SlideShare a Scribd company logo
HP Hadoop Platform
High Performance – High Throughput and Low Latency
HP Hadoop Platform
Reliable Scalable Cost Effective
• Execute High through put Transformations
• High Reliability and fault tolerance
• MR Engine for High Throughput
• TEZ for faster response
• Hive - High Productivity for SQL Developers
• Pig – Flow visualization makes understanding easy
• Pig and Hive Integration using HCatalog
HP Hadoop Platform
Hive/Pig– High Throughput Queries
• Presto is in-memory based- stores query results in memory and subsequent
operations on query result are faster.
• Presto works on top of HDFS and YARN
• Leverages Hive Meta store
• Offers flexibility in deployment- Can be deployed on few nodes in cluster
• HDP Certified release available
HP Hadoop Platform
Presto – Low Latency Queries
• HDFS Storage– Data Locality. Suited for Lambda architecture
• On top of HBase/Big Table columnar storage
• Consistency in CAP theorem
• Linear Scalability
• HTTP API for pulling Readings.
• Write with millisecond precision
• Integrates with Visualization tools
OpenTSDB – Time Series Solution
HP Hadoop Platform
• In Memory computation engine
• 100x faster than MapReduce.
• Ships with ML Lib – Machine Learning Library
• Usable with Scala/Python/R/Java
• Integrates with Kafka, Hive SQL, OpenTSDb
• GraphX/Graph Frame – Graph Query Capabilities
HP Hadoop Platform
Spark – Speed up Computation and ML
• Supported on Hortonworks Platform
• Based on Lucene Search engine
• HDFS Storage – Scalable storage
• Solr Cloud - Distributed architecture
• Near Real-Time Indexing
• Phonetic Matching supported
HP Hadoop Platform
Solr Indexing – Scalable Search

More Related Content

PDF
HBaseConAsia2018 Track3-3: HBase at China Life Insurance
PPTX
Rakuten techconf2015.baiji.he.bigdataforsmallstartupandbeyond
PPTX
HBaseConAsia2018 Track2-2: Apache Kylin on HBase: Extreme OLAP for big data
PPT
Schema Design
PPTX
HBaseConAsia2018: Track2-5: JanusGraph-Distributed graph database with HBase
PDF
HBaseConAsia2018 Track2-6: Scaling 30TB's of data lake with Apache HBase and ...
PDF
SAP DAY 2018 - Johan Francken
 
PDF
Apache spark on Hadoop Yarn Resource Manager
HBaseConAsia2018 Track3-3: HBase at China Life Insurance
Rakuten techconf2015.baiji.he.bigdataforsmallstartupandbeyond
HBaseConAsia2018 Track2-2: Apache Kylin on HBase: Extreme OLAP for big data
Schema Design
HBaseConAsia2018: Track2-5: JanusGraph-Distributed graph database with HBase
HBaseConAsia2018 Track2-6: Scaling 30TB's of data lake with Apache HBase and ...
SAP DAY 2018 - Johan Francken
 
Apache spark on Hadoop Yarn Resource Manager

What's hot (20)

PDF
Riak at shareaholic
PDF
Bigdata and Hadoop with Docker
PPTX
Change Data Capture using Kafka
PDF
#GeodeSummit: Combining Stream Processing and In-Memory Data Grids for Near-R...
PDF
Using Kafka as a Database For Real-Time Transaction Processing | Chad Preisle...
PPTX
How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale
PPTX
Kafka website activity architecture
PPTX
HBaseConAsia2018 Track3-5: HBase Practice at Lianjia
PDF
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine
PDF
#GeodeSummit - Where Does Geode Fit in Modern System Architectures
PPTX
Apache Spark on Kubernetes
PPTX
RedisConf17 - Home Depot - Turbo charging existing applications with Redis
PPTX
DC Migration and Hadoop Scale For Big Billion Days
PDF
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
PPTX
Building a Scalable and Modern Infrastructure at CARFAX
KEY
From Batch to Realtime with Hadoop - Berlin Buzzwords - June 2012
PPTX
HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...
PPTX
Devops Days, 2019 - Charlotte
PPTX
Big Data Platform at Pinterest
PDF
Kafka at the core of an AIOps pipeline | Sunanda Kommula, Selector.ai and Ala...
Riak at shareaholic
Bigdata and Hadoop with Docker
Change Data Capture using Kafka
#GeodeSummit: Combining Stream Processing and In-Memory Data Grids for Near-R...
Using Kafka as a Database For Real-Time Transaction Processing | Chad Preisle...
How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale
Kafka website activity architecture
HBaseConAsia2018 Track3-5: HBase Practice at Lianjia
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine
#GeodeSummit - Where Does Geode Fit in Modern System Architectures
Apache Spark on Kubernetes
RedisConf17 - Home Depot - Turbo charging existing applications with Redis
DC Migration and Hadoop Scale For Big Billion Days
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
Building a Scalable and Modern Infrastructure at CARFAX
From Batch to Realtime with Hadoop - Berlin Buzzwords - June 2012
HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...
Devops Days, 2019 - Charlotte
Big Data Platform at Pinterest
Kafka at the core of an AIOps pipeline | Sunanda Kommula, Selector.ai and Ala...
Ad

Similar to Hp hadoop platform (20)

PPTX
Cloudera Hadoop Distribution
PPTX
Big Data and Hadoop Training in Chandigarh
PDF
Introduction To Hadoop Ecosystem
PDF
Technologies for Data Analytics Platform
PPT
Etu Solution Day 2014 Track-D: 掌握Impala和Spark
PPTX
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
PPTX
Concepts on Hadoop
PPTX
Analytics using big data technologies
PPTX
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
PPT
Hadoop distributions - ecosystem
PPTX
Hadoop And Their Ecosystem ppt
PPTX
Hadoop And Their Ecosystem
PPTX
Indexing with solr search server and hadoop framework
PDF
Big Data Developers Moscow Meetup 1 - sql on hadoop
PPTX
Introduction to Hadoop
PPTX
hadoop-ecosystem-ppt.pptx
PPTX
SQL on Hadoop
PPTX
SQL Server 2012 and Big Data
PDF
Search On Hadoop
PDF
The Zoo Expands: Labrador *Loves* Elephant, Thanks to Hamster
Cloudera Hadoop Distribution
Big Data and Hadoop Training in Chandigarh
Introduction To Hadoop Ecosystem
Technologies for Data Analytics Platform
Etu Solution Day 2014 Track-D: 掌握Impala和Spark
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Concepts on Hadoop
Analytics using big data technologies
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Hadoop distributions - ecosystem
Hadoop And Their Ecosystem ppt
Hadoop And Their Ecosystem
Indexing with solr search server and hadoop framework
Big Data Developers Moscow Meetup 1 - sql on hadoop
Introduction to Hadoop
hadoop-ecosystem-ppt.pptx
SQL on Hadoop
SQL Server 2012 and Big Data
Search On Hadoop
The Zoo Expands: Labrador *Loves* Elephant, Thanks to Hamster
Ad

Recently uploaded (20)

PDF
Digital Logic Computer Design lecture notes
PPTX
Lesson 3_Tessellation.pptx finite Mathematics
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
MET 305 MODULE 1 KTU 2019 SCHEME 25.pptx
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPT
Drone Technology Electronics components_1
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
Construction Project Organization Group 2.pptx
PPTX
web development for engineering and engineering
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
ETO & MEO Certificate of Competency Questions and Answers
PDF
composite construction of structures.pdf
Digital Logic Computer Design lecture notes
Lesson 3_Tessellation.pptx finite Mathematics
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
MET 305 MODULE 1 KTU 2019 SCHEME 25.pptx
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Drone Technology Electronics components_1
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Model Code of Practice - Construction Work - 21102022 .pdf
Construction Project Organization Group 2.pptx
web development for engineering and engineering
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Operating System & Kernel Study Guide-1 - converted.pdf
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Foundation to blockchain - A guide to Blockchain Tech
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
ETO & MEO Certificate of Competency Questions and Answers
composite construction of structures.pdf

Hp hadoop platform

  • 1. HP Hadoop Platform High Performance – High Throughput and Low Latency
  • 2. HP Hadoop Platform Reliable Scalable Cost Effective
  • 3. • Execute High through put Transformations • High Reliability and fault tolerance • MR Engine for High Throughput • TEZ for faster response • Hive - High Productivity for SQL Developers • Pig – Flow visualization makes understanding easy • Pig and Hive Integration using HCatalog HP Hadoop Platform Hive/Pig– High Throughput Queries
  • 4. • Presto is in-memory based- stores query results in memory and subsequent operations on query result are faster. • Presto works on top of HDFS and YARN • Leverages Hive Meta store • Offers flexibility in deployment- Can be deployed on few nodes in cluster • HDP Certified release available HP Hadoop Platform Presto – Low Latency Queries
  • 5. • HDFS Storage– Data Locality. Suited for Lambda architecture • On top of HBase/Big Table columnar storage • Consistency in CAP theorem • Linear Scalability • HTTP API for pulling Readings. • Write with millisecond precision • Integrates with Visualization tools OpenTSDB – Time Series Solution HP Hadoop Platform
  • 6. • In Memory computation engine • 100x faster than MapReduce. • Ships with ML Lib – Machine Learning Library • Usable with Scala/Python/R/Java • Integrates with Kafka, Hive SQL, OpenTSDb • GraphX/Graph Frame – Graph Query Capabilities HP Hadoop Platform Spark – Speed up Computation and ML
  • 7. • Supported on Hortonworks Platform • Based on Lucene Search engine • HDFS Storage – Scalable storage • Solr Cloud - Distributed architecture • Near Real-Time Indexing • Phonetic Matching supported HP Hadoop Platform Solr Indexing – Scalable Search