Diving	in	the	Desert—
Running	your	HDP	Cluster	with	
Helion OpenStack	and	Sahara
Alex	Tesch
Cloud	Evangelist,	Asia	Pacific	and	Japan
HPE
Which Workloads Are Running Today on OpenStack
These Projects Are Gaining in Popularity
Architecture
limitations
Single workload Hadoop cluster
infrastructure choices lead to
cluster sprawl and management
difficulties as newer varying
workloads arrive
Implementation
difficulties
Lack of expertise in Big Data
Infrastructure and Hadoop
architecture leads to significant risks
in the journey to Hadoop and
impedes time to value
Immaturity of Big Data
ecosystem
Hadoop ecosystem products vary
in levels of maturity, needing
careful integration of multiple
vendor products making DIY risky
for enterprise workloads
Challenges in Maximizing Value with Hadoop
How Sahara helps to Address These Challenges
Data processing services on OpenStack Cloud
• Sahara allows:
• Leverage on OpenStack multitenancy and run several clusters
• Deploy preferred distribution based on Sahara plugins available
• Choose the best topology for your cluster
• Configure only the services that you will use
• Run your jobs!!!
Please Join Us in our Booth for a Full Demo!!!
• Introducing Hybrid Hadoop
• HDP on OpenStack Sahara
• Containerized Hadoop
Find	more	#DWS17	sessions	and	slides	at:	
www.DataWorksSummit.com
8
T H A N K 	 Y O U

More Related Content

PPTX
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
PDF
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
PPTX
Accelerating Big Data Insights
PPTX
Cloudy with a chance of Hadoop - real world considerations
PDF
DataOps with Project Amaterasu
PPTX
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
PPTX
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
PPTX
Cloudy with a Chance of Hadoop - Real World Considerations
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
Accelerating Big Data Insights
Cloudy with a chance of Hadoop - real world considerations
DataOps with Project Amaterasu
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Cloudy with a Chance of Hadoop - Real World Considerations

What's hot (20)

PPTX
How to Use Apache Zeppelin with HWX HDB
PDF
Running Zeppelin in Enterprise
PPTX
Successes, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
PPTX
Hadoop in the Cloud - The what, why and how from the experts
PDF
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
PDF
Hybrid is the New Normal
PPTX
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...
PDF
Data in the Cloud Crash Course
PPTX
Built-In Security for the Cloud
PPTX
Big Data in the Cloud - The What, Why and How from the Experts
PPTX
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
PPTX
Treat your enterprise data lake indigestion: Enterprise ready security and go...
PDF
Realizing the Promise of Portable Data Processing with Apache Beam
PPTX
Cloudbreak - Technical Deep Dive
PPTX
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
PPTX
Build Big Data Enterprise solutions faster on Azure HDInsight
PPTX
Deep Learning using Spark and DL4J for fun and profit
PDF
Hortonworks tech workshop in-memory processing with spark
PPTX
Logical Data Warehouse: How to Build a Virtualized Data Services Layer
PPTX
An Overview on Optimization in Apache Hive: Past, Present Future
How to Use Apache Zeppelin with HWX HDB
Running Zeppelin in Enterprise
Successes, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
Hadoop in the Cloud - The what, why and how from the experts
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
Hybrid is the New Normal
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...
Data in the Cloud Crash Course
Built-In Security for the Cloud
Big Data in the Cloud - The What, Why and How from the Experts
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Treat your enterprise data lake indigestion: Enterprise ready security and go...
Realizing the Promise of Portable Data Processing with Apache Beam
Cloudbreak - Technical Deep Dive
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Build Big Data Enterprise solutions faster on Azure HDInsight
Deep Learning using Spark and DL4J for fun and profit
Hortonworks tech workshop in-memory processing with spark
Logical Data Warehouse: How to Build a Virtualized Data Services Layer
An Overview on Optimization in Apache Hive: Past, Present Future
Ad

Viewers also liked (17)

PDF
Delivering Data Science to the Business
PDF
SparkR Best Practices for R Data Scientists
PDF
The Apache Way
PDF
Data Guarantees and Fault Tolerance in Streaming Systems
PDF
How Big Data and Deep Learning are Revolutionizing AML and Financial Crime De...
PDF
Beyond Big Data: Data Science and AI
PDF
MaaS (Model as a Service): Modern Streaming Data Science with Apache Metron
PDF
Apache Hadoop Crash Course
PDF
Next Generation Execution for Apache Storm
PDF
Data-In-Motion Unleashed
PDF
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
PDF
Data Science Crash Course
PDF
The Future of Data in Telecom and the Rise of Connected Communities
PDF
Apache Spark Crash Course
PDF
An Apache Hive Based Data Warehouse
PDF
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
PPTX
Performance Update: When Apache ORC Met Apache Spark
Delivering Data Science to the Business
SparkR Best Practices for R Data Scientists
The Apache Way
Data Guarantees and Fault Tolerance in Streaming Systems
How Big Data and Deep Learning are Revolutionizing AML and Financial Crime De...
Beyond Big Data: Data Science and AI
MaaS (Model as a Service): Modern Streaming Data Science with Apache Metron
Apache Hadoop Crash Course
Next Generation Execution for Apache Storm
Data-In-Motion Unleashed
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
Data Science Crash Course
The Future of Data in Telecom and the Rise of Connected Communities
Apache Spark Crash Course
An Apache Hive Based Data Warehouse
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
Performance Update: When Apache ORC Met Apache Spark
Ad

Similar to Driving in the Desert - Running Your HDP Cluster with Helion, Openstack, and Sahara (20)

PDF
OpenStack in Action! 5 - Dell - OpenStack powered solutions - Patrick Hamon
PPTX
How to Upgrade Your Hadoop Stack in 1 Step -- with Zero Downtime
PPTX
OpenStack & Cloud Foundry (OpenStack Fall 2012 Summit)
PDF
Webinar - Introduction to Ceph and OpenStack
PDF
Dell openstack cloud with inktank ceph – large scale customer deployment
PPTX
Introduction to the Hadoop EcoSystem
PPTX
Bridging Oracle Database and Hadoop by Alex Gorbachev, Pythian from Oracle Op...
PDF
Designing OpenStack Architectures
PDF
Running Hadoop as Service in AltiScale Platform
PDF
Applications on Hadoop
PDF
Hadoop Application Architectures Mark Grover Ted Malaska Jonathan Seidman Gwe...
PDF
CEPH & OPENSTACK - Red Hat's Winning Combination for Enterprise Clouds
PDF
[Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight
PDF
OpenStack in Action 4! Patrick Hamon - Architectures of reference for OpenSta...
PDF
Docker and OpenStack Boston Meetup
PPTX
Cloudera Manager Webinar | Cloudera Enterprise 3.7
PPTX
A glimpse into the Future of Hadoop & Big Data
PDF
Designing OpenStack Architectures
PDF
Hashicorp at holaluz
PPTX
Docker based Hadoop Deployment
OpenStack in Action! 5 - Dell - OpenStack powered solutions - Patrick Hamon
How to Upgrade Your Hadoop Stack in 1 Step -- with Zero Downtime
OpenStack & Cloud Foundry (OpenStack Fall 2012 Summit)
Webinar - Introduction to Ceph and OpenStack
Dell openstack cloud with inktank ceph – large scale customer deployment
Introduction to the Hadoop EcoSystem
Bridging Oracle Database and Hadoop by Alex Gorbachev, Pythian from Oracle Op...
Designing OpenStack Architectures
Running Hadoop as Service in AltiScale Platform
Applications on Hadoop
Hadoop Application Architectures Mark Grover Ted Malaska Jonathan Seidman Gwe...
CEPH & OPENSTACK - Red Hat's Winning Combination for Enterprise Clouds
[Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight
OpenStack in Action 4! Patrick Hamon - Architectures of reference for OpenSta...
Docker and OpenStack Boston Meetup
Cloudera Manager Webinar | Cloudera Enterprise 3.7
A glimpse into the Future of Hadoop & Big Data
Designing OpenStack Architectures
Hashicorp at holaluz
Docker based Hadoop Deployment

More from DataWorks Summit (20)

PPTX
Data Science Crash Course
PPTX
Floating on a RAFT: HBase Durability with Apache Ratis
PPTX
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
PDF
HBase Tales From the Trenches - Short stories about most common HBase operati...
PPTX
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
PPTX
Managing the Dewey Decimal System
PPTX
Practical NoSQL: Accumulo's dirlist Example
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
PPTX
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
PPTX
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
PPTX
Supporting Apache HBase : Troubleshooting and Supportability Improvements
PPTX
Security Framework for Multitenant Architecture
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
PPTX
Extending Twitter's Data Platform to Google Cloud
PPTX
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
PPTX
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
PDF
Computer Vision: Coming to a Store Near You
PPTX
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Data Science Crash Course
Floating on a RAFT: HBase Durability with Apache Ratis
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
HBase Tales From the Trenches - Short stories about most common HBase operati...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Managing the Dewey Decimal System
Practical NoSQL: Accumulo's dirlist Example
HBase Global Indexing to support large-scale data ingestion at Uber
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Security Framework for Multitenant Architecture
Presto: Optimizing Performance of SQL-on-Anything Engine
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Extending Twitter's Data Platform to Google Cloud
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Computer Vision: Coming to a Store Near You
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark

Recently uploaded (20)

PDF
Enhancing emotion recognition model for a student engagement use case through...
PPTX
2018-HIPAA-Renewal-Training for executives
PPTX
Configure Apache Mutual Authentication
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
CloudStack 4.21: First Look Webinar slides
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
Getting started with AI Agents and Multi-Agent Systems
DOCX
search engine optimization ppt fir known well about this
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
Five Habits of High-Impact Board Members
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
Enhancing emotion recognition model for a student engagement use case through...
2018-HIPAA-Renewal-Training for executives
Configure Apache Mutual Authentication
A contest of sentiment analysis: k-nearest neighbor versus neural network
Consumable AI The What, Why & How for Small Teams.pdf
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
CloudStack 4.21: First Look Webinar slides
NewMind AI Weekly Chronicles – August ’25 Week III
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
Benefits of Physical activity for teenagers.pptx
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
sustainability-14-14877-v2.pddhzftheheeeee
Getting started with AI Agents and Multi-Agent Systems
search engine optimization ppt fir known well about this
Zenith AI: Advanced Artificial Intelligence
Five Habits of High-Impact Board Members
Convolutional neural network based encoder-decoder for efficient real-time ob...
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
Custom Battery Pack Design Considerations for Performance and Safety

Driving in the Desert - Running Your HDP Cluster with Helion, Openstack, and Sahara