SlideShare a Scribd company logo
Marrakech
October 2018
Jumbo
Presentation and future of
the Hadoop cluster bootstrapper
Marrakech
October 2018
2
What is Jumbo?
● Hadoop cluster bootstrapper
1. Describe your cluster in minutes
2. Let Jumbo provision everything
3. Start using your cluster
Marrakech
October 2018
3
What is Jumbo?
● Python 3 CLI tool
● Script generator
● Leverages trusted DevOps tools:
○ Vagrant
○ Ansible
○ Ambari
Marrakech
October 2018
4
Why Jumbo?
Need for multiple development environments:
● Fast and easy definition
● Fast provisioning
● Reusable code
Marrakech
October 2018
5
Jumbo
User Interface
Script generation and tools control
Marrakech
October 2018
6
Hadoop
installation
Node
configuration
VM provisioning
VBox | KVM
Jumbo internals
Marrakech
October 2018
7
Script generation:
Vagrantfile Jinja templating
Ansible playbooks Python dict → YAML (Ansible group_vars)
Ambari blueprint Python dict → JSON
● service
● component
● cluster
● node
Abstraction layer
Marrakech
October 2018
8
mycluster
HDFS
node01 node02 node03
NN
SNN
DN
DN
Abstraction layer
Marrakech
October 2018
9
node types
master master components of master-slave services
sidemaster key components of non master-slave services
worker slave components of master-slave services
edge server components
ldap security components
Available features
● Nodes (VM) definition
● Services topology
○ Automatic components placement
○ Automatic installation of service dependencies
○ Manual tuning with compatibility control
○ High Availability support
Marrakech
October 2018
10
Hadoop ecosystem
Database
SecurityProvisioning and
monitoring
Available features
Supported services:
Marrakech
October 2018
11
● Cluster templating
● Cluster security (Kerberos)
● Centralized versions and repositories management
● VM control within Jumbo
Available features
Marrakech
October 2018
12
Jumbo in action
Marrakech
October 2018
13
What’s next for Jumbo?
Marrakech
October 2018
14
● HDP - Hortonworks Data Platform
○ Support for all the services of the stack
○ HA support for all the services
● Smart cluster topology based on available resources
What’s next for Jumbo?
Marrakech
October 2018
15
● Create a bundle model for services to ease contribution
○ Service definition
○ Ansible playbooks
● Support other stacks
○ HDF - Hortonworks DataFlow
○ CDH - Cloudera Distribution for Hadoop
Get involved!
Facing problems using Jumbo?
Missing a service for your development environment?
github.com/adaltas/jumbo
jumbo.adaltas.com
Marrakech
October 2018
16
Questions?
Marrakech
October 2018
17

More Related Content

PDF
Life of a startup - Sjoerd Mulder - Codemotion Amsterdam 2017
PDF
Why we need open systems, and how to create them in the era of the Cloud (Ops...
PDF
SFScon18 - Gerhard Sulzberger - Jason Tevnan - gitops with gitlab + terraform
PDF
Nyc storm meetup_robdoherty
PPTX
DevOps Fest 2019. Дмитрий Лагоза. CD for StartUp, cheap and furious
PDF
Defrag 2014 - Blend Web IDEs, Open Source and PaaS to Create and Deploy APIs
PDF
Integracia security do ci cd pipelines
PDF
BLOCKCHAIN DELIVERY ENGINE (HYPERLEDGER)
Life of a startup - Sjoerd Mulder - Codemotion Amsterdam 2017
Why we need open systems, and how to create them in the era of the Cloud (Ops...
SFScon18 - Gerhard Sulzberger - Jason Tevnan - gitops with gitlab + terraform
Nyc storm meetup_robdoherty
DevOps Fest 2019. Дмитрий Лагоза. CD for StartUp, cheap and furious
Defrag 2014 - Blend Web IDEs, Open Source and PaaS to Create and Deploy APIs
Integracia security do ci cd pipelines
BLOCKCHAIN DELIVERY ENGINE (HYPERLEDGER)

What's hot (20)

ODP
Oslo Vancouver Onboarding
PDF
How we build Videome
PPTX
Hyperledger composer (from zero to hero )
PDF
Last Month in PHP - September 2016
PDF
Going Enterprise QGIS and PostGIS
ODP
Go lambda-presentation
PDF
[WSO2Con EU 2018] Deploying Applications in K8S and Docker
PDF
Modern Monitoring and processing logs
PDF
Introduction to Modern DevOps Technologies
PDF
Introduction to DevOps and the Practical Use Cases at Credit OK
PDF
RealTime Recommendations @Netflix - Spark
PDF
DevOps: Coding Defines Monitoring
PDF
SAIS2018 - Fact Store At Netflix Scale
PDF
Head in the clouds @ bol.com
PDF
QGIS UK: Developing with QGIS - What is possible (Lutra Consulting)
PDF
Building Scalable Backends with Go
PDF
Drupal Brisbane Meetup :: Drupal in late 2017-2018
PDF
MongoDB .local London 2019: Nationwide Building Society: Building Mobile Appl...
PDF
Monitoring to the Nth tier: The state of distributed tracing in 2016
PDF
BKK16-215 kernelci.org: beyond boot testing
Oslo Vancouver Onboarding
How we build Videome
Hyperledger composer (from zero to hero )
Last Month in PHP - September 2016
Going Enterprise QGIS and PostGIS
Go lambda-presentation
[WSO2Con EU 2018] Deploying Applications in K8S and Docker
Modern Monitoring and processing logs
Introduction to Modern DevOps Technologies
Introduction to DevOps and the Practical Use Cases at Credit OK
RealTime Recommendations @Netflix - Spark
DevOps: Coding Defines Monitoring
SAIS2018 - Fact Store At Netflix Scale
Head in the clouds @ bol.com
QGIS UK: Developing with QGIS - What is possible (Lutra Consulting)
Building Scalable Backends with Go
Drupal Brisbane Meetup :: Drupal in late 2017-2018
MongoDB .local London 2019: Nationwide Building Society: Building Mobile Appl...
Monitoring to the Nth tier: The state of distributed tracing in 2016
BKK16-215 kernelci.org: beyond boot testing
Ad

Similar to Jumbo the Hadoop cluster bootstrapper (20)

PDF
LINE's Private Cloud - Meet Cloud Native World
PDF
IDEALIZE 2023 - NodeJS & Firebase Session
PDF
DevOpsDays Taipei 2019 - Mastering IaC the DevOps Way
PDF
Angular2 - A story from the trenches
PDF
State of serverless
PDF
Intro to creating kubernetes operators
PDF
AWS Lambda and Serverless framework: lessons learned while building a serverl...
PDF
Rancher 2.0 Technical Deep Dive
PDF
Traefik on Kubernetes at MySocialApp (CNCF Paris Meetup)
PDF
Openshift serverless Solution
PDF
Unifying Frontend and Backend Development with Scala - ScalaCon 2021
PDF
Cloud native IPC for Microservices Workshop @ Containerdays 2022
PDF
Design Summit - Technology Vision - Oleg Barenboim and Jason Frey
PDF
Node.js scaling in highload
PDF
How Sysbee Manages Infrastructures and Provides Advanced Monitoring by Using ...
PDF
Openstack India May Meetup
PDF
Advanced deeplearning4j features
PDF
TSC BoF: OSS Toolchain Discussion - SFO17-409
PDF
Livnat Peer & Arthur Berezin, Red Hat - Neutron High Availability - OpenStack...
PDF
Neutron high availability open stack architecture openstack israel event 2015
LINE's Private Cloud - Meet Cloud Native World
IDEALIZE 2023 - NodeJS & Firebase Session
DevOpsDays Taipei 2019 - Mastering IaC the DevOps Way
Angular2 - A story from the trenches
State of serverless
Intro to creating kubernetes operators
AWS Lambda and Serverless framework: lessons learned while building a serverl...
Rancher 2.0 Technical Deep Dive
Traefik on Kubernetes at MySocialApp (CNCF Paris Meetup)
Openshift serverless Solution
Unifying Frontend and Backend Development with Scala - ScalaCon 2021
Cloud native IPC for Microservices Workshop @ Containerdays 2022
Design Summit - Technology Vision - Oleg Barenboim and Jason Frey
Node.js scaling in highload
How Sysbee Manages Infrastructures and Provides Advanced Monitoring by Using ...
Openstack India May Meetup
Advanced deeplearning4j features
TSC BoF: OSS Toolchain Discussion - SFO17-409
Livnat Peer & Arthur Berezin, Red Hat - Neutron High Availability - OpenStack...
Neutron high availability open stack architecture openstack israel event 2015
Ad

Recently uploaded (20)

PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPT
Teaching material agriculture food technology
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Big Data Technologies - Introduction.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
“AI and Expert System Decision Support & Business Intelligence Systems”
Teaching material agriculture food technology
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Review of recent advances in non-invasive hemoglobin estimation
Mobile App Security Testing_ A Comprehensive Guide.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Encapsulation_ Review paper, used for researhc scholars
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Spectral efficient network and resource selection model in 5G networks
Programs and apps: productivity, graphics, security and other tools
sap open course for s4hana steps from ECC to s4
Network Security Unit 5.pdf for BCA BBA.
Big Data Technologies - Introduction.pptx
MIND Revenue Release Quarter 2 2025 Press Release
Understanding_Digital_Forensics_Presentation.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing

Jumbo the Hadoop cluster bootstrapper