SlideShare a Scribd company logo
2015
Cloudman: Cluster
Management for Big Data
in the Cloud
Swati Singhi
December 3, 2015
#GHCI15
2015
2015
▪Fixed pre-provisioned capacity
▪Variable and Unpredictable workloads
▪Do not scale well
▪Expensive
▪On-site IT team
Challenges of On-premise Big Data Infra
2015
Cloud offers salvation...
▪Stretches with the workload
▪Pay-as-you-go
...but brings its own challenges
▪Moving data to the cloud
▪Security/Privacy
Big Data in the Cloud
2015
Qubole as Big Data Service
▪Enables Big Data on the cloud
▪Enterprise ready deployments
▪On major public clouds
▪Simple and Fast
2015
Cloudman
▪Qubole’s Cluster management software
▪Launches half a million nodes per month
▪Works across AWS, GCE and Azure
▪Provides higher level APIs
2015
Cloudman Goals
▪Automated cluster provisioning
▪Configure Big Data Stack
▪Manage cluster lifecycle
▪Highly optimized cost of compute
2015
UI SDK API
Cloudman
Layers of Big Data as a Service
2015
Architecture
2015
Challenges
▪ Autoscale based on workload
▪ Abstractions to address differences in
behaviors of each cloud provider
Examples
− Image creation and registration
− Configuring clusters
2015
▪ Launched automatically when needed
▪ Expands automatically if the load is high
▪ Terminate the cluster with no running jobs
▪ Remove nodes at billing boundary
Autoscaling Clusters
2015
insert overwrite table dest
select … from ads join campaigns on …group by …;
Map Tasks
ReduceTasks
Demand Supply
Progress
Master
Slaves
Job Tracker
Cloudman
Cloudman: AutoScaling
2015
Image registration in AWS vs. Azure
Image creation and registration
2015
▪ Image creation
▪ Public images in AWS
▪ Not well supported in Azure
▪ Images copied to user’s account in Azure
Image creation and registration
2015
▪ Configure credentials
−Storage and Compute keys
▪Configure the big data stack
−Start appropriate s/w, example JobTracker and
NameNode on Master and TaskTracker and
DataNode on Slaves
Cluster Configuration
2015
Optimizing cost of compute in Cloud
▪ Utilize ephemeral compute instances to lower cost
− AWS Spot Instances
− GCE Preemptible VMs
▪ Challenges
− Data loss
− Big data job failures
2015
Demo
2015
2015
Key Takeaways
▪ Highly efficient cluster management system
▪ Proven at scale in production
▪ Works on multiple clouds
2015
Got Feedback?
Rate and review the session on our mobile app – Convene
For all details visit: http://ghcindia.anitaborg.org
2015
Appendix
2015
Architecture
▪ ​QDS has a user interface, Python and Java SDKs
and APIs that allows users to talk to QDS and
analyze data sets without knowing cluster
management.
▪ A QDS user can submit primitive commands to
logical clusters.
▪ The middleware layer communicates to the cloud
orchestration layer called Cloudman
▪ Cloudman is responsible for spinning up clusters in
the concerned cloud
2015
▪ One such example is Image creation and registration
▪ Procedure
▪ Precreate a machine image with all the the
softwares to be deployed baked into it
▪ We start the cluster machines using this as the
underlying image
▪ Saves us the time in deploying the softwares on
the nodes after they are up
▪ This process is very different in all the cloud
providers
Image creation and registration
2015
Cluster Configuration
▪Another operation that had to be implemented
differently for each cloud
▪Startup scripts are used for to programmatically
customize virtual machine instances
▪AWS and Google cloud had support for this
▪Azure did not support automatic execution of
this script at the VM boot up time in the Centos
VMs
2015
▪Hadoop clusters in QDS come up automatically
when applications that require them are
launched
▪If the load on the cluster is high, then the
cluster automatically expands.
▪Cloudman automatically launches additional
nodes which eventually join the running cluster
and are able to pick up part of the workload
Autoscaling Clusters

More Related Content

PDF
Activiti Cloud Overview & BluePrint: Trending Topic Campaigns
PDF
Lessons Learned: From Java EE to Spring Cloud in the context of Activiti OSS
PDF
Activiti & Activiti Cloud DevCon
PDF
JJUG CCC 2018 : Lessons Learned: Spring Cloud -> Docker -> Kubernetes
PDF
Lessons Learned: Spring Cloud -> Docker -> Kubernetes
PDF
KCD Guatemala - Abstracciones sobre Abstracciones
PDF
Deployment Via Capistrano
PDF
Keptn Meetup Interoperable ci/cd ecosystem
Activiti Cloud Overview & BluePrint: Trending Topic Campaigns
Lessons Learned: From Java EE to Spring Cloud in the context of Activiti OSS
Activiti & Activiti Cloud DevCon
JJUG CCC 2018 : Lessons Learned: Spring Cloud -> Docker -> Kubernetes
Lessons Learned: Spring Cloud -> Docker -> Kubernetes
KCD Guatemala - Abstracciones sobre Abstracciones
Deployment Via Capistrano
Keptn Meetup Interoperable ci/cd ecosystem

What's hot (20)

ODP
Gluster d2.0
PDF
Azure Service Operator - Provision Your Resources in a Cloud-Native Way
PDF
SquareScale Munich Cloud Native Night
PDF
[WSO2Con USA 2018] Architecting for Container-native Environments
PDF
Orchestrating Cloud Events - Knative Meetup 2020
PPTX
Cloud Native Application Framework
PDF
Google Cloud Functions & Firebase Crash Course
PDF
Orchestrating Microservices
PDF
Buzzwords: Microservices, containers and serverless - real life applications ...
PDF
Servers? Where we're going we don't need servers.
PDF
QCon Plus From monoliths to k8s - Workshop
PPTX
TIAD : In a chocolate factory
PPTX
TIAD : Full stack automation
PPT
Gwt training presentation
PDF
Create A Mapping Web Part
PPTX
CloudStack User Group Overview And News - 12 feb 2015
PDF
The Future of Workflow Automation Is Now - Hassle-Free ARM Template Deploymen...
PPTX
Cloud Automation with ProActive
PPTX
Meetup 23 - 03 - Application Delivery on K8S with GitOps
PDF
Extending and Integrating QlikView
Gluster d2.0
Azure Service Operator - Provision Your Resources in a Cloud-Native Way
SquareScale Munich Cloud Native Night
[WSO2Con USA 2018] Architecting for Container-native Environments
Orchestrating Cloud Events - Knative Meetup 2020
Cloud Native Application Framework
Google Cloud Functions & Firebase Crash Course
Orchestrating Microservices
Buzzwords: Microservices, containers and serverless - real life applications ...
Servers? Where we're going we don't need servers.
QCon Plus From monoliths to k8s - Workshop
TIAD : In a chocolate factory
TIAD : Full stack automation
Gwt training presentation
Create A Mapping Web Part
CloudStack User Group Overview And News - 12 feb 2015
The Future of Workflow Automation Is Now - Hassle-Free ARM Template Deploymen...
Cloud Automation with ProActive
Meetup 23 - 03 - Application Delivery on K8S with GitOps
Extending and Integrating QlikView
Ad

Similar to GraceHopper 2015, Cluster Management for Big Data in the Cloud (20)

PDF
Introduction to GCP
PDF
Getting Started with Google Cloud Platform: A Beginner’s Guide
PDF
Serverless Days Ahmedabad - Dhaval Nagar.pptx.pdf
PDF
Making Money in the Cloud
PPTX
2015 cloud trend and cloud DR
PDF
Dhaval Nagar - ServerlessDays Bengaluru 2023
PDF
Cloud Migration Services | Mindtree
PDF
Smart Integration to the Cloud - Kellton Tech Webinar
PPTX
VisiQuate: Azure cloud migration case study
PDF
SRE & Kubernetes
PDF
User Group Presentation | AWS 2022 Latest Release
PPTX
An Introduction to Talend Integration Cloud
PDF
We are Net3 Technology
PDF
Serverless solutions on GCF
PDF
Building ISV Applications that run in the cloud with SQL Anywhere On-Demand E...
PPTX
What serverless means for enterprise apps
PDF
7 Myths about Cloud Computing
PDF
Tips For Building Private Cloud Architecture With Virtualization
PPTX
29Aug2024_CloudHub2_MuleSoft_Meetup.pptx
PDF
Right scale enterprise solution
Introduction to GCP
Getting Started with Google Cloud Platform: A Beginner’s Guide
Serverless Days Ahmedabad - Dhaval Nagar.pptx.pdf
Making Money in the Cloud
2015 cloud trend and cloud DR
Dhaval Nagar - ServerlessDays Bengaluru 2023
Cloud Migration Services | Mindtree
Smart Integration to the Cloud - Kellton Tech Webinar
VisiQuate: Azure cloud migration case study
SRE & Kubernetes
User Group Presentation | AWS 2022 Latest Release
An Introduction to Talend Integration Cloud
We are Net3 Technology
Serverless solutions on GCF
Building ISV Applications that run in the cloud with SQL Anywhere On-Demand E...
What serverless means for enterprise apps
7 Myths about Cloud Computing
Tips For Building Private Cloud Architecture With Virtualization
29Aug2024_CloudHub2_MuleSoft_Meetup.pptx
Right scale enterprise solution
Ad

Recently uploaded (20)

PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPTX
A Presentation on Touch Screen Technology
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Hindi spoken digit analysis for native and non-native speakers
Heart disease approach using modified random forest and particle swarm optimi...
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
cloud_computing_Infrastucture_as_cloud_p
Building Integrated photovoltaic BIPV_UPV.pdf
NewMind AI Weekly Chronicles - August'25-Week II
Univ-Connecticut-ChatGPT-Presentaion.pdf
OMC Textile Division Presentation 2021.pptx
1 - Historical Antecedents, Social Consideration.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Zenith AI: Advanced Artificial Intelligence
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Encapsulation_ Review paper, used for researhc scholars
Accuracy of neural networks in brain wave diagnosis of schizophrenia
A Presentation on Touch Screen Technology
Unlocking AI with Model Context Protocol (MCP)
Group 1 Presentation -Planning and Decision Making .pptx

GraceHopper 2015, Cluster Management for Big Data in the Cloud

  • 1. 2015 Cloudman: Cluster Management for Big Data in the Cloud Swati Singhi December 3, 2015 #GHCI15 2015
  • 2. 2015 ▪Fixed pre-provisioned capacity ▪Variable and Unpredictable workloads ▪Do not scale well ▪Expensive ▪On-site IT team Challenges of On-premise Big Data Infra
  • 3. 2015 Cloud offers salvation... ▪Stretches with the workload ▪Pay-as-you-go ...but brings its own challenges ▪Moving data to the cloud ▪Security/Privacy Big Data in the Cloud
  • 4. 2015 Qubole as Big Data Service ▪Enables Big Data on the cloud ▪Enterprise ready deployments ▪On major public clouds ▪Simple and Fast
  • 5. 2015 Cloudman ▪Qubole’s Cluster management software ▪Launches half a million nodes per month ▪Works across AWS, GCE and Azure ▪Provides higher level APIs
  • 6. 2015 Cloudman Goals ▪Automated cluster provisioning ▪Configure Big Data Stack ▪Manage cluster lifecycle ▪Highly optimized cost of compute
  • 7. 2015 UI SDK API Cloudman Layers of Big Data as a Service
  • 9. 2015 Challenges ▪ Autoscale based on workload ▪ Abstractions to address differences in behaviors of each cloud provider Examples − Image creation and registration − Configuring clusters
  • 10. 2015 ▪ Launched automatically when needed ▪ Expands automatically if the load is high ▪ Terminate the cluster with no running jobs ▪ Remove nodes at billing boundary Autoscaling Clusters
  • 11. 2015 insert overwrite table dest select … from ads join campaigns on …group by …; Map Tasks ReduceTasks Demand Supply Progress Master Slaves Job Tracker Cloudman Cloudman: AutoScaling
  • 12. 2015 Image registration in AWS vs. Azure Image creation and registration
  • 13. 2015 ▪ Image creation ▪ Public images in AWS ▪ Not well supported in Azure ▪ Images copied to user’s account in Azure Image creation and registration
  • 14. 2015 ▪ Configure credentials −Storage and Compute keys ▪Configure the big data stack −Start appropriate s/w, example JobTracker and NameNode on Master and TaskTracker and DataNode on Slaves Cluster Configuration
  • 15. 2015 Optimizing cost of compute in Cloud ▪ Utilize ephemeral compute instances to lower cost − AWS Spot Instances − GCE Preemptible VMs ▪ Challenges − Data loss − Big data job failures
  • 17. 2015
  • 18. 2015 Key Takeaways ▪ Highly efficient cluster management system ▪ Proven at scale in production ▪ Works on multiple clouds
  • 19. 2015 Got Feedback? Rate and review the session on our mobile app – Convene For all details visit: http://ghcindia.anitaborg.org
  • 21. 2015 Architecture ▪ ​QDS has a user interface, Python and Java SDKs and APIs that allows users to talk to QDS and analyze data sets without knowing cluster management. ▪ A QDS user can submit primitive commands to logical clusters. ▪ The middleware layer communicates to the cloud orchestration layer called Cloudman ▪ Cloudman is responsible for spinning up clusters in the concerned cloud
  • 22. 2015 ▪ One such example is Image creation and registration ▪ Procedure ▪ Precreate a machine image with all the the softwares to be deployed baked into it ▪ We start the cluster machines using this as the underlying image ▪ Saves us the time in deploying the softwares on the nodes after they are up ▪ This process is very different in all the cloud providers Image creation and registration
  • 23. 2015 Cluster Configuration ▪Another operation that had to be implemented differently for each cloud ▪Startup scripts are used for to programmatically customize virtual machine instances ▪AWS and Google cloud had support for this ▪Azure did not support automatic execution of this script at the VM boot up time in the Centos VMs
  • 24. 2015 ▪Hadoop clusters in QDS come up automatically when applications that require them are launched ▪If the load on the cluster is high, then the cluster automatically expands. ▪Cloudman automatically launches additional nodes which eventually join the running cluster and are able to pick up part of the workload Autoscaling Clusters