SlideShare a Scribd company logo
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
System health 
Analytics 
Customer experience 
Insights 
Anomaly detection
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
User 
Job 1 
User 
Job 2 
User 
Job 3 
Discovery
Apache Mesos 
Apache Mesos 
Apache Mesos 
MManantitsis 
Mantis 
ASG 
ASG 
ASG 
Fenzo 
Mesos 
Framework
Job 
Job 
Job
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
FrmWrk1 FrmWrk2 
Mesos master Standby master Standby master 
Mesos slave 
FrmWrk2 executor 
TaTasksk 
Mesos slave 
FrmWrk1 executor 
TaTasksk 
FrmWrk2 executor 
Mesos slave 
FrmWrk1 executor 
TaTasksk
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
Instance 1 
Instance 1 
Instance 2 
Task A 
Task B 
Instance 1 
Task A 
Instance 1 
Task C Task D 
Task B 
Task A
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
Data 
stream 
Task1 
Host A 
Task2 
Host B 
Task3 
Host C
Data 
stream 
Data 
stream 
Task1 
Task2 
Task3 
Host X 
Task1 
Host A 
Task2 
Host B 
Task3 
Host C
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
Mantis 
Mantis 
Mantis 
Fenzo 
Mesos 
Framework
Mesos slave 
FrmWrk1 executor 
FrmWrk1 executor 
Mesos slave 
Framework executor 
TaTasksk Task 
Framework executor 
Task
Apache Mesos 
Apache Mesos 
Mesos Master 
Apache Mesos 
Framework 
Persistence
Apache Mesos 
Apache Mesos 
Mesos Master 
Apache Mesos 
Framework 
Persistence
.setName(name) 
.setFailoverTimeout(to) 
.setId(id) 
.setCheckpoint(true) 
.build();
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
Heterogeneous 
Autoscale 
Visibility 
Plugins for 
Constraints, Fitness 
High speed
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
Mesos master 
Mesos framework 
Tasks 
requests 
Available 
resource 
offers 
Fenzo task 
scheduler 
Persistence
Fitness 
Urgency
Speed Accuracy 
First fit assignment Optimal assignment 
Real world trade-offs 
~ O (1) ~ O (N * M)1 
1 Assuming tasks are not reassigned
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
✔
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
2150 
1950 
1750 
1550 
1350 
1150 
950 
750 
#Hosts 
No bin packing used 
#Full 
#Partial 
#Empty 
2150 
1950 
1750 
1550 
1350 
1150 
950 
750 
#Hosts 
With bin packing 
#Full 
#Partial 
#Empty
2050 
1950 
1850 
1750 
1650 
1550 
1450 
#Hosts 
No task runtime-based 
packer 
Different 
runtimes 
Same 
runtimes 
Unused 
2050 
1950 
1850 
1750 
1650 
1550 
1450 
#Hosts 
Using task runtime-based 
packer 
Different 
runtimes 
Same 
runtimes 
Unused
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
ASG/Cluster: 
mantisagent 
MinIdle: 8 
MaxIdle: 20 
CooldownSecs: 
360 
ASG/Cluster: 
mantisagent 
MinIdle: 8 
MaxIdle: 20 
CooldownSecs: 
360 
ASG/cluster: 
mantisagent 
MinIdle: 8 
MaxIdle: 20 
CooldownSecs: 360 
Fenzo 
ScaleUp 
action: 
Cluster, N 
ScaleDown 
action: 
Cluster, 
HostList
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
.withLeaseOfferExpirySecs(60) 
.withLeaseRejectAction( (lease) -> { 
mesosDriver.declineOffer(lease.getOffer().getId()); 
})
.withLeaseOfferExpirySecs(60) 
.withLeaseRejectAction( (lease) -> { 
mesosDriver.declineOffer(lease.getOffer().getId()); 
} ) 
.withFitnessCalculator( 
BinPackingFitnessCalculators.cpuBinPacker)
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
Heterogeneous 
Autoscale 
Visibility 
Plugins for 
Constraints, Fitness 
High speed
Apache Mesos 
Apache Mesos 
Apache Mesos 
MManantitsis 
Mantis 
ASG 
ASG 
ASG 
Fenzo 
Mesos 
Framework
AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud
Talk Time Title 
PFC-305 Wednesday, 1:15pm Embracing Failure: Fault Injection and Service Reliability 
BDT-403 Wednesday, 2:15pm Next Generation Big Data Platform at Netflix 
PFC-306 Wednesday, 3:30pm Performance Tuning EC2 
DEV-309 Wednesday, 3:30pm From Asgard to Zuul, How Netflix’s proven Open Source 
Tools can accelerate and scale your services 
ARC-317 Wednesday, 4:30pm Maintaining a Resilient Front-Door at Massive Scale 
PFC-304 Wednesday, 4:30pm Effective Inter-process Communications in the Cloud: The 
Pros and Cons of Micro Services Architectures 
ENT-209 Wednesday, 4:30pm Cloud Migration, Dev-Ops and Distributed Systems 
APP-310 Friday, 9:00am Scheduling using Apache Mesos in the Cloud
Please give us your feedback on this 
presentation

More Related Content

PDF
Resource Scheduling using Apache Mesos in Cloud Native Environments
PDF
ZooKeeper - wait free protocol for coordinating processes
PDF
Monitoring NGINX (plus): key metrics and how-to
PDF
Prezo at-mesos con2015-final
PDF
Ansible with AWS
PDF
Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDog
PDF
Introduction to Meteor - revised edition
PDF
Integrating Puppet with Cloud Infrastructures-Remco Overdijk
Resource Scheduling using Apache Mesos in Cloud Native Environments
ZooKeeper - wait free protocol for coordinating processes
Monitoring NGINX (plus): key metrics and how-to
Prezo at-mesos con2015-final
Ansible with AWS
Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDog
Introduction to Meteor - revised edition
Integrating Puppet with Cloud Infrastructures-Remco Overdijk

What's hot (18)

PDF
The Monitoring Playground
PPTX
Speeding up R with Parallel Programming in the Cloud
PPTX
RedisConf17 - Redis in High Traffic Adtech Stack
PPTX
Automating aws infrastructure and code deployments using Ansible @WebEngage
PDF
PPTX
R in Minecraft
PDF
Cortex: Prometheus as a Service, One Year On
PPTX
Stabilising the jenga tower
PPTX
Migrating Existing Open Source Machine Learning to Azure
PDF
Boyan Ivanov - latency, the #1 metric of your cloud
PPTX
Apache Incubator Samza: Stream Processing at LinkedIn
PPTX
Webinar patterns anti patterns
PPTX
Lifting the Blinds: Monitoring Windows Server 2012
PDF
London Hug 19/5 - Terraform in Production
PPTX
Scylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
POTX
Mobile 3: Launch Like a Boss!
PDF
Leveraging Databricks for Spark pipelines
PPTX
MySQL Head to Head Performance
The Monitoring Playground
Speeding up R with Parallel Programming in the Cloud
RedisConf17 - Redis in High Traffic Adtech Stack
Automating aws infrastructure and code deployments using Ansible @WebEngage
R in Minecraft
Cortex: Prometheus as a Service, One Year On
Stabilising the jenga tower
Migrating Existing Open Source Machine Learning to Azure
Boyan Ivanov - latency, the #1 metric of your cloud
Apache Incubator Samza: Stream Processing at LinkedIn
Webinar patterns anti patterns
Lifting the Blinds: Monitoring Windows Server 2012
London Hug 19/5 - Terraform in Production
Scylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
Mobile 3: Launch Like a Boss!
Leveraging Databricks for Spark pipelines
MySQL Head to Head Performance
Ad

Similar to AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud (20)

PDF
Practice and challenges from building IaaS
PDF
Survey of Percona Toolkit
PDF
Podila mesos con europe keynote aug sep 2016
PDF
Apache Flink® Meets Apache Mesos® and DC/OS
PDF
Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...
ODP
Nagios Conference 2012 - Dan Wittenberg - Case Study: Scaling Nagios Core at ...
PDF
Deep Dive on Amazon EC2 Instances (March 2017)
PDF
MesosCon 2018
PDF
Testing Persistent Storage Performance in Kubernetes with Sherlock
PDF
Podila QCon SF 2016
PDF
Performance Analysis and Optimizations of CAE Applications (Case Study: STAR_...
PPTX
Light-weighted HDFS disaster recovery
PPTX
Comparative Performance Analysis of AWS EC2 Instance Types Commonly Used for ...
PPTX
AWS SSA Webinar - Cost optimisation on AWS
PPT
Computing Outside The Box September 2009
PDF
Netflix container scheduling talk at stanford final
PDF
Reactive Stream Processing with Mantis
PDF
Tackling the Management Challenges of Server Consolidation on Multi-core Systems
PDF
Dynamic Hadoop Clusters
PDF
Upgrading your Private Cloud to Windows Server 2012 R2
Practice and challenges from building IaaS
Survey of Percona Toolkit
Podila mesos con europe keynote aug sep 2016
Apache Flink® Meets Apache Mesos® and DC/OS
Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...
Nagios Conference 2012 - Dan Wittenberg - Case Study: Scaling Nagios Core at ...
Deep Dive on Amazon EC2 Instances (March 2017)
MesosCon 2018
Testing Persistent Storage Performance in Kubernetes with Sherlock
Podila QCon SF 2016
Performance Analysis and Optimizations of CAE Applications (Case Study: STAR_...
Light-weighted HDFS disaster recovery
Comparative Performance Analysis of AWS EC2 Instance Types Commonly Used for ...
AWS SSA Webinar - Cost optimisation on AWS
Computing Outside The Box September 2009
Netflix container scheduling talk at stanford final
Reactive Stream Processing with Mantis
Tackling the Management Challenges of Server Consolidation on Multi-core Systems
Dynamic Hadoop Clusters
Upgrading your Private Cloud to Windows Server 2012 R2
Ad

Recently uploaded (20)

PDF
Understanding Forklifts - TECH EHS Solution
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PDF
medical staffing services at VALiNTRY
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
PTS Company Brochure 2025 (1).pdf.......
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
top salesforce developer skills in 2025.pdf
PPTX
Introduction to Artificial Intelligence
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
System and Network Administration Chapter 2
Understanding Forklifts - TECH EHS Solution
Softaken Excel to vCard Converter Software.pdf
Wondershare Filmora 15 Crack With Activation Key [2025
VVF-Customer-Presentation2025-Ver1.9.pptx
How to Choose the Right IT Partner for Your Business in Malaysia
2025 Textile ERP Trends: SAP, Odoo & Oracle
medical staffing services at VALiNTRY
Odoo POS Development Services by CandidRoot Solutions
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Operating system designcfffgfgggggggvggggggggg
PTS Company Brochure 2025 (1).pdf.......
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
top salesforce developer skills in 2025.pdf
Introduction to Artificial Intelligence
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
Design an Analysis of Algorithms II-SECS-1021-03
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
System and Network Administration Chapter 2

AWS re:Invent 2014 talk: Scheduling using Apache Mesos in the Cloud

  • 12. System health Analytics Customer experience Insights Anomaly detection
  • 16. User Job 1 User Job 2 User Job 3 Discovery
  • 17. Apache Mesos Apache Mesos Apache Mesos MManantitsis Mantis ASG ASG ASG Fenzo Mesos Framework
  • 22. FrmWrk1 FrmWrk2 Mesos master Standby master Standby master Mesos slave FrmWrk2 executor TaTasksk Mesos slave FrmWrk1 executor TaTasksk FrmWrk2 executor Mesos slave FrmWrk1 executor TaTasksk
  • 24. Instance 1 Instance 1 Instance 2 Task A Task B Instance 1 Task A Instance 1 Task C Task D Task B Task A
  • 35. Data stream Task1 Host A Task2 Host B Task3 Host C
  • 36. Data stream Data stream Task1 Task2 Task3 Host X Task1 Host A Task2 Host B Task3 Host C
  • 41. Mantis Mantis Mantis Fenzo Mesos Framework
  • 42. Mesos slave FrmWrk1 executor FrmWrk1 executor Mesos slave Framework executor TaTasksk Task Framework executor Task
  • 43. Apache Mesos Apache Mesos Mesos Master Apache Mesos Framework Persistence
  • 44. Apache Mesos Apache Mesos Mesos Master Apache Mesos Framework Persistence
  • 45. .setName(name) .setFailoverTimeout(to) .setId(id) .setCheckpoint(true) .build();
  • 50. Heterogeneous Autoscale Visibility Plugins for Constraints, Fitness High speed
  • 52. Mesos master Mesos framework Tasks requests Available resource offers Fenzo task scheduler Persistence
  • 54. Speed Accuracy First fit assignment Optimal assignment Real world trade-offs ~ O (1) ~ O (N * M)1 1 Assuming tasks are not reassigned
  • 69.
  • 71. 2150 1950 1750 1550 1350 1150 950 750 #Hosts No bin packing used #Full #Partial #Empty 2150 1950 1750 1550 1350 1150 950 750 #Hosts With bin packing #Full #Partial #Empty
  • 72. 2050 1950 1850 1750 1650 1550 1450 #Hosts No task runtime-based packer Different runtimes Same runtimes Unused 2050 1950 1850 1750 1650 1550 1450 #Hosts Using task runtime-based packer Different runtimes Same runtimes Unused
  • 78. ASG/Cluster: mantisagent MinIdle: 8 MaxIdle: 20 CooldownSecs: 360 ASG/Cluster: mantisagent MinIdle: 8 MaxIdle: 20 CooldownSecs: 360 ASG/cluster: mantisagent MinIdle: 8 MaxIdle: 20 CooldownSecs: 360 Fenzo ScaleUp action: Cluster, N ScaleDown action: Cluster, HostList
  • 81. .withLeaseOfferExpirySecs(60) .withLeaseRejectAction( (lease) -> { mesosDriver.declineOffer(lease.getOffer().getId()); })
  • 82. .withLeaseOfferExpirySecs(60) .withLeaseRejectAction( (lease) -> { mesosDriver.declineOffer(lease.getOffer().getId()); } ) .withFitnessCalculator( BinPackingFitnessCalculators.cpuBinPacker)
  • 85. Heterogeneous Autoscale Visibility Plugins for Constraints, Fitness High speed
  • 86. Apache Mesos Apache Mesos Apache Mesos MManantitsis Mantis ASG ASG ASG Fenzo Mesos Framework
  • 88. Talk Time Title PFC-305 Wednesday, 1:15pm Embracing Failure: Fault Injection and Service Reliability BDT-403 Wednesday, 2:15pm Next Generation Big Data Platform at Netflix PFC-306 Wednesday, 3:30pm Performance Tuning EC2 DEV-309 Wednesday, 3:30pm From Asgard to Zuul, How Netflix’s proven Open Source Tools can accelerate and scale your services ARC-317 Wednesday, 4:30pm Maintaining a Resilient Front-Door at Massive Scale PFC-304 Wednesday, 4:30pm Effective Inter-process Communications in the Cloud: The Pros and Cons of Micro Services Architectures ENT-209 Wednesday, 4:30pm Cloud Migration, Dev-Ops and Distributed Systems APP-310 Friday, 9:00am Scheduling using Apache Mesos in the Cloud
  • 89. Please give us your feedback on this presentation