Distributed scheduler hell
Story of ..
How we moved 100(s) of virtual machines, onto containers
What is a distributed scheduler
Your cloud provider: Digital Ocean, AWS, Google.
For containers: Kubernetes, Nomad, Mesos
Why do we care?
Microservices...
Vulcan
Distributed timeseries database.
open source https://github.com/digitalocean/vulcan
Requirements
4 3 Gbits a second of Network traffic
4 20 TB Storage
4 Sub 100ms Read times
4 100k writes a second
Distributed scheduler hell (MicroXChg 2017 Berlin)
Single process
Hypervisor
Linux Scheduler
Kernel Provides
4 Virtual Memory
4 Process Isolation
4 Disk storage
4 Networking
4 CPU scheduling
Distributed Scheduler Provides
4 Container Deploy
4 Virtual Machine Deploy
4 Memory Quota
4 Disk storage
4 Networking
4 CPU scheduling
4 Scaling Instances
Mesos
Distributed scheduler hell (MicroXChg 2017 Berlin)
Distributed scheduler hell (MicroXChg 2017 Berlin)
Deploying our App On Mesos
Distributed scheduler hell (MicroXChg 2017 Berlin)
Marathon Architecture
Kafka
Custom Scheduler
Cassandra
Pinning to specific Mesos nodes
Mesos Failure Modes
Network Partition FAIL
Counter Example: Nomad
Nomad Architecture
Kubernetes
Distributed scheduler hell (MicroXChg 2017 Berlin)
Custom Json Language
{
"application": {
"name": "timeseries-ingestor",
"scale": 1,
"ingresses": {
"timeseries-ingestor-health": {
"scheme": "http",
"container_port": 8001
}
},
"containers": {
"timeseries-ingestor": {
"image": "docker.com/timeseries/ingestor",
"image_tag": "fcb24ca",
"ports": [
8001,
9090
],
"env": {
"CASSANDRA_ADDRESS_LIST": ""
},
"resources": {
"cpu": "4",
"memory": "4000"
}
}
},
"metrics": {
"port": 8001,
"path": "/debug/metrics"
}
},
"maintainer": "dummy@digitalocean.com"
}
Command Line Deploys
docc --contexts dev_env deploy timeseries-microservice1.json
Docc is our internal Kubernetes tool
Deployment tool with diffs to Kubernetes
"env": {
+++ "CASSANDRA_CQL_VERSION": "3.1.7",
--- "CASSANDRA_KEYSPACE": "staging_vulcan",
+++ "CASSANDRA_KEYSPACE": "staging_vulcan_123",
Simliar to KubeDiff
Final Architecture
Distributed scheduler hell (MicroXChg 2017 Berlin)
Load balancing
Pushing 3 Gbs to kubernetes using Flannel
Metrics
Distributed scheduler hell (MicroXChg 2017 Berlin)
Logging
Distributed scheduler hell (MicroXChg 2017 Berlin)
Upsides to Distributed Schedulers
How to choice your abstractation
Alles hat ein Ende nur die Wurst hat
zwei
('everything has one end, only the sausage has two')

More Related Content

PPTX
Seastar at Linux Foundation Collaboration Summit
PDF
Service Discovery in Prometheus
PDF
Monitoring with Prometheus
PDF
Seastar @ NYCC++UG
PDF
Solving some of the scalability problems at booking.com
PDF
Chronix as Long-Term Storage for Prometheus
PDF
Anatomy of an action
PDF
OSv presentation from Linux Foundation Collaboration Summit
Seastar at Linux Foundation Collaboration Summit
Service Discovery in Prometheus
Monitoring with Prometheus
Seastar @ NYCC++UG
Solving some of the scalability problems at booking.com
Chronix as Long-Term Storage for Prometheus
Anatomy of an action
OSv presentation from Linux Foundation Collaboration Summit

What's hot (20)

PDF
Максим Барышиков-«WoT: Geographically distributed cluster of clusters»
PDF
Grafana is not enough: DIY user interfaces for Prometheus
PPTX
Speeding up R with Parallel Programming in the Cloud
PDF
Openstack Scheduler and Scalability Issue
PPTX
Developing Scylla Applications: Practical Tips
PDF
Automatic Operation Bot for Ceph - You Ji
PDF
Resource Scheduling using Apache Mesos in Cloud Native Environments
PDF
Doing QoS Before Ceph Cluster QoS is available - David Byte, Alex Lau
PDF
Scaling metrics
PDF
1027 predictive models in 10 seconds, by David Pardo Villaverde, Corunet
PPTX
HighLoad Solutions On MySQL / Xiaobin Lin (Alibaba)
PDF
Build a Complex, Realtime Data Management App with Postgres 14!
PDF
RBD: What will the future bring? - Jason Dillaman
PDF
Solr on Docker - the Good, the Bad and the Ugly
PDF
Azure Functions - Get rid of your servers, use functions!
PDF
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
PDF
Cassandra at teads
PDF
On heap cache vs off-heap cache
PDF
Odoo Performance Limits
PPTX
RedisConf17 - Redis in High Traffic Adtech Stack
Максим Барышиков-«WoT: Geographically distributed cluster of clusters»
Grafana is not enough: DIY user interfaces for Prometheus
Speeding up R with Parallel Programming in the Cloud
Openstack Scheduler and Scalability Issue
Developing Scylla Applications: Practical Tips
Automatic Operation Bot for Ceph - You Ji
Resource Scheduling using Apache Mesos in Cloud Native Environments
Doing QoS Before Ceph Cluster QoS is available - David Byte, Alex Lau
Scaling metrics
1027 predictive models in 10 seconds, by David Pardo Villaverde, Corunet
HighLoad Solutions On MySQL / Xiaobin Lin (Alibaba)
Build a Complex, Realtime Data Management App with Postgres 14!
RBD: What will the future bring? - Jason Dillaman
Solr on Docker - the Good, the Bad and the Ugly
Azure Functions - Get rid of your servers, use functions!
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
Cassandra at teads
On heap cache vs off-heap cache
Odoo Performance Limits
RedisConf17 - Redis in High Traffic Adtech Stack
Ad

Viewers also liked (12)

PDF
Cloud in your Cloud
PDF
Making Wallstreet talk with GO (GO India Conference 2015)
PDF
DigitalOcean Microservices Talk Rocket Internet Conf '16
PDF
AliExpress’ Way to Microservices - microXchg 2017
PDF
presentation-chaos-monkey
PDF
Breaking Prometheus (Promcon Berlin '16)
KEY
NS2--Event Scheduler
PDF
Microservice Websites (microXchg 2017)
PDF
microXchg 2017: "Microservices: The People and Organisational Impact"
PDF
Continuous delivery of microservices with kubernetes - Quintor 27-2-2017
PDF
2017-03-11 02 Денис Нелюбин. Docker & Ansible - лучшие друзья DevOps
PDF
2017-03-11 01 Игорь Родионов. Docker swarm vs Kubernetes
Cloud in your Cloud
Making Wallstreet talk with GO (GO India Conference 2015)
DigitalOcean Microservices Talk Rocket Internet Conf '16
AliExpress’ Way to Microservices - microXchg 2017
presentation-chaos-monkey
Breaking Prometheus (Promcon Berlin '16)
NS2--Event Scheduler
Microservice Websites (microXchg 2017)
microXchg 2017: "Microservices: The People and Organisational Impact"
Continuous delivery of microservices with kubernetes - Quintor 27-2-2017
2017-03-11 02 Денис Нелюбин. Docker & Ansible - лучшие друзья DevOps
2017-03-11 01 Игорь Родионов. Docker swarm vs Kubernetes
Ad

Similar to Distributed scheduler hell (MicroXChg 2017 Berlin) (20)

PDF
Fully fault tolerant real time data pipeline with docker and mesos
PDF
OSDC 2015: Bernd Mathiske | Why the Datacenter Needs an Operating System
PPTX
Enabling Microservices Frameworks to Solve Business Problems
PDF
[AWS Dev Day] 앱 현대화 | AWS Fargate를 사용한 서버리스 컨테이너 활용 하기 - 삼성전자 개발자 포털 사례 - 정영준...
PDF
From Monolith to Docker Distributed Applications
PPTX
Containerization - The DevOps Revolution
PDF
From Monolith to Docker Distributed Applications. JavaOne
PPTX
DevOps in Age of Kubernetes
PDF
AWS reinvent 2019 recap - Riyadh - Containers and Serverless - Paul Maddox
PPTX
Weave User Group Talk - DockerCon 2017 Recap
PPT
Docker based Hadoop provisioning - Hadoop Summit 2014
PPTX
On Docker and its use for LHC at CERN
PDF
Containers Meetup (AWS+CNCF) Milano Jan 15th 2020
PPTX
State of the Container Ecosystem
PPTX
Techdays SE 2016 - Micros.. err Microcosmos
PPTX
Re:invent 2016 Container Scheduling, Execution and AWS Integration
PPTX
Deploy data analysis pipeline with mesos and docker
PPTX
Docker Devoxx UK - Never mind the bollocks here's the Linux Containers
PDF
Microservice Architecture on AWS using AWS Lambda and Docker Containers
PDF
Inside Triton, July 2015
Fully fault tolerant real time data pipeline with docker and mesos
OSDC 2015: Bernd Mathiske | Why the Datacenter Needs an Operating System
Enabling Microservices Frameworks to Solve Business Problems
[AWS Dev Day] 앱 현대화 | AWS Fargate를 사용한 서버리스 컨테이너 활용 하기 - 삼성전자 개발자 포털 사례 - 정영준...
From Monolith to Docker Distributed Applications
Containerization - The DevOps Revolution
From Monolith to Docker Distributed Applications. JavaOne
DevOps in Age of Kubernetes
AWS reinvent 2019 recap - Riyadh - Containers and Serverless - Paul Maddox
Weave User Group Talk - DockerCon 2017 Recap
Docker based Hadoop provisioning - Hadoop Summit 2014
On Docker and its use for LHC at CERN
Containers Meetup (AWS+CNCF) Milano Jan 15th 2020
State of the Container Ecosystem
Techdays SE 2016 - Micros.. err Microcosmos
Re:invent 2016 Container Scheduling, Execution and AWS Integration
Deploy data analysis pipeline with mesos and docker
Docker Devoxx UK - Never mind the bollocks here's the Linux Containers
Microservice Architecture on AWS using AWS Lambda and Docker Containers
Inside Triton, July 2015

Recently uploaded (20)

PPTX
Benefits of Physical activity for teenagers.pptx
PDF
August Patch Tuesday
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PDF
STKI Israel Market Study 2025 version august
PPT
What is a Computer? Input Devices /output devices
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PDF
WOOl fibre morphology and structure.pdf for textiles
PPTX
Modernising the Digital Integration Hub
PPTX
Web Crawler for Trend Tracking Gen Z Insights.pptx
PPT
Module 1.ppt Iot fundamentals and Architecture
Benefits of Physical activity for teenagers.pptx
August Patch Tuesday
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Taming the Chaos: How to Turn Unstructured Data into Decisions
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
STKI Israel Market Study 2025 version august
What is a Computer? Input Devices /output devices
A comparative study of natural language inference in Swahili using monolingua...
O2C Customer Invoices to Receipt V15A.pptx
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Group 1 Presentation -Planning and Decision Making .pptx
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Zenith AI: Advanced Artificial Intelligence
Getting started with AI Agents and Multi-Agent Systems
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Final SEM Unit 1 for mit wpu at pune .pptx
WOOl fibre morphology and structure.pdf for textiles
Modernising the Digital Integration Hub
Web Crawler for Trend Tracking Gen Z Insights.pptx
Module 1.ppt Iot fundamentals and Architecture

Distributed scheduler hell (MicroXChg 2017 Berlin)