SlideShare a Scribd company logo
Kubernetes and Big Data
Services
@joerg_schad @gaunetes @dcos
Chris Gaun
PMM at Mesosphere /
Kubernetes Expert /
CNCF Ambasador
● Previous to that
Gartner analyst
covering public IaaS
● Kubernetes
community for 3 years
© 2018 Mesosphere, Inc. All Rights Reserved.
Mesosphere DC/OS at KubeCon EU
● Mesosphere - Platinum
Sponsor
● Many presentation:
container storage, ML,
HDFS
● Demoing smart city
application
3
Jörg Schad
Technical Community
Lead / Developer
● Core Mesos
developer at
Mesosphere
● Passions are deep
learning, distributed
data systems, and
data analytics
Bootcamp: Building Kubernetes-as-a-Service at
Scale, Anywhere
● Episode 1: Building Kubernetes-as-a-Service
at Scale
● Episode 2: Deploying Kubernetes at Scale
with DC/OS
● Episode 3: Kubernetes and Big Data
Services
● Episode 4: Operating Kubernetes at Scale
with DC/OS
● End-to-end components
and best practices
● Automated management
of Kubernetes
● Connecting Kubernetes
to Big Data services
● Delivering an entire
Kubernetes solution
6
Star / Clone Github
1. Go to Kubernetes DC/OS
quickstart
2. Search “DC/OS Kubernetes
Quickstart Github” or
https://github.com/mesospher
e/dcos-kubernetes-quickstart
3. Live demo
https://github.com/dcos/demos
/tree/master/flink-k8s/1.11
7
Sign Up For Slack
1. Slack URL: https://chat.dcos.io/
2. Join #kubernetes channel
3. OSS support / feedback
© 2017 Mesosphere, Inc. All Rights Reserved. 8
MapReduce is
crunching Data
Ancient
Times...
© 2016 Mesosphere, Inc. All Rights Reserved. 9
But then business
demanded
FAST DATA
We need to turn faster!
Today...
© 2016 Mesosphere, Inc. All Rights Reserved. 10
Fast Data
Batch Event ProcessingMicro-Batch
Days Hours Minutes Seconds Microseconds
Solves problems using predictive and prescriptive analyticsReports what has happened using descriptive analytics
Predictive User InterfaceReal-time Pricing and Routing Real-time AdvertisingBilling, Chargeback Product recommendations
© 2016 Mesosphere, Inc. All Rights Reserved. 11
The SMACK Stack
EVENTS
Ubiquitous data streams
from connected devices
INGEST
Apache Kafka
STORE
Apache Spark
ANALYZE
Apache Cassandra
ACT
Akka
Ingest millions of events
per second
Distributed & highly
scalable database
Real-time and batch
process data
Visualize data and build
data driven applications
Apache Mesos/ DC/OS
Sensors
Devices
Clients
Episode 3: Kubernetes and Big Data Services
© 2016 Mesosphere, Inc. All Rights Reserved. 13
The SMACK Stack
EVENTS
Ubiquitous data streams
from connected devices
INGEST
Apache Kafka
STORE
Apache Flink
ANALYZE
Apache Cassandra
ACT
Akka
Ingest millions of events
per second
Distributed & highly
scalable database
Real-time and batch
process data
Visualize data and build
data driven applications
Apache Mesos/ DC/OS
Sensors
Devices
Clients
© 2017 Mesosphere, Inc. All Rights Reserved. 14
Challenges
© 2018 Mesosphere, Inc. All Rights Reserved. 15
Datacenter
Typical Datacenter
siloed, over-provisioned servers,
low utilization
Kubernetes
Jenkins
Kafka
Spark
Cassandra
3 AM
Typical Datacenter
siloed, over-provisioned servers,
low utilization
Kubernetes
Jenkins
Kafka
Spark
Cassandra
DC1
DC2
DC3
3 AM
© 2017 Mesosphere, Inc. All Rights Reserved. 18
© 2018 Mesosphere, Inc. All Rights Reserved. 19
Datacenter
Typical Datacenter
siloed, over-provisioned servers,
low utilization
Mesos/ DC/OS
automated schedulers, workload multiplexing onto the
same machines
Kubernetes
Jenkins
Kafka
Spark
Cassandra
• Brings “as-a-Service”
automation to any application
technology on any
infrastructure
• Organizations Run All Types of
Container Management as-a-
Service Using Mesos:
"(Netflix) launches up to 500,000
containers and 200,000
clusters/day"
-Netflix OSS, on using Titus container
management ontop Mesos
© 2018 Mesosphere, Inc. All Rights Reserved. 21
DC/OS
PHYSICAL INFRASTRUCTURE
MICROSERVICES, CONTAINERS, & DEV TOOLS
VIRTUAL MACHINES PUBLIC CLOUDS
DATA SERVICES, MACHINE LEARNING, & AI
Security &
Compliance
Application-Aware
Automation
Multitenancy
Hybrid Cloud
Management
100+
MORE
DatacenterEdge
Datacenter and Cloud as a Single Computing Resource
Powered by Apache Mesos
20+
MORE
© 2018 Mesosphere, Inc. All Rights Reserved. 22
© 2018 Mesosphere, Inc. All Rights Reserved.
Two-level Scheduling
1. Agents advertise resources to Master
2. Master offers resources to Framework
3. Framework rejects / uses resources
4. Agent reports task status to Master
23
MESOS ARCHITECTURE
Mesos
Master
Mesos
Master
Mesos
Master
Mesos AgentMesos Agent Service
Cassandra
Executor
Cassandra
Task
Kubernetes
Scheduler
Spark
Executor
Spark
Task
Mesos AgentMesos Agent Service
Docker
Executor
Docker
Task
K8s Executor
Kubelet
Task
Marathon
Scheduler
Kafka
Scheduler
© 2018 Mesosphere, Inc. All Rights Reserved.
Distributed Systems are ...
HDFS Scheduler
© 2018 Mesosphere, Inc. All Rights Reserved.
Plans
dcos hdfs --name=hdfs plan status deploy
deploy (serial strategy) (COMPLETE)
├─ journal (serial strategy) (COMPLETE)
│ ├─ journal-0:[node] (COMPLETE)
│ ├─ journal-1:[node] (COMPLETE)
│ └─ journal-2:[node] (COMPLETE)
├─ name (serial strategy) (COMPLETE)
│ ├─ name-0:[node, zkfc] (COMPLETE)
│ └─ name-1:[node, zkfc] (COMPLETE)
└─ data (serial strategy) (COMPLETE)
├─ data-0:[node] (COMPLETE)
├─ data-1:[node] (COMPLETE)
└─ data-2:[node] (COMPLETE)
Server Server Server Server Server
Server Server Server Server Server
as-a-Service
Installation
KubernetesOne-Click
20+
MORE
Cloud Native
Services
ServerServer Server
as-a-Service
Automated Self Healing
Server Server Server Server Server
Server Server Server
KubernetesZero
Touch
20+
MORE
Cloud Native
Services
© 2018 Mesosphere, Inc. All Rights Reserved. 28
Why {Spark, HDFS, ..} on K8s today?
Kelsey Hightower
Kubernetes Thought
Leader
Ranked #1 K8s Influencer
Staff Developer
Advocate
PM & Chief Advocate
Today Big Data on K8s is more DIY
Top
Kubernetes
Advocate
© 2018 Mesosphere, Inc. All Rights Reserved. 29
SMACK Stack
Generator Display
1. Financial data created
by generator
2. Written to
Kafka topics
3. Kafka Topics
consumed by Spark or
Flink
4. Results written back into Kafka
stream (another topic)
7. Results displayed
30
Star / Clone Github
1. Go to Kubernetes DC/OS
quickstart
2. Search “DC/OS Kubernetes
Quickstart Github” or
https://github.com/mesospher
e/dcos-kubernetes-quickstart
3. Live demo
https://github.com/dcos/demos
/tree/master/flink-k8s/1.11
© 2018 Mesosphere, Inc. All Rights Reserved. 31
SMACK Stack
Generator Display
1. Financial data created
by generator
2. Written to
Kafka topics
4. Results written back into Kafka
stream (another topic)
5. Results displayed
3. Kafka Topics
consumed and analyzed
by Flink
Kubernetes Cluster
(running on top of DC/OS)
Download Now
https://mesosphere.com/resource/category/ebook/
© 2018 Mesosphere, Inc. All Rights Reserved. 33
THANK YOU!
ANY
QUESTION
S?
@dcos
users@dcos.io
/groups/8295652
/dcos
/dcos/examples
/dcos/demos
chat.dcos.io
https://github.com/mesosphere/dcos-kubernetes-quickstart
https://mesosphere.com/blog/another-kubernetes-service/

More Related Content

PDF
Apache Druid Auto Scale-out/in for Streaming Data Ingestion on Kubernetes
PDF
Extending DevOps to Big Data Applications with Kubernetes
PDF
Migrating pipelines into Docker
PDF
Novinky v Oracle Database 18c
PPTX
DevNexus 2015: Kubernetes & Container Engine
PDF
HDFS on Kubernetes—Lessons Learned with Kimoon Kim
PPTX
What's the Hadoop-la about Kubernetes?
PPTX
Operating Kubernetes at Scale (Australia Presentation)
Apache Druid Auto Scale-out/in for Streaming Data Ingestion on Kubernetes
Extending DevOps to Big Data Applications with Kubernetes
Migrating pipelines into Docker
Novinky v Oracle Database 18c
DevNexus 2015: Kubernetes & Container Engine
HDFS on Kubernetes—Lessons Learned with Kimoon Kim
What's the Hadoop-la about Kubernetes?
Operating Kubernetes at Scale (Australia Presentation)

What's hot (20)

PPTX
Java EE Modernization with Mesosphere DCOS
PPTX
Episode 2: Deploying Kubernetes at Scale
PDF
Resilient microservices with Kubernetes - Mete Atamel
PPTX
Serverless and Servicefull Applications - Where Microservices complements Ser...
PDF
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
PDF
Leveraging docker for hadoop build automation and big data stack provisioning
PDF
Apache Spark on K8s and HDFS Security
PPTX
DevOps in Age of Kubernetes
PPTX
Hadoop on Docker
PPTX
Episode 1: Building Kubernetes-as-a-Service
PPTX
Deploy data analysis pipeline with mesos and docker
PPTX
Big Data Platform Industrialization
PPTX
Future Architecture of Streaming Analytics: Capitalizing on the Analytics of ...
PPTX
RedisConf17 - Redis Enterprise: Continuous Availability, Unlimited Scaling, S...
PPTX
SQL Server 2017 Machine Learning Services
PDF
Apache Spark on K8S Best Practice and Performance in the Cloud
PPTX
How to Use Telegraf and Its Plugin Ecosystem
PPTX
YARN Containerized Services: Fading The Lines Between On-Prem And Cloud
PPTX
Episode 4: Operating Kubernetes at Scale with DC/OS
Java EE Modernization with Mesosphere DCOS
Episode 2: Deploying Kubernetes at Scale
Resilient microservices with Kubernetes - Mete Atamel
Serverless and Servicefull Applications - Where Microservices complements Ser...
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
Leveraging docker for hadoop build automation and big data stack provisioning
Apache Spark on K8s and HDFS Security
DevOps in Age of Kubernetes
Hadoop on Docker
Episode 1: Building Kubernetes-as-a-Service
Deploy data analysis pipeline with mesos and docker
Big Data Platform Industrialization
Future Architecture of Streaming Analytics: Capitalizing on the Analytics of ...
RedisConf17 - Redis Enterprise: Continuous Availability, Unlimited Scaling, S...
SQL Server 2017 Machine Learning Services
Apache Spark on K8S Best Practice and Performance in the Cloud
How to Use Telegraf and Its Plugin Ecosystem
YARN Containerized Services: Fading The Lines Between On-Prem And Cloud
Episode 4: Operating Kubernetes at Scale with DC/OS
Ad

Similar to Episode 3: Kubernetes and Big Data Services (20)

PDF
DevOps vs. Site Reliability Engineering (SRE) in Age of Kubernetes
PDF
Kubernetes One-Click Deployment: Hands-on Workshop (Munich)
PPTX
Enabling Microservices Frameworks to Solve Business Problems
PDF
OSDC 2018 | From batch to pipelines – why Apache Mesos and DC/OS are a soluti...
PPTX
Dealing with kubesprawl tetris style !
PDF
Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)
PPTX
Webinar: Operating Kubernetes at Scale
PDF
JConWorld_ Continuous SQL with Kafka and Flink
PDF
CN Asturias - Stateful application for kubernetes
PPTX
MANTL Data Platform, Microservices and BigData Services
PDF
Flink Forward San Francisco 2018: Jörg Schad and Biswajit Das - "Operating Fl...
PDF
Jörg Schad - Hybrid Cloud (Kubernetes, Spark, HDFS, …)-as-a-Service - Codemot...
PDF
Jörg Schad - Hybrid Cloud (Kubernetes, Spark, HDFS, …)-as-a-Service - Codemot...
PDF
Kubernetes on DC/OS
PPTX
OpenEBS Technical Workshop - KubeCon San Diego 2019
PDF
Cloud-native .NET Microservices mit Kubernetes
PPTX
Doing Dropbox the Native Cloud Native Way
PPTX
Webinar: End-to-End CI/CD with GitLab and DC/OS
PDF
MayaData Datastax webinar - Operating Cassandra on Kubernetes with the help ...
PDF
Kubernetes for the PHP developer
DevOps vs. Site Reliability Engineering (SRE) in Age of Kubernetes
Kubernetes One-Click Deployment: Hands-on Workshop (Munich)
Enabling Microservices Frameworks to Solve Business Problems
OSDC 2018 | From batch to pipelines – why Apache Mesos and DC/OS are a soluti...
Dealing with kubesprawl tetris style !
Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)
Webinar: Operating Kubernetes at Scale
JConWorld_ Continuous SQL with Kafka and Flink
CN Asturias - Stateful application for kubernetes
MANTL Data Platform, Microservices and BigData Services
Flink Forward San Francisco 2018: Jörg Schad and Biswajit Das - "Operating Fl...
Jörg Schad - Hybrid Cloud (Kubernetes, Spark, HDFS, …)-as-a-Service - Codemot...
Jörg Schad - Hybrid Cloud (Kubernetes, Spark, HDFS, …)-as-a-Service - Codemot...
Kubernetes on DC/OS
OpenEBS Technical Workshop - KubeCon San Diego 2019
Cloud-native .NET Microservices mit Kubernetes
Doing Dropbox the Native Cloud Native Way
Webinar: End-to-End CI/CD with GitLab and DC/OS
MayaData Datastax webinar - Operating Cassandra on Kubernetes with the help ...
Kubernetes for the PHP developer
Ad

More from Mesosphere Inc. (20)

PDF
Best Practices for Managing Kubernetes and Stateful Services: Mesosphere & Sy...
PPTX
Webinar: What's New in DC/OS 1.11
PPTX
Webinar: Déployez facilement Kubernetes & vos containers
PPTX
Webinar: Deep Learning Pipelines Beyond the Learning
PPTX
Running Distributed TensorFlow with GPUs on Mesos with DC/OS
PPTX
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
PDF
Jolt: Distributed, fault-tolerant test running at scale using Mesos
PDF
Deploying Kong with Mesosphere DC/OS
PPTX
Tech Preview: Kubernetes on Mesosphere DC/OS 1.10
PDF
Discover the all new Mesosphere DC/OS 1.10
PDF
Mesosphere & Magnetic: Take the pain out of running complex and critical serv...
PDF
Easy Docker Deployments with Mesosphere DCOS on Azure
PPTX
Mesos framework API v1
PPTX
Scaling Like Twitter with Apache Mesos
PDF
Elastic jenkins with mesos and dcos (2016 01-20)
PDF
Growing the Mesos Ecosystem
PDF
Doing Big Data for Real with Docker
PDF
Deploying Containers in Production and at Scale
PDF
Re-Platforming All the Things
PDF
Mesos Networking
Best Practices for Managing Kubernetes and Stateful Services: Mesosphere & Sy...
Webinar: What's New in DC/OS 1.11
Webinar: Déployez facilement Kubernetes & vos containers
Webinar: Deep Learning Pipelines Beyond the Learning
Running Distributed TensorFlow with GPUs on Mesos with DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Jolt: Distributed, fault-tolerant test running at scale using Mesos
Deploying Kong with Mesosphere DC/OS
Tech Preview: Kubernetes on Mesosphere DC/OS 1.10
Discover the all new Mesosphere DC/OS 1.10
Mesosphere & Magnetic: Take the pain out of running complex and critical serv...
Easy Docker Deployments with Mesosphere DCOS on Azure
Mesos framework API v1
Scaling Like Twitter with Apache Mesos
Elastic jenkins with mesos and dcos (2016 01-20)
Growing the Mesos Ecosystem
Doing Big Data for Real with Docker
Deploying Containers in Production and at Scale
Re-Platforming All the Things
Mesos Networking

Recently uploaded (20)

PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
cuic standard and advanced reporting.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Cloud computing and distributed systems.
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Big Data Technologies - Introduction.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
cuic standard and advanced reporting.pdf
Understanding_Digital_Forensics_Presentation.pptx
Cloud computing and distributed systems.
The Rise and Fall of 3GPP – Time for a Sabbatical?
Big Data Technologies - Introduction.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
The AUB Centre for AI in Media Proposal.docx
Review of recent advances in non-invasive hemoglobin estimation
Network Security Unit 5.pdf for BCA BBA.
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Per capita expenditure prediction using model stacking based on satellite ima...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Dropbox Q2 2025 Financial Results & Investor Presentation
Advanced methodologies resolving dimensionality complications for autism neur...
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Chapter 3 Spatial Domain Image Processing.pdf

Episode 3: Kubernetes and Big Data Services

  • 1. Kubernetes and Big Data Services @joerg_schad @gaunetes @dcos
  • 2. Chris Gaun PMM at Mesosphere / Kubernetes Expert / CNCF Ambasador ● Previous to that Gartner analyst covering public IaaS ● Kubernetes community for 3 years
  • 3. © 2018 Mesosphere, Inc. All Rights Reserved. Mesosphere DC/OS at KubeCon EU ● Mesosphere - Platinum Sponsor ● Many presentation: container storage, ML, HDFS ● Demoing smart city application 3
  • 4. Jörg Schad Technical Community Lead / Developer ● Core Mesos developer at Mesosphere ● Passions are deep learning, distributed data systems, and data analytics
  • 5. Bootcamp: Building Kubernetes-as-a-Service at Scale, Anywhere ● Episode 1: Building Kubernetes-as-a-Service at Scale ● Episode 2: Deploying Kubernetes at Scale with DC/OS ● Episode 3: Kubernetes and Big Data Services ● Episode 4: Operating Kubernetes at Scale with DC/OS ● End-to-end components and best practices ● Automated management of Kubernetes ● Connecting Kubernetes to Big Data services ● Delivering an entire Kubernetes solution
  • 6. 6 Star / Clone Github 1. Go to Kubernetes DC/OS quickstart 2. Search “DC/OS Kubernetes Quickstart Github” or https://github.com/mesospher e/dcos-kubernetes-quickstart 3. Live demo https://github.com/dcos/demos /tree/master/flink-k8s/1.11
  • 7. 7 Sign Up For Slack 1. Slack URL: https://chat.dcos.io/ 2. Join #kubernetes channel 3. OSS support / feedback
  • 8. © 2017 Mesosphere, Inc. All Rights Reserved. 8 MapReduce is crunching Data Ancient Times...
  • 9. © 2016 Mesosphere, Inc. All Rights Reserved. 9 But then business demanded FAST DATA We need to turn faster! Today...
  • 10. © 2016 Mesosphere, Inc. All Rights Reserved. 10 Fast Data Batch Event ProcessingMicro-Batch Days Hours Minutes Seconds Microseconds Solves problems using predictive and prescriptive analyticsReports what has happened using descriptive analytics Predictive User InterfaceReal-time Pricing and Routing Real-time AdvertisingBilling, Chargeback Product recommendations
  • 11. © 2016 Mesosphere, Inc. All Rights Reserved. 11 The SMACK Stack EVENTS Ubiquitous data streams from connected devices INGEST Apache Kafka STORE Apache Spark ANALYZE Apache Cassandra ACT Akka Ingest millions of events per second Distributed & highly scalable database Real-time and batch process data Visualize data and build data driven applications Apache Mesos/ DC/OS Sensors Devices Clients
  • 13. © 2016 Mesosphere, Inc. All Rights Reserved. 13 The SMACK Stack EVENTS Ubiquitous data streams from connected devices INGEST Apache Kafka STORE Apache Flink ANALYZE Apache Cassandra ACT Akka Ingest millions of events per second Distributed & highly scalable database Real-time and batch process data Visualize data and build data driven applications Apache Mesos/ DC/OS Sensors Devices Clients
  • 14. © 2017 Mesosphere, Inc. All Rights Reserved. 14 Challenges
  • 15. © 2018 Mesosphere, Inc. All Rights Reserved. 15 Datacenter Typical Datacenter siloed, over-provisioned servers, low utilization Kubernetes Jenkins Kafka Spark Cassandra
  • 16. 3 AM Typical Datacenter siloed, over-provisioned servers, low utilization Kubernetes Jenkins Kafka Spark Cassandra
  • 18. © 2017 Mesosphere, Inc. All Rights Reserved. 18
  • 19. © 2018 Mesosphere, Inc. All Rights Reserved. 19 Datacenter Typical Datacenter siloed, over-provisioned servers, low utilization Mesos/ DC/OS automated schedulers, workload multiplexing onto the same machines Kubernetes Jenkins Kafka Spark Cassandra
  • 20. • Brings “as-a-Service” automation to any application technology on any infrastructure • Organizations Run All Types of Container Management as-a- Service Using Mesos: "(Netflix) launches up to 500,000 containers and 200,000 clusters/day" -Netflix OSS, on using Titus container management ontop Mesos
  • 21. © 2018 Mesosphere, Inc. All Rights Reserved. 21 DC/OS PHYSICAL INFRASTRUCTURE MICROSERVICES, CONTAINERS, & DEV TOOLS VIRTUAL MACHINES PUBLIC CLOUDS DATA SERVICES, MACHINE LEARNING, & AI Security & Compliance Application-Aware Automation Multitenancy Hybrid Cloud Management 100+ MORE DatacenterEdge Datacenter and Cloud as a Single Computing Resource Powered by Apache Mesos 20+ MORE
  • 22. © 2018 Mesosphere, Inc. All Rights Reserved. 22
  • 23. © 2018 Mesosphere, Inc. All Rights Reserved. Two-level Scheduling 1. Agents advertise resources to Master 2. Master offers resources to Framework 3. Framework rejects / uses resources 4. Agent reports task status to Master 23 MESOS ARCHITECTURE Mesos Master Mesos Master Mesos Master Mesos AgentMesos Agent Service Cassandra Executor Cassandra Task Kubernetes Scheduler Spark Executor Spark Task Mesos AgentMesos Agent Service Docker Executor Docker Task K8s Executor Kubelet Task Marathon Scheduler Kafka Scheduler
  • 24. © 2018 Mesosphere, Inc. All Rights Reserved. Distributed Systems are ... HDFS Scheduler
  • 25. © 2018 Mesosphere, Inc. All Rights Reserved. Plans dcos hdfs --name=hdfs plan status deploy deploy (serial strategy) (COMPLETE) ├─ journal (serial strategy) (COMPLETE) │ ├─ journal-0:[node] (COMPLETE) │ ├─ journal-1:[node] (COMPLETE) │ └─ journal-2:[node] (COMPLETE) ├─ name (serial strategy) (COMPLETE) │ ├─ name-0:[node, zkfc] (COMPLETE) │ └─ name-1:[node, zkfc] (COMPLETE) └─ data (serial strategy) (COMPLETE) ├─ data-0:[node] (COMPLETE) ├─ data-1:[node] (COMPLETE) └─ data-2:[node] (COMPLETE)
  • 26. Server Server Server Server Server Server Server Server Server Server as-a-Service Installation KubernetesOne-Click 20+ MORE Cloud Native Services
  • 27. ServerServer Server as-a-Service Automated Self Healing Server Server Server Server Server Server Server Server KubernetesZero Touch 20+ MORE Cloud Native Services
  • 28. © 2018 Mesosphere, Inc. All Rights Reserved. 28 Why {Spark, HDFS, ..} on K8s today? Kelsey Hightower Kubernetes Thought Leader Ranked #1 K8s Influencer Staff Developer Advocate PM & Chief Advocate Today Big Data on K8s is more DIY Top Kubernetes Advocate
  • 29. © 2018 Mesosphere, Inc. All Rights Reserved. 29 SMACK Stack Generator Display 1. Financial data created by generator 2. Written to Kafka topics 3. Kafka Topics consumed by Spark or Flink 4. Results written back into Kafka stream (another topic) 7. Results displayed
  • 30. 30 Star / Clone Github 1. Go to Kubernetes DC/OS quickstart 2. Search “DC/OS Kubernetes Quickstart Github” or https://github.com/mesospher e/dcos-kubernetes-quickstart 3. Live demo https://github.com/dcos/demos /tree/master/flink-k8s/1.11
  • 31. © 2018 Mesosphere, Inc. All Rights Reserved. 31 SMACK Stack Generator Display 1. Financial data created by generator 2. Written to Kafka topics 4. Results written back into Kafka stream (another topic) 5. Results displayed 3. Kafka Topics consumed and analyzed by Flink Kubernetes Cluster (running on top of DC/OS)
  • 33. © 2018 Mesosphere, Inc. All Rights Reserved. 33 THANK YOU! ANY QUESTION S? @dcos users@dcos.io /groups/8295652 /dcos /dcos/examples /dcos/demos chat.dcos.io https://github.com/mesosphere/dcos-kubernetes-quickstart https://mesosphere.com/blog/another-kubernetes-service/