SlideShare a Scribd company logo
DOING BIG DATA FOR
REAL WITH DOCKER
MESOSPHERE DCOS
Elizabeth Lingg
elizabeth@mesosphere.io
AGENDA
1. Intro
2. Mesosphere, Docker, and DCOS Overview
3. Big Data Container Orchestration using DCOS and Docker
4. Demo
5. Q & A
INTRO
Engineering Manager @ Mesosphere
M.S. Computer Science with a Specialization in Artificial
Intelligence from Stanford
B.S. Computer Science with a Minor in Math, B.S. Policy
and Management from Carnegie Mellon
Experience in AI, Big Data, and Systems
Enjoys applying Distributed Systems to Manage and
Reason Over Large Amounts of Data
MESOS
Provides primitives to author datacenter-native apps.
PRIMITIVES
Resources (cpu, mem, disk, ports)
Asset fetching
Task state tracking
API for the datacenter
STATUS QUO IS STATIC
PARTITIONING
AND USE OF VIRTUAL MACHINES
MESOS LET US TREAT A CLUSTER OF
NODES...
AS ONE BIG COMPUTER

Not as individual
machines

Not as VMs
BUT AS COMPUTATIONAL
RESOURCES LIKE CORES, MEMORY,
DISKS, ETC.
Doing Big Data for Real with Docker
WE LOVE CONTAINERS
MOST MODERN APPLICATIONS ARE A WEB OF
CONTAINERS
A CONTAINER ORCHESTRATION PLATFORM
Containerization in Mesos, a brief history
MESOSPHERE DCOS
Software to provide a complete OS: init, cron, apt-get,
discovery, routing
Beautiful web UI and CLI
Support
Ecosystem of DCOS Services
Mesos Master and Mesos Workers Running in Docker
Containers
DCOS UI
DCOS CLI
$ dcos
Command line utility for the Mesosphere Datacenter Operating
System (DCOS). The Mesosphere DCOS is a distributed operating
system built around Apache Mesos. This utility provides tools
for easy management of a DCOS installation.
Available DCOS commands:
config Get and set DCOS CLI configuration properties
help Display command line usage information
marathon Deploy and manage applications on the DCOS
node Manage DCOS nodes
package Install and manage DCOS software packages
service Manage DCOS services
task Manage DCOS tasks
BIG DATA DISTRIBUTED
APPLICATIONS ON DCOS
Mesos Master and Mesos Workers Running in Docker
Containers
Distributed Applications Running in Containers on the
Mesos Workers
Container Orchestration done by Apache Mesos
Resource Allocation and Scaling Managed by Apache
Mesos
BIG DATA DISTRIBUTED
APPLICATIONS ON DCOS
Popular Distributed Apps easily deployed on a single
DCOS Cluster
Kafka, Cassandra, HDFS, Spark, and other Big Data
Services
Health checks and failure recovery are automated
APPLICATION NETWORKING
Interact with the CLI or REST API's to interact with the
services
Mesos DNS Resolution
Docker Networking mainly done through host mode
networking, works seamlessly
DATA SECURITY
Services storing secure data run on private worker nodes
in the cluster
Private nodes can only be accessed through VPN
As needed, services choose what is exposed through a
proxy running on a public node
Distributed Application can authenticate with the Master
using Framework Authentication (Kerberos Option)
EXAMPLE: SIMPLE DOCKER APP ON
DCOS
{
"id": "/mesosphere/cd-demo-app",
"instances": 1,
"cpus": 1,
"mem": 512,
"container": {
"type": "DOCKER",
"docker": {
"image": "mesosphere/cd-demo-app:$tag",
"network": "BRIDGE",
"portMappings": [
{
"servicePort": 28080,
"containerPort": 80,
"hostPort": 0,
"protocol": "tcp"
}<
EXAMPLE: CASSANDRA DCOS
SERVICE
FEATURES
Managed node configuration
Health Monitoring
Rest API
DNS Names for nodes
Multiple Rings in one cluster
INSTALL
$ dcos package install cassandra
CUSTOMIZABLE INSTALL OPTIONS
{
"cassandra": {
"cluster-name": "dev",
"resources": {
"cpus": 3.0,
"mem": 6144,
"disk": 30720
}
}
}
$ dcos package install cassandra --options=options.json
INSTALLING
HEALTHY
REST API
GET /node/all
GET /health/cluster/report
POST /node/{node}/replace
POST /cluster/repair/start
POST /scale/nodes?nodeCount={count}
DEMO!
Q & A
THANKS!
LET'S CHAT!
WE'RE HIRING!
DCOS:
Join:
mesosphere.com
mesosphere.com/careers/

More Related Content

PDF
IoT on DCOS - Scala By the Bay 2015
PDF
Mesosphere quick overview
PPTX
DC/OS: The definitive platform for modern apps
PDF
Deploying Containers in Production and at Scale
PDF
Mesos: Cluster Management System
PPTX
Introduction to DC/OS
PDF
Mesosphere & Magnetic: Take the pain out of running complex and critical serv...
PDF
Discover the all new Mesosphere DC/OS 1.10
IoT on DCOS - Scala By the Bay 2015
Mesosphere quick overview
DC/OS: The definitive platform for modern apps
Deploying Containers in Production and at Scale
Mesos: Cluster Management System
Introduction to DC/OS
Mesosphere & Magnetic: Take the pain out of running complex and critical serv...
Discover the all new Mesosphere DC/OS 1.10

What's hot (20)

PDF
Scale your docker containers with Mesos
PDF
Mesos and the Architecture of the New Datacenter
PDF
DCOS Presentation
PDF
Hyperscale Computing, Enterprise Agility with Mesosphere
PDF
Cloud Computing, Docker, Mesos, DCOS, Container, Big Data, Paas
PDF
Re-Platforming All the Things
PDF
Mesos, DC/OS and the Architecture of the New Datacenter
PDF
Container Orchestration Wars (2017 Edition)
PPTX
Scalable On-Demand Hadoop Clusters with Docker and Mesos
PPTX
Cloud Foundry and OpenStack - A Marriage Made in Heaven! (Cloud Foundry Summi...
PPTX
Running OpenStack on Amazon AWS, Alex Fishman
PDF
Downtime is not an option - day 2 operations - Jörg Schad
PDF
Modernizing Applications with Microservices
PDF
Cloud stack design camp on jun 15
PDF
Datacenter Computing with Apache Mesos - シリコンバレー日本人駐在員Meetup
PDF
OSDC 2018 | Introduction to SaltStack in the Modern Data Center by Mike Place
PDF
Practical PaaS presentation
PDF
Cloud stack for_beginners
PPT
Enabling Fast IT using Containers, Microservices and DevOps Model
PPTX
Introducing Cloud Development with Mantl
Scale your docker containers with Mesos
Mesos and the Architecture of the New Datacenter
DCOS Presentation
Hyperscale Computing, Enterprise Agility with Mesosphere
Cloud Computing, Docker, Mesos, DCOS, Container, Big Data, Paas
Re-Platforming All the Things
Mesos, DC/OS and the Architecture of the New Datacenter
Container Orchestration Wars (2017 Edition)
Scalable On-Demand Hadoop Clusters with Docker and Mesos
Cloud Foundry and OpenStack - A Marriage Made in Heaven! (Cloud Foundry Summi...
Running OpenStack on Amazon AWS, Alex Fishman
Downtime is not an option - day 2 operations - Jörg Schad
Modernizing Applications with Microservices
Cloud stack design camp on jun 15
Datacenter Computing with Apache Mesos - シリコンバレー日本人駐在員Meetup
OSDC 2018 | Introduction to SaltStack in the Modern Data Center by Mike Place
Practical PaaS presentation
Cloud stack for_beginners
Enabling Fast IT using Containers, Microservices and DevOps Model
Introducing Cloud Development with Mantl
Ad

Viewers also liked (11)

PDF
Easy Docker Deployments with Mesosphere DCOS on Azure
PDF
Mesos Networking
PDF
Elastic jenkins with mesos and dcos (2016 01-20)
PPTX
Mesos framework API v1
PPTX
Scaling Like Twitter with Apache Mesos
PPTX
11 Live Node.js CMS Frameworks
PPTX
基于Python构建可扩展的自动化运维平台
PDF
Docker 初探,實驗室中的運貨鯨
PDF
Container Orchestration Wars
PPTX
Python in the Hadoop Ecosystem (Rock Health presentation)
PPTX
Lessons Learned Running Hadoop and Spark in Docker Containers
Easy Docker Deployments with Mesosphere DCOS on Azure
Mesos Networking
Elastic jenkins with mesos and dcos (2016 01-20)
Mesos framework API v1
Scaling Like Twitter with Apache Mesos
11 Live Node.js CMS Frameworks
基于Python构建可扩展的自动化运维平台
Docker 初探,實驗室中的運貨鯨
Container Orchestration Wars
Python in the Hadoop Ecosystem (Rock Health presentation)
Lessons Learned Running Hadoop and Spark in Docker Containers
Ad

Similar to Doing Big Data for Real with Docker (20)

ODP
DC/OS: Datacenter Operating System
PDF
OSDC 2018 | From batch to pipelines – why Apache Mesos and DC/OS are a soluti...
PDF
Kubernetes on DC/OS
PDF
Mesosphere and Contentteam: A New Way to Run Cassandra
PDF
Best Practices for Managing Kubernetes and Stateful Services: Mesosphere & Sy...
PDF
Kubernetes on Top of Mesos on Top of DCOS
PDF
Introduction to DC/OS
PPTX
Episode 3: Kubernetes and Big Data Services
PDF
Introduction to DC/OS
PPTX
Tech Preview: Kubernetes on Mesosphere DC/OS 1.10
PDF
Flink forward sf 17
PDF
Flink forward SF 2017: Elizabeth K. Joseph and Ravi Yadav - Flink meet DC/OS ...
PDF
Flink Forward San Francisco 2017 - Flink meet DC/OS
PDF
Elastic data services on Apache Mesos via Mesosphere’s DCOS
PDF
Apache Mesos and the new Open Source Architecture of the Modern Datacenter
PPTX
EMC World 2016 - Introduction to Mesos and Mesosphere
PDF
Mesos meetup @ shutterstock
PDF
Mesos meetup @ add this
PDF
Mesos meetup @ add this
PDF
Containerizing couchbase with microservice architecture on mesosphere.pptx
DC/OS: Datacenter Operating System
OSDC 2018 | From batch to pipelines – why Apache Mesos and DC/OS are a soluti...
Kubernetes on DC/OS
Mesosphere and Contentteam: A New Way to Run Cassandra
Best Practices for Managing Kubernetes and Stateful Services: Mesosphere & Sy...
Kubernetes on Top of Mesos on Top of DCOS
Introduction to DC/OS
Episode 3: Kubernetes and Big Data Services
Introduction to DC/OS
Tech Preview: Kubernetes on Mesosphere DC/OS 1.10
Flink forward sf 17
Flink forward SF 2017: Elizabeth K. Joseph and Ravi Yadav - Flink meet DC/OS ...
Flink Forward San Francisco 2017 - Flink meet DC/OS
Elastic data services on Apache Mesos via Mesosphere’s DCOS
Apache Mesos and the new Open Source Architecture of the Modern Datacenter
EMC World 2016 - Introduction to Mesos and Mesosphere
Mesos meetup @ shutterstock
Mesos meetup @ add this
Mesos meetup @ add this
Containerizing couchbase with microservice architecture on mesosphere.pptx

More from Mesosphere Inc. (16)

PPTX
DevOps in Age of Kubernetes
PPTX
Java EE Modernization with Mesosphere DCOS
PPTX
Operating Kubernetes at Scale (Australia Presentation)
PPTX
Episode 4: Operating Kubernetes at Scale with DC/OS
PPTX
Episode 2: Deploying Kubernetes at Scale
PPTX
Episode 1: Building Kubernetes-as-a-Service
PPTX
Webinar: What's New in DC/OS 1.11
PPTX
Webinar: End-to-End CI/CD with GitLab and DC/OS
PPTX
Webinar: Operating Kubernetes at Scale
PPTX
Webinar: Déployez facilement Kubernetes & vos containers
PPTX
Webinar: Deep Learning Pipelines Beyond the Learning
PPTX
Running Distributed TensorFlow with GPUs on Mesos with DC/OS
PPTX
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
PDF
Jolt: Distributed, fault-tolerant test running at scale using Mesos
PDF
Deploying Kong with Mesosphere DC/OS
PDF
Growing the Mesos Ecosystem
DevOps in Age of Kubernetes
Java EE Modernization with Mesosphere DCOS
Operating Kubernetes at Scale (Australia Presentation)
Episode 4: Operating Kubernetes at Scale with DC/OS
Episode 2: Deploying Kubernetes at Scale
Episode 1: Building Kubernetes-as-a-Service
Webinar: What's New in DC/OS 1.11
Webinar: End-to-End CI/CD with GitLab and DC/OS
Webinar: Operating Kubernetes at Scale
Webinar: Déployez facilement Kubernetes & vos containers
Webinar: Deep Learning Pipelines Beyond the Learning
Running Distributed TensorFlow with GPUs on Mesos with DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Jolt: Distributed, fault-tolerant test running at scale using Mesos
Deploying Kong with Mesosphere DC/OS
Growing the Mesos Ecosystem

Recently uploaded (20)

PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Big Data Technologies - Introduction.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
MYSQL Presentation for SQL database connectivity
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Cloud computing and distributed systems.
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
A Presentation on Artificial Intelligence
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
KodekX | Application Modernization Development
PDF
Electronic commerce courselecture one. Pdf
Understanding_Digital_Forensics_Presentation.pptx
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Network Security Unit 5.pdf for BCA BBA.
Big Data Technologies - Introduction.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Unlocking AI with Model Context Protocol (MCP)
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
MYSQL Presentation for SQL database connectivity
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Cloud computing and distributed systems.
Building Integrated photovoltaic BIPV_UPV.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
A Presentation on Artificial Intelligence
Mobile App Security Testing_ A Comprehensive Guide.pdf
KodekX | Application Modernization Development
Electronic commerce courselecture one. Pdf

Doing Big Data for Real with Docker

  • 1. DOING BIG DATA FOR REAL WITH DOCKER MESOSPHERE DCOS Elizabeth Lingg elizabeth@mesosphere.io
  • 2. AGENDA 1. Intro 2. Mesosphere, Docker, and DCOS Overview 3. Big Data Container Orchestration using DCOS and Docker 4. Demo 5. Q & A
  • 3. INTRO Engineering Manager @ Mesosphere M.S. Computer Science with a Specialization in Artificial Intelligence from Stanford B.S. Computer Science with a Minor in Math, B.S. Policy and Management from Carnegie Mellon Experience in AI, Big Data, and Systems Enjoys applying Distributed Systems to Manage and Reason Over Large Amounts of Data
  • 4. MESOS Provides primitives to author datacenter-native apps. PRIMITIVES Resources (cpu, mem, disk, ports) Asset fetching Task state tracking API for the datacenter
  • 5. STATUS QUO IS STATIC PARTITIONING AND USE OF VIRTUAL MACHINES
  • 6. MESOS LET US TREAT A CLUSTER OF NODES...
  • 7. AS ONE BIG COMPUTER
  • 9. BUT AS COMPUTATIONAL RESOURCES LIKE CORES, MEMORY, DISKS, ETC.
  • 12. MOST MODERN APPLICATIONS ARE A WEB OF CONTAINERS
  • 14. Containerization in Mesos, a brief history
  • 15. MESOSPHERE DCOS Software to provide a complete OS: init, cron, apt-get, discovery, routing Beautiful web UI and CLI Support Ecosystem of DCOS Services Mesos Master and Mesos Workers Running in Docker Containers
  • 17. DCOS CLI $ dcos Command line utility for the Mesosphere Datacenter Operating System (DCOS). The Mesosphere DCOS is a distributed operating system built around Apache Mesos. This utility provides tools for easy management of a DCOS installation. Available DCOS commands: config Get and set DCOS CLI configuration properties help Display command line usage information marathon Deploy and manage applications on the DCOS node Manage DCOS nodes package Install and manage DCOS software packages service Manage DCOS services task Manage DCOS tasks
  • 18. BIG DATA DISTRIBUTED APPLICATIONS ON DCOS Mesos Master and Mesos Workers Running in Docker Containers Distributed Applications Running in Containers on the Mesos Workers Container Orchestration done by Apache Mesos Resource Allocation and Scaling Managed by Apache Mesos
  • 19. BIG DATA DISTRIBUTED APPLICATIONS ON DCOS Popular Distributed Apps easily deployed on a single DCOS Cluster Kafka, Cassandra, HDFS, Spark, and other Big Data Services Health checks and failure recovery are automated
  • 20. APPLICATION NETWORKING Interact with the CLI or REST API's to interact with the services Mesos DNS Resolution Docker Networking mainly done through host mode networking, works seamlessly
  • 21. DATA SECURITY Services storing secure data run on private worker nodes in the cluster Private nodes can only be accessed through VPN As needed, services choose what is exposed through a proxy running on a public node Distributed Application can authenticate with the Master using Framework Authentication (Kerberos Option)
  • 22. EXAMPLE: SIMPLE DOCKER APP ON DCOS { "id": "/mesosphere/cd-demo-app", "instances": 1, "cpus": 1, "mem": 512, "container": { "type": "DOCKER", "docker": { "image": "mesosphere/cd-demo-app:$tag", "network": "BRIDGE", "portMappings": [ { "servicePort": 28080, "containerPort": 80, "hostPort": 0, "protocol": "tcp" }<
  • 23. EXAMPLE: CASSANDRA DCOS SERVICE FEATURES Managed node configuration Health Monitoring Rest API DNS Names for nodes Multiple Rings in one cluster
  • 24. INSTALL $ dcos package install cassandra CUSTOMIZABLE INSTALL OPTIONS { "cassandra": { "cluster-name": "dev", "resources": { "cpus": 3.0, "mem": 6144, "disk": 30720 } } } $ dcos package install cassandra --options=options.json
  • 27. REST API GET /node/all GET /health/cluster/report POST /node/{node}/replace POST /cluster/repair/start POST /scale/nodes?nodeCount={count}
  • 28. DEMO!
  • 29. Q & A