SlideShare a Scribd company logo
Data focused Docker
clustering
Luke Marsden
Founder, ClusterHQ
clusterhq.com
github.com/clusterhq/flocker
Background:
Container clusters where volumes
are a first class citizen
Motivations:
robustness
scalability
eliminate SPOFs
reduce complexity
Modern applications
are complex distributed systems
Containers and PaaS today
capture the easy, data-free bits
We should aim to capture the entire app
enabling larger class of apps to move to containers
Goals
Multi-node/clustered
Ease of operational manageability
Homogeneous servers
No single point of failure
Multi-region
Data services the cluster depends on must be
protected by/within the system itself
Containers should be portable, even when
they have state
(corollary: the system should expose
portable, resilient volumes)
Dependent services or applications
should not need to be reconfigured
when a service moves around
Stateful things scale
vertically, stateless things
scale horizontally
Proposed
implementation
Management layer
Strongly consistent group messaging semantics
Distributed consensus protocols
No single master
Storage
ZFS: storage analogue to containers for compute
Independently snapshotted and replicated
Sharding container volumes across a cluster
Redundancy invariant
Automatic slave allocation
Networking
AwesomeProxy
any request to any node
Routes vs. Links
Live migration
Pausing in flight requests
Quiesce filesystem
Replicate last snapshot
Unleash requests
Thank you - questions?
Check it out!
github.com/clusterhq/flocker
clusterhq.com

More Related Content

PPTX
Architecting Scalable Private Clouds
ODP
Build a Cloud Day - CloudStack
PPTX
Cloud computing
PPT
Clustering - Eric Vanderburg
PPTX
Cluster computings
PPTX
Building and provisioning genomics platforms on the world’s clouds
PPTX
Introduction to mesos
PDF
NATS Streaming - an alternative to Apache Kafka?
Architecting Scalable Private Clouds
Build a Cloud Day - CloudStack
Cloud computing
Clustering - Eric Vanderburg
Cluster computings
Building and provisioning genomics platforms on the world’s clouds
Introduction to mesos
NATS Streaming - an alternative to Apache Kafka?

Viewers also liked (12)

PDF
IPVS for Docker Containers
PPTX
PDF
Stateful Containers: Flocker on CoreOS
PDF
[En] IPVS for Docker Containers
PDF
What's New in Docker 1.12 (June 20, 2016) by Mike Goelzer & Andrea Luzzardi
PDF
Kernel load-balancing for Docker containers using IPVS
PDF
Containerd: Building a Container Supervisor by Michael Crosby
PPTX
Thinking Inside the Container: A Continuous Delivery Story by Maxfield Stewart
PDF
Docker Security Deep Dive by Ying Li and David Lawrence
PDF
Managing Persistent Storage with Docker Containers by John Griffith and Garre...
PDF
The Golden Ticket: Docker and High Security Microservices by Aaron Grattafiori
PPTX
Containerd - core container runtime component
IPVS for Docker Containers
Stateful Containers: Flocker on CoreOS
[En] IPVS for Docker Containers
What's New in Docker 1.12 (June 20, 2016) by Mike Goelzer & Andrea Luzzardi
Kernel load-balancing for Docker containers using IPVS
Containerd: Building a Container Supervisor by Michael Crosby
Thinking Inside the Container: A Continuous Delivery Story by Maxfield Stewart
Docker Security Deep Dive by Ying Li and David Lawrence
Managing Persistent Storage with Docker Containers by John Griffith and Garre...
The Golden Ticket: Docker and High Security Microservices by Aaron Grattafiori
Containerd - core container runtime component
Ad

Similar to Data focused docker clustering (20)

PPTX
Data Focused Docker Clustering. Docker Hamburg
PPTX
Emerging storage-trends-for-containers
PPTX
Why should i care about stateful containers?
PPTX
Robust Containers by Eric Brewer
PDF
DEVOPS UNIT 4 docker and services commands
PPTX
State of the Container Ecosystem
PDF
IAU workshop 2018 day one
PDF
Docker Tips And Tricks at the Docker Beijing Meetup
PPTX
Docker training
PDF
Docker Workshop - Orchestrating Docker Containers
PDF
KubeCon EU Presentation “Zombie Kubernetes! Raising nodes from the dead.“
PDF
2016 10-26 docker meetup - kubernetes on open stack
PDF
Managing containers with Docker
PPTX
Data weekender deploying prod grade sql 2019 big data clusters
PDF
On Prem Container Cloud - Lessons Learned
PDF
Common primitives in Docker environments
PDF
Containers - Portable, repeatable user-oriented application delivery. Build, ...
PDF
Dennis Benkert & Matthias Lübken - Patterns in a containerized world? - code....
PDF
DockerPenang Meetup#1
PPTX
David Steiman - Getting serious with private kubernetes clusters & cloud nati...
Data Focused Docker Clustering. Docker Hamburg
Emerging storage-trends-for-containers
Why should i care about stateful containers?
Robust Containers by Eric Brewer
DEVOPS UNIT 4 docker and services commands
State of the Container Ecosystem
IAU workshop 2018 day one
Docker Tips And Tricks at the Docker Beijing Meetup
Docker training
Docker Workshop - Orchestrating Docker Containers
KubeCon EU Presentation “Zombie Kubernetes! Raising nodes from the dead.“
2016 10-26 docker meetup - kubernetes on open stack
Managing containers with Docker
Data weekender deploying prod grade sql 2019 big data clusters
On Prem Container Cloud - Lessons Learned
Common primitives in Docker environments
Containers - Portable, repeatable user-oriented application delivery. Build, ...
Dennis Benkert & Matthias Lübken - Patterns in a containerized world? - code....
DockerPenang Meetup#1
David Steiman - Getting serious with private kubernetes clusters & cloud nati...
Ad

More from Luke Marsden (9)

PDF
Inextricably linked: reproducibility and productivity in data science and AI
PDF
Monitoring your App in Kubernetes with Prometheus
PDF
How and why we got Prometheus working with Docker Swarm
PDF
Observability beyond logging for Java Microservices
PDF
How to install and use Kubernetes
PDF
Continuous Delivery the hard way with Kubernetes
PDF
Istio Service Mesh
PDF
Docs at Weaveworks: DX from open source to SaaS and beyond
PDF
Securing & Enforcing Network Policy and Encryption with Weave Net
Inextricably linked: reproducibility and productivity in data science and AI
Monitoring your App in Kubernetes with Prometheus
How and why we got Prometheus working with Docker Swarm
Observability beyond logging for Java Microservices
How to install and use Kubernetes
Continuous Delivery the hard way with Kubernetes
Istio Service Mesh
Docs at Weaveworks: DX from open source to SaaS and beyond
Securing & Enforcing Network Policy and Encryption with Weave Net

Recently uploaded (20)

PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Cloud computing and distributed systems.
PDF
Empathic Computing: Creating Shared Understanding
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Encapsulation theory and applications.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Approach and Philosophy of On baking technology
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPT
Teaching material agriculture food technology
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Spectroscopy.pptx food analysis technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Programs and apps: productivity, graphics, security and other tools
Chapter 3 Spatial Domain Image Processing.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
20250228 LYD VKU AI Blended-Learning.pptx
Cloud computing and distributed systems.
Empathic Computing: Creating Shared Understanding
sap open course for s4hana steps from ECC to s4
Review of recent advances in non-invasive hemoglobin estimation
Per capita expenditure prediction using model stacking based on satellite ima...
Encapsulation theory and applications.pdf
MYSQL Presentation for SQL database connectivity
Approach and Philosophy of On baking technology
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Teaching material agriculture food technology
MIND Revenue Release Quarter 2 2025 Press Release
Spectroscopy.pptx food analysis technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
The AUB Centre for AI in Media Proposal.docx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx

Data focused docker clustering

Editor's Notes

  • #5: So let’s look at what that means today… take a look at a sample modern application. There’s a huge macro trend towards modern applications being complicated distributed systems, as shown in this example. (And this example doesn’t even included monitoring, logging, caching or lock servers.) These sophisticated apps consists of lots of moving parts - typically, a stateless application tier, here shown on the right, and a data tier, shown on the left, indicated with a database icon. Some parts are horizontally scalable, as indicated by double-boxes. Current platforms do not make it easy for you to move these data services, and certainly not automatically. There’s this hugely important class of data services which (of course) depend upon having a resilient writable file system.
  • #6: Containers - and PaaS in general - are great at capturing the stateless (data-free) parts of the applications. When well-designed, according to Heroku’s guidelines known as the 12factor manifesto, the application tier is stateless - which btw was a great business move by Heroku, to avoid having to do the hard work. :) Docker has done a great job of helping people to agree on how to encapsulate their stateless app tiers so that they can run in exactly the same way on your laptop, your staging server, or in production. This is the whole shipping container metaphor that everyone’s super excited about. However, these complex applications depend on stateful services too. In a public IaaS or PaaS, there exist external/outsourced data services, but these don’t exist in a standardised or consistent way in a private PaaS let alone on your developer’s laptop, and that’s a huge problem for devops because it means they can’t guarantee consistency between dev and production. We’re here to change that. You can actually containerise these data services today, but as soon as you do, the data gets locked to the server it’s on, and from our years of experience doing ops we know that’s no good because every server fails eventually. Container platforms today have no answer to the problems of data portability, let alone resilience, failover and versioning, all of which are critically important problems to solve for critical data in challenging production environments. In other words, containers are great for dev environment consistency but it’s not ready yet for ops people who want to use it in production. Docker and the Deis, Flynn crowd are super early, and even the more established players like CloudFoundry and OpenShift don’t have answers to these problems. As Adrian says, “watch the hand-wave”. They only tackle the stateless app tier which is a small component of just about every real application, because real applications have databases, queues and key value stores in them!
  • #7: The technology that we bring to the table, backed by 5 years of hard computer science, R&D and 7 patents is how you make that data tier portable too, using resilient distributed storage, as well as how to encapsulate the whole application including the data parts in a self-organising cluster with no single point of failure. This is THE big problem that needs to be solved before Docker can be used to put data and applications together into production.