SlideShare a Scribd company logo
Javascript addict,
Experienced as technical
leader and being focused on
building high-quality,
distributed, scalable
applications.
VALI MALINOIU @0X4139
HOW TO - WITH CASSANDRA
★
Scaling with DOCKER
★
Clustering with COREOS
★
Querying with PRESTO
SCALING - DOCKER
★
What is docker ?
★
Why should i use it ?
★
How does it help me ?
Docker is an open platform
for developers and sysadmins
to build, ship, and run
distributed applications
WHAT IS DOCKER???
DOCKER
★ Why developers like it?
★ Why sysadmins like it?
★ Docker vs VM (VBox, VMWARE)
FOR DEVELOPERS
• Any language using any toolchain
•“Dockerized” apps are completely portable and can run
anywhere
•No more apt-get install, yum install
•Get started quickly using one of the 13000+ apps on
Docker Hub
FOR SYSADMINS
•Provides standardized environments for QA,DEV,PROD
•Reduces “works on my machine” finger pointing
•Deploy and run any app on any infrastructure, quickly and
reliable
Each virtualized application
includes not only the
application - which may be
10s of MBS and the necessary
binaries and libraries but also
the entire guest operating
system - which may weigh
10’s of GB
VIRTUAL MACHINES
The docker Engine container
comprises just the application
and its dependencies. It runs
as an isolated process in
userspace on the host
operating system, sharing the
kernel with other containers.
DOCKER
DOCKER DEMO
CLUSTER - COREOS
★
What is cores ?
★
Why should i use it ?
★
How does it help me ?
CoreOS is Linux for Massive
Server Deployments
WHAT IS COREOS???
COREOS
★ ETCD
★ FLEET
★ FLANNEL
ETCD
ETCD is a distributed key value store that provides a
reliable way to store data across a cluster of machines.
ETCD gracefully handles master elections during network
partitions and will tolerate machine failure, including the
master.
Your applications can read and write data into etcd.
FLEET
Easy Warehouse-scale Computing
Treat your CoreOS cluster as if it shared a single init
system.
Automatic migration of the units when a machine is
failing
FLEET - ADVANTAGES
	Deploy docker containers on arbitrary hosts in a cluster
	Distribute services across a cluster using machine-level
anti-affinity
	Maintain N instances of a service, re-scheduling on
machine failure
	Discover machines running in the cluster
	Automatically SSH into the machine running a job
Listing machines in a cluster
Listing units in a cluster
Listing unit files in a cluster
Creating a unit
FLANNEL
is an overlay network that gives a subnet to each machine
for use with Docker/Kubernetes
Allows for Docker containers to communicate even tough
the containers are located on different machines (magic)
Facebook uses Presto for interactive queries
against several internal data stores,
including their 300PB data warehouse. Over
1,000 Facebook employees use Presto daily
to run more than 30,000 queries that in
total scan over a petabyte each per day.
DISTRIBUTED SQL
ENGINE FOR BIG
DATA
PRESTO
by Facebook
HOW DOES IT WORK?
★
Coordinator-worker
architecture
★
Works with varios
connectors
★
Hadoop/HIVE
★
Cassandra
★
TCP-H (mostly for testing)
WHY??
★
Combine data from
multiple data sources
★
JOINS!!!!
★
Scalable
★
Maintained!!!
QUESTIONS?

More Related Content

PDF
Cassandra and docker
PDF
Docker Container Orchestration
PDF
Cassandra on Docker
PDF
Cassandra and Docker Lessons Learned
PPTX
Cassandra via-docker
PDF
Introduction To Docker
PDF
Docker Intro at the Google Developer Group and Google Cloud Platform Meet Up
PDF
Introduction to docker
Cassandra and docker
Docker Container Orchestration
Cassandra on Docker
Cassandra and Docker Lessons Learned
Cassandra via-docker
Introduction To Docker
Docker Intro at the Google Developer Group and Google Cloud Platform Meet Up
Introduction to docker

What's hot (20)

PDF
How we dockerized a startup? #meetup #docker
PDF
Basic docker for developer
PDF
Introduction to Containers and Docker
ODP
Docker - The Linux Container
PDF
Docker and containers : Disrupting the virtual machine(VM)
PDF
Optimizing Docker Images
PDF
Shipping Applications to Production in Containers with Docker
PDF
Introduction to Docker - Docker workshop @Twitter
PDF
Introduction to Docker and all things containers, Docker Meetup at RelateIQ
PDF
Docker Introduction + what is new in 0.9
PDF
Docker and Containers for Development and Deployment — SCALE12X
PPTX
Introduction to Docker
PPTX
Docker Introduction
PDF
Nebulaworks Docker Overview 09-22-2015
PDF
Understand how docker works
PDF
Orchestrating Docker containers at scale
PDF
Virtual Machines and Docker
PPTX
Docker for the new Era: Introducing Docker,its components and tools
PPTX
Docker Ecosystem on Azure
PDF
Wso2 con 2014-us-tutorial-apache stratos-wso2 private paas with docker integr...
How we dockerized a startup? #meetup #docker
Basic docker for developer
Introduction to Containers and Docker
Docker - The Linux Container
Docker and containers : Disrupting the virtual machine(VM)
Optimizing Docker Images
Shipping Applications to Production in Containers with Docker
Introduction to Docker - Docker workshop @Twitter
Introduction to Docker and all things containers, Docker Meetup at RelateIQ
Docker Introduction + what is new in 0.9
Docker and Containers for Development and Deployment — SCALE12X
Introduction to Docker
Docker Introduction
Nebulaworks Docker Overview 09-22-2015
Understand how docker works
Orchestrating Docker containers at scale
Virtual Machines and Docker
Docker for the new Era: Introducing Docker,its components and tools
Docker Ecosystem on Azure
Wso2 con 2014-us-tutorial-apache stratos-wso2 private paas with docker integr...
Ad

Viewers also liked (8)

PDF
[Hadoop] NexR Terapot: Massive Email Archiving
PDF
Archiving 2.0 - Retain Business Value
PPTX
Deep Dive Into Email Archiving Products
PDF
Exchange Architecture & Sizing
PPTX
EV.Cloud Email Archiving
PPTX
Introduction to DataStax Enterprise Graph Database
PPTX
Cassandra on Docker @ Walmart Labs
PDF
Hw09 Terapot Email Archiving With Hadoop
[Hadoop] NexR Terapot: Massive Email Archiving
Archiving 2.0 - Retain Business Value
Deep Dive Into Email Archiving Products
Exchange Architecture & Sizing
EV.Cloud Email Archiving
Introduction to DataStax Enterprise Graph Database
Cassandra on Docker @ Walmart Labs
Hw09 Terapot Email Archiving With Hadoop
Ad

Similar to Scaling and Managing Cassandra with docker, CoreOS and Presto (20)

PPTX
The world of Docker and Kubernetes
 
PDF
DockerPenang Meetup#1
PPTX
Docker 101 - all about Docker containers
PDF
Build cloud native solution using open source
PPTX
Docker-N-Beyond
PPTX
Docker & aPaaS: Enterprise Innovation and Trends for 2015
PDF
presentation @ docker meetup
PDF
Docker Application to Scientific Computing
PPTX
Lectre # 11 (VS&S). virtualization .pptx
PPTX
Docker Overview
PPTX
Bahrain ch9 introduction to docker 5th birthday
PPTX
Docker Enterprise Workshop - Intro
PDF
Docker Birthday #5 Meetup Cluj - Presentation
PDF
Docker-v3.pdf
PPTX
Docker Bday #5, SF Edition: Introduction to Docker
PDF
Docker Tips And Tricks at the Docker Beijing Meetup
PDF
Beyond static configuration
PDF
IAU workshop 2018 day one
PDF
EDB Postgres with Containers
 
PPTX
Docker intro
The world of Docker and Kubernetes
 
DockerPenang Meetup#1
Docker 101 - all about Docker containers
Build cloud native solution using open source
Docker-N-Beyond
Docker & aPaaS: Enterprise Innovation and Trends for 2015
presentation @ docker meetup
Docker Application to Scientific Computing
Lectre # 11 (VS&S). virtualization .pptx
Docker Overview
Bahrain ch9 introduction to docker 5th birthday
Docker Enterprise Workshop - Intro
Docker Birthday #5 Meetup Cluj - Presentation
Docker-v3.pdf
Docker Bday #5, SF Edition: Introduction to Docker
Docker Tips And Tricks at the Docker Beijing Meetup
Beyond static configuration
IAU workshop 2018 day one
EDB Postgres with Containers
 
Docker intro

Recently uploaded (20)

PDF
Odoo Companies in India – Driving Business Transformation.pdf
PPTX
L1 - Introduction to python Backend.pptx
PDF
Digital Strategies for Manufacturing Companies
PPT
Introduction Database Management System for Course Database
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
top salesforce developer skills in 2025.pdf
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PPTX
Introduction to Artificial Intelligence
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
PTS Company Brochure 2025 (1).pdf.......
PPTX
ISO 45001 Occupational Health and Safety Management System
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
System and Network Administration Chapter 2
PDF
How Creative Agencies Leverage Project Management Software.pdf
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PPTX
history of c programming in notes for students .pptx
Odoo Companies in India – Driving Business Transformation.pdf
L1 - Introduction to python Backend.pptx
Digital Strategies for Manufacturing Companies
Introduction Database Management System for Course Database
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
top salesforce developer skills in 2025.pdf
Navsoft: AI-Powered Business Solutions & Custom Software Development
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Introduction to Artificial Intelligence
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Softaken Excel to vCard Converter Software.pdf
Which alternative to Crystal Reports is best for small or large businesses.pdf
PTS Company Brochure 2025 (1).pdf.......
ISO 45001 Occupational Health and Safety Management System
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Operating system designcfffgfgggggggvggggggggg
System and Network Administration Chapter 2
How Creative Agencies Leverage Project Management Software.pdf
Adobe Illustrator 28.6 Crack My Vision of Vector Design
history of c programming in notes for students .pptx

Scaling and Managing Cassandra with docker, CoreOS and Presto

  • 1. Javascript addict, Experienced as technical leader and being focused on building high-quality, distributed, scalable applications. VALI MALINOIU @0X4139
  • 2. HOW TO - WITH CASSANDRA ★ Scaling with DOCKER ★ Clustering with COREOS ★ Querying with PRESTO
  • 3. SCALING - DOCKER ★ What is docker ? ★ Why should i use it ? ★ How does it help me ?
  • 4. Docker is an open platform for developers and sysadmins to build, ship, and run distributed applications WHAT IS DOCKER??? DOCKER ★ Why developers like it? ★ Why sysadmins like it? ★ Docker vs VM (VBox, VMWARE)
  • 5. FOR DEVELOPERS • Any language using any toolchain •“Dockerized” apps are completely portable and can run anywhere •No more apt-get install, yum install •Get started quickly using one of the 13000+ apps on Docker Hub
  • 6. FOR SYSADMINS •Provides standardized environments for QA,DEV,PROD •Reduces “works on my machine” finger pointing •Deploy and run any app on any infrastructure, quickly and reliable
  • 7. Each virtualized application includes not only the application - which may be 10s of MBS and the necessary binaries and libraries but also the entire guest operating system - which may weigh 10’s of GB VIRTUAL MACHINES The docker Engine container comprises just the application and its dependencies. It runs as an isolated process in userspace on the host operating system, sharing the kernel with other containers. DOCKER
  • 9. CLUSTER - COREOS ★ What is cores ? ★ Why should i use it ? ★ How does it help me ?
  • 10. CoreOS is Linux for Massive Server Deployments WHAT IS COREOS??? COREOS ★ ETCD ★ FLEET ★ FLANNEL
  • 11. ETCD ETCD is a distributed key value store that provides a reliable way to store data across a cluster of machines. ETCD gracefully handles master elections during network partitions and will tolerate machine failure, including the master. Your applications can read and write data into etcd.
  • 12. FLEET Easy Warehouse-scale Computing Treat your CoreOS cluster as if it shared a single init system. Automatic migration of the units when a machine is failing
  • 13. FLEET - ADVANTAGES Deploy docker containers on arbitrary hosts in a cluster Distribute services across a cluster using machine-level anti-affinity Maintain N instances of a service, re-scheduling on machine failure Discover machines running in the cluster Automatically SSH into the machine running a job
  • 14. Listing machines in a cluster Listing units in a cluster Listing unit files in a cluster
  • 16. FLANNEL is an overlay network that gives a subnet to each machine for use with Docker/Kubernetes Allows for Docker containers to communicate even tough the containers are located on different machines (magic)
  • 17. Facebook uses Presto for interactive queries against several internal data stores, including their 300PB data warehouse. Over 1,000 Facebook employees use Presto daily to run more than 30,000 queries that in total scan over a petabyte each per day. DISTRIBUTED SQL ENGINE FOR BIG DATA PRESTO by Facebook
  • 18. HOW DOES IT WORK? ★ Coordinator-worker architecture ★ Works with varios connectors ★ Hadoop/HIVE ★ Cassandra ★ TCP-H (mostly for testing)
  • 19. WHY?? ★ Combine data from multiple data sources ★ JOINS!!!! ★ Scalable ★ Maintained!!!