SlideShare a Scribd company logo
#QConSF @ana_m_medina
Chaos EngineeringChaos Engineering
with Containers
1
Ana Medina

Chaos Engineer at
InfoQ.com: News & Community Site
• Over 1,000,000 software developers, architects and CTOs read the site world-
wide every month
• 250,000 senior developers subscribe to our weekly newsletter
• Published in 4 languages (English, Chinese, Japanese and Brazilian
Portuguese)
• Post content from our QCon conferences
• 2 dedicated podcast channels: The InfoQ Podcast, with a focus on
Architecture and The Engineering Culture Podcast, with a focus on building
• 96 deep dives on innovative topics packed as downloadable emags and
minibooks
• Over 40 new content items per week
Watch the video with slide
synchronization on InfoQ.com!
https://www.infoq.com/presentations/
chaos-engineering-gamedays
Purpose of QCon
- to empower software development by facilitating the spread of
knowledge and innovation
Strategy
- practitioner-driven conference designed for YOU: influencers of
change and innovation in your teams
- speakers and topics driving the evolution and innovation
- connecting and catalyzing the influencers and innovators
Highlights
- attended by more than 12,000 delegates since 2007
- held in 9 cities worldwide
Presented at QCon San Francisco
www.qconsf.com
#QConSF @ana_m_medina
2
Ana Medina
@ana_m_medina


Chaos Engineer @ Gremlin
Previously Software Engineer /
SRE @ Uber, Also worked/
interned @ SFEFCU, Google,
Quicken Loans, Stanford
University and Miami Dade
College.
College dropout.
Self taught engineer.
#QConSF @ana_m_medina
3
How many of you have
heard of Chaos
Engineering?
#QConSF @ana_m_medina
4
How many of have run
a Chaos Engineering
experiment?
#QConSF @ana_m_medina
5
Thoughtful, planned
experiments designed to reveal
the weakness in our systems.


Chaos Engineering
#QConSF @ana_m_medina
6
Inject something harmful to
build an immunity.
-@KoltonAndrus

Gremlin Founder and CEO
Chaos Engineering
#QConSF @ana_m_medina
7
Why?
● Microservices
● Systems are scaling fast
● Downtime is really expensive
● Our dependencies will fail
● Pager fatigue and burnout really hurts
#QConSF @ana_m_medina
8
“Chaos Engineering
Without Observability ...
Is Just Chaos”

-@mipsytipsy
Charity Majors
CEO of honeycomb

#QConSF @ana_m_medina
9
Prerequisite of Chaos Engineering
● Monitoring/Observability
● On-Call and Incident Management
● Cost of Downtime Per Hour
#QConSF @ana_m_medina
10
Use Cases for Chaos Engineering
● Outage reproduction
● On-call training
● Strengthen new products
● Battle test new infrastructure and
services
#QConSF @ana_m_medina
11
Use Cases for Chaos Engineering - Containers
● Testing Provider Specific Reliability
(eg: EKS vs AKS vs GKE)
● Auto Scaling
● Logs, Disk failure
#QConSF @ana_m_medina
Minimize the
Blast radius
12
#QConSF @ana_m_medina
Monitoring /
Observability
13
#QConSF @ana_m_medina
14
What to measure and monitor?
! System Metrics: CPU, Disk, I/O
! Availability
! Service specific KPIs
! Customer complaints
#QConSF @ana_m_medina
15
Demo
#QConSF @ana_m_medina
16
#1 - Battle Test Cloud infrastructure
Real World Scenario: company / user is evaluating cloud
provider managed kubernetes. which one is more reliable?
The Hypothesis: shutting down a container (1/1) should only
give a small delay before app is reachable again
The Experiment: shut down kubernetes dashboard
container
Abort Conditions: app is unreachable after 60 seconds
#QConSF @ana_m_medina
17
#QConSF @ana_m_medina
#QConSF @ana_m_medina
#QConSF @ana_m_medina
#QConSF @ana_m_medina
21
#2 - Shutdown of a Container
Real World Scenario: company / user is evaluating
containers. Are they as reliable as promised?
The Hypothesis: yes, they will come back up
The Experiment: shutdown container and wait a few
seconds and check if it’s up
Abort Conditions: app is unreachable after 60 seconds
#QConSF @ana_m_medina
22
#QConSF @ana_m_medina
23
#3 - Blackholing traffic to Catalog
Real World Scenario: company / user is working with their UI
team to provide a good user experience when there API/DB
issues
The Hypothesis: images will not load, but product listing will
The Experiment: blackhole all traffic from the front end to
REST API and DB ports
Abort Conditions: app is unreachable after 60 seconds
#QConSF @ana_m_medina
24
#QConSF @ana_m_medina
Case Study
25
#QConSF @ana_m_medina
26
Companies doing
Chaos Engineering
#QConSF @ana_m_medina
27
Tools you Can Use
Gremlin

Chaos Toolkit

Litmus

PowerfulSeal
#QConSF @ana_m_medina
28
Break Things Together
bit.ly/chaos-eng-slack

2,000+ members across the world
#QConSF @ana_m_medina
THANKS!
@ana_m_medina
ana@gremlin.com
Watch the video with slide
synchronization on InfoQ.com!
https://www.infoq.com/presentations/
chaos-engineering-gamedays

More Related Content

PPTX
Tests your pipeline might be missing
PPTX
DevOps in a Regulated and Embedded Environment (AgileDC)
PDF
Chaos Engineering, When should you release the monkeys?
PDF
Continuous Delivery in a Legacy Shop - One Step at a Time
PDF
DevSecOps: Taking a DevOps Approach to Security
PDF
You Build It, You Secure It: Higher Velocity and Better Security with DevSecOps
PPTX
30 days or less: New Features to Production
PDF
Web Application Security Testing: Kali Linux Is the Way to Go
Tests your pipeline might be missing
DevOps in a Regulated and Embedded Environment (AgileDC)
Chaos Engineering, When should you release the monkeys?
Continuous Delivery in a Legacy Shop - One Step at a Time
DevSecOps: Taking a DevOps Approach to Security
You Build It, You Secure It: Higher Velocity and Better Security with DevSecOps
30 days or less: New Features to Production
Web Application Security Testing: Kali Linux Is the Way to Go

What's hot (20)

PDF
Principles of Chaos Engineering
PPTX
Testing in a Continuous Delivery Pipeline - Better, Faster, Cheaper
PDF
Auto sre with keptn
PPTX
Scaling a Start-up DevOps team to 10x while scaling the system 50x
PDF
DevOps, Common use cases, Architectures, Best Practices
PPTX
Chaos Engineering when you're not Netflix
PDF
Better Security Testing: Using the Cloud and Continuous Delivery
PDF
Effective approaches to web application security
PPTX
A Guide to Event-Driven SRE-inspired DevOps
PDF
Releasing the monolith on a daily basis - CodeMash
PPTX
DevSecOps OWASP
KEY
DevOpsSec: Appling DevOps Principles to Security, DevOpsDays Austin 2012
PPTX
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
PDF
How to adapt the SDLC to the era of DevSecOps
PPTX
Automation: The Good, The Bad and The Ugly with DevOpsGuys - AppD Summit Europe
PDF
Devops: Who Does What? - Devops Enterprise Summit 2016
PPTX
BsidesMCR_2016-what-can-infosec-learn-from-devops
PDF
Microservices 5 things i wish i'd known code motion
PDF
Attack-driven defense
PPTX
Conquering Chaos: Helix & DevOps
Principles of Chaos Engineering
Testing in a Continuous Delivery Pipeline - Better, Faster, Cheaper
Auto sre with keptn
Scaling a Start-up DevOps team to 10x while scaling the system 50x
DevOps, Common use cases, Architectures, Best Practices
Chaos Engineering when you're not Netflix
Better Security Testing: Using the Cloud and Continuous Delivery
Effective approaches to web application security
A Guide to Event-Driven SRE-inspired DevOps
Releasing the monolith on a daily basis - CodeMash
DevSecOps OWASP
DevOpsSec: Appling DevOps Principles to Security, DevOpsDays Austin 2012
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
How to adapt the SDLC to the era of DevSecOps
Automation: The Good, The Bad and The Ugly with DevOpsGuys - AppD Summit Europe
Devops: Who Does What? - Devops Enterprise Summit 2016
BsidesMCR_2016-what-can-infosec-learn-from-devops
Microservices 5 things i wish i'd known code motion
Attack-driven defense
Conquering Chaos: Helix & DevOps
Ad

Similar to Chaos Engineering with Containers (20)

PPTX
16370 cics project opening and project update f
PDF
Accelerating Digital Transformation: It's About Digital Enablement
PDF
Continuous Delivery: Fly the Friendly CI in Pivotal Cloud Foundry with Concourse
PDF
Cloud Native CI/CD with Jenkins X and Knative Pipelines
PPTX
Deploying more technology to shift from agility to anti-fragility
PDF
Pivotal Developer-Ready Infrastructure Slides
PDF
CV - Abhijit
PPTX
Chapter 10 - Emerging Paradigms and Project Development.pptx
PPTX
Cloud native fundamentals
PPTX
Moving from Legacy Development Tools to transformative DevOps with Enterprise...
PPTX
The Reality of Managing Microservice Deployments at Scale: You Need a Spinnaker
PDF
Confluent Partner Tech Talk with SVA
PPTX
Lugano Tech Talks - Why Docker
PDF
How to Build and Operate a Global Behavioral Change Platform (Neil Adamson, V...
PDF
Guiding a Product Roadmap in a Chaotic World
DOCX
PPTX
The Evolution of Industrial Visualization
PDF
BayInfotech (BIT) ACI Portfolio
PDF
CI/CD for Machine Learning
DOC
Ravi-apr30
16370 cics project opening and project update f
Accelerating Digital Transformation: It's About Digital Enablement
Continuous Delivery: Fly the Friendly CI in Pivotal Cloud Foundry with Concourse
Cloud Native CI/CD with Jenkins X and Knative Pipelines
Deploying more technology to shift from agility to anti-fragility
Pivotal Developer-Ready Infrastructure Slides
CV - Abhijit
Chapter 10 - Emerging Paradigms and Project Development.pptx
Cloud native fundamentals
Moving from Legacy Development Tools to transformative DevOps with Enterprise...
The Reality of Managing Microservice Deployments at Scale: You Need a Spinnaker
Confluent Partner Tech Talk with SVA
Lugano Tech Talks - Why Docker
How to Build and Operate a Global Behavioral Change Platform (Neil Adamson, V...
Guiding a Product Roadmap in a Chaotic World
The Evolution of Industrial Visualization
BayInfotech (BIT) ACI Portfolio
CI/CD for Machine Learning
Ravi-apr30
Ad

More from C4Media (20)

PDF
Streaming a Million Likes/Second: Real-Time Interactions on Live Video
PDF
Next Generation Client APIs in Envoy Mobile
PDF
Software Teams and Teamwork Trends Report Q1 2020
PDF
Understand the Trade-offs Using Compilers for Java Applications
PDF
Kafka Needs No Keeper
PDF
High Performing Teams Act Like Owners
PDF
Does Java Need Inline Types? What Project Valhalla Can Bring to Java
PDF
Service Meshes- The Ultimate Guide
PDF
Shifting Left with Cloud Native CI/CD
PDF
Fault Tolerance at Speed
PDF
Architectures That Scale Deep - Regaining Control in Deep Systems
PDF
ML in the Browser: Interactive Experiences with Tensorflow.js
PDF
Build Your Own WebAssembly Compiler
PDF
User & Device Identity for Microservices @ Netflix Scale
PDF
Scaling Patterns for Netflix's Edge
PDF
Make Your Electron App Feel at Home Everywhere
PDF
The Talk You've Been Await-ing For
PDF
Future of Data Engineering
PDF
Automated Testing for Terraform, Docker, Packer, Kubernetes, and More
PDF
Navigating Complexity: High-performance Delivery and Discovery Teams
Streaming a Million Likes/Second: Real-Time Interactions on Live Video
Next Generation Client APIs in Envoy Mobile
Software Teams and Teamwork Trends Report Q1 2020
Understand the Trade-offs Using Compilers for Java Applications
Kafka Needs No Keeper
High Performing Teams Act Like Owners
Does Java Need Inline Types? What Project Valhalla Can Bring to Java
Service Meshes- The Ultimate Guide
Shifting Left with Cloud Native CI/CD
Fault Tolerance at Speed
Architectures That Scale Deep - Regaining Control in Deep Systems
ML in the Browser: Interactive Experiences with Tensorflow.js
Build Your Own WebAssembly Compiler
User & Device Identity for Microservices @ Netflix Scale
Scaling Patterns for Netflix's Edge
Make Your Electron App Feel at Home Everywhere
The Talk You've Been Await-ing For
Future of Data Engineering
Automated Testing for Terraform, Docker, Packer, Kubernetes, and More
Navigating Complexity: High-performance Delivery and Discovery Teams

Recently uploaded (20)

PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Modernizing your data center with Dell and AMD
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Machine learning based COVID-19 study performance prediction
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
cuic standard and advanced reporting.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Big Data Technologies - Introduction.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPT
Teaching material agriculture food technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Modernizing your data center with Dell and AMD
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Machine learning based COVID-19 study performance prediction
NewMind AI Weekly Chronicles - August'25 Week I
cuic standard and advanced reporting.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
NewMind AI Monthly Chronicles - July 2025
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Dropbox Q2 2025 Financial Results & Investor Presentation
“AI and Expert System Decision Support & Business Intelligence Systems”
Unlocking AI with Model Context Protocol (MCP)
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Big Data Technologies - Introduction.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Teaching material agriculture food technology

Chaos Engineering with Containers