SlideShare a Scribd company logo
1
Tammy Butow - Principal SRE, Gremlin
Ana Medina - Chaos Engineer, Gremlin
Next Level Chaos Engineering
@tammybutow & @ana_m_medina
3
What is your next level for Chaos Engineering?
@tammybutow & @ana_m_medina
Chaos Engineer @ Gremlin.
Previously Software Engineer / SRE
@ Uber, worked on Chaos
Engineering and Cloud Infrastructure.
Also worked/interned @ SFEFCU,
Google, Quicken Loans, Stanford
University and Miami Dade College.
Ana Medina
Principal SRE @ Gremlin.
Previously SRE Manager @
Dropbox leading Databases, Block
Storage and Code Workflows.
IMOC (Incident Manager On-Call)
for Dropbox.
Also worked @ DigitalOcean, NAB
and QUT.
Tammy Butow
The Why, How & What Of CE
Focus on impact!
@tammybutow & @ana_m_medina
Why Practice Chaos Engineering?
@tammybutow & @ana_m_medina
10x
@tammybutow & @ana_m_medina
100%
@tammybutow & @ana_m_medina
IPO
@tammybutow & @ana_m_medina
Strengthen
New Products
Through
Failure Fridays
@tammybutow & @ana_m_medina
Battle Test
New Cloud Infra
Services Before You
Use Them
@tammybutow & @ana_m_medina
Battle Test
New Versions
Of Cloud Infra
Services Before You
Use Them
@tammybutow & @ana_m_medina
How Do We Practice
Chaos Engineering?
@tammybutow & @ana_m_medina
Chaos Engineering
Tools, Talks
& Guides
@tammybutow & @ana_m_medina
Make Failure Friday
Open To Your Entire
Company
@tammybutow & @ana_m_medina
On-Call Training
With Chaos
Engineering
@tammybutow & @ana_m_medina
Run Chaos
Engineering
Experiments 3x +
A Week Per Service
@tammybutow & @ana_m_medina
What Do We Do To Practice
Chaos Engineering?
@tammybutow & @ana_m_medina
What experiments do we run to practice Chaos Engineering
on new cutting edge software?
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
We have Security Engineering
bug bounty programs...
How can we get better at
creating a culture of doing the
same for reliability vulnerabilities?
Chaos Engineering!
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
@tammybutow & @ana_m_medina
33

More Related Content

PDF
Introduction to Chaos Engineering with Microsoft Azure
PDF
InfoQ Live - Reducing Uncertainty in Software Delivery - Building reliability...
PDF
Chaos Engineering in a Multi-Cloud World | Escape Conference 2019
PPTX
Chaos Engineering with Containers - QCon SF 2018
PDF
Chaos Engineering
PPTX
Introduction to Chaos Engineering
PDF
Chaos Engineering: Why the World Needs More Resilient Systems
PDF
Chaos Engineering: Injecting Failure for Building Resilience in Systems
Introduction to Chaos Engineering with Microsoft Azure
InfoQ Live - Reducing Uncertainty in Software Delivery - Building reliability...
Chaos Engineering in a Multi-Cloud World | Escape Conference 2019
Chaos Engineering with Containers - QCon SF 2018
Chaos Engineering
Introduction to Chaos Engineering
Chaos Engineering: Why the World Needs More Resilient Systems
Chaos Engineering: Injecting Failure for Building Resilience in Systems

What's hot (17)

PDF
Applying principles of chaos engineering to serverless (reinvent DVC305)
PPTX
Use GitLab with Chaos Engineering to Harden your Applications + OpenEBS 1.3 ...
PDF
5 Essential Techniques for Building Fault-tolerant Systems
PDF
Chaos Engineering - The Art of Breaking Things in Production
PDF
SecOps - Bringing Agility into Security
PDF
Chaos engineering intro
PDF
Applying principles of chaos engineering to Serverless (CodeMotion Berlin)
PDF
Chaos Engineering, When should you release the monkeys?
PDF
Top 10 Tips for Securing and Scaling Atlassian Cloud
PPTX
Splunk'ing JIRA for deep insights into application, database, and server heal...
PDF
Extending Trello - The Power-Up Opportunity
PDF
Embrace Chaos - Introducing Chaos Engineering to your Organization
PDF
Chaos Engineering – why we should all practice breaking things on purpose by ...
PDF
An Introduction to Chaos Engineering
PDF
Incident Management in the Age of DevOps and SRE
PPTX
JavaOne 2015 Devops and the Darkside CON6447
PDF
The Four Principles of Atlassian Performance Tuning
Applying principles of chaos engineering to serverless (reinvent DVC305)
Use GitLab with Chaos Engineering to Harden your Applications + OpenEBS 1.3 ...
5 Essential Techniques for Building Fault-tolerant Systems
Chaos Engineering - The Art of Breaking Things in Production
SecOps - Bringing Agility into Security
Chaos engineering intro
Applying principles of chaos engineering to Serverless (CodeMotion Berlin)
Chaos Engineering, When should you release the monkeys?
Top 10 Tips for Securing and Scaling Atlassian Cloud
Splunk'ing JIRA for deep insights into application, database, and server heal...
Extending Trello - The Power-Up Opportunity
Embrace Chaos - Introducing Chaos Engineering to your Organization
Chaos Engineering – why we should all practice breaking things on purpose by ...
An Introduction to Chaos Engineering
Incident Management in the Age of DevOps and SRE
JavaOne 2015 Devops and the Darkside CON6447
The Four Principles of Atlassian Performance Tuning
Ad

Similar to Next Level Chaos Engineering - Chaos Conf 2018 (20)

PPTX
MDM is Still Failing 2020
PDF
GDG Cloud Southlake #6 Tammy Bryant Butow: Chaos Engineering The Road To Res...
PDF
Ben Huh Keynote: LOLcats, FAILS, and Other Blunders from the Cheezburger Network
PDF
Big Data, Big Opportunity: A Primer for Understanding The Big Data Frontier
PPTX
Remote Work Readiness - A simple guide for remote work & management.
PDF
Introduction to Chaos Engineering | SRECon Asia - Ana Medina
PPTX
Data Con LA 2018 Keynote - The secret to your big data success by Tim Eusterman
PDF
2010 Snowpocalypse Operations Survey Results
PPTX
Cloud Worst Practices
PDF
WordCamp Nashville 2016: The promise and peril of Agile and Lean practices
PDF
Online Fraud Detection Using Big Data Analytics Webinar
PDF
Boston Ruby Meetup: The promise and peril of Agile and Lean practices
PPT
Value Presentation
PPTX
Proven achievement
PDF
AMP Accelerated Mobile Pages - The Next Generation SMX London 2017 Dawn Anderson
PPTX
Web 2.0 Beyond the Hype: Presentation March 27th, 2009
PPTX
Cloud lunchn learn_howtobecomeacloudarchitect_part1
PDF
Provenance in Production-Grade Machine Learning
PDF
Look Up
MDM is Still Failing 2020
GDG Cloud Southlake #6 Tammy Bryant Butow: Chaos Engineering The Road To Res...
Ben Huh Keynote: LOLcats, FAILS, and Other Blunders from the Cheezburger Network
Big Data, Big Opportunity: A Primer for Understanding The Big Data Frontier
Remote Work Readiness - A simple guide for remote work & management.
Introduction to Chaos Engineering | SRECon Asia - Ana Medina
Data Con LA 2018 Keynote - The secret to your big data success by Tim Eusterman
2010 Snowpocalypse Operations Survey Results
Cloud Worst Practices
WordCamp Nashville 2016: The promise and peril of Agile and Lean practices
Online Fraud Detection Using Big Data Analytics Webinar
Boston Ruby Meetup: The promise and peril of Agile and Lean practices
Value Presentation
Proven achievement
AMP Accelerated Mobile Pages - The Next Generation SMX London 2017 Dawn Anderson
Web 2.0 Beyond the Hype: Presentation March 27th, 2009
Cloud lunchn learn_howtobecomeacloudarchitect_part1
Provenance in Production-Grade Machine Learning
Look Up
Ad

More from Ana Medina (8)

PDF
Navigating Mental Health as a Human - Write/Speak/Code 2019
PDF
Chaos Engineering Bootcamp - QCon SF 2018
PDF
Velocity London - Chaos Engineering Bootcamp
PDF
The Practice of Chaos Engineering - Reactive Summit 2018 - Montreal, QC
PDF
DevOpsDays Kansas City - Getting Started with Chaos Engineering
PDF
#AllDayDevOps Getting Started with Chaos Engineering
PDF
Chaos Engineering with Kubernetes - Berlin / Hamburg Chaos Engineering Meetup...
PDF
SRECon Europe - Chaos Engineering Bootcamp | August 2018
Navigating Mental Health as a Human - Write/Speak/Code 2019
Chaos Engineering Bootcamp - QCon SF 2018
Velocity London - Chaos Engineering Bootcamp
The Practice of Chaos Engineering - Reactive Summit 2018 - Montreal, QC
DevOpsDays Kansas City - Getting Started with Chaos Engineering
#AllDayDevOps Getting Started with Chaos Engineering
Chaos Engineering with Kubernetes - Berlin / Hamburg Chaos Engineering Meetup...
SRECon Europe - Chaos Engineering Bootcamp | August 2018

Recently uploaded (20)

PDF
Well-logging-methods_new................
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
Construction Project Organization Group 2.pptx
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
Artificial Intelligence
PDF
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
III.4.1.2_The_Space_Environment.p pdffdf
PPTX
Current and future trends in Computer Vision.pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
additive manufacturing of ss316l using mig welding
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Well-logging-methods_new................
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Model Code of Practice - Construction Work - 21102022 .pdf
Construction Project Organization Group 2.pptx
OOP with Java - Java Introduction (Basics)
Foundation to blockchain - A guide to Blockchain Tech
Artificial Intelligence
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
UNIT-1 - COAL BASED THERMAL POWER PLANTS
CYBER-CRIMES AND SECURITY A guide to understanding
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
III.4.1.2_The_Space_Environment.p pdffdf
Current and future trends in Computer Vision.pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
additive manufacturing of ss316l using mig welding
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx

Next Level Chaos Engineering - Chaos Conf 2018

Editor's Notes

  • #3: Both of us
  • #4: Ana to talk about this! Reference Kolton’s keynote! Level 0 - Chaos Monkey Level 1 - Infra Failures Level 1.5 - Network Failures Level 2 - Application Failures https://docs.google.com/presentation/d/1uKroG_Hnf-w_VfOXpTdKvPVCE5bEorlj46FSVKYYayU/edit#slide=id.g405a4a4491_1_18
  • #5: ana
  • #6: tammy
  • #7: Tammy
  • #8: Tammy
  • #9: Tammy Achieved a 10x reduction in incidents @ Dropbox using CE
  • #10: Tammy Achieved a 100% reduction in SEV 0s for 12 months @ Dropbox using CE
  • #11: Tammy Achieved a 100% reduction in SEV 0s for 12 months @ Dropbox using CE
  • #12: Ana - ALFI Failure Fridays to strengthen products before launch @ Gremlin
  • #13: Ana New software can dramatically improve reliability, reduce engineering/business/support cost, improve engineering happiness and increase feature velocity New software/tools constantly being rolled out: EKS, AKS etc New versions of software frequently released
  • #14: Ana New software/tools constantly being rolled out: EKS, AKS etc New versions of software frequently released New software can dramatically improve reliability, reduce engineering/business/support cost, improve engineering happiness and increase feature velocity
  • #15: Tammy
  • #16: Tammy
  • #17: Tammy - open culture of chaos
  • #18: Ana
  • #19: Ana
  • #20: Ana
  • #21: Ana
  • #22: Ana Chaos Engineering For Cutting Edge Software
  • #23: tammy
  • #24: tammy
  • #25: Ana
  • #26: Ana
  • #27: tammy
  • #28: tammy
  • #29: tammy
  • #30: Tammy
  • #31: Ana
  • #32: Tammy
  • #33: Ana