SlideShare a Scribd company logo
a
CHAOS PATTERNS
BRUCE M. WONG | @BRUCE_M_WONG
LESSONS ABOUT FAILING WELL AND FAILING OFTEN
FAILURE HAPPENS
BRUCE M. WONG | @BRUCE_M_WONG
“EVERYTHING FAILS ALL THE TIME”
-WERNER VOGELS, CTO, AMAZON WEB SERVICES
HTTP://THENEXTWEB.COM/2008/04/04/WERNER-VOGELS-EVERYTHING-FAILS-ALL-THE-TIME/
BRUCE M. WONG | @BRUCE_M_WONG
THE ORIGINAL CHAOS MONKEY
CREATED BY NETFLIX CLOUD ARCHITECT, GREG ORZELL - @CHAOSSIMIA 2010
BRUCE M. WONG | @BRUCE_M_WONG
HTTPS://WWW.LINKEDIN.COM/IN/GORZELL
a
A STATE OF XEN
AWS EC2 REBOOT, 2014
BRUCE M. WONG | @BRUCE_M_WONG
HTTP://XENBITS.XEN.ORG/XSA/ADVISORY-108.HTML
HTTP://TECHBLOG.NETFLIX.COM/2014/10/A-STATE-OF-XEN-CHAOS-MONKEY-CASSANDRA.HTML
HTTP://AWS.AMAZON.COM/BLOGS/AWS/EC2-MAINTENANCE-UPDATE/
22COMPLETE NODE FAILURE
2700+ C* NODES, 218 REBOOTS
0DOWNTIME
BRUCE M. WONG | @BRUCE_M_WONG
LESSON #1 : TRUST YOUR RESILIENCE
BRUCE M. WONG | @BRUCE_M_WONG
SLOW IS HARD
BRUCE M. WONG | @BRUCE_M_WONG
SLOW IS HARD
BRUCE M. WONG | @BRUCE_M_WONG
UNBOUND QUEUES - ELASTIC ISN’T INFINITE
BRUCE M. WONG | @BRUCE_M_WONG
UNBOUND QUEUES - ELASTIC ISN’T INFINITE
BRUCE M. WONG | @BRUCE_M_WONG
SLOW IS HARD
BRUCE M. WONG | @BRUCE_M_WONG
LATENCY MONKEY
BRUCE M. WONG | @BRUCE_M_WONG
SLOW IS HARD
BRUCE M. WONG | @BRUCE_M_WONG
LATENCY TESTING 2.0 - FIT
HTTP://TECHBLOG.NETFLIX.COM/2014/10/FIT-FAILURE-INJECTION-TESTING.HTML
BRUCE M. WONG | @BRUCE_M_WONG
SLOW IS HARD
BRUCE M. WONG | @BRUCE_M_WONG
SLOW IS HARD
START SLOW
•ACCOUNT LEVEL
•+10MS BEFORE +100MS
•+1% ERRORS BEFORE +80%
ERRORS
DIAL IT UP
•A -> D NOT * -> D
BRUCE M. WONG | @BRUCE_M_WONG
LESSON # 2 : FIXING ONE FAILURE MODE
EXPOSES NEW ONES
BRUCE M. WONG | @BRUCE_M_WONG
WHATS SO SPECIAL ABOUT CHAOS
BRUCE M. WONG | @BRUCE_M_WONG
CHAOS IS A CHOICE
WHATS SO SPECIAL ABOUT CHAOS
BRUCE M. WONG | @BRUCE_M_WONG
OUTAGES VS CHAOS
BRUCE M. WONG | @BRUCE_M_WONG
OUTAGES VS CHAOS
Uncontrolled Controlled
Unpredictable Scheduled
Time to Detect: Minutes 0 Time to Detect
Time to Resolve: ???? Time to Resolve: seconds*
Analysis Time: ???? Root Cause Analysis: Intentional
MYTH OF RESILIENCE
NATION’S BUSINESS, 1977
BRUCE M. WONG | @BRUCE_M_WONG
LATENCY MONKEY
BRUCE M. WONG | @BRUCE_M_WONG
LESSON # 3 : THE CULTURE ASPECTS OF CHAOS
ARE HARD
BRUCE M. WONG | @BRUCE_M_WONG
BRUCE M. WONG | @BRUCE_M_WONG
MOST ENTERPRISES HIRE PEOPLE TO FIX THINGS. NETFLIX
HIRES PEOPLE TO BREAK THINGS….
…WE SHOULD EMBRACE NETFLIX'S CULTURE OF "CHAOS ENGINEERING"
THROUGHOUT ORGANIZATIONS OF ALL SHAPES AND SIZES.
BRUCE M. WONG | @BRUCE_M_WONG
SEEK PROGRESS OVER PERFECTION
TWILIO LEADERSHIP PRINCIPLE
BRUCE M. WONG | @BRUCE_M_WONG
GAME DAYS - BENEFITS
•Training New Engineers
•Discover Instrumentation gaps
•New Product Launches
•Incident Management Practices
BRUCE M. WONG | @BRUCE_M_WONG
GAME DAYS - THE SETUP
•Two “on-call” teams
•Separate rooms, separate slack
channels
•Master of Disaster
•Incident Commander
BRUCE M. WONG | @BRUCE_M_WONG
LEVERAGE EXISTING TESTBOTS
•Functionally test fallback code
•Early warning!
•Existing Integrations with
Telemetry, PagerDuty, Slack
•Incorporate into Canary
process
FUTURE
BRUCE M. WONG | @BRUCE_M_WONG
RECAP
Lesson # 1 : Trust your resilience
Lesson # 2 : Fixing one failure mode exposes new ones
Lesson # 3 : The culture aspects of Chaos are HARD
Get started today!
Game Days are your friend - do them early and often
Testbots + focus on developer productivity
BRUCE M. WONG | @BRUCE_M_WONG
WHEN YOU WISH UPON A BLUE MOON
BRUCE M. WONG | @BRUCE_M_WONG

More Related Content

PPTX
Social Networking for Public Works
PPTX
Personal Learning Networks
PDF
devopsdays Madison - opening remarks
PDF
blogs and wikis
PDF
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
PDF
Social Media: Changing the Face of Response
PPT
The Socia Web - Ignite eTech
PPTX
The Networked Museum ("The Revolution Will Not Be Televised")
Social Networking for Public Works
Personal Learning Networks
devopsdays Madison - opening remarks
blogs and wikis
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
Social Media: Changing the Face of Response
The Socia Web - Ignite eTech
The Networked Museum ("The Revolution Will Not Be Televised")

Viewers also liked (13)

PDF
Chaos Patterns
PPTX
The Case for Chaos
PDF
The Journey of Chaos Engineering Begins with a Single Step
PDF
Chaos Driven Development
PDF
From Sketch Mockup → WatchKit App
PDF
Elements of User Experience for Mobile Apps
PDF
Principles Of Chaos Engineering - Chaos Engineering Hamburg
PDF
"WE MAKE SPACE, SPACE MAKES US" - 김정태 MYSC 대표
PPTX
What's New in Java 8
PPTX
Intro to Netflix's Chaos Monkey
PDF
Scalable Microservices at Netflix. Challenges and Tools of the Trade
PDF
Mini-Training: Netflix Simian Army
PDF
Chaos Driven Development (Bruce Wong)
Chaos Patterns
The Case for Chaos
The Journey of Chaos Engineering Begins with a Single Step
Chaos Driven Development
From Sketch Mockup → WatchKit App
Elements of User Experience for Mobile Apps
Principles Of Chaos Engineering - Chaos Engineering Hamburg
"WE MAKE SPACE, SPACE MAKES US" - 김정태 MYSC 대표
What's New in Java 8
Intro to Netflix's Chaos Monkey
Scalable Microservices at Netflix. Challenges and Tools of the Trade
Mini-Training: Netflix Simian Army
Chaos Driven Development (Bruce Wong)
Ad

Similar to Chaos Patterns Twilio SIGNALCONF 2016 (20)

PDF
Chaos patterns - architecting for failure in distributed systems
PPTX
Chaos engineering - The art of breaking stuff in production on purpose
PDF
Becoming a catalyst for chaos
PPTX
Green Custard Friday Talk 19: Chaos Engineering
PDF
Jason Yee - Chaos! - Codemotion Rome 2019
PPTX
Embracing Failure - Fault Injection and Service Resilience at Netflix
PPTX
Introduction to Chaos Engineering
PPTX
Embracing Failure - AzureDay Rome
PDF
Applying principles of chaos engineering to serverless (reinvent DVC305)
PDF
An introduction to chaos engineering as part of DevOps at XP2019
PPTX
Chaos engineering
PDF
Chaos Engineering - The Art of Breaking Things in Production
PDF
Applying principles of chaos engineering to serverless (O'Reilly Software Arc...
PDF
Yan Cui - Applying principles of chaos engineering to Serverless - Codemotion...
PDF
Applying principles of chaos engineering to Serverless (CodeMotion Berlin)
PDF
Yan Cui - Applying principles of chaos engineering to Serverless - Codemotion...
PPTX
Kriss Rochefolle: "How to Convince Your Boss to Say "Yes!" to Chaos Engineeri...
PDF
Choose your own adventure Chaos Engineering - QCon NYC 2017
PDF
You shall not Fail! (in the face of turbulent conditions)
PDF
Using chaos to bring resiliency to your applications
Chaos patterns - architecting for failure in distributed systems
Chaos engineering - The art of breaking stuff in production on purpose
Becoming a catalyst for chaos
Green Custard Friday Talk 19: Chaos Engineering
Jason Yee - Chaos! - Codemotion Rome 2019
Embracing Failure - Fault Injection and Service Resilience at Netflix
Introduction to Chaos Engineering
Embracing Failure - AzureDay Rome
Applying principles of chaos engineering to serverless (reinvent DVC305)
An introduction to chaos engineering as part of DevOps at XP2019
Chaos engineering
Chaos Engineering - The Art of Breaking Things in Production
Applying principles of chaos engineering to serverless (O'Reilly Software Arc...
Yan Cui - Applying principles of chaos engineering to Serverless - Codemotion...
Applying principles of chaos engineering to Serverless (CodeMotion Berlin)
Yan Cui - Applying principles of chaos engineering to Serverless - Codemotion...
Kriss Rochefolle: "How to Convince Your Boss to Say "Yes!" to Chaos Engineeri...
Choose your own adventure Chaos Engineering - QCon NYC 2017
You shall not Fail! (in the face of turbulent conditions)
Using chaos to bring resiliency to your applications
Ad

Recently uploaded (20)

PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
Geodesy 1.pptx...............................................
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PPTX
Artificial Intelligence
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
UNIT 4 Total Quality Management .pptx
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
additive manufacturing of ss316l using mig welding
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
Current and future trends in Computer Vision.pptx
PPTX
bas. eng. economics group 4 presentation 1.pptx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Lecture Notes Electrical Wiring System Components
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Geodesy 1.pptx...............................................
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Operating System & Kernel Study Guide-1 - converted.pdf
Artificial Intelligence
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
UNIT-1 - COAL BASED THERMAL POWER PLANTS
UNIT 4 Total Quality Management .pptx
R24 SURVEYING LAB MANUAL for civil enggi
additive manufacturing of ss316l using mig welding
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Current and future trends in Computer Vision.pptx
bas. eng. economics group 4 presentation 1.pptx

Chaos Patterns Twilio SIGNALCONF 2016