SlideShare a Scribd company logo
Plain to fail.
Embrace the Chaos.
Nice to meet you
Software Engineer @Colpatria Scotiabank
Making things happen!
Loves reading, programming and teaching.
@yurynino
@yurynino
@yurynino
yury.nino.roa
Agenda ● Why do we have fear of failure?
● What is chaos engineering?
● Principles of Chaos
● Phases of Chaos Engineering
● Chaos in Practice
● Who is using it?
● Review
Why do we have
fear to failure?
The reality is inherently
uncertain, and this
uncertainty exposes our
limitations.
This means that failure
should be a perpetual
consideration even in
the best planned and
controlled.
Benefits of failure
1. Is an opportunity to learn.
2. Encourages the creativity.
3. Keeps hungry and humble.
4. Helps to overcome the fear.
5. Becomes Resilient!
Try 1
Try 2
Finally
We need highly available and
durable systems.
We need
Resilient Systems
What is Chaos
Engineering?
Taken from https://medium.com/netflix-techblog
Netflix Architecture
Chaos Engineering: Definition!
It is the discipline of
experimenting on a distributed
system in order to build
confidence in the system’s
capability to withstand
turbulent conditions in
production.
Taken from https://principlesofchaos.org/
Chaos Engineering: Definition!
It is deliberately inducing
stress or fault into software
and/or hardware as a way of
learning/verifying things
about systems.
Taken from https://www.gremlin.com/community
History of Chaos Engineering
2008
Chaos Engineering
began at Netflix
2018
2014
2010
History of Chaos Engineering
2008
Chaos Engineering
began at Netflix
2010
Chaos Monkey was
launched
2018
2014
History of Chaos Engineering
2008
Chaos Engineering
began at Netflix
2010
Chaos Monkey was
launched
2018
2014
Chaos Engineer Role
was created.
I learned more about chaos engineering from
Kolton Andrus than anyone else, he called it
failure injection testing
Kolton Andrus
Chaos Engineer
A vaccine research computer scientist.
History of Chaos Engineering
2008
Chaos Engineering
began at Netflix
2010
Chaos Monkey was
launched
2018
Book Chaos Engineering
was published.
2014
Role of Chaos
Engineer was created.
How to start with
Chaos Engineering?
https://www.oreilly.com/library/view/chaos-engineering/9781
491988459/
How to start ….
https://chaosengineering.slack.com
https://github.com/dastergon/awesome-chaos
-engineering
https://www.infoq.com/chaos-engineering
https://groups.google.com/forum/#!forum/ch
aos-community
https://www.linkedin.com/groups/7057761/
How to start ….
https://www.gremlin.com/chaos-monkey/
Chaos Engineering
Because ….
Writing many unit and integration
tests are not enough.
● How does the application behave with
network latency?
● What if one of our services breaks down?
Remember the world is chaotic!
● Unpredictable events are bound to happen.
● Distributed systems contains moving parts.
● Many things can go wrong.
○ Hard disks can fail.
○ The network can go down.
○ Surge in customer traffic can overload.
February 28th, 2017 will be remembered
● Simple Storage Service (S3) went
down in US-EAST
● Outage lasted about 4 hrs.
● 100,000’s of websites across the US
impacted.
Why is it different of Testing?
Testing vs Chaos Engineering
Tests are assertions, whose outputs typically
are binary ..
while
Chaos Engineering is a practice for generating
new information.
I want to emphasize that both sizes of
the equation [unit, regression and chaos
side] are required to get you the level of
availability you want.
Chaos Engineer is about ...
● Simulating the failure of a datacenter.
● Injecting latency between services.
● Randomly causing exceptions.
● Changing time travel.
● Emulating I/O errors.
http://principlesofchaos.org/
Phases of Chaos Engineering
Taken from https://blog.codecentric.de/en/2018/07/chaos-engineering/
Chaos Engineering
Example - Chaos Monkey for Spring Boot
codecentricTaken from https://codecentric.github.io/chaos-monkey-spring-boot/2.0.0/
Example - Configuration
Taken from https://codecentric.github.io/chaos-monkey-spring-boot/2.0.0/
Example - Configuration
Example - Configuration
Taken from https://codecentric.github.io/chaos-monkey-spring-boot/2.0.0/
Example - Configuration
Chaos Engineering
Designing Experiments
1. Pick a Hypothesis
2. Choose the Scope
3. Identify the Metrics to Monitor
4. Notify the Organization
5. Run the Experiment
6. Analyze the Results
7. utomate
Who is using
Chaos Engineering?
Conclusion
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor
incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud
exercitation ullamco laboris nisi ut aliquip.
It means: To ACCEPT
the uncertainty of life and
move forward anyway!
Take action despite not
knowing how things will
turn out.
Chaos Engineering

More Related Content

PDF
Chaos Engineering, When should you release the monkeys?
PPTX
Chaos engineering & Gameday on AWS
PDF
An Introduction to Chaos Engineering
PPTX
Chaos engineering
PPTX
Introduction to Chaos Engineering
PDF
Chaos Engineering - The Art of Breaking Things in Production
PPTX
Shift Left Testing: A New Paradigm Shift To Quality
PDF
DevOps Powerpoint Presentation Slides
Chaos Engineering, When should you release the monkeys?
Chaos engineering & Gameday on AWS
An Introduction to Chaos Engineering
Chaos engineering
Introduction to Chaos Engineering
Chaos Engineering - The Art of Breaking Things in Production
Shift Left Testing: A New Paradigm Shift To Quality
DevOps Powerpoint Presentation Slides

What's hot (20)

PDF
Chaos Engineering: Why the World Needs More Resilient Systems
PDF
Chaos Engineering: Injecting Failure for Building Resilience in Systems
PDF
Chaos Engineering with Kubernetes - Berlin / Hamburg Chaos Engineering Meetup...
PPTX
Chaos engineering and chaos testing
PDF
Principles Of Chaos Engineering - Chaos Engineering Hamburg
PDF
Chaos Engineering with Kubernetes
PPTX
Chaos Engineering with Gremlin Platform
PDF
Choose your own adventure Chaos Engineering - QCon NYC 2017
PDF
Chaos Engineering 101: A Field Guide
PDF
Building an SRE Organization @ Squarespace
PDF
Microservices Architectures: Become a Unicorn like Netflix, Twitter and Hailo
PPSX
Agile, User Stories, Domain Driven Design
PDF
User Story Mapping, Discover the whole story
PPSX
Service Mesh - Observability
PDF
DevOps vs Agile | DevOps Tutorial For Beginners | DevOps Training | Edureka
PDF
The Paved Road at Netflix
PDF
Practical Chaos Engineering
PDF
Keynote: Testing and Quality in the Scaled Agile Framework for Lean Enterpris...
PDF
Event-driven Architecture
PDF
Kubernetes vs Docker Swarm | Container Orchestration War | Kubernetes Trainin...
Chaos Engineering: Why the World Needs More Resilient Systems
Chaos Engineering: Injecting Failure for Building Resilience in Systems
Chaos Engineering with Kubernetes - Berlin / Hamburg Chaos Engineering Meetup...
Chaos engineering and chaos testing
Principles Of Chaos Engineering - Chaos Engineering Hamburg
Chaos Engineering with Kubernetes
Chaos Engineering with Gremlin Platform
Choose your own adventure Chaos Engineering - QCon NYC 2017
Chaos Engineering 101: A Field Guide
Building an SRE Organization @ Squarespace
Microservices Architectures: Become a Unicorn like Netflix, Twitter and Hailo
Agile, User Stories, Domain Driven Design
User Story Mapping, Discover the whole story
Service Mesh - Observability
DevOps vs Agile | DevOps Tutorial For Beginners | DevOps Training | Edureka
The Paved Road at Netflix
Practical Chaos Engineering
Keynote: Testing and Quality in the Scaled Agile Framework for Lean Enterpris...
Event-driven Architecture
Kubernetes vs Docker Swarm | Container Orchestration War | Kubernetes Trainin...
Ad

Similar to Chaos Engineering (20)

PDF
JDD 2016 - Jedrzej Dabrowa - Distributed System Fault Injection Testing With ...
PDF
Designing Highly-Available Architectures for OTM
PPTX
Product discovery engineering point of view
PPTX
APIdays Singapore 2019 - Building Applications in the Cloud: Best Practices F...
ODP
DevOps Days Vancouver 2014 Slides
PPTX
From Duke of DevOps to Queen of Chaos - Api days 2018
PDF
Using security to drive chaos engineering - April 2018
PDF
Stor c gregynog colloquium
PDF
Data Science in Production: Technologies That Drive Adoption of Data Science ...
PPTX
Antifragility and testing for distributed systems failure
PDF
Applying Chaos Engineering to Build Resilient Serverless Applications
PDF
chaos-engineering-Knolx
PDF
Architectural Patterns of Resilient Distributed Systems
PPTX
Designing Cloud Backup to reduce DR downtime for IT Professionals
PDF
Chaos engineering open science for software engineering - kube con north am...
PDF
Design for Scale / Surge 2010
PDF
Applying principles of chaos engineering to serverless (O'Reilly Software Arc...
PPTX
Green Custard Friday Talk 19: Chaos Engineering
PDF
SRE Topics with Charity Majors and Liz Fong-Jones of Honeycomb
PPTX
Reanimating DevOps to Build Things that Work
JDD 2016 - Jedrzej Dabrowa - Distributed System Fault Injection Testing With ...
Designing Highly-Available Architectures for OTM
Product discovery engineering point of view
APIdays Singapore 2019 - Building Applications in the Cloud: Best Practices F...
DevOps Days Vancouver 2014 Slides
From Duke of DevOps to Queen of Chaos - Api days 2018
Using security to drive chaos engineering - April 2018
Stor c gregynog colloquium
Data Science in Production: Technologies That Drive Adoption of Data Science ...
Antifragility and testing for distributed systems failure
Applying Chaos Engineering to Build Resilient Serverless Applications
chaos-engineering-Knolx
Architectural Patterns of Resilient Distributed Systems
Designing Cloud Backup to reduce DR downtime for IT Professionals
Chaos engineering open science for software engineering - kube con north am...
Design for Scale / Surge 2010
Applying principles of chaos engineering to serverless (O'Reilly Software Arc...
Green Custard Friday Talk 19: Chaos Engineering
SRE Topics with Charity Majors and Liz Fong-Jones of Honeycomb
Reanimating DevOps to Build Things that Work
Ad

Recently uploaded (20)

PDF
PPT on Performance Review to get promotions
PPTX
Construction Project Organization Group 2.pptx
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
web development for engineering and engineering
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
DOCX
573137875-Attendance-Management-System-original
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
Well-logging-methods_new................
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
Strings in CPP - Strings in C++ are sequences of characters used to store and...
PDF
composite construction of structures.pdf
PPTX
additive manufacturing of ss316l using mig welding
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPT on Performance Review to get promotions
Construction Project Organization Group 2.pptx
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
web development for engineering and engineering
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
573137875-Attendance-Management-System-original
OOP with Java - Java Introduction (Basics)
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Well-logging-methods_new................
CYBER-CRIMES AND SECURITY A guide to understanding
Strings in CPP - Strings in C++ are sequences of characters used to store and...
composite construction of structures.pdf
additive manufacturing of ss316l using mig welding
CH1 Production IntroductoryConcepts.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Operating System & Kernel Study Guide-1 - converted.pdf
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf

Chaos Engineering

  • 2. Nice to meet you Software Engineer @Colpatria Scotiabank Making things happen! Loves reading, programming and teaching. @yurynino @yurynino @yurynino yury.nino.roa
  • 3. Agenda ● Why do we have fear of failure? ● What is chaos engineering? ● Principles of Chaos ● Phases of Chaos Engineering ● Chaos in Practice ● Who is using it? ● Review
  • 4. Why do we have fear to failure?
  • 5. The reality is inherently uncertain, and this uncertainty exposes our limitations.
  • 6. This means that failure should be a perpetual consideration even in the best planned and controlled.
  • 7. Benefits of failure 1. Is an opportunity to learn. 2. Encourages the creativity. 3. Keeps hungry and humble. 4. Helps to overcome the fear. 5. Becomes Resilient! Try 1 Try 2 Finally
  • 8. We need highly available and durable systems. We need Resilient Systems
  • 11. Chaos Engineering: Definition! It is the discipline of experimenting on a distributed system in order to build confidence in the system’s capability to withstand turbulent conditions in production. Taken from https://principlesofchaos.org/
  • 12. Chaos Engineering: Definition! It is deliberately inducing stress or fault into software and/or hardware as a way of learning/verifying things about systems. Taken from https://www.gremlin.com/community
  • 13. History of Chaos Engineering 2008 Chaos Engineering began at Netflix 2018 2014 2010
  • 14. History of Chaos Engineering 2008 Chaos Engineering began at Netflix 2010 Chaos Monkey was launched 2018 2014
  • 15. History of Chaos Engineering 2008 Chaos Engineering began at Netflix 2010 Chaos Monkey was launched 2018 2014 Chaos Engineer Role was created. I learned more about chaos engineering from Kolton Andrus than anyone else, he called it failure injection testing Kolton Andrus
  • 16. Chaos Engineer A vaccine research computer scientist.
  • 17. History of Chaos Engineering 2008 Chaos Engineering began at Netflix 2010 Chaos Monkey was launched 2018 Book Chaos Engineering was published. 2014 Role of Chaos Engineer was created.
  • 18. How to start with Chaos Engineering? https://www.oreilly.com/library/view/chaos-engineering/9781 491988459/
  • 19. How to start …. https://chaosengineering.slack.com https://github.com/dastergon/awesome-chaos -engineering https://www.infoq.com/chaos-engineering https://groups.google.com/forum/#!forum/ch aos-community https://www.linkedin.com/groups/7057761/
  • 20. How to start …. https://www.gremlin.com/chaos-monkey/
  • 22. Because …. Writing many unit and integration tests are not enough. ● How does the application behave with network latency? ● What if one of our services breaks down?
  • 23. Remember the world is chaotic! ● Unpredictable events are bound to happen. ● Distributed systems contains moving parts. ● Many things can go wrong. ○ Hard disks can fail. ○ The network can go down. ○ Surge in customer traffic can overload.
  • 24. February 28th, 2017 will be remembered ● Simple Storage Service (S3) went down in US-EAST ● Outage lasted about 4 hrs. ● 100,000’s of websites across the US impacted.
  • 25. Why is it different of Testing?
  • 26. Testing vs Chaos Engineering Tests are assertions, whose outputs typically are binary .. while Chaos Engineering is a practice for generating new information.
  • 27. I want to emphasize that both sizes of the equation [unit, regression and chaos side] are required to get you the level of availability you want.
  • 28. Chaos Engineer is about ... ● Simulating the failure of a datacenter. ● Injecting latency between services. ● Randomly causing exceptions. ● Changing time travel. ● Emulating I/O errors. http://principlesofchaos.org/
  • 29. Phases of Chaos Engineering Taken from https://blog.codecentric.de/en/2018/07/chaos-engineering/
  • 31. Example - Chaos Monkey for Spring Boot codecentricTaken from https://codecentric.github.io/chaos-monkey-spring-boot/2.0.0/
  • 32. Example - Configuration Taken from https://codecentric.github.io/chaos-monkey-spring-boot/2.0.0/
  • 34. Example - Configuration Taken from https://codecentric.github.io/chaos-monkey-spring-boot/2.0.0/
  • 37. Designing Experiments 1. Pick a Hypothesis 2. Choose the Scope 3. Identify the Metrics to Monitor 4. Notify the Organization 5. Run the Experiment 6. Analyze the Results 7. utomate
  • 38. Who is using Chaos Engineering?
  • 39. Conclusion Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip.
  • 40. It means: To ACCEPT the uncertainty of life and move forward anyway! Take action despite not knowing how things will turn out.