SlideShare a Scribd company logo
6
Most read
12
Most read
14
Most read
Version 1.0
SRE 101
Introduction to Site
Reliability Engineering
Hussain Mansoor
Topics / Agenda
1
Why & What is DevOps?
2
SRE
3
Relation between SRE & DevOps
4
SRE Details
What is DevOps? - AWS
DevOps is the combination of cultural philosophies, practices, and tools that increases
an organization's ability to deliver applications and services at high velocity: evolving
and improving products at a faster pace than organizations using traditional software
development and infrastructure management processes. Ref
Why DevOps? ● To align on the mindset and
activities which speeds up
software delivery
● Reduce Human Errors
● Consistency (because code)
● Reduce manual efforts
How to DevOps?
*Generally
via DevOps Principles
● Have CICD practices
● Shift Left
● Continuous Improvements
● Remove Silos
● Automate
● Shared Responsibilities
● Autonomous Teams
SRE
Guiding Principles
● You can’t improve what you can’t measure
○ SLI, SLO, Error Budget
● Embracing Risk
● Eliminate Toil
● Implementation agnostic monitoring
● Automate
● Simplicity*
Relation between
SRE & DevOps
Agile Manifesto
Scrum, Kanban, Lean, XP
DevOps
SRE, Systems Engineer,
Platform Engineer, Automation
Engineer, Cloud x Engineer
SRE vs DevOps
● Non Competing
● Class SRE Implements Interface DevOps
https://goo.gl/CKv3tV
● SRE is part of whole DevOps Umbrella
○ SRE defines the practices which DevOps
suggests
○ And MORE
SRE vs DevOps
SRE Details
SLI
Service Level Indicator
Availability, Throughput, Error
Rate
SLO
Service Level Objectives
E.g.: 99% availability
Error Budget
the amount of error that your
service can accumulate over a
certain period of time.
Tolerance of user happiness
SRE Practices
● Remove Toil
● Defining criticalities (System, downtime, unavailability)
● System Designing (DR, Multi or Poly Cloud, Multi-Region Deployments)
● Observability
● Chaos Engineering
SRE Practices
● ONLY people who can touch Production Environment
● MTTR, MTBF
● Incident Management
● Postmortems
SRE 101 (Site Reliability Engineering)
SRE 101 (Site Reliability Engineering)

More Related Content

PDF
Site Reliability Engineering (SRE) - Tech Talk by Keet Sugathadasa
PPTX
SRE-iously! Defining the Principles, Habits, and Practices of Site Reliabilit...
PDF
SRE 101
PPTX
A Crash Course in Building Site Reliability
PPTX
Site reliability engineering
PDF
Getting started with Site Reliability Engineering (SRE)
PPTX
SRE (service reliability engineer) on big DevOps platform running on the clou...
PDF
Overview of Site Reliability Engineering (SRE) & best practices
Site Reliability Engineering (SRE) - Tech Talk by Keet Sugathadasa
SRE-iously! Defining the Principles, Habits, and Practices of Site Reliabilit...
SRE 101
A Crash Course in Building Site Reliability
Site reliability engineering
Getting started with Site Reliability Engineering (SRE)
SRE (service reliability engineer) on big DevOps platform running on the clou...
Overview of Site Reliability Engineering (SRE) & best practices

What's hot (20)

PPTX
Site (Service) Reliability Engineering
PPTX
What is Site Reliability Engineering (SRE)
PDF
Sre summary
PPTX
How Small Team Get Ready for SRE (public version)
PPTX
SRE-iously! Reliability!
PPTX
Implementing SRE practices: SLI/SLO deep dive - David Blank-Edelman - DevOpsD...
PPTX
About DevOps in simple steps
PPTX
DevOps introduction
PDF
Building an SRE Organization @ Squarespace
PPTX
DevOps Torino Meetup - SRE Concepts
PPTX
DevOps Introduction
PDF
Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
PDF
DevOps Powerpoint Presentation Slides
PPTX
Site reliability engineering - Lightning Talk
PDF
Intro to open source observability with grafana, prometheus, loki, and tempo(...
PDF
SRE Demystified - 01 - SLO SLI and SLA
PDF
What is DevOps | DevOps Introduction | DevOps Training | DevOps Tutorial | Ed...
PPTX
Chaos engineering & Gameday on AWS
PPTX
Introduction to DevOps
PDF
DevOps for beginners
Site (Service) Reliability Engineering
What is Site Reliability Engineering (SRE)
Sre summary
How Small Team Get Ready for SRE (public version)
SRE-iously! Reliability!
Implementing SRE practices: SLI/SLO deep dive - David Blank-Edelman - DevOpsD...
About DevOps in simple steps
DevOps introduction
Building an SRE Organization @ Squarespace
DevOps Torino Meetup - SRE Concepts
DevOps Introduction
Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
DevOps Powerpoint Presentation Slides
Site reliability engineering - Lightning Talk
Intro to open source observability with grafana, prometheus, loki, and tempo(...
SRE Demystified - 01 - SLO SLI and SLA
What is DevOps | DevOps Introduction | DevOps Training | DevOps Tutorial | Ed...
Chaos engineering & Gameday on AWS
Introduction to DevOps
DevOps for beginners
Ad

Similar to SRE 101 (Site Reliability Engineering) (20)

PPTX
Top Site Reliability Engineering Training - SRE Course in Ameerpet.pptx
PDF
Site-Reliability-Engineering-v2[6241].pdf
PDF
Björn Rabenstein - About SRE – and how (not) to apply it - Codemotion Berlin ...
PDF
Björn Rabenstein - About SRE and how (not) to apply it - Codemotion Berlin 2018
PDF
Bjorn Rabenstein. SRE, DevOps, Google, and you
PPTX
Site Reliability Engineering Certification Course in Hyderabad.pptx
PDF
S.R.E - create ultra-scalable and highly reliable systems
PDF
Site Reliability Engineering slide deck 101
PDF
ADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdf
PDF
How to use Istio/Anthos to build Enterprise SRE
PPTX
Kanban India 2023 | Ravishankar N | Don’t implement SRE like this!
PPTX
DevOps & Site Reliability Engineering (SRE).pptx
PDF
stackconf 2022: Want to start with SRE? Start with this talk.
PDF
Upskill Yourself With GSDC Site Reliability Engineering Certification
PPTX
Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...
PPTX
Site Reliability Engineering: Harnessing (and redefining) it for ITSM
PDF
Bridging the Gap Between SRE and DevOps.pdf
PPTX
SRE vs DevOps
PPTX
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
PPTX
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
Top Site Reliability Engineering Training - SRE Course in Ameerpet.pptx
Site-Reliability-Engineering-v2[6241].pdf
Björn Rabenstein - About SRE – and how (not) to apply it - Codemotion Berlin ...
Björn Rabenstein - About SRE and how (not) to apply it - Codemotion Berlin 2018
Bjorn Rabenstein. SRE, DevOps, Google, and you
Site Reliability Engineering Certification Course in Hyderabad.pptx
S.R.E - create ultra-scalable and highly reliable systems
Site Reliability Engineering slide deck 101
ADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdf
How to use Istio/Anthos to build Enterprise SRE
Kanban India 2023 | Ravishankar N | Don’t implement SRE like this!
DevOps & Site Reliability Engineering (SRE).pptx
stackconf 2022: Want to start with SRE? Start with this talk.
Upskill Yourself With GSDC Site Reliability Engineering Certification
Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...
Site Reliability Engineering: Harnessing (and redefining) it for ITSM
Bridging the Gap Between SRE and DevOps.pdf
SRE vs DevOps
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
Ad

More from Hussain Mansoor (19)

PPTX
Cloud for Enterprise - AWS Community Day Dubai 2022
PDF
FAST - Karachi Campus - Cloud Computing Introduction
PPTX
FiresideChat on Serverless Architecture
PPTX
Serverless Architecture for Beginners - Murdoch Dubai - AWS UG Dubai.pptx
PPTX
Certification Journey in AWS Cloud
PPTX
Scale Engineering using Cloud. AWS CommunityDay Pakistan 2021
PDF
Intro to docker - innovation demo 2022
PPTX
Design patterns of Distributed Systems
PPTX
Android developer to tech leadership
PPTX
Observability and DevOps Improvements
PPTX
Cache options for Data Layer
PPTX
AWS Lambda and Infrastructure as Code
PPTX
Why everyone should go for Masters Degree
PPTX
Agile101
PPTX
DevOps for iOS
PPTX
Unit Testing Android Application
PPTX
Code quality
PPT
FAST-NUCES Apps/Games presentation by Husyn 2012
PPTX
Maven basics (Android & IntelliJ)
Cloud for Enterprise - AWS Community Day Dubai 2022
FAST - Karachi Campus - Cloud Computing Introduction
FiresideChat on Serverless Architecture
Serverless Architecture for Beginners - Murdoch Dubai - AWS UG Dubai.pptx
Certification Journey in AWS Cloud
Scale Engineering using Cloud. AWS CommunityDay Pakistan 2021
Intro to docker - innovation demo 2022
Design patterns of Distributed Systems
Android developer to tech leadership
Observability and DevOps Improvements
Cache options for Data Layer
AWS Lambda and Infrastructure as Code
Why everyone should go for Masters Degree
Agile101
DevOps for iOS
Unit Testing Android Application
Code quality
FAST-NUCES Apps/Games presentation by Husyn 2012
Maven basics (Android & IntelliJ)

Recently uploaded (20)

PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
Lecture Notes Electrical Wiring System Components
PDF
ETO & MEO Certificate of Competency Questions and Answers
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Lesson 3_Tessellation.pptx finite Mathematics
PPTX
Construction Project Organization Group 2.pptx
PDF
Structs to JSON How Go Powers REST APIs.pdf
PPTX
MET 305 MODULE 1 KTU 2019 SCHEME 25.pptx
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
DOCX
573137875-Attendance-Management-System-original
PPT
Project quality management in manufacturing
PPTX
additive manufacturing of ss316l using mig welding
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
Internet of Things (IOT) - A guide to understanding
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
Foundation to blockchain - A guide to Blockchain Tech
CYBER-CRIMES AND SECURITY A guide to understanding
bas. eng. economics group 4 presentation 1.pptx
Lecture Notes Electrical Wiring System Components
ETO & MEO Certificate of Competency Questions and Answers
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Lesson 3_Tessellation.pptx finite Mathematics
Construction Project Organization Group 2.pptx
Structs to JSON How Go Powers REST APIs.pdf
MET 305 MODULE 1 KTU 2019 SCHEME 25.pptx
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
573137875-Attendance-Management-System-original
Project quality management in manufacturing
additive manufacturing of ss316l using mig welding
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Internet of Things (IOT) - A guide to understanding
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx

SRE 101 (Site Reliability Engineering)

  • 1. Version 1.0 SRE 101 Introduction to Site Reliability Engineering Hussain Mansoor
  • 2. Topics / Agenda 1 Why & What is DevOps? 2 SRE 3 Relation between SRE & DevOps 4 SRE Details
  • 3. What is DevOps? - AWS DevOps is the combination of cultural philosophies, practices, and tools that increases an organization's ability to deliver applications and services at high velocity: evolving and improving products at a faster pace than organizations using traditional software development and infrastructure management processes. Ref
  • 4. Why DevOps? ● To align on the mindset and activities which speeds up software delivery ● Reduce Human Errors ● Consistency (because code) ● Reduce manual efforts
  • 5. How to DevOps? *Generally via DevOps Principles ● Have CICD practices ● Shift Left ● Continuous Improvements ● Remove Silos ● Automate ● Shared Responsibilities ● Autonomous Teams
  • 6. SRE Guiding Principles ● You can’t improve what you can’t measure ○ SLI, SLO, Error Budget ● Embracing Risk ● Eliminate Toil ● Implementation agnostic monitoring ● Automate ● Simplicity*
  • 8. Agile Manifesto Scrum, Kanban, Lean, XP DevOps SRE, Systems Engineer, Platform Engineer, Automation Engineer, Cloud x Engineer
  • 10. ● Non Competing ● Class SRE Implements Interface DevOps https://goo.gl/CKv3tV ● SRE is part of whole DevOps Umbrella ○ SRE defines the practices which DevOps suggests ○ And MORE SRE vs DevOps
  • 11. SRE Details SLI Service Level Indicator Availability, Throughput, Error Rate SLO Service Level Objectives E.g.: 99% availability Error Budget the amount of error that your service can accumulate over a certain period of time. Tolerance of user happiness
  • 12. SRE Practices ● Remove Toil ● Defining criticalities (System, downtime, unavailability) ● System Designing (DR, Multi or Poly Cloud, Multi-Region Deployments) ● Observability ● Chaos Engineering
  • 13. SRE Practices ● ONLY people who can touch Production Environment ● MTTR, MTBF ● Incident Management ● Postmortems

Editor's Notes

  • #7: We define toil as mundane, repetitive operational work providing no enduring value, which scales linearly with service growth A complex system that works necessarily evolved from a simple system that works. Simplicity, goes into this topic in detail