SlideShare a Scribd company logo
3
Most read
8
Most read
10
Most read
Introduction to SRE
SLO, SLA and SLI
ganesh@ganeshniyer.com
ganesh.vigneswara@gmail.com,
http://ganeshniyer.com
Dr Ganesh Neelakanta Iyer
Site Reliability Engineering
Wikipedia
• Discipline that
incorporates aspects of
software engineering and
applies them to
infrastructure and
operations problems. The
main goals are to create
scalable and highly
reliable software systems
Google
• SRE is what you get when
you treat operations as if
it’s a software problem
Dr Ganesh Neelakanta Iyer 2
Dr Ganesh Neelakanta Iyer
https://twitter.com/DEVOPSINST/status/1273678804951666688/photo/1
Never tell me the odds!!!
4https://twitter.com/nobl9inc/status/1272974728584663049/photo/1
Dr Ganesh Neelakanta Iyer
https://twitter.com/sureshgp/status/1273643362617159680/photo/1
What is common for all tech companies?
6
Users…
Whether you’re Google’s search engine, serving a billion
active monthly users who interact with your service for free,
or Salesforce, with 3.75 million paying subscribers, building
a technology product means serving people.
https://www.atlassian.com/incident-management/kpis/sla-vs-slo-vs-sli
Hence, it is important for the companies to understand SLOs,
SLIs and SLAs - three initialisms that represent the promises
we make to our users, the internal objectives that help us keep
those promises, and the trackable measurements that tell us
how we’re doing.
https://www.atlassian.com/incident-management/kpis/sla-vs-slo-vs-sli
The goal
• Get everybody—vendor and client alike—on the same
page about system performance
– How often will your systems be available?
– How quickly will your team respond if the system goes down?
– What kind of promises are you making about speed and
functionality?
– Users want to know—and so you need SLAs, SLOs, and SLIs
Dr Ganesh Neelakanta Iyer 10
https://www.atlassian.com/incident-management/kpis/sla-vs-slo-vs-sli
Dr Ganesh Neelakanta Iyer 11
https://www.atlassian.com/incident-management/kpis/sla-vs-slo-vs-sli
So what is the difference?
SLI
What we’ve chosen to measure progress towards our goal.
E.g., “Latency of a request.”
SLO
Stated objective of the SLI – what we’re trying to accomplish for either
ourselves or the customer.
E.g., “99.5% of requests will be completed in 5ms.”
SLA
Contract explicitly stating the consequences of failing to achieve your
defined SLOs.
E.g., “If 99% of your system requests aren’t completed in 5ms, you get a
refund.”
Dr Ganesh Neelakanta Iyer 12
https://www.circonus.com/2018/07/a
-guide-to-service-level-objectives/
SLI
• SLI are the parameters which indicates the successful
transactions, requests served by the service over the
predefined intervals of time
• These parameters allows to measure much required
performance and availability of the service
• Measuring these parameters also enables to improve them
gradually
• Key Examples are:
– Availability/Uptime of the service.
– Number of successful transactions/requests.
– Consistency and durability of the data
Dr Ganesh Neelakanta Iyer 13https://www.anshulpatel.in/post/sre_sli_sla_slo/
SLO
• SLO defines the acceptable downtime of the service
• For multiple components of the service, there can be different
parameters which defines the acceptable downtime
• It is common pattern to start with low SLO and gradually
increase it
• Key Examples are:
– Durability of disks should be 99.9%.
– Availability of service should be 99.95%
– Service should successfully serve 99.999% requests/transactions
Dr Ganesh Neelakanta Iyer 14https://www.anshulpatel.in/post/sre_sli_sla_slo/
SLA
• SLA defines the penalty that service provider should pay
in an event of service unavailability for pre-defined period
of time
• Service provider should clearly define the failure factors
for which they will be accountable(Domain of
responsibility)
• Key Examples are:
– Partial refund of service subscription fee.
– Additional subscription time added for free
Dr Ganesh Neelakanta Iyer 15https://www.anshulpatel.in/post/sre_sli_sla_slo/
References
https://landing.google.com/sre/books/
Dr Ganesh Neelakanta Iyer 16
Dr Ganesh Neelakanta Iyer
ganesh@ganeshniyer.com
ganesh.vigneswara@gmail.com 17

More Related Content

PDF
Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
PDF
Overview of Site Reliability Engineering (SRE) & best practices
PDF
Getting started with Site Reliability Engineering (SRE)
PPTX
SRE 101 (Site Reliability Engineering)
PDF
Service Level Terminology : SLA ,SLO & SLI
PPTX
A Crash Course in Building Site Reliability
PPTX
Site reliability engineering
PPTX
ChatGPT Deck.pptx
Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
Overview of Site Reliability Engineering (SRE) & best practices
Getting started with Site Reliability Engineering (SRE)
SRE 101 (Site Reliability Engineering)
Service Level Terminology : SLA ,SLO & SLI
A Crash Course in Building Site Reliability
Site reliability engineering
ChatGPT Deck.pptx

What's hot (20)

PPTX
Implementing SRE practices: SLI/SLO deep dive - David Blank-Edelman - DevOpsD...
PPTX
SRE-iously! Reliability!
PDF
Site Reliability Engineering (SRE) - Tech Talk by Keet Sugathadasa
PDF
Building an SRE Organization @ Squarespace
PPTX
What is Site Reliability Engineering (SRE)
PDF
SRE Demystified - 05 - Toil Elimination
PPTX
Site reliability engineering - Lightning Talk
PPTX
SRE-iously! Defining the Principles, Habits, and Practices of Site Reliabilit...
PDF
SRE 101
PPTX
Site (Service) Reliability Engineering
PDF
SRE in Startup
PPTX
SRE (service reliability engineer) on big DevOps platform running on the clou...
PDF
DevOps & SRE at Google Scale
PDF
Cloud-Native Observability
PPTX
SRE vs DevOps
PPTX
DevOps Challenges and Best Practices
PDF
DevOps Powerpoint Presentation Slides
PDF
DevSecOps
PDF
DevSecOps Implementation Journey
PDF
How to implement DevOps in your Organization
Implementing SRE practices: SLI/SLO deep dive - David Blank-Edelman - DevOpsD...
SRE-iously! Reliability!
Site Reliability Engineering (SRE) - Tech Talk by Keet Sugathadasa
Building an SRE Organization @ Squarespace
What is Site Reliability Engineering (SRE)
SRE Demystified - 05 - Toil Elimination
Site reliability engineering - Lightning Talk
SRE-iously! Defining the Principles, Habits, and Practices of Site Reliabilit...
SRE 101
Site (Service) Reliability Engineering
SRE in Startup
SRE (service reliability engineer) on big DevOps platform running on the clou...
DevOps & SRE at Google Scale
Cloud-Native Observability
SRE vs DevOps
DevOps Challenges and Best Practices
DevOps Powerpoint Presentation Slides
DevSecOps
DevSecOps Implementation Journey
How to implement DevOps in your Organization
Ad

Similar to SRE Demystified - 01 - SLO SLI and SLA (20)

PDF
Delivering A Great End User Experience
PPT
Capturing Measurable Non Functional Requirements
DOC
Jeevitha_Java_3.1_Yrs
DOC
shashank_project_Mgmt
PDF
Solution Manual for Software Engineering 10th Edition Sommerville 0133943038 ...
DOC
shashank_QA_02_june_2016
PDF
Mastering performance testing_ a comprehensive guide to optimizing applicatio...
PDF
Stay Two Steps Ahead of Your Auditor
PDF
Simplifying Analytics - by Novoniel Deb
PPTX
3. Rebalancing Support and Innovation_SAP UEM by KNOA Data Analysis_Jeff Shin
PDF
ITIL Implementation – Value addition to the IT industry
PDF
DMG Consulting's Donna Fluss Discusses Desktop Automation and Analytics - Opt...
PDF
OpenSpan Webinar Sept. 17th with Donna Fluss, DMG Consulting - Desktop Analyt...
PDF
Download Software Engineering 10th Edition Sommerville Solutions Manual ebook...
PDF
Software Engineering 10th Edition Sommerville Solutions Manual
DOC
Ramasundararajan MV_Resume
PDF
Project on multiplex ticket bookingn system globsyn2014
PPTX
Software developement life cycle ppt
PDF
Reliability Engineering Q&A - LCE
PPTX
Chapter 1 - Requirement Engineering
Delivering A Great End User Experience
Capturing Measurable Non Functional Requirements
Jeevitha_Java_3.1_Yrs
shashank_project_Mgmt
Solution Manual for Software Engineering 10th Edition Sommerville 0133943038 ...
shashank_QA_02_june_2016
Mastering performance testing_ a comprehensive guide to optimizing applicatio...
Stay Two Steps Ahead of Your Auditor
Simplifying Analytics - by Novoniel Deb
3. Rebalancing Support and Innovation_SAP UEM by KNOA Data Analysis_Jeff Shin
ITIL Implementation – Value addition to the IT industry
DMG Consulting's Donna Fluss Discusses Desktop Automation and Analytics - Opt...
OpenSpan Webinar Sept. 17th with Donna Fluss, DMG Consulting - Desktop Analyt...
Download Software Engineering 10th Edition Sommerville Solutions Manual ebook...
Software Engineering 10th Edition Sommerville Solutions Manual
Ramasundararajan MV_Resume
Project on multiplex ticket bookingn system globsyn2014
Software developement life cycle ppt
Reliability Engineering Q&A - LCE
Chapter 1 - Requirement Engineering
Ad

More from Dr Ganesh Iyer (20)

PDF
SRE Demystified - 16 - NALSD - Non-Abstract Large System Design
PDF
SRE Demystified - 14 - SRE Practices overview
PDF
SRE Demystified - 13 - Docs that matter -2
PDF
SRE Demystified - 12 - Docs that matter -1
PDF
SRE Demystified - 11 - Release management-2
PDF
SRE Demystified - 10 - Release management-1
PDF
SRE Demystified - 09 - Simplicity
PDF
SRE Demystified - 07 - Practical Alerting
PDF
SRE Demystified - 06 - Distributed Monitoring
PDF
SRE Demystified - 04 - Engagement Model
PDF
SRE Demystified - 03 - Choosing SLIs and SLOs
PDF
Machine Learning for Statisticians - Introduction
PDF
Making Decisions - A Game Theoretic approach
PDF
Cloud and Industry4.0
PDF
Game Theory and Engineering Applications
PDF
Machine Learning and its Applications
PDF
How to become a successful entrepreneur
PDF
Dockers and kubernetes
PDF
Containerization Principles Overview for app development and deployment
PDF
Game Theory and Engineering Applications
SRE Demystified - 16 - NALSD - Non-Abstract Large System Design
SRE Demystified - 14 - SRE Practices overview
SRE Demystified - 13 - Docs that matter -2
SRE Demystified - 12 - Docs that matter -1
SRE Demystified - 11 - Release management-2
SRE Demystified - 10 - Release management-1
SRE Demystified - 09 - Simplicity
SRE Demystified - 07 - Practical Alerting
SRE Demystified - 06 - Distributed Monitoring
SRE Demystified - 04 - Engagement Model
SRE Demystified - 03 - Choosing SLIs and SLOs
Machine Learning for Statisticians - Introduction
Making Decisions - A Game Theoretic approach
Cloud and Industry4.0
Game Theory and Engineering Applications
Machine Learning and its Applications
How to become a successful entrepreneur
Dockers and kubernetes
Containerization Principles Overview for app development and deployment
Game Theory and Engineering Applications

Recently uploaded (20)

PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Modernizing your data center with Dell and AMD
PDF
Machine learning based COVID-19 study performance prediction
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Approach and Philosophy of On baking technology
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Big Data Technologies - Introduction.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPT
Teaching material agriculture food technology
PPTX
Cloud computing and distributed systems.
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Spectral efficient network and resource selection model in 5G networks
Modernizing your data center with Dell and AMD
Machine learning based COVID-19 study performance prediction
“AI and Expert System Decision Support & Business Intelligence Systems”
Chapter 3 Spatial Domain Image Processing.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Approach and Philosophy of On baking technology
Network Security Unit 5.pdf for BCA BBA.
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Big Data Technologies - Introduction.pptx
Encapsulation_ Review paper, used for researhc scholars
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Teaching material agriculture food technology
Cloud computing and distributed systems.
NewMind AI Weekly Chronicles - August'25 Week I
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Building Integrated photovoltaic BIPV_UPV.pdf

SRE Demystified - 01 - SLO SLI and SLA

  • 1. Introduction to SRE SLO, SLA and SLI ganesh@ganeshniyer.com ganesh.vigneswara@gmail.com, http://ganeshniyer.com Dr Ganesh Neelakanta Iyer
  • 2. Site Reliability Engineering Wikipedia • Discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems Google • SRE is what you get when you treat operations as if it’s a software problem Dr Ganesh Neelakanta Iyer 2
  • 3. Dr Ganesh Neelakanta Iyer https://twitter.com/DEVOPSINST/status/1273678804951666688/photo/1
  • 4. Never tell me the odds!!! 4https://twitter.com/nobl9inc/status/1272974728584663049/photo/1
  • 5. Dr Ganesh Neelakanta Iyer https://twitter.com/sureshgp/status/1273643362617159680/photo/1
  • 6. What is common for all tech companies? 6
  • 8. Whether you’re Google’s search engine, serving a billion active monthly users who interact with your service for free, or Salesforce, with 3.75 million paying subscribers, building a technology product means serving people. https://www.atlassian.com/incident-management/kpis/sla-vs-slo-vs-sli
  • 9. Hence, it is important for the companies to understand SLOs, SLIs and SLAs - three initialisms that represent the promises we make to our users, the internal objectives that help us keep those promises, and the trackable measurements that tell us how we’re doing. https://www.atlassian.com/incident-management/kpis/sla-vs-slo-vs-sli
  • 10. The goal • Get everybody—vendor and client alike—on the same page about system performance – How often will your systems be available? – How quickly will your team respond if the system goes down? – What kind of promises are you making about speed and functionality? – Users want to know—and so you need SLAs, SLOs, and SLIs Dr Ganesh Neelakanta Iyer 10 https://www.atlassian.com/incident-management/kpis/sla-vs-slo-vs-sli
  • 11. Dr Ganesh Neelakanta Iyer 11 https://www.atlassian.com/incident-management/kpis/sla-vs-slo-vs-sli
  • 12. So what is the difference? SLI What we’ve chosen to measure progress towards our goal. E.g., “Latency of a request.” SLO Stated objective of the SLI – what we’re trying to accomplish for either ourselves or the customer. E.g., “99.5% of requests will be completed in 5ms.” SLA Contract explicitly stating the consequences of failing to achieve your defined SLOs. E.g., “If 99% of your system requests aren’t completed in 5ms, you get a refund.” Dr Ganesh Neelakanta Iyer 12 https://www.circonus.com/2018/07/a -guide-to-service-level-objectives/
  • 13. SLI • SLI are the parameters which indicates the successful transactions, requests served by the service over the predefined intervals of time • These parameters allows to measure much required performance and availability of the service • Measuring these parameters also enables to improve them gradually • Key Examples are: – Availability/Uptime of the service. – Number of successful transactions/requests. – Consistency and durability of the data Dr Ganesh Neelakanta Iyer 13https://www.anshulpatel.in/post/sre_sli_sla_slo/
  • 14. SLO • SLO defines the acceptable downtime of the service • For multiple components of the service, there can be different parameters which defines the acceptable downtime • It is common pattern to start with low SLO and gradually increase it • Key Examples are: – Durability of disks should be 99.9%. – Availability of service should be 99.95% – Service should successfully serve 99.999% requests/transactions Dr Ganesh Neelakanta Iyer 14https://www.anshulpatel.in/post/sre_sli_sla_slo/
  • 15. SLA • SLA defines the penalty that service provider should pay in an event of service unavailability for pre-defined period of time • Service provider should clearly define the failure factors for which they will be accountable(Domain of responsibility) • Key Examples are: – Partial refund of service subscription fee. – Additional subscription time added for free Dr Ganesh Neelakanta Iyer 15https://www.anshulpatel.in/post/sre_sli_sla_slo/
  • 17. Dr Ganesh Neelakanta Iyer ganesh@ganeshniyer.com ganesh.vigneswara@gmail.com 17