SlideShare a Scribd company logo
Secrets of
top-performing
DevOps teams
at Google
and beyond
Nikhil Kaul
Nick Pan
cloud.google.com/
devops
Agenda
● Defining DevOps
● How to measure DevOps performance
● New insights in 2019
● Balancing speed with stability
● How do we improve
Defining DevOps
BusinessConcept Development Operations Market
Agile
solves this
Greater agility with DevOps
DevOps
solves this
Measuring DevOps performance
Speed Stability
Deployment
frequency
Lead time
for changes
Time to restore
service
Change
fail rate
Aspect of Software
Delivery Performance*
Elite High Medium Low
Deployment frequency On demand
(multiple deploys
per day)
Between once per
day and once per
week
Between once per
week and once per
month
Between once per
month and once
every six months
Lead time for changes Less than one day
Between one day
and one week
Between one week
and one month
Between one
month and six
months
Time to restore service Less than one hour Less than one day* Less than one day
Between one week
and one month
Change failure rate 0-15%b
0-15%a, b
0-15%a
46-60%
Speed and stability
are both possible
Adding availability
to metrics for
operational
performance
Availability is about ensuring a product
or service is available and accessible
Availability measures are significantly
correlated with performance
Availability reflects how well teams
define their availability targets and
learn from outages
Agenda
● Defining DevOps
● How to measure DevOps performance
● New insights in 2019
● Balancing speed with stability
● How do we improve
Industry
continues to
improve
2018 2019
48%
High
performers
7%
Elite
37%
Medium
performers
15%
Low
performers
20%
Elite
23%
High
performers
44%
Medium
performers
12%
Low
performers
Heavyweight
process & control
make enterprises
low performers…
20%
Low performers have highest concentration
of fully proprietary software
Low Medium High
A mix of proprietary tools, open source, and
commercial off-the-shelf (COTS) software
Mainly open source and COTS, heavily customized
Mainly open source and COTS, with little customization
Primarily COTS packaged software
Primarily developed in-house and proprietary
to my organization
Primarily open source, heavily customized
Primarily open source, with little customization
30% 34% 32%
Elite
33%
17% 8% 7% 10%
14% 21% 18% 20%
8% 12% 8% 4%
6% 5% 6%
6% 7% 5% 12%
5% 12% 24% 15%
Growing role of open source in
DevOps
Skaffold
Speed up local build, test,
deploy workflow.
Kubernetes
For management of
containerized software
Istio
For services management
Spinnaker
An open-source
multi-cloud CD platform
Tekton
Open-source, vendor-neutral
components for running CI/CD
workloads on Kubernetes to
standardize implementation
across the industry and increase
workload portability.
“Increasingly, Tekton is
becoming the universal, if
you like, pipeline glue which
Jenkins, Jenkins X, and
possibly, hopefully one day,
Spinnaker, can all use”
James Strachan,
CloudBees distinguished engineer
Tekton
Events UI Notifications
Kubernetes
Pipelines Tasks
Jenkins X
Elite performers are 24x
more likely to have met
all cloud characteristics
Only 29% of
respondents met all
5 characteristics of
cloud computing
On-demand self-service
Broad network access
Resource pooling
Rapid elasticity
Measured service
Clear wins from cloud
Fast autoscaling Cost visibility Security
Full-stack
serverless
on Google Cloud
for auto-scaling
ML & AI
Compute
Database &
Storage
Data Analytics
Messaging
DevOps
Smart
assistants &
chat
Cloud: Slashing cost of
research at Broad Institute
First GCP run Task splitting Preemptible VMs Persistent disks Algorithm tuning
0
0.25
0.5
0.75
1
Relativecost
Shopify: a story of security at scale
Every day 6,000 built container
images are deployed securely.
Over 330,000 images stored in
Container Registry
Agenda
● Defining DevOps
● How to measure DevOps performance
● New insights in 2019
● Balancing speed with stability
● How do we improve
Ship as
often as
possible!
Don’t break
anything,
ever!
Why not
both?
● Humans have never created a perfectly reliable system.
● Nature has not created a perfectly reliable system.
● If systems that have evolved under competitive
pressure over hundreds, or billions, of years aren’t
perfectly reliable, then perhaps perfect reliability isn’t
the best target?
While deciding
availability remember
perfection is unrealistic
Balancing speed with stability
Select availability level based on
Balancing speed with stability
Service level expected by users Does this service tie to our
or customer’s revenue?
Service for customers
vs enterprises
Paid vs free service What availability
competitors provide
Cost
● An objective metric that determines how unreliable the
service is allowed to be within a single quarter.
● Product management & SRE determine availability
target.
● If there is error budget remaining – new releases can
be pushed.
Error budget helps balance
availability with velocity
100 - availability target =
Error budget
Balancing speed with stability
Example
90% 36.5 days 9 days 3 days
95% 18.25 days 4.5 days 1.5 days
99% 3.65 days 21.6 hours 7.2 hours
99.5% 1.83 days 10.8 hours 3.6 hours
99.9% 8.76 hours 2.16 hours 43.2 minutes
99.95% 4.38 hours 1.08 hours 21.6 minutes
99.99% 52.6 minutes 12.96 minutes 4.32 minutes
99.999% 5.26 minutes 1.30 minutes 25.9 seconds
Availability level
Allowed unavailability window
Balancing speed with stability
landing.google.com/sre
/books/
Agenda
● Defining DevOps
● How to measure DevOps performance
● New insights in 2019
● Balancing speed with stability
● How do we improve
2 research models to drive
improvements
Performance
model
Productivity
model
Performance model
Clear change
process
Heavyweight
change process
Code
maintainability
Loosely coupled
architecture
Monitoring
Trunk-based
development
Deployment
automation
Culture of psychological
safety
Continuous
integration
Automated testing
Continuous delivery
Disaster
recovery testing
Cloud
Burnout
SDO Performance
Software delivery performance
Availability
Organizational
performance
Industry (control)
Enterprise (control)
(-)
(-)
(-)
3 guiding principles
1.
Increase developer
productivity
2.
Shift left
3.
Automated
Compliance
Technical Practices- Demo
Cloud
Code
GCR vulnerability
scanning
Binary
authorization
Trusted
images
Kubernetes
Engine
Cloud
Build
GitHub
Untrusted
images
Audit log
Local
development
Remote
development
Fixes
Driving productivity
improvements
Useful, easy to
use tools
Internal search
External search
Culture of psychological
safety
Productivity
Technical debt Work recovery
Burnout
Code maintainability
Loosely coupled
architecture
Monitoring
Years of experience
(control)
(-)
(-)(-)(-)
(-)
n.s.
Takeaway
The industry continues to improve,
particularly among the highest performers.
DevOps matters for both commercial goals
and non-commercial goals
Cloud continues to be a differentiator for
elite performers
100% availability is unrealistic & undesirable
Error budgets can help with measuring
availability
Download
the report
cloud.google.com/
devops
Questions?

More Related Content

PPTX
Modernizing Your Testing Tools
PPTX
Building Better Collaboration Between Development and Testing in a DevOps World
PPTX
BizDevOps – Delivering Business Value Quickly at Scale
PDF
How to Use KPIs in an Agile Delivery Environment
PPT
Linkedin Resource Focus
PDF
Saying Goodbye to Quality Center
PPTX
TUI & qTest: Why, How and Where Next
PPTX
Making the Switch from HP Quality Center to qTest
Modernizing Your Testing Tools
Building Better Collaboration Between Development and Testing in a DevOps World
BizDevOps – Delivering Business Value Quickly at Scale
How to Use KPIs in an Agile Delivery Environment
Linkedin Resource Focus
Saying Goodbye to Quality Center
TUI & qTest: Why, How and Where Next
Making the Switch from HP Quality Center to qTest

What's hot (20)

PPTX
Knowing Where to Tap
PPTX
Kick-Starting BDD for Your Organization
PDF
Agile metrics - Measure and Improve
PDF
Quality Jam 2017: Jesse Reed & Kyle McMeekin "Test Case Management & Explorat...
PDF
Agile Efficacy Presentation
PPTX
Steve Lawrence - Agile Metrics
PPTX
Breakdowns and Breakthroughs: Handoffs Between Sales and Marketing
PDF
2. garansys loves estimates for agile projects alexander vermeulen
PDF
Lightning talk Accenture
PDF
Andy singleton continuous delivery-fcb - nov 2014
PPT
Quantre Initial Solutions
PPTX
Efficient Performance Test Automation - Opitmizing the Jenkins Pipeline
PPTX
The Business Case for DevOps - Justifying the Journey
POTX
Simple Lean Agile KPIs
PPT
Agile Metrics
PDF
PPTX
Agile Metrics, Value, and Softwre
PPTX
State of continuous delivery in 2015 - Minsk 15-5-2015
PPTX
RGS - Project Overview 6 Months - Cloud.pptx
PDF
Application Transformation Workshop
Knowing Where to Tap
Kick-Starting BDD for Your Organization
Agile metrics - Measure and Improve
Quality Jam 2017: Jesse Reed & Kyle McMeekin "Test Case Management & Explorat...
Agile Efficacy Presentation
Steve Lawrence - Agile Metrics
Breakdowns and Breakthroughs: Handoffs Between Sales and Marketing
2. garansys loves estimates for agile projects alexander vermeulen
Lightning talk Accenture
Andy singleton continuous delivery-fcb - nov 2014
Quantre Initial Solutions
Efficient Performance Test Automation - Opitmizing the Jenkins Pipeline
The Business Case for DevOps - Justifying the Journey
Simple Lean Agile KPIs
Agile Metrics
Agile Metrics, Value, and Softwre
State of continuous delivery in 2015 - Minsk 15-5-2015
RGS - Project Overview 6 Months - Cloud.pptx
Application Transformation Workshop
Ad

Similar to [webinar] Secrets of Top-performing DevOps Teams -- at Google and Beyond (20)

PPTX
DevOps For Everyone: Bringing DevOps Success to Every App and Every Role in y...
PDF
Pivotal korea transformation_strategy_seminar_enterprise_dev_ops_20160630_v1.0
PPTX
Agile & DevOps - It's all about project success
PDF
GDG Cloud Southlake #2 Jez Humble DevOps Transformation:Building & Scaling H...
PPT
IBM Innovate 2013 Session: DevOps 101
DOCX
DevOps, Agile and Continuous Delivery: Creating a repeatable and reliable del...
PDF
Getting Started with DevOps
PPTX
What is DevOps? What is DevOps CoE?
PDF
Incluit - Studio DevOps
PPTX
DevOps State of the Union 2015
PDF
Intro to DevOps
PDF
DevOps, Common use cases, Architectures, Best Practices
PDF
PPTX
PDF
DevOps in Regulated Industries: Speed with Compliance
PDF
Understanding the Role of DevOps in Modern Software Development.pdf
PDF
DevOps - Top Trends In 2019
PPTX
Measure and Accelerate Your Software Delivery
PDF
DevOps for absolute beginners
PPTX
DevOps Culture transformation in Modern Software Delivery
DevOps For Everyone: Bringing DevOps Success to Every App and Every Role in y...
Pivotal korea transformation_strategy_seminar_enterprise_dev_ops_20160630_v1.0
Agile & DevOps - It's all about project success
GDG Cloud Southlake #2 Jez Humble DevOps Transformation:Building & Scaling H...
IBM Innovate 2013 Session: DevOps 101
DevOps, Agile and Continuous Delivery: Creating a repeatable and reliable del...
Getting Started with DevOps
What is DevOps? What is DevOps CoE?
Incluit - Studio DevOps
DevOps State of the Union 2015
Intro to DevOps
DevOps, Common use cases, Architectures, Best Practices
DevOps in Regulated Industries: Speed with Compliance
Understanding the Role of DevOps in Modern Software Development.pdf
DevOps - Top Trends In 2019
Measure and Accelerate Your Software Delivery
DevOps for absolute beginners
DevOps Culture transformation in Modern Software Delivery
Ad

More from Applitools (20)

PDF
Applitools Platform Pulse: What's New and What's Coming - July 2025
PDF
Code and No-Code Journeys: The Maintenance Shortcut
PDF
Code and No-Code Journeys: The Coverage Overlook
PDF
Creating Automated Tests with AI - Cory House - Applitools.pdf
PDF
Navigating EAA Compliance in Testing.pdf
PDF
AI-Assisted, AI-Augmented & Autonomous Testing
PDF
Code or No-Code Tests: Why Top Teams Choose Both
PDF
The ROI of AI-Powered Testing, presented by Applitools
PDF
Building No-code Autonomous E2E Tests_Applitools.pdf
PDF
Conquer 6 Testing Challenges_Applitools.pdf
PDF
Autonomous End-to-End Testing for Online Banking Applications Presented with ...
PDF
Playwright Visual Testing Best Practices, presented by Applitools
PDF
Cross-Browser and Cross-Device Testing | Applitools in Action
PDF
Advanced Debugging Techniques | Applitools in Action.pdf
PDF
AI-Powered Testing Strategies for the Seasonal Shopping Surge.pdf
PDF
Test Automation for Dynamic Applications _ Applitools in Action.pdf
PDF
Proven Approaches to AI-Powered E2E Testing.pdf
PDF
Applitools Autonomous 2.0 Sneak Peek.pdf
PDF
Building the Ideal CI-CD Pipeline_ Achieving Visual Perfection
PDF
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Applitools Platform Pulse: What's New and What's Coming - July 2025
Code and No-Code Journeys: The Maintenance Shortcut
Code and No-Code Journeys: The Coverage Overlook
Creating Automated Tests with AI - Cory House - Applitools.pdf
Navigating EAA Compliance in Testing.pdf
AI-Assisted, AI-Augmented & Autonomous Testing
Code or No-Code Tests: Why Top Teams Choose Both
The ROI of AI-Powered Testing, presented by Applitools
Building No-code Autonomous E2E Tests_Applitools.pdf
Conquer 6 Testing Challenges_Applitools.pdf
Autonomous End-to-End Testing for Online Banking Applications Presented with ...
Playwright Visual Testing Best Practices, presented by Applitools
Cross-Browser and Cross-Device Testing | Applitools in Action
Advanced Debugging Techniques | Applitools in Action.pdf
AI-Powered Testing Strategies for the Seasonal Shopping Surge.pdf
Test Automation for Dynamic Applications _ Applitools in Action.pdf
Proven Approaches to AI-Powered E2E Testing.pdf
Applitools Autonomous 2.0 Sneak Peek.pdf
Building the Ideal CI-CD Pipeline_ Achieving Visual Perfection
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton

Recently uploaded (20)

PPTX
Why Generative AI is the Future of Content, Code & Creativity?
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PPTX
Transform Your Business with a Software ERP System
PPTX
Operating system designcfffgfgggggggvggggggggg
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
Nekopoi APK 2025 free lastest update
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PDF
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PPTX
Introduction to Artificial Intelligence
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
PTS Company Brochure 2025 (1).pdf.......
PPTX
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
PDF
Designing Intelligence for the Shop Floor.pdf
PDF
top salesforce developer skills in 2025.pdf
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PPTX
assetexplorer- product-overview - presentation
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Why Generative AI is the Future of Content, Code & Creativity?
Odoo Companies in India – Driving Business Transformation.pdf
Transform Your Business with a Software ERP System
Operating system designcfffgfgggggggvggggggggg
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Nekopoi APK 2025 free lastest update
Reimagine Home Health with the Power of Agentic AI​
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Introduction to Artificial Intelligence
Softaken Excel to vCard Converter Software.pdf
PTS Company Brochure 2025 (1).pdf.......
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
Designing Intelligence for the Shop Floor.pdf
top salesforce developer skills in 2025.pdf
wealthsignaloriginal-com-DS-text-... (1).pdf
Upgrade and Innovation Strategies for SAP ERP Customers
assetexplorer- product-overview - presentation
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...

[webinar] Secrets of Top-performing DevOps Teams -- at Google and Beyond

  • 4. Agenda ● Defining DevOps ● How to measure DevOps performance ● New insights in 2019 ● Balancing speed with stability ● How do we improve
  • 5. Defining DevOps BusinessConcept Development Operations Market Agile solves this Greater agility with DevOps DevOps solves this
  • 6. Measuring DevOps performance Speed Stability Deployment frequency Lead time for changes Time to restore service Change fail rate
  • 7. Aspect of Software Delivery Performance* Elite High Medium Low Deployment frequency On demand (multiple deploys per day) Between once per day and once per week Between once per week and once per month Between once per month and once every six months Lead time for changes Less than one day Between one day and one week Between one week and one month Between one month and six months Time to restore service Less than one hour Less than one day* Less than one day Between one week and one month Change failure rate 0-15%b 0-15%a, b 0-15%a 46-60% Speed and stability are both possible
  • 8. Adding availability to metrics for operational performance Availability is about ensuring a product or service is available and accessible Availability measures are significantly correlated with performance Availability reflects how well teams define their availability targets and learn from outages
  • 9. Agenda ● Defining DevOps ● How to measure DevOps performance ● New insights in 2019 ● Balancing speed with stability ● How do we improve
  • 11. Heavyweight process & control make enterprises low performers…
  • 12. 20% Low performers have highest concentration of fully proprietary software Low Medium High A mix of proprietary tools, open source, and commercial off-the-shelf (COTS) software Mainly open source and COTS, heavily customized Mainly open source and COTS, with little customization Primarily COTS packaged software Primarily developed in-house and proprietary to my organization Primarily open source, heavily customized Primarily open source, with little customization 30% 34% 32% Elite 33% 17% 8% 7% 10% 14% 21% 18% 20% 8% 12% 8% 4% 6% 5% 6% 6% 7% 5% 12% 5% 12% 24% 15%
  • 13. Growing role of open source in DevOps Skaffold Speed up local build, test, deploy workflow. Kubernetes For management of containerized software Istio For services management Spinnaker An open-source multi-cloud CD platform
  • 14. Tekton Open-source, vendor-neutral components for running CI/CD workloads on Kubernetes to standardize implementation across the industry and increase workload portability.
  • 15. “Increasingly, Tekton is becoming the universal, if you like, pipeline glue which Jenkins, Jenkins X, and possibly, hopefully one day, Spinnaker, can all use” James Strachan, CloudBees distinguished engineer Tekton Events UI Notifications Kubernetes Pipelines Tasks Jenkins X
  • 16. Elite performers are 24x more likely to have met all cloud characteristics
  • 17. Only 29% of respondents met all 5 characteristics of cloud computing On-demand self-service Broad network access Resource pooling Rapid elasticity Measured service
  • 18. Clear wins from cloud Fast autoscaling Cost visibility Security
  • 19. Full-stack serverless on Google Cloud for auto-scaling ML & AI Compute Database & Storage Data Analytics Messaging DevOps Smart assistants & chat
  • 20. Cloud: Slashing cost of research at Broad Institute First GCP run Task splitting Preemptible VMs Persistent disks Algorithm tuning 0 0.25 0.5 0.75 1 Relativecost
  • 21. Shopify: a story of security at scale Every day 6,000 built container images are deployed securely. Over 330,000 images stored in Container Registry
  • 22. Agenda ● Defining DevOps ● How to measure DevOps performance ● New insights in 2019 ● Balancing speed with stability ● How do we improve
  • 23. Ship as often as possible! Don’t break anything, ever! Why not both?
  • 24. ● Humans have never created a perfectly reliable system. ● Nature has not created a perfectly reliable system. ● If systems that have evolved under competitive pressure over hundreds, or billions, of years aren’t perfectly reliable, then perhaps perfect reliability isn’t the best target? While deciding availability remember perfection is unrealistic Balancing speed with stability
  • 25. Select availability level based on Balancing speed with stability Service level expected by users Does this service tie to our or customer’s revenue? Service for customers vs enterprises Paid vs free service What availability competitors provide Cost
  • 26. ● An objective metric that determines how unreliable the service is allowed to be within a single quarter. ● Product management & SRE determine availability target. ● If there is error budget remaining – new releases can be pushed. Error budget helps balance availability with velocity 100 - availability target = Error budget Balancing speed with stability
  • 27. Example 90% 36.5 days 9 days 3 days 95% 18.25 days 4.5 days 1.5 days 99% 3.65 days 21.6 hours 7.2 hours 99.5% 1.83 days 10.8 hours 3.6 hours 99.9% 8.76 hours 2.16 hours 43.2 minutes 99.95% 4.38 hours 1.08 hours 21.6 minutes 99.99% 52.6 minutes 12.96 minutes 4.32 minutes 99.999% 5.26 minutes 1.30 minutes 25.9 seconds Availability level Allowed unavailability window Balancing speed with stability
  • 29. Agenda ● Defining DevOps ● How to measure DevOps performance ● New insights in 2019 ● Balancing speed with stability ● How do we improve
  • 30. 2 research models to drive improvements Performance model Productivity model
  • 31. Performance model Clear change process Heavyweight change process Code maintainability Loosely coupled architecture Monitoring Trunk-based development Deployment automation Culture of psychological safety Continuous integration Automated testing Continuous delivery Disaster recovery testing Cloud Burnout SDO Performance Software delivery performance Availability Organizational performance Industry (control) Enterprise (control) (-) (-) (-)
  • 32. 3 guiding principles 1. Increase developer productivity 2. Shift left 3. Automated Compliance
  • 33. Technical Practices- Demo Cloud Code GCR vulnerability scanning Binary authorization Trusted images Kubernetes Engine Cloud Build GitHub Untrusted images Audit log Local development Remote development Fixes
  • 34. Driving productivity improvements Useful, easy to use tools Internal search External search Culture of psychological safety Productivity Technical debt Work recovery Burnout Code maintainability Loosely coupled architecture Monitoring Years of experience (control) (-) (-)(-)(-) (-) n.s.
  • 35. Takeaway The industry continues to improve, particularly among the highest performers. DevOps matters for both commercial goals and non-commercial goals Cloud continues to be a differentiator for elite performers 100% availability is unrealistic & undesirable Error budgets can help with measuring availability