SlideShare a Scribd company logo
What We Learned
from Three Years
Sciencing the Crap
Out of Devops
Jez Humble @jezhumble
Nicole Forsgren, PhD @nicolefv
© 2016 DevOps Research and Assessments LLC. CC-BY-SA
Outline
How to make your data suck less
● Writing good survey questions
● Making sure the survey questions are good - with
SCIENCE
What we found… that we did (AND didn’t) expect
Things about Continuous Delivery
Things about Management
Not all data is created equal
Who here thinks surveys are sh*t?
[Nicole should probably turn around]
Not all data is created equal
Who here thinks surveys are sh*t?
[Nicole should probably turn around]
Who here LOVES the data from their log files?
And who has seen sh*t data in a log file?
What is a Latent Construct?
We use
PSYCHOMETRICS
to make our survey data good*
*or give us a reasonable assurance that it’s telling us what we
think it’s telling us (& some of this can also apply to your log data)
Psychometrics includes:
Construct creation (manual)
● When possible: use previously validated constructs
● Based on definitions and theory, carefully and precisely
worded, card sorting task, pilot tested
Construct evaluation (statistics)
● Establishing Validity: discriminant and convergent
● Establishing Reliability
Psychometrics Writing Example:
Culture
● Does it matter to our study?
○ More than just intuition?
● What KIND of culture?
○ National identity and norms
○ Adaptive culture
○ Value learning (2014 study)
○ Value information flow and trust (2014 and 2015
studies -- Westrum culture)
Psychometrics Writing Example:
Culture
● Does it matter to our study?
○ More than just intuition?
● What KIND of culture?
○ National identity and norms
○ Adaptive culture
○ Value learning (2014 study)
○ Value information flow and trust (2014 and 2015
studies -- Westrum culture)
Westrum typology
Pathological
Power-oriented
Bureaucratic
Rule-oriented
Generative
Performance-oriented
Low cooperation Modest cooperation High cooperation
Messengers shot Messengers neglected Messengers trained
Responsibilities shirked Narrow responsibilities Risks are shared
Bridging discouraged Bridging tolerated Bridging encouraged
Failure leads to scapegoating Failure leads to justice Failure leads to inquiry
Novelty crushed Novelty leads to problems Novelty implemented
Try writing
items
yourself!
Use strong
statements
with clear
language.
Westrum Culture Items
● On my team, information is actively sought.
● On my team, failures are learning opportunities, and
messengers of them are not punished.
● On my team, responsibilities are shared.
● On my team, cross-functional collaboration is
encouraged and rewarded.
● On my team, failure causes inquiry.
● On my team, new ideas are welcomed.
Found to
be valid &
reliable
Predictive of
IT Performance &
Organizational
Performance
Psychometrics Analysis Example:
Notification of Failure
At my organization…
● We are primarily notified of failures by reports from
customers.
● We are primarily notified of failures by the NOC.
● We get failure alerts from logging and monitoring
systems.
● We monitor system health based on threshold warnings
(ex. CPU exceeds 100%).
● We monitor system health based on rate-of-change
warnings (ex. CPU usage has increased by 25% over the
last 10 minutes).
Original in
2014, but
there was a
surprise.
Can you
spot it?
Psychometrics Analysis Example:
Notification of Failure
At my organization…
● We are primarily notified of failures by reports from
customers.
● We are primarily notified of failures by the NOC.
● We get failure alerts from logging and monitoring
systems.
● We monitor system health based on threshold warnings
(ex. CPU exceeds 100%).
● We monitor system health based on rate-of-change
warnings (ex. CPU usage has increased by 25% over the
last 10 minutes).
Notification
from INSIDE
Notification
from
OUTSIDE
More data tests!
Plus, we test to make sure the survey doesn’t have other
problems.
● Common method variance (CMV) (aka CMB for Bias)
● Early vs. late responders
● Survey drop-off rates and bias
Okay NOW we can look at the data
IT performance matters!
“Firms with high-performing IT organizations were twice as
likely to exceed their profitability, market share and
productivity goals.”
http://bit.ly/2014-devops-report/
http://bit.ly/2015-devops-report/
IT performance
lead time for changes
release frequency
time to restore service
change fail rate
Continuous delivery
Okay NOW we can look at the data.
And now with added math
Forsgren, N., J. Humble (2016).
"The Role of Continuous Delivery in
IT and Organizational
Performance." In the Proceedings
of the Western Decision Sciences
Institute (WDSI) 2016, Las Vegas,
NV. Available at SSRN: http://ssrn.
com/abstract=2681909
some surprises
What’s not strongly correlated w/ ITPerf?
Third-party scripts
Homegrown scripts
Commercial configuration management tools
Open source
Golden images
Manual configuration management
What’s not strongly correlated w/ ITPerf?
Third-party scripts
Homegrown scripts
Commercial configuration management tools
Open source
Golden images
Manual configuration management
Which of these measure effective test practices?
Developers primarily create & maintain acceptance tests
QA primarily create & maintain acceptance tests
Primarily created & maintained by outsourced party
When automated tests pass, I’m confident the software is releasable
Test failures are likely to indicate a real defect
It’s easy for developers to fix acceptance tests
Developers share a common pool of test servers to reproduce failures
Developers create on demand test environments
Developers use their own dev environments to reproduce failures
Which of these measure effective test practices?
Developers primarily create & maintain acceptance tests
QA primarily create & maintain acceptance tests
Primarily created & maintained by outsourced party
When automated tests pass, I’m confident the software is releasable
Test failures are likely to indicate a real defect
It’s easy for developers to fix acceptance tests
Developers share a common pool of test servers to reproduce failures
Developers create on demand test environments
Developers use their own dev environments to reproduce failures
Change management
All production changes must be approved by an external body
(e.g. change approval board, manager, etc.) before
deployment or implementation (R)
Only high-risk changes, such as database changes, require
approval
We have no change approval process
We rely on peer review to manage changes
Change management
All production changes must be approved by an external body
(e.g. change approval board, manager, etc.) before
deployment or implementation (R)
Only high-risk changes, such as database changes, require
approval
We have no change approval process
We rely on peer review to manage changes
Now for management stuff
We all know managing WIP is important, right?
Now for management stuff
We all know managing WIP is important, right?
Correlation between WIP and ITPerf is negligible
Now for management stuff
We all know managing WIP is important, right?
Correlation between WIP and ITPerf is negligible
What’s going on?
Lean management SEM
Conclusions
Even if you think it’s obvious, TEST WITH DATA.
(if the results don’t surprise you, you’re doing it wrong)
(if you don’t also confirm some things you expected, you’re doing it
wrong)
We CAN have it all, or at least throughput AND stability.
DevOps culture & practices have a measurable impact on IT & org perf
Stay tuned!
Take the 2016 State of DevOps Survey late March
Read the 2016 State of DevOps Survey results late June
Want to know how your OWN DevOps Capabilities compare?
just pick up your phone and send an email
to: jezhumble@sendyourslides.com
subject: devops

More Related Content

PDF
How DevOps is Transforming IT, and What it Can Do for Academia
PPTX
2016 velocity santa clara state of dev ops report deck final
PDF
The Data on DevOps: Making the Case for Awesome
PDF
Sciencing the Crap Out of DevOps
PDF
2016 State of DevOps
PPTX
Nf final chef-lisa-metrics-2015-ss
PDF
Are We There Yet? Signposts On Your Journey to Awesome
PPTX
DevOps: The Key to IT Performance
How DevOps is Transforming IT, and What it Can Do for Academia
2016 velocity santa clara state of dev ops report deck final
The Data on DevOps: Making the Case for Awesome
Sciencing the Crap Out of DevOps
2016 State of DevOps
Nf final chef-lisa-metrics-2015-ss
Are We There Yet? Signposts On Your Journey to Awesome
DevOps: The Key to IT Performance

What's hot (20)

PDF
DevOps: What's Buried in the Fine Print
PDF
How Metrics Make Your DevOps Awesome
PPTX
Continuous Delivery + DevOps = Awesome
PPTX
How Continuous Delivery and Lean Management Make your DevOps Amazeballs
PDF
DOES 2016 Sciencing the Crap Out of DevOps
PPTX
DevOps: A Value Proposition
PPTX
Secrets and surprises of high performance: What the data says
PDF
If you don't know where you're going it doesn't matter how fast you get there
PPTX
Tools Won't Fix Your Broken DevOps
PDF
What I learned from 5 years of sciencing the crap out of DevOps
PDF
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
PPTX
The Data Behind DevOps: What Does it Take to be a High Performer? Jenkins Wor...
PDF
Vmware2021 why even devop nicolefv
PPTX
Software as Craft
PPTX
The Key to High Performance - What the Data Says
DOCX
Successful writing at work copyright 2017 cengage learn
PPTX
Soaring in the Clouds - Don't be dragged down by ITIL bloat!
PDF
DevOps & Security from an Enterprise Toolsmith's Perspective
PPTX
DOES16 San Francisco - David Blank-Edelman - Lessons Learned from a Parallel ...
PDF
DevOps: The Future is Already Here — It’s Just Unevenly Distributed
DevOps: What's Buried in the Fine Print
How Metrics Make Your DevOps Awesome
Continuous Delivery + DevOps = Awesome
How Continuous Delivery and Lean Management Make your DevOps Amazeballs
DOES 2016 Sciencing the Crap Out of DevOps
DevOps: A Value Proposition
Secrets and surprises of high performance: What the data says
If you don't know where you're going it doesn't matter how fast you get there
Tools Won't Fix Your Broken DevOps
What I learned from 5 years of sciencing the crap out of DevOps
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
The Data Behind DevOps: What Does it Take to be a High Performer? Jenkins Wor...
Vmware2021 why even devop nicolefv
Software as Craft
The Key to High Performance - What the Data Says
Successful writing at work copyright 2017 cengage learn
Soaring in the Clouds - Don't be dragged down by ITIL bloat!
DevOps & Security from an Enterprise Toolsmith's Perspective
DOES16 San Francisco - David Blank-Edelman - Lessons Learned from a Parallel ...
DevOps: The Future is Already Here — It’s Just Unevenly Distributed
Ad

Viewers also liked (18)

PDF
PPT
Análisis de vídeojuego de Luana Malfitano
PDF
Avant+Garde+2011+Full+Low+Quality
PPTX
Communication visuelle
PDF
Docket 03:27:2013
ODP
Análisis de videojuegos- Malen Soares
PDF
HOME - a film by Yann Arthus-Bertrand (2009)- A Writeup
PPTX
Vsm (value stream mapping))
PPTX
SMALL IS BEAUTIFUL
PPT
Analisis de video juego santa cruz
ODP
Análisis de vídeo juego Agostina Del Risco
PDF
Memphis Flyer - Contemporary Media Trump Cover Complaint
PPT
Analisis de videojuegos Longoni franco
PDF
Apresentação - Relatório de Inflação de 24 de Setembro de 2015
PPTX
Best Practices in Music on YouTube
PDF
Final final
PDF
Communication Visuelle - introduction
PDF
Devops Scorecard
Análisis de vídeojuego de Luana Malfitano
Avant+Garde+2011+Full+Low+Quality
Communication visuelle
Docket 03:27:2013
Análisis de videojuegos- Malen Soares
HOME - a film by Yann Arthus-Bertrand (2009)- A Writeup
Vsm (value stream mapping))
SMALL IS BEAUTIFUL
Analisis de video juego santa cruz
Análisis de vídeo juego Agostina Del Risco
Memphis Flyer - Contemporary Media Trump Cover Complaint
Analisis de videojuegos Longoni franco
Apresentação - Relatório de Inflação de 24 de Setembro de 2015
Best Practices in Music on YouTube
Final final
Communication Visuelle - introduction
Devops Scorecard
Ad

Similar to What we learned from three years sciencing the crap out of devops (20)

PDF
What We Learned from Three Years of Sciencing the Crap Out of DevOps
PDF
DOES16 San Francisco - Nicole Forsgren & Jez Humble - The Latest: What We Lea...
PDF
What We Learned from Four Years of Sciencing the Crap Out of DevOps - Nicole ...
PDF
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
PDF
2013 Velocity DevOps Metrics -- It's Not Just For WebOps Any More!
PDF
An Approach of Improve Efficiencies through DevOps Adoption
PDF
The caseforawesome
PDF
The Latest in DevOps: Elite Performance, Productivity, and Scaling - Google
PDF
Building and Scaling High Performing Technology Organizations by Jez Humble a...
PPTX
The Forgotten Secret to DevOps Success: Measurement
PDF
All daydevops 2016 - Turning Human Capital into High Performance Organizati...
PDF
ITIL Guide for DevOps
PDF
The Data Behind DevOps: Becoming a High Performer
PPTX
Keynote - DevOps and the bottom line.
PPTX
Determining Organizational Fit for Transformation Success
PDF
Continuous Delivery: Making DevOps Awesome
PPTX
DevOps: an efficient operating model
PDF
What I Learned from Four Years of Science-ing the Crap Out of DevOps
PPTX
2014 State Of DevOps Findings! Velocity Conference
PPTX
Measuring Performance: See the Science of DevOps Measurement in Action
What We Learned from Three Years of Sciencing the Crap Out of DevOps
DOES16 San Francisco - Nicole Forsgren & Jez Humble - The Latest: What We Lea...
What We Learned from Four Years of Sciencing the Crap Out of DevOps - Nicole ...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
2013 Velocity DevOps Metrics -- It's Not Just For WebOps Any More!
An Approach of Improve Efficiencies through DevOps Adoption
The caseforawesome
The Latest in DevOps: Elite Performance, Productivity, and Scaling - Google
Building and Scaling High Performing Technology Organizations by Jez Humble a...
The Forgotten Secret to DevOps Success: Measurement
All daydevops 2016 - Turning Human Capital into High Performance Organizati...
ITIL Guide for DevOps
The Data Behind DevOps: Becoming a High Performer
Keynote - DevOps and the bottom line.
Determining Organizational Fit for Transformation Success
Continuous Delivery: Making DevOps Awesome
DevOps: an efficient operating model
What I Learned from Four Years of Science-ing the Crap Out of DevOps
2014 State Of DevOps Findings! Velocity Conference
Measuring Performance: See the Science of DevOps Measurement in Action

More from Nicole Forsgren (7)

PDF
DevEx Essentials
PDF
2016 metrics-as-culture
PPTX
DevOps: Next (ChefConf 2015 keynote)
PPTX
Winning the Budget Game: How to Get the Money You Need for IT Every Time
PPTX
Using Learning Analytics to Illuminate Student Learning Pathways in an Online...
PPTX
How to Speak "Manager"
PPTX
DevOps and the Bottom Line
DevEx Essentials
2016 metrics-as-culture
DevOps: Next (ChefConf 2015 keynote)
Winning the Budget Game: How to Get the Money You Need for IT Every Time
Using Learning Analytics to Illuminate Student Learning Pathways in an Online...
How to Speak "Manager"
DevOps and the Bottom Line

Recently uploaded (20)

PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Tartificialntelligence_presentation.pptx
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Getting Started with Data Integration: FME Form 101
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
1. Introduction to Computer Programming.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Encapsulation theory and applications.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
Network Security Unit 5.pdf for BCA BBA.
Tartificialntelligence_presentation.pptx
gpt5_lecture_notes_comprehensive_20250812015547.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Getting Started with Data Integration: FME Form 101
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
NewMind AI Weekly Chronicles - August'25-Week II
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Unlocking AI with Model Context Protocol (MCP)
1. Introduction to Computer Programming.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Spectral efficient network and resource selection model in 5G networks
Encapsulation theory and applications.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
A comparative analysis of optical character recognition models for extracting...
Empathic Computing: Creating Shared Understanding
Group 1 Presentation -Planning and Decision Making .pptx

What we learned from three years sciencing the crap out of devops

  • 1. What We Learned from Three Years Sciencing the Crap Out of Devops Jez Humble @jezhumble Nicole Forsgren, PhD @nicolefv © 2016 DevOps Research and Assessments LLC. CC-BY-SA
  • 2. Outline How to make your data suck less ● Writing good survey questions ● Making sure the survey questions are good - with SCIENCE What we found… that we did (AND didn’t) expect Things about Continuous Delivery Things about Management
  • 3. Not all data is created equal Who here thinks surveys are sh*t? [Nicole should probably turn around]
  • 4. Not all data is created equal Who here thinks surveys are sh*t? [Nicole should probably turn around] Who here LOVES the data from their log files? And who has seen sh*t data in a log file?
  • 5. What is a Latent Construct?
  • 6. We use PSYCHOMETRICS to make our survey data good* *or give us a reasonable assurance that it’s telling us what we think it’s telling us (& some of this can also apply to your log data)
  • 7. Psychometrics includes: Construct creation (manual) ● When possible: use previously validated constructs ● Based on definitions and theory, carefully and precisely worded, card sorting task, pilot tested Construct evaluation (statistics) ● Establishing Validity: discriminant and convergent ● Establishing Reliability
  • 8. Psychometrics Writing Example: Culture ● Does it matter to our study? ○ More than just intuition? ● What KIND of culture? ○ National identity and norms ○ Adaptive culture ○ Value learning (2014 study) ○ Value information flow and trust (2014 and 2015 studies -- Westrum culture)
  • 9. Psychometrics Writing Example: Culture ● Does it matter to our study? ○ More than just intuition? ● What KIND of culture? ○ National identity and norms ○ Adaptive culture ○ Value learning (2014 study) ○ Value information flow and trust (2014 and 2015 studies -- Westrum culture)
  • 10. Westrum typology Pathological Power-oriented Bureaucratic Rule-oriented Generative Performance-oriented Low cooperation Modest cooperation High cooperation Messengers shot Messengers neglected Messengers trained Responsibilities shirked Narrow responsibilities Risks are shared Bridging discouraged Bridging tolerated Bridging encouraged Failure leads to scapegoating Failure leads to justice Failure leads to inquiry Novelty crushed Novelty leads to problems Novelty implemented Try writing items yourself! Use strong statements with clear language.
  • 11. Westrum Culture Items ● On my team, information is actively sought. ● On my team, failures are learning opportunities, and messengers of them are not punished. ● On my team, responsibilities are shared. ● On my team, cross-functional collaboration is encouraged and rewarded. ● On my team, failure causes inquiry. ● On my team, new ideas are welcomed. Found to be valid & reliable Predictive of IT Performance & Organizational Performance
  • 12. Psychometrics Analysis Example: Notification of Failure At my organization… ● We are primarily notified of failures by reports from customers. ● We are primarily notified of failures by the NOC. ● We get failure alerts from logging and monitoring systems. ● We monitor system health based on threshold warnings (ex. CPU exceeds 100%). ● We monitor system health based on rate-of-change warnings (ex. CPU usage has increased by 25% over the last 10 minutes). Original in 2014, but there was a surprise. Can you spot it?
  • 13. Psychometrics Analysis Example: Notification of Failure At my organization… ● We are primarily notified of failures by reports from customers. ● We are primarily notified of failures by the NOC. ● We get failure alerts from logging and monitoring systems. ● We monitor system health based on threshold warnings (ex. CPU exceeds 100%). ● We monitor system health based on rate-of-change warnings (ex. CPU usage has increased by 25% over the last 10 minutes). Notification from INSIDE Notification from OUTSIDE
  • 14. More data tests! Plus, we test to make sure the survey doesn’t have other problems. ● Common method variance (CMV) (aka CMB for Bias) ● Early vs. late responders ● Survey drop-off rates and bias
  • 15. Okay NOW we can look at the data
  • 16. IT performance matters! “Firms with high-performing IT organizations were twice as likely to exceed their profitability, market share and productivity goals.” http://bit.ly/2014-devops-report/ http://bit.ly/2015-devops-report/
  • 17. IT performance lead time for changes release frequency time to restore service change fail rate
  • 18. Continuous delivery Okay NOW we can look at the data.
  • 19. And now with added math Forsgren, N., J. Humble (2016). "The Role of Continuous Delivery in IT and Organizational Performance." In the Proceedings of the Western Decision Sciences Institute (WDSI) 2016, Las Vegas, NV. Available at SSRN: http://ssrn. com/abstract=2681909
  • 21. What’s not strongly correlated w/ ITPerf? Third-party scripts Homegrown scripts Commercial configuration management tools Open source Golden images Manual configuration management
  • 22. What’s not strongly correlated w/ ITPerf? Third-party scripts Homegrown scripts Commercial configuration management tools Open source Golden images Manual configuration management
  • 23. Which of these measure effective test practices? Developers primarily create & maintain acceptance tests QA primarily create & maintain acceptance tests Primarily created & maintained by outsourced party When automated tests pass, I’m confident the software is releasable Test failures are likely to indicate a real defect It’s easy for developers to fix acceptance tests Developers share a common pool of test servers to reproduce failures Developers create on demand test environments Developers use their own dev environments to reproduce failures
  • 24. Which of these measure effective test practices? Developers primarily create & maintain acceptance tests QA primarily create & maintain acceptance tests Primarily created & maintained by outsourced party When automated tests pass, I’m confident the software is releasable Test failures are likely to indicate a real defect It’s easy for developers to fix acceptance tests Developers share a common pool of test servers to reproduce failures Developers create on demand test environments Developers use their own dev environments to reproduce failures
  • 25. Change management All production changes must be approved by an external body (e.g. change approval board, manager, etc.) before deployment or implementation (R) Only high-risk changes, such as database changes, require approval We have no change approval process We rely on peer review to manage changes
  • 26. Change management All production changes must be approved by an external body (e.g. change approval board, manager, etc.) before deployment or implementation (R) Only high-risk changes, such as database changes, require approval We have no change approval process We rely on peer review to manage changes
  • 27. Now for management stuff We all know managing WIP is important, right?
  • 28. Now for management stuff We all know managing WIP is important, right? Correlation between WIP and ITPerf is negligible
  • 29. Now for management stuff We all know managing WIP is important, right? Correlation between WIP and ITPerf is negligible What’s going on?
  • 31. Conclusions Even if you think it’s obvious, TEST WITH DATA. (if the results don’t surprise you, you’re doing it wrong) (if you don’t also confirm some things you expected, you’re doing it wrong) We CAN have it all, or at least throughput AND stability. DevOps culture & practices have a measurable impact on IT & org perf
  • 32. Stay tuned! Take the 2016 State of DevOps Survey late March Read the 2016 State of DevOps Survey results late June Want to know how your OWN DevOps Capabilities compare? just pick up your phone and send an email to: jezhumble@sendyourslides.com subject: devops