SlideShare a Scribd company logo
Operational InsightJune 15, 2015
Roy Rapoport
@royrapoport / linkedin.com/in/royrapoport / rrapoport@netflix.com
Oh, The Places
We’ll Go!
Operational Insight: Concepts and Examples (w/o Presenter Notes)
John Boyd
Operational Insight: Concepts and Examples (w/o Presenter Notes)
Observe
Observe
Orient
Observe
Orient
Decide
Observe
Orient
Decide
Act
Observe
Orient
Decide
Act
OODA
Observe
Orient
Decide
Act
OODA
“This approach favors agility over raw power in dealing with human
opponents in any endeavor” - Wikipedia
This Is What We
Do
Operational Insight: Concepts and Examples (w/o Presenter Notes)
Operational Insight: Concepts and Examples (w/o Presenter Notes)
Operational Insight: Concepts and Examples (w/o Presenter Notes)
OODA KPI
OODA KPI
Speed
OODA KPI
Speed Effort
OODA KPI
Speed Effort Reliability
Winning
Speed Effort Reliability
Winning
Speed
Effort Reliability
Winning
Speed
Effort
Reliability
Winning
Speed
Effort
Reliability
Implications …
for Observation (aka measurement, telemetry, metrics)
Implications …
for Observation (aka measurement, telemetry, metrics)
• Make It Easy
Implications …
for Observation (aka measurement, telemetry, metrics)
• Make It Easy
• Make It Scalable
Implications …
for Observation (aka measurement, telemetry, metrics)
• Make It Easy
• Make It Scalable
• Make it pluggable
Implications …
for Observation (aka measurement, telemetry, metrics)
• Make It Easy
• Make It Scalable
• Make it pluggable
• (Eventually) Ruthlessly Cull
Implications …
for Observation (aka measurement, telemetry, metrics)
• Make It Easy
• Make It Scalable
• Make it pluggable
• (Eventually) Ruthlessly Cull
“What decision will this help me make?”
A Joke
Operational Insight: Concepts and Examples (w/o Presenter Notes)
52
48
% of servers in major region
with an even IP address
Implications …
for Orientation (aka graphing, visualization)
Implications …
for Orientation (aka graphing, visualization)
• First-class product
Implications …
for Orientation (aka graphing, visualization)
• First-class product
• Different decisions require different viz
Implications …
for Orientation (aka graphing, visualization)
• First-class product
• Different decisions require different viz
• Low cognitive load better than
Implications …
for Orientation (aka graphing, visualization)
• First-class product
• Different decisions require different viz
• Low cognitive load better than
• High refresh rates
Implications …
for Orientation (aka graphing, visualization)
• First-class product
• Different decisions require different viz
• Low cognitive load better than
• High refresh rates
• Deep data density
Better Like This …
Or Better Like That …
Implications …
for Decisions (aka alerting, real-time analytics, etc)
Implications …
for Decisions (aka alerting, real-time analytics, etc)
• You already have (some of) this
Implications …
for Decisions (aka alerting, real-time analytics, etc)
• You already have (some of) this
• Incremental improvement
Implications …
for Decisions (aka alerting, real-time analytics, etc)
• You already have (some of) this
• Incremental improvement
• Sky’s the limit
Implications …
for Decisions (aka alerting, real-time analytics, etc)
• You already have (some of) this
• Incremental improvement
• Sky’s the limit
• For benefits
Implications …
for Decisions (aka alerting, real-time analytics, etc)
• You already have (some of) this
• Incremental improvement
• Sky’s the limit
• For benefits
• For cost
Implications …
for Action
Implications …
for Action
1. Humans beat bureaucracy
Implications …
for Action
1. Humans beat bureaucracy
2. Machines beat humans
Implications …
for Action
1. Humans beat bureaucracy
2. Machines beat humans
3. Repeatability beats one-offs
Implications …
for Action
1. Humans beat bureaucracy
2. Machines beat humans
3. Repeatability beats one-offs
Repeatable machine processes TROUNCE one-off human
bureaucracy
Implications …
for Action
1. Humans beat bureaucracy
2. Machines beat humans
3. Repeatability beats one-offs
4. Start with humans
Repeatable machine processes TROUNCE one-off human
bureaucracy
Implications …
for Action
1. Humans beat bureaucracy
2. Machines beat humans
3. Repeatability beats one-offs
4. Start with humans
5. If IFTTT, deprecate humans
Repeatable machine processes TROUNCE one-off human
bureaucracy
Decision:
Do I Have Enough
Instances?
Operational Insight: Concepts and Examples (w/o Presenter Notes)
Operational Insight: Concepts and Examples (w/o Presenter Notes)
Decision:
Is My Canary Good?
25
Been there.
Done that.
Manually.Artisanally.
25
Been there.
• Started in the Data Center
Done that.
Manually.Artisanally.
25
Been there.
• Started in the Data Center
• Manual, dashboard-driven
Done that.
Manually.Artisanally.
25
Been there.
Done that.
Manually.
26
CPURequestsErrors
Been there.
Done that.
Manually.
27
Been there.
Done that.
Manually.
• Context vs Precision
27
Been there.
Done that.
Manually.
• Context vs Precision
• No …
27
Been there.
Done that.
Manually.
• Context vs Precision
• No …
• Repeatability
27
Been there.
Done that.
Manually.
• Context vs Precision
• No …
• Repeatability
• Trending
27
Been there.
Done that.
Manually.
• Context vs Precision
• No …
• Repeatability
• Trending
• Manual effort is manual
27
So Now What?
28
So Now What?
• Automate Analysis
28
So Now What?
• Automate Analysis
• Took Some Effort
28
So Now What?
• Automate Analysis
• Took Some Effort
• Approach and analytics
28
So Now What?
• Automate Analysis
• Took Some Effort
• Approach and analytics
• Presentation matters
28
Version
Control
System
1000
servers
@ 1.0.1
Customers
Build &
Deployment
System
Automated
Canary
Analysis
Pretty Pictures
29
Version
Control
System
1000
servers
@ 1.0.1
Customers
Build &
Deployment
System
1 server
@ 1.0.2
Automated
Canary
Analysis
Pretty Pictures
29
10 servers
@ 1.0.2
Version
Control
System
1000
servers
@ 1.0.1
Customers
Build &
Deployment
System
Automated
Canary
Analysis
Pretty Pictures
29
1000
servers
@ 1.0.2
Version
Control
System
1000
servers
@ 1.0.1
Customers
Build &
Deployment
System
Automated
Canary
Analysis
Pretty Pictures
29
Versi
on
1000
servers
@ 1.0.1
Custome
Build &
Deployment
Automat
ed
1000
servers
@ 1.0.2
Pretty Pictures
30
Version
Control
System
Build &
Deployment
System
Automated
Canary
Analysis
Customers
Versi
on
Custome
Build &
Deployment
Automat
ed
1000
servers
@ 1.0.2
Pretty Pictures
30
Version
Control
System
Build &
Deployment
System
Automated
Canary
Analysis
Customers
Versi
on
1000
servers
@ 1.0.1
Custome
Build &
Deployment
Automat
ed
1000
servers
@ 1.0.2
Pretty Pictures
31
Version
Control
System
Build &
Deployment
System
Automated
Canary
Analysis
Versi
on
1000
servers
@ 1.0.1
Custome
Build &
Deployment
Automat
ed
1000
servers
@ 1.0.2
Pretty Pictures
31
Version
Control
System
Build &
Deployment
System
Automated
Canary
Analysis
Just The Stats
4-Week View
Just The Stats
4-Week View
6309 canary analysis cycles
Just The Stats
4-Week View
6309 canary analysis cycles
16% canaries failed
Decision:
Do I Have an Outlier?
Outlier Detection
Would You Like to Play a
Game?
Spot the Outlier
Operational Insight: Concepts and Examples (w/o Presenter Notes)
Operational Insight: Concepts and Examples (w/o Presenter Notes)
Operational Insight: Concepts and Examples (w/o Presenter Notes)
The
Outlier Is
“A”
Just The Stats
4-Week View
Just The Stats
4-Week View
739 Server Terminations
In a Nutshell
Observe
Orient
Decide
Act
In a Nutshell
Observe
Orient
Decide
Act
Need This First
http://bit.ly/nflx-atlas-2013
http://metrics20.org
In a Nutshell
Observe
Orient
Decide
Act
Need This First
http://bit.ly/nflx-atlas-2013
http://metrics20.org
Understand the decision
http://bit.ly/nflx-qcon-aca-2014
In a Nutshell
Observe
Orient
Decide
Act
Need This First
http://bit.ly/nflx-atlas-2013
http://metrics20.org
Understand the decision
http://bit.ly/nflx-qcon-aca-2014
Make it easier for humans
In a Nutshell
Observe
Orient
Decide
Act
Need This First
http://bit.ly/nflx-atlas-2013
http://metrics20.org
Understand the decision
http://bit.ly/nflx-qcon-aca-2014
Make it easier for humans
Make machines

do it
In a Nutshell
Observe
Orient
Decide
Act
Need This First
http://bit.ly/nflx-atlas-2013
http://metrics20.org
Understand the decision
http://bit.ly/nflx-qcon-aca-2014
Make it easier for humans
Make machines

do it
Higher speed
Lower effort
Higher reliability
Questions, Attributions, Feedback
42
Questions, Attributions, Feedback
@royrapoport
rsr@netflix.com
linkedin.com/in/royrapoport
?42

More Related Content

PDF
Operational Insight: Concepts and Examples
PDF
DevOps Paradox: Going Faster Brings Higher Quality, Lower Costs, & Better Out...
PDF
DevOps: The Future is Already Here — It’s Just Unevenly Distributed
PDF
DevOps & Security from an Enterprise Toolsmith's Perspective
PPTX
Systematic Inventive Thinking and Process improvements
PDF
Leveraging Your Company's DevOps Transformation (AppSec USA 2014)
PPT
A scientific method to develop people
PDF
SRE for Everyone: Making Tomorrow Better Than Today
Operational Insight: Concepts and Examples
DevOps Paradox: Going Faster Brings Higher Quality, Lower Costs, & Better Out...
DevOps: The Future is Already Here — It’s Just Unevenly Distributed
DevOps & Security from an Enterprise Toolsmith's Perspective
Systematic Inventive Thinking and Process improvements
Leveraging Your Company's DevOps Transformation (AppSec USA 2014)
A scientific method to develop people
SRE for Everyone: Making Tomorrow Better Than Today

What's hot (20)

PPTX
Testing within an Agile Environment - Beyza Sakir and Chris Gollop
PDF
Making Tomorrow Better than Today - Unlocking the Full Potential of Operations
PDF
SysAdmin to SRE: Creating Capacity to Make Tomorrow Better Than Today
PDF
Data-Driven Software Mastery @Open Mastery Austin
PPT
Sww 2006 Redesigning Processes For Solid Works
PDF
Esteem and Estimates (Ti Stimo Fratello)
PDF
Value stream mapping
KEY
Lego Lean Game (Agile Australia 2011)
PDF
The Art of Better
PDF
Without Self-Service Operations, the Cloud is Just Expensive Hosting 2.0 - (a...
PPTX
The Lego Lean Game (XP 2009 version)
PPTX
The Pursuit of Quality - Chasing Tornadoes or Just Hot Air?
PDF
Let's Make the PAIN Visible!
PDF
141015 Discovering Scrum at Scrum Roma
PDF
ABC's of Problem Solving
PPTX
Devops at scale is a hard problem challenges, insights and lessons learned
PDF
Agile Intro and 2014 trends for AgileSparks open day at John-Bryce - March 2014
PPTX
No Projects - Beyond Projects (Refreshed version)
PPTX
Innovation, Lean, Agile. Myths and Misconception
PDF
CTQ Tree Webinar 11-17-2020
Testing within an Agile Environment - Beyza Sakir and Chris Gollop
Making Tomorrow Better than Today - Unlocking the Full Potential of Operations
SysAdmin to SRE: Creating Capacity to Make Tomorrow Better Than Today
Data-Driven Software Mastery @Open Mastery Austin
Sww 2006 Redesigning Processes For Solid Works
Esteem and Estimates (Ti Stimo Fratello)
Value stream mapping
Lego Lean Game (Agile Australia 2011)
The Art of Better
Without Self-Service Operations, the Cloud is Just Expensive Hosting 2.0 - (a...
The Lego Lean Game (XP 2009 version)
The Pursuit of Quality - Chasing Tornadoes or Just Hot Air?
Let's Make the PAIN Visible!
141015 Discovering Scrum at Scrum Roma
ABC's of Problem Solving
Devops at scale is a hard problem challenges, insights and lessons learned
Agile Intro and 2014 trends for AgileSparks open day at John-Bryce - March 2014
No Projects - Beyond Projects (Refreshed version)
Innovation, Lean, Agile. Myths and Misconception
CTQ Tree Webinar 11-17-2020
Ad

Similar to Operational Insight: Concepts and Examples (w/o Presenter Notes) (20)

PDF
Operations as a Service: Because Failure Still Happens
KEY
PDF
Nondeterministic Software for the Rest of Us
PDF
Agile Beyond Development
 
PDF
Goto Chicago; Journeys To Cloud Native Architecture: Sun, Sea And Emergencies...
ODP
From devoops to devops
PDF
DriveBuild: Automation of Tests in the Field of Autonomous Cars
PDF
Its not about the tooling
PDF
Self-Service Operations: Because Failure Still Happens (Developer Edition)
PDF
No, we can't do continuous delivery
PDF
The "Ops" Side of DevSecOps
PDF
Metrics-driven Continuous Delivery
PDF
Monitoring Complex Systems - Chicago Erlang, 2014
ODP
Devops is not about Tooling
PDF
Keeping Your DevOps Transformation From Crushing Your Ops Capacity
PPTX
Navigation in 3 d environment with reinforcement learning by Predrag Njegovan...
PDF
Devops, The future is here, it's just not evenly distributed
PDF
Faster apps. faster time to market. faster mean time to repair
PPT
Devopsdays barcelona
ZIP
Em.key
Operations as a Service: Because Failure Still Happens
Nondeterministic Software for the Rest of Us
Agile Beyond Development
 
Goto Chicago; Journeys To Cloud Native Architecture: Sun, Sea And Emergencies...
From devoops to devops
DriveBuild: Automation of Tests in the Field of Autonomous Cars
Its not about the tooling
Self-Service Operations: Because Failure Still Happens (Developer Edition)
No, we can't do continuous delivery
The "Ops" Side of DevSecOps
Metrics-driven Continuous Delivery
Monitoring Complex Systems - Chicago Erlang, 2014
Devops is not about Tooling
Keeping Your DevOps Transformation From Crushing Your Ops Capacity
Navigation in 3 d environment with reinforcement learning by Predrag Njegovan...
Devops, The future is here, it's just not evenly distributed
Faster apps. faster time to market. faster mean time to repair
Devopsdays barcelona
Em.key
Ad

Recently uploaded (20)

PPT
Teaching material agriculture food technology
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
 
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Big Data Technologies - Introduction.pptx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
 
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Approach and Philosophy of On baking technology
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Teaching material agriculture food technology
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Spectral efficient network and resource selection model in 5G networks
Advanced methodologies resolving dimensionality complications for autism neur...
Building Integrated photovoltaic BIPV_UPV.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
 
MIND Revenue Release Quarter 2 2025 Press Release
20250228 LYD VKU AI Blended-Learning.pptx
Big Data Technologies - Introduction.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
Encapsulation_ Review paper, used for researhc scholars
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Mobile App Security Testing_ A Comprehensive Guide.pdf
The AUB Centre for AI in Media Proposal.docx
 
Unlocking AI with Model Context Protocol (MCP)
Approach and Philosophy of On baking technology
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...

Operational Insight: Concepts and Examples (w/o Presenter Notes)