Customer-centric Metrics
Mark McBride
October 4th, 2017
Mark
• Twi)er - pla/orm/API team
during the great de-monolithing
• Nest - led service engineering,
launched their developer program
• Turbine Labs - turning pracCces
developed at Twi)er/Nest/etc.
into products you can buy
• @mccv on the Twi)er
Engineering RealiCes
Care vs. Anxiety
• Caring about whether your system is working is the heavy burden of
being a grown up soLware professional
• Caring about things you can’t quanCfy leads to anxiety
• Caring about too many things leads to anxiety
• Caring about a tractable number of quanCfiable things frees you from
anxiety
A Twi)er Metrics History
• 2009 - Ganglia
• Open source
• Limited scalability
• URL based UI
• Limited exploraCon
• Chart unCl the
system falls over
A Twi)er Metrics History
• Early 2010 - Reunion
• Custom built for
diagnosis
• ExploraCon oriented
• Limited scalability
• Only look at what
you need to resolve
the incident at hand
A Twi)er Metrics History
• 2010 - Cuckoo/Viz
• Custom built for
scalability
• Dashboard oriented
• Highly scalable
• Charts for everything
A Twi)er Metrics History
• The cardinality of services exploded
• Blindness was replaced with
cogniCve overload
Switch Focus to Customers,
Not System Internals
Simplified View of Service Health
• What do customers care about?
• Success Rate - I want it to work
• Latency - I want it to be fast
• RPS - I want to be able to use this much of the service
• What don’t customers care about?
• CPU burn, disk uClizaCon, leader elecCons, etc. etc. etc.
Success Rate
• Stuff obviously needs to work
• Necessary but not sufficient
• “Nines don’t ma)er if customers
aren’t happy” - @mipsyCpsy
Request Rate
• A contract about how much service
you’re willing to provide
• This is oLen implicit
• It’s sCll important
Latency
• Don’t (just) measure medians
• Measure the experience of your
worst N% of customers, then make
them happy
Puhng it All Together
• A simple view of customer happiness
• This is not a diagnosCc view
• This lets you know whether you need
to start diagnosing
• More importantly, it quickly lets you
know that you don’t need to worry
Not All AcCons are Equal
SegmentaCon
• Consider an online shop
• Customers want to look at stuff
• Customers want to buy stuff
• Aggregates Hide InformaCon
• Looks >>> buys
SegmentaCon
• Break down health by acCon
• This gives you a clearer view of all the
ways customers could be unhappy
• Scan quickly to ensure your service is
ok, get back to building features
But I Don’t Serve
Customer Traffic
Teammates Become Customers
• Teammates
• Have a relaCvely deep understanding of your system
• Are willing to be flexible with system changes
• Customers
• Don’t understand the internal details of your system
• Unwilling to adapt to incidents with your system
Teammates Become Customers
• As teams grow, specializaCon necessarily becomes the norm
• TreaCng customers like teammates is going to hurt
A Twi)er History
• 2011 - Viz Standards
• Pick good metrics
• Show them for your
service
• Measure them for all
your dependent
services
Embrace Customerhood
• Manage your internal systems like they serve customers
• Strong abstracCon boundaries (APIs)
• Measure customer experience, not system internals
Embrace Customerhood
• This means you get to manage your internal dependencies like you’re a
customer!
• You don’t have to worry about what’s behind the API
• But you can easily observe the behavior of the API you’re calling
• Common metrics create a common language for discussing system
behavior
• Common language reduces anxiety team wide
Observability the Easy Way
Network AbstracCon Boundaries
• Now that everything is behind APIs,
everybody is talking over the network
• Proxies make this a programmable
abstracCon boundary
• Proxies give you a common point of
observaCon and control
• CapabiliCes span languages and runCmes
• CapabiliCes can be applied globally
The Network
• Stop thinking of the network as (only) a tax
• AbstracCon boundaries are good things
• Programmable abstracCon boundaries are be)er
• Behind that boundary there is freedom to move
• Across those boundaries you can observe and control service
interacCon
Care Without Anxiety
• Watch the metrics that make happy customers
• Watch metrics for all customer acCons
• Develop a common language for discussing system health
• Treat teammates like customers
• Use your network as a common point of observability and control
Todos
@mccv
mark@turbinelabs.io
h)ps://www.turbinelabs.io

More Related Content

PDF
Traffic Control with Envoy Proxy
PDF
More Than Just URL Mappers - Proxies for Observation and Control
PDF
Solving trust issues at scale - Omer Levi Hevroni
PDF
Embrace DevOps: Delivery Value with Puppet Automation
PPTX
Performance: Key Elements to Consider in the Cloud - RightScale Compute 2013
PPTX
Strong Consistency in Databases. What does it actually guarantee? - Andy Goo...
PDF
THE PLEASURES OF ON-PREM, TOMER GABEL
PDF
Altitude San Francisco 2018: HTTP/2 Tales: Discovery and Woe
Traffic Control with Envoy Proxy
More Than Just URL Mappers - Proxies for Observation and Control
Solving trust issues at scale - Omer Levi Hevroni
Embrace DevOps: Delivery Value with Puppet Automation
Performance: Key Elements to Consider in the Cloud - RightScale Compute 2013
Strong Consistency in Databases. What does it actually guarantee? - Andy Goo...
THE PLEASURES OF ON-PREM, TOMER GABEL
Altitude San Francisco 2018: HTTP/2 Tales: Discovery and Woe

What's hot (19)

PDF
Sajan-SiteSyncReverseProxySolution
PPTX
Code Yellow: Helping Operations Top-Heavy Teams the Smart Way
PDF
Lean Kanban India 2016 | Stop fudging the numbers and Start Forecasting with ...
PPTX
Serverless Architecture
PDF
Service Levels and Error Budgets - Paweł Kucharski
PPTX
Why HTTP Won't Work For The Internet of Things (Dreamforce 2014)
PDF
Speed up your Serverless development flow
PPTX
Solving trust issues at scale
PPTX
Nov 2015 Webinar: Introduction to FileCatalyst v3.6
PPTX
Building a [micro]services platform on AWS
KEY
The guardian and app engine
PPTX
Building Real Time Web Applications with SignalR (NoVA Code Camp 2015)
PPTX
Aug2015 webinar-file catalyst v3.5
PDF
Creating Hyper Performant Web Apps with React
PPTX
Solving trust issues at scale - AppSec California
PDF
Object-oriented design for infrastructure-as-a-code - Anna Bankirer
PPTX
Monitoring microservices
PDF
Shifting is more than shifting left
PPTX
How to measure the business impact of web performance
Sajan-SiteSyncReverseProxySolution
Code Yellow: Helping Operations Top-Heavy Teams the Smart Way
Lean Kanban India 2016 | Stop fudging the numbers and Start Forecasting with ...
Serverless Architecture
Service Levels and Error Budgets - Paweł Kucharski
Why HTTP Won't Work For The Internet of Things (Dreamforce 2014)
Speed up your Serverless development flow
Solving trust issues at scale
Nov 2015 Webinar: Introduction to FileCatalyst v3.6
Building a [micro]services platform on AWS
The guardian and app engine
Building Real Time Web Applications with SignalR (NoVA Code Camp 2015)
Aug2015 webinar-file catalyst v3.5
Creating Hyper Performant Web Apps with React
Solving trust issues at scale - AppSec California
Object-oriented design for infrastructure-as-a-code - Anna Bankirer
Monitoring microservices
Shifting is more than shifting left
How to measure the business impact of web performance
Ad

Similar to Customer-centric Metrics (20)

PDF
QCon 2015 - Microservices Track Notes
PPTX
The 7 deadly sins of micro services
PPTX
Top 5 Java Performance Metrics, Tips & Tricks
PPTX
Service Architectures At Scale - QCon London 2015
PPTX
YTD Video Downloader Pro Crack [Latest 2025]
PPTX
Distributed systems: design, principles and experiencies
PDF
Jeremy Edberg (MinOps ) - How to build a solid infrastructure for a startup t...
PDF
Integration strategies best practices- Mulesoft meetup April 2018
PDF
Microservices - Scaling Development and Service
PDF
Retiring Service Interfaces: A Retrospective on Two 10+ Year Old Services
PPTX
Going Global with Itoc and AWS
PPTX
Service Architectures at Scale
PDF
cloud session uklug
PPTX
Standard Bank: How APM Supports DevOps, Agile and Engineering Transformation ...
PDF
Phil Green - We're migrating to the cloud - Who needs service management
PDF
Chris Munns, DevOps @ Amazon: Microservices, 2 Pizza Teams, & 50 Million Depl...
PPTX
ServiceNow + Precisely: Getting Business Value and Visibility from Mainframe ...
PPTX
Effective Microservices In a Data-centric World
PPTX
Driving Business Agility with AWS Serverless -Atlanta
PDF
From no services to Microservices
QCon 2015 - Microservices Track Notes
The 7 deadly sins of micro services
Top 5 Java Performance Metrics, Tips & Tricks
Service Architectures At Scale - QCon London 2015
YTD Video Downloader Pro Crack [Latest 2025]
Distributed systems: design, principles and experiencies
Jeremy Edberg (MinOps ) - How to build a solid infrastructure for a startup t...
Integration strategies best practices- Mulesoft meetup April 2018
Microservices - Scaling Development and Service
Retiring Service Interfaces: A Retrospective on Two 10+ Year Old Services
Going Global with Itoc and AWS
Service Architectures at Scale
cloud session uklug
Standard Bank: How APM Supports DevOps, Agile and Engineering Transformation ...
Phil Green - We're migrating to the cloud - Who needs service management
Chris Munns, DevOps @ Amazon: Microservices, 2 Pizza Teams, & 50 Million Depl...
ServiceNow + Precisely: Getting Business Value and Visibility from Mainframe ...
Effective Microservices In a Data-centric World
Driving Business Agility with AWS Serverless -Atlanta
From no services to Microservices
Ad

Recently uploaded (20)

PDF
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
PDF
DNT Brochure 2025 – ISV Solutions @ D365
PPTX
Patient Appointment Booking in Odoo with online payment
PPTX
Why Generative AI is the Future of Content, Code & Creativity?
PPTX
Computer Software and OS of computer science of grade 11.pptx
PDF
The Dynamic Duo Transforming Financial Accounting Systems Through Modern Expe...
PDF
CCleaner 6.39.11548 Crack 2025 License Key
PPTX
Computer Software - Technology and Livelihood Education
PDF
iTop VPN Crack Latest Version Full Key 2025
PPTX
CNN LeNet5 Architecture: Neural Networks
PPTX
Tech Workshop Escape Room Tech Workshop
PDF
Autodesk AutoCAD Crack Free Download 2025
PDF
EaseUS PDF Editor Pro 6.2.0.2 Crack with License Key 2025
PDF
MCP Security Tutorial - Beginner to Advanced
PPTX
Monitoring Stack: Grafana, Loki & Promtail
PDF
Multiverse AI Review 2025: Access All TOP AI Model-Versions!
PDF
Cost to Outsource Software Development in 2025
PDF
Wondershare Recoverit Full Crack New Version (Latest 2025)
PPTX
Oracle Fusion HCM Cloud Demo for Beginners
PPTX
"Secure File Sharing Solutions on AWS".pptx
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
DNT Brochure 2025 – ISV Solutions @ D365
Patient Appointment Booking in Odoo with online payment
Why Generative AI is the Future of Content, Code & Creativity?
Computer Software and OS of computer science of grade 11.pptx
The Dynamic Duo Transforming Financial Accounting Systems Through Modern Expe...
CCleaner 6.39.11548 Crack 2025 License Key
Computer Software - Technology and Livelihood Education
iTop VPN Crack Latest Version Full Key 2025
CNN LeNet5 Architecture: Neural Networks
Tech Workshop Escape Room Tech Workshop
Autodesk AutoCAD Crack Free Download 2025
EaseUS PDF Editor Pro 6.2.0.2 Crack with License Key 2025
MCP Security Tutorial - Beginner to Advanced
Monitoring Stack: Grafana, Loki & Promtail
Multiverse AI Review 2025: Access All TOP AI Model-Versions!
Cost to Outsource Software Development in 2025
Wondershare Recoverit Full Crack New Version (Latest 2025)
Oracle Fusion HCM Cloud Demo for Beginners
"Secure File Sharing Solutions on AWS".pptx

Customer-centric Metrics

  • 2. Mark • Twi)er - pla/orm/API team during the great de-monolithing • Nest - led service engineering, launched their developer program • Turbine Labs - turning pracCces developed at Twi)er/Nest/etc. into products you can buy • @mccv on the Twi)er
  • 4. Care vs. Anxiety • Caring about whether your system is working is the heavy burden of being a grown up soLware professional • Caring about things you can’t quanCfy leads to anxiety • Caring about too many things leads to anxiety • Caring about a tractable number of quanCfiable things frees you from anxiety
  • 5. A Twi)er Metrics History • 2009 - Ganglia • Open source • Limited scalability • URL based UI • Limited exploraCon • Chart unCl the system falls over
  • 6. A Twi)er Metrics History • Early 2010 - Reunion • Custom built for diagnosis • ExploraCon oriented • Limited scalability • Only look at what you need to resolve the incident at hand
  • 7. A Twi)er Metrics History • 2010 - Cuckoo/Viz • Custom built for scalability • Dashboard oriented • Highly scalable • Charts for everything
  • 8. A Twi)er Metrics History • The cardinality of services exploded • Blindness was replaced with cogniCve overload
  • 9. Switch Focus to Customers, Not System Internals
  • 10. Simplified View of Service Health • What do customers care about? • Success Rate - I want it to work • Latency - I want it to be fast • RPS - I want to be able to use this much of the service • What don’t customers care about? • CPU burn, disk uClizaCon, leader elecCons, etc. etc. etc.
  • 11. Success Rate • Stuff obviously needs to work • Necessary but not sufficient • “Nines don’t ma)er if customers aren’t happy” - @mipsyCpsy
  • 12. Request Rate • A contract about how much service you’re willing to provide • This is oLen implicit • It’s sCll important
  • 13. Latency • Don’t (just) measure medians • Measure the experience of your worst N% of customers, then make them happy
  • 14. Puhng it All Together • A simple view of customer happiness • This is not a diagnosCc view • This lets you know whether you need to start diagnosing • More importantly, it quickly lets you know that you don’t need to worry
  • 15. Not All AcCons are Equal
  • 16. SegmentaCon • Consider an online shop • Customers want to look at stuff • Customers want to buy stuff • Aggregates Hide InformaCon • Looks >>> buys
  • 17. SegmentaCon • Break down health by acCon • This gives you a clearer view of all the ways customers could be unhappy • Scan quickly to ensure your service is ok, get back to building features
  • 18. But I Don’t Serve Customer Traffic
  • 19. Teammates Become Customers • Teammates • Have a relaCvely deep understanding of your system • Are willing to be flexible with system changes • Customers • Don’t understand the internal details of your system • Unwilling to adapt to incidents with your system
  • 20. Teammates Become Customers • As teams grow, specializaCon necessarily becomes the norm • TreaCng customers like teammates is going to hurt
  • 21. A Twi)er History • 2011 - Viz Standards • Pick good metrics • Show them for your service • Measure them for all your dependent services
  • 22. Embrace Customerhood • Manage your internal systems like they serve customers • Strong abstracCon boundaries (APIs) • Measure customer experience, not system internals
  • 23. Embrace Customerhood • This means you get to manage your internal dependencies like you’re a customer! • You don’t have to worry about what’s behind the API • But you can easily observe the behavior of the API you’re calling • Common metrics create a common language for discussing system behavior • Common language reduces anxiety team wide
  • 25. Network AbstracCon Boundaries • Now that everything is behind APIs, everybody is talking over the network • Proxies make this a programmable abstracCon boundary • Proxies give you a common point of observaCon and control • CapabiliCes span languages and runCmes • CapabiliCes can be applied globally
  • 26. The Network • Stop thinking of the network as (only) a tax • AbstracCon boundaries are good things • Programmable abstracCon boundaries are be)er • Behind that boundary there is freedom to move • Across those boundaries you can observe and control service interacCon
  • 28. • Watch the metrics that make happy customers • Watch metrics for all customer acCons • Develop a common language for discussing system health • Treat teammates like customers • Use your network as a common point of observability and control Todos