chronosphere.io
chronosphere.io
Choose Your Own Adventure
Eric D. Schabell
Director Evangelism
@ericschabell{@fosstodon.org}
Cloud Native Observability Pitfalls
chronosphere.io
Cloud Native Observability
chronosphere.io
Cloud Native
chronosphere.io
Data volume
Experiment:
- Hello World app on 4 node
Kubernetes cluster with
Tracing, End User Metrics
(EUM), Logs, Metrics
(containers / nodes)
- 30 days == +450 GB
chronosphere.io
chronosphere.io
Cloud Native at Scale
chronosphere.io
Observability…
chronosphere.io
Cloud Native Observability
at Scale
chronosphere.io
O11y at Scale (need)
chronosphere.io
Picking Your Pitfalls
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section)
chronosphere.io
1. Ignoring existing
landscape
chronosphere.io
If they can’t
see me…
they can’t
hurt me...
chronosphere.io
chronosphere.io
Prometheus for metrics, alerting, queries
chronosphere.io
Prometheus auto discovery
chronosphere.io
Manual instrumentation (java client lib)
chronosphere.io
Short link: bit.ly/prom-
workshop
chronosphere.io
Applications (Java)
OTel Auto Instrumentation (libraries)
OTel API
OTel SDK
OTel Collector
OTLP
OTLP
OTLP
OpenTelemetry (Auto) instrumentation
chronosphere.io
Host
Observability Backend
(Prometheus, Jaeger, Fluent Bit, etc.),
Applications
OTel Auto Instrumentation
OTel API
OTel SDK
OTel Collector Agent
OTLP
OTLP
OTLP
OTLP
OTLP
OpenTelemetry Collector (agent)
chronosphere.io
Host
Host
Host
Observability Backend
(Prometheus, Jaeger, Fluent Bit, etc.),
Applications
OTel Auto Instrumentation
OTel API
OTel SDK
OTel Collector Agent
OTLP
OTLP
OTLP
OTLP
Collector (gateway)
OTel Collector Gateway
chronosphere.io
Short link: bit.ly/opentelemetry-
workshop
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
2. Focusing on The Pillars
chronosphere.io
Pillars Phases
chronosphere.io
Developer
Technology
Bottom up
chronosphere.io
Pillar problems…
chronosphere.io
Car is on fire…
chronosphere.io
Better outcomes…
Faster remediation…
Easier detection…
Happier customers…
chronosphere.io
Phase 1
Know something is
happening as fast
as possible…
chronosphere.io
Phase 2
Triage with specific
information…
chronosphere.io
Phase 3
Understand to
ensure never
happens again…
chronosphere.io
chronosphere.io
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
3. Sneaky sprawling mess
chronosphere.io
Over 66% of organizations
use more than 10 different
observability tools
– ESG report over exploding data volumes
chronosphere.io
chronosphere.io
Know
Triage
Understand
chronosphere.io
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
4. Controlling costs
chronosphere.io
“It’s remarkable how common
this situation is, where an
organization is paying more for
their observability data, than
they do for their production
infrastructure.”
chronosphere.io
O11y data storage costs
are broken.
Keeping everything
model?
chronosphere.io
Know the cost of
observability
metrics data?
chronosphere.io
DATA
COLLECTION
CONTROL PLANE
PURPOSE-BUILT DATA STORES
PER TELEMETRY TYPE
CHRONOSPHERE LENS
Align cost to value Single Tenanted Architecture w/
99.99% Reliability
Turns raw data into generated
insights for each user
Customer Environment Chronosphere SaaS Platform
METRICS
|
LOGS
|
TRACES
|
EVENTS
Ingest all your data from
any source
chronosphere.io
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
5. The protocol jungle
chronosphere.io
Without open standards,
you’ll not find a way back…
chronosphere.io
chronosphere.io
Host
Observability Backend
(Prometheus, Jaeger, Fluent Bit, etc.),
Applications
OTel Auto Instrumentation
OTel API
OTel SDK
OTel Collector Agent
OTLP
OTLP
OTLP
OTLP
OTLP
OpenTelemetry Collector (agent)
chronosphere.io
Prometheus for metrics, alerting, queries
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
6. Underestimating cardinality
chronosphere.io
The struggle is real
“I don't yet collect spans/traces because I can hardly get our devs to care about basic metrics, let alone
traces.”
“This is a large enterprise with approx. 1000 developers. Cultivating a culture of engineering that cares
about availability is a challenge that we need to solve alongside any technical implementations.”
chronosphere.io
10 hours
on average, per week,
trying to triage and
understand incidents -
a quarter of a
40 hour work week
chronosphere.io
33%
said those issues
disrupted their
personal life
39%
admitting they are
frequently
stressed out
chronosphere.io
Cloud Native
Observability
at Scale
chronosphere.io
DATA
COLLECTION
CONTROL PLANE
PURPOSE-BUILT DATA STORES
PER TELEMETRY TYPE
CHRONOSPHERE LENS
Align cost to value Single Tenanted Architecture w/
99.99% Reliability
Turns raw data into generated
insights for each user
Customer Environment Chronosphere SaaS Platform
METRICS
|
LOGS
|
TRACES
|
EVENTS
Ingest all your data from
any source
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
What should be
the #1 item on
cloud wishlist?
What should be #1 item on your
cloud native observability
wishlist?
chronosphere.io
chronosphere.io
Questions?
Eric D. Schabell
Director Evangelism
@ericschabell{@fosstodon.org}

More Related Content

PDF
3 Pitfalls Everyone Should Avoid with Cloud Native Observability
PPTX
KCD Porto: Choose Your Own Adventure - Cloud Naive Observability Pitfalls
PPTX
Checking the pulse of your cloud native architecture
PPTX
SRECon EU 2023 - Three Phases to Better Observability Outcomes
PPTX
Green Custard Friday Talk 19: Chaos Engineering
PDF
Trajectory 2022 - Shifting Cloud Native Observability to the Left
PPTX
3 Pitfalls Everyone Should Avoid with Cloud Data
PPTX
3 Pitfalls Everyone Should Avoid with Cloud Data
3 Pitfalls Everyone Should Avoid with Cloud Native Observability
KCD Porto: Choose Your Own Adventure - Cloud Naive Observability Pitfalls
Checking the pulse of your cloud native architecture
SRECon EU 2023 - Three Phases to Better Observability Outcomes
Green Custard Friday Talk 19: Chaos Engineering
Trajectory 2022 - Shifting Cloud Native Observability to the Left
3 Pitfalls Everyone Should Avoid with Cloud Data
3 Pitfalls Everyone Should Avoid with Cloud Data

Similar to Choose Your Own Adventure - Cloud Native Observability Pitfalls (20)

PPTX
3 Pitfalls Everyone Should Avoid with Cloud Native Data
PDF
Observability For You and Me with openTelemetry
PPTX
Open Source 101 - Observability For You and Me with OpenTelemetry
PDF
Chaos Engineering - The Art of Breaking Things in Production
PPTX
Chaos engineering
PDF
Practical Chaos Engineering
PPTX
How to Wrestle Your Observability Data Demons and Win!
PPTX
Observability For You and Me with OpenTelemetry
ODP
Logs And Backups
PDF
SRECon Coherent Performance
PDF
Unraveling mysteries of the Universe at CERN, with OpenStack and Hadoop
PDF
SCAM 2012 Keynote Slides on Cooperative Testing and Analysis by Tao Xie
PDF
Embracing the Monolith
PDF
Embracing the Monolith in Small Teams: Doubling down on python to move fast w...
PDF
Let's Test Together by Justin Hunter
PDF
Building a cutting-edge data processing environment on a budget
PDF
Three Pillars, No Answers: Helping Platform Teams Solve Real Observability Pr...
PPTX
Put Some SRE in Your Shipped Software
PDF
Using security to drive chaos engineering - April 2018
PPTX
Optimizing Observability Spend: Metrics
3 Pitfalls Everyone Should Avoid with Cloud Native Data
Observability For You and Me with openTelemetry
Open Source 101 - Observability For You and Me with OpenTelemetry
Chaos Engineering - The Art of Breaking Things in Production
Chaos engineering
Practical Chaos Engineering
How to Wrestle Your Observability Data Demons and Win!
Observability For You and Me with OpenTelemetry
Logs And Backups
SRECon Coherent Performance
Unraveling mysteries of the Universe at CERN, with OpenStack and Hadoop
SCAM 2012 Keynote Slides on Cooperative Testing and Analysis by Tao Xie
Embracing the Monolith
Embracing the Monolith in Small Teams: Doubling down on python to move fast w...
Let's Test Together by Justin Hunter
Building a cutting-edge data processing environment on a budget
Three Pillars, No Answers: Helping Platform Teams Solve Real Observability Pr...
Put Some SRE in Your Shipped Software
Using security to drive chaos engineering - April 2018
Optimizing Observability Spend: Metrics
Ad

More from Eric D. Schabell (20)

PPTX
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
PPTX
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
PPTX
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
PPTX
Observability-as-a-Service: When Platform Engineers meet SREs
PPTX
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
PPTX
When Platform Engineers meet SREs - The Birth of O11y-as-a-Service Superpowers
PPTX
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
PPTX
Taking Back Control of Your Telemetry Data with Fluent Bit
PPTX
Finding observability and DevEx tranquility sailing the monitoring data seas
PDF
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
PPTX
MTTS - Sleep more, slog less with automated cloud native o11y platforms
PPTX
Infobip Shift EU 2024: Platform Engineers Arise - Adding Observability to You...
PPTX
PromCon EU 2024: Meet the New Kid in the Sandbox - Integrating Visualization ...
PPTX
Taking Back Control of Your Telemetry Data with Fluent Bit
PDF
Observability For You and Me with OpenTelemetry
PPTX
Power Up with Podman - Cloud Native + K8s Meetup
PPTX
Choose Your Own Observability Adventure
PDF
Observability For You and Me with OpenTelemetry (with demo)
PDF
Observability For You and Me with OpenTelemetry
PDF
Roadmap to Becoming a CNCF Ambassador
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
Observability-as-a-Service: When Platform Engineers meet SREs
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
When Platform Engineers meet SREs - The Birth of O11y-as-a-Service Superpowers
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
Taking Back Control of Your Telemetry Data with Fluent Bit
Finding observability and DevEx tranquility sailing the monitoring data seas
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
MTTS - Sleep more, slog less with automated cloud native o11y platforms
Infobip Shift EU 2024: Platform Engineers Arise - Adding Observability to You...
PromCon EU 2024: Meet the New Kid in the Sandbox - Integrating Visualization ...
Taking Back Control of Your Telemetry Data with Fluent Bit
Observability For You and Me with OpenTelemetry
Power Up with Podman - Cloud Native + K8s Meetup
Choose Your Own Observability Adventure
Observability For You and Me with OpenTelemetry (with demo)
Observability For You and Me with OpenTelemetry
Roadmap to Becoming a CNCF Ambassador
Ad

Recently uploaded (20)

PPTX
The various Industrial Revolutions .pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
CloudStack 4.21: First Look Webinar slides
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
August Patch Tuesday
PDF
Unlock new opportunities with location data.pdf
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPTX
Chapter 5: Probability Theory and Statistics
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Getting Started with Data Integration: FME Form 101
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
The various Industrial Revolutions .pptx
Assigned Numbers - 2025 - Bluetooth® Document
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Taming the Chaos: How to Turn Unstructured Data into Decisions
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
1 - Historical Antecedents, Social Consideration.pdf
Univ-Connecticut-ChatGPT-Presentaion.pdf
CloudStack 4.21: First Look Webinar slides
A comparative study of natural language inference in Swahili using monolingua...
August Patch Tuesday
Unlock new opportunities with location data.pdf
Module 1.ppt Iot fundamentals and Architecture
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
Chapter 5: Probability Theory and Statistics
Final SEM Unit 1 for mit wpu at pune .pptx
O2C Customer Invoices to Receipt V15A.pptx
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Getting Started with Data Integration: FME Form 101
Group 1 Presentation -Planning and Decision Making .pptx

Choose Your Own Adventure - Cloud Native Observability Pitfalls