SlideShare a Scribd company logo
1
Logging and machine data at Scale
James Governor @monkchips co-founder.
2
“The lumber industry sells what used to be waste —
sawdust, chips, and shredded wood — for a pretty profit.
Today you’ll find these by-products in synthetic fireplace
logs, concrete, ice strengtheners, mulch, particle board,
fuel, livestock and pet bedding, winter road traction,
weed killing and more”
– Jason Fried, CEO Basecamp
3
4
5
6
7
8
9
Zero Users Care What The System Health
is
All Users Care What Their Experience Is
Nines Don’t Matter if Users Aren’t Happy
Charity Majors, Honeycomb.io
10
Observability bridges tracing, logging, and monitoring
Logs are no longer about history
Real time, streams, and adhoc queries
11
https://simplicable.com/new/machine-data
12
Use Cases
Compliance
DevOps, IT Service management/root cause analysis
ecommerce, digital marketing optimisation
Fraud detection
IoT and integration with data from physical assets
Healthcare
Real time security
13
Always Compliance
Anti-money laundering
PCI
GDPR is coming – fines potentially 4% of global turnover
14
Splunk
Dubai Airport instruments literally everything
– Even hand washing in the bathroom from electronic taps and soap
Rackspace ingests 3TB of data every day
– Security monitoring and management
Tactical Assault Light Operator Suit (TALOS)
– Monitoring including vital signs
15
ELK as a disruptive entrant
Started as text search, now as a default for many orgs
Sprint – 3bn events per day
– Digital Transformation 4.0 Project
– real-time data from Retail Management and Store Ops
– 200 dashboards represent events from logs, databases, emails,
syslogs, test messages, and internal and vendor application APIs.
Uber, Facebook, Netflix
16
17
18
Edge Filtering
IoT
Sampling and polling
19
Tracing at Facebook
2014 Mystery Machine took 2 hours to compute a model from 1.3M traces
2017 Canopy generates and processes 1.3 billion traces per day
(spanning end-user devices, web servers, and backend services, backs 129
performance datasets from high-level end-to-end metrics to specific custom use cases)
Source https://blog.acolyer.org/2017/11/22/canopy-an-end-to-end-performance-tracing-and-analysis-system/
20
Roll your own kind of sucks
21
Using machine data for business process optimization
Data isn’t on the balance sheet
But data as product is
Data optimized services and customer experiences
Data culture
Observability is the new hotness
22
Cloud Native Tools
Grafana – visualization graph and dashboards,
Prometheus – monitoring and alerting, time series
Fluentd – data collector and log aggregation
Graphite – time series and graphs

More Related Content

PPT
Convenience is the killer app
PPTX
Glenn Ricart of US Ignite: Gigabit Apps (Gigabit City Summit)
PPTX
Top 11 wireless advances
PDF
CI/CD for Serverless Applications on AWS
PPTX
Digital Journey
PPTX
Cloud Computing 101
PDF
Web Analytics Wednesday Melbourne Meet Up
PPTX
Highway to heaven - Microservices Meetup Munich
Convenience is the killer app
Glenn Ricart of US Ignite: Gigabit Apps (Gigabit City Summit)
Top 11 wireless advances
CI/CD for Serverless Applications on AWS
Digital Journey
Cloud Computing 101
Web Analytics Wednesday Melbourne Meet Up
Highway to heaven - Microservices Meetup Munich

Similar to Logging and machine data at Scale. re:Invent 2017 (20)

PDF
Data dynamics in IoT Era
PDF
Rethinking the Database in the IoT Era
PDF
ThoughtWorks Technology Radar Roadshow - Sydney
PPTX
Big Data Analytics PPT - S1 working .pptx
PDF
How Data-Driven Continuous Intelligence Benefits Aid the Development and Mana...
PDF
IOT_MODULE_4.pd easy to understand notes
PPTX
Building Scalable IoT Apps (QCon S-F)
PDF
Data Management At Scale Best Practices For Enterprise Architecture 1st Editi...
PDF
How a Cloud Computing Provider Reached the Holy Grail of Visibility
PDF
Using Spark and Riak for IoT Apps—Patterns and Anti-Patterns: Spark Summit Ea...
PDF
Companies in cloud ecosystem
PDF
ML & Data Processing for Industrial IoT with InfluxDB
PPTX
Unushs susus susujss. Ssuusussjjsjsit 4.pptx
PDF
The Cloud Data Lake Early Release Rukmani Gopalan
PPTX
Cloud and Data Analytics Architecture: Data Everywhere for Everyone
PDF
ThoughtWorks Technology Radar Roadshow - Melbourne
PPTX
Lecture1 BIG DATA and Types of data in details
PPTX
Dell hans timmerman v1.1
PPTX
Big Data Session 1.pptx
PDF
Architecting Agile Data Applications for Scale
Data dynamics in IoT Era
Rethinking the Database in the IoT Era
ThoughtWorks Technology Radar Roadshow - Sydney
Big Data Analytics PPT - S1 working .pptx
How Data-Driven Continuous Intelligence Benefits Aid the Development and Mana...
IOT_MODULE_4.pd easy to understand notes
Building Scalable IoT Apps (QCon S-F)
Data Management At Scale Best Practices For Enterprise Architecture 1st Editi...
How a Cloud Computing Provider Reached the Holy Grail of Visibility
Using Spark and Riak for IoT Apps—Patterns and Anti-Patterns: Spark Summit Ea...
Companies in cloud ecosystem
ML & Data Processing for Industrial IoT with InfluxDB
Unushs susus susujss. Ssuusussjjsjsit 4.pptx
The Cloud Data Lake Early Release Rukmani Gopalan
Cloud and Data Analytics Architecture: Data Everywhere for Everyone
ThoughtWorks Technology Radar Roadshow - Melbourne
Lecture1 BIG DATA and Types of data in details
Dell hans timmerman v1.1
Big Data Session 1.pptx
Architecting Agile Data Applications for Scale
Ad

More from James Governor (20)

PPTX
2020 progressive delivery, git ops, observability
PPTX
An introduction to progressive delivery
PPTX
DevOps World lisbon 2019
PPTX
CI/CD and Progressive Delivery. Reframing velocity vs risk.
PPTX
The Next Wave: 100M Developers Worldwide
PPTX
Goto Copenhagen: How beauteous technology is! O brave new world.
PPT
Sympathy for the DevRel
PPT
A Tale of Two ITs. Tech, Power, Responsibility
PPT
Progressive delivery at DevOps World
PPT
Progressive Delivery at Spring One Platform
PPT
The Quickening at Futurestack 2018
PPT
Future Platforms
PPT
Convenience is the killer app
PPT
Data transformation is the new digital transformation
PPT
Space cluster disrupt for red hat
PPTX
Disruptors and trends in app dev test industry
PPT
Space cluster disrupt, Open Innovation labs
PPT
Devops market opportunity
PPTX
Test 2020 HPE Discover 2016
PPTX
Hacking the robots
2020 progressive delivery, git ops, observability
An introduction to progressive delivery
DevOps World lisbon 2019
CI/CD and Progressive Delivery. Reframing velocity vs risk.
The Next Wave: 100M Developers Worldwide
Goto Copenhagen: How beauteous technology is! O brave new world.
Sympathy for the DevRel
A Tale of Two ITs. Tech, Power, Responsibility
Progressive delivery at DevOps World
Progressive Delivery at Spring One Platform
The Quickening at Futurestack 2018
Future Platforms
Convenience is the killer app
Data transformation is the new digital transformation
Space cluster disrupt for red hat
Disruptors and trends in app dev test industry
Space cluster disrupt, Open Innovation labs
Devops market opportunity
Test 2020 HPE Discover 2016
Hacking the robots
Ad

Recently uploaded (20)

PPTX
Big Data Technologies - Introduction.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
sap open course for s4hana steps from ECC to s4
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Approach and Philosophy of On baking technology
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Cloud computing and distributed systems.
PDF
Electronic commerce courselecture one. Pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Spectroscopy.pptx food analysis technology
PDF
Chapter 3 Spatial Domain Image Processing.pdf
Big Data Technologies - Introduction.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Spectral efficient network and resource selection model in 5G networks
sap open course for s4hana steps from ECC to s4
The AUB Centre for AI in Media Proposal.docx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Approach and Philosophy of On baking technology
Per capita expenditure prediction using model stacking based on satellite ima...
Reach Out and Touch Someone: Haptics and Empathic Computing
Cloud computing and distributed systems.
Electronic commerce courselecture one. Pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
MYSQL Presentation for SQL database connectivity
gpt5_lecture_notes_comprehensive_20250812015547.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Spectroscopy.pptx food analysis technology
Chapter 3 Spatial Domain Image Processing.pdf

Logging and machine data at Scale. re:Invent 2017

  • 1. 1 Logging and machine data at Scale James Governor @monkchips co-founder.
  • 2. 2 “The lumber industry sells what used to be waste — sawdust, chips, and shredded wood — for a pretty profit. Today you’ll find these by-products in synthetic fireplace logs, concrete, ice strengtheners, mulch, particle board, fuel, livestock and pet bedding, winter road traction, weed killing and more” – Jason Fried, CEO Basecamp
  • 3. 3
  • 4. 4
  • 5. 5
  • 6. 6
  • 7. 7
  • 8. 8
  • 9. 9 Zero Users Care What The System Health is All Users Care What Their Experience Is Nines Don’t Matter if Users Aren’t Happy Charity Majors, Honeycomb.io
  • 10. 10 Observability bridges tracing, logging, and monitoring Logs are no longer about history Real time, streams, and adhoc queries
  • 12. 12 Use Cases Compliance DevOps, IT Service management/root cause analysis ecommerce, digital marketing optimisation Fraud detection IoT and integration with data from physical assets Healthcare Real time security
  • 13. 13 Always Compliance Anti-money laundering PCI GDPR is coming – fines potentially 4% of global turnover
  • 14. 14 Splunk Dubai Airport instruments literally everything – Even hand washing in the bathroom from electronic taps and soap Rackspace ingests 3TB of data every day – Security monitoring and management Tactical Assault Light Operator Suit (TALOS) – Monitoring including vital signs
  • 15. 15 ELK as a disruptive entrant Started as text search, now as a default for many orgs Sprint – 3bn events per day – Digital Transformation 4.0 Project – real-time data from Retail Management and Store Ops – 200 dashboards represent events from logs, databases, emails, syslogs, test messages, and internal and vendor application APIs. Uber, Facebook, Netflix
  • 16. 16
  • 17. 17
  • 19. 19 Tracing at Facebook 2014 Mystery Machine took 2 hours to compute a model from 1.3M traces 2017 Canopy generates and processes 1.3 billion traces per day (spanning end-user devices, web servers, and backend services, backs 129 performance datasets from high-level end-to-end metrics to specific custom use cases) Source https://blog.acolyer.org/2017/11/22/canopy-an-end-to-end-performance-tracing-and-analysis-system/
  • 20. 20 Roll your own kind of sucks
  • 21. 21 Using machine data for business process optimization Data isn’t on the balance sheet But data as product is Data optimized services and customer experiences Data culture Observability is the new hotness
  • 22. 22 Cloud Native Tools Grafana – visualization graph and dashboards, Prometheus – monitoring and alerting, time series Fluentd – data collector and log aggregation Graphite – time series and graphs

Editor's Notes

  • #7: In society we put a huge premium on convenience. Humans are fairly lazy when it comes to it. why did uber win, and why do so many Silicon Valley companies emerge that focus on convenience? Because convenience wins. Whatever you think of uber, it wins because of the way is packaged and serviced. It works and works well. I like black cabs but I hate messing around with receipts and cash.