SlideShare a Scribd company logo
Fluentd: 
Towards Unified 
Logging Layer 
Kiyoto Tamura 
@kiyototamura 
Treasure Data, Inc.
Pivotal Open Source:  Using Fluentd to gain insights into your logs
• Fluentd maintainer & 
community manager 
• data nerd 
• math nerd 
• nerd 
whoami
Pivotal Open Source:  Using Fluentd to gain insights into your logs
this talk isn’t about 
Big Data
it’s about 
Log Data
a motivating anecdote
The life of 
a data scientist 
(me in 2009)
http://cacm.acm.org/blogs/blog-cacm/169199-data-science-workflow-overview- 
and-challenges/fulltext
Acquire Data (or 
so you think) 
WUT!? Invalid 
UTF8? 
Fix the encoding 
issue… 
Yell at the 
engineers 
Some columns 
are missing!? 
Run the 
script…DIVISION 
BY ZERO!!!
Hmm…
Logging.priority 
=> :not_super_high
analytics.priority 
=> :very_high
analytics.needs? :logs 
=> true
outage.priority 
=> :super_high
outage.needs? :logs 
=> [“no”, “shit”]
Metrics and Monitoring 
(hint: you need logs)
Pivotal Open Source:  Using Fluentd to gain insights into your logs
Ops 
VPs 
Engineers 
Managers 
PMs 
More PMs
Pivotal Open Source:  Using Fluentd to gain insights into your logs
How can we do better?
How to Unify Logging (1) 
Common Interface + Decoupling 
Mobile Web IoT 
Message 
Queue 
Search 
Backend 
Analytic DB 
Archival 
Storage 
Unified 
Logging Layer 
Parse into a 
common data format 
Decouple from 
data sources
How to Unify Logging (2) 
Reliability & Scalability 
Mobile Web IoT 
Message 
Queue 
Search 
Backend 
Analytic DB 
Archival 
Storage 
Unified 
Logging Layer 
Need 
persistence/buff 
ering 
Robust retries 
and recovery
How to Unify Logging (3) 
Extensibility 
? Web IoT 
? Search 
Backend 
Analytic DB 
Archival 
Storage 
Unified 
Logging Layer 
Adding a new 
in/output must be 
easy 
Same for filters
Fluentd can help us unify logging
how it works
Pivotal Open Source:  Using Fluentd to gain insights into your logs
127.0.0.1 - - [05/Feb/2012:17:11:55 
+0000] "GET / HTTP/1.1" 200 140 "-" 
"Mozilla/5.0 (Windows NT 6.1; WOW64) 
AppleWebKit/535.19 (KHTML, like Gecko) 
Chrome/18.0.1025.5 Safari/535.19"
{ 
"host": "127.0.0.1", 
"user": "-", 
"method": "GET", 
"path": "/", 
"code": "200", 
"size": "140", 
"referer": "-", 
"agent": “Mozilla/5.0 (Windows…" 
}
Pivotal Open Source:  Using Fluentd to gain insights into your logs
Parse as JSON!
?
[“05/Feb/2012:17:11:55”,“web.access”,{ 
"host": "127.0.0.1", 
"user": "-", 
"method": "GET", 
"path": "/", 
"code": "200", 
"size": "140", 
"referer": "-", 
"agent": “Mozilla/5.0 (Windows…" 
}] 
timestamp tag 
record
?
web.mongodb 
web.file 
web.hdfs 
web.s3 
web.mysql
Demo: Bring Your Own A/B Testing
How A/B Testing Starts 
website 
<script>…</script> 
A/B Testing 
SaaS
How A/B Testing Evolves 
Android iOS 
<script>…</script> 
A/B Testing 
SaaS 1 
website 
A/B Testing 
SaaS 1 
A/B Testing 
SaaS 1 
<script>…</script> 
A/B Testing 
SaaS 1 
event.post()… 
<script>…</script> 
event.post()…
How A/B Testing Evolves 
Android iOS 
<script>…</script> 
A/B Testing 
SaaS 1 
website 
A/B Testing 
SaaS 1 
A/B Testing 
SaaS 1 
<script>…</script> 
A/B Testing 
SaaS 1 
event.post()… 
<script>…</script> 
event.post()… 
Looks Familiar?
Bring Your Own A/B Testing! 
Android website iOS 
A/B Testing 
SaaS 1 
A/B Testing 
SaaS 2 
Analytic DB 
Archival 
Storage
bit.ly/cf-fluentd
{ 
“install”: “gem install fluentd”, 
“website”: “www.fluentd.org”, 
“github” : “fluent/fluentd”, 
“twitter”: “@fluentd” 
}

More Related Content

PDF
Your WordPress Site Has Been Hacked: What Now?
PPTX
Treasure Data Overview
PDF
Fluentd unified logging layer
PDF
Packaging Ecosystems -Monki Gras 2017
PDF
Open source data ingestion
PPTX
Fluentd: Towards Unified Logging (SF Logging Meetup Jult 2014)
PDF
Fluentd meetup #2
PPTX
CSE3069 - FLUENTD real time analytics.pptx
Your WordPress Site Has Been Hacked: What Now?
Treasure Data Overview
Fluentd unified logging layer
Packaging Ecosystems -Monki Gras 2017
Open source data ingestion
Fluentd: Towards Unified Logging (SF Logging Meetup Jult 2014)
Fluentd meetup #2
CSE3069 - FLUENTD real time analytics.pptx

Similar to Pivotal Open Source: Using Fluentd to gain insights into your logs (20)

PDF
Fluentd - RubyKansai 65
PDF
Unifying Events and Logs into the Cloud
PDF
Unifying Events and Logs into the Cloud
PDF
Fluentd Overview, Now and Then
PDF
Fluentd Project Intro at Kubecon 2019 EU
PDF
Fluentd 101
PPTX
How fluentd fits into the modern software landscape
PDF
Fluentd and Docker - running fluentd within a docker container
PDF
Fluentd and Docker - running fluentd within a docker container
PDF
Centralized + Unified Logging
PDF
Fluentd at HKOScon
PPTX
Building a Unified Logging Layer with Fluentd, Elasticsearch and Kibana
PPTX
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
PDF
Centralized logging system using mongoDB
PDF
Collect distributed application logging using fluentd (EFK stack)
PDF
Logging in Action: With Fluentd, Kubernetes and more 1st Edition Phil Wilkins
PDF
Fluentd - CNCF Paris
PDF
VMworld 2013: Deep Dive into vSphere Log Management with vCenter Log Insight
PDF
The basics of fluentd
PDF
Log aggregation: using Elasticsearch, Fluentd/Fluentbit and Kibana (EFK)
Fluentd - RubyKansai 65
Unifying Events and Logs into the Cloud
Unifying Events and Logs into the Cloud
Fluentd Overview, Now and Then
Fluentd Project Intro at Kubecon 2019 EU
Fluentd 101
How fluentd fits into the modern software landscape
Fluentd and Docker - running fluentd within a docker container
Fluentd and Docker - running fluentd within a docker container
Centralized + Unified Logging
Fluentd at HKOScon
Building a Unified Logging Layer with Fluentd, Elasticsearch and Kibana
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Centralized logging system using mongoDB
Collect distributed application logging using fluentd (EFK stack)
Logging in Action: With Fluentd, Kubernetes and more 1st Edition Phil Wilkins
Fluentd - CNCF Paris
VMworld 2013: Deep Dive into vSphere Log Management with vCenter Log Insight
The basics of fluentd
Log aggregation: using Elasticsearch, Fluentd/Fluentbit and Kibana (EFK)
Ad

Recently uploaded (20)

PPTX
MYSQL Presentation for SQL database connectivity
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
KodekX | Application Modernization Development
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Cloud computing and distributed systems.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Encapsulation theory and applications.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Approach and Philosophy of On baking technology
MYSQL Presentation for SQL database connectivity
Reach Out and Touch Someone: Haptics and Empathic Computing
KodekX | Application Modernization Development
Diabetes mellitus diagnosis method based random forest with bat algorithm
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Machine learning based COVID-19 study performance prediction
Cloud computing and distributed systems.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Encapsulation theory and applications.pdf
The AUB Centre for AI in Media Proposal.docx
Building Integrated photovoltaic BIPV_UPV.pdf
Spectral efficient network and resource selection model in 5G networks
Per capita expenditure prediction using model stacking based on satellite ima...
Review of recent advances in non-invasive hemoglobin estimation
Advanced methodologies resolving dimensionality complications for autism neur...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Approach and Philosophy of On baking technology
Ad

Pivotal Open Source: Using Fluentd to gain insights into your logs

Editor's Notes

  • #2: Thanks the organizers, Pivotal, audience
  • #5: so, I am a big fan of spoilers when it comes to tech talks. I think spoilers give the audience a much better idea of what to expect. so here it is.
  • #6: phew. I just said that. no, this talk is definitely not about “big data” besides poking fun around the buzzword… people can’t seem to agree on what it is. I want to talk about something far more concrete
  • #9: I worked as a quantitative analyst for three years
  • #12: definitely not just data engineers’ problem started to think more deeply about why logging becomes haphazard. talked to hundreds of people at treasure data. eventually, I had a couple of observations.
  • #14: and here is another observation
  • #23: the first requirement is common interface between data inputs and outputs. why? common interface -> one data can be stored into multiple places with the same semantics. You don’t know if you stick to the same backend system. You probably will need to piece together information from multiple data sources.
  • #24: data pipelines fail format changes volume spikes hardware/IaaS hiccups Scalability matters You need to be able to scale out the logging layer.
  • #25: New data sources/outputs come up Need to be able to extend your system
  • #26: So, here is a rather self-aggrandizing claim: Fluentd can be that unified logging layer. In the rest of the talk, I will show you how.
  • #27: yes, it is about log data!