SlideShare a Scribd company logo
Structured logging	

!
Reliable forwarding	

!
Pluggable architecturehttp://fluentd.org/
Agenda
> Background
> Overview
> Product Comparison
> Use cases
Background
Data Processing
Collect Store Process Visualize
Data source
Reporting
Monitoring
Related Products
Store Process
Cloudera
Horton Works
Treasure Data
Collect Visualize
Tableau
Excel
R
easier & shorter time
???
The basics of fluentd
Before Fluentd
Application
・・・
Server2
Application
・・・
Server3
Application
・・・
Server1
FluentLog
High Latency!
must wait for a day...
After Fluentd
Application
・・・
Server2
Application
・・・
Server3
Application
・・・
Server1
In streaming!
Fluentd Fluentd Fluentd
Fluentd Fluentd
Overview
> Open sourced log collector written in Ruby
> Reliable, scalable and easy to extend
> Using rubygems ecosystem for plugins
!
!
In short
It’s like syslogd, but
uses JSON for log messages
tail
insert
event
buffering
127.0.0.1 - - [11/Dec/2012:07:26:27] "GET / ...
127.0.0.1 - - [11/Dec/2012:07:26:30] "GET / ...
127.0.0.1 - - [11/Dec/2012:07:26:32] "GET / ...
127.0.0.1 - - [11/Dec/2012:07:26:40] "GET / ...
127.0.0.1 - - [11/Dec/2012:07:27:01] "GET / ...
...
Fluentd
Web Server
2012-02-04 01:33:51	

apache.log	

{	

"host": "127.0.0.1",	

"method": "GET",	

...	

}
Example (apache to mongo)
> default second unit
> from data source or

adding parsed time
Event structure(log message)
✓ Time
> for message routing
✓ Tag
> JSON format
> MessagePack

internally
> non-unstructured
✓ Record
Pluggable Architecture
Buffer Output
Input
> Forward
> HTTP
> File tail
> dstat
> ...
> Forward
> File
> MongoDB
> ...
> File
> Memory
Engine
Output
> rewrite
> ...
Pluggable Pluggable
Fluentd
# Ruby!
Fluent.open(“myapp”)!
Fluent.event(“login”, {“user” => 38})!
#=> 2012-12-11 07:56:01 myapp.login {“user”:38}
> Ruby	

> Java	

> Perl	

> PHP	

> Python	

> D	

> Scala	

> ...
Application
Time:Tag:Record
Client libraries
Configuration and operation
> No central / master node
> HTTP include helps configuration sharing
> Operation depends on your environment
> Use your deamon management
> Use Chef in Treasure Data
> Apache like syntax and Ruby DSL
# receive events via HTTP
<source>
type http
port 8888
</source>
!
# read logs from a file
<source>
type tail
path /var/log/httpd.log
format apache
tag apache.access
</source>
!
# save access logs to MongoDB
<match apache.access>
type mongo
database apache
collection log
</match>
# save alerts to a file	

<match alert.**>	

type file	

path /var/log/fluent/alerts	

</match>	

!
# forward other logs to servers	

<match **>	

type forward	

<server>	

host 192.168.0.11	

weight 20	

</server>	

<server>	

host 192.168.0.12	

weight 60	

</server>	

</match>	

!
include http://example.com/conf
Reliability (core + plugin)
> Buffering
> Use file buffer for persistent data
> buffer chunk has ID for idempotent
> Retrying
> Error handling
> transaction, failover, etc on forward plugin
> secondary for backup
Plugins - use rubygems
$ fluent-gem search -rd fluent-plugin!
!
$ fluent-gem search -rd fluent-mixin!
!
$ fluent-gem install fluent-plugin-mongo
http://www.fluentd.org/plugins
in_tail
✓ read a log file!
✓ read log files in directory!
✓ custom regexp!
✓ custom parser in Ruby
FluentdApache
access.log
> apache
> apache2
> syslog
> nginx
> json
> csv
> tsv
> ltsv
Supported format:
> none
> multiline



Fluentd
out_mongo
Apache
bufferaccess.log
✓ retry automatically!
✓ exponential retry wait!
✓ persistent on a file
Fluentd
out_webhdfs
buffer
✓ retry automatically!
✓ exponential retry wait!
✓ persistent on a file
✓ slice files based on time
2013-01-01/01/access.log.gz!
2013-01-01/02/access.log.gz!
2013-01-01/03/access.log.gz!
...
HDFS
✓ custom text formatter
Apache
access.log
out_copy + other plugins
✓ routing based on tags!
✓ copy to multiple storages
Amazon S3
Hadoop
Fluentd
buffer
Apache
access.log
out_forward
apache
✓ automatic fail-over!
✓ load balancing
FluentdApache
bufferaccess.log
✓ retry automatically!
✓ exponential retry wait!
✓ persistent on a file
Fluentd
Fluentd
Fluentd
Forward topology
send/ack
Fluentd
Fluentd
Fluentd
Fluentd
Fluentd
Fluentd
Fluentd
send/ack
Nagios
MongoDB
Hadoop
Alerting
Amazon S3
Analysis
Archiving
MySQL
Apache
Frontend
Access logs
syslogd
App logs
System logs
Backend
Databases
filter / buffer / routing
Nagios
MongoDB
Hadoop
Alerting
Amazon S3
Analysis
Archiving
MySQL
Apache
Frontend
Access logs
syslogd
App logs
System logs
Backend
Databases
filter / buffer / routing
Nagios
MongoDB
Hadoop
Alerting
Amazon S3
Analysis
Archiving
MySQL
Apache
Frontend
Access logs
syslogd
App logs
System logs
Backend
Databases
filter / buffer / routing
td-agent
> Open sourced distribution package of fluentd
> ETL part of Treasure Data
> deb, rpm, dmg (since td-agent 2.0)
> Including useful components
> ruby, jemalloc, fluentd
> 3rd party gems: td, mongo, webhdfs, etc…
> http://packages.treasure-data.com/
v1
> New features without breaking compatibility
> Filter, Label and better error handling
> Serverengine based: multi-process, signal, etc.
> New configuration and DSL format
> JRuby and Windows support
> github issue: Plan for v1 release #251
Use cases
Treasure Data
Frontend
Job Queue
Worker
Hadoop
Hadoop
Fluentd
Applications push
metrics to Fluentd

(via local Fluentd)
Librato
Metrics
for realtime analysis
Treasure
Data
for historical analysis
Fluentd sums up data minutes

(partial aggregation)
hundreds of app servers
sends event logs
sends event logs
sends event logs
Rails app td-agent
td-agent
td-agent
Google
Spreadsheet
Treasure Data
MySQL
Logs are available
after several mins.
Daily/Hourly
Batch
KPI
visualizationFeedback rankings
Rails app
Rails app
Unlimited scalability
Flexible schema
Realtime
Less performance impact
Cookpad
✓ Over 100 RoR servers (2012/2/4)
http://www.slideshare.net/tagomoris/log-analysis-with-hadoop-in-livedoor-2013
NHN Japan
by @tagomoris
✓ 16 nodes!
✓ 120,000+ lines/sec!
✓ 400Mbps at peak!
✓ 1.5+ TB/day (raw)
Web
Servers Fluentd

Cluster
Archive

Storage

(scribed)
Fluentd

Watchers
Graph

Tools
Notifications

(IRC)
Hadoop Cluster

CDH4

(HDFS, YARN)
webhdfs
Huahin

Manager
hive

server
STREAM
Shib ShibUI
BATCH
SCHEDULED
BATCH
Other usecases
> Collect censor logs
> Embedded devise, Rapsberry Pi, etc
> Integrated with Elasticsearch and Kibana
> Integrated with Norikra CEP engine



http://www.fluentd.org/guides
Other companies
http://www.fluentd.org/testimonials
> Fluentd is a widely-used log collector
> There are many use cases
> Many contributors and plugins
> Keep it simple
> Easy to integrate your environment
Conclusion

More Related Content

PDF
Grafana Loki: like Prometheus, but for Logs
PDF
Distributed Tracing for Kafka with OpenTelemetry with Daniel Kim | Kafka Summ...
PDF
"DevOps > CI+CD "
PPTX
SlideEgg_100781-Gemini AI.pptx..........
PDF
Prompt-Engineering-Lecture-Elvis learn prompt engineering
PDF
The Patterns of Distributed Logging and Containers
PDF
GitOps and ArgoCD
PPTX
Splunk Overview
Grafana Loki: like Prometheus, but for Logs
Distributed Tracing for Kafka with OpenTelemetry with Daniel Kim | Kafka Summ...
"DevOps > CI+CD "
SlideEgg_100781-Gemini AI.pptx..........
Prompt-Engineering-Lecture-Elvis learn prompt engineering
The Patterns of Distributed Logging and Containers
GitOps and ArgoCD
Splunk Overview

What's hot (20)

PDF
Fluentd Overview, Now and Then
PDF
Fluentd and Distributed Logging at Kubecon
PDF
Fluentd v1.0 in a nutshell
PDF
The basics of fluentd
PDF
Apache kafka 모니터링을 위한 Metrics 이해 및 최적화 방안
PPTX
From cache to in-memory data grid. Introduction to Hazelcast.
PPTX
Kafka replication apachecon_2013
PDF
Apache kafka performance(latency)_benchmark_v0.3
PPTX
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
ODP
Stream processing using Kafka
PPTX
[211] HBase 기반 검색 데이터 저장소 (공개용)
KEY
Introduction to memcached
PDF
Introduction and Overview of Apache Kafka, TriHUG July 23, 2013
PDF
Apache Kafka - Martin Podval
PDF
ksqlDB로 시작하는 스트림 프로세싱
PDF
Easy, scalable, fault tolerant stream processing with structured streaming - ...
PDF
Fluentd 101
PPTX
PDF
Apache Spark on K8S Best Practice and Performance in the Cloud
PPTX
Kafka 101
Fluentd Overview, Now and Then
Fluentd and Distributed Logging at Kubecon
Fluentd v1.0 in a nutshell
The basics of fluentd
Apache kafka 모니터링을 위한 Metrics 이해 및 최적화 방안
From cache to in-memory data grid. Introduction to Hazelcast.
Kafka replication apachecon_2013
Apache kafka performance(latency)_benchmark_v0.3
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
Stream processing using Kafka
[211] HBase 기반 검색 데이터 저장소 (공개용)
Introduction to memcached
Introduction and Overview of Apache Kafka, TriHUG July 23, 2013
Apache Kafka - Martin Podval
ksqlDB로 시작하는 스트림 프로세싱
Easy, scalable, fault tolerant stream processing with structured streaming - ...
Fluentd 101
Apache Spark on K8S Best Practice and Performance in the Cloud
Kafka 101
Ad

Viewers also liked (6)

PPTX
Life of an Fluentd event
PDF
Dive into Fluentd plugin v0.12
PDF
Fluentd v0.12 master guide
PDF
Fluentd Hacking Guide at RubyKaigi 2014
PDF
Fluent-bit
PDF
Docker and Fluentd
Life of an Fluentd event
Dive into Fluentd plugin v0.12
Fluentd v0.12 master guide
Fluentd Hacking Guide at RubyKaigi 2014
Fluent-bit
Docker and Fluentd
Ad

Similar to The basics of fluentd (20)

PDF
Fluentd and Embulk Game Server 4
PDF
Fluentd - RubyKansai 65
PDF
Fluentd Unified Logging Layer At Fossasia
PDF
Treasure Data and OSS
PDF
Fluentd meetup
PDF
Fluentd Project Intro at Kubecon 2019 EU
PDF
fluentd -- the missing log collector
PDF
Fluentd: Unified Logging Layer at CWT2014
PDF
Fluentd meetup #2
PPTX
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
PDF
Fluentd - road to v1 -
PDF
Logging for Production Systems in The Container Era
PDF
Fluentd at HKOScon
PDF
Fluentd meetup at Slideshare
PPT
WE18_Performance_Up.ppt
PPT
Performance_Up.ppt
PDF
Fluentd and AWS at classmethod
PDF
Collect distributed application logging using fluentd (EFK stack)
PPTX
Unified Batch & Stream Processing with Apache Samza
PDF
Fluentd - Set Up Once, Collect More
Fluentd and Embulk Game Server 4
Fluentd - RubyKansai 65
Fluentd Unified Logging Layer At Fossasia
Treasure Data and OSS
Fluentd meetup
Fluentd Project Intro at Kubecon 2019 EU
fluentd -- the missing log collector
Fluentd: Unified Logging Layer at CWT2014
Fluentd meetup #2
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Fluentd - road to v1 -
Logging for Production Systems in The Container Era
Fluentd at HKOScon
Fluentd meetup at Slideshare
WE18_Performance_Up.ppt
Performance_Up.ppt
Fluentd and AWS at classmethod
Collect distributed application logging using fluentd (EFK stack)
Unified Batch & Stream Processing with Apache Samza
Fluentd - Set Up Once, Collect More

More from Treasure Data, Inc. (20)

PPTX
GDPR: A Practical Guide for Marketers
PPTX
AR and VR by the Numbers: A Data First Approach to the Technology and Market
PPTX
Introduction to Customer Data Platforms
PPTX
Hands On: Javascript SDK
PPTX
Hands-On: Managing Slowly Changing Dimensions Using TD Workflow
PPTX
Brand Analytics Management: Measuring CLV Across Platforms, Devices and Apps
PPTX
How to Power Your Customer Experience with Data
PPTX
Why Your VR Game is Virtually Useless Without Data
PDF
Connecting the Customer Data Dots
PPTX
Harnessing Data for Better Customer Experience and Company Success
PDF
Packaging Ecosystems -Monki Gras 2017
PDF
글로벌 사례로 보는 데이터로 돈 버는 법 - 트레저데이터 (Treasure Data)
PDF
Keynote - Fluentd meetup v14
PDF
Introduction to New features and Use cases of Hivemall
PDF
Scalable Hadoop in the cloud
PDF
Using Embulk at Treasure Data
PDF
Scaling to Infinity - Open Source meets Big Data
PDF
Treasure Data: Move your data from MySQL to Redshift with (not much more tha...
PDF
Treasure Data From MySQL to Redshift
PDF
Unifying Events and Logs into the Cloud
GDPR: A Practical Guide for Marketers
AR and VR by the Numbers: A Data First Approach to the Technology and Market
Introduction to Customer Data Platforms
Hands On: Javascript SDK
Hands-On: Managing Slowly Changing Dimensions Using TD Workflow
Brand Analytics Management: Measuring CLV Across Platforms, Devices and Apps
How to Power Your Customer Experience with Data
Why Your VR Game is Virtually Useless Without Data
Connecting the Customer Data Dots
Harnessing Data for Better Customer Experience and Company Success
Packaging Ecosystems -Monki Gras 2017
글로벌 사례로 보는 데이터로 돈 버는 법 - 트레저데이터 (Treasure Data)
Keynote - Fluentd meetup v14
Introduction to New features and Use cases of Hivemall
Scalable Hadoop in the cloud
Using Embulk at Treasure Data
Scaling to Infinity - Open Source meets Big Data
Treasure Data: Move your data from MySQL to Redshift with (not much more tha...
Treasure Data From MySQL to Redshift
Unifying Events and Logs into the Cloud

Recently uploaded (20)

PDF
Modernizing your data center with Dell and AMD
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
KodekX | Application Modernization Development
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Modernizing your data center with Dell and AMD
The AUB Centre for AI in Media Proposal.docx
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
KodekX | Application Modernization Development
Spectral efficient network and resource selection model in 5G networks
Chapter 3 Spatial Domain Image Processing.pdf
Advanced Soft Computing BINUS July 2025.pdf
Electronic commerce courselecture one. Pdf
Advanced methodologies resolving dimensionality complications for autism neur...
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Diabetes mellitus diagnosis method based random forest with bat algorithm
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
Per capita expenditure prediction using model stacking based on satellite ima...
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025

The basics of fluentd