SlideShare a Scribd company logo
Open-Falcon
A Distributed and High-Performance Monitoring System
Yao-Wei Ou & Lai Wei
2017/05/22
Let us begin with a little story…
Grafana PR#3787
[feature] Add Open-Falcon datasource
“I'm sorry but we will not merge any new
datasources unless they are very popular.”
- Grafana
Open-Falcon: A Distributed and High-Performance Monitoring System
Open-Falcon: A Distributed and High-Performance Monitoring System
Open-Falcon,now is one of the most
popular monitoring systems in China.
About us
Laiwei
✦ Technical director of DiDi, China.
✦ Founder of Open-Falcon software.
✦ Core maintainer of Open-Falcon
community.
✦ Focus on service reliability, DevOps,
Cloud computing, etc.
7
Yaowei
✦ Director of WiFire oversea R&D center.
✦ Core maintainer of Open-Falcon
organization.
✦ Leads the development of CDN
monitoring system in Fastweb.
✦ Focus on CDN and Blockchain.
Outline
‣ Motivation
‣ Features
‣ Architecture
‣ Comparison
‣ Community
8
Motivation(1/3)
There are already so many outstanding open source monitoring
systems. Why do we reinvent a wheel? And how does it
become the most popular monitoring system in China?
9
Motivation(2/3)
‣ Zabbix
•difficult to scale out (~2000)
• Database can grow very large, very quickly if not tuned properly.
• not very resource friendly: a lot of connections will be made.
• we have to build several zabbix cluster to deal with the rapid business grow.
•a bit of a learning curve
• Tuning and tweaking can be a lot of work
• more complex to setup
• hard to extend
• manually configuration
‣ 2015 open sourced by Xiaomi SRE, China, under Apache
License Version 2.0
•to replace Zabbix
10
Motivation(3/3)
‣ key factors of an enterprise class monitoring system.
11
Scalability Performance
User-
Oriented
Flexibility
High
Availability
Features
Scalability
‣ Scalable monitoring system is necessary to support rapid business
growth. Each module of Open-Falcon is super easy to scale
horizontally.
‣ Supports up to hundreds of million transactions per minute (query/
judge/store/search).
‣ Can easily support over 100,000 hosts.
13
Performance
‣ With  RRA(Round Robin Archive) mechanism, the one-year history
data  of 100+ metrics could be  returned in just few seconds.
‣ stores 10+ years historical metrics.
14
High Availability
‣ No critical single point of failure
‣ Easy to operate and deploy
15
Flexibility
‣ Falcon-agent has already 400+ built-in server metrics. Users can
collect their customized metrics by writing plugins or just simply run
a script/program to relay metrics to falcon-agent.
‣ Extensive architecture.
‣ Customizable metrics.
‣ Abundant APIs.
16
Efficiency
‣ For easier management of alerting rules, Open-Falcon supports
strategy, expression, template inheritance, and multiple alerting
method, and callback for recovery.
‣ Auto discovery of endpoints and counters.
‣ API support.
17
User-Oriented
‣ Supports Grafana Datasource.
18
‣ Open-Falcon could present multi-dimension graph, including user-defined dashboard/screen.
Grafana with Open-Falcon Screenshot (1/3)
Grafana with Open-Falcon Screenshot (2/3)
Grafana with Open-Falcon Screenshot (3/3)
Architecture
Components
23
COLLECT STORE
JUDGE
PRESENT
NOTIFY
Components
23
COLLECT STORE
JUDGE
PRESENT
NOTIFY
Components
23
COLLECT STORE
JUDGE
PRESENT
NOTIFY
Components
23
COLLECT STORE
JUDGE
PRESENT
NOTIFY
Falcon-Agent
Proxy-Gateway
aggregator
nodata
graph
redis
MySQL
hbs
Falcon-Dashboard
Grafana
Falcon-API
alarmjudge
Center Status
24
Agent1 Agent2 Agent3 Agent4 … AgentN
UICDashboard Portal2 2 2
AlarmQuery
Graph 20
Transfer 60
Judge 20
5 1 HBS
Sender
Link
5
1
1
Before:Too many modules
Nodata
Aggregator
Tasks
Alarm FE
After
After
UICDashboard Portal
Alarm FE
After
Alarm API Gateway
Graph
Transfer (Queue)
Judge HBS

(Control)
UICDashboard Portal
Alarm FE
Dashboard
After
Central
Status
Alarm API Gateway
Graph
Transfer (Queue)
Judge HBS

(Control)
UICDashboard Portal
Alarm FE
Falcon-Plus
Dashboard
After
Central
Status
Alarm API Gateway
Graph
Transfer (Queue)
Judge HBS

(Control)
UICDashboard Portal
Alarm FE
Falcon-Plus
Dashboard
Agent1 Agent2 Agent3 Agent4 … AgentN
Grafana
After
Central
Status
InfluxDB Cassandra OpenTSDB ELK
Alarm API Gateway
Graph
Transfer (Queue)
Judge HBS

(Control)
UICDashboard Portal
Alarm FE
Falcon-Plus
Dashboard
Agent1 Agent2 Agent3 Agent4 … AgentN
Design Philosophy
27
Design Philosophy
27
MICROSERVICES
10 major modules
individual deployed
Design Philosophy
27
PUSH
agent as a push proxy
MICROSERVICES
10 major modules
individual deployed
Design Philosophy
27
PUSH
agent as a push proxy
RRDTOOL
with consistent hash
MICROSERVICES
10 major modules
individual deployed
Design Philosophy
27
BINARY
Go static binary
PUSH
agent as a push proxy
RRDTOOL
with consistent hash
MICROSERVICES
10 major modules
individual deployed
Design Philosophy
27
BINARY
Go static binary
PUSH
agent as a push proxy
RRDTOOL
with consistent hash
DEPLOYMENT
mass deployment by ops-updater
MICROSERVICES
10 major modules
individual deployed
Design Philosophy
27
BINARY
Go static binary
PUSH
agent as a push proxy
RRDTOOL
with consistent hash
GRAFANA
open-falcon datasource
DEPLOYMENT
mass deployment by ops-updater
MICROSERVICES
10 major modules
individual deployed
Comparison
Compared to Prometheus
29
OPEN-FALCON PROMETHEUS
Abundant APIs Metrics API
Push Model: Auto Discovery Pull Model: Manual configuration
Easy to scale out Harder to scale out
simple alert management of own dashboard
Alertmanager offers grouping, deduplication

and silencing functionality
Faster query performance of RRA Slower, Recording rules
Simple shellscript as plugin
A bit learning curve to write exporter and 

collector
Limited expression PromQL
Compared to Prometheus
29
OPEN-FALCON PROMETHEUS
Abundant APIs Metrics API
Push Model: Auto Discovery Pull Model: Manual configuration
Easy to scale out Harder to scale out
simple alert management of own dashboard
Alertmanager offers grouping, deduplication

and silencing functionality
Faster query performance of RRA Slower, Recording rules
Simple shellscript as plugin
A bit learning curve to write exporter and 

collector
Limited expression PromQL
Compared to Prometheus
29
OPEN-FALCON PROMETHEUS
Abundant APIs Metrics API
Push Model: Auto Discovery Pull Model: Manual configuration
Easy to scale out Harder to scale out
simple alert management of own dashboard
Alertmanager offers grouping, deduplication

and silencing functionality
Faster query performance of RRA Slower, Recording rules
Simple shellscript as plugin
A bit learning curve to write exporter and 

collector
Limited expression PromQL
Compared to Prometheus
29
OPEN-FALCON PROMETHEUS
Abundant APIs Metrics API
Push Model: Auto Discovery Pull Model: Manual configuration
Easy to scale out Harder to scale out
simple alert management of own dashboard
Alertmanager offers grouping, deduplication

and silencing functionality
Faster query performance of RRA Slower, Recording rules
Simple shellscript as plugin
A bit learning curve to write exporter and 

collector
Limited expression PromQL
Compared to Prometheus
29
OPEN-FALCON PROMETHEUS
Abundant APIs Metrics API
Push Model: Auto Discovery Pull Model: Manual configuration
Easy to scale out Harder to scale out
simple alert management of own dashboard
Alertmanager offers grouping, deduplication

and silencing functionality
Faster query performance of RRA Slower, Recording rules
Simple shellscript as plugin
A bit learning curve to write exporter and 

collector
Limited expression PromQL
Compared to Prometheus
29
OPEN-FALCON PROMETHEUS
Abundant APIs Metrics API
Push Model: Auto Discovery Pull Model: Manual configuration
Easy to scale out Harder to scale out
simple alert management of own dashboard
Alertmanager offers grouping, deduplication

and silencing functionality
Faster query performance of RRA Slower, Recording rules
Simple shellscript as plugin
A bit learning curve to write exporter and 

collector
Limited expression PromQL
Compared to Prometheus
29
OPEN-FALCON PROMETHEUS
Abundant APIs Metrics API
Push Model: Auto Discovery Pull Model: Manual configuration
Easy to scale out Harder to scale out
simple alert management of own dashboard
Alertmanager offers grouping, deduplication

and silencing functionality
Faster query performance of RRA Slower, Recording rules
Simple shellscript as plugin
A bit learning curve to write exporter and 

collector
Limited expression PromQL
Compared to Prometheus
29
OPEN-FALCON PROMETHEUS
Abundant APIs Metrics API
Push Model: Auto Discovery Pull Model: Manual configuration
Easy to scale out Harder to scale out
simple alert management of own dashboard
Alertmanager offers grouping, deduplication

and silencing functionality
Faster query performance of RRA Slower, Recording rules
Simple shellscript as plugin
A bit learning curve to write exporter and 

collector
Limited expression PromQL
Community
Ecosystem (1/2)
31
✦ Banking
✦ IaaS, SaaS
✦ CDN
✦ O2O
✦ Social
✦ Entertainment
✦ …
Ecosystem (2/2)
32
OS
Plugins
Switch Hadoop HBase Docker Redis
MongoDB GPU RabbitMQ HAProxy Nginx
JMX LVS Tomcat WebSphere IIS
UI
Join us
33
Github
Homepage
Contact us
https://github.com/open-falcon
http://open-falcon.org
openfalcon-users@googlegroups.com
Wechat
Summary
100+
companies
40,000+
servers
400+
built-in metrics
5,000+ users
3 seconds
Q&A
https://github.com/open-falcon/open-falcon

More Related Content

PPT
GSM Presentation
PPT
Gsm – global system for mobile communication
PPTX
The GSM Technology
PPTX
LTE Basic
PPTX
Lte Presentation.Ppt
PDF
IT8602 Mobile Communication - Unit V
PPTX
Cellular system
PDF
idle mode_operation_v1.1_chema
GSM Presentation
Gsm – global system for mobile communication
The GSM Technology
LTE Basic
Lte Presentation.Ppt
IT8602 Mobile Communication - Unit V
Cellular system
idle mode_operation_v1.1_chema

What's hot (20)

PDF
Gsm optimization
PPTX
PPT
UMTS OVERVIEW
PPTX
Gsm and Gprs Ppt
PPTX
Second Generation of Cellular Telephone Technology(2G)
PDF
NETWORK PLANNING AND DESIGN,
PPT
GSM Architecture.ppt
DOC
Moc mtc
PPT
Cellular communication
PPTX
Unit 2 sdr architecture
ODP
UMTS, Introduction.
PDF
GSM Module
PPTX
PPTX
GSM. Global System for Mobile Communication.
PDF
3G Radio Network Planning
PPTX
Cdma2000
PPTX
Mobile Originated Call Process in Simple Words
PPTX
Lte(long term evolution) 4G LTE
PPT
Wireless personal area networks(PAN)
Gsm optimization
UMTS OVERVIEW
Gsm and Gprs Ppt
Second Generation of Cellular Telephone Technology(2G)
NETWORK PLANNING AND DESIGN,
GSM Architecture.ppt
Moc mtc
Cellular communication
Unit 2 sdr architecture
UMTS, Introduction.
GSM Module
GSM. Global System for Mobile Communication.
3G Radio Network Planning
Cdma2000
Mobile Originated Call Process in Simple Words
Lte(long term evolution) 4G LTE
Wireless personal area networks(PAN)
Ad

Similar to Open-Falcon: A Distributed and High-Performance Monitoring System (20)

PDF
Kafka summit apac session
PDF
Which Application Modernization Pattern Is Right For You?
PPTX
3158 - Cloud Infrastructure & It Optimization - Application Performance Manag...
PDF
The DevOps Paradigm
PDF
The DevOps paradigm - the evolution of IT professionals and opensource toolkit
PDF
Akamai for Dev Ops Current Capabilities - Atlanta DevOps World Tour
PPTX
Accelerating SAP transformations with Micro Focus
PDF
Introducing enhancement framework.doc
PDF
Akamai Developer General Session
PPTX
API Best Practices
PDF
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
PDF
Partner Connect APAC - 2022 - April
PDF
Understand the Clean Core Extensibility Options for Cloud ERP.pdf
ODP
Simplifying RCP Update and Install
PPTX
Workshop híbrido: Stream Processing con Flink
PPTX
Manchester MuleSoft Meetup #8 - 28 Sept.pptx
PDF
CA World 2014 Release Automation Coexistence in the Enterprise
PPT
Lamp Zend Security
PDF
Cloud Native Applications on OpenShift
PPTX
OCP Datacomm RedHat - Kubernetes Launch
Kafka summit apac session
Which Application Modernization Pattern Is Right For You?
3158 - Cloud Infrastructure & It Optimization - Application Performance Manag...
The DevOps Paradigm
The DevOps paradigm - the evolution of IT professionals and opensource toolkit
Akamai for Dev Ops Current Capabilities - Atlanta DevOps World Tour
Accelerating SAP transformations with Micro Focus
Introducing enhancement framework.doc
Akamai Developer General Session
API Best Practices
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
Partner Connect APAC - 2022 - April
Understand the Clean Core Extensibility Options for Cloud ERP.pdf
Simplifying RCP Update and Install
Workshop híbrido: Stream Processing con Flink
Manchester MuleSoft Meetup #8 - 28 Sept.pptx
CA World 2014 Release Automation Coexistence in the Enterprise
Lamp Zend Security
Cloud Native Applications on OpenShift
OCP Datacomm RedHat - Kubernetes Launch
Ad

More from Yao-Wei Ou (11)

PDF
AAMA_10 個台灣新創不能忽略的趨勢_講師簡報.pdf
PDF
金融革命?談區塊鏈上的去中心化金融應用
PDF
投資還是投機?帶你入門虛擬貨幣與區塊鏈
PDF
Ico 的生與死 @ C&B - 20170929
PDF
ICO 亂象
PDF
從碼農的角度來剖析區塊鏈 @新竹碼農
PDF
未来网络技术发展探梦 - 开篇
PDF
光載無限監控平台的變革與演進
PDF
分布式監控系統的容器化變遷與 CI/CD 實踐
PDF
企業級監控平台的變革與演進
PDF
Linkwish present 20100903_final_clean_z10
AAMA_10 個台灣新創不能忽略的趨勢_講師簡報.pdf
金融革命?談區塊鏈上的去中心化金融應用
投資還是投機?帶你入門虛擬貨幣與區塊鏈
Ico 的生與死 @ C&B - 20170929
ICO 亂象
從碼農的角度來剖析區塊鏈 @新竹碼農
未来网络技术发展探梦 - 开篇
光載無限監控平台的變革與演進
分布式監控系統的容器化變遷與 CI/CD 實踐
企業級監控平台的變革與演進
Linkwish present 20100903_final_clean_z10

Recently uploaded (20)

PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PPTX
history of c programming in notes for students .pptx
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Digital Strategies for Manufacturing Companies
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
top salesforce developer skills in 2025.pdf
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
medical staffing services at VALiNTRY
PDF
Nekopoi APK 2025 free lastest update
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PPTX
Computer Software and OS of computer science of grade 11.pptx
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
VVF-Customer-Presentation2025-Ver1.9.pptx
Wondershare Filmora 15 Crack With Activation Key [2025
history of c programming in notes for students .pptx
Softaken Excel to vCard Converter Software.pdf
Digital Strategies for Manufacturing Companies
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Reimagine Home Health with the Power of Agentic AI​
wealthsignaloriginal-com-DS-text-... (1).pdf
PTS Company Brochure 2025 (1).pdf.......
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
top salesforce developer skills in 2025.pdf
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Operating system designcfffgfgggggggvggggggggg
medical staffing services at VALiNTRY
Nekopoi APK 2025 free lastest update
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Computer Software and OS of computer science of grade 11.pptx
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Which alternative to Crystal Reports is best for small or large businesses.pdf

Open-Falcon: A Distributed and High-Performance Monitoring System