SlideShare a Scribd company logo
Samantha Wang [InfluxData] | Data Collection Overview | InfluxDays 2022
InfluxDB Data
Collectors
Samantha Wang - Product Manager, InfluxData
Connect Learn Build
Hear from and meet developers
from the InfluxDB Community
Be inspired by use cases from
our partners and InfluxDB engineers
Learn best practices that will
help you build great experiences
for your projects
InfluxDB provides multiple methods for ingesting data.
Discover the newest options for data collection and
what’s in store for the future of Telegraf.
Samantha Wang
Product Manager, InfluxData
Samantha Wang is currently a Product Manager at
InfluxData, creator of InfluxDB, the leading time series
platform. Samantha is currently responsible for Telegraf,
InfluxData’s open source data collection agent. Previous
to InfluxData, she worked in product and engineering
roles at Gracenote (a Nielsen Company) and Verizon.
Samantha holds a degree in Industrial & Systems
Engineering from the University of Southern California.
InfluxDB Data Collectors
Agenda
1. Native Data Collection
2. Client Libraries
3. Telegraf
Multiple InfluxDB Ingestion Options
Native Collectors Telegraf Client Libraries
Scrapers Ecosystem InfluxDB API
Interfaces directly with Cloud
data sources (in this case, MQTT
Brokers)
Offers over 300 plugins to collect
data from many data sources
Available for many popular
development languages: Make
API Calls to write data to
InfluxDB Cloud
Scrape Prometheus-formatted
metrics from an HTTP-accessible
endpoint and store them in
InfluxDB.
Configure third-party
technologies to send line
protocol directly to InfluxDB.
Build your own with the
InfluxDB API, which provides a
programmatic interface for
interactions with InfluxDB
What is Native Data
Collection?
The ability to directly ingest data from
popular data sources without the
need to do any transformation to Line
Protocol or run a Telegraf agent
8
Native Collection
9
MQTT Topic Subscriptions
• Subscribe to topics being published by
their own devices and sent through a
message broker directly to the InfluxDB
Cloud service
• No code or installing any additional
software required
MQTT Ingestion: Telegraf vs Native MQTT
Simple 3 Step Configuration
Define Parsing Rules
• Configure Rules for
finding timestamps,
measurements, tags, and
field information in the
MQTT messages
Configure Broker
• Broker URL
• Broker TCP/IP Port
Number
• Broker Authentication
Type
• Broker Authentication
Parameters
Configure Topic
• Topic Name
• InfluxDB Bucket
What about the Client
Libraries?
Client Libraries
Improving the user experience
• Release updates to client library last Thursday of the month
• Added an error message if pivot() is not included
• Improved working with timestamps in the pandas dataframe
What’s new with Telegraf?
T H A N K Y O U
PROJECT NUMBERS
300+ Plugins and Telegraf Capabilities
Recent improvements:
• Apple M1 and RISC-V support
• Precisely schedule collection with `collection_offset`
• Faster line protocol parsing
• Improved MQTT topic and payload parsing
• SNMP plugin can parse MIB files directly
• Test artifacts on pull requests
Telegraf Binary growth
+18Mb AWS EC2 Metadata
processor plugin
+10Mb Aliyuncms
input plugin
+8Mb SQL input plugin
Telegraf Maintainers
Paweł Żak | @zak-pawel
Software Engineer,
Intel Corporation
Giovanni Luisotto | @Trovalo
DBA & BI Dev,
Quantumdatis
Thomas Casteleyn | @Hipska
Software Engineer,
Super-Visions
Telegraf Maintainers
Paweł Żak | @zak-pawel
Software Engineer,
Intel Corporation
Giovanni Luisotto | @Trovalo
DBA & BI Dev,
Quantumdatis
Thomas Casteleyn | @Hipska
Software Engineer,
Super-Visions
What’s next for Telegraf?
Focus for Telegraf
• Empower users to create a customized Telegraf
• Reducing the Telegraf binary
• Improve the CLI user experience to be logical and consistent
• Align with the future of InfluxDB - logging, tracing, and high
cardinality
Telegraf Binary growth
+18Mb AWS EC2 Metadata
processor plugin
+10Mb Aliyuncms
input plugin
+8Mb SQL input plugin
Telegraf Custom Builder
make build_tools
./tools/custom_builder/custom_builde
r --config ~/telegraf.conf
Enable users to easily build custom Telegraf agents only with
desired plugins
Adds [[secretstores.xyz]] section
to Telegraf config.
Can be managed by JOSE
(Javascript Object Signing and
Encryption) based secret store or
your native OS keyring.
Secret Store in Telegraf
[[secretstores.os]]
## Unique identifier for the secret-store.
## This id can later be used in plugins to
reference the secrets
## in this secret-store via
@{<id>:<secret_key>} (mandatory)
id = "secretstore"
## Keyring of the secrets
## In Windows, keys follow a fixed pattern in
the form `<keyring>:<collection>:<key>`. Please
keep this in mind
## when creating secrets with the Windows
credential tool.
# keyring = "telegraf"
# collection = ""
## Allow dynamic secrets that are updated
during runtime of telegraf
# dynamic = false
CLI Rewrite
• Meet developers where they are
• Implement CLI best practices and consistencies
telegraf <subcommand> <options>
Continue Releasing Plugins & Features
• OPC-UA events based input plugin
• CSV Serializer
• Azure Monitor Input Plugin
• PostgreSQL Output Plugin
• Plugin state persistence
Continue Releasing Plugins & Features
• OPC-UA events based input plugin
• CSV Serializer
• Azure Monitor Input Plugin
• PostgreSQL Output Plugin
• Plugin state persistence
Future Breaking
Changes in next
major release
• Plugins and settings
deprecation
• Processors and
Aggregator behavior
Changes in release schedule
Current Future
Feature release (2.x) End of every quarter Every month
Maintenance release (2.x.x) Every 3 weeks Only if urgent fixes
InfluxData Community Resources
GitHub: https://github.com/influxdata
Slack: http://influxdata.com/slack
Community Forum: https://community.influxdata.com
InfluxDB University: https://university.influxdata.com
T H A N K Y O U
Additional Resources
Free InfluxDB: OSS or Cloud - influxdata.com/cloud
Forums: community.influxdata.com
Slack: influxcommunity.slack.com
Reddit: r/InfluxData
Influx Community (GH): github.com/InfluxCommunity
Book: awesome.influxdata.com
Docs: docs.influxdata.com
Blogs: influxdata.com/blog
InfluxDB University: influxdata.com/university
How-to guides: docs.influxdata.com/resources/how-to-guides/

More Related Content

PPTX
How to Use Telegraf and Its Plugin Ecosystem
PDF
Getting Started: Intro to Telegraf - July 2021
PDF
INTERFACE by apidays 2023 - Data Collection Basics, Anais Dotis-Georgiou, Inf...
PDF
Intro to Telegraf
PDF
OSMC 2024 | Telegraf – A data collection agent by Sven Rebhan.pdf
PDF
Alan Pope [InfluxData] | Data Collectors | InfluxDays 2022
PDF
InfluxDB Live Product Training
PDF
Jess Ingrassellino [InfluxData] | How to Get Data Into InfluxDB | InfluxDays ...
How to Use Telegraf and Its Plugin Ecosystem
Getting Started: Intro to Telegraf - July 2021
INTERFACE by apidays 2023 - Data Collection Basics, Anais Dotis-Georgiou, Inf...
Intro to Telegraf
OSMC 2024 | Telegraf – A data collection agent by Sven Rebhan.pdf
Alan Pope [InfluxData] | Data Collectors | InfluxDays 2022
InfluxDB Live Product Training
Jess Ingrassellino [InfluxData] | How to Get Data Into InfluxDB | InfluxDays ...

Similar to Samantha Wang [InfluxData] | Data Collection Overview | InfluxDays 2022 (20)

PDF
Write your own telegraf plugin
PDF
OSMC 2019 | The Telegraf Toolbelt: It Can Do That, Really? by David McKay
PDF
The Telegraf Toolbelt | David McKay | InfluxData
PDF
The Telegraf Toolbelt: It Can Do That, Really? | David McKay | InfluxData
PDF
Taming the Tiger: Tips and Tricks for Using Telegraf
PDF
Announcing: Native MQTT Integration with HiveMQ and InfluxDB Cloud
PDF
Tim Hall and Ryan Betts [InfluxData] | InfluxDB Roadmap and Engineering Updat...
PDF
Monitoring InfluxEnterprise
PDF
InfluxDB + Telegraf Operator: Easy Kubernetes Monitoring
PPTX
Taming the Tiger: Tips and Tricks for Using Telegraf
PPTX
Lessons Learned Running InfluxDB Cloud and Other Cloud Services at Scale by T...
PDF
Lessons Learned: Running InfluxDB Cloud and Other Cloud Services at Scale | T...
PDF
Maximizing Real-Time Data Processing with Apache Kafka and InfluxDB: A Compre...
PPTX
InfluxDB Cloud Product Update
PDF
Building a Telegraf Plugin by Noah Crowly | Developer Advocate | InfluxData
PDF
How to Build a Telegraf Plugin by Noah Crowley
PDF
2.0 Client Libraries & Using the Java Client by Noah Crowley, Developer Advoc...
PDF
Using the Java Client Library by Noah Crowley, DevRel | InfluxData
PDF
How Sysbee Manages Infrastructures and Provides Advanced Monitoring by Using ...
PPTX
Improving Clinical Data Accuracy: How to Streamline a Data Pipeline Using Nod...
Write your own telegraf plugin
OSMC 2019 | The Telegraf Toolbelt: It Can Do That, Really? by David McKay
The Telegraf Toolbelt | David McKay | InfluxData
The Telegraf Toolbelt: It Can Do That, Really? | David McKay | InfluxData
Taming the Tiger: Tips and Tricks for Using Telegraf
Announcing: Native MQTT Integration with HiveMQ and InfluxDB Cloud
Tim Hall and Ryan Betts [InfluxData] | InfluxDB Roadmap and Engineering Updat...
Monitoring InfluxEnterprise
InfluxDB + Telegraf Operator: Easy Kubernetes Monitoring
Taming the Tiger: Tips and Tricks for Using Telegraf
Lessons Learned Running InfluxDB Cloud and Other Cloud Services at Scale by T...
Lessons Learned: Running InfluxDB Cloud and Other Cloud Services at Scale | T...
Maximizing Real-Time Data Processing with Apache Kafka and InfluxDB: A Compre...
InfluxDB Cloud Product Update
Building a Telegraf Plugin by Noah Crowly | Developer Advocate | InfluxData
How to Build a Telegraf Plugin by Noah Crowley
2.0 Client Libraries & Using the Java Client by Noah Crowley, Developer Advoc...
Using the Java Client Library by Noah Crowley, DevRel | InfluxData
How Sysbee Manages Infrastructures and Provides Advanced Monitoring by Using ...
Improving Clinical Data Accuracy: How to Streamline a Data Pipeline Using Nod...
Ad

More from InfluxData (20)

PPTX
Announcing InfluxDB Clustered
PDF
Best Practices for Leveraging the Apache Arrow Ecosystem
PDF
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
PDF
Power Your Predictive Analytics with InfluxDB
PDF
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
PDF
Build an Edge-to-Cloud Solution with the MING Stack
PDF
Meet the Founders: An Open Discussion About Rewriting Using Rust
PDF
Introducing InfluxDB Cloud Dedicated
PDF
Gain Better Observability with OpenTelemetry and InfluxDB
PPTX
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
PDF
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
PPTX
Introducing InfluxDB’s New Time Series Database Storage Engine
PDF
Start Automating InfluxDB Deployments at the Edge with balena
PDF
Understanding InfluxDB’s New Storage Engine
PDF
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
PPTX
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
PDF
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
PDF
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
PDF
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
PDF
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Announcing InfluxDB Clustered
Best Practices for Leveraging the Apache Arrow Ecosystem
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
Power Your Predictive Analytics with InfluxDB
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
Build an Edge-to-Cloud Solution with the MING Stack
Meet the Founders: An Open Discussion About Rewriting Using Rust
Introducing InfluxDB Cloud Dedicated
Gain Better Observability with OpenTelemetry and InfluxDB
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
Introducing InfluxDB’s New Time Series Database Storage Engine
Start Automating InfluxDB Deployments at the Edge with balena
Understanding InfluxDB’s New Storage Engine
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Ad

Recently uploaded (20)

PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Modernizing your data center with Dell and AMD
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
cuic standard and advanced reporting.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Empathic Computing: Creating Shared Understanding
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Digital-Transformation-Roadmap-for-Companies.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Network Security Unit 5.pdf for BCA BBA.
Modernizing your data center with Dell and AMD
Reach Out and Touch Someone: Haptics and Empathic Computing
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Chapter 3 Spatial Domain Image Processing.pdf
A Presentation on Artificial Intelligence
Encapsulation_ Review paper, used for researhc scholars
cuic standard and advanced reporting.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
“AI and Expert System Decision Support & Business Intelligence Systems”
Empathic Computing: Creating Shared Understanding
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Electronic commerce courselecture one. Pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf

Samantha Wang [InfluxData] | Data Collection Overview | InfluxDays 2022

  • 2. InfluxDB Data Collectors Samantha Wang - Product Manager, InfluxData
  • 3. Connect Learn Build Hear from and meet developers from the InfluxDB Community Be inspired by use cases from our partners and InfluxDB engineers Learn best practices that will help you build great experiences for your projects
  • 4. InfluxDB provides multiple methods for ingesting data. Discover the newest options for data collection and what’s in store for the future of Telegraf. Samantha Wang Product Manager, InfluxData Samantha Wang is currently a Product Manager at InfluxData, creator of InfluxDB, the leading time series platform. Samantha is currently responsible for Telegraf, InfluxData’s open source data collection agent. Previous to InfluxData, she worked in product and engineering roles at Gracenote (a Nielsen Company) and Verizon. Samantha holds a degree in Industrial & Systems Engineering from the University of Southern California. InfluxDB Data Collectors
  • 5. Agenda 1. Native Data Collection 2. Client Libraries 3. Telegraf
  • 6. Multiple InfluxDB Ingestion Options Native Collectors Telegraf Client Libraries Scrapers Ecosystem InfluxDB API Interfaces directly with Cloud data sources (in this case, MQTT Brokers) Offers over 300 plugins to collect data from many data sources Available for many popular development languages: Make API Calls to write data to InfluxDB Cloud Scrape Prometheus-formatted metrics from an HTTP-accessible endpoint and store them in InfluxDB. Configure third-party technologies to send line protocol directly to InfluxDB. Build your own with the InfluxDB API, which provides a programmatic interface for interactions with InfluxDB
  • 7. What is Native Data Collection?
  • 8. The ability to directly ingest data from popular data sources without the need to do any transformation to Line Protocol or run a Telegraf agent 8 Native Collection
  • 9. 9 MQTT Topic Subscriptions • Subscribe to topics being published by their own devices and sent through a message broker directly to the InfluxDB Cloud service • No code or installing any additional software required
  • 10. MQTT Ingestion: Telegraf vs Native MQTT
  • 11. Simple 3 Step Configuration Define Parsing Rules • Configure Rules for finding timestamps, measurements, tags, and field information in the MQTT messages Configure Broker • Broker URL • Broker TCP/IP Port Number • Broker Authentication Type • Broker Authentication Parameters Configure Topic • Topic Name • InfluxDB Bucket
  • 12. What about the Client Libraries?
  • 14. Improving the user experience • Release updates to client library last Thursday of the month • Added an error message if pivot() is not included • Improved working with timestamps in the pandas dataframe
  • 15. What’s new with Telegraf?
  • 16. T H A N K Y O U PROJECT NUMBERS
  • 17. 300+ Plugins and Telegraf Capabilities Recent improvements: • Apple M1 and RISC-V support • Precisely schedule collection with `collection_offset` • Faster line protocol parsing • Improved MQTT topic and payload parsing • SNMP plugin can parse MIB files directly • Test artifacts on pull requests
  • 18. Telegraf Binary growth +18Mb AWS EC2 Metadata processor plugin +10Mb Aliyuncms input plugin +8Mb SQL input plugin
  • 19. Telegraf Maintainers Paweł Żak | @zak-pawel Software Engineer, Intel Corporation Giovanni Luisotto | @Trovalo DBA & BI Dev, Quantumdatis Thomas Casteleyn | @Hipska Software Engineer, Super-Visions
  • 20. Telegraf Maintainers Paweł Żak | @zak-pawel Software Engineer, Intel Corporation Giovanni Luisotto | @Trovalo DBA & BI Dev, Quantumdatis Thomas Casteleyn | @Hipska Software Engineer, Super-Visions
  • 21. What’s next for Telegraf?
  • 22. Focus for Telegraf • Empower users to create a customized Telegraf • Reducing the Telegraf binary • Improve the CLI user experience to be logical and consistent • Align with the future of InfluxDB - logging, tracing, and high cardinality
  • 23. Telegraf Binary growth +18Mb AWS EC2 Metadata processor plugin +10Mb Aliyuncms input plugin +8Mb SQL input plugin
  • 24. Telegraf Custom Builder make build_tools ./tools/custom_builder/custom_builde r --config ~/telegraf.conf Enable users to easily build custom Telegraf agents only with desired plugins
  • 25. Adds [[secretstores.xyz]] section to Telegraf config. Can be managed by JOSE (Javascript Object Signing and Encryption) based secret store or your native OS keyring. Secret Store in Telegraf [[secretstores.os]] ## Unique identifier for the secret-store. ## This id can later be used in plugins to reference the secrets ## in this secret-store via @{<id>:<secret_key>} (mandatory) id = "secretstore" ## Keyring of the secrets ## In Windows, keys follow a fixed pattern in the form `<keyring>:<collection>:<key>`. Please keep this in mind ## when creating secrets with the Windows credential tool. # keyring = "telegraf" # collection = "" ## Allow dynamic secrets that are updated during runtime of telegraf # dynamic = false
  • 26. CLI Rewrite • Meet developers where they are • Implement CLI best practices and consistencies telegraf <subcommand> <options>
  • 27. Continue Releasing Plugins & Features • OPC-UA events based input plugin • CSV Serializer • Azure Monitor Input Plugin • PostgreSQL Output Plugin • Plugin state persistence
  • 28. Continue Releasing Plugins & Features • OPC-UA events based input plugin • CSV Serializer • Azure Monitor Input Plugin • PostgreSQL Output Plugin • Plugin state persistence
  • 29. Future Breaking Changes in next major release • Plugins and settings deprecation • Processors and Aggregator behavior
  • 30. Changes in release schedule Current Future Feature release (2.x) End of every quarter Every month Maintenance release (2.x.x) Every 3 weeks Only if urgent fixes
  • 31. InfluxData Community Resources GitHub: https://github.com/influxdata Slack: http://influxdata.com/slack Community Forum: https://community.influxdata.com InfluxDB University: https://university.influxdata.com
  • 32. T H A N K Y O U
  • 33. Additional Resources Free InfluxDB: OSS or Cloud - influxdata.com/cloud Forums: community.influxdata.com Slack: influxcommunity.slack.com Reddit: r/InfluxData Influx Community (GH): github.com/InfluxCommunity Book: awesome.influxdata.com Docs: docs.influxdata.com Blogs: influxdata.com/blog InfluxDB University: influxdata.com/university How-to guides: docs.influxdata.com/resources/how-to-guides/