SlideShare a Scribd company logo
Sam Dillard
Sales Engineering
InfluxEnterprise
Architectural Patterns
Tim Hall
VP, Products
What we will be
covering
✓ Enterprise Overview
✓ Other Features
✓ Ingestion & Query Rates
✓ Deployment Examples
✓ Replications Patterns
✓ General Advice
Why InfluxEnterprise?
InfluxEnterprise
• Open Source Core
• High Availability
• Scalability
• Fine Grained Authorization
• Support from InfluxData
• OnPremise/Cloud Deployment Options
Signs You’re Ready for InfluxEnterprise
6. Your CPU average is >=70%
1. The sales team starts calling you on weekends
5. Increasing throughput causing write drops errors
4. Sprawling number of single node deployments
3. Horizontal scaling not providing further benefit
2. Data durability matters
What Problem Are You
Trying to Solve?
What are you dealing with?
• Metrics
• Events
• Log Data
• Sensors
• Apps
• Servers
• Long-Term Storage
• Vendor Replacement
• Time-Series Alerts
• Visualization
• Network Data
• Custom Solution
• Real-Time Analytics
• Virtualization Monitoring
• Managed Service (InfluxCloud)
InfluxEnterprise Overview
InfluxEnterprise Cluster Architecture: Meta Nodes
InfluxEnterprise Clustering: Data Nodes
InfluxEnterprise Cluster Architecture
InfluxEnterprise Cluster Architecture
Features
Security
• LDAP Support
– Enterprise customers can configure the database to use LDAP as a backing
authentication source for users, roles and permissions.
– Connection between DB and LDAP server secured once connected
• Fine-grained authorization
– Used to control access at a measurement or series level
(compared to limiting access at the database level)
– Enable authentication in your configuration file
– Create users through the query API
– Grant users explicit read and/or write privileges
– Set restrictions which define a combination of database, measurement, and
tags which cannot be accessed without an explicit grant
© 2018 InfluxData. All rights reserved.© 2017 InfluxData. All rights reserved.
Eventual Consistency
• Anti-Entropy Service
– Expands on capabilities to
detect and copy full shards
– Now allows for detection and
repair of inconsistent shards
• Hinted-handoff Queue
– Queue inbound points
destined to land on other
nodes in the cluster which
may currently be down
– Stored by node and shard
(10GB - default)
Backup and Restore
• Useful for: Disaster recovery, Debugging, Restoring clusters to a
consistent state
• What it does: Creates a copy of the metastore and the shard data
• Backup is compressed and is not human readable
• Export is not compressed but is human readable
• OSS and InfluxEnterprise ARE NOW compatible – aka portable
• Full or partial backup options
• Move data into a new database (with new Retention Policies, etc)
Ingestion & Query Rates
Cluster
Data Node 1 Data Node 2 Data Node 3 Data Node 4
Database X (Replication=1)
Shard 1 Shard 2 Shard 3 Shard 4
a b c d
X ≈ 4x ingest rate
≤ 1x concurrent query rate
a b c d
Cluster
Data Node 1 Data Node 2 Data Node 3 Data Node 4
Database X (Replication=4)
Shard 1 Shard 2 Shard 3 Shard 4
a a’ a’’ a’’’
X ≈ 1x ingest rate
replication
≈ 4x concurrent query rate
a
b
b b’ b’’ b’’’
Cluster
Data Node 1 Data Node 2 Data Node 3 Data Node 4
Database X (Replication=2)
Shard 1 Shard 2 Shard 3 Shard 4
a a’ b b’
X ≈ 2x ingest rate
replication replication
≤ 2x concurrent query rate
a b
Deployment Examples
How does
InfluxEnterprise Fit?
Example 1: Mothership
Data Center 1
Kapacitor
Telegraf InfluxDB
Ent
Enterprise Cluster
Data Node 1
Data Node 2
Data Node 3
Data Node n
Firewall/
LoadBalancer
Telegraf
Telegraf
Chronograf
Chronograf Kapacitor
Data Center 2
Kapacitor
Telegraf InfluxDB
EntTelegraf
Telegraf
Chronograf
Example 2: Durable Data Ingest
Telegraf Cluster
Telegraf
or other
source
Kafka
Queue
LoadBalancer
InfluxDB
Cluster
Telegraf
or other
source
Telegraf
or other
source
Telegraf
or other
sources
Telegraf
Telegraf
Telegraf
Telegraf
Put each Telegraf instance in
the same Kafka Consumer
Group
How Fast is Fast?
(ex): Six datanodes at 2.5M values per
second
Example 3: Influx with ElasticSearch
InfluxDB
Cluster
• Discover trends before and during the Error from metrics
• Perform Root Cause Analysis from Logs
LoadBalancer
Telegraf
ElasticSearch
Include common
Session ID
or other UID
Kapacitor
You
Metrics
Logs
Query using the common
Session ID or UID received
form Alert
Replication Patterns
How Are You Using InfluxDB?
Data Replication
Generally there are two types of data that we care about replicating:
• New Data – Data which is coming form our raw sources
• Derived Data – The output of a SELECT INTO, or TICK script
Replication of New Data – Pattern 1
Cluster 1
Firewall/
Load Balancer
Data Node 1
Data Node 2
Data Node 3
Data Node n
Telegraf
Cluster 2
Firewall/
Load Balancer
Data Node 1
Data Node 2
Data Node 3
Data Node n
Telegraf
Replication of New Data – Pattern 2
Cluster 1
Firewall/
Load Balancer
Data Node 1
Data Node 2
Data Node 3
Data Node n
Telegraf
Cluster 2
Firewall/
Load Balancer
Data Node 1
Data Node 2
Data Node 3
Data Node n
Telegraf
Kafka Queue
Replication of Derived Data – Pattern 3
Cluster 1
Load Balancer
Cluster 2
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Kapacitor
Load Balancer
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Kapacitor
Uses output of
Kapacitor to other
cluster
Telegraf Telegraf
General Advice
General Cluster Advice
• Batch your writes!
• The number of data nodes should be a multiple of your replication
factor
• Use a single node of InfluxDB to monitor your cluster
• Put a load balancer in front of each of your data nodes
• Higher replication factors result in higher query concurrency, but
higher write latency.
• Use Fine Grained Authorization instead of multiple databases
Thank You!
InfluxEnterprise Architecture Patterns by Tim Hall & Sam Dillard

More Related Content

PPTX
Setting Up InfluxDB for IoT by David G Simmons
PPTX
Container Monitoring Best Practices Using AWS and InfluxData by Gunnar Aasen
PDF
InfluxDB 2.0: Dashboarding 101 by David G. Simmons
PPTX
A Walkthrough of InfluxCloud 2.0 by Tim Hall
PDF
InfluxEnterprise Architectural Patterns by Dean Sheehan, Senior Director, Pre...
PPTX
InfluxData Internals by Ryan Betts
PDF
Monitoring, Alerting, and Tasks as Code by Russ Savage, Director of Product M...
PDF
InfluxDB IOx Tech Talks: Replication, Durability and Subscriptions in InfluxD...
Setting Up InfluxDB for IoT by David G Simmons
Container Monitoring Best Practices Using AWS and InfluxData by Gunnar Aasen
InfluxDB 2.0: Dashboarding 101 by David G. Simmons
A Walkthrough of InfluxCloud 2.0 by Tim Hall
InfluxEnterprise Architectural Patterns by Dean Sheehan, Senior Director, Pre...
InfluxData Internals by Ryan Betts
Monitoring, Alerting, and Tasks as Code by Russ Savage, Director of Product M...
InfluxDB IOx Tech Talks: Replication, Durability and Subscriptions in InfluxD...

What's hot (20)

PDF
Time Series Tech Stack for the IoT Edge
PDF
WRITING QUERIES (INFLUXQL AND TICK)
PDF
Spacecrafts Made Simple: How Loft Orbital Delivers Unparalleled Speed-to-Spac...
PDF
Optimizing InfluxDB Performance in the Real World by Dean Sheehan, Senior Dir...
PPTX
Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience Lond...
PDF
A True Story About Database Orchestration
PDF
Data Integration
PDF
InfluxDB Live Product Training
PDF
Setting up InfluxData for IoT
PPTX
In Flux Limiting for a multi-tenant logging service
PDF
Safer Commutes & Streaming Data | George Padavick, Ohio Department of Transpo...
PDF
Best Practices for Scaling an InfluxEnterprise Cluster
PDF
Gain Deep Visibility into APIs and Integrations with Anypoint Monitoring
PPTX
InfluxDB Community Office Hours September 2020
PDF
How Sysbee Manages Infrastructures and Provides Advanced Monitoring by Using ...
PPTX
RedisConf18 - Redis Fault Injection
PDF
Lessons and Observations Scaling a Time Series Database
PDF
Analyzing Petabyte Scale Financial Data with Apache Pinot and Apache Kafka | ...
PDF
Kafka Tiered Storage | Satish Duggana and Sriharsha Chintalapani, Uber
PDF
Kapacitor Manager
Time Series Tech Stack for the IoT Edge
WRITING QUERIES (INFLUXQL AND TICK)
Spacecrafts Made Simple: How Loft Orbital Delivers Unparalleled Speed-to-Spac...
Optimizing InfluxDB Performance in the Real World by Dean Sheehan, Senior Dir...
Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience Lond...
A True Story About Database Orchestration
Data Integration
InfluxDB Live Product Training
Setting up InfluxData for IoT
In Flux Limiting for a multi-tenant logging service
Safer Commutes & Streaming Data | George Padavick, Ohio Department of Transpo...
Best Practices for Scaling an InfluxEnterprise Cluster
Gain Deep Visibility into APIs and Integrations with Anypoint Monitoring
InfluxDB Community Office Hours September 2020
How Sysbee Manages Infrastructures and Provides Advanced Monitoring by Using ...
RedisConf18 - Redis Fault Injection
Lessons and Observations Scaling a Time Series Database
Analyzing Petabyte Scale Financial Data with Apache Pinot and Apache Kafka | ...
Kafka Tiered Storage | Satish Duggana and Sriharsha Chintalapani, Uber
Kapacitor Manager
Ad

Similar to InfluxEnterprise Architecture Patterns by Tim Hall & Sam Dillard (20)

PDF
Installing your influx enterprise cluster
PDF
InfluxDB Enterprise Architectural Patterns | Craig Hobbs | InfluxData
PPTX
ARCHITECTING INFLUXENTERPRISE FOR SUCCESS
PDF
3 Reasons to Select Time Series Platforms for Cloud Native Applications Monit...
PPTX
Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience NA 2020
PDF
Case Study : InfluxDB
PDF
3 reasons to pick a time series platform for monitoring dev ops driven contai...
PDF
Rethinking the Database in the IoT Era
PPTX
InfluxDB Roadmap: What’s New and What’s Coming
PDF
Virtual training intro to InfluxDB - June 2021
PDF
Roadshow September 2018
PDF
Intro to InfluxDB
PPTX
Lessons Learned Running InfluxDB Cloud and Other Cloud Services at Scale by T...
PDF
Gilmore, Palani [InfluxData] | Use Case: Monitoring / Observability | InfluxD...
PPTX
Announcing InfluxDB Clustered
PDF
Lessons Learned: Running InfluxDB Cloud and Other Cloud Services at Scale | T...
PPTX
Maksim Vazhenin [Dell Technologies] | InfluxDB for Storage System Monitoring ...
PPTX
Evan Kaplan [InfluxData] | InfluxDays Opening Remarks | InfluxDays Virtual Ex...
PDF
Monitoring InfluxEnterprise
PDF
Charles Mahler [InfluxData] | Use Case: Networking Monitoring | InfluxDays 2022
Installing your influx enterprise cluster
InfluxDB Enterprise Architectural Patterns | Craig Hobbs | InfluxData
ARCHITECTING INFLUXENTERPRISE FOR SUCCESS
3 Reasons to Select Time Series Platforms for Cloud Native Applications Monit...
Tim Hall [InfluxData] | InfluxDB Roadmap | InfluxDays Virtual Experience NA 2020
Case Study : InfluxDB
3 reasons to pick a time series platform for monitoring dev ops driven contai...
Rethinking the Database in the IoT Era
InfluxDB Roadmap: What’s New and What’s Coming
Virtual training intro to InfluxDB - June 2021
Roadshow September 2018
Intro to InfluxDB
Lessons Learned Running InfluxDB Cloud and Other Cloud Services at Scale by T...
Gilmore, Palani [InfluxData] | Use Case: Monitoring / Observability | InfluxD...
Announcing InfluxDB Clustered
Lessons Learned: Running InfluxDB Cloud and Other Cloud Services at Scale | T...
Maksim Vazhenin [Dell Technologies] | InfluxDB for Storage System Monitoring ...
Evan Kaplan [InfluxData] | InfluxDays Opening Remarks | InfluxDays Virtual Ex...
Monitoring InfluxEnterprise
Charles Mahler [InfluxData] | Use Case: Networking Monitoring | InfluxDays 2022
Ad

More from InfluxData (20)

PDF
Best Practices for Leveraging the Apache Arrow Ecosystem
PDF
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
PDF
Power Your Predictive Analytics with InfluxDB
PDF
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
PDF
Build an Edge-to-Cloud Solution with the MING Stack
PDF
Meet the Founders: An Open Discussion About Rewriting Using Rust
PDF
Introducing InfluxDB Cloud Dedicated
PDF
Gain Better Observability with OpenTelemetry and InfluxDB
PPTX
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
PDF
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
PPTX
Introducing InfluxDB’s New Time Series Database Storage Engine
PDF
Start Automating InfluxDB Deployments at the Edge with balena
PDF
Understanding InfluxDB’s New Storage Engine
PDF
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
PPTX
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
PDF
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
PDF
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
PDF
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
PDF
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
PDF
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022
Best Practices for Leveraging the Apache Arrow Ecosystem
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
Power Your Predictive Analytics with InfluxDB
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
Build an Edge-to-Cloud Solution with the MING Stack
Meet the Founders: An Open Discussion About Rewriting Using Rust
Introducing InfluxDB Cloud Dedicated
Gain Better Observability with OpenTelemetry and InfluxDB
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
Introducing InfluxDB’s New Time Series Database Storage Engine
Start Automating InfluxDB Deployments at the Edge with balena
Understanding InfluxDB’s New Storage Engine
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022

Recently uploaded (20)

PDF
Enhancing emotion recognition model for a student engagement use case through...
PPTX
TLE Review Electricity (Electricity).pptx
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
August Patch Tuesday
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PPTX
A Presentation on Touch Screen Technology
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Tartificialntelligence_presentation.pptx
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Getting Started with Data Integration: FME Form 101
PDF
project resource management chapter-09.pdf
Enhancing emotion recognition model for a student engagement use case through...
TLE Review Electricity (Electricity).pptx
WOOl fibre morphology and structure.pdf for textiles
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
August Patch Tuesday
cloud_computing_Infrastucture_as_cloud_p
Heart disease approach using modified random forest and particle swarm optimi...
A Presentation on Touch Screen Technology
1 - Historical Antecedents, Social Consideration.pdf
Assigned Numbers - 2025 - Bluetooth® Document
Programs and apps: productivity, graphics, security and other tools
Web App vs Mobile App What Should You Build First.pdf
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Tartificialntelligence_presentation.pptx
Hindi spoken digit analysis for native and non-native speakers
Building Integrated photovoltaic BIPV_UPV.pdf
Encapsulation theory and applications.pdf
Getting Started with Data Integration: FME Form 101
project resource management chapter-09.pdf

InfluxEnterprise Architecture Patterns by Tim Hall & Sam Dillard

  • 2. What we will be covering ✓ Enterprise Overview ✓ Other Features ✓ Ingestion & Query Rates ✓ Deployment Examples ✓ Replications Patterns ✓ General Advice
  • 4. InfluxEnterprise • Open Source Core • High Availability • Scalability • Fine Grained Authorization • Support from InfluxData • OnPremise/Cloud Deployment Options
  • 5. Signs You’re Ready for InfluxEnterprise 6. Your CPU average is >=70% 1. The sales team starts calling you on weekends 5. Increasing throughput causing write drops errors 4. Sprawling number of single node deployments 3. Horizontal scaling not providing further benefit 2. Data durability matters
  • 6. What Problem Are You Trying to Solve?
  • 7. What are you dealing with? • Metrics • Events • Log Data • Sensors • Apps • Servers • Long-Term Storage • Vendor Replacement • Time-Series Alerts • Visualization • Network Data • Custom Solution • Real-Time Analytics • Virtualization Monitoring • Managed Service (InfluxCloud)
  • 14. Security • LDAP Support – Enterprise customers can configure the database to use LDAP as a backing authentication source for users, roles and permissions. – Connection between DB and LDAP server secured once connected • Fine-grained authorization – Used to control access at a measurement or series level (compared to limiting access at the database level) – Enable authentication in your configuration file – Create users through the query API – Grant users explicit read and/or write privileges – Set restrictions which define a combination of database, measurement, and tags which cannot be accessed without an explicit grant
  • 15. © 2018 InfluxData. All rights reserved.© 2017 InfluxData. All rights reserved. Eventual Consistency • Anti-Entropy Service – Expands on capabilities to detect and copy full shards – Now allows for detection and repair of inconsistent shards • Hinted-handoff Queue – Queue inbound points destined to land on other nodes in the cluster which may currently be down – Stored by node and shard (10GB - default)
  • 16. Backup and Restore • Useful for: Disaster recovery, Debugging, Restoring clusters to a consistent state • What it does: Creates a copy of the metastore and the shard data • Backup is compressed and is not human readable • Export is not compressed but is human readable • OSS and InfluxEnterprise ARE NOW compatible – aka portable • Full or partial backup options • Move data into a new database (with new Retention Policies, etc)
  • 18. Cluster Data Node 1 Data Node 2 Data Node 3 Data Node 4 Database X (Replication=1) Shard 1 Shard 2 Shard 3 Shard 4 a b c d X ≈ 4x ingest rate ≤ 1x concurrent query rate a b c d
  • 19. Cluster Data Node 1 Data Node 2 Data Node 3 Data Node 4 Database X (Replication=4) Shard 1 Shard 2 Shard 3 Shard 4 a a’ a’’ a’’’ X ≈ 1x ingest rate replication ≈ 4x concurrent query rate a b b b’ b’’ b’’’
  • 20. Cluster Data Node 1 Data Node 2 Data Node 3 Data Node 4 Database X (Replication=2) Shard 1 Shard 2 Shard 3 Shard 4 a a’ b b’ X ≈ 2x ingest rate replication replication ≤ 2x concurrent query rate a b
  • 23. Example 1: Mothership Data Center 1 Kapacitor Telegraf InfluxDB Ent Enterprise Cluster Data Node 1 Data Node 2 Data Node 3 Data Node n Firewall/ LoadBalancer Telegraf Telegraf Chronograf Chronograf Kapacitor Data Center 2 Kapacitor Telegraf InfluxDB EntTelegraf Telegraf Chronograf
  • 24. Example 2: Durable Data Ingest Telegraf Cluster Telegraf or other source Kafka Queue LoadBalancer InfluxDB Cluster Telegraf or other source Telegraf or other source Telegraf or other sources Telegraf Telegraf Telegraf Telegraf Put each Telegraf instance in the same Kafka Consumer Group How Fast is Fast? (ex): Six datanodes at 2.5M values per second
  • 25. Example 3: Influx with ElasticSearch InfluxDB Cluster • Discover trends before and during the Error from metrics • Perform Root Cause Analysis from Logs LoadBalancer Telegraf ElasticSearch Include common Session ID or other UID Kapacitor You Metrics Logs Query using the common Session ID or UID received form Alert
  • 27. How Are You Using InfluxDB?
  • 28. Data Replication Generally there are two types of data that we care about replicating: • New Data – Data which is coming form our raw sources • Derived Data – The output of a SELECT INTO, or TICK script
  • 29. Replication of New Data – Pattern 1 Cluster 1 Firewall/ Load Balancer Data Node 1 Data Node 2 Data Node 3 Data Node n Telegraf Cluster 2 Firewall/ Load Balancer Data Node 1 Data Node 2 Data Node 3 Data Node n Telegraf
  • 30. Replication of New Data – Pattern 2 Cluster 1 Firewall/ Load Balancer Data Node 1 Data Node 2 Data Node 3 Data Node n Telegraf Cluster 2 Firewall/ Load Balancer Data Node 1 Data Node 2 Data Node 3 Data Node n Telegraf Kafka Queue
  • 31. Replication of Derived Data – Pattern 3 Cluster 1 Load Balancer Cluster 2 Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Kapacitor Load Balancer Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Kapacitor Uses output of Kapacitor to other cluster Telegraf Telegraf
  • 33. General Cluster Advice • Batch your writes! • The number of data nodes should be a multiple of your replication factor • Use a single node of InfluxDB to monitor your cluster • Put a load balancer in front of each of your data nodes • Higher replication factors result in higher query concurrency, but higher write latency. • Use Fine Grained Authorization instead of multiple databases