SlideShare a Scribd company logo
Copyright © 2015 Splunk Inc.
Splunk @ Level++
2
Splunk at the Next Level
Time to move beyond initial Splunk environment
• More use cases – how to tackle?
• More data – how do we scale?
• Splunk is mission critical == HA
• Global deployments
• Splunk user experience Screenshot here
3
Agenda
Use cases  Business Cases
Simple Scaling
Indexer Clustering (+Cross-site Clustering, Search Affinity)
Search Head Clustering
Distributed Management Console
Centralized Configuration Management
Splunk Cloud & Hybrid Deployments
Q&A
4
Growing your Splunk Deployment
Many customers start with a single use case…
• Ex: Monitor the web servers
• Help ensure up-time & response times
• Track usage, errors
• Provides business value
5
Growing your Splunk Deployment
Value statement for each overall service
Your services exist in a larger context than just one app, or one tier.
What is the value of the service as a whole?
What are CIO commitments for the service?
• The company’s web store is one of the most critical parts of the business.
• Performance of the overall environment must be maintained at all times.
• Failures in any portion of the web store must be quickly identified, send
notification to the appropriate parties.
• Dependencies on external processes must be monitored as well.
6
Growing your Splunk Deployment
The larger context
• Failure in one system cascades
• Map dependencies, estimate costs
• Use Splunk to track all dependencies.
• What happens when it is down?
Dependencies often include:
• Networking dependencies
• Shared storage
• Databases, middleware, custom apps
• Virtualization layer
Screenshot here
7
Scaling
Multiple factors
Indexer: IOPs, daily rate
Storage: Usage & retention
Search Head usage
8
Scaling - Indexers
Sizing for index performance
Indexers are usually storage-bound
Indexers: 150 to 250 GB per day each. (With suitable storage)
Ref HW: 12 cores (2 GHz+), 12 GB RAM, 800+ IOPs
Optimal HW (normal disk): 16 CPU cores, 48 GB RAM
Optimal HW (SSD): 24 CPU cores, 132 GB RAM
Questions?
9
SSD Advantage
http://blogs.splunk.com/2012/05/10/quantifying-the-benefits-of-
splunk-with-ssds/
• Low cost random seeks
• Writes are not that much faster – no great improvement with Indexing
• Significant improvements with Sparse/needle-haystack searches
• Dense searches become CPU bound
• Searches run faster allowing for more completed searches/min
10
Scaling - Storage
Simple storage to complex
Raw data rate  net compression of ~ 50% on disk.
Simple: rate * compression * retention
200 GB / day * 50% * 100 days = 10TB
Consider cold storage on NAS
– Changes storage story.
– Retention on fast, retention on slow
Clustering
– Changes storage story
11
Scaling - Storage
Sizing Calculator: http://splunk-sizing.appspot.com/
12
Scaling - Storage
RAID + SSD deep dive
• For spinning disks, Splunk recommends RAID 1+0 with 1k IOPs
• SSDs provide extremely high IOPs (45,000 +)
• RAID 5 SSD arrays give great Splunk performance in most
scenarios.
Additional details: Splunk Docs, Capacity Planning Manual
13
Forwarder Load Balancing
Have UF balance across multiple indexers
DNS round robin
Multiple hosts in outputs
LB not needed!
Geography-based routing
14
Indexer Clustering
High-Availability, Out of the Box
Splunk indexer clustering
Active-Active= better performance
Specific terms:
– Master Node
– Peer Node
– Search Factor
– Replication Factor
Additional details: Splunk Docs, Distributed Deployment Manual
15
Cross-site Clustering
Search Affinity by location
“Search locally”, “Store Globally”
DR scenarios
16
Scaling the Search Heads
Splunk Search is critical, too!
Splunk Search high availability needs
Scale to handle # of concurrent queries
17
SHP vs SHC
SHC
• SHP
• Available since v4.2
• Sharing configurations through NFS
• Single point of failure
• Performance issues
• No NFS
• Replication using local storage
• Commodity hardware
NFS
18
Search Head Clustering
19
Search Head Clustering
Use “Captain” for Master to avoid confusion with Index-Clustering
Minimum 3 nodes required. Odd is always preferred.
Cluster takes certain key decisions based on *majority* (consensus)
In multi-site setup have more nodes in main datacenter
20
Distributed Management Console
Manage Splunk 6.2 environments
Replaces Deployment Monitor App
Incorporates SOS app prior to 6.2
21
Deployment Server
Central management of Splunk Forwarders
Deployment Server manages Apps, Configs
Select one or more classes for each host
Class defines apps & configs
Works by phone-home
Notes:
DS does not push forwarder binaries
Use Cluster Master to manage indexers in cluster, not DS
22
Cloud & Hybrid
Scale without waiting for hardware
Thank You

More Related Content

PPTX
Taking Splunk to the Next Level - Architecture
PPTX
Taking Splunk to the Next Level - Architecture Breakout Session
PPTX
Taking Splunk to the Next Level - Technical
PPTX
Scale Splunk
PPTX
Taking Splunk to the Next Level - Architecture Breakout Session
PPTX
SplunkLive! Atlanta Mar 2013 - University of Alabama at Birmingham
PPTX
Taking Splunk to the Next Level - Architecture Breakout Session
PPTX
Taking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Technical
Scale Splunk
Taking Splunk to the Next Level - Architecture Breakout Session
SplunkLive! Atlanta Mar 2013 - University of Alabama at Birmingham
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level – Architecture

What's hot (20)

ODP
Get involved with the Apache Software Foundation
PDF
Metrics-Driven Tuning of Apache Spark at Scale with Edwina Lu and Ye Zhou
PDF
Inside Solr 5 - Bangalore Solr/Lucene Meetup
PDF
tdtechtalk20160330johan
PDF
Searching The Enterprise Data Lake With Solr - Watch Us Do It!: Presented by...
PPTX
Building Efficient Pipelines in Apache Spark
PPTX
Building Effective Near-Real-Time Analytics with Spark Streaming and Kudu
PDF
Hive spark-s3acommitter-hbase-nfs
PDF
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Introduction to Kafka - Je...
PDF
Spark on Mesos
PDF
Portable UDFs: Write Once, Run Anywhere
PDF
Connect Code to Resource Consumption to Scale Your Production Spark Applicati...
PDF
Leveraging Databricks for Spark pipelines
PDF
Whirlpools in the Stream with Jayesh Lalwani
PDF
Solving Your Backup Needs Using MongoDB Ops Manager, Cloud Manager and Atlas
PDF
Spark Summit EU talk by Oscar Castaneda
PDF
Indicium: Interactive Querying at Scale Using Apache Spark, Zeppelin, and Spa...
PDF
Running Spark Inside Containers with Haohai Ma and Khalid Ahmed
PPTX
Securing Spark Applications by Kostas Sakellis and Marcelo Vanzin
PDF
Continuous Processing in Structured Streaming with Jose Torres
Get involved with the Apache Software Foundation
Metrics-Driven Tuning of Apache Spark at Scale with Edwina Lu and Ye Zhou
Inside Solr 5 - Bangalore Solr/Lucene Meetup
tdtechtalk20160330johan
Searching The Enterprise Data Lake With Solr - Watch Us Do It!: Presented by...
Building Efficient Pipelines in Apache Spark
Building Effective Near-Real-Time Analytics with Spark Streaming and Kudu
Hive spark-s3acommitter-hbase-nfs
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Introduction to Kafka - Je...
Spark on Mesos
Portable UDFs: Write Once, Run Anywhere
Connect Code to Resource Consumption to Scale Your Production Spark Applicati...
Leveraging Databricks for Spark pipelines
Whirlpools in the Stream with Jayesh Lalwani
Solving Your Backup Needs Using MongoDB Ops Manager, Cloud Manager and Atlas
Spark Summit EU talk by Oscar Castaneda
Indicium: Interactive Querying at Scale Using Apache Spark, Zeppelin, and Spa...
Running Spark Inside Containers with Haohai Ma and Khalid Ahmed
Securing Spark Applications by Kostas Sakellis and Marcelo Vanzin
Continuous Processing in Structured Streaming with Jose Torres
Ad

Similar to Taking Splunk to the Next Level – Architecture (20)

PPTX
Taking Splunk to the Next Level - Architecture Breakout Session
PPTX
Taking Splunk to the Next Level - Architecture
PPTX
Taking Splunk to the Next Level - Architecture
PPTX
Taking Splunk to the Next Level - Architecture Breakout Session
PPTX
Taking Splunk to the Next Level - Architecture Breakout Session
PPTX
Getting Started with Splunk
PPTX
Getting Started with Splunk Breakout Session
PDF
Getting Started with Splunk Enterprise
PDF
Deploying Splunk. Arquitetura e dimensionamento do Splunk
PPTX
Getting Started with Splunk Breakout Session
PPTX
Splunk Enterprise 6.3 - Splunk Tech Day
PPTX
Best Practices for a CoE
PDF
SplunkLive! Amsterdam 2015 Breakout - Getting Started with Splunk
PPTX
SplunkLive! Washington DC May 2013 - Splunk Enterprise 5
PPTX
Getting Started with Splunk Enterprises
PDF
SFBA Splunk Usergroup meeting March 13, 2024
PDF
Splunk Sales Presentation Imagemaker 2014
PPTX
Splunk Ninjas: New Features, Pivot, and Search Dojo
PDF
Splunk in Nordstrom: IT Operations
PDF
Splunk Data Onboarding Overview - Splunk Data Collection Architecture
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
Getting Started with Splunk
Getting Started with Splunk Breakout Session
Getting Started with Splunk Enterprise
Deploying Splunk. Arquitetura e dimensionamento do Splunk
Getting Started with Splunk Breakout Session
Splunk Enterprise 6.3 - Splunk Tech Day
Best Practices for a CoE
SplunkLive! Amsterdam 2015 Breakout - Getting Started with Splunk
SplunkLive! Washington DC May 2013 - Splunk Enterprise 5
Getting Started with Splunk Enterprises
SFBA Splunk Usergroup meeting March 13, 2024
Splunk Sales Presentation Imagemaker 2014
Splunk Ninjas: New Features, Pivot, and Search Dojo
Splunk in Nordstrom: IT Operations
Splunk Data Onboarding Overview - Splunk Data Collection Architecture
Ad

More from Splunk (20)

PDF
Splunk Leadership Forum Wien - 20.05.2025
PDF
Splunk Security Update | Public Sector Summit Germany 2025
PDF
Building Resilience with Energy Management for the Public Sector
PDF
IT-Lagebild: Observability for Resilience (SVA)
PDF
Nach dem SOC-Aufbau ist vor der Automatisierung (OFD Baden-Württemberg)
PDF
Monitoring einer Sicheren Inter-Netzwerk Architektur (SINA)
PDF
Praktische Erfahrungen mit dem Attack Analyser (gematik)
PDF
Cisco XDR & Splunk SIEM - stronger together (DATAGROUP Cyber Security)
PDF
Security - Mit Sicherheit zum Erfolg (Telekom)
PDF
One Cisco - Splunk Public Sector Summit Germany April 2025
PDF
.conf Go 2023 - Data analysis as a routine
PDF
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
PDF
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
PDF
.conf Go 2023 - Raiffeisen Bank International
PDF
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
PDF
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
PDF
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
PDF
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
PDF
.conf go 2023 - De NOC a CSIRT (Cellnex)
PDF
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
Splunk Leadership Forum Wien - 20.05.2025
Splunk Security Update | Public Sector Summit Germany 2025
Building Resilience with Energy Management for the Public Sector
IT-Lagebild: Observability for Resilience (SVA)
Nach dem SOC-Aufbau ist vor der Automatisierung (OFD Baden-Württemberg)
Monitoring einer Sicheren Inter-Netzwerk Architektur (SINA)
Praktische Erfahrungen mit dem Attack Analyser (gematik)
Cisco XDR & Splunk SIEM - stronger together (DATAGROUP Cyber Security)
Security - Mit Sicherheit zum Erfolg (Telekom)
One Cisco - Splunk Public Sector Summit Germany April 2025
.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - De NOC a CSIRT (Cellnex)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)

Recently uploaded (20)

PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
KodekX | Application Modernization Development
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Unlocking AI with Model Context Protocol (MCP)
The Rise and Fall of 3GPP – Time for a Sabbatical?
Chapter 3 Spatial Domain Image Processing.pdf
Review of recent advances in non-invasive hemoglobin estimation
Advanced methodologies resolving dimensionality complications for autism neur...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
MIND Revenue Release Quarter 2 2025 Press Release
NewMind AI Weekly Chronicles - August'25 Week I
KodekX | Application Modernization Development
Spectral efficient network and resource selection model in 5G networks
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Mobile App Security Testing_ A Comprehensive Guide.pdf
Network Security Unit 5.pdf for BCA BBA.
Big Data Technologies - Introduction.pptx
Understanding_Digital_Forensics_Presentation.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Unlocking AI with Model Context Protocol (MCP)

Taking Splunk to the Next Level – Architecture

  • 1. Copyright © 2015 Splunk Inc. Splunk @ Level++
  • 2. 2 Splunk at the Next Level Time to move beyond initial Splunk environment • More use cases – how to tackle? • More data – how do we scale? • Splunk is mission critical == HA • Global deployments • Splunk user experience Screenshot here
  • 3. 3 Agenda Use cases  Business Cases Simple Scaling Indexer Clustering (+Cross-site Clustering, Search Affinity) Search Head Clustering Distributed Management Console Centralized Configuration Management Splunk Cloud & Hybrid Deployments Q&A
  • 4. 4 Growing your Splunk Deployment Many customers start with a single use case… • Ex: Monitor the web servers • Help ensure up-time & response times • Track usage, errors • Provides business value
  • 5. 5 Growing your Splunk Deployment Value statement for each overall service Your services exist in a larger context than just one app, or one tier. What is the value of the service as a whole? What are CIO commitments for the service? • The company’s web store is one of the most critical parts of the business. • Performance of the overall environment must be maintained at all times. • Failures in any portion of the web store must be quickly identified, send notification to the appropriate parties. • Dependencies on external processes must be monitored as well.
  • 6. 6 Growing your Splunk Deployment The larger context • Failure in one system cascades • Map dependencies, estimate costs • Use Splunk to track all dependencies. • What happens when it is down? Dependencies often include: • Networking dependencies • Shared storage • Databases, middleware, custom apps • Virtualization layer Screenshot here
  • 7. 7 Scaling Multiple factors Indexer: IOPs, daily rate Storage: Usage & retention Search Head usage
  • 8. 8 Scaling - Indexers Sizing for index performance Indexers are usually storage-bound Indexers: 150 to 250 GB per day each. (With suitable storage) Ref HW: 12 cores (2 GHz+), 12 GB RAM, 800+ IOPs Optimal HW (normal disk): 16 CPU cores, 48 GB RAM Optimal HW (SSD): 24 CPU cores, 132 GB RAM Questions?
  • 9. 9 SSD Advantage http://blogs.splunk.com/2012/05/10/quantifying-the-benefits-of- splunk-with-ssds/ • Low cost random seeks • Writes are not that much faster – no great improvement with Indexing • Significant improvements with Sparse/needle-haystack searches • Dense searches become CPU bound • Searches run faster allowing for more completed searches/min
  • 10. 10 Scaling - Storage Simple storage to complex Raw data rate  net compression of ~ 50% on disk. Simple: rate * compression * retention 200 GB / day * 50% * 100 days = 10TB Consider cold storage on NAS – Changes storage story. – Retention on fast, retention on slow Clustering – Changes storage story
  • 11. 11 Scaling - Storage Sizing Calculator: http://splunk-sizing.appspot.com/
  • 12. 12 Scaling - Storage RAID + SSD deep dive • For spinning disks, Splunk recommends RAID 1+0 with 1k IOPs • SSDs provide extremely high IOPs (45,000 +) • RAID 5 SSD arrays give great Splunk performance in most scenarios. Additional details: Splunk Docs, Capacity Planning Manual
  • 13. 13 Forwarder Load Balancing Have UF balance across multiple indexers DNS round robin Multiple hosts in outputs LB not needed! Geography-based routing
  • 14. 14 Indexer Clustering High-Availability, Out of the Box Splunk indexer clustering Active-Active= better performance Specific terms: – Master Node – Peer Node – Search Factor – Replication Factor Additional details: Splunk Docs, Distributed Deployment Manual
  • 15. 15 Cross-site Clustering Search Affinity by location “Search locally”, “Store Globally” DR scenarios
  • 16. 16 Scaling the Search Heads Splunk Search is critical, too! Splunk Search high availability needs Scale to handle # of concurrent queries
  • 17. 17 SHP vs SHC SHC • SHP • Available since v4.2 • Sharing configurations through NFS • Single point of failure • Performance issues • No NFS • Replication using local storage • Commodity hardware NFS
  • 19. 19 Search Head Clustering Use “Captain” for Master to avoid confusion with Index-Clustering Minimum 3 nodes required. Odd is always preferred. Cluster takes certain key decisions based on *majority* (consensus) In multi-site setup have more nodes in main datacenter
  • 20. 20 Distributed Management Console Manage Splunk 6.2 environments Replaces Deployment Monitor App Incorporates SOS app prior to 6.2
  • 21. 21 Deployment Server Central management of Splunk Forwarders Deployment Server manages Apps, Configs Select one or more classes for each host Class defines apps & configs Works by phone-home Notes: DS does not push forwarder binaries Use Cluster Master to manage indexers in cluster, not DS
  • 22. 22 Cloud & Hybrid Scale without waiting for hardware