SlideShare a Scribd company logo
Copyright © 2015 Splunk Inc.
Getting Started
2
Why Splunk?
“Splunk takes machine data and makes it relevant for non-technical business users. ..Splunk
provide[s] both the up-to-the-minute and long-term trending data business users need to make
the decisions that impact revenue.”
“It's become a collaborative tool where everybody can gather around the same data and see
the same big picture.” “I'm sometimes so amazed by what we can do with Splunk, I wonder if
there's magic in there.”
Splunk selected by Symantec to help security intelligence operations.
Symantec centralizes, monitors and analyzes security-related data in Splunk Enterprise to help
investigate incidents and detect advanced threats. Symantec also uses Splunk software to ensure
comprehensive compliance with Sarbanes-Oxley (SOX) and the Payment Card Industry Data
Security Standard (PCI DSS). www.datacenterknowledge.com
3
Splunk Company Overview
3
Company
• Global HQs:
 San Francisco
 London
 Hong Kong
• 1,700 employees globally
• Annual Revenue for
FY14: $450M (YoY +50%)
• NASDAQ: SPLK
Products
• Free trial to massive scale
• Splunk products:
 Splunk Enterprise
 Splunk Cloud
 Hunk
 Splunk MINT
 Premium Apps
 VMWARE
 MS Exchange
 PCI Comp and ES App
Customers
• 9,000+ customers
• Across 100+ countries
• Small to large
organizations
• 80+ of the Fortune 100
• Largest license:
 400+ Terabytes/day
4
Our Plan of Action
4
1.Big Data - setting the stage.
2.How does Splunk fit in the landscape?
3.What differentiates Splunk?
4.Components that make up Splunk?
5.Demo - How it works?
5
The Accelerating Pace of Data
Volume | Velocity | Variety | Variability
GPS,
RFID,
Hypervisor,
Web Servers,
Email, Messaging,
Clickstreams, Mobile,
Telephony, IVR, Databases,
Sensors, Telematics, Storage,
Servers, Security Devices, Desktops
Machine data is the fastest growing, most
complex, most valuable area of big data
5
6 6
Making machine data accessible,
usable and valuable to everyone.
6
7
Big Data Landscape
Key/Value, Columnar or
Other (semi-structured)
Cassandra
CouchDB
MongoDB
NoSQL
7
Relational Database
(highly structured)
SQL &
MapReduce
RDBMS
Oracle,
MySQL,
IBM DB2,
Teradata
Teradata Aster Data
SQL on Hadoop
Distributed File System
(semi-structured)
Hadoop
HDFS Storage +
MapReduce
Temporal, Unstructured
Heterogeneous
Real-Time Indexing
MapReduce
8
Big Data Landscape
Key/Value, Columnar or
Other (semi-structured)
Cassandra
CouchDB
MongoDB
NoSQL
8
Relational Database
(highly structured)
SQL &
MapReduce
RDBMS
Oracle,
MySQL,
IBM DB2,
Teradata
Teradata Aster Data
SQL on Hadoop
Distributed File System
(semi-structured)
Hadoop
HDFS Storage +
MapReduce
Temporal, Unstructured
Heterogeneous
Real-Time Indexing
MapReduce
9
perf
shell
API
Mounted File Systems
hostnamemount
syslog
TCP/UDP
Event Logs
Performance
Active
Directory
syslog hosts
and network devices
Unix, Linux and Windows hosts
Local File Monitoring
Splunk Forwarder
virtual
host
Windows
Scripted or Modular Inputs
shell scripts
API subscriptions
Mainframes*nix
Wire Data
Splunk App for Stream
Efficient Time Based Indexing
Splunk Differentiators
11
Splunk Differentiators
11
• Universal Machine Data Platform
• Schema on the fly
• Agile Reporting and Analytics
• Real-time Architecture
• Scales from desktop to enterprise
12
Splunk Components
12
Data Collection Layer - Universal Forwarders, syslog, API, TCP, Scripts, Wire, etc.
Data Indexing Layer – Indexer(s).
Data Presentation Layer– Search Head(s)
Universal Forwarder
13
1.
2.
3.
4.
How to Get Started
Download
Install
Forward Data
Search
Databases
Networks
Servers
Virtual
Machines
Smart
phones
and
Devices
Custom
Applications
Security
WebServer
Sensors
Four steps:
14
Demo – How it Works
14
1. Installing and Starting Splunk
2. Ingesting Data
3. Search Basics
• Search Bar
• Time Picker
• Extracted Fields
4. Alerting
5. Statistics and Reporting
6. Dynamic Field Extraction
7. Search Language
8. Splunk Applications
15
Demo
15
Copyright © 2015 Splunk Inc.
Copyright © 2015 Splunk Inc.
Copyright © 2015 Splunk Inc.
Copyright © 2015 Splunk Inc.
Copyright © 2015 Splunk Inc.
21
Education Resources
21
Splunk Education
• www.splunk.com/education
Using Splunk, Searching and Reporting, Developing Apps,
Administering Splunk, and more!
Books
• Implementing Splunk: Big Data Essentials for Operational Intelligence
• Splunk Essentials
• Exploring Splunk
• Splunk Operational Intelligence Cookbook
22
Supplemental Information
22
Download
• www.splunk.com/download
Search Tutorial:
• docs.splunk.com/Documentation/Splunk/latest/SearchTutorial
Tutorial Data:
• docs.splunk.com/images/Tutorial/tutorialdata.zip
23
Things to Remember
23
1. Splunk is Free – Download and get started today
2. Quick Time to Value
3. Data Gold Mines – what informational fortune awaits?!
4. Leverage the Splunk Community
• splunkbase.com
• answers.splunk.com
• blogs.splunk.com
5. Happy Splunking
Thank You

More Related Content

PPTX
Getting Started with Splunk Enterprises
PPTX
Getting Started with Splunk Enterprise
PPTX
Getting Started with Splunk Enterprise
PPTX
Power of Splunk Search Processing Language (SPL)
PPTX
Power of Splunk Search Processing Language (SPL) ...
PPTX
Data Obfuscation in Splunk Enterprise
PPTX
SplunkLive! Customer Presentation - Cardinal Health
PPT
SplunkLive! Customer Presentation - Penn State Hershey Medical Center
Getting Started with Splunk Enterprises
Getting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
Power of Splunk Search Processing Language (SPL)
Power of Splunk Search Processing Language (SPL) ...
Data Obfuscation in Splunk Enterprise
SplunkLive! Customer Presentation - Cardinal Health
SplunkLive! Customer Presentation - Penn State Hershey Medical Center

What's hot (20)

PDF
Splunking configfiles 20211208_daniel_wilson
PPTX
SplunkLive! Customer Presentation – HCA
PDF
Splunk Sales Presentation Imagemaker 2014
PPTX
Daten anonymisieren und pseudonymisieren in Splunk Enterprise
PPTX
Taking Splunk to the Next Level - Architecture Breakout Session
PPTX
Data Onboarding Breakout Session
PPTX
Splunk Enterpise for Information Security Hands-On
PPTX
Getting Started with Splunk Breakout Session
PPTX
SplunkLive! London 2016 Splunk Overview
PDF
Splunk @ Adobe
PPTX
SplunkLive! Paris 2018: Splunk Overview
PPTX
Webinar: Improve Splunk Analytics and Automate Processes with SnapLogic
PPTX
Getting Started with Splunk Enterprise Hands-On
PDF
Webinar: Was ist neu in Splunk Enterprise 6.5
PDF
Yahoo Enabling Exploratory Analytics of Data in Shared-service Hadoop Clusters
PPTX
How to Design, Build and Map IT and Business Services in Splunk
PDF
Machine Data 101
PPTX
SplunkLive! Customer Presentation - Cisco Systems, Inc.
PPTX
Customer Presentation
PDF
Advanced Splunk Administration
Splunking configfiles 20211208_daniel_wilson
SplunkLive! Customer Presentation – HCA
Splunk Sales Presentation Imagemaker 2014
Daten anonymisieren und pseudonymisieren in Splunk Enterprise
Taking Splunk to the Next Level - Architecture Breakout Session
Data Onboarding Breakout Session
Splunk Enterpise for Information Security Hands-On
Getting Started with Splunk Breakout Session
SplunkLive! London 2016 Splunk Overview
Splunk @ Adobe
SplunkLive! Paris 2018: Splunk Overview
Webinar: Improve Splunk Analytics and Automate Processes with SnapLogic
Getting Started with Splunk Enterprise Hands-On
Webinar: Was ist neu in Splunk Enterprise 6.5
Yahoo Enabling Exploratory Analytics of Data in Shared-service Hadoop Clusters
How to Design, Build and Map IT and Business Services in Splunk
Machine Data 101
SplunkLive! Customer Presentation - Cisco Systems, Inc.
Customer Presentation
Advanced Splunk Administration
Ad

Similar to Getting Started with Splunk Breakout Session (20)

PPTX
Getting Started with Splunk Breakout Session
PPTX
Getting Started with Splunk Breakout Session
PPTX
Getting Started with Splunk Enterprise
PPTX
Getting Started with Splunk Enterprise
PPTX
Getting Started with Splunk Enterprise Hands-On
PPTX
Getting Started with Splunk Enterprise Hands-On
PPTX
Getting Started with Splunk Enterprise
PPTX
Getting Started with Splunk Enterprise
PDF
Getting Started with Splunk Enterprise
PPTX
Getting Started with Splunk Breakout Session
PPTX
Getting Started with Splunk (Hands-On)
PPTX
Getting Started with Splunk Breakout Session
PDF
SplunkLive! Amsterdam 2015 Breakout - Getting Started with Splunk
PPTX
SplunkLive! Tampa: Getting Started Session
ODP
Splunk
PPTX
Getting Started with Splunk Breakout Session
PPTX
Getting Started with Splunk Enterprise
PPTX
Getting Started with Splunk Enterprise
PPTX
SplunkLive! Washington DC May 2013 - Splunk Enterprise 5
PPTX
Getting Started with Splunk Enterprise
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
Getting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
Getting Started with Splunk Enterprise Hands-On
Getting Started with Splunk Enterprise Hands-On
Getting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
Getting Started with Splunk Breakout Session
Getting Started with Splunk (Hands-On)
Getting Started with Splunk Breakout Session
SplunkLive! Amsterdam 2015 Breakout - Getting Started with Splunk
SplunkLive! Tampa: Getting Started Session
Splunk
Getting Started with Splunk Breakout Session
Getting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
SplunkLive! Washington DC May 2013 - Splunk Enterprise 5
Getting Started with Splunk Enterprise
Ad

More from Splunk (20)

PDF
Splunk Leadership Forum Wien - 20.05.2025
PDF
Splunk Security Update | Public Sector Summit Germany 2025
PDF
Building Resilience with Energy Management for the Public Sector
PDF
IT-Lagebild: Observability for Resilience (SVA)
PDF
Nach dem SOC-Aufbau ist vor der Automatisierung (OFD Baden-Württemberg)
PDF
Monitoring einer Sicheren Inter-Netzwerk Architektur (SINA)
PDF
Praktische Erfahrungen mit dem Attack Analyser (gematik)
PDF
Cisco XDR & Splunk SIEM - stronger together (DATAGROUP Cyber Security)
PDF
Security - Mit Sicherheit zum Erfolg (Telekom)
PDF
One Cisco - Splunk Public Sector Summit Germany April 2025
PDF
.conf Go 2023 - Data analysis as a routine
PDF
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
PDF
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
PDF
.conf Go 2023 - Raiffeisen Bank International
PDF
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
PDF
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
PDF
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
PDF
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
PDF
.conf go 2023 - De NOC a CSIRT (Cellnex)
PDF
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
Splunk Leadership Forum Wien - 20.05.2025
Splunk Security Update | Public Sector Summit Germany 2025
Building Resilience with Energy Management for the Public Sector
IT-Lagebild: Observability for Resilience (SVA)
Nach dem SOC-Aufbau ist vor der Automatisierung (OFD Baden-Württemberg)
Monitoring einer Sicheren Inter-Netzwerk Architektur (SINA)
Praktische Erfahrungen mit dem Attack Analyser (gematik)
Cisco XDR & Splunk SIEM - stronger together (DATAGROUP Cyber Security)
Security - Mit Sicherheit zum Erfolg (Telekom)
One Cisco - Splunk Public Sector Summit Germany April 2025
.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - De NOC a CSIRT (Cellnex)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)

Recently uploaded (20)

PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Cloud computing and distributed systems.
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Big Data Technologies - Introduction.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Electronic commerce courselecture one. Pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
NewMind AI Weekly Chronicles - August'25 Week I
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
A Presentation on Artificial Intelligence
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Cloud computing and distributed systems.
Unlocking AI with Model Context Protocol (MCP)
Mobile App Security Testing_ A Comprehensive Guide.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Big Data Technologies - Introduction.pptx
Empathic Computing: Creating Shared Understanding
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Electronic commerce courselecture one. Pdf
Machine learning based COVID-19 study performance prediction
NewMind AI Weekly Chronicles - August'25 Week I
The AUB Centre for AI in Media Proposal.docx
A Presentation on Artificial Intelligence
MYSQL Presentation for SQL database connectivity
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Review of recent advances in non-invasive hemoglobin estimation
Advanced methodologies resolving dimensionality complications for autism neur...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx

Getting Started with Splunk Breakout Session

  • 1. Copyright © 2015 Splunk Inc. Getting Started
  • 2. 2 Why Splunk? “Splunk takes machine data and makes it relevant for non-technical business users. ..Splunk provide[s] both the up-to-the-minute and long-term trending data business users need to make the decisions that impact revenue.” “It's become a collaborative tool where everybody can gather around the same data and see the same big picture.” “I'm sometimes so amazed by what we can do with Splunk, I wonder if there's magic in there.” Splunk selected by Symantec to help security intelligence operations. Symantec centralizes, monitors and analyzes security-related data in Splunk Enterprise to help investigate incidents and detect advanced threats. Symantec also uses Splunk software to ensure comprehensive compliance with Sarbanes-Oxley (SOX) and the Payment Card Industry Data Security Standard (PCI DSS). www.datacenterknowledge.com
  • 3. 3 Splunk Company Overview 3 Company • Global HQs:  San Francisco  London  Hong Kong • 1,700 employees globally • Annual Revenue for FY14: $450M (YoY +50%) • NASDAQ: SPLK Products • Free trial to massive scale • Splunk products:  Splunk Enterprise  Splunk Cloud  Hunk  Splunk MINT  Premium Apps  VMWARE  MS Exchange  PCI Comp and ES App Customers • 9,000+ customers • Across 100+ countries • Small to large organizations • 80+ of the Fortune 100 • Largest license:  400+ Terabytes/day
  • 4. 4 Our Plan of Action 4 1.Big Data - setting the stage. 2.How does Splunk fit in the landscape? 3.What differentiates Splunk? 4.Components that make up Splunk? 5.Demo - How it works?
  • 5. 5 The Accelerating Pace of Data Volume | Velocity | Variety | Variability GPS, RFID, Hypervisor, Web Servers, Email, Messaging, Clickstreams, Mobile, Telephony, IVR, Databases, Sensors, Telematics, Storage, Servers, Security Devices, Desktops Machine data is the fastest growing, most complex, most valuable area of big data 5
  • 6. 6 6 Making machine data accessible, usable and valuable to everyone. 6
  • 7. 7 Big Data Landscape Key/Value, Columnar or Other (semi-structured) Cassandra CouchDB MongoDB NoSQL 7 Relational Database (highly structured) SQL & MapReduce RDBMS Oracle, MySQL, IBM DB2, Teradata Teradata Aster Data SQL on Hadoop Distributed File System (semi-structured) Hadoop HDFS Storage + MapReduce Temporal, Unstructured Heterogeneous Real-Time Indexing MapReduce
  • 8. 8 Big Data Landscape Key/Value, Columnar or Other (semi-structured) Cassandra CouchDB MongoDB NoSQL 8 Relational Database (highly structured) SQL & MapReduce RDBMS Oracle, MySQL, IBM DB2, Teradata Teradata Aster Data SQL on Hadoop Distributed File System (semi-structured) Hadoop HDFS Storage + MapReduce Temporal, Unstructured Heterogeneous Real-Time Indexing MapReduce
  • 9. 9 perf shell API Mounted File Systems hostnamemount syslog TCP/UDP Event Logs Performance Active Directory syslog hosts and network devices Unix, Linux and Windows hosts Local File Monitoring Splunk Forwarder virtual host Windows Scripted or Modular Inputs shell scripts API subscriptions Mainframes*nix Wire Data Splunk App for Stream Efficient Time Based Indexing Splunk Differentiators
  • 10. 11 Splunk Differentiators 11 • Universal Machine Data Platform • Schema on the fly • Agile Reporting and Analytics • Real-time Architecture • Scales from desktop to enterprise
  • 11. 12 Splunk Components 12 Data Collection Layer - Universal Forwarders, syslog, API, TCP, Scripts, Wire, etc. Data Indexing Layer – Indexer(s). Data Presentation Layer– Search Head(s) Universal Forwarder
  • 12. 13 1. 2. 3. 4. How to Get Started Download Install Forward Data Search Databases Networks Servers Virtual Machines Smart phones and Devices Custom Applications Security WebServer Sensors Four steps:
  • 13. 14 Demo – How it Works 14 1. Installing and Starting Splunk 2. Ingesting Data 3. Search Basics • Search Bar • Time Picker • Extracted Fields 4. Alerting 5. Statistics and Reporting 6. Dynamic Field Extraction 7. Search Language 8. Splunk Applications
  • 15. Copyright © 2015 Splunk Inc.
  • 16. Copyright © 2015 Splunk Inc.
  • 17. Copyright © 2015 Splunk Inc.
  • 18. Copyright © 2015 Splunk Inc.
  • 19. Copyright © 2015 Splunk Inc.
  • 20. 21 Education Resources 21 Splunk Education • www.splunk.com/education Using Splunk, Searching and Reporting, Developing Apps, Administering Splunk, and more! Books • Implementing Splunk: Big Data Essentials for Operational Intelligence • Splunk Essentials • Exploring Splunk • Splunk Operational Intelligence Cookbook
  • 21. 22 Supplemental Information 22 Download • www.splunk.com/download Search Tutorial: • docs.splunk.com/Documentation/Splunk/latest/SearchTutorial Tutorial Data: • docs.splunk.com/images/Tutorial/tutorialdata.zip
  • 22. 23 Things to Remember 23 1. Splunk is Free – Download and get started today 2. Quick Time to Value 3. Data Gold Mines – what informational fortune awaits?! 4. Leverage the Splunk Community • splunkbase.com • answers.splunk.com • blogs.splunk.com 5. Happy Splunking

Editor's Notes

  • #3: [Why Splunk? What makes it special? SE presenting can tell their own story] What get’s me excited about Splunk? It’s disruptive! It’s impactful! In my 11 years of working with Enterprise software I have never seen any other other software that invigorates, inspires, and resonates with customers, myself included. So I figured I why not share some of Splunk’s testimonials.
  • #4: Splunk has more than 1200 employees worldwide, with our global headquarters in San Francisco. Our 7,900 customers in 100 countries are using Splunk software to improve service levels, reduce operations costs, mitigate security risks, enable compliance, enhance DevOps collaboration and create new product and service offerings. Our products are designed to fit your needs and are built to be as frictionless to deploy as possible. Simple download Splunk software, point it at your data, and you’ll up and running in minutes. Please always refer to latest company data found here: http://www.splunk.com/company.
  • #6: Data is growing and embodies new characteristics not found in traditional structured data: Volume, Velocity, Variety, Variability. "Big data" is a term applied to these expanding data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. Machine data is one of the fastest, growing, most complex and most valuable segments of big data and embodies new characteristics not found in traditional structured data terms of Volume, Velocity, Variety, Variability. All the webservers, applications, network devices – all of the technology infrastructure running an enterprise or organization – generates massive streams of data, digital exhaust per say. It comes in an array of unpredictable formats that are difficult to process and analyze by traditional methods or in a timely manner. So why is this “machine data” valuable? Because it contains a trace - a categorical record - of user behavior, cyber-security risks, application behavior, service levels, fraudulent activity and customer experiences.
  • #7: Splunk’s mission is to make YOUR machine data accessible, usable and valuable to everyone. It’s this overarching mission that drives our company and products that we deliver.
  • #8: How has big data evolved over time. For a long time, ‘big data’ was was simply a large database. The database industry – in order to handle large data – moved to smaller databases, but many of them. Horizontal partitioning (Also known as Sharding) is a database design principle whereby rows of a database table are held separately (For example, A -> D in one database E -> H in a second database, etc ..) Hadoop was introduced by Google and was adapted as the de-facto big data system. Hadoop is an open source project from Apache that has evolved rapidly into a major technology movement. It has emerged as a popular way to handle massive amounts of data, including structured and complex unstructured data. Its popularity is due in part to its ability to store and process large amounts of data effectively across clusters of commodity hardware, particularly cheaply. Apache Hadoop is not actually a single product but instead a collection of several components. For the most part, Hadoop is a batch oriented system. ** Teradata Aster Data & SQL on Hadoop are SQL interface systems that can talk to Hadoop ** Cassandra & HBase are NoSQL databases that can process data using a Key / Value in real-time. Splunk = Temporal, Unstructured, Heterogeneous, real-time analytics platform. Besides relational databases, the technologies leverage a form of MapReduce – which is a programming model for processing and generating large data sets. So we’ll dig deeper in a bit to see what truly differentiates Splunk. Splunk can also enrich your machine data with several types of external data sources, included are databases, Hadoop, and NoSQL data stores.
  • #9: How has big data evolved over time. For a long time, ‘big data’ was was simply a large database. The database industry – in order to handle large data – moved to smaller databases, but many of them. Horizontal partitioning (Also known as Sharding) is a database design principle whereby rows of a database table are held separately (For example, A -> D in one database E -> H in a second database, etc ..) Hadoop was introduced by Google and was adapted as the de-facto big data system. Hadoop is an open source project from Apache that has evolved rapidly into a major technology movement. It has emerged as a popular way to handle massive amounts of data, including structured and complex unstructured data. Its popularity is due in part to its ability to store and process large amounts of data effectively across clusters of commodity hardware, particularly cheaply. Apache Hadoop is not actually a single product but instead a collection of several components. For the most part, Hadoop is a batch oriented system. ** Teradata Aster Data & SQL on Hadoop are SQL interface systems that can talk to Hadoop ** Cassandra & HBase are NoSQL databases that can process data using a Key / Value in real-time. Splunk = Temporal, Unstructured, Heterogeneous, real-time analytics platform. Besides relational databases, the technologies leverage a form of MapReduce – which is a programming model for processing and generating large data sets. So we’ll dig deeper in a bit to see what truly differentiates Splunk. Splunk can also enrich your machine data with several types of external data sources, included are databases, Hadoop, and NoSQL data stores.
  • #10: Getting data into Splunk is designed to be as flexible and easy as possible. In most cases you’ll find that no configuration is required; you just have to determine what data to collect and which method you want to use to get it into Splunk. Splunk is THE universal machine data platform. It goes beyond ingesting just log files, ingesting data from syslog, scripts, system events, API’s, even wire data! The result is beautifully indexed time-based series events, previously in disparate silos that can now be cross-correlated and made accessible to everyone your organization. Notice here that we are ingesting local files, data from syslogs, output from scripts and even wire data. Let’s see how the Splunk platform supports all this data collection.
  • #13: Three major tiers and components of Splunk Distribution Data Collection Layer -> The star of the show here is Splunk’s Universal Forwarder. Data Indexing Layer -> The Data Layer’s job is to collect and/or forward data to the Data Indexing Layer - Powered by Splunk Indexers. Indexers are containers of indexes, logical containers for data to reside in. Data Presentation Layer -> Powered by Search Heads is responsible for distributing searches to the indexing layer, aggregate the final results, and present it to the end user. Viewing the data -> No special or custom client needed! Simply use your favorite browser and point to your Search Head. Now, in modestly small deployments the data indexing and searching will be done with the same Splunk Instance.
  • #14: It only takes minutes to download and install Splunk on the platform of your choice, bringing you fast time to value. Once Splunk has been downloaded and installed the next step is to get data into a Splunk instance. The data then becomes searchable from a single place! Since Splunk stores only a copy of the raw data, searches won’t affect the end devices data comes from. Having a central place to search your data not only simplifies things, it also decreases risk since a user doesn’t have to log into the end devices. Splunk can be installed on a single small instance, such as a laptop, or installed on multiple servers to scale as needed. The ability to scale from a single desktop to an enterprise is another of our key differentiators. When installed on multiple servers the functions can be split up to meet any performance, security, or availability requirements.
  • #16: Start up a brand new Splunk Have a ready data set, typically use tutorial Literally drag and drop. Go back to components, what make them up Run two manual queries, paints picture of we can do. Patterns Create a data model (Use instant pivot) Create output Do something completely impressive. (create party on third party system, 3d graph, alert, something tangible outside of Splunk)   Highlight best Splunk 6 features, add data, patterns, instant pivot,
  • #22: Data is growing and embodies new characteristics not found in traditional structured data: Volume, Velocity, Variety, Variability. "Big data" is a term applied to these expanding data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. Machine data is one of the fastest, growing, most complex and most valuable segments of big data and embodies new characteristics not found in traditional structured data terms of Volume, Velocity, Variety, Variability. All the webservers, applications, network devices – all of the technology infrastructure running an enterprise or organization – generates massive streams of data, digital exhaust per say. It comes in an array of unpredictable formats that are difficult to process and analyze by traditional methods or in a timely manner. So why is this “machine data” valuable? Because it contains a trace - a categorical record - of user behavior, cyber-security risks, application behavior, service levels, fraudulent activity and customer experiences.