SplunkLive! London: Splunk ninjas- new features and search dojo

Copyright © 2016 Splunk Inc.
Splunk Ninjas:
New Features
and Search Dojo
Richard Morgan - Splunk

2
Safe Harbor Statement
During the course of this presentation,we may make forward looking statements regarding future events
or the expected performance of the company. We caution you that such statements reflect our current
expectations and estimates based on factors currently known to us and that actual events or results could
differ materially. For important factors that may cause actual results to differ from those contained in our
forward-looking statements, please review our filings with the SEC. The forward-looking statements
made in this presentation are being made as of the time and date of its live presentation. If reviewed
after its live presentation, this presentation may not contain current or accurate information. We do not
assume any obligation to update any forward looking statements we may make. In addition, any
information about our roadmap outlines our general product direction and is subject to change at any
time without notice. It is for informational purposes only and shall not be incorporated into any contract
or other commitment. Splunk undertakes no obligation either to develop the features or functionality
described orto includeany suchfeatureor functionalityina futurerelease.

3
Agenda
What’s new in 6.4 (and a few goodies from 6.3!)
– TCO & Performance Improvements
– Platform Security and Management
– New Interactive Visualizations
Harness the power of search
– The 5 Search Commands That Can Solve Most Problems
Tricks and tips

6
Splunk Enterprise & Cloud 6.4
Storage TCO
Reduction
- TSIDX Reduction
reduces historical data
storage TCO by 40%+
Platform Security &
Management
New Interactive
Visualizations
- Improved DMC
- New SSO Options
- Improved Event Collector
- New Pre-built Visualizations
- Open Community Library
- Event Sampling and Predict

7
TSIDX Reduction
Provides up to 40-80% storage reduction
Retention Policy on TSIDX Files
Creates “mini” TSIDX
Performance trade-off between
storage costs and performance
– Rare vs Dense Searches
*Limited functionality loss
Can restore original TSIDX files if
needed
7

8
Storage TCO
Reduction
- TSIDX Reduction
storage TCO by 40%+
Platform Security &
Management
New Interactive
Visualizations
- Improved DMC
- New SSO Options

9
Management & Platform Enhancements
Management
– Distributed Management Console
 New monitoring views for scheduler,
Event Collector, system I/O performance
– Delegated Admin roles
HTTP Event Collector
– Unrestricted data for payloads
– Data Indexing acknowledgement
SAML Identity Provider Support
– OKTA, Azure AD, ADFS
9
SAML Support
 OKTA
 Azure AD
 ADFS
 Ping FederateAWS IoT
Event Collector

10
Storage TCO
Reduction
- TSIDX Reduction
storage TCO by 40%+
Platform Security &
Management
New Interactive
Visualizations
- Improved DMC
- New SSO Options

11
Custom Visualizations
Unlimited new ways to visualize your data
15 new interactive visualizations useful
for IT, security, IoT, business analysis
Open framework to create or customize
any visual
Visuals shared via Splunkbase library
Available for any use: search, dashboards,
reports…
1

12
New Custom Visualizations
1
Treemap
Sankey
Diagram
Punchcard Calendar
Heat Map
Parallel
Coordinates
Bullet GraphLocation
Tracker
Horseshoe
Meter
Machine Learning
Charts
Timeline
Horizon
Chart
Multiple use cases across IT, security, IoT, and business analytics

13
Event Sampling
• Powerful search option provides
unbiased sample results
• Useful to quickly determine dataset
characteristics
• Speeds large-scale data investigation
and discovery
13
Optimizes query performance for big data analysis

14
Predict Command Enhancements
• Time-series forecasting
• New algorithms:
• Support bivariate time series
with covariance
• Predict multiple series independently
• Predict missing values within series
• 80-100X performance improvement
14
Forecast Trends and Predict Missing Values

16
Download the Overview App (6.4) & 6.x Dashboard Examples

19
Five Commands That Will Solve Most Data Questions
eval - Modify or Create New Fields and Values
stats - Calculate Statistics Based on Field Values
eventstats - Add Summary Statistics to Search Results
streamstats - Cumulative Statistics for Each Event
transaction - Group Related Events Spanning Time

21
Examples
• Calculation:
sourcetype=access*
|eval KB=bytes/1024
• Evaluation:
sourcetype=access*
| eval http_response = if(status == 200,
"OK", "Error”)
• Concatenation:
sourcetype=access*
| eval connection = clientip.":".port

22
Examples
• Calculation:
sourcetype=access*
|eval KB=bytes/1024
• Evaluation:
sourcetype=access*
"OK", "Error”)
• Concatenation:
sourcetype=access*

23
Examples
• Calculation:
sourcetype=access*
|eval KB=bytes/1024
• Evaluation:
sourcetype=access*
"OK", "Error”)
• Concatenation:
sourcetype=access*

25
stats – Calculate Statistics Based on Field Values
Examples
• Calculate stats and rename
sourcetype=access*
| stats sum(KB) AS “Total KB”
• Multiple statistics
sourcetype=access*
| stats sum(KB) avg(KB)
• By another field
sourcetype=access*
| stats sum(KB) avg(KB) by clientip

26
Examples
• Calculate stats and rename
sourcetype=access*
| stats sum(KB) as “Total KB”
sourcetype=access*
| stats sum(KB) avg(KB)
sourcetype=access*

27
Examples
• Calculate statistics
sourcetype=access*
| stats sum(KB) AS "Total KB”
sourcetype=access*
| stats avg(KB) sum(KB)
sourcetype=access*

29
eventstats – Add Summary Statistics to Search Results
Examples
• Overlay Average
sourcetype=access*
| eventstats avg(bytes) AS avg_bytes
| timechart latest(avg_bytes) avg(bytes)
• Moving Average
sourcetype=access*
| eventstats avg(bytes) AS avg_bytes by date_hour
• By created field
sourcetype=access*
| eval http_response = if(status == 200, "OK", "Error”)
| eventstats avg(bytes) AS avg_bytes by http_response
| timechart latest(avg_bytes) avg(bytes) by http_response

30
Examples
• Overlay Average
sourcetype=access*
• Moving Average
sourcetype=access*
sourcetype=access*

31
Examples
• Overlay Average
sourcetype=access*
• Moving Average
sourcetype=access*
sourcetype=access*

33
streamstats – Cumulative Statistics for Each Event
Examples
• Cumulative Sum
sourcetype=access*
| reverse
| streamstats sum(bytes) as bytes_total
| timechart max(bytes_total)
• Cumulative Sum by Field
sourcetype=access*
| reverse
| streamstats sum(bytes) as bytes_total by status
| timechart max(bytes_total) by status
• Moving Average
sourcetype=access*
| timechart avg(bytes) as avg_bytes
| streamstats avg(avg_bytes) AS moving_avg_bytes window=10
| timechart latest(moving_avg_bytes) latest(avg_bytes)

34
Examples
• Cumulative Sum
sourcetype=access*
| timechart sum(bytes) as bytes
| streamstats sum(bytes) as cumulative_bytes
| timechart max(cumulative_bytes)
sourcetype=access*
| reverse
• Moving Average
sourcetype=access*
| streamstats avg(avg_bytes) AS moving_avg_bytes window=10

35
Examples
• Cumulative Sum
sourcetype=access*
| timechart sum(bytes) as bytes
| streamstats sum(bytes) as cumulative_bytes
| timechart max(cumulative_bytes)
sourcetype=access*
| reverse
• Moving Average
sourcetype=access*
| streamstats avg(avg_bytes) AS moving_avg_bytes
window=10

37
transaction – Group Related Events Spanning Time
Examples
• Group by Session ID
sourcetype=access*
| transaction JSESSIONID
• Calculate Session Durations
sourcetype=access*
| stats min(duration) max(duration) avg(duration)
• Stats is Better
sourcetype=access*
| stats min(_time) AS earliest max(_time) AS latest by JSESSIONID
| eval duration=latest-earliest

38
Examples
sourcetype=access*
sourcetype=access*
• Stats is Better
sourcetype=access*

39
Examples
sourcetype=access*
sourcetype=access*
• Stats is Better
sourcetype=access*

40
Learn Them Well and Become a Ninja
stats - Calculate Statistics Based on Field Values
eventstats - Add Summary Statistics to Search Results
streamstats - Cumulative Statistics for Each Event
transaction - Group Related Events Spanning Time
See many more examples and neat tricks at docs.splunk.com and answers.splunk.com

42
Use CTRL + enter to break up searches
Calculates delay in indexing by host

43
Using OR with time ranges
Becomes
Great for comparing time ranges in a single search

44
Generate results without a search
Creates a list of ten random
numbers

45
How many days I’ve worked at Splunk

46
How many months I’ve worked at Splunk
Uses timechart to generate the
monthly periods, then counts the
number of periods.
More accurate than dividing by
30!

47
Mess with columns #1
• Generates 10
rows.
• Adds three
columns with
random
numbers
between 0-10
• Computes the
mean average
of columns
starting with
”col”

48
Mess with Columns #2
• Generates 10
rows.
• Adds three
columns with
random
numbers
between 0-10
• Renames the
columns

49
Use accelerated data models in searches

50
Use accelerated data models in searches

51
Dynamically build search strings
Executes as

52
Mash it up!
• At 10 minute intervals how many users were logged into the server?

53
Horror query – SPL as a functional language
tag=minecraft logged
| rex field=_raw "]: (?<name>[^ []+)( (?<logged_out>left the game)|([ /d.:][]+(?<logged_in>logged in)))"
| timechart span=10m values(name) as names
| eventstats values(names) as name
| fields - names
| mvexpand name
| search
[| pivot Minecraft_log_messages
Login_location count(Login_location) AS "logged_in"
SPLITROW _time AS _time PERIOD minute
SPLITROW name AS name SPLITROW ip AS ip
SORT 1000 _time ROWSUMMARY 0 COLSUMMARY 0 NUMCOLS 0 SHOWOTHER 1
| eval logged_out=0
| append
[
| pivot Minecraft_log_messages
Log_out count(Log_out) AS "logged_out"
SPLITROW _time AS _time PERIOD minute
SPLITROW name AS name
SORT 1000 _time ROWSUMMARY 0 COLSUMMARY 0 NUMCOLS 0 SHOWOTHER 1
| eval logged_in=0
]
| sort - _time
| streamstats current=f max(_time) as max_time last(_time) as last_time by name
| eventstats max(_time) as max_time by name
| eval status=if(logged_in=0,"Logged out Period", "Logged in Period")
| where _time=max_time AND !isNull(coord_label) OR status="Logged in Period"
| eval status=if(_time=max_time,"Currently logged in",status)
| eval last_time=if(_time=max_time,info_max_time,last_time)
| eval time=_time
| table name time last_time
| eval max_window=time+600
| eval condition="name=".name." AND _time>=".time." AND (_time<=". last_time." OR
_time<=".max_window.")"
| eval search=condition
| table search
| format
| rex field=search mode=sed ""s/"//g""]
| timechart span=10m count by name

55
Splunk Mobile App
EMBEDDING
OPERATIONAL
INTELLIGENCE
• Access dashboards and
reports
• Annotate dashboards and
share with others
• Receive push notifications
Native Mobile Experience

60
cluster – Find Common and/or Rare Events
Examples
• Find the most common events
*
| cluster showcount=t t=0.1
| table cluster_count, _raw
| sort - cluster_count
• Select a field to cluster on
sourcetype=access*
| cluster field=bc_uri showcount=t
| table cluster_count bc_uri _raw
| sort -cluster_count
• Most or least common errors
index=_internal source=*splunkd.log* log_level!=info
| cluster showcount=t
| table cluster_count _raw

61
Examples
*
sourcetype=access*

62
Examples
*
sourcetype=access*

SplunkLive! London: Splunk ninjas- new features and search dojo

More Related Content

What's hot (20)

Viewers also liked (20)

Similar to SplunkLive! London: Splunk ninjas- new features and search dojo (20)

More from Splunk (20)

Recently uploaded (20)

SplunkLive! London: Splunk ninjas- new features and search dojo

Editor's Notes