Buzzwords 2014 / Overview / part2

BERLIN BUZZWORDS 2014

SELECTEDTALKS OVERVIEW
tech talk @ ferret
Andrii Gakhov

19/06/2014

DEEP LEARNING
FOR HIGH PERFORMANCETIME-SERIES DATABASES
byTed Dunning

ABOUT OF THE AUTHOR
• Chief Application Architect at MapR
Technologies

• Ph.D. in computing science from the University
of Shefﬁeld

• Committer on Mahout, Drill, Zookeeper …

• http://tdunning.blogspot.de/

• @ted_dunning

ANOMALY DETECTION
99.9%-ile
Online summarizer

(t-digest)
99.9%-ile
t
x > t?
x
!
• The t-digest algorithm was developed byTed Dunning and available in Apache Machout

• With t-digest algorithm one can accurately estimate quantiles for very large data sets
with limited memory use
input signal

WHAT IS NORMAL?
• We need to have a model of what is normal

• Everything that doesn’t ﬁt model is the anomaly

• For simple signals we can assume just normal distribution

WINDOWS
• Set of windowed signals - model of the original signal

• Clustering can ﬁnd the prototypes

• The result is a dictionary of shapes

• New signals can be encoded by shifting, scaling and adding
shapes from the dictionary

MODEL ANOMALY DETECTION
Online summarizer
99.9%-ile
t
∆ > t?
x
!
t
∆ reconstruction error
Model
x’
x-x’
input signal

COMPRESSION
Minimal Error
Maximum likelihood
Maximum compression
• Good anomaly detectors give good compression

• So, we are constructing an auto-encoder!

MODEL ANOMALY DETECTION
Online summarizer
99.9%-ile
t
∆ > t?
x
!
t
∆ reconstruction error
Encoder
x’
x-x’
input signal
shape

dict

CLUSTERING
• Use windowing

• Find nearest cluster for each window

• Scale cluster to the right size

• Subtract from the original signal

OVERLAPPING NETWORKS
Time series input
Reconstructed time series

READ MORE?
• A New Look At Anomaly
Detection

• This is the second book in the
series Practical Machine Learning by
Ted Dunning & Ellen Friedman

• FREE download from
www.mapr.com

Buzzwords 2014 / Overview / part2

More Related Content

Viewers also liked (9)

Similar to Buzzwords 2014 / Overview / part2 (20)

More from Andrii Gakhov (20)

Recently uploaded (20)

Buzzwords 2014 / Overview / part2