SlideShare a Scribd company logo
BERLIN BUZZWORDS 2014	

SELECTEDTALKS OVERVIEW
tech talk @ ferret
Andrii Gakhov	

19/06/2014
DEEP LEARNING
FOR HIGH PERFORMANCETIME-SERIES DATABASES
byTed Dunning
ABOUT OF THE AUTHOR
• Chief Application Architect at MapR
Technologies	

• Ph.D. in computing science from the University
of Sheffield	

• Committer on Mahout, Drill, Zookeeper …	

• http://tdunning.blogspot.de/	

• @ted_dunning
ANOMALY DETECTION
ANOMALY DETECTION
99.9%-ile
Online summarizer	

(t-digest)
99.9%-ile
t
x > t?
x
!
• The t-digest algorithm was developed byTed Dunning and available in Apache Machout	

• With t-digest algorithm one can accurately estimate quantiles for very large data sets
with limited memory use
input signal
ISSUES WITH SIMPLETHRESHOLDS
LOOKS LIKE ANOMALY?
NOT SURE
WHAT IS NORMAL?
• We need to have a model of what is normal	

• Everything that doesn’t fit model is the anomaly	

• For simple signals we can assume just normal distribution
WINDOWS
WINDOWS
WINDOWS
• Set of windowed signals - model of the original signal	

• Clustering can find the prototypes	

• The result is a dictionary of shapes	

• New signals can be encoded by shifting, scaling and adding
shapes from the dictionary
COMMON SHAPES (EKG)
RECONSTRUCTION
ANOMALY
ANOMALY
MODEL ANOMALY DETECTION
Online summarizer
99.9%-ile
t
∆ > t?
x
!
t
∆ reconstruction error
Model
x’
x-x’
input signal
COMPRESSION
Minimal Error
Maximum likelihood
Maximum compression
• Good anomaly detectors give good compression	

• So, we are constructing an auto-encoder!
MODEL ANOMALY DETECTION
Online summarizer
99.9%-ile
t
∆ > t?
x
!
t
∆ reconstruction error
Encoder
x’
x-x’
input signal
shape	

dict
CLUSTERING
• Use windowing	

• Find nearest cluster for each window	

• Scale cluster to the right size	

• Subtract from the original signal
CLUSTERING AS NEURAL NETWORK
OVERLAPPING NETWORKS
Time series input
Reconstructed time series
READ MORE?
• A New Look At Anomaly
Detection	

• This is the second book in the
series Practical Machine Learning by
Ted Dunning & Ellen Friedman	

• FREE download from
www.mapr.com

More Related Content

PPTX
2014etc016
PDF
Ways To Master 12 Essential Life Skills!
PPTX
Lean enterprise fatma urek
DOCX
8 rasgos de evaluación
PPT
둥진최고
PPTX
Presentation: The Past, Present and Future of Mobile for CPG Marketers
PDF
The swiss knife of a word press developer
PDF
Winning tenders / securing tenderers in a competitive construction market - N...
2014etc016
Ways To Master 12 Essential Life Skills!
Lean enterprise fatma urek
8 rasgos de evaluación
둥진최고
Presentation: The Past, Present and Future of Mobile for CPG Marketers
The swiss knife of a word press developer
Winning tenders / securing tenderers in a competitive construction market - N...

Viewers also liked (9)

PDF
Claims club - November 2016, Exeter
PDF
HDI Capital Area Local Chapter March 2016 Meeting
PPT
Culver City Film Festival - Film Marketing Services
PDF
Daily Newsletter: 1st February, 2011
PPT
VVN Jan 2015
PDF
Landscapes of love for slideshare
PDF
PP Battery Separator for Lithium-Ion Battery Manufacturing
PPTX
Steven paul jobs
PDF
تطوير المحتوى الرقمى للبحث العلمى وجوجل سكولار والمردود على الباحث فى سكوبس
Claims club - November 2016, Exeter
HDI Capital Area Local Chapter March 2016 Meeting
Culver City Film Festival - Film Marketing Services
Daily Newsletter: 1st February, 2011
VVN Jan 2015
Landscapes of love for slideshare
PP Battery Separator for Lithium-Ion Battery Manufacturing
Steven paul jobs
تطوير المحتوى الرقمى للبحث العلمى وجوجل سكولار والمردود على الباحث فى سكوبس
Ad

Similar to Buzzwords 2014 / Overview / part2 (20)

PDF
Strata 2014 Anomaly Detection
PDF
Strata 2014-tdunning-anomaly-detection-140211162923-phpapp01
PPTX
Anomaly Detection - New York Machine Learning
PPTX
How to find what you didn't know to look for, oractical anomaly detection
PPTX
Understanding Jupyter notebooks using bioinformatics examples
PDF
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
PPTX
Predictive Analytics with Hadoop
PPTX
soft computing manoj
PDF
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
PPTX
Fake news detection
PPTX
Important message for translation agencies
PDF
Machine Learning Deep Learning AI and Data Science
PPTX
Deep Learning for Fraud Detection
PPTX
AI IoT Edge Applications Insights and Trends
PDF
MapR & Skytree:
PPTX
Neural Networks - it’s usage in Corporate
PPTX
Practical Computing with Chaos
PPTX
Practical Computing With Chaos
PDF
Mathematical bridges From Old to New
PPTX
Smart home
Strata 2014 Anomaly Detection
Strata 2014-tdunning-anomaly-detection-140211162923-phpapp01
Anomaly Detection - New York Machine Learning
How to find what you didn't know to look for, oractical anomaly detection
Understanding Jupyter notebooks using bioinformatics examples
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
Predictive Analytics with Hadoop
soft computing manoj
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
Fake news detection
Important message for translation agencies
Machine Learning Deep Learning AI and Data Science
Deep Learning for Fraud Detection
AI IoT Edge Applications Insights and Trends
MapR & Skytree:
Neural Networks - it’s usage in Corporate
Practical Computing with Chaos
Practical Computing With Chaos
Mathematical bridges From Old to New
Smart home
Ad

More from Andrii Gakhov (20)

PDF
Let's start GraphQL: structure, behavior, and architecture
PDF
Exceeding Classical: Probabilistic Data Structures in Data Intensive Applicat...
PDF
Too Much Data? - Just Sample, Just Hash, ...
PDF
DNS Delegation
PPTX
Implementing a Fileserver with Nginx and Lua
PPTX
Pecha Kucha: Ukrainian Food Traditions
PDF
Probabilistic data structures. Part 4. Similarity
PDF
Probabilistic data structures. Part 3. Frequency
PDF
Probabilistic data structures. Part 2. Cardinality
PDF
Вероятностные структуры данных
PDF
Recurrent Neural Networks. Part 1: Theory
PDF
Apache Big Data Europe 2015: Selected Talks
PDF
Swagger / Quick Start Guide
PDF
API Days Berlin highlights
PDF
ELK - What's new and showcases
PDF
Apache Spark Overview @ ferret
PDF
Data Mining - lecture 8 - 2014
PDF
Data Mining - lecture 7 - 2014
PDF
Data Mining - lecture 6 - 2014
PDF
Data Mining - lecture 5 - 2014
Let's start GraphQL: structure, behavior, and architecture
Exceeding Classical: Probabilistic Data Structures in Data Intensive Applicat...
Too Much Data? - Just Sample, Just Hash, ...
DNS Delegation
Implementing a Fileserver with Nginx and Lua
Pecha Kucha: Ukrainian Food Traditions
Probabilistic data structures. Part 4. Similarity
Probabilistic data structures. Part 3. Frequency
Probabilistic data structures. Part 2. Cardinality
Вероятностные структуры данных
Recurrent Neural Networks. Part 1: Theory
Apache Big Data Europe 2015: Selected Talks
Swagger / Quick Start Guide
API Days Berlin highlights
ELK - What's new and showcases
Apache Spark Overview @ ferret
Data Mining - lecture 8 - 2014
Data Mining - lecture 7 - 2014
Data Mining - lecture 6 - 2014
Data Mining - lecture 5 - 2014

Recently uploaded (20)

PDF
Odoo Companies in India – Driving Business Transformation.pdf
PPT
Introduction Database Management System for Course Database
PPTX
Transform Your Business with a Software ERP System
PPTX
ManageIQ - Sprint 268 Review - Slide Deck
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
AI in Product Development-omnex systems
PPTX
ai tools demonstartion for schools and inter college
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PDF
Digital Strategies for Manufacturing Companies
PDF
Nekopoi APK 2025 free lastest update
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PPTX
Online Work Permit System for Fast Permit Processing
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PPTX
CHAPTER 2 - PM Management and IT Context
Odoo Companies in India – Driving Business Transformation.pdf
Introduction Database Management System for Course Database
Transform Your Business with a Software ERP System
ManageIQ - Sprint 268 Review - Slide Deck
Softaken Excel to vCard Converter Software.pdf
How to Choose the Right IT Partner for Your Business in Malaysia
AI in Product Development-omnex systems
ai tools demonstartion for schools and inter college
VVF-Customer-Presentation2025-Ver1.9.pptx
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
Digital Strategies for Manufacturing Companies
Nekopoi APK 2025 free lastest update
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Online Work Permit System for Fast Permit Processing
Upgrade and Innovation Strategies for SAP ERP Customers
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
CHAPTER 2 - PM Management and IT Context

Buzzwords 2014 / Overview / part2