Science of Anomaly Detection
Numenta Workshop
October 17, 2014
Scott Purdy
Engineering Manager
What is an anomaly?
Something that deviates from what is standard, normal, or
expected.
Types of Anomalies
Spatial (static) anomalies Temporal anomalies
Challenges with Temporal Anomalies
Tolerance to noise Continuous learning
• How anomaly detection fits into HTM theory
• How we do anomaly detection
– HTM Learning Algorithms
– Anomaly score processing
• Evaluating anomaly detection algorithms
Outline
What does anomaly detection have to do with
Hierarchical Temporal Memory?
Applications of HTM:
• Prediction
• Classification
• Anomaly Detection
No changes to the
algorithm
Anomaly Detection with HTM
HTM Algorithms
Encoder
SDR
Prediction
Raw anomaly score
Time average
Historical comparison
Anomaly likelihood
Data
How do we turn a data stream into anomaly scores?
Raw Anomaly Score
Raw anomaly score is the fraction of active columns
that were not predicted.
rawAnomalyScore =
At -(Pt-1 Ç At )
At
Pt = Predicted columns at time t
At = Active columns at time t
Load Balancer Example
0
0.5
1
1.5
2
2.5
3
3.5
Latency(s)
Latency
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
RawAnomalyScore
Raw Anomaly Score
Historical Comparison
Compute normal distribution over history
Compute probability for each point relative to the
distribution
m = xP(x)å s = E[(X -m)2
]
Anomaly Likelihood
0
0.2
0.4
0.6
0.8
1
1.2
Raw Anomaly Score Mean and Standard Deviation
mean
std dev
0
0.05
0.1
0.15
0.2
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Probability
Probability Distribution
Anomalies in Load Balancer Latency
Continuous Learning
Highly Predictable Data
Subtle Anomalies
Patterns that humans can’t see but are important
HTM Learning Algorithms Compared to Other Techniques
Anomaly Type Sudden In
predictable
data
In noisy
data
Human
can’t see
HTM Learning Algorithms Yes Yes Yes Yes
Thresholds Yes No No No
Various Statistical Yes Maybe Yes No
Time Series Analysis Yes Yes No No
Distance-based Yes Maybe No No
Supervised Methods N/A N/A N/A N/A
See “The Science of Anomaly Detection” White Paper at numenta.com
Benchmarking Streaming Anomaly Detection
• No training/test set
• No parameter tuning per data sample
• Need real data samples in addition to artificial
• We haven’t found any streaming anomaly detection
benchmarks so far
Numenta Anomaly Benchmark (NAB)
• Work in progress
• High velocity, streaming data
• Currently 21 real data samples and 11 artificial samples
• Hand-labeled, requiring multiple labelers to agree
• Open source, configurable
– Currently runs HTM Learning Algorithms and Etsy Skyline algorithms
• Follow progress at http://github.com/numenta/nab
• Please participate!
Next Steps
Read the white paper http://numenta.com/#technology
Scott Purdy spurdy@numenta.com
NAB http://github.com/numenta/nab
Algorithm code http://numenta.org
@numenta @scottmpurdy

More Related Content

PPTX
Anomaly Detection Using the CLA
PPTX
Evaluating Real-Time Anomaly Detection: The Numenta Anomaly Benchmark
PDF
Predictive Analytics with Numenta Machine Intelligence
PPTX
Applications of Hierarchical Temporal Memory (HTM)
PDF
Numenta Anomaly Benchmark - SF Data Science Meetup
PPTX
Streaming Analytics: It's Not the Same Game
PPTX
Getting Started with Numenta Technology
PPTX
Hierarchical Temporal Memory for Real-time Anomaly Detection
Anomaly Detection Using the CLA
Evaluating Real-Time Anomaly Detection: The Numenta Anomaly Benchmark
Predictive Analytics with Numenta Machine Intelligence
Applications of Hierarchical Temporal Memory (HTM)
Numenta Anomaly Benchmark - SF Data Science Meetup
Streaming Analytics: It's Not the Same Game
Getting Started with Numenta Technology
Hierarchical Temporal Memory for Real-time Anomaly Detection

What's hot (20)

PPTX
HTM & Apache Flink (2016-06-27)
PDF
Analytics for large-scale time series and event data
PPTX
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
PPTX
Subutai Ahmad, VP of Research, Numenta at MLconf SF - 11/13/15
PDF
Detecting Anomalies in Streaming Data
PDF
A Practical Guide to Anomaly Detection for DevOps
PDF
Performance engineering methodologies
PPTX
SplunkLive! Prelert Session - Extending Splunk with Machine Learning
PDF
Machine Learning for the Sensored IoT
PDF
Autoencoder Forest for Anomaly Detection from IoT Time Series
PPT
Sporar
PDF
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
PPTX
Measure to fail
PDF
Migrating to Prometheus: what we learned running it in production
PPT
Applications of ATPG
PPTX
REAL-TIME SIMULATION TECHNOLOGIES FOR POWER SYSTEMS DESIGN, TESTING, AND ANAL...
PDF
1025 track1 Malin
PPTX
Handling and Analyzing Large Telemetry Data Sets
PDF
Automatic test packet generation in network
PDF
Everybody Lies
HTM & Apache Flink (2016-06-27)
Analytics for large-scale time series and event data
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Subutai Ahmad, VP of Research, Numenta at MLconf SF - 11/13/15
Detecting Anomalies in Streaming Data
A Practical Guide to Anomaly Detection for DevOps
Performance engineering methodologies
SplunkLive! Prelert Session - Extending Splunk with Machine Learning
Machine Learning for the Sensored IoT
Autoencoder Forest for Anomaly Detection from IoT Time Series
Sporar
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Measure to fail
Migrating to Prometheus: what we learned running it in production
Applications of ATPG
REAL-TIME SIMULATION TECHNOLOGIES FOR POWER SYSTEMS DESIGN, TESTING, AND ANAL...
1025 track1 Malin
Handling and Analyzing Large Telemetry Data Sets
Automatic test packet generation in network
Everybody Lies
Ad

Viewers also liked (20)

PDF
Detecting Anomalies in Streaming Data
PPTX
Principles of Hierarchical Temporal Memory - Foundations of Machine Intelligence
PPTX
What the Brain says about Machine Intelligence
PPTX
Sparse Distributed Representations: Our Brain's Data Structure
PPTX
Beginner's Guide to NuPIC
PPTX
2014 Spring NuPIC Hackathon Kickoff
PPTX
A Whole New World [DEMO #4] (2014 Fall NuPIC Hackathon)
PPTX
2014 Fall NuPIC Hackathon Kickoff
PPTX
Temporal memory in racket
PDF
We'll Always Have Paris
PPTX
State of NuPIC
PPTX
Brains, Data, and Machine Intelligence (2014 04 14 London Meetup)
PPTX
Why Neurons have thousands of synapses? A model of sequence memory in the brain
PPTX
Real-Time Streaming Data Analysis with HTM
PDF
Internship_presentation
PPTX
HawkEye : A Real-time Anomaly Detection System
PPTX
"Building Anomaly Detection For Large Scale Analytics", Yonatan Ben Shimon, A...
PDF
Anomaly Detection at Scale
PPTX
Data Mining with Splunk
PPTX
Anomaly Detection - New York Machine Learning
Detecting Anomalies in Streaming Data
Principles of Hierarchical Temporal Memory - Foundations of Machine Intelligence
What the Brain says about Machine Intelligence
Sparse Distributed Representations: Our Brain's Data Structure
Beginner's Guide to NuPIC
2014 Spring NuPIC Hackathon Kickoff
A Whole New World [DEMO #4] (2014 Fall NuPIC Hackathon)
2014 Fall NuPIC Hackathon Kickoff
Temporal memory in racket
We'll Always Have Paris
State of NuPIC
Brains, Data, and Machine Intelligence (2014 04 14 London Meetup)
Why Neurons have thousands of synapses? A model of sequence memory in the brain
Real-Time Streaming Data Analysis with HTM
Internship_presentation
HawkEye : A Real-time Anomaly Detection System
"Building Anomaly Detection For Large Scale Analytics", Yonatan Ben Shimon, A...
Anomaly Detection at Scale
Data Mining with Splunk
Anomaly Detection - New York Machine Learning
Ad

Similar to Science of Anomaly Detection (20)

PPTX
Diagnosis Support by Machine Learning Using Posturography Data
PPTX
Random testing
PDF
Pattern recognition at scale anomaly detection in banking on stream data
PPTX
4Developers 2015: Measure to fail - Tomasz Kowalczewski
PPTX
Testing for cognitive bias in ai systems
PPTX
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
PDF
Bringing iot data to life, IoT Israel 2014
PDF
Strata 2014-tdunning-anomaly-detection-140211162923-phpapp01
PDF
SmartData Webinar: Applying Neocortical Research to Streaming Analytics
PDF
ATI Courses Professional Development Short Course Applied Measurement Engin...
PPTX
Using Machine Learning to Optimize DevOps Practices
PDF
The math behind big systems analysis.
PPTX
November 15th 2018 denver cu seminar (drew miller) ai robotics cryptocurrency...
PDF
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
PPTX
InfoSecurity Europe 2017 - On The Hunt for Advanced Attacks? C&C Channels are...
PDF
Data pipelines and anomaly detection
PPTX
Lie detector
PPTX
Anomalies and events keep us on our toes
PPTX
Av 738 - Adaptive Filtering Lecture 1 - Introduction
PPTX
Multimodal Learning Analytics
Diagnosis Support by Machine Learning Using Posturography Data
Random testing
Pattern recognition at scale anomaly detection in banking on stream data
4Developers 2015: Measure to fail - Tomasz Kowalczewski
Testing for cognitive bias in ai systems
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
Bringing iot data to life, IoT Israel 2014
Strata 2014-tdunning-anomaly-detection-140211162923-phpapp01
SmartData Webinar: Applying Neocortical Research to Streaming Analytics
ATI Courses Professional Development Short Course Applied Measurement Engin...
Using Machine Learning to Optimize DevOps Practices
The math behind big systems analysis.
November 15th 2018 denver cu seminar (drew miller) ai robotics cryptocurrency...
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
InfoSecurity Europe 2017 - On The Hunt for Advanced Attacks? C&C Channels are...
Data pipelines and anomaly detection
Lie detector
Anomalies and events keep us on our toes
Av 738 - Adaptive Filtering Lecture 1 - Introduction
Multimodal Learning Analytics

More from Numenta (20)

PDF
Deep learning at the edge: 100x Inference improvement on edge devices
PDF
Brains@Bay Meetup: A Primer on Neuromodulatory Systems - Srikanth Ramaswamy
PDF
Brains@Bay Meetup: How to Evolve Your Own Lab Rat - Thomas Miconi
PDF
Brains@Bay Meetup: The Increasing Role of Sensorimotor Experience in Artifici...
PDF
Brains@Bay Meetup: Open-ended Skill Acquisition in Humans and Machines: An Ev...
PDF
Brains@Bay Meetup: The Effect of Sensorimotor Learning on the Learned Represe...
PDF
SBMT 2021: Can Neuroscience Insights Transform AI? - Lawrence Spracklen
PDF
FPGA Conference 2021: Breaking the TOPS ceiling with sparse neural networks -...
PDF
BAAI Conference 2021: The Thousand Brains Theory - A Roadmap for Creating Mac...
PDF
Jeff Hawkins NAISys 2020: How the Brain Uses Reference Frames, Why AI Needs t...
PDF
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
PDF
CVPR 2020 Workshop: Sparsity in the neocortex, and its implications for conti...
PDF
Sparsity In The Neocortex, And Its Implications For Machine Learning
PDF
The Thousand Brains Theory: A Framework for Understanding the Neocortex and B...
PPTX
Jeff Hawkins Human Brain Project Summit Keynote: "Location, Location, Locatio...
PPTX
Location, Location, Location - A Framework for Intelligence and Cortical Comp...
PPTX
Have We Missed Half of What the Neocortex Does? A New Predictive Framework ...
PPTX
Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortica...
PPTX
The Predictive Neuron: How Active Dendrites Enable Spatiotemporal Computation...
PDF
The Biological Path Toward Strong AI by Matt Taylor (05/17/18)
Deep learning at the edge: 100x Inference improvement on edge devices
Brains@Bay Meetup: A Primer on Neuromodulatory Systems - Srikanth Ramaswamy
Brains@Bay Meetup: How to Evolve Your Own Lab Rat - Thomas Miconi
Brains@Bay Meetup: The Increasing Role of Sensorimotor Experience in Artifici...
Brains@Bay Meetup: Open-ended Skill Acquisition in Humans and Machines: An Ev...
Brains@Bay Meetup: The Effect of Sensorimotor Learning on the Learned Represe...
SBMT 2021: Can Neuroscience Insights Transform AI? - Lawrence Spracklen
FPGA Conference 2021: Breaking the TOPS ceiling with sparse neural networks -...
BAAI Conference 2021: The Thousand Brains Theory - A Roadmap for Creating Mac...
Jeff Hawkins NAISys 2020: How the Brain Uses Reference Frames, Why AI Needs t...
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
CVPR 2020 Workshop: Sparsity in the neocortex, and its implications for conti...
Sparsity In The Neocortex, And Its Implications For Machine Learning
The Thousand Brains Theory: A Framework for Understanding the Neocortex and B...
Jeff Hawkins Human Brain Project Summit Keynote: "Location, Location, Locatio...
Location, Location, Location - A Framework for Intelligence and Cortical Comp...
Have We Missed Half of What the Neocortex Does? A New Predictive Framework ...
Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortica...
The Predictive Neuron: How Active Dendrites Enable Spatiotemporal Computation...
The Biological Path Toward Strong AI by Matt Taylor (05/17/18)

Recently uploaded (20)

PDF
Global Data and Analytics Market Outlook Report
PPTX
Steganography Project Steganography Project .pptx
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPT
statistic analysis for study - data collection
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PDF
Introduction to the R Programming Language
PPT
DU, AIS, Big Data and Data Analytics.ppt
PPT
Image processing and pattern recognition 2.ppt
PPTX
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
PPTX
CYBER SECURITY the Next Warefare Tactics
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPT
Predictive modeling basics in data cleaning process
PPTX
chrmotography.pptx food anaylysis techni
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
New ISO 27001_2022 standard and the changes
PDF
Microsoft Core Cloud Services powerpoint
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
Global Data and Analytics Market Outlook Report
Steganography Project Steganography Project .pptx
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
statistic analysis for study - data collection
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Introduction to the R Programming Language
DU, AIS, Big Data and Data Analytics.ppt
Image processing and pattern recognition 2.ppt
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
CYBER SECURITY the Next Warefare Tactics
Topic 5 Presentation 5 Lesson 5 Corporate Fin
Predictive modeling basics in data cleaning process
chrmotography.pptx food anaylysis techni
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
New ISO 27001_2022 standard and the changes
Microsoft Core Cloud Services powerpoint
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx

Science of Anomaly Detection

  • 1. Science of Anomaly Detection Numenta Workshop October 17, 2014 Scott Purdy Engineering Manager
  • 2. What is an anomaly? Something that deviates from what is standard, normal, or expected.
  • 3. Types of Anomalies Spatial (static) anomalies Temporal anomalies
  • 4. Challenges with Temporal Anomalies Tolerance to noise Continuous learning
  • 5. • How anomaly detection fits into HTM theory • How we do anomaly detection – HTM Learning Algorithms – Anomaly score processing • Evaluating anomaly detection algorithms Outline
  • 6. What does anomaly detection have to do with Hierarchical Temporal Memory? Applications of HTM: • Prediction • Classification • Anomaly Detection No changes to the algorithm
  • 7. Anomaly Detection with HTM HTM Algorithms Encoder SDR Prediction Raw anomaly score Time average Historical comparison Anomaly likelihood Data How do we turn a data stream into anomaly scores?
  • 8. Raw Anomaly Score Raw anomaly score is the fraction of active columns that were not predicted. rawAnomalyScore = At -(Pt-1 Ç At ) At Pt = Predicted columns at time t At = Active columns at time t
  • 10. Historical Comparison Compute normal distribution over history Compute probability for each point relative to the distribution m = xP(x)å s = E[(X -m)2 ]
  • 11. Anomaly Likelihood 0 0.2 0.4 0.6 0.8 1 1.2 Raw Anomaly Score Mean and Standard Deviation mean std dev 0 0.05 0.1 0.15 0.2 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Probability Probability Distribution
  • 12. Anomalies in Load Balancer Latency
  • 15. Subtle Anomalies Patterns that humans can’t see but are important
  • 16. HTM Learning Algorithms Compared to Other Techniques Anomaly Type Sudden In predictable data In noisy data Human can’t see HTM Learning Algorithms Yes Yes Yes Yes Thresholds Yes No No No Various Statistical Yes Maybe Yes No Time Series Analysis Yes Yes No No Distance-based Yes Maybe No No Supervised Methods N/A N/A N/A N/A See “The Science of Anomaly Detection” White Paper at numenta.com
  • 17. Benchmarking Streaming Anomaly Detection • No training/test set • No parameter tuning per data sample • Need real data samples in addition to artificial • We haven’t found any streaming anomaly detection benchmarks so far
  • 18. Numenta Anomaly Benchmark (NAB) • Work in progress • High velocity, streaming data • Currently 21 real data samples and 11 artificial samples • Hand-labeled, requiring multiple labelers to agree • Open source, configurable – Currently runs HTM Learning Algorithms and Etsy Skyline algorithms • Follow progress at http://github.com/numenta/nab • Please participate!
  • 19. Next Steps Read the white paper http://numenta.com/#technology Scott Purdy spurdy@numenta.com NAB http://github.com/numenta/nab Algorithm code http://numenta.org @numenta @scottmpurdy