SlideShare a Scribd company logo
Concept Drift: Monitoring Model Quality In Streaming ML Applications
model

quality
Concept Drift: Monitoring Model Quality In Streaming ML Applications
* same data generating distribution

(Some algorithms tolerate violation of this to a certain degree.)
training set
operation=*
core problem
core problem
stream
core problem
stream
population change
core problem
stream
sensor failure
core problem
stream
concept drift
core problem
stream
emerging 

concept
common solution
model batch
poor
classifiermargin
can active learning help?
color
size
classifiermargin
can active learning help?
color
size
classifiermargin
can active learning help?
color
size
classifiermargin
can active learning help?
color
size
classifiermargin
can active learning help?
color
size
classifiermargin
can active learning help?
color
size
a better solution
data
classifier
feature extraction
predictions
a better solution
data
classifier
feature extraction
predictions
monitoring
a better solution
data
classifier
feature extraction
predictions
monitoring
a better solution
data
classifier
feature extraction
predictions
monitoring
a better solution
data
classifier
feature extraction
predictions
monitoring
a better solution
data
classifier
labeling
-
feature extraction
predictions
monitoring
a better solution
data
classifier
labeling
-change detection
adaptation
feature extraction
predictions
monitor how?
supervised
unsupervised
statistical process control
sequential analysis
error distribution monitoring
clustering / novelty detection
feature distribution monitoring
model-dependent monitoring
adapt how?
explicit 

mechanisms
implicit 

mechanisms
windowing
weighting
sampling
pure methods
ensemble methods
which method?
monitor how?
supervised
unsupervised
statistical process control
sequential analysis
error distribution monitoring
clustering / novelty detection
feature distribution monitoring
model-dependent monitoring
ML theory: samples errors
statistical process control
• Drift Detection Method [DDM]
• # of errors is Binomial:
• alert:
statistical process control
• Early Drift Detection Method [EDDM]
• distance between errors 

better for gradual drift
• warn & start caching:
• alert and reset max:
• Drift Detection Method [DDM]
• # of errors is Binomial:
• alert:
monitor how?
supervised
unsupervised
statistical process control
sequential analysis
error distribution monitoring
clustering / novelty detection
feature distribution monitoring
model-dependent monitoring
sequential analysis
• Linear Four Rates [LFR]
• stationary data => constant contingency table
0 1
0 TN FN
1 FP TP
Predicted
True
sequential analysis
• Linear Four Rates [LFR]
• stationary data => constant contingency table
• calculate four rates
0 1
0 TN FN
1 FP TP
Predicted
True
sequential analysis
• Linear Four Rates [LFR]
• stationary data => constant contingency table
• calculate four rates
• incremental updates
0 1
0 TN FN
1 FP TP
Predicted
True
sequential analysis
• Linear Four Rates [LFR]
• stationary data => constant contingency table
• calculate four rates
• incremental updates
• test for change
• Monte Carlo sampling 

for significance level
• Bonferoni correction 

for correlated tests
• O(1)
• Better than (E)DDM 

for class imbalance
0 1
0 TN FN
1 FP TP
Predicted
True
monitor how?
supervised
unsupervised
statistical process control
sequential analysis
error distribution monitoring
clustering / novelty detection
feature distribution monitoring
model-dependent monitoring
error distribution monitoring
• ADaptive WINdowing [ADWIN]
• Consider all partitions of a window









• Drop the last element if any





• Efficient version O(log W)
• Data structure for windows ~ exponential histograms
• Drop last window rather than last element
w0 w1
prediction
errors
resampling
• Prediction loss over random permutations vs. ordered training data
• Parallel permutation test version available
• Still expensive
• Only method directly applicable to regression setting
• Side note: Even with finite training set, drift could be problematic if model is developed
naively.
monitor how?
supervised
unsupervised
statistical process control
sequential analysis
error distribution monitoring
clustering / novelty detection
feature distribution monitoring
model-dependent monitoring
clustering / novelty detection
• OLINDDA: K-means, periodically
merge unknown to known or flag
• MINAS: micro-clusters, incremental
stream clustering
• DETECTNOD: Discrete Cosine
Transform to estimate distances
efficiently
• Woo-ensemble: Treat outliers as
potential emerging class centroids
• ECSMiner: Store and use cluster
summary efficiently
• GC3: Grid based clustering
size
color
Curse of 

Dimensionality
clustering / novelty detection
size
color
• OLINDDA: K-means, periodically
merge unknown to known or flag
• MINAS: micro-clusters, incremental
stream clustering
• DETECTNOD: Discrete Cosine
Transform to estimate distances
efficiently
• Woo-ensemble: Treat outliers as
potential emerging class centroids
• ECSMiner: Store and use cluster
summary efficiently
• GC3: Grid based clustering
monitor how?
supervised
unsupervised
statistical process control
sequential analysis
error distribution monitoring
clustering / novelty detection
feature distribution monitoring
model-dependent monitoring
feature distribution monitoring
• Monitor individual features
• Many ways to compare:
• Pearson correlation [Change of Concept - CoC]
• Hellinger distance [HDDDM] ~ O(DB)
• PCA to reduce the number of features to track (top [PCA-1] or bottom [PCA-2] n%)
w0
w1
color
size
samples
monitor how?
supervised
unsupervised
statistical process control
sequential analysis
error distribution monitoring
clustering / novelty detection
feature distribution monitoring
model-dependent monitoring
model-dependent monitoring
• Not all changes matter
• Posterior probability estimate
• Use [A-distance] ~ generalized Kolmogorov-Simirnov distance
• designed to be less sensitive to irrelevant changes
L1-distance KS-distance A-distance
model-dependent monitoring
• [Margin] distribution
• rank statistic on density estimates for a 

binary representation of the data,
• compare average margins of a linear classifier 

induced by the 1-norm SVM
• based on the average zero-one or sigmoid error 

rate of an SVM classifier
• Generalized margin [MD3]:
• Embed base classifier in a 

Random Feature Bagged Ensemble
• Margin == high disagreement region of the ensemble
m
argin
“margin”
“margin”
adapt how?
explicit 

mechanisms
implicit 

mechanisms
windowing
weighting
sampling
pure methods
ensemble methods
explicit mechanisms for adaptation
W
stationary
W
drift
ADWIN
Drop the last sub-window 

if threshold is exceeded. = Adaptively shrink
window during drift.
explicit mechanisms for adaptation
* Adaptation goes through a similar refinement process.
JIT w
0
m
0
m
1
m
2
m
3
m
4
I
0
I
1
I
2
I
3
I
4
change detected *
w
1
w
2
w
3
w
4
adapt how?
explicit 

mechanisms
implicit 

mechanisms
windowing
weighting
sampling
pure methods
ensemble methods
explicit mechanisms for adaptation
Biased

Reservoir

Sampling
bias:
capacity:
overwrite / exchange
randomly w/ Prob{ %full }
or append
adapt how?
explicit 

mechanisms
implicit 

mechanisms
windowing
weighting
sampling
pure methods
ensemble methods
implicit mechanisms for adaptation
Ensemble Based Adaptation
ensemble 1 ensemble (N-1) ensemble N
train new member
implicit mechanisms for adaptation
retire / decay train new member
ensemble 1 ensemble (N-1) ensemble N
Ensemble Based Adaptation
implicit mechanisms for adaptation
retire / decay
recurring
train new member
ensemble 1 ensemble (N-1) ensemble N
Ensemble Based Adaptation
implicit mechanisms for adaptation
• Online NonStationary boosting [ONSboost]
• NonStationary Random Forests [NSRF]
• Dynamic Weighted Majority [DWM]
• Learn++ for NonStationary Environments [Learn++.NSE]
retire / decay
recurring
train new member
ensemble 1 ensemble (N-1) ensemble N
Ensemble Based Adaptation
which method?
Method Efficiency Pros Cons Notes
DDM/EDDM O(1) no data stored
label cost
false alarms sampling 

necessary 

in case of 

fast data,
microservices

architecture

ideal
LFR O(1) class imbalance OK label cost
ADWIN O(log W)
better change
localization
label cost
JIT O(log W) no labels required only for abrupt changes best localization
which method?
Method Efficiency Pros Cons Notes
ECSMiner / GC3 O(W
2
/ k)

O(G log C)
emerging concepts
clusterable 

drift only
use if emerging
concepts expected
HDDDM O(DB) no labels
not for population
drift or class
imbalance
better when combined
with PCA
A-distance O(log W) no labels
less false positives
compared to HDDDM
good choice for
unsupervised
Margin / MD3
Learning, detection,
adaptation bundled
reduced false alarms
must use feature
bagged ensembles
best choice but must
commit to using the
specific machine
learning algorithmsEnsemble methods recurring concepts large batches
references
https://gist.github.com/emrev12/0d75dc2d6c3e80012d10a82712b8ced0
Check out these resources:
Dean’s book
Webinars
etc.Fast Data Architectures 

for Streaming Applications
Getting Answers Now from Data Sets that Never End
By Dean Wampler, Ph. D., VP of Fast Data Engineering
60
LIGHTBEND.COM/LEARN
Serving Machine Learning Models
A Guide to Architecture, Stream Processing Engines, 

and Frameworks
By Boris Lublinsky, Fast Data Platform Architect
61
LIGHTBEND.COM/LEARN
lightbend.com/fast-data-platform
thank you!
emre.velipasaoglu@ .com

More Related Content

PDF
Handling concept drift in data stream mining
PPTX
Scikit-Learn Tutorial | Machine Learning With Scikit-Learn | Sklearn | Python...
PDF
Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive W...
PDF
Machine Learning Deep Learning AI and Data Science
PPTX
Automated Machine Learning (Auto ML)
PPT
Reinforcement Learning Q-Learning
PPTX
Data drift and machine learning
PPTX
Recommendation Systems
Handling concept drift in data stream mining
Scikit-Learn Tutorial | Machine Learning With Scikit-Learn | Sklearn | Python...
Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive W...
Machine Learning Deep Learning AI and Data Science
Automated Machine Learning (Auto ML)
Reinforcement Learning Q-Learning
Data drift and machine learning
Recommendation Systems

What's hot (20)

PDF
Lecture 1: What is Machine Learning?
PDF
Machine Learning Ml Overview Algorithms Use Cases And Applications
PDF
An introduction to computer vision with Hugging Face
PDF
Introduction to pattern recognition
PDF
The Evolution of AutoML
PDF
ML Drift - How to find issues before they become problems
PPT
Supervised and unsupervised learning
PDF
Explainability for Natural Language Processing
PDF
What is MLOps
PPTX
Microsoft Introduction to Automated Machine Learning
PDF
Monitoring AI with AI
PPTX
Deep neural networks
PPTX
Recurrent neural network
PPT
Machine Learning
PDF
Monitoring and observability
PDF
Introducing Neo4j
PDF
Kdd 2014 Tutorial - the recommender problem revisited
PPT
★Mean shift a_robust_approach_to_feature_space_analysis
PDF
From Idea to Execution: Spotify's Discover Weekly
PPTX
Introduction to continual learning
Lecture 1: What is Machine Learning?
Machine Learning Ml Overview Algorithms Use Cases And Applications
An introduction to computer vision with Hugging Face
Introduction to pattern recognition
The Evolution of AutoML
ML Drift - How to find issues before they become problems
Supervised and unsupervised learning
Explainability for Natural Language Processing
What is MLOps
Microsoft Introduction to Automated Machine Learning
Monitoring AI with AI
Deep neural networks
Recurrent neural network
Machine Learning
Monitoring and observability
Introducing Neo4j
Kdd 2014 Tutorial - the recommender problem revisited
★Mean shift a_robust_approach_to_feature_space_analysis
From Idea to Execution: Spotify's Discover Weekly
Introduction to continual learning
Ad

Similar to Concept Drift: Monitoring Model Quality In Streaming ML Applications (20)

PPT
NEURAL Network Design Training
PDF
Heuristic design of experiments w meta gradient search
PPT
Nbvtalkonfeatureselection
PPTX
Feature Selection Techniques for Software Fault Prediction (Summary)
PPTX
A Sparse-Coding Based Approach for Class-Specific Feature Selection
PPTX
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
PDF
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
PPTX
Dimensionality Reduction.pptx
PDF
featurers_Machinelearning___________.pdf
PDF
2023_ASPRS_LiDAR_Division_Update.pdf
PPTX
dimentionalityreduction-241109090040-5290a6cd.pptx
PDF
Mattar_PhD_Thesis
PPTX
Robust Filtering Schemes for Machine Learning Systems to Defend Adversarial A...
PDF
PDF
Dimentionality Reduction PCA Version 1.pdf
PPTX
Support Vector machine(SVM) and Random Forest
PDF
GLM & GBM in H2O
PPTX
crossvalidation.pptx
PPTX
Machine learning with scikitlearn
PDF
Unsupervised Data Augmentation for Consistency Training
NEURAL Network Design Training
Heuristic design of experiments w meta gradient search
Nbvtalkonfeatureselection
Feature Selection Techniques for Software Fault Prediction (Summary)
A Sparse-Coding Based Approach for Class-Specific Feature Selection
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Dimensionality Reduction.pptx
featurers_Machinelearning___________.pdf
2023_ASPRS_LiDAR_Division_Update.pdf
dimentionalityreduction-241109090040-5290a6cd.pptx
Mattar_PhD_Thesis
Robust Filtering Schemes for Machine Learning Systems to Defend Adversarial A...
Dimentionality Reduction PCA Version 1.pdf
Support Vector machine(SVM) and Random Forest
GLM & GBM in H2O
crossvalidation.pptx
Machine learning with scikitlearn
Unsupervised Data Augmentation for Consistency Training
Ad

More from Lightbend (20)

PDF
IoT 'Megaservices' - High Throughput Microservices with Akka
PDF
How Akka Cluster Works: Actors Living in a Cluster
PDF
The Reactive Principles: Eight Tenets For Building Cloud Native Applications
PDF
Putting the 'I' in IoT - Building Digital Twins with Akka Microservices
PDF
Akka at Enterprise Scale: Performance Tuning Distributed Applications
PDF
Digital Transformation with Kubernetes, Containers, and Microservices
PDF
Detecting Real-Time Financial Fraud with Cloudflow on Kubernetes
PDF
Cloudstate - Towards Stateful Serverless
PDF
Digital Transformation from Monoliths to Microservices to Serverless and Beyond
PDF
Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6
PPTX
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
PDF
How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...
PDF
Microservices, Kubernetes, and Application Modernization Done Right
PDF
Full Stack Reactive In Practice
PDF
Akka and Kubernetes: A Symbiotic Love Story
PPTX
Scala 3 Is Coming: Martin Odersky Shares What To Know
PDF
Migrating From Java EE To Cloud-Native Reactive Systems
PDF
Running Kafka On Kubernetes With Strimzi For Real-Time Streaming Applications
PDF
Designing Events-First Microservices For A Cloud Native World
PDF
Scala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For Scala
IoT 'Megaservices' - High Throughput Microservices with Akka
How Akka Cluster Works: Actors Living in a Cluster
The Reactive Principles: Eight Tenets For Building Cloud Native Applications
Putting the 'I' in IoT - Building Digital Twins with Akka Microservices
Akka at Enterprise Scale: Performance Tuning Distributed Applications
Digital Transformation with Kubernetes, Containers, and Microservices
Detecting Real-Time Financial Fraud with Cloudflow on Kubernetes
Cloudstate - Towards Stateful Serverless
Digital Transformation from Monoliths to Microservices to Serverless and Beyond
Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...
Microservices, Kubernetes, and Application Modernization Done Right
Full Stack Reactive In Practice
Akka and Kubernetes: A Symbiotic Love Story
Scala 3 Is Coming: Martin Odersky Shares What To Know
Migrating From Java EE To Cloud-Native Reactive Systems
Running Kafka On Kubernetes With Strimzi For Real-Time Streaming Applications
Designing Events-First Microservices For A Cloud Native World
Scala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For Scala

Recently uploaded (20)

PDF
Odoo Companies in India – Driving Business Transformation.pdf
PPTX
history of c programming in notes for students .pptx
PDF
medical staffing services at VALiNTRY
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
System and Network Administraation Chapter 3
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PPTX
Introduction to Artificial Intelligence
PPTX
Transform Your Business with a Software ERP System
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PPT
Introduction Database Management System for Course Database
PPTX
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
PDF
Nekopoi APK 2025 free lastest update
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PPTX
L1 - Introduction to python Backend.pptx
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
Odoo Companies in India – Driving Business Transformation.pdf
history of c programming in notes for students .pptx
medical staffing services at VALiNTRY
Softaken Excel to vCard Converter Software.pdf
Navsoft: AI-Powered Business Solutions & Custom Software Development
System and Network Administraation Chapter 3
Design an Analysis of Algorithms I-SECS-1021-03
Introduction to Artificial Intelligence
Transform Your Business with a Software ERP System
Upgrade and Innovation Strategies for SAP ERP Customers
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Introduction Database Management System for Course Database
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
Nekopoi APK 2025 free lastest update
Design an Analysis of Algorithms II-SECS-1021-03
L1 - Introduction to python Backend.pptx
VVF-Customer-Presentation2025-Ver1.9.pptx

Concept Drift: Monitoring Model Quality In Streaming ML Applications