SlideShare a Scribd company logo
Machine Learning
The High Interest Credit Card of Technical Debt
The Market Intelligence Company of the Digital World
$65M
Funding
2007
Founded
6
Offices
300+
Employees
Market Intelligence Company
of the Digital World
The
Machine learning the high interest credit card of technical debt [PWL]
Learned | Estimated
Machine learning: The high interest credit card of technical debt
(2014)
Hidden technical debt in machine learning systems (2015)
D. Sculley, Gary Holt, Daniel Golovin, Eugene Davydov, Todd Phillips,
Dietmar Ebner, Vinay Chaudhary, Michael Young, Jean-Francois
Crespo, Dan Dennison
A Few Words About The Papers
Systems engineering papers
About Machine Learning systems
Give a lot of names to a lot of things (which we know is hard)
We found them in 2015 and liked them a lot
A Few Words About The Papers
What is ML and what is Technical Debt?
Sources of Technical Debt in ML systems
Mitigation
Today
Machine Learning
Train Predict
Data
Algorithm
Data
Data
Hyperparameters
Why Machine Learning?
Allows us to convert data to software
We often already have data
Some problems are hard or impossible to
solve otherwise
http://xkcd.com/1425/
A metaphor for the long term costs of moving quickly
Lack of testing, bad modularity, non-redundant systems, etc.
Somewhat similar to fiscal debt
There are good reasons to take it, but it needs to be serviced
Hidden technical debt - a special, evil, variant
Technical Debt
Boundary Erosion
Components, interfaces, all that jazz
Think MVC, microservices
Implicitly assumed in “good” systems
Makes components easy to:
- Test
- Change
- Reason about
- Monitor
Boundaries in Systems Engineering
Entanglement
ML System “Inputs”
Learning
settings
Hyperparams
Data prep
settings
Real world
inputs
?
Other systems
outputs
Issues
Change in distribution of any input influences
all outputs
Adding/Removing a feature changes the
model and output distribution
Any configuration parameter is just as
coupled
Retraining not reproducible
Changing Anything Changes Everything (CACE)
Model parts
Correction Cascades
Output Output Output
We sometimes use output from an existing
model as a feature to get a small correction
Easier than training a new model
Easier than teaching an existing model new
tricks
A
B
C
Correction Cascades
Output Output Output
Improvement
Degradation
Model improvements cause degradation
down the line
Corrections might lead to an “improvement
deadlock”
A
B
C
Outputs of ML systems include:
- Predictions
- Weights and other state
Data is easy to consume
In turn makes it hard to improve model
May create hidden feedback loops
Undeclared Consumers
Data Dependencies
Data Dependencies
Regular system
ComponentInput
Component Output
ComponentInput Output
Data Dependencies
Regular system
ComponentInput
Component Output
ComponentInput Output
Data dependency
Data Dependencies
Regular system
ComponentInput
ML System
Component Output
ComponentInput Output
Input Logs
Weights
Output
ML
Component
Trainer
PredictInput Output
Data dependency
Data Dependencies
Regular system
ComponentInput
ML System
Component Output
ComponentInput Output
Input Logs
Weights
Output
ML
Component
Trainer
PredictInput Output
Data dependency
Features for training can be outputs of other models
IDF tables, Word2Vec embeddings..
Logs, intermediate results, monitoring feeds..
But if they change schema?
Stop being updated?
Disappear?
Unstable Dependencies
Legacy features - Nobody maintains / wants to maintain them
Bundled features - Not sure which ones we need
Correlated features - May mask features with actual causality
Epsilon features - Improve the result by very little
Underutilized Dependencies
Software Issues
ML as Software
Actual machine learning is a lot more than modeling
Configuration
Data
Collection
Feature
Extraction
Data
Verification
Process Management
Resource
Management
Analysis Tools Serving
Infrastructure
Monitoring
Model
Glue code
Software issues
Pipeline jungles
Dead experimental paths
Abstraction Debt
Multiple languages, systems, packages
Need to configure/test/deploy:
- Hyper-parameters
- Schema (including semantics)
- Data dependencies
Hard to understand or visualize what changed
Configuration Debt
Interactions
Experience has shown that the external world is rarely stable
- Word2Vec for “Pokemon”
- Population of Sudan
- Gregorian dates of holidays
Makes monitoring essential.
Makes testing very hard.
Changes in The External World
A model sometimes influences its future training data
This is common in:
- Recommendation systems
- Ad placement
- Systems that affect the physical world
Especially hard if change is gradual and model updates infrequently
Direct Feedback Loops
Often happen when two different systems learn from each other’s
outputs
Classic example is algo-trading
But two independent content generation systems running on the
same page also qualify
Undeclared consumers can be a cause
Hidden Feedback Loops
..But wait
There’s More!
Data Testing
Reproducibility
Process Management
Cultural Debt
More!
Mitigation
How easily can an entirely new algorithmic approach be tested at full
scale?
What is the transitive closure of all data dependencies?
How precisely can the impact of a new change to the system be
measured?
Be Aware of Debt
Does improving one model or signal degrade others?
How quickly can new members of the team be brought up to speed?
Be Aware of Debt
Merge mature models into a single, well defined, well tested system
Prune experimental code paths
Make each feature count
Monitor
Map consumers
Test data
Paying per model
Configuration system - versioned, comprehensive, testable
Data dependency system - versioned, comprehensive, testable
Consolidate mature systems
Reproducibility is awesome
Pay off cultural debt
Paying for Systems
Other Questions?
We Are Hiring!
similarweb.com/corp/jobs

More Related Content

PDF
Tuning SQL for Oracle Exadata: The Good, The Bad, and The Ugly Tuning SQL fo...
PDF
Tanel Poder - Performance stories from Exadata Migrations
PDF
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
PDF
Machine Learning for Q&A Sites: The Quora Example
PDF
Past present and future of Recommender Systems: an Industry Perspective
PDF
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
PDF
Staying Shallow & Lean in a Deep Learning World
PDF
Machine Learning Goes Production
Tuning SQL for Oracle Exadata: The Good, The Bad, and The Ugly Tuning SQL fo...
Tanel Poder - Performance stories from Exadata Migrations
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
Machine Learning for Q&A Sites: The Quora Example
Past present and future of Recommender Systems: an Industry Perspective
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Staying Shallow & Lean in a Deep Learning World
Machine Learning Goes Production

Similar to Machine learning the high interest credit card of technical debt [PWL] (20)

PDF
Practical machine learning
PDF
Engineering Intelligent Systems using Machine Learning
PDF
Pitfalls of machine learning in production
PDF
Introduction to ML.pdf Supervised Learning, Unsupervised
PDF
Managing machine learning
PDF
DN18 | Technical Debt in Machine Learning | Jaroslaw Szymczak | OLX
PDF
Technical debt in ML | Jaroslaw Szymczak | DN18
PDF
Introduction to Machine Learning
PDF
Machine Learning: Past, Present and Future - by Tom Dietterich
PDF
10 more lessons learned from building Machine Learning systems - MLConf
PDF
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
 
PDF
10 more lessons learned from building Machine Learning systems
PPTX
Introduction to Machine Learning
PDF
Machine learning para tertulianos, by javier ramirez at teowaki
PPTX
introduction to machine learning
PDF
Week 1.pdf
PPTX
Ml - A shallow dive
PPTX
recent.pptx
PPTX
Data Science Crash Course
PDF
Choosing a Machine Learning technique to solve your need
Practical machine learning
Engineering Intelligent Systems using Machine Learning
Pitfalls of machine learning in production
Introduction to ML.pdf Supervised Learning, Unsupervised
Managing machine learning
DN18 | Technical Debt in Machine Learning | Jaroslaw Szymczak | OLX
Technical debt in ML | Jaroslaw Szymczak | DN18
Introduction to Machine Learning
Machine Learning: Past, Present and Future - by Tom Dietterich
10 more lessons learned from building Machine Learning systems - MLConf
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
 
10 more lessons learned from building Machine Learning systems
Introduction to Machine Learning
Machine learning para tertulianos, by javier ramirez at teowaki
introduction to machine learning
Week 1.pdf
Ml - A shallow dive
recent.pptx
Data Science Crash Course
Choosing a Machine Learning technique to solve your need
Ad

Recently uploaded (20)

PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PPTX
L1 - Introduction to python Backend.pptx
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PPTX
ai tools demonstartion for schools and inter college
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
How Creative Agencies Leverage Project Management Software.pdf
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PDF
medical staffing services at VALiNTRY
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PPTX
CHAPTER 12 - CYBER SECURITY AND FUTURE SKILLS (1) (1).pptx
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PPTX
ManageIQ - Sprint 268 Review - Slide Deck
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
L1 - Introduction to python Backend.pptx
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
ai tools demonstartion for schools and inter college
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
How Creative Agencies Leverage Project Management Software.pdf
Internet Downloader Manager (IDM) Crack 6.42 Build 41
medical staffing services at VALiNTRY
Navsoft: AI-Powered Business Solutions & Custom Software Development
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
CHAPTER 12 - CYBER SECURITY AND FUTURE SKILLS (1) (1).pptx
VVF-Customer-Presentation2025-Ver1.9.pptx
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Design an Analysis of Algorithms I-SECS-1021-03
Wondershare Filmora 15 Crack With Activation Key [2025
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PTS Company Brochure 2025 (1).pdf.......
Design an Analysis of Algorithms II-SECS-1021-03
ManageIQ - Sprint 268 Review - Slide Deck
Ad

Machine learning the high interest credit card of technical debt [PWL]