SlideShare a Scribd company logo
Why more than half of ML models don't make it to production
whoami
● Solutions Architect @ cnvrg.io
● = built by data scientists, for data scientists to help teams:
○ Get from data to models to production in the most efficient and fast way
○ Bridge science and engineering
○ Automate MLOps
○ Help teams streamline every element of their pipelines
Aaron
Schneider
aaron@cnvrg.io
LinkedIn: azschneider
def agenda(30 mins):
● Understanding the problems and symptoms
● Discussing the root causes
● Best practices for closing the gap
● Tools to streamline the production process
Deployment
ML.overview()
Data Processing Training MonitorDeployment
Deployment
Deployment
production.statistics()
https://venturebeat.com/2019/07/19/why-do-87-of-data-science-projects-never-make-it-into-production/
Algorithmia: 2020 state of enterprise machine learning
87%
of ML projects
don’t make it to
production
55%
of companies
haven’t deployed a
single model
production.statistics()
https://venturebeat.com/2019/07/19/why-do-87-of-data-science-projects-never-make-it-into-production/
Algorithmia: 2020 state of enterprise machine learning
50%
of models take 8-90 days
before production
13%
take 91-365 days
Investment in ML is
increasing
Model != in_production
● Good work being done by good data scientists
● Functional and accurate models being made
● But not being used
● If being used, often take a long time before implemented
● Undermines the whole process as ML not being used or no longer
accurate
● Enterprise problem impacting business results
● Need to accelerate time from research into production
production_issue(team_friction)
● Multiple teams:
○ Data Science Team (Development)
○ Engineering Team (DevOps)
● Data Science can build the model but engineering can’t (won’t?) put it
into production
● Causes:
○ Miscommunication
○ Developed in Python but implement in C/Java
○ Mismanagement
production_issue(iteration)
● You can’t reach desired accuracy through one grid search. Have to
continually improve the model
● Can get stuck in the grind towards the perfect model
● Management wants higher accuracy before production
● Like any software/agile method: Start with MVP
● Build a model and get it to production, THEN monitor and iterate
● Already puts you leaps and bounds ahead of competitors
Accuracy =
62%
Version 2
Accuracy =
60%
Version 1
Accuracy =
64%
Version 3
production_issue(containerizing)
● How do we even implement a model? What technology do we use?
● Model developed separately from product, how do we align?
● Containerization is a logical answer
● Tough to implement properly
● Links back to first issue we discussed
production_issue(scaling)
● Demand isn’t static
● Need complicated infrastructure to support the service you are
delivering
● Must be able to scale without interruption to service
● Kubernetes is great, but hard
● Need whole different set of skills to build the architecture for
autoscaling with k8
production_issue(executive_buy_in)
● Might have the best DS team in the world
● But if higher-ups push back, all a waste of time
● Need the management and executive on board with the vision
● Might be excited by the flashiness of ML/AI/Big Data but need to take
risks and believe in the DS teams capability
cnvrg.demo()
webinar.summary()
● There are many issues that keep models from production
● Very rarely are they DS issues, usually team friction and engineering
complexities
● It is the responsibility of the DevOps and Management teams to build a
structure that can minimize time to production
● This is a real metric that can have real business consequences, both
positively and negatively
● Low time to production = $$$
● High time to production = -$$$
Why more than half of ML models don't make it to production
Thanks!
https://cnvrg.io
info@cnvrg.io
+972506600186

More Related Content

PDF
MLOps for production-level machine learning
PDF
How to monitor your ML models in production with Kubernetes
PDF
Training Machine Learning models directly from GitHub with cnvrg.io MLOps
PDF
Build machine learning pipelines from research to production
PDF
What is MLOps
PDF
Bridging the gap in enterprise AI
PDF
MLOps Bridging the gap between Data Scientists and Ops.
PDF
MMT - Next Generation Machine Translation by Marcello Federico (Fondazione Br...
MLOps for production-level machine learning
How to monitor your ML models in production with Kubernetes
Training Machine Learning models directly from GitHub with cnvrg.io MLOps
Build machine learning pipelines from research to production
What is MLOps
Bridging the gap in enterprise AI
MLOps Bridging the gap between Data Scientists and Ops.
MMT - Next Generation Machine Translation by Marcello Federico (Fondazione Br...

What's hot (20)

PDF
Machine learning life cycle
PPTX
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
PDF
Legacy code development and maintenance
PPTX
Machine Learning in Production with Dato Predictive Services
PDF
Data engineering zoomcamp introduction
PDF
Making machine learning model deployment boring - Big Data Expo 2019
PPTX
vodQA Pune (2019) - Design patterns in test automation
PDF
DN18 | Technical Debt in Machine Learning | Jaroslaw Szymczak | OLX
PPTX
John Robert: Making your machine learning model usable by others
PPTX
vodQA Pune (2019) - Insights into big data testing
PPTX
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...
PDF
Data Science Salon: Kaggle 1st Place in 30 minutes: Putting AutoML to Work wi...
PPTX
Freenome's Biological Machine Learning Platform
PDF
Webinar - Patient Readmission Risk
PPTX
MATLAB GUI Projects Research Ideas
PPTX
Matlab Based Projects Research guidance
PDF
Spark Summit EU 2017 - Preventing revenue leakage and monitoring distributed ...
PDF
MLSEV Virtual. ML Platformization and AutoML in the Enterprise
PDF
Sparklyr: Big Data enabler for R users
Machine learning life cycle
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Legacy code development and maintenance
Machine Learning in Production with Dato Predictive Services
Data engineering zoomcamp introduction
Making machine learning model deployment boring - Big Data Expo 2019
vodQA Pune (2019) - Design patterns in test automation
DN18 | Technical Debt in Machine Learning | Jaroslaw Szymczak | OLX
John Robert: Making your machine learning model usable by others
vodQA Pune (2019) - Insights into big data testing
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...
Data Science Salon: Kaggle 1st Place in 30 minutes: Putting AutoML to Work wi...
Freenome's Biological Machine Learning Platform
Webinar - Patient Readmission Risk
MATLAB GUI Projects Research Ideas
Matlab Based Projects Research guidance
Spark Summit EU 2017 - Preventing revenue leakage and monitoring distributed ...
MLSEV Virtual. ML Platformization and AutoML in the Enterprise
Sparklyr: Big Data enabler for R users
Ad

Similar to Why more than half of ML models don't make it to production (20)

PDF
Rsqrd AI: From R&D to ROI of AI
PDF
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
PDF
Building Data Science into Organizations: Field Experience
PDF
Productionising Machine Learning Models
PDF
Machine learning systems for engineers
PDF
How Will Your ML Project Fail
PPTX
From Data Science to MLOps
PDF
Bridging the Gap: from Data Science to Production
PPTX
Why do the majority of Data Science projects never make it to production?
PDF
Ml ops intro session
PDF
Machine Learning Model Deployment: Strategy to Implementation
PDF
From Machine Learning Scientist to Full Stack Data Scientist: Lessons learned...
PPTX
Bridging the AI Gap: Building Stakeholder Support
PPTX
Machine Learning Models in Production
PPTX
Why do most machine learning projects never make it to production
PDF
Pm.ais ummit 180917 final
PDF
MLOps – Applying DevOps to Competitive Advantage
PPTX
MLOps - Getting Machine Learning Into Production
PDF
Tecton - The state of applied ML 2023.pdf
PDF
Using MLOps to Bring ML to Production/The Promise of MLOps
Rsqrd AI: From R&D to ROI of AI
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
Building Data Science into Organizations: Field Experience
Productionising Machine Learning Models
Machine learning systems for engineers
How Will Your ML Project Fail
From Data Science to MLOps
Bridging the Gap: from Data Science to Production
Why do the majority of Data Science projects never make it to production?
Ml ops intro session
Machine Learning Model Deployment: Strategy to Implementation
From Machine Learning Scientist to Full Stack Data Scientist: Lessons learned...
Bridging the AI Gap: Building Stakeholder Support
Machine Learning Models in Production
Why do most machine learning projects never make it to production
Pm.ais ummit 180917 final
MLOps – Applying DevOps to Competitive Advantage
MLOps - Getting Machine Learning Into Production
Tecton - The state of applied ML 2023.pdf
Using MLOps to Bring ML to Production/The Promise of MLOps
Ad

More from cnvrg.io AI OS - Hands-on ML Workshops (8)

PDF
Webinar kubernetes and-spark
PDF
How to set up Kubernetes for all your machine learning workflows
PDF
CI/CD for Machine Learning
PDF
How to use continual learning in your ML models
PDF
How To Build Auto-Adaptive Machine Learning Models with Kubernetes
PDF
Continual learning with human in-the-loop
PDF
Scaling MLOps on NVIDIA DGX Systems
PDF
Deploy your machine learning models to production with Kubernetes
Webinar kubernetes and-spark
How to set up Kubernetes for all your machine learning workflows
CI/CD for Machine Learning
How to use continual learning in your ML models
How To Build Auto-Adaptive Machine Learning Models with Kubernetes
Continual learning with human in-the-loop
Scaling MLOps on NVIDIA DGX Systems
Deploy your machine learning models to production with Kubernetes

Recently uploaded (20)

PPTX
Database Infoormation System (DBIS).pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPT
Quality review (1)_presentation of this 21
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PDF
Business Analytics and business intelligence.pdf
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
SAP 2 completion done . PRESENTATION.pptx
PDF
annual-report-2024-2025 original latest.
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Computer network topology notes for revision
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Database Infoormation System (DBIS).pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
Quality review (1)_presentation of this 21
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
Business Analytics and business intelligence.pdf
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction-to-Cloud-ComputingFinal.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
SAP 2 completion done . PRESENTATION.pptx
annual-report-2024-2025 original latest.
Clinical guidelines as a resource for EBP(1).pdf
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
.pdf is not working space design for the following data for the following dat...
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Computer network topology notes for revision
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx

Why more than half of ML models don't make it to production