SlideShare a Scribd company logo
Simple. Precise.
Competent.
Reproducibility and
experiments management in
ML projects
D1 EXPO 2019, Alicante
Rozhkov Mikhail
Senior Data Scientist
Raiffeisen Bank Russia
Topics
• Difference of ML projects from IT
projects
• ML experiments management
• Agile ML
• ML reproducibility and why it’s
important?
• Approaches and tools
2
1. Different from IT projects
2. Longer dev cycle
3. Experiments driven
4. Not easy to test and validate
3
ML projects
IT projects development processes
4
Source: https://online.husson.edu/software-development-cycle/ Source: https://en.wikipedia.org/wiki/Software_development_process
ML project workflow is experiments driven
5
Problem
Statement
MVP
design
Get data
Prepare data
Train model
Evaluate
modelTest &
Integrate
Serve /
Predict
Monitor
1. Analyze &
Plan
2. Prototype
4. Monitor &
Maintain
3. Productionize
Inspired by Uber’s workflow of a machine learning project diagram. Scaling Machine Learning at Uber with Michelangelo https://eng.uber.com/scaling-michelangelo/
Solution
development
Experiment = code + dataset + outputs
6
Algorithm
Data
Hyperpara
meters
Evaluation
Measure
Model
ETL
tasks
test
dataset
train
dataset
evaluate
train
Experiment
config - artifacts
- pipelines
- code
- configs
ML project requires more factors to take into account
7
Software ML
Architecture design
tasks, UI/UX
integrations
+ nature and quality of data
Quality measures working code
+ model quality metrics
+ performance in production
Version control
code
environment
+ pipelines
+ datasets
+ models & artifacts
Testing code
+ data and features
+ model development methods
+ ML infrastructure
+ ML systems
Inspired by Dmitry Petrov, Ivan Shcheklein. Open source tools for machine learning model and dataset versioning.
1. Satisfy customer
2. Early fail
3. Fail safe
4. Frequent code updates
5. Constant changing requirements
6. Team = Business + DS/ML
7. Frequent team meetings/statuses
8. Measure of progress=working
code
9. Technical excellence and good
design
10. Reproducibility
8
Agile ML
Inspired by: Andrew Kelleher, Adam Kelleher. Machine Learning in Production:
Developing and Optimizing Data Science Workflows and Applications. 2019
ML reproducibility is a dimension of quality
9
What is Reproducibility?
● using the original methods applied to
the original data to produce the
original results [Gardner]
Why should you care?
● Trust
● Consistent Results
● Versioned History
● Team Performance
● Pain Less Production
Josh Gardner, Yuming Yang, Ryan S. Baker, Christopher Brooks. Enabling End-To-End Machine
Learning Replicability: A Case Study in Educational Data Mining
● Code, models and data version
control
● Automated pipelines
● Tests
● Control environment
● Experiments management
● Methodology and procedures
documentation
10
Reproducible ML
11
Use case
Onsite Recommendation System
12
Purpose: improve conversion rate on landing page
send online
user data
get promo
User History
DataCV prediction
model
Promo
recommendati
on model
Promo DB
{uid, cv_pred, promo_id}
Tracking project statuses and issues, documentation
13
Code
14
● Version control
● Re-usable .py modules
● Tests...
Source: https://www.bitbull.it/en/blog/how-git-flow-works/
Data and artifacts
15
● Version Control
● Store / share
● Access
Pipelines
16
● One button run
● End-to-end or selected steps
● Configs (i.e. random seeds)
load/
transform raw
data
evaluate
train
split train/test
prepare train
dataset
select best
model
prepare test
dataset
predict
Experiments Management
17
● Browse history
● Compare results
● Share results
● Methodology and
procedures
Data Model
pipelines
MetricsConfig
pipelines
pipelines
Data Model
pipelines
MetricsConfig
pipelines
pipelines
experiment X (dd.mm.2018)
experiment Y (dd.mm.2019)
Environment
18
● Libraries
● OS
● Hardware
Gronenschild EHBM, Habets P, Jacobs HIL, Mengelers R, Rozendaal N, et al. (2012) The Effects of FreeSurfer Version, Workstation Type, and Macintosh
Operating System Version on Anatomical Volume and Cortical Thickness Measurements. PLOS ONE 7(6): e38234. https://doi.org/10.1371/journal.pone.0038234
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0038234
Example: Effects of data processing conditions on the voxel volumes for a subsample of
(sub)cortical structures
Test ML
● Tests for data and features
● Tests for model development
● Tests for ML infrastructure
● Test for running ML systems
19
Conclusions
1. ML projects requires different
approach
2. Data and experiments are crucial
3. Agility is driven by fast
experimenting and reproducibility
4. Experiments are versioned,
browsable, comparable,
documented, reproducible
5. Reproducibility is a dimension of
quality and maturity
20
raiffeisen.ru
Simple. Precise.
Competent.
Thank you
21
Rozhkov Mikhail
Senior Data Scientist
Raiffeisen Bank Russia
mikhail.rozhkov@raiffeisen.ru

More Related Content

PDF
Technical debt in machine learning - Data Natives Berlin 2018
PDF
Machine Learning to moderate ads in real world classified's business
PDF
An online tool for requirements engineering, modeling and verification of dis...
PDF
MLOps for production-level machine learning
PPTX
Academic structure checker system
PDF
Building successful and secure products with AI and ML
PDF
Agile Data Science on Greenplum Using Airflow - Greenplum Summit 2019
PPTX
From Data Science to MLOps
Technical debt in machine learning - Data Natives Berlin 2018
Machine Learning to moderate ads in real world classified's business
An online tool for requirements engineering, modeling and verification of dis...
MLOps for production-level machine learning
Academic structure checker system
Building successful and secure products with AI and ML
Agile Data Science on Greenplum Using Airflow - Greenplum Summit 2019
From Data Science to MLOps

Similar to Reproducibility and experiments management in Machine Learning (20)

PDF
From Machine Learning Scientist to Full Stack Data Scientist: Lessons learned...
PDF
Bridging the Gap: from Data Science to Production
PDF
Start with version control and experiments management in machine learning
PDF
C2_W1---.pdf
PDF
AI projects - Lifecyle & Best Practices
PDF
StarWest 2019 - End to end testing: Stupid or Legit?
PDF
End to-end test automation at scale
PDF
Scaling Ride-Hailing with Machine Learning on MLflow
PPTX
Moving from BI to AI : For decision makers
PDF
Monitoring AI with AI
PDF
Data Summer Conf 2018, “Monitoring AI with AI (RUS)” — Stepan Pushkarev, CTO ...
PDF
Ml ops intro session
PDF
DevOps Days Rockies MLOps
PPTX
MLOps and Data Quality: Deploying Reliable ML Models in Production
PPTX
vodQA Pune (2019) - Testing AI,ML applications
PPTX
prod-dev-management.pptx
PPTX
CNCF-Istanbul-MLOps for Devops Engineers.pptx
PDF
Real world machine learning with Java for Fumankaitori.com
PDF
Slides-Артем Коваль-Cloud-Native MLOps Framework - DataFest 2021.pdf
PDF
Using Data Science to Build an End-to-End Recommendation System
From Machine Learning Scientist to Full Stack Data Scientist: Lessons learned...
Bridging the Gap: from Data Science to Production
Start with version control and experiments management in machine learning
C2_W1---.pdf
AI projects - Lifecyle & Best Practices
StarWest 2019 - End to end testing: Stupid or Legit?
End to-end test automation at scale
Scaling Ride-Hailing with Machine Learning on MLflow
Moving from BI to AI : For decision makers
Monitoring AI with AI
Data Summer Conf 2018, “Monitoring AI with AI (RUS)” — Stepan Pushkarev, CTO ...
Ml ops intro session
DevOps Days Rockies MLOps
MLOps and Data Quality: Deploying Reliable ML Models in Production
vodQA Pune (2019) - Testing AI,ML applications
prod-dev-management.pptx
CNCF-Istanbul-MLOps for Devops Engineers.pptx
Real world machine learning with Java for Fumankaitori.com
Slides-Артем Коваль-Cloud-Native MLOps Framework - DataFest 2021.pdf
Using Data Science to Build an End-to-End Recommendation System
Ad

More from Mikhail Rozhkov (15)

PDF
Школа Tech-In.RU: Cеминар 1. Основы работы с Ардуино (Аrduino) и Обзор hardwa...
PPTX
How to improve performance of team members? Consider competencies and context!
PPTX
Применение Arduino (Ардуино) в школе. Сообщество Tech-In.ru
PDF
Tech in.ru Опыт проведения семинаров по ардуино, электронике и робототехнике ...
PDF
Slides_Workplace context and its effect on individual competencies and perfor...
PDF
Study summary_Workplace context and its effect on individual competencies and...
PPSX
An initial framework of competency-based knowledge management
PDF
Отчет о конференции "Управление знаниями: практика" 2011
PDF
Роль знаний в организации
PDF
Влияние управления знаниями на конкурентоспособность организаций
PPSX
концепция поликтики уз в современном вузе
PPSX
организационно-управленческие семинары как инструмент управления знаниями
PPSX
Implementation of work-based learning approach in partnership of universities...
PPTX
Управление знаниями в университете
PDF
Интернет в образовании: путеводитель
Школа Tech-In.RU: Cеминар 1. Основы работы с Ардуино (Аrduino) и Обзор hardwa...
How to improve performance of team members? Consider competencies and context!
Применение Arduino (Ардуино) в школе. Сообщество Tech-In.ru
Tech in.ru Опыт проведения семинаров по ардуино, электронике и робототехнике ...
Slides_Workplace context and its effect on individual competencies and perfor...
Study summary_Workplace context and its effect on individual competencies and...
An initial framework of competency-based knowledge management
Отчет о конференции "Управление знаниями: практика" 2011
Роль знаний в организации
Влияние управления знаниями на конкурентоспособность организаций
концепция поликтики уз в современном вузе
организационно-управленческие семинары как инструмент управления знаниями
Implementation of work-based learning approach in partnership of universities...
Управление знаниями в университете
Интернет в образовании: путеводитель
Ad

Recently uploaded (20)

PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
Computer network topology notes for revision
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PDF
Launch Your Data Science Career in Kochi – 2025
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
Computer network topology notes for revision
IBA_Chapter_11_Slides_Final_Accessible.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
.pdf is not working space design for the following data for the following dat...
climate analysis of Dhaka ,Banglades.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Mega Projects Data Mega Projects Data
Introduction to Knowledge Engineering Part 1
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
Launch Your Data Science Career in Kochi – 2025
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Moving the Public Sector (Government) to a Digital Adoption
Business Acumen Training GuidePresentation.pptx
Database Infoormation System (DBIS).pptx
IB Computer Science - Internal Assessment.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx

Reproducibility and experiments management in Machine Learning

  • 1. Simple. Precise. Competent. Reproducibility and experiments management in ML projects D1 EXPO 2019, Alicante Rozhkov Mikhail Senior Data Scientist Raiffeisen Bank Russia
  • 2. Topics • Difference of ML projects from IT projects • ML experiments management • Agile ML • ML reproducibility and why it’s important? • Approaches and tools 2
  • 3. 1. Different from IT projects 2. Longer dev cycle 3. Experiments driven 4. Not easy to test and validate 3 ML projects
  • 4. IT projects development processes 4 Source: https://online.husson.edu/software-development-cycle/ Source: https://en.wikipedia.org/wiki/Software_development_process
  • 5. ML project workflow is experiments driven 5 Problem Statement MVP design Get data Prepare data Train model Evaluate modelTest & Integrate Serve / Predict Monitor 1. Analyze & Plan 2. Prototype 4. Monitor & Maintain 3. Productionize Inspired by Uber’s workflow of a machine learning project diagram. Scaling Machine Learning at Uber with Michelangelo https://eng.uber.com/scaling-michelangelo/ Solution development
  • 6. Experiment = code + dataset + outputs 6 Algorithm Data Hyperpara meters Evaluation Measure Model ETL tasks test dataset train dataset evaluate train Experiment config - artifacts - pipelines - code - configs
  • 7. ML project requires more factors to take into account 7 Software ML Architecture design tasks, UI/UX integrations + nature and quality of data Quality measures working code + model quality metrics + performance in production Version control code environment + pipelines + datasets + models & artifacts Testing code + data and features + model development methods + ML infrastructure + ML systems Inspired by Dmitry Petrov, Ivan Shcheklein. Open source tools for machine learning model and dataset versioning.
  • 8. 1. Satisfy customer 2. Early fail 3. Fail safe 4. Frequent code updates 5. Constant changing requirements 6. Team = Business + DS/ML 7. Frequent team meetings/statuses 8. Measure of progress=working code 9. Technical excellence and good design 10. Reproducibility 8 Agile ML Inspired by: Andrew Kelleher, Adam Kelleher. Machine Learning in Production: Developing and Optimizing Data Science Workflows and Applications. 2019
  • 9. ML reproducibility is a dimension of quality 9 What is Reproducibility? ● using the original methods applied to the original data to produce the original results [Gardner] Why should you care? ● Trust ● Consistent Results ● Versioned History ● Team Performance ● Pain Less Production Josh Gardner, Yuming Yang, Ryan S. Baker, Christopher Brooks. Enabling End-To-End Machine Learning Replicability: A Case Study in Educational Data Mining
  • 10. ● Code, models and data version control ● Automated pipelines ● Tests ● Control environment ● Experiments management ● Methodology and procedures documentation 10 Reproducible ML
  • 12. Onsite Recommendation System 12 Purpose: improve conversion rate on landing page send online user data get promo User History DataCV prediction model Promo recommendati on model Promo DB {uid, cv_pred, promo_id}
  • 13. Tracking project statuses and issues, documentation 13
  • 14. Code 14 ● Version control ● Re-usable .py modules ● Tests... Source: https://www.bitbull.it/en/blog/how-git-flow-works/
  • 15. Data and artifacts 15 ● Version Control ● Store / share ● Access
  • 16. Pipelines 16 ● One button run ● End-to-end or selected steps ● Configs (i.e. random seeds) load/ transform raw data evaluate train split train/test prepare train dataset select best model prepare test dataset predict
  • 17. Experiments Management 17 ● Browse history ● Compare results ● Share results ● Methodology and procedures Data Model pipelines MetricsConfig pipelines pipelines Data Model pipelines MetricsConfig pipelines pipelines experiment X (dd.mm.2018) experiment Y (dd.mm.2019)
  • 18. Environment 18 ● Libraries ● OS ● Hardware Gronenschild EHBM, Habets P, Jacobs HIL, Mengelers R, Rozendaal N, et al. (2012) The Effects of FreeSurfer Version, Workstation Type, and Macintosh Operating System Version on Anatomical Volume and Cortical Thickness Measurements. PLOS ONE 7(6): e38234. https://doi.org/10.1371/journal.pone.0038234 https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0038234 Example: Effects of data processing conditions on the voxel volumes for a subsample of (sub)cortical structures
  • 19. Test ML ● Tests for data and features ● Tests for model development ● Tests for ML infrastructure ● Test for running ML systems 19
  • 20. Conclusions 1. ML projects requires different approach 2. Data and experiments are crucial 3. Agility is driven by fast experimenting and reproducibility 4. Experiments are versioned, browsable, comparable, documented, reproducible 5. Reproducibility is a dimension of quality and maturity 20
  • 21. raiffeisen.ru Simple. Precise. Competent. Thank you 21 Rozhkov Mikhail Senior Data Scientist Raiffeisen Bank Russia mikhail.rozhkov@raiffeisen.ru