SlideShare a Scribd company logo
Maintainability Challenges
in ML : A SLR
KARTHIK SHIVASHANKAR ANTONIO MARTINI
UNIVERSITY OF OSLO
DEPARTMENT OF INFORMATICS
Study Objective
Our study aims to identify and synthesise the maintainability
challenges in different stages of the ML workflow and understand
how these stages are interdependent and impact each other’s
maintainability.
Maintainability
Software maintainability means ”the ease with which a software
system or component can be modified to correct faults, improve
performance or other attributes and adapt to a changing
environment”
Method
We have a replication package with all the
details and metadata related to this SLR
study @
https://doi.org/10.5281/zenodo.6400559
Research Questions
(RQ1) What are the Data Engineering
Maintainability challenges?
(RQ2) What are the Model Engineering
Maintainability challenges?
(RQ3) What are the current maintainability
challenges when Building an ML system?
RQ1 Key
takeaways
•Data is messy, error-prone, and lacks transparency
and ownership.
•No guarantee that pre-processing can handle all
types of quality errors, bias and adversarial data.
•Most Data pipelines are tested in a trial and error
manner. It also changes and evolves, making it
difficult to validate and maintain it on an ongoing
basis.
Courtesy Randal Munroe of XKCD
RQ2 Key takeaways
•The entanglement in hyperparameters directly affects
the model performance and training pipeline.
•Stochastic nature of ML and rapidly changing input and
expected output create a moving target and make ML
testing an open challenge.
•Data seasonality and fluctuation in data collection may
lead to model staleness and degrading in performance
Image credits:
https://matthewmcateer.me/blog/machine-learning-technical-debt/
RQ3 Key takeaways
• In general, most cloud providers do not provide a common programming
model. They typically use either a black box or a complex runtime environment
to approach ML, leading to a tight coupling between the modelling and
infrastructure layers.
• Although AutoML alleviates some challenges by automating the model
selection and hyper-tuning, it is still hard to minimise expert intervention
easily with the current scene.
• Engineers spend significant effort developing ad hoc programs to connect
components from different software libraries, processing various forms of raw
input, and interfacing with external systems, leading to pipeline jungles and
glue codes in an MLOps-like set-up.
Credits: https://towardsdatascience.com/seven-signs-you-might-be-creating-ml-technical-debt-
1a96a840fd80
Interdependence of
ML challenges
ML has unique quality attributes concerns during
development, such as
•data-dependent behaviour,
•detecting and responding to drift over time,
•handling bias and quality issues,
•timely capture of ground truth for retraining of a model
to deliver a quality ML system
•And many more
Image credits:
https://matthewmcateer.me/blog/machine-learning-technical-debt/
Interdependence
of Maintainability
challenges in
different stages
If you try to use ML to give fashion advice, know that fashion changes over
time
CREDITS:
https://towardsdatascience.com/how-to-attack-machine-learning-evasion-poisoning-inference-trojans-backdoors-a7cb5832595c
https://medium.com/thelaunchpad/how-to-protect-your-machine-learning-product-from-time-adversaries-and-itself-ff07727d6712
ML systems are data-dependent and complex, making them susceptible to Data
and Concept Drift which leads to rapid obsolescence of input and expected
output parts
Credits: https://towardsdatascience.com/machine-learning-in-production-why-is-it-so-difficult-28ce74bfc732
Implication for developers
▪There is a lack of standard tools and method for provenance tracking, publishing of ML models
and their artefacts, tracking data transformations, querying and storing intermediate steps.
▪Many ML projects fail at the prototyping stage because setting up infrastructure for deployment
and maintenance requires integration and management of glue code, ad-hoc pipelines, and data
monitoring.
▪In collaborative or multi-organisational projects, monitoring processes are complex because
different teams have different metrics and requirements, especially in terms of governance and
regulations and also a lack of standards to communicate about ML issues and their quality
Implication for Researcher
•It is unclear even for experienced developers how to select between several data processing
steps and how they will affect the model’s performance.
•ML systems constantly adapt to new data, creating a moving target and posing a different set of
challenges to maintain unit and regression testing than traditional software.
•Need better validation algorithms and Monitoring techniques to identify key data and model
metrics over time.
Thank you
Questions

More Related Content

PPTX
Software engineering for machine learning.pptx
PDF
Introduction to MLOps_ CI_CD for Machine Learning Models.pdf
PPTX
Why do the majority of Data Science projects never make it to production?
PDF
MLOps Bridging the gap between Data Scientists and Ops.
PDF
MLOps – Applying DevOps to Competitive Advantage
PPTX
MLOps and Data Quality: Deploying Reliable ML Models in Production
PPTX
An intro to building an architecture repository meta model and modeling frame...
PDF
How to Build an MLOps Pipeline - SoluLab
Software engineering for machine learning.pptx
Introduction to MLOps_ CI_CD for Machine Learning Models.pdf
Why do the majority of Data Science projects never make it to production?
MLOps Bridging the gap between Data Scientists and Ops.
MLOps – Applying DevOps to Competitive Advantage
MLOps and Data Quality: Deploying Reliable ML Models in Production
An intro to building an architecture repository meta model and modeling frame...
How to Build an MLOps Pipeline - SoluLab

Similar to Maintainability Challenges inML:ASLR (20)

PDF
Productionizing Predictive Analytics using the Rendezvous Architecture - for ...
PPTX
Databricks for MLOps Presentation (AI/ML)
PDF
15 Popular Test Automation Frameworks and How to Choose One.pdf
PDF
Mykola Mykytenko: MLOps: your way from nonsense to valuable effect (approache...
PPTX
3-ISDM-Yu-IS Development Methods for.pptx
PDF
Accelerating Machine Learning as a Service with Automated Feature Engineering
PPTX
case_study_ml_22DCS085_Yash_Patel__.pptx
PDF
Robert Risch -DevOps for Machine Learning.pdf
PDF
Adm Workshop Program
PDF
Bridging the Gap: from Data Science to Production
PDF
The power of AI and ML in Testing .
PPTX
Transform Legacy Systems with Modern Development Expertise
PDF
Testing and Deployment - Full Stack Deep Learning
PDF
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
PPTX
Flexi dc: A flexible platform for database conversion by wael yahfooz and Sk ...
PDF
Integrating MLOps into Your Data Science Workflow.pdf
PPTX
Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...
PDF
Week 3 data journey and data storage
PDF
MLOps Virtual Event: Automating ML at Scale
PPT
Deploying ML Models using MLOps Pipelines.ppt
Productionizing Predictive Analytics using the Rendezvous Architecture - for ...
Databricks for MLOps Presentation (AI/ML)
15 Popular Test Automation Frameworks and How to Choose One.pdf
Mykola Mykytenko: MLOps: your way from nonsense to valuable effect (approache...
3-ISDM-Yu-IS Development Methods for.pptx
Accelerating Machine Learning as a Service with Automated Feature Engineering
case_study_ml_22DCS085_Yash_Patel__.pptx
Robert Risch -DevOps for Machine Learning.pdf
Adm Workshop Program
Bridging the Gap: from Data Science to Production
The power of AI and ML in Testing .
Transform Legacy Systems with Modern Development Expertise
Testing and Deployment - Full Stack Deep Learning
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
Flexi dc: A flexible platform for database conversion by wael yahfooz and Sk ...
Integrating MLOps into Your Data Science Workflow.pdf
Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...
Week 3 data journey and data storage
MLOps Virtual Event: Automating ML at Scale
Deploying ML Models using MLOps Pipelines.ppt
Ad

More from SEAA 2022 (18)

PDF
Risk and Engineering Knowledge Integration in Cyber-physical Production Syste...
PDF
Bad Smells in Industrial Automation: Sniffing out Feature Envy
PDF
Software Architecture Challenges in Process Automation - From Code Generation...
PDF
From Traditional to Digital: How software, data and AI are transforming the e...
PDF
Exploiting dynamic analysis for architectural smell detection: a preliminary ...
PDF
On the Role of Personality Traits in Implementation Tasks: A Preliminary Inve...
PDF
An Empirical Analysis of Microservices Systems Using Consumer-Driven Contract...
PDF
Have Java Production Methods Co-Evolved With Test Methods Properly?: A Fine-G...
PDF
A Preliminary Conceptualization and Analysis on Automated Static Analysis Too...
PPTX
An Evaluation of Effort-Aware Fine-Grained Just-in-Time Defect Prediction Met...
PDF
The Impact of Forced Working-From-Home on Code Technical Debt: An Industrial ...
PDF
Service Classification through Machine Learning: Aiding in the Efficient Ide...
PDF
Model-Driven Optimization: Generating Smart Mutation Operators for Multi-Obj...
PDF
An Industrial Experience Report about Challenges from Continuous Monitoring, ...
PDF
API Deprecation: A Systematic Mapping Study
PDF
MDEML_UMLsec4Edge Extending UMLsec to model data-protection-compliant edge co...
PDF
EMMM: A Unified Meta-Model for Tracking Machine Learning Experiments
PDF
Easing the Reuse of ML Solutions by Interactive Clustering-based Autotuning i...
Risk and Engineering Knowledge Integration in Cyber-physical Production Syste...
Bad Smells in Industrial Automation: Sniffing out Feature Envy
Software Architecture Challenges in Process Automation - From Code Generation...
From Traditional to Digital: How software, data and AI are transforming the e...
Exploiting dynamic analysis for architectural smell detection: a preliminary ...
On the Role of Personality Traits in Implementation Tasks: A Preliminary Inve...
An Empirical Analysis of Microservices Systems Using Consumer-Driven Contract...
Have Java Production Methods Co-Evolved With Test Methods Properly?: A Fine-G...
A Preliminary Conceptualization and Analysis on Automated Static Analysis Too...
An Evaluation of Effort-Aware Fine-Grained Just-in-Time Defect Prediction Met...
The Impact of Forced Working-From-Home on Code Technical Debt: An Industrial ...
Service Classification through Machine Learning: Aiding in the Efficient Ide...
Model-Driven Optimization: Generating Smart Mutation Operators for Multi-Obj...
An Industrial Experience Report about Challenges from Continuous Monitoring, ...
API Deprecation: A Systematic Mapping Study
MDEML_UMLsec4Edge Extending UMLsec to model data-protection-compliant edge co...
EMMM: A Unified Meta-Model for Tracking Machine Learning Experiments
Easing the Reuse of ML Solutions by Interactive Clustering-based Autotuning i...
Ad

Recently uploaded (20)

PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PDF
diccionario toefl examen de ingles para principiante
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PPTX
2. Earth - The Living Planet earth and life
PPTX
famous lake in india and its disturibution and importance
PDF
An interstellar mission to test astrophysical black holes
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
Sciences of Europe No 170 (2025)
PDF
HPLC-PPT.docx high performance liquid chromatography
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
bbec55_b34400a7914c42429908233dbd381773.pdf
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
diccionario toefl examen de ingles para principiante
INTRODUCTION TO EVS | Concept of sustainability
The KM-GBF monitoring framework – status & key messages.pptx
2. Earth - The Living Planet earth and life
famous lake in india and its disturibution and importance
An interstellar mission to test astrophysical black holes
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Sciences of Europe No 170 (2025)
HPLC-PPT.docx high performance liquid chromatography
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf

Maintainability Challenges inML:ASLR

  • 1. Maintainability Challenges in ML : A SLR KARTHIK SHIVASHANKAR ANTONIO MARTINI UNIVERSITY OF OSLO DEPARTMENT OF INFORMATICS
  • 2. Study Objective Our study aims to identify and synthesise the maintainability challenges in different stages of the ML workflow and understand how these stages are interdependent and impact each other’s maintainability.
  • 3. Maintainability Software maintainability means ”the ease with which a software system or component can be modified to correct faults, improve performance or other attributes and adapt to a changing environment”
  • 4. Method We have a replication package with all the details and metadata related to this SLR study @ https://doi.org/10.5281/zenodo.6400559
  • 5. Research Questions (RQ1) What are the Data Engineering Maintainability challenges? (RQ2) What are the Model Engineering Maintainability challenges? (RQ3) What are the current maintainability challenges when Building an ML system?
  • 6. RQ1 Key takeaways •Data is messy, error-prone, and lacks transparency and ownership. •No guarantee that pre-processing can handle all types of quality errors, bias and adversarial data. •Most Data pipelines are tested in a trial and error manner. It also changes and evolves, making it difficult to validate and maintain it on an ongoing basis. Courtesy Randal Munroe of XKCD
  • 7. RQ2 Key takeaways •The entanglement in hyperparameters directly affects the model performance and training pipeline. •Stochastic nature of ML and rapidly changing input and expected output create a moving target and make ML testing an open challenge. •Data seasonality and fluctuation in data collection may lead to model staleness and degrading in performance Image credits: https://matthewmcateer.me/blog/machine-learning-technical-debt/
  • 8. RQ3 Key takeaways • In general, most cloud providers do not provide a common programming model. They typically use either a black box or a complex runtime environment to approach ML, leading to a tight coupling between the modelling and infrastructure layers. • Although AutoML alleviates some challenges by automating the model selection and hyper-tuning, it is still hard to minimise expert intervention easily with the current scene. • Engineers spend significant effort developing ad hoc programs to connect components from different software libraries, processing various forms of raw input, and interfacing with external systems, leading to pipeline jungles and glue codes in an MLOps-like set-up. Credits: https://towardsdatascience.com/seven-signs-you-might-be-creating-ml-technical-debt- 1a96a840fd80
  • 9. Interdependence of ML challenges ML has unique quality attributes concerns during development, such as •data-dependent behaviour, •detecting and responding to drift over time, •handling bias and quality issues, •timely capture of ground truth for retraining of a model to deliver a quality ML system •And many more Image credits: https://matthewmcateer.me/blog/machine-learning-technical-debt/
  • 11. If you try to use ML to give fashion advice, know that fashion changes over time
  • 14. Implication for developers ▪There is a lack of standard tools and method for provenance tracking, publishing of ML models and their artefacts, tracking data transformations, querying and storing intermediate steps. ▪Many ML projects fail at the prototyping stage because setting up infrastructure for deployment and maintenance requires integration and management of glue code, ad-hoc pipelines, and data monitoring. ▪In collaborative or multi-organisational projects, monitoring processes are complex because different teams have different metrics and requirements, especially in terms of governance and regulations and also a lack of standards to communicate about ML issues and their quality
  • 15. Implication for Researcher •It is unclear even for experienced developers how to select between several data processing steps and how they will affect the model’s performance. •ML systems constantly adapt to new data, creating a moving target and posing a different set of challenges to maintain unit and regression testing than traditional software. •Need better validation algorithms and Monitoring techniques to identify key data and model metrics over time.