SlideShare a Scribd company logo
11
Most read
19
Most read
21
Most read
An introduction to machine learning in biomedical research:
Key concepts, problems and applications
Francisco Azuaje, PhD.
Head of Bioinformatics and Modeling Research Group (BIOMOD)
Luxembourg Institute of Health (LIH)
Presentation for the ISCB Regional Student Group (RSG) - Luxembourg
7 November 2018
Today’s presentation:
• ML in biomedical research: concepts, approaches, applications
• A selection of recent, interesting examples
• Challenges
• ML software frameworks, and final remarks
Feel free to ask questions during or after the presentation
• Artificial Intelligence (AI), a field of computer science whose origins
can be dated back to the 1940s (Turing, 1950), aims to develop
computational systems with advanced analytical or predictive
capabilities.
What is Artificial Intelligence (AI) ?
• Machine Learning (ML) is the most successful branch of AI.
• “A computer program is said to learn from experience E with respect
to some class of tasks T and performance measure P, if its
performance at tasks in T, as measured by P, improves with
experience E” (Mitchell, 1997).
What is Machine Learning (ML)?
• ML is concerned with the development of programs with the capacity
to “learn” from diverse data sources. These programs can be used to
make predictions, help make decisions…etc.
Major learning “paradigms”:
• Supervised learning
• Unsupervised learning, semi-supervised learning
• Reinforcement learning
ML taps into recent and older research advances
• ML stands on the shoulders of probability theory, optimization algorithms, calculus, algebra…
• For example, Deep Learning (DL) builds on progress achieved in artificial neural networks over
decades. Key periods: 1980s and 2000s.
• DL is a sub-branch of ML: A diverse family of computational models consisting of many
(“deep”) data processing layers for automated feature extraction and pattern recognition in
large datasets.
• DL’s progress possible not only because of larger datasets and accelerated computing
capabilities, but also by developments in statistical learning theory, algorithms and open-
source software accumulated over the past four decades.
Major (supervised) learning approaches/techniques
• Generalizations of linear models for regression and classification
• Bayesian models: naïve, graphical models, probabilistic learning/programming
• Support vector machines (SVMs), kernel-based models
• Decision trees, random forests
• Gradient boosting machines (GBM)
• Neural networks, deep learning
• …..
(Topol, 2014, Cell)
(Eisenstein, 2015, Nature)
Biomedical research: larger and diverse datasets
High inter-individual variabilityDatasets change in time and space High intra-individual variability
(Hutter and Zenklusen, Cell, 2018)
Key example of “big data” in cancer research
Typical questions answered by ML using such and other datasets
“Fundamental” research “Applied” research
• What is the “behavior”,
“mechanism”…?
• Within a data layer, how are
samples or features related?
• How are different data layers
interrelated…?
• What if …?
• Why…?
• Risk assessment
• Diagnosis
• Prognosis
• Other clinical outcome prediction
• Prevention
• Drug targets
• Therapeutic strategies
Machine Learning
Examples of applications of ML in biomedicine
(Table adapted from Yu et al., Nat Bio Med Eng, 2018)
Koohy, 2018, F1000 Research
ML in biomedical research
Global usage of ML techniques Trends of ML techniques ( !(PCA & LRM) )
Trends of ML techniques
SVM
RF
DNN
Typical ML development cycle and applications
Supervised machine learning Unsupervised machine learning
Figures adapted from Yu et al., Nat Bio Med Eng, 2018
ML in biomedical research: Examples of model diversity and applications (1)
Large-scale phenotypic image analysis
Novelty/anomaly detection
Prediction of hard-to-discretize states
Image classification according to phenotypes
(e.g., here with Cell Profiler Analyst)
Smith et al., 2018, Cell Systems
ML in biomedical research: Examples of model diversity and applications (2)
Kermany et al., 2018, Cell
Medical Diagnoses with Transfer Image-Based Deep Learning
Retina images Retinal diseases
• DL system: Low classification errors, comparable to humans (on 1000
images)
• Strategy also successfully applied to analysis of chest X-ray images
• Potential “generalized” platform for image-based diagnoses (?)
ML in biomedical research: Examples of model diversity and applications (3)
Ambale-Venkatesh et al., Circ. Res., 2017
Random survival forests for predicting cardiovascular (CV) events
Variable importance for each of the 735 variables used in analysis
Variableimportanceis
measuredusingtheminimum
depthofthemaximalsubtree
• Accurate prediction of 6 CV outcomes (in
asymptomatic population).
• Subsets of predictive features for each
event.
• 12-year follow-up, multi-center, -ethnic,
wide age range.
Imaging, noninvasive
tests, questionnaires,
biomarker panels
Top-20 features
Key challenges of ML in biomedical research
Heterogeneity: Data, events, states,
within and between individuals…
Data not always “big”: relative lack of
labelled data, curse of dimensionality
Data: multi-layered, hierarchical
For same data type/layer: multiple
measurement platforms
Key challenges of ML in biomedical research (2)
Interpretability, understandability:
Global and local, novelty and consistency
with prior knowledge
Reproducibility:
Crucial requirement
“Gold standards”/”ground truth”:
Lack, limitations
Complexity of pattern recurrence,
regularities
ML software frameworks
• Different programing languages offer extensive libraries for
implementing ML: Python, R, Java, C++…
• Widely used frameworks: Scikit-learn, Keras, PyTorch,
TensorFlow, H2O
• Also “automated”, “driverless”…tools are available to get you
started.
Takeaways:
• Many ML challenges in BM research are shared by different
application domains, but this field poses its unique challenges.
• ML in BM research will continue advancing driven by: more data, new
expectations and emerging questions.
• Supervised learning, including e.g., deep learning, will meet many of
these needs, however: unbiased exploration, unsupervised learning,
hypothesis generation and interpretation are also crucial.
Thanks to:
Funding from:
Bioinformatics and Modelling Research
Group (BIOMOD)
Our research partners in Luxembourg and abroad

More Related Content

PDF
Machine Learning in Bioinformatics
PPTX
AI-powered Drug Discovery: Revolutionizing Precision Medicine
PDF
An Introduction to Machine Learning and Genomics
PDF
Machine learning in biology
PPTX
Ml in genomics
PPTX
AI in Bioinformatics
PDF
Machine Learning and Reasoning for Drug Discovery
PDF
Machine Learning in Bioinformatics
Machine Learning in Bioinformatics
AI-powered Drug Discovery: Revolutionizing Precision Medicine
An Introduction to Machine Learning and Genomics
Machine learning in biology
Ml in genomics
AI in Bioinformatics
Machine Learning and Reasoning for Drug Discovery
Machine Learning in Bioinformatics

What's hot (20)

PPT
Classification and prediction
PPTX
Data mining techniques unit III
PDF
Data Science - Part V - Decision Trees & Random Forests
PDF
Genetic Algorithms
PPT
Fp growth algorithm
PDF
Machine Learning and its Applications
PDF
Naive Bayes
PPT
Classification (ML).ppt
PDF
Optimization for Deep Learning
PPTX
Collaborative filtering
ODP
Machine Learning With Logistic Regression
PPTX
Evolutionary Algorithms
PDF
Support Vector Machines ( SVM )
PDF
Cluster Analysis for Dummies
PPTX
Supervised Machine Learning
PDF
Deep learning with Keras
PPT
Machine learning
PPTX
CART – Classification & Regression Trees
PPT
Machine Learning
PDF
Classification Based Machine Learning Algorithms
Classification and prediction
Data mining techniques unit III
Data Science - Part V - Decision Trees & Random Forests
Genetic Algorithms
Fp growth algorithm
Machine Learning and its Applications
Naive Bayes
Classification (ML).ppt
Optimization for Deep Learning
Collaborative filtering
Machine Learning With Logistic Regression
Evolutionary Algorithms
Support Vector Machines ( SVM )
Cluster Analysis for Dummies
Supervised Machine Learning
Deep learning with Keras
Machine learning
CART – Classification & Regression Trees
Machine Learning
Classification Based Machine Learning Algorithms
Ad

Similar to An introduction to machine learning in biomedical research: Key concepts, problems and applications (20)

PDF
Challenges and opportunities for machine learning in biomedical research
PDF
APPLICATIONS OF DEEP LEARNING AND MACHINE LEARNING IN HEALTHCARE DOMAIN – A L...
PDF
Deep learning for biomedical discovery and data mining I
PPTX
Statistics and ML 21Oct22 sel.pptx
PDF
Machine Learning in Medicine A Primer
PDF
Health Care Application using Machine Learning and Deep Learning
PPTX
Statistics and ML Paris 20sept22
PDF
ML and AI: a blessing and curse for statisticians and medical doctors
PDF
Lec.10 Dr Ahmed Elngar
PDF
Deep Learning in Medicine
PDF
Machine learning versus traditional statistical modeling and medical doctors
PPTX
Big Data & ML for Clinical Data
PPTX
AI in translational medicine webinar
PDF
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
PPTX
Application and Implementation of different deep learning
PDF
Health advances ai in diagnostic development
PDF
IRJET- Cancer Disease Prediction using Machine Learning over Big Data
PDF
Rage Against the Machine Learning - Young Researchers Event
PPTX
ML, biomedical data & trust
PPTX
Big Data and Artificial Intelligence
Challenges and opportunities for machine learning in biomedical research
APPLICATIONS OF DEEP LEARNING AND MACHINE LEARNING IN HEALTHCARE DOMAIN – A L...
Deep learning for biomedical discovery and data mining I
Statistics and ML 21Oct22 sel.pptx
Machine Learning in Medicine A Primer
Health Care Application using Machine Learning and Deep Learning
Statistics and ML Paris 20sept22
ML and AI: a blessing and curse for statisticians and medical doctors
Lec.10 Dr Ahmed Elngar
Deep Learning in Medicine
Machine learning versus traditional statistical modeling and medical doctors
Big Data & ML for Clinical Data
AI in translational medicine webinar
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Application and Implementation of different deep learning
Health advances ai in diagnostic development
IRJET- Cancer Disease Prediction using Machine Learning over Big Data
Rage Against the Machine Learning - Young Researchers Event
ML, biomedical data & trust
Big Data and Artificial Intelligence
Ad

Recently uploaded (20)

PPTX
2. Earth - The Living Planet earth and life
PPTX
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
PDF
HPLC-PPT.docx high performance liquid chromatography
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PDF
An interstellar mission to test astrophysical black holes
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
neck nodes and dissection types and lymph nodes levels
PPT
Chemical bonding and molecular structure
PDF
The scientific heritage No 166 (166) (2025)
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PPTX
microscope-Lecturecjchchchchcuvuvhc.pptx
PPTX
Derivatives of integument scales, beaks, horns,.pptx
2. Earth - The Living Planet earth and life
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
HPLC-PPT.docx high performance liquid chromatography
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
An interstellar mission to test astrophysical black holes
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
Cell Membrane: Structure, Composition & Functions
neck nodes and dissection types and lymph nodes levels
Chemical bonding and molecular structure
The scientific heritage No 166 (166) (2025)
2. Earth - The Living Planet Module 2ELS
The KM-GBF monitoring framework – status & key messages.pptx
TOTAL hIP ARTHROPLASTY Presentation.pptx
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
Introduction to Fisheries Biotechnology_Lesson 1.pptx
microscope-Lecturecjchchchchcuvuvhc.pptx
Derivatives of integument scales, beaks, horns,.pptx

An introduction to machine learning in biomedical research: Key concepts, problems and applications

  • 1. An introduction to machine learning in biomedical research: Key concepts, problems and applications Francisco Azuaje, PhD. Head of Bioinformatics and Modeling Research Group (BIOMOD) Luxembourg Institute of Health (LIH) Presentation for the ISCB Regional Student Group (RSG) - Luxembourg 7 November 2018
  • 2. Today’s presentation: • ML in biomedical research: concepts, approaches, applications • A selection of recent, interesting examples • Challenges • ML software frameworks, and final remarks Feel free to ask questions during or after the presentation
  • 3. • Artificial Intelligence (AI), a field of computer science whose origins can be dated back to the 1940s (Turing, 1950), aims to develop computational systems with advanced analytical or predictive capabilities. What is Artificial Intelligence (AI) ? • Machine Learning (ML) is the most successful branch of AI.
  • 4. • “A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E” (Mitchell, 1997). What is Machine Learning (ML)? • ML is concerned with the development of programs with the capacity to “learn” from diverse data sources. These programs can be used to make predictions, help make decisions…etc.
  • 5. Major learning “paradigms”: • Supervised learning • Unsupervised learning, semi-supervised learning • Reinforcement learning
  • 6. ML taps into recent and older research advances • ML stands on the shoulders of probability theory, optimization algorithms, calculus, algebra… • For example, Deep Learning (DL) builds on progress achieved in artificial neural networks over decades. Key periods: 1980s and 2000s. • DL is a sub-branch of ML: A diverse family of computational models consisting of many (“deep”) data processing layers for automated feature extraction and pattern recognition in large datasets. • DL’s progress possible not only because of larger datasets and accelerated computing capabilities, but also by developments in statistical learning theory, algorithms and open- source software accumulated over the past four decades.
  • 7. Major (supervised) learning approaches/techniques • Generalizations of linear models for regression and classification • Bayesian models: naïve, graphical models, probabilistic learning/programming • Support vector machines (SVMs), kernel-based models • Decision trees, random forests • Gradient boosting machines (GBM) • Neural networks, deep learning • …..
  • 8. (Topol, 2014, Cell) (Eisenstein, 2015, Nature) Biomedical research: larger and diverse datasets High inter-individual variabilityDatasets change in time and space High intra-individual variability
  • 9. (Hutter and Zenklusen, Cell, 2018) Key example of “big data” in cancer research
  • 10. Typical questions answered by ML using such and other datasets “Fundamental” research “Applied” research • What is the “behavior”, “mechanism”…? • Within a data layer, how are samples or features related? • How are different data layers interrelated…? • What if …? • Why…? • Risk assessment • Diagnosis • Prognosis • Other clinical outcome prediction • Prevention • Drug targets • Therapeutic strategies Machine Learning
  • 11. Examples of applications of ML in biomedicine (Table adapted from Yu et al., Nat Bio Med Eng, 2018)
  • 12. Koohy, 2018, F1000 Research ML in biomedical research Global usage of ML techniques Trends of ML techniques ( !(PCA & LRM) ) Trends of ML techniques SVM RF DNN
  • 13. Typical ML development cycle and applications Supervised machine learning Unsupervised machine learning Figures adapted from Yu et al., Nat Bio Med Eng, 2018
  • 14. ML in biomedical research: Examples of model diversity and applications (1) Large-scale phenotypic image analysis Novelty/anomaly detection Prediction of hard-to-discretize states Image classification according to phenotypes (e.g., here with Cell Profiler Analyst) Smith et al., 2018, Cell Systems
  • 15. ML in biomedical research: Examples of model diversity and applications (2) Kermany et al., 2018, Cell Medical Diagnoses with Transfer Image-Based Deep Learning Retina images Retinal diseases • DL system: Low classification errors, comparable to humans (on 1000 images) • Strategy also successfully applied to analysis of chest X-ray images • Potential “generalized” platform for image-based diagnoses (?)
  • 16. ML in biomedical research: Examples of model diversity and applications (3) Ambale-Venkatesh et al., Circ. Res., 2017 Random survival forests for predicting cardiovascular (CV) events Variable importance for each of the 735 variables used in analysis Variableimportanceis measuredusingtheminimum depthofthemaximalsubtree • Accurate prediction of 6 CV outcomes (in asymptomatic population). • Subsets of predictive features for each event. • 12-year follow-up, multi-center, -ethnic, wide age range. Imaging, noninvasive tests, questionnaires, biomarker panels Top-20 features
  • 17. Key challenges of ML in biomedical research Heterogeneity: Data, events, states, within and between individuals… Data not always “big”: relative lack of labelled data, curse of dimensionality Data: multi-layered, hierarchical For same data type/layer: multiple measurement platforms
  • 18. Key challenges of ML in biomedical research (2) Interpretability, understandability: Global and local, novelty and consistency with prior knowledge Reproducibility: Crucial requirement “Gold standards”/”ground truth”: Lack, limitations Complexity of pattern recurrence, regularities
  • 19. ML software frameworks • Different programing languages offer extensive libraries for implementing ML: Python, R, Java, C++… • Widely used frameworks: Scikit-learn, Keras, PyTorch, TensorFlow, H2O • Also “automated”, “driverless”…tools are available to get you started.
  • 20. Takeaways: • Many ML challenges in BM research are shared by different application domains, but this field poses its unique challenges. • ML in BM research will continue advancing driven by: more data, new expectations and emerging questions. • Supervised learning, including e.g., deep learning, will meet many of these needs, however: unbiased exploration, unsupervised learning, hypothesis generation and interpretation are also crucial.
  • 21. Thanks to: Funding from: Bioinformatics and Modelling Research Group (BIOMOD) Our research partners in Luxembourg and abroad