SlideShare a Scribd company logo
Thoughts on
Machine Learning and Artificial Intelligence
Maarten van Smeden, PhD

Leiden University Medical Center, Netherlands

STRATOS Lorenz Meeting

21/09/2018
Interested reader perspective
• Statistician by training

• Limited experience applying machine learning techniques

• Three examples that I think are illustrative for ML/AI in medicine
as it is applied nowadays

• Focus: prediction
Tech company business model
Thoughts on Machine Learning and Artificial Intelligence
Apple Watch 4
FDA Approval
https://www.statnews.com/2018/09/13/heres-the-data-behind-the-new-apple-watch-ekg-app/?mc_cid=0fbfd65c13&mc_eid=75f1d5aea2
Thoughts on Machine Learning and Artificial Intelligence
Impressive artificial intelligence
IBM Watson win against 2 Jeopardy’s champions in 2011
Reviewer #2
Less impressive artificial intelligence
Warning!
Statistical policing going on
Yesterday’s news
http://www.timvanderzee.com/the-wansink-dossier-an-overview/

Example 1: ML predicting mortality
• Caliber dataset (UK, EHR)

• N = 80,000 pre-existing coronary artery disease

• Predict all cause mortality (18,000 events, time horizon unclear)

• “used Cox models, random forests and elastic net regression”

• 586 candidate predictors vs 27 pre-selected variables

• Complete case / multiple imputation / missing indicator method

• Cox models: linear main effects only

• Split sample (1/3 test, 2/3 training)
Example 1: ML predicting mortality
Example 1: ML predicting mortality
Example 1: ML predicting mortality
One take
Linear regression is an example of
Machine Learning?
If so, what isn’t Machine Learning?
Perhaps more reasonable?
Beam & Kohane, JAMA, 2018
Example 2: lymph node metastases
Example 2: lymph node metastases
Example 2: lymph node metastases
• Researcher challenge competition

• Whole slide images of women diagnosed with breast cancer

• Training data: N = 270 (110 events); test data: N = 129 (49 events)

• 11 pathologists evaluating the test data

• 390 teams signed up for the competition

• 23 teams submitted 32 algorithms for evaluation
Example 2: lymph node metastases
Example 2: lymph node metastases
• Unfair comparison between pathologists and DL

• Pathologists no access to regularly available diagnostics

• AUC comparison DL (continuous) vs pathologists (5-item
scale) 

• Promising algorithms overrepresented (390 teams -> 32
algorithms submitted)
Example 2: lymph node metastases
• No attention to risk prediction / calibration

• ML: attention classification only without probability

• Hugh (often implicit) difference between the traditional (risk)
prediction modeling in medicine and (traditional ML)

• Probably fine for Netflix recommendations; not so much for
real life medical decision making
Misuse of “risk"
Example 3: 5 types of diabetes
Example 3: 5 types of diabetes
Example 3: 5 types of diabetes
• Patients with newly diagnosed diabetes (N = 8980) 

• 6 continuous variables 

• K-means clustering (‘unsupervised learning’)
Example 3: 5 types of diabetes
Example 3: 5 types of diabetes
BS detection simulation
• Data generated from 2 independent MVN-distributions with .3 equal pairwise correlations 

• “Sunday morning simulations”, code: https://github.com/MvanSmeden/DiabetesClusters
K-means clustering
“K-means finds a Voronoi partition, only if that partition coincides with a
"clustering" does it have a hope of actually doing clustering”

Max Little: https://twitter.com/MaxALittle/status/970277900871262213
Freak examples?
Probably?
Maybe?
What I observe is:
• Confusion and disagreement about what is and isn’t ML/AI 

• Analyses labeled “ML/AI” have a tendency to concentrate on
classification (exceptions exist, e.g. high dimensional PS
approaches suggested that are called “ML”) 

• Analyses labeled “ML/AI” in medicine are surprisingly often
done by people not thoroughly trained in statistics

• Basic statistical principles are often forgotten or ignored (e.g.
improper scoring rules)
Concluding remarks (1)
• Just because an algorithm is novel or flexible doesn’t mean it is
any good, obviously

• Dismissing the potential value of novel “ML/AI” algorithms out-
of-hand doesn’t make sense

• We need more realistic simulations and many applications to
compare the traditional vs more novel / flexible algorithms

• The primary issue in medical applications seems to be with the
modelers not so much with the models
Concluding remarks (2)
• Statisticians should be more involved in the application and
evaluation of novel / flexible algorithms, especially for risk
prediction

• Statisticians should be involved in studying performance of
novel / flexible algorithms (e.g. data hungriness) -> realistic
simulation studies

• Collaboration with computer scientists

• Computationally intensive -> may not be cheap

• Serious experimental design and reporting
Simulation is…
“…it is using simulation for multiplication that I find objectionable. Eight patients are
eight patients and so should remain.”
“All the impressive achievements of
deep learning amount to just curve
fitting”
Judea Pearl
Thoughts on Machine Learning and Artificial Intelligence
Thoughts on Machine Learning and Artificial Intelligence

More Related Content

PDF
Introduction to prediction modelling - Berlin 2018 - Part II
PDF
Clinical prediction models
PDF
Development and evaluation of prediction models: pitfalls and solutions
PDF
Machine learning versus traditional statistical modeling and medical doctors
PPTX
Is it causal, is it prediction or is it neither?
PDF
The absence of a gold standard: a measurement error problem
PDF
Sample size for binary logistic prediction models: Beyond events per variable...
PDF
Make clinical prediction models great again
Introduction to prediction modelling - Berlin 2018 - Part II
Clinical prediction models
Development and evaluation of prediction models: pitfalls and solutions
Machine learning versus traditional statistical modeling and medical doctors
Is it causal, is it prediction or is it neither?
The absence of a gold standard: a measurement error problem
Sample size for binary logistic prediction models: Beyond events per variable...
Make clinical prediction models great again

What's hot (20)

PDF
Clinical prediction models: development, validation and beyond
PDF
The basics of prediction modeling
PDF
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
PDF
Dichotomania and other challenges for the collaborating biostatistician
PPTX
How to establish and evaluate clinical prediction models - Statswork
PDF
Introduction to prediction modelling - Berlin 2018 - Part I
PDF
Why the EPV≥10 sample size rule is rubbish and what to use instead
PDF
Evaluation of the clinical value of biomarkers for risk prediction
PPTX
Calibration of risk prediction models: decision making with the lights on or ...
PDF
ML and AI: a blessing and curse for statisticians and medical doctors
PDF
Machine learning in medicine: calm down
PDF
Bias in covid 19 models
PDF
Development and evaluation of prediction models: pitfalls and solutions (Part...
PPTX
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AI
PPTX
Open science LMU session contribution E Steyerberg 2jul20
PDF
Prediction research in a pandemic: 3 lessons from a living systematic review ...
PDF
Measurement error in medical research
PDF
Prediction models for diagnosis and prognosis related to COVID-19
PDF
Correcting for missing data, measurement error and confounding
PDF
COVID-19 related prediction models for diagnosis and prognosis - a living sys...
Clinical prediction models: development, validation and beyond
The basics of prediction modeling
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Dichotomania and other challenges for the collaborating biostatistician
How to establish and evaluate clinical prediction models - Statswork
Introduction to prediction modelling - Berlin 2018 - Part I
Why the EPV≥10 sample size rule is rubbish and what to use instead
Evaluation of the clinical value of biomarkers for risk prediction
Calibration of risk prediction models: decision making with the lights on or ...
ML and AI: a blessing and curse for statisticians and medical doctors
Machine learning in medicine: calm down
Bias in covid 19 models
Development and evaluation of prediction models: pitfalls and solutions (Part...
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AI
Open science LMU session contribution E Steyerberg 2jul20
Prediction research in a pandemic: 3 lessons from a living systematic review ...
Measurement error in medical research
Prediction models for diagnosis and prognosis related to COVID-19
Correcting for missing data, measurement error and confounding
COVID-19 related prediction models for diagnosis and prognosis - a living sys...
Ad

Similar to Thoughts on Machine Learning and Artificial Intelligence (20)

PPTX
Big Data & ML for Clinical Data
PDF
Introduction to machine_learning_us
PDF
APPLICATIONS OF DEEP LEARNING AND MACHINE LEARNING IN HEALTHCARE DOMAIN – A L...
PPTX
ArtificialIntelligenceandMachineLearningforBusiness.pptx
PPTX
Artificial Intelligence and Machine Learning for business
PPTX
Societal, policy, and regulatory implications of AI for healthcare and medicine
PPTX
ML, biomedical data & trust
PPTX
Informatics in disease management: What will the future bring?
PDF
Machine Learning in Healthcare: What's Now & What's Next
PPTX
AI.pptx
PPTX
ARTIFICIAL INTELLIGENCE AND THE MEDICAL APPLICATIONS POWER POINT (fnsh).pptx
PDF
A gentle introduction to AI for medicine
PPTX
Artificial Intelligence for Medicine
PPTX
Artificial Intelligence in Neurology.pptx
PDF
The (very) basics of AI for the Radiology resident
PPTX
AIMed 19 Workshop 1: Machine Learning for non-data scientist by Dr. Robert Hoyt
PPTX
PMED Transition Workshop - Some Recent Advances in Precision Medicine and Mac...
PDF
Health advances ai in diagnostic development
PPTX
Artificial Intelligence in oral radiology .pptx
PDF
How does machine learning help in cancer detection
Big Data & ML for Clinical Data
Introduction to machine_learning_us
APPLICATIONS OF DEEP LEARNING AND MACHINE LEARNING IN HEALTHCARE DOMAIN – A L...
ArtificialIntelligenceandMachineLearningforBusiness.pptx
Artificial Intelligence and Machine Learning for business
Societal, policy, and regulatory implications of AI for healthcare and medicine
ML, biomedical data & trust
Informatics in disease management: What will the future bring?
Machine Learning in Healthcare: What's Now & What's Next
AI.pptx
ARTIFICIAL INTELLIGENCE AND THE MEDICAL APPLICATIONS POWER POINT (fnsh).pptx
A gentle introduction to AI for medicine
Artificial Intelligence for Medicine
Artificial Intelligence in Neurology.pptx
The (very) basics of AI for the Radiology resident
AIMed 19 Workshop 1: Machine Learning for non-data scientist by Dr. Robert Hoyt
PMED Transition Workshop - Some Recent Advances in Precision Medicine and Mac...
Health advances ai in diagnostic development
Artificial Intelligence in oral radiology .pptx
How does machine learning help in cancer detection
Ad

More from Maarten van Smeden (18)

PDF
Rage Against the Machine Learning - Young Researchers Event
PDF
Clinical prediction modeling in the era of AI: a blessing and a curse
PDF
Uncertainty in AI
PDF
UMC Utrecht AI Methods Lab
PDF
Rage against the machine learning 2023
PDF
Associate professor lecture
PDF
Improving epidemiological research: avoiding the statistical paradoxes and fa...
PDF
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
PDF
Guideline for high-quality diagnostic and prognostic applications of AI in he...
PDF
Predictimands
PDF
Prognosis-based medicine: merits and pitfalls of forecasting patient health
PDF
Algorithm based medicine
PDF
Algorithm based medicine: old statistics wine in new machine learning bottles?
PDF
Clinical prediction models for covid-19: alarming results from a living syste...
PDF
Five questions about artificial intelligence
PDF
Living systematic reviews: now and in the future
PDF
Voorspelmodellen en COVID-19
PDF
The statistics of the coronavirus
Rage Against the Machine Learning - Young Researchers Event
Clinical prediction modeling in the era of AI: a blessing and a curse
Uncertainty in AI
UMC Utrecht AI Methods Lab
Rage against the machine learning 2023
Associate professor lecture
Improving epidemiological research: avoiding the statistical paradoxes and fa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Guideline for high-quality diagnostic and prognostic applications of AI in he...
Predictimands
Prognosis-based medicine: merits and pitfalls of forecasting patient health
Algorithm based medicine
Algorithm based medicine: old statistics wine in new machine learning bottles?
Clinical prediction models for covid-19: alarming results from a living syste...
Five questions about artificial intelligence
Living systematic reviews: now and in the future
Voorspelmodellen en COVID-19
The statistics of the coronavirus

Recently uploaded (20)

PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PPT
protein biochemistry.ppt for university classes
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
Microbiology with diagram medical studies .pptx
PPTX
2. Earth - The Living Planet earth and life
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPTX
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PDF
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
PPTX
Cell Membrane: Structure, Composition & Functions
PDF
diccionario toefl examen de ingles para principiante
PDF
An interstellar mission to test astrophysical black holes
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
protein biochemistry.ppt for university classes
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
neck nodes and dissection types and lymph nodes levels
Microbiology with diagram medical studies .pptx
2. Earth - The Living Planet earth and life
Classification Systems_TAXONOMY_SCIENCE8.pptx
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Biophysics 2.pdffffffffffffffffffffffffff
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
Taita Taveta Laboratory Technician Workshop Presentation.pptx
TOTAL hIP ARTHROPLASTY Presentation.pptx
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
Cell Membrane: Structure, Composition & Functions
diccionario toefl examen de ingles para principiante
An interstellar mission to test astrophysical black holes
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx

Thoughts on Machine Learning and Artificial Intelligence

  • 1. Thoughts on Machine Learning and Artificial Intelligence Maarten van Smeden, PhD Leiden University Medical Center, Netherlands STRATOS Lorenz Meeting 21/09/2018
  • 2. Interested reader perspective • Statistician by training • Limited experience applying machine learning techniques • Three examples that I think are illustrative for ML/AI in medicine as it is applied nowadays • Focus: prediction
  • 8. Impressive artificial intelligence IBM Watson win against 2 Jeopardy’s champions in 2011
  • 14. Example 1: ML predicting mortality • Caliber dataset (UK, EHR) • N = 80,000 pre-existing coronary artery disease • Predict all cause mortality (18,000 events, time horizon unclear) • “used Cox models, random forests and elastic net regression” • 586 candidate predictors vs 27 pre-selected variables • Complete case / multiple imputation / missing indicator method • Cox models: linear main effects only • Split sample (1/3 test, 2/3 training)
  • 15. Example 1: ML predicting mortality
  • 16. Example 1: ML predicting mortality
  • 17. Example 1: ML predicting mortality
  • 18. One take Linear regression is an example of Machine Learning? If so, what isn’t Machine Learning?
  • 19. Perhaps more reasonable? Beam & Kohane, JAMA, 2018
  • 20. Example 2: lymph node metastases
  • 21. Example 2: lymph node metastases
  • 22. Example 2: lymph node metastases • Researcher challenge competition • Whole slide images of women diagnosed with breast cancer • Training data: N = 270 (110 events); test data: N = 129 (49 events) • 11 pathologists evaluating the test data • 390 teams signed up for the competition • 23 teams submitted 32 algorithms for evaluation
  • 23. Example 2: lymph node metastases
  • 24. Example 2: lymph node metastases • Unfair comparison between pathologists and DL • Pathologists no access to regularly available diagnostics • AUC comparison DL (continuous) vs pathologists (5-item scale) • Promising algorithms overrepresented (390 teams -> 32 algorithms submitted)
  • 25. Example 2: lymph node metastases • No attention to risk prediction / calibration • ML: attention classification only without probability • Hugh (often implicit) difference between the traditional (risk) prediction modeling in medicine and (traditional ML) • Probably fine for Netflix recommendations; not so much for real life medical decision making
  • 27. Example 3: 5 types of diabetes
  • 28. Example 3: 5 types of diabetes
  • 29. Example 3: 5 types of diabetes • Patients with newly diagnosed diabetes (N = 8980) • 6 continuous variables • K-means clustering (‘unsupervised learning’)
  • 30. Example 3: 5 types of diabetes
  • 31. Example 3: 5 types of diabetes
  • 32. BS detection simulation • Data generated from 2 independent MVN-distributions with .3 equal pairwise correlations • “Sunday morning simulations”, code: https://github.com/MvanSmeden/DiabetesClusters
  • 33. K-means clustering “K-means finds a Voronoi partition, only if that partition coincides with a "clustering" does it have a hope of actually doing clustering” Max Little: https://twitter.com/MaxALittle/status/970277900871262213
  • 35. What I observe is: • Confusion and disagreement about what is and isn’t ML/AI • Analyses labeled “ML/AI” have a tendency to concentrate on classification (exceptions exist, e.g. high dimensional PS approaches suggested that are called “ML”) • Analyses labeled “ML/AI” in medicine are surprisingly often done by people not thoroughly trained in statistics • Basic statistical principles are often forgotten or ignored (e.g. improper scoring rules)
  • 36. Concluding remarks (1) • Just because an algorithm is novel or flexible doesn’t mean it is any good, obviously • Dismissing the potential value of novel “ML/AI” algorithms out- of-hand doesn’t make sense • We need more realistic simulations and many applications to compare the traditional vs more novel / flexible algorithms • The primary issue in medical applications seems to be with the modelers not so much with the models
  • 37. Concluding remarks (2) • Statisticians should be more involved in the application and evaluation of novel / flexible algorithms, especially for risk prediction • Statisticians should be involved in studying performance of novel / flexible algorithms (e.g. data hungriness) -> realistic simulation studies • Collaboration with computer scientists • Computationally intensive -> may not be cheap • Serious experimental design and reporting
  • 38. Simulation is… “…it is using simulation for multiplication that I find objectionable. Eight patients are eight patients and so should remain.”
  • 39. “All the impressive achievements of deep learning amount to just curve fitting” Judea Pearl