SlideShare a Scribd company logo
Reproducibility in machine
learning
1
Payam Barnaghi
Centre for Vision, Speech and Signal Processing (CVSSP), University of
Surrey
Care Technology & Research Centre, The UK Dementia Research Institute
2
3
4
Source: Odd Erik Gundersen, the Norwegian University of Science and Technology in
Trondheim,
Science Magazine
5
6
What is machine learning?
How can we make the results of machine
learning experiments more reproducible?
7
8
Leia: Yaté. Yaté. Yotó.
(SUBTITLE: “I have come for the bounty on this Wookiee.”)
C-3PO relays this message and Jabba says he’ll offer 25,000 for
Chewie.
Leia: Yotó. Yotó. (SUBTITLE: “50,000, no less.”)
C-3PO relays this message and Jabba asks why he should pay so
much.
Leia: Eí yóto.
The above isn’t subtitled, but Leia pulls out a bomb and activates it.
c-3po: Because he’s holding a thermal detonator!
Jabba is impressed by this and offers 35,000.
Leia: Yató cha.
The above isn’t subtitled, but Leia deactivates the bomb and puts it
away.
c-3po: He agrees.
Order is restored.
Adapted from: The Art of Language Invention: From Horse-Lords to Dark Elves, The Words Behind World-Building, by
9
Leia: Yaté. Yaté. Yotó.
(SUBTITLE: “I have come for the bounty on this Wookiee.”)
C-3PO relays this message and Jabba says he’ll offer 25,000 for
Chewie.
Leia: Yotó. Yotó. (SUBTITLE: “50,000, no less.”)
C-3PO relays this message and Jabba asks why he should pay so
much.
Leia: Eí yóto.
The above isn’t subtitled, but Leia pulls out a bomb and activates it.
c-3po: Because he’s holding a thermal detonator!
Jabba is impressed by this and offers 35,000.
Leia: Yató cha.
The above isn’t subtitled, but Leia deactivates the bomb and puts it
away.
c-3po: He agrees.
Order is restored.
Adapted from: The Art of Language Invention: From Horse-Lords to Dark Elves, The Words Behind World-Building, by
10
11Source:
https://www.omniglot.com/conscripts/tengwarhv.htm
However, don’t underestimate the human brain
− For example, in German:
−you have to distinguish between “the” and “a” articles
−and then each one has four case forms
−three genders
−and singular and plural forms
−and then the adjectives have to agree
−and then there are verbs!
−or, take this as an example:
12
Adapted from: The Art of Language Invention: From Horse-Lords to Dark Elves, The Words Behind World-Building, by
(biáng)
Peterson
13
/ (ˈpiːtəsən) /
Pear Peter/ Peter’s Son
Bear Meter
Ear Greta
… …
Payam
14
/ (ˈpæːjʌm) /
Pay + am or Pa +
yam
‫پیام‬
15
Source: https://www.originofalphabet.com/meow-is-just-another-name-for-
cat-2/
16
/ (ˈpiːtəsən) /
/ (ˈpiːtəsən) /
P
ˈp
e
i
…
…
n
n
17
“It is a capital mistake to
theorise before one has
data. Insensibly one begins
to twist facts to suit theories,
instead of theories to suit
facts.”
Arthur Conan Doyle, Sherlock
Holmes
18
19
20
21
22
The Machine Learning Reproducibility Checklist
(Version 1.2, Mar.27 2019)- Joelle Pineau
−For all models and algorithms presented, check
if you include:
−A clear description of the mathematical setting,
algorithm, and/or model.
−An analysis of the complexity (time, space, sample
size) of any algorithm.
−A link to a downloadable source code, with
specification of all dependencies, including external
libraries.
23
Reproduced from: www.cs.mcgill.ca/~jpineau/ReproducibilityChecklist.pdf
The Machine Learning Reproducibility Checklist
− For any theoretical claim, check if you include:
−A statement of the result.
−A clear explanation of any assumptions.
−A complete proof of the claim.
24
Reproduced from: www.cs.mcgill.ca/~jpineau/ReproducibilityChecklist.pdf
The Machine Learning Reproducibility Checklist
− For all figures and tables that present empirical results,
check if you include:
− A complete description of the data collection process, including
sample size.
− A link to a downloadable version of the dataset or simulation
environment.
− An explanation of any data that were excluded, description of
any pre-processing step.
− An explanation of how samples were allocated for training/
validation/ testing.
25
Reproduced from: www.cs.mcgill.ca/~jpineau/ReproducibilityChecklist.pdf
The Machine Learning Reproducibility Checklist
− The range of hyper-parameters considered, method to select the
best hyper-parameter configuration, and specification of all hyper-
parameters used to generate results.
− The exact number of evaluation runs.
− A description of how experiments were run.
− A clear definition of the specific measure or statistics used to report
results.
− Clearly defined error bars.
− A description of results with central tendency (e.g. mean) &
variation(e.g. stddev).
− A description of the computing infrastructure used.
26
Reproduced from: www.cs.mcgill.ca/~jpineau/ReproducibilityChecklist.pdf
27
https://arxiv.org/pdf/1810.03993.pdf
Reproducibility in ML
”The point of reproducibility isn’t to replicate the results
exactly. That would be nearly impossible given the
natural randomness in neural networks and variations in
hardware and code. Instead, the idea is to offer a road
map to reach the same conclusions as the original
research, especially when that involves deciding which
machine-learning system is best for a particular task.”
Jesse Dodge, Carnegie Mellon University
28
Reproduced from: https://www.wired.com/story/artificial-intelligence-confronts-
reproducibility-crisis/
29
Thank you.
30

More Related Content

PPT
Tree Top
PPTX
How to make deck to do trick's 1111.pptx
PPTX
B hive machine learning
PDF
Artificial Intelligence for Undergrads
PPT
Machine Learning ICS 273A
PPT
Machine Learning ICS 273A
PPTX
Google ai based personal assistant and i woring
PDF
Python and Machine Learning - BCN Python Meetup - 25th Sep 2014
Tree Top
How to make deck to do trick's 1111.pptx
B hive machine learning
Artificial Intelligence for Undergrads
Machine Learning ICS 273A
Machine Learning ICS 273A
Google ai based personal assistant and i woring
Python and Machine Learning - BCN Python Meetup - 25th Sep 2014

Similar to Reproducibility in machine learning (20)

PDF
Applied Machine Learning for Chemistry I (HSI2020)
PPT
2.17Mb ppt
PDF
Machine learning para tertulianos, by javier ramirez at teowaki
PPTX
Introduction of Machine Learning through case studies - Speaker: Thuong Dinh,...
PDF
Introduction to Applied Machine Learning for Data-Driven Science
PDF
Machine learning
PPT
MLlecture1.ppt
PPT
MLlecture1.ppt
PDF
Today is all about AI
PDF
01_introduction_ML.pdf
PDF
Introduction to ai
PDF
Natural Language Processing
PPTX
L1_intro.pptx
PPT
chapter1-introduction1.ppt
PDF
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
PDF
01_introduction to machine learning algorithms and basics .pdf
PPTX
introduction to machine learning education.pptx
PPTX
Recent Advances in Natural Language Processing
PPTX
introduction to machine learning and ai.pptx
PPTX
Introduction to Machine Learning and AI.pptx
Applied Machine Learning for Chemistry I (HSI2020)
2.17Mb ppt
Machine learning para tertulianos, by javier ramirez at teowaki
Introduction of Machine Learning through case studies - Speaker: Thuong Dinh,...
Introduction to Applied Machine Learning for Data-Driven Science
Machine learning
MLlecture1.ppt
MLlecture1.ppt
Today is all about AI
01_introduction_ML.pdf
Introduction to ai
Natural Language Processing
L1_intro.pptx
chapter1-introduction1.ppt
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
01_introduction to machine learning algorithms and basics .pdf
introduction to machine learning education.pptx
Recent Advances in Natural Language Processing
introduction to machine learning and ai.pptx
Introduction to Machine Learning and AI.pptx
Ad

More from PayamBarnaghi (20)

PPTX
Academic Research: A Survival Guide
PPTX
Search, Discovery and Analysis of Sensory Data Streams
PPTX
Internet Search: the past, present and the future
PPT
Scientific and Academic Research: A Survival Guide 
PPT
Lecture 8: IoT System Models and Applications
PPT
Lecture 7: Semantic Technologies and Interoperability
PPT
Lecture 6: IoT Data Processing
PPT
Lecture 5: Software platforms and services
PPT
Internet of Things for healthcare: data integration and security/privacy issu...
PPT
Scientific and Academic Research: A Survival Guide 
PPT
Semantic Technolgies for the Internet of Things
PPT
Internet of Things and Data Analytics for Smart Cities and eHealth
PPT
Spatial Data on the Web
PPT
IoT-Lite: A Lightweight Semantic Model for the Internet of Things
PPT
The Future is Cyber-Healthcare
PPT
Internet of Things: Concepts and Technologies
PPT
How to make cities "smarter"?
PPT
The Internet of Things: What's next?
PPT
Information Engineering in the Age of the Internet of Things
PPT
Smart Cities….Smart Future
Academic Research: A Survival Guide
Search, Discovery and Analysis of Sensory Data Streams
Internet Search: the past, present and the future
Scientific and Academic Research: A Survival Guide 
Lecture 8: IoT System Models and Applications
Lecture 7: Semantic Technologies and Interoperability
Lecture 6: IoT Data Processing
Lecture 5: Software platforms and services
Internet of Things for healthcare: data integration and security/privacy issu...
Scientific and Academic Research: A Survival Guide 
Semantic Technolgies for the Internet of Things
Internet of Things and Data Analytics for Smart Cities and eHealth
Spatial Data on the Web
IoT-Lite: A Lightweight Semantic Model for the Internet of Things
The Future is Cyber-Healthcare
Internet of Things: Concepts and Technologies
How to make cities "smarter"?
The Internet of Things: What's next?
Information Engineering in the Age of the Internet of Things
Smart Cities….Smart Future
Ad

Recently uploaded (20)

PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Classroom Observation Tools for Teachers
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PPTX
Cell Structure & Organelles in detailed.
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
Business Ethics Teaching Materials for college
PDF
Basic Mud Logging Guide for educational purpose
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Complications of Minimal Access Surgery at WLH
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Final Presentation General Medicine 03-08-2024.pptx
Classroom Observation Tools for Teachers
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Module 4: Burden of Disease Tutorial Slides S2 2025
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Microbial disease of the cardiovascular and lymphatic systems
Week 4 Term 3 Study Techniques revisited.pptx
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Cell Structure & Organelles in detailed.
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Business Ethics Teaching Materials for college
Basic Mud Logging Guide for educational purpose
Supply Chain Operations Speaking Notes -ICLT Program
Complications of Minimal Access Surgery at WLH
2.FourierTransform-ShortQuestionswithAnswers.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
O5-L3 Freight Transport Ops (International) V1.pdf

Reproducibility in machine learning

  • 1. Reproducibility in machine learning 1 Payam Barnaghi Centre for Vision, Speech and Signal Processing (CVSSP), University of Surrey Care Technology & Research Centre, The UK Dementia Research Institute
  • 2. 2
  • 3. 3
  • 4. 4 Source: Odd Erik Gundersen, the Norwegian University of Science and Technology in Trondheim, Science Magazine
  • 5. 5
  • 6. 6
  • 7. What is machine learning? How can we make the results of machine learning experiments more reproducible? 7
  • 8. 8 Leia: Yaté. Yaté. Yotó. (SUBTITLE: “I have come for the bounty on this Wookiee.”) C-3PO relays this message and Jabba says he’ll offer 25,000 for Chewie. Leia: Yotó. Yotó. (SUBTITLE: “50,000, no less.”) C-3PO relays this message and Jabba asks why he should pay so much. Leia: Eí yóto. The above isn’t subtitled, but Leia pulls out a bomb and activates it. c-3po: Because he’s holding a thermal detonator! Jabba is impressed by this and offers 35,000. Leia: Yató cha. The above isn’t subtitled, but Leia deactivates the bomb and puts it away. c-3po: He agrees. Order is restored. Adapted from: The Art of Language Invention: From Horse-Lords to Dark Elves, The Words Behind World-Building, by
  • 9. 9 Leia: Yaté. Yaté. Yotó. (SUBTITLE: “I have come for the bounty on this Wookiee.”) C-3PO relays this message and Jabba says he’ll offer 25,000 for Chewie. Leia: Yotó. Yotó. (SUBTITLE: “50,000, no less.”) C-3PO relays this message and Jabba asks why he should pay so much. Leia: Eí yóto. The above isn’t subtitled, but Leia pulls out a bomb and activates it. c-3po: Because he’s holding a thermal detonator! Jabba is impressed by this and offers 35,000. Leia: Yató cha. The above isn’t subtitled, but Leia deactivates the bomb and puts it away. c-3po: He agrees. Order is restored. Adapted from: The Art of Language Invention: From Horse-Lords to Dark Elves, The Words Behind World-Building, by
  • 10. 10
  • 12. However, don’t underestimate the human brain − For example, in German: −you have to distinguish between “the” and “a” articles −and then each one has four case forms −three genders −and singular and plural forms −and then the adjectives have to agree −and then there are verbs! −or, take this as an example: 12 Adapted from: The Art of Language Invention: From Horse-Lords to Dark Elves, The Words Behind World-Building, by (biáng)
  • 13. Peterson 13 / (ˈpiːtəsən) / Pear Peter/ Peter’s Son Bear Meter Ear Greta … …
  • 14. Payam 14 / (ˈpæːjʌm) / Pay + am or Pa + yam ‫پیام‬
  • 16. 16 / (ˈpiːtəsən) / / (ˈpiːtəsən) / P ˈp e i … … n n
  • 17. 17 “It is a capital mistake to theorise before one has data. Insensibly one begins to twist facts to suit theories, instead of theories to suit facts.” Arthur Conan Doyle, Sherlock Holmes
  • 18. 18
  • 19. 19
  • 20. 20
  • 21. 21
  • 22. 22
  • 23. The Machine Learning Reproducibility Checklist (Version 1.2, Mar.27 2019)- Joelle Pineau −For all models and algorithms presented, check if you include: −A clear description of the mathematical setting, algorithm, and/or model. −An analysis of the complexity (time, space, sample size) of any algorithm. −A link to a downloadable source code, with specification of all dependencies, including external libraries. 23 Reproduced from: www.cs.mcgill.ca/~jpineau/ReproducibilityChecklist.pdf
  • 24. The Machine Learning Reproducibility Checklist − For any theoretical claim, check if you include: −A statement of the result. −A clear explanation of any assumptions. −A complete proof of the claim. 24 Reproduced from: www.cs.mcgill.ca/~jpineau/ReproducibilityChecklist.pdf
  • 25. The Machine Learning Reproducibility Checklist − For all figures and tables that present empirical results, check if you include: − A complete description of the data collection process, including sample size. − A link to a downloadable version of the dataset or simulation environment. − An explanation of any data that were excluded, description of any pre-processing step. − An explanation of how samples were allocated for training/ validation/ testing. 25 Reproduced from: www.cs.mcgill.ca/~jpineau/ReproducibilityChecklist.pdf
  • 26. The Machine Learning Reproducibility Checklist − The range of hyper-parameters considered, method to select the best hyper-parameter configuration, and specification of all hyper- parameters used to generate results. − The exact number of evaluation runs. − A description of how experiments were run. − A clear definition of the specific measure or statistics used to report results. − Clearly defined error bars. − A description of results with central tendency (e.g. mean) & variation(e.g. stddev). − A description of the computing infrastructure used. 26 Reproduced from: www.cs.mcgill.ca/~jpineau/ReproducibilityChecklist.pdf
  • 28. Reproducibility in ML ”The point of reproducibility isn’t to replicate the results exactly. That would be nearly impossible given the natural randomness in neural networks and variations in hardware and code. Instead, the idea is to offer a road map to reach the same conclusions as the original research, especially when that involves deciding which machine-learning system is best for a particular task.” Jesse Dodge, Carnegie Mellon University 28 Reproduced from: https://www.wired.com/story/artificial-intelligence-confronts- reproducibility-crisis/
  • 29. 29