SlideShare a Scribd company logo
Machine Learning
Nicolas P. Rougier
July 2, 2018
Inria – Institute of Neurodegenerative Diseases – Bordeaux
What to learn?
Considering an input x (that can be an image, a text, a vector, a
sequence, a set of features, a scalar, etc.)
Classification
To find the category Cx of some input x
Regression
To find the value f(x) of some input x
Clustering
To divide all inputs {x} into distinct groups
Optimization
To find a sequence {ai} such as to optimize
∑
f(ai)
1
Classification
We know that (x1, y1) is blue, (x2, y2) is blue, (x3, y3) is red., etc.
What is the color of (x, y)?
2
Regression
We know that f(x1) = y1, f(x2) = y2, f(x3) = y3, etc.
x
What is the value of f(x)?
3
Clustering
We have (x1, y1), (x2, y2), (x3, y3), etc.
What group (x, y) belongs to?
4
Optimization
You chose C and got no reward, you chose F and got no reward, you
chose N and got no reward, etc.
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
What do you choose next?
5
How to learn?
Supervised learning
To learn a function that maps an input to an output based on
example input-output pairs
Reinforcement learning
To find a policy that maps states to action such as to maximize
long-term reward
Unsupervised learning
To infer a function that describes the structure of some (unlabeled)
data
6
Machine learning ̸= Human learning
To learn anything, at any time, using any method and a few samples.
Explicit & implicit learning, lifelong learning, imitation learning,
one-shot learning, declarative and procedural memory, short and
long term memory, working memory, forgetting, motivation, emotion,
etc.
7
The machine-learning pipeline
Data (60% of your time)
Analysis, acquisition, cleaning,
preparation, augmentation, balance
Model (30% of your time)
Hyperparameters, pre-training, training,
tweaking, testing
Deployment (10% of your time)
Optimization, dockerization, evaluation
8
Common pitfalls
Data
Small, unbalanced, biased, leakage, variance, etc
Model
Initialization, local minima, curse of dimensionality, etc
Deployment
Scalability, fragility, ethics, etc
9
How big is “big”?
Deep convolutional network (Hinton, 2010)
• 5 convolutional layers
• 650,000 units
• 60,000,000 parameters
• 1,200,000 train images
• 100,000 test images
• 1,000 classes
Latest OpenAI DOTA player (June 2018): 180 years worth of games, 256
GPUs and 128,000 CPU cores.
10
Do I need deep learning?
If all you have is a hammer, everything looks like a nail.
11
Conclusion
Lot of challenges ahead
• Learn with a few samples
• Lifelong learning
• Transfer learning
• Unbiased learning
• Robust learning
• Unsupervised learning
• Reinforcement learning
Lot of opportunities as well...
12

More Related Content

PDF
LR1. Summary Day 1
PPTX
Interpreting machine learning models
PDF
L3. Decision Trees
PDF
A Random Forest Approach To Skin Detection With R
PDF
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
PDF
L13. Cluster Analysis
PPT
Active learning lecture
PDF
Building Random Forest at Scale
LR1. Summary Day 1
Interpreting machine learning models
L3. Decision Trees
A Random Forest Approach To Skin Detection With R
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
L13. Cluster Analysis
Active learning lecture
Building Random Forest at Scale

What's hot (20)

PDF
Jan vitek distributedrandomforest_5-2-2013
PDF
CudaTree (GTC 2014)
PPTX
Session 06 machine learning.pptx
PDF
Random forest using apache mahout
PPTX
Machine Learning - Dummy Variable Conversion
PDF
Introduction to Data Analytics with R
PDF
Gradient Boosted Regression Trees in scikit-learn
PPTX
Decision trees
PDF
Overview of tree algorithms from decision tree to xgboost
PPTX
Improvement of id3 algorithm based on simplified information entropy and coor...
ODP
Machine Learning with Decision trees
PDF
CVPR2015 reading "Global refinement of random forest"
PDF
[系列活動] Machine Learning 機器學習課程
PPTX
Comparison Study of Decision Tree Ensembles for Regression
PPT
Chapter II.6 (Book Part VI) Learning
DOCX
Dystopian infogramchecklist
PPTX
Machine Learning
PDF
Introduction to Some Tree based Learning Method
PDF
Ml intro
PPTX
Self taught clustering
Jan vitek distributedrandomforest_5-2-2013
CudaTree (GTC 2014)
Session 06 machine learning.pptx
Random forest using apache mahout
Machine Learning - Dummy Variable Conversion
Introduction to Data Analytics with R
Gradient Boosted Regression Trees in scikit-learn
Decision trees
Overview of tree algorithms from decision tree to xgboost
Improvement of id3 algorithm based on simplified information entropy and coor...
Machine Learning with Decision trees
CVPR2015 reading "Global refinement of random forest"
[系列活動] Machine Learning 機器學習課程
Comparison Study of Decision Tree Ensembles for Regression
Chapter II.6 (Book Part VI) Learning
Dystopian infogramchecklist
Machine Learning
Introduction to Some Tree based Learning Method
Ml intro
Self taught clustering
Ad

Similar to Machine learning (20)

PDF
know Machine Learning Basic Concepts.pdf
PDF
Presentation-19.08.2024hvug7gugyvuvugugugugugug
PDF
Machine Learning ebook.pdf
PDF
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
PPT
Useful Techniques in Artificial Intelligence
PDF
Machine Learning : why we should know and how it works
PDF
When Classifier Selection meets Information Theory: A Unifying View
PDF
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
PPTX
Introduction to Machine Learning
PDF
Machine learning for_finance
PDF
IndabaX Ghana Poster.pdf
PDF
LR2. Summary Day 2
PPTX
Machine learning ppt unit one syllabuspptx
PPT
Machine Learning: Decision Trees Chapter 18.1-18.3
PPT
Machine Learning: Foundations Course Number 0368403401
PPTX
Classification: MNIST, training a Binary classifier, performance measure, mul...
PPT
Introduction to Machine Learning.
PDF
Deep Learning: concepts and use cases (October 2018)
PPT
Introduction
PDF
Chris Dyer - 2017 - Neural MT Workshop Invited Talk: The Neural Noisy Channel...
know Machine Learning Basic Concepts.pdf
Presentation-19.08.2024hvug7gugyvuvugugugugugug
Machine Learning ebook.pdf
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
Useful Techniques in Artificial Intelligence
Machine Learning : why we should know and how it works
When Classifier Selection meets Information Theory: A Unifying View
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
Introduction to Machine Learning
Machine learning for_finance
IndabaX Ghana Poster.pdf
LR2. Summary Day 2
Machine learning ppt unit one syllabuspptx
Machine Learning: Decision Trees Chapter 18.1-18.3
Machine Learning: Foundations Course Number 0368403401
Classification: MNIST, training a Binary classifier, performance measure, mul...
Introduction to Machine Learning.
Deep Learning: concepts and use cases (October 2018)
Introduction
Chris Dyer - 2017 - Neural MT Workshop Invited Talk: The Neural Noisy Channel...
Ad

More from Nicolas Rougier (11)

PDF
The Art of Braincrafting
PDF
Neural fields, a cognitive approach
PDF
SIGGRAPH 2018 - Digital typography
PDF
Modern OpenGL scientific visualization
PDF
Open science
PDF
On ne voit que ce que l’on regarde (French)
PDF
Cybercriminalité (French)
PDF
Where is my mind?
PDF
Scientific visualization
PDF
ReScience
PDF
One critic, two actors, one decision
The Art of Braincrafting
Neural fields, a cognitive approach
SIGGRAPH 2018 - Digital typography
Modern OpenGL scientific visualization
Open science
On ne voit que ce que l’on regarde (French)
Cybercriminalité (French)
Where is my mind?
Scientific visualization
ReScience
One critic, two actors, one decision

Recently uploaded (20)

PDF
Nante Industrial Plug Factory: Engineering Quality for Modern Power Applications
PDF
Deliverable file - Regulatory guideline analysis.pdf
PDF
SIMNET Inc – 2023’s Most Trusted IT Services & Solution Provider
PPTX
Sales & Distribution Management , LOGISTICS, Distribution, Sales Managers
PDF
Module 2 - Modern Supervison Challenges - Student Resource.pdf
PDF
Charisse Litchman: A Maverick Making Neurological Care More Accessible
PPTX
sales presentation، Training Overview.pptx
PDF
How to Get Funding for Your Trucking Business
PDF
Digital Marketing & E-commerce Certificate Glossary.pdf.................
PDF
Technical Architecture - Chainsys dataZap
PDF
NEW - FEES STRUCTURES (01-july-2024).pdf
PDF
NISM Series V-A MFD Workbook v December 2024.khhhjtgvwevoypdnew one must use ...
PDF
Family Law: The Role of Communication in Mediation (www.kiu.ac.ug)
PPTX
TRAINNING, DEVELOPMENT AND APPRAISAL.pptx
PDF
ANALYZING THE OPPORTUNITIES OF DIGITAL MARKETING IN BANGLADESH TO PROVIDE AN ...
PPTX
2025 Product Deck V1.0.pptxCATALOGTCLCIA
PDF
Module 3 - Functions of the Supervisor - Part 1 - Student Resource (1).pdf
PDF
IFRS Notes in your pocket for study all the time
PDF
kom-180-proposal-for-a-directive-amending-directive-2014-45-eu-and-directive-...
PDF
How to Get Approval for Business Funding
Nante Industrial Plug Factory: Engineering Quality for Modern Power Applications
Deliverable file - Regulatory guideline analysis.pdf
SIMNET Inc – 2023’s Most Trusted IT Services & Solution Provider
Sales & Distribution Management , LOGISTICS, Distribution, Sales Managers
Module 2 - Modern Supervison Challenges - Student Resource.pdf
Charisse Litchman: A Maverick Making Neurological Care More Accessible
sales presentation، Training Overview.pptx
How to Get Funding for Your Trucking Business
Digital Marketing & E-commerce Certificate Glossary.pdf.................
Technical Architecture - Chainsys dataZap
NEW - FEES STRUCTURES (01-july-2024).pdf
NISM Series V-A MFD Workbook v December 2024.khhhjtgvwevoypdnew one must use ...
Family Law: The Role of Communication in Mediation (www.kiu.ac.ug)
TRAINNING, DEVELOPMENT AND APPRAISAL.pptx
ANALYZING THE OPPORTUNITIES OF DIGITAL MARKETING IN BANGLADESH TO PROVIDE AN ...
2025 Product Deck V1.0.pptxCATALOGTCLCIA
Module 3 - Functions of the Supervisor - Part 1 - Student Resource (1).pdf
IFRS Notes in your pocket for study all the time
kom-180-proposal-for-a-directive-amending-directive-2014-45-eu-and-directive-...
How to Get Approval for Business Funding

Machine learning

  • 1. Machine Learning Nicolas P. Rougier July 2, 2018 Inria – Institute of Neurodegenerative Diseases – Bordeaux
  • 2. What to learn? Considering an input x (that can be an image, a text, a vector, a sequence, a set of features, a scalar, etc.) Classification To find the category Cx of some input x Regression To find the value f(x) of some input x Clustering To divide all inputs {x} into distinct groups Optimization To find a sequence {ai} such as to optimize ∑ f(ai) 1
  • 3. Classification We know that (x1, y1) is blue, (x2, y2) is blue, (x3, y3) is red., etc. What is the color of (x, y)? 2
  • 4. Regression We know that f(x1) = y1, f(x2) = y2, f(x3) = y3, etc. x What is the value of f(x)? 3
  • 5. Clustering We have (x1, y1), (x2, y2), (x3, y3), etc. What group (x, y) belongs to? 4
  • 6. Optimization You chose C and got no reward, you chose F and got no reward, you chose N and got no reward, etc. A B C D E F G H I J K L M N O P What do you choose next? 5
  • 7. How to learn? Supervised learning To learn a function that maps an input to an output based on example input-output pairs Reinforcement learning To find a policy that maps states to action such as to maximize long-term reward Unsupervised learning To infer a function that describes the structure of some (unlabeled) data 6
  • 8. Machine learning ̸= Human learning To learn anything, at any time, using any method and a few samples. Explicit & implicit learning, lifelong learning, imitation learning, one-shot learning, declarative and procedural memory, short and long term memory, working memory, forgetting, motivation, emotion, etc. 7
  • 9. The machine-learning pipeline Data (60% of your time) Analysis, acquisition, cleaning, preparation, augmentation, balance Model (30% of your time) Hyperparameters, pre-training, training, tweaking, testing Deployment (10% of your time) Optimization, dockerization, evaluation 8
  • 10. Common pitfalls Data Small, unbalanced, biased, leakage, variance, etc Model Initialization, local minima, curse of dimensionality, etc Deployment Scalability, fragility, ethics, etc 9
  • 11. How big is “big”? Deep convolutional network (Hinton, 2010) • 5 convolutional layers • 650,000 units • 60,000,000 parameters • 1,200,000 train images • 100,000 test images • 1,000 classes Latest OpenAI DOTA player (June 2018): 180 years worth of games, 256 GPUs and 128,000 CPU cores. 10
  • 12. Do I need deep learning? If all you have is a hammer, everything looks like a nail. 11
  • 13. Conclusion Lot of challenges ahead • Learn with a few samples • Lifelong learning • Transfer learning • Unbiased learning • Robust learning • Unsupervised learning • Reinforcement learning Lot of opportunities as well... 12