SlideShare a Scribd company logo
State of the Art in Machine Learning
Poul Petersen
BigML
2
What is ML?
“a field of study that gives computers the
ability to learn without being explicitly
programmed”
Professor Arthur Samuel, 1959
•What “ability to learn” do computers have?
•What does “explicitly programmed” mean?
Square Feet
Price
Sacramento Real Estate Prices by sq_ft
3
Ability to Learn
4
Ability to Learn
• Provide computer with examples of the relationship between
square footage X and price Y.
• The computer learns the equation of a line F() which fits the
examples.
• The computer can now predict the price of any home
knowing only the square footage: F( x ) = y
• This model is known as Linear Regression. There are other
types of models.
• You may have noticed there was some points that did not fit.
This is important!
5
Learning Problems (fit)
Under-fitting Over-fitting
• Model does not fit well enough
• Does not capture the underlying trend
of the data
• Change algorithm or features
• Model fits too well does not “generalize”
• Captures the noise or outliers of the
data
• Change algorithm or filter outliers
6
Learning Problems (missing)
• Missing values at training/prediction time
• Some algorithms can handle missing values, some no
• Missing data is sometimes important
• Replace missing values
• Predict missing values
7
Learning Problems (missing)
Missing@
Decision
Trees
KNN
Logistic
Regression
Naive
Bayes
Neural
Networks
Training Yes No No Yes Yes*
Prediction Yes No No Yes No
8
Not Explicitly Programmed
Control System
“Customers go on vacation… We need a program that
predicts if the light will come on when the control
system flips the switch.”
9
Not Explicitly Programmed
@iLoveRuby
Switch Light?
on TRUE
off FALSE
• @iLoveRuby reasoned the rules by experience
• Then programmed the rules explicitly
10
Not Explicitly Programmed
Switch Light?
on TRUE
off FALSE
• The ML System reasoned the rules from the data and
created a Model
• Functionally the Model is the same as @iLoveRuby’s
explicit model
@iLoveML
question
answer
ML System Model
11
Not Explicitly Programmed
• how long has the bulb has been in service
• reliability of brand
• rated power: higher power shorter life
• duty cycle
• room conditions: temperature, humidity
Goal is to predict if the bulb will come on

but the switch is not the important variable:
Even worse: None of these conditions is absolute.
12
Not Explicitly Programmed
brand power age duty temp humidity FAIL?
koala 45 338 1 16 0.03 FALSE
otter 15 140 1 27 0.27 FALSE
koala 15 315 1 19 0.37 TRUE
otter 45 338 1 29 0.27 TRUE
koala 45 211 1 23 0.85 TRUE
otter 15 328 1 17 0.56 FALSE
koala 15 318 2 22 0.45 TRUE
koala 15 273 1 27 0.18 FALSE
koala 45 102 1 21 0.48 FALSE
koala 15 110 2 15 0.99 TRUE
otter 45 355 2 15 0.01 FALSE
otter 15 69 1 24 0.70 FALSE
koala 15 69 1 24 0.70 FALSE
koala 15 337 2 27 0.83 TRUE
13
Not Explicitly Programmed
@iLoveRuby
• Multi-dimensional data is much harder to find rules
• Explicit program requires modification
brand power age duty temp humidity FAIL?
koala 45 338 1 16 0.03 FALSE
otter 15 140 1 27 0.27 FALSE
koala 15 315 1 19 0.37 TRUE
otter 45 338 1 29 0.27 TRUE
koala 45 211 1 23 0.85 TRUE
otter 15 328 1 17 0.56 FALSE
koala 15 318 2 22 0.45 TRUE
koala 15 273 1 27 0.18 FALSE
koala 45 102 1 21 0.48 FALSE
koala 15 110 2 15 0.99 TRUE
otter 45 355 2 15 0.01 FALSE
otter 15 69 1 24 0.70 FALSE
koala 15 69 1 24 0.70 FALSE
koala 15 337 2 27 0.83 TRUE
@iLoveML ML System Model
14
Not Explicitly Programmed
• ML System easily re-trains on new data
question
answer
brand power age duty temp humidity FAIL?
koala 45 338 1 16 0.03 FALSE
otter 15 140 1 27 0.27 FALSE
koala 15 315 1 19 0.37 TRUE
otter 45 338 1 29 0.27 TRUE
koala 45 211 1 23 0.85 TRUE
otter 15 328 1 17 0.56 FALSE
koala 15 318 2 22 0.45 TRUE
koala 15 273 1 27 0.18 FALSE
koala 45 102 1 21 0.48 FALSE
koala 15 110 2 15 0.99 TRUE
otter 45 355 2 15 0.01 FALSE
otter 15 69 1 24 0.70 FALSE
koala 15 69 1 24 0.70 FALSE
koala 15 337 2 27 0.83 TRUE
@iLoveML ML System Model
15
Terminology
input
prediction
brand power age duty temp humidity FAIL?
koala 45 338 1 16 0.03 FALSE
otter 15 140 1 27 0.27 FALSE
koala 15 315 1 19 0.37 TRUE
otter 45 338 1 29 0.27 TRUE
koala 45 211 1 23 0.85 TRUE
otter 15 328 1 17 0.56 FALSE
koala 15 318 2 22 0.45 TRUE
koala 15 273 1 27 0.18 FALSE
koala 45 102 1 21 0.48 FALSE
koala 15 110 2 15 0.99 TRUE
otter 45 355 2 15 0.01 FALSE
otter 15 69 1 24 0.70 FALSE
koala 15 69 1 24 0.70 FALSE
koala 15 337 2 27 0.83 TRUE
datasource
training
“putting into
production”
16
Supervised Learning
brand power age duty temp humidity FAIL?
koala 45 338 1 16 0.03 FALSE
otter 15 140 1 27 0.27 FALSE
koala 15 315 1 19 0.37 TRUE
otter 45 338 1 29 0.27 TRUE
koala 45 211 1 23 0.85 TRUE
otter 15 328 1 17 0.56 FALSE
koala 15 318 2 22 0.45 TRUE
features
instances
label
Labeled Data
17
Supervised Learning
animal state … proximity action
tiger hungry … close run
elephant happy … far take picture
Classification
animal state … proximity min_kmh
tiger hungry … close 70
hippo angry … far 10
Regression
label
animal state … proximity action1 action2
tiger hungry … close run look untasty
elephant happy … far take picture call friends
Multi-Label Classification
18
Unsupervised Learning
date customer account auth class zip amount
Mon Bob 3421 pin clothes 46140 135
Tue Bob 3421 sign food 46140 401
Tue Alice 2456 pin food 12222 234
Wed Sally 6788 pin gas 26339 94
Wed Bob 3421 pin tech 21350 2459
Wed Bob 3421 pin gas 46140 83
The Sally 6788 sign food 26339 51
features
instances
Unlabeled Data
19
Unsupervised Learning
date customer account auth class zip amount
Mon Bob 3421 pin clothes 46140 135
Tue Bob 3421 sign food 46140 401
Tue Alice 2456 pin food 12222 234
Wed Sally 6788 pin gas 26339 94
Wed Bob 3421 pin tech 21350 2459
Wed Bob 3421 pin gas 46140 83
The Sally 6788 sign food 26339 51
Clustering
date customer account auth class zip amount
Mon Bob 3421 pin clothes 46140 135
Tue Bob 3421 sign food 46140 401
Tue Alice 2456 pin food 12222 234
Wed Sally 6788 pin gas 26339 94
Wed Bob 3421 pin tech 21350 2459
Wed Bob 3421 pin gas 46140 83
The Sally 6788 sign food 26339 51
Anomaly Detection
similar
unusual
20
Semi-Supervised Learning
brand power age duty temp humidity FAIL?
koala 45 338 1 16 0.03 FALSE
otter 15 140 1 27 0.27
koala 15 315 1 19 0.37
otter 45 338 1 29 0.27 TRUE
koala 45 211 1 23 0.85 TRUE
otter 15 328 1 17 0.56
koala 15 318 2 22 0.45 TRUE
label
Labeled and Unlabeled Data
}infer
21
Data Types
numeric
1 2 3
1, 2.0, 3, -5.4 categoricaltrue, yes, red, mammal categoricalcategorical
A B C
DATE-TIME2013-09-25 10:02
DATE-TIME
YEAR
MONTH
DAY-OF-MONTH
YYYY-MM-DD
DAY-OF-WEEK
HOUR
MINUTE
YYYY-MM-DD
YYYY-MM-DD
M-T-W-T-F-S-D
HH:MM:SS
HH:MM:SS
2013
September
25
Wednesday
10
02
text
Be not afraid of greatness:
some are born great, some
achieve greatness, and
some have greatness
thrust upon 'em.
text
“great”
“afraid”
“born”
“some”
appears 2 times
appears 1 time
appears 1 time
appears 2 times
22
Text Analysis
Be not afraid of greatness:
some are born great, some
achieve greatness, and
some have greatness
thrust upon 'em.
22
Text Analysis
Be not afraid of greatness:
some are born great, some
achieve greatness, and
some have greatness
thrust upon 'em.
great: appears 4 times
23
Text Analysis
great afraid born achieve
4 1 1 1
… … … …
Be not afraid of greatness:
some are born great, some achieve
greatness, and some have greatness
thrust upon ‘em.
Model
The token “great” 

does not occur
The token “afraid” 

occurs more than once
24
Topic Modeling
25
Brief History of ML
1950 1960 1970 1980 1990 2000 2010
Perceptron
Neural
Networks
Ensembles
Support Vector Machines
Boosting
Interpretability
Rosenblatt, 1957
Quinlan, 1979 (ID3),
Minsky, 1969
Vapnik, 1963 Corina & Vapnik, 1995
Schapire, 1989 (Boosting)
Schapire, 1995 (Adaboost)
Breiman, 2001 (Random Forests)
Breiman, 1994 (Bagging)
Deep Learning
Hinton, 2006Fukushima, 1989 (ANN)
Breiman, 1984 (CART)
2020
+
-
Decision Trees
26
Why ML Now?
• Decreasing cost of data
• Abundant computing power, especially cloud
• Machine Learning APIs
• Abundance of APIs + internet to combine easily
27
Composability
28
The Stages of a ML App
State the problem
Data Wrangling
Feature Engineering
Learning
Deploying
Predicting
Measuring Impact

More Related Content

PPT
Pipelining & All Hazards Solution
PPTX
Planning in AI(Partial order planning)
PDF
Project presentation on Phone Book
PPT
Heuristic Search Techniques {Artificial Intelligence}
PPTX
Dendral
PDF
Emotion detection using cnn.pptx
PPTX
The world of loss function
PPTX
Double DES & Triple DES
Pipelining & All Hazards Solution
Planning in AI(Partial order planning)
Project presentation on Phone Book
Heuristic Search Techniques {Artificial Intelligence}
Dendral
Emotion detection using cnn.pptx
The world of loss function
Double DES & Triple DES

What's hot (20)

PDF
Ddb 1.6-design issues
PDF
Data base management system LAB MANUAL KCS 551.pdf
PPTX
Cache coherence problem and its solutions
PPTX
PROCEDURAL AND DECLARATIVE KNOWLEDGE IN AI & ML (1).pptx
PDF
CRYPTOGRAPHY AND NETWORK SECURITY
PPTX
Operating system memory management
PPT
Ll(1) Parser in Compilers
PPTX
RSA Algorithm
PDF
Computer graphics lab report with code in cpp
PPT
Game Playing in Artificial Intelligence
PPTX
Lecture 16 memory bounded search
PPTX
Artificial Intelligence Searching Techniques
PDF
Sensor Cloud
PDF
HPPS: Heart Problem Prediction System using Machine Learning
PPTX
Deep Learning Explained
PPTX
Critical section problem in operating system.
PDF
Smart attendance system using facial recognition
PDF
Birthday Paradox explained
PPTX
Feistel cipher
PPTX
Secure Hash Algorithm (SHA-512)
Ddb 1.6-design issues
Data base management system LAB MANUAL KCS 551.pdf
Cache coherence problem and its solutions
PROCEDURAL AND DECLARATIVE KNOWLEDGE IN AI & ML (1).pptx
CRYPTOGRAPHY AND NETWORK SECURITY
Operating system memory management
Ll(1) Parser in Compilers
RSA Algorithm
Computer graphics lab report with code in cpp
Game Playing in Artificial Intelligence
Lecture 16 memory bounded search
Artificial Intelligence Searching Techniques
Sensor Cloud
HPPS: Heart Problem Prediction System using Machine Learning
Deep Learning Explained
Critical section problem in operating system.
Smart attendance system using facial recognition
Birthday Paradox explained
Feistel cipher
Secure Hash Algorithm (SHA-512)
Ad

Viewers also liked (20)

PDF
L5. Data Transformation and Feature Engineering
PPTX
Feature Engineering
PDF
Introduction to Machine Learning and Deep Learning
PPTX
Introduction to Machine Learning
PPTX
Introduction to Machine Learning
PDF
A field guide the machine learning zoo
PPT
Introduction to Machine Learning* Prof. D. Spears
PPT
introducción a Machine Learning
PDF
Machine Learning - Where to Next?, May 2015
PPTX
Ai history to-m-learning
PDF
An explanation of machine learning for business
PDF
Machine learning Lecture 1
PDF
BSSML16 L7. Feature Engineering
PDF
A brief history of machine learning
PPTX
Make Sense Out of Data with Feature Engineering
PDF
Machine Learning in R
PDF
Microservices, containers, and machine learning
PPTX
Machine learning workshop @DYP Pune
PDF
機器學習速遊
PDF
Machine Learning
L5. Data Transformation and Feature Engineering
Feature Engineering
Introduction to Machine Learning and Deep Learning
Introduction to Machine Learning
Introduction to Machine Learning
A field guide the machine learning zoo
Introduction to Machine Learning* Prof. D. Spears
introducción a Machine Learning
Machine Learning - Where to Next?, May 2015
Ai history to-m-learning
An explanation of machine learning for business
Machine learning Lecture 1
BSSML16 L7. Feature Engineering
A brief history of machine learning
Make Sense Out of Data with Feature Engineering
Machine Learning in R
Microservices, containers, and machine learning
Machine learning workshop @DYP Pune
機器學習速遊
Machine Learning
Ad

Similar to L1. State of the Art in Machine Learning (7)

PDF
MLSEV. Models, Evaluations and Ensembles
PDF
DutchMLSchool. Models, Evaluations, and Ensembles
PDF
2013 11-06 lsr-dublin_m_hausenblas_solr as recommendation engine
PDF
VSSML17 L3. Clusters and Anomaly Detection
PDF
Deep Learning and Design Thinking
PPT
23T1W3 Mathematics ppt on Number Sequence.ppt
PDF
Finding needles in haystacks with deep neural networks
MLSEV. Models, Evaluations and Ensembles
DutchMLSchool. Models, Evaluations, and Ensembles
2013 11-06 lsr-dublin_m_hausenblas_solr as recommendation engine
VSSML17 L3. Clusters and Anomaly Detection
Deep Learning and Design Thinking
23T1W3 Mathematics ppt on Number Sequence.ppt
Finding needles in haystacks with deep neural networks

More from Machine Learning Valencia (14)

PPTX
From Turing To Humanoid Robots - Ramón López de Mántaras
PPTX
Artificial Intelligence Progress - Tom Dietterich
PDF
LR2. Summary Day 2
PDF
L15. Machine Learning - Black Art
PDF
L14. Anomaly Detection
PDF
L13. Cluster Analysis
PDF
L9. Real World Machine Learning - Cooking Predictions
PDF
L11. The Future of Machine Learning
PDF
L7. A developers’ overview of the world of predictive APIs
PDF
LR1. Summary Day 1
PDF
L6. Unbalanced Datasets
PDF
L4. Ensembles of Decision Trees
PDF
L3. Decision Trees
PDF
L2. Evaluating Machine Learning Algorithms I
From Turing To Humanoid Robots - Ramón López de Mántaras
Artificial Intelligence Progress - Tom Dietterich
LR2. Summary Day 2
L15. Machine Learning - Black Art
L14. Anomaly Detection
L13. Cluster Analysis
L9. Real World Machine Learning - Cooking Predictions
L11. The Future of Machine Learning
L7. A developers’ overview of the world of predictive APIs
LR1. Summary Day 1
L6. Unbalanced Datasets
L4. Ensembles of Decision Trees
L3. Decision Trees
L2. Evaluating Machine Learning Algorithms I

Recently uploaded (20)

PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Computer network topology notes for revision
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PDF
Mega Projects Data Mega Projects Data
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Lecture1 pattern recognition............
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
IB Computer Science - Internal Assessment.pptx
Introduction to Knowledge Engineering Part 1
Computer network topology notes for revision
STUDY DESIGN details- Lt Col Maksud (21).pptx
[EN] Industrial Machine Downtime Prediction
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Reliability_Chapter_ presentation 1221.5784
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Business Analytics and business intelligence.pdf
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Mega Projects Data Mega Projects Data
IBA_Chapter_11_Slides_Final_Accessible.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Lecture1 pattern recognition............
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
IB Computer Science - Internal Assessment.pptx

L1. State of the Art in Machine Learning

  • 1. State of the Art in Machine Learning Poul Petersen BigML
  • 2. 2 What is ML? “a field of study that gives computers the ability to learn without being explicitly programmed” Professor Arthur Samuel, 1959 •What “ability to learn” do computers have? •What does “explicitly programmed” mean?
  • 3. Square Feet Price Sacramento Real Estate Prices by sq_ft 3 Ability to Learn
  • 4. 4 Ability to Learn • Provide computer with examples of the relationship between square footage X and price Y. • The computer learns the equation of a line F() which fits the examples. • The computer can now predict the price of any home knowing only the square footage: F( x ) = y • This model is known as Linear Regression. There are other types of models. • You may have noticed there was some points that did not fit. This is important!
  • 5. 5 Learning Problems (fit) Under-fitting Over-fitting • Model does not fit well enough • Does not capture the underlying trend of the data • Change algorithm or features • Model fits too well does not “generalize” • Captures the noise or outliers of the data • Change algorithm or filter outliers
  • 6. 6 Learning Problems (missing) • Missing values at training/prediction time • Some algorithms can handle missing values, some no • Missing data is sometimes important • Replace missing values • Predict missing values
  • 8. 8 Not Explicitly Programmed Control System “Customers go on vacation… We need a program that predicts if the light will come on when the control system flips the switch.”
  • 9. 9 Not Explicitly Programmed @iLoveRuby Switch Light? on TRUE off FALSE • @iLoveRuby reasoned the rules by experience • Then programmed the rules explicitly
  • 10. 10 Not Explicitly Programmed Switch Light? on TRUE off FALSE • The ML System reasoned the rules from the data and created a Model • Functionally the Model is the same as @iLoveRuby’s explicit model @iLoveML question answer ML System Model
  • 11. 11 Not Explicitly Programmed • how long has the bulb has been in service • reliability of brand • rated power: higher power shorter life • duty cycle • room conditions: temperature, humidity Goal is to predict if the bulb will come on but the switch is not the important variable: Even worse: None of these conditions is absolute.
  • 12. 12 Not Explicitly Programmed brand power age duty temp humidity FAIL? koala 45 338 1 16 0.03 FALSE otter 15 140 1 27 0.27 FALSE koala 15 315 1 19 0.37 TRUE otter 45 338 1 29 0.27 TRUE koala 45 211 1 23 0.85 TRUE otter 15 328 1 17 0.56 FALSE koala 15 318 2 22 0.45 TRUE koala 15 273 1 27 0.18 FALSE koala 45 102 1 21 0.48 FALSE koala 15 110 2 15 0.99 TRUE otter 45 355 2 15 0.01 FALSE otter 15 69 1 24 0.70 FALSE koala 15 69 1 24 0.70 FALSE koala 15 337 2 27 0.83 TRUE
  • 13. 13 Not Explicitly Programmed @iLoveRuby • Multi-dimensional data is much harder to find rules • Explicit program requires modification brand power age duty temp humidity FAIL? koala 45 338 1 16 0.03 FALSE otter 15 140 1 27 0.27 FALSE koala 15 315 1 19 0.37 TRUE otter 45 338 1 29 0.27 TRUE koala 45 211 1 23 0.85 TRUE otter 15 328 1 17 0.56 FALSE koala 15 318 2 22 0.45 TRUE koala 15 273 1 27 0.18 FALSE koala 45 102 1 21 0.48 FALSE koala 15 110 2 15 0.99 TRUE otter 45 355 2 15 0.01 FALSE otter 15 69 1 24 0.70 FALSE koala 15 69 1 24 0.70 FALSE koala 15 337 2 27 0.83 TRUE
  • 14. @iLoveML ML System Model 14 Not Explicitly Programmed • ML System easily re-trains on new data question answer brand power age duty temp humidity FAIL? koala 45 338 1 16 0.03 FALSE otter 15 140 1 27 0.27 FALSE koala 15 315 1 19 0.37 TRUE otter 45 338 1 29 0.27 TRUE koala 45 211 1 23 0.85 TRUE otter 15 328 1 17 0.56 FALSE koala 15 318 2 22 0.45 TRUE koala 15 273 1 27 0.18 FALSE koala 45 102 1 21 0.48 FALSE koala 15 110 2 15 0.99 TRUE otter 45 355 2 15 0.01 FALSE otter 15 69 1 24 0.70 FALSE koala 15 69 1 24 0.70 FALSE koala 15 337 2 27 0.83 TRUE
  • 15. @iLoveML ML System Model 15 Terminology input prediction brand power age duty temp humidity FAIL? koala 45 338 1 16 0.03 FALSE otter 15 140 1 27 0.27 FALSE koala 15 315 1 19 0.37 TRUE otter 45 338 1 29 0.27 TRUE koala 45 211 1 23 0.85 TRUE otter 15 328 1 17 0.56 FALSE koala 15 318 2 22 0.45 TRUE koala 15 273 1 27 0.18 FALSE koala 45 102 1 21 0.48 FALSE koala 15 110 2 15 0.99 TRUE otter 45 355 2 15 0.01 FALSE otter 15 69 1 24 0.70 FALSE koala 15 69 1 24 0.70 FALSE koala 15 337 2 27 0.83 TRUE datasource training “putting into production”
  • 16. 16 Supervised Learning brand power age duty temp humidity FAIL? koala 45 338 1 16 0.03 FALSE otter 15 140 1 27 0.27 FALSE koala 15 315 1 19 0.37 TRUE otter 45 338 1 29 0.27 TRUE koala 45 211 1 23 0.85 TRUE otter 15 328 1 17 0.56 FALSE koala 15 318 2 22 0.45 TRUE features instances label Labeled Data
  • 17. 17 Supervised Learning animal state … proximity action tiger hungry … close run elephant happy … far take picture Classification animal state … proximity min_kmh tiger hungry … close 70 hippo angry … far 10 Regression label animal state … proximity action1 action2 tiger hungry … close run look untasty elephant happy … far take picture call friends Multi-Label Classification
  • 18. 18 Unsupervised Learning date customer account auth class zip amount Mon Bob 3421 pin clothes 46140 135 Tue Bob 3421 sign food 46140 401 Tue Alice 2456 pin food 12222 234 Wed Sally 6788 pin gas 26339 94 Wed Bob 3421 pin tech 21350 2459 Wed Bob 3421 pin gas 46140 83 The Sally 6788 sign food 26339 51 features instances Unlabeled Data
  • 19. 19 Unsupervised Learning date customer account auth class zip amount Mon Bob 3421 pin clothes 46140 135 Tue Bob 3421 sign food 46140 401 Tue Alice 2456 pin food 12222 234 Wed Sally 6788 pin gas 26339 94 Wed Bob 3421 pin tech 21350 2459 Wed Bob 3421 pin gas 46140 83 The Sally 6788 sign food 26339 51 Clustering date customer account auth class zip amount Mon Bob 3421 pin clothes 46140 135 Tue Bob 3421 sign food 46140 401 Tue Alice 2456 pin food 12222 234 Wed Sally 6788 pin gas 26339 94 Wed Bob 3421 pin tech 21350 2459 Wed Bob 3421 pin gas 46140 83 The Sally 6788 sign food 26339 51 Anomaly Detection similar unusual
  • 20. 20 Semi-Supervised Learning brand power age duty temp humidity FAIL? koala 45 338 1 16 0.03 FALSE otter 15 140 1 27 0.27 koala 15 315 1 19 0.37 otter 45 338 1 29 0.27 TRUE koala 45 211 1 23 0.85 TRUE otter 15 328 1 17 0.56 koala 15 318 2 22 0.45 TRUE label Labeled and Unlabeled Data }infer
  • 21. 21 Data Types numeric 1 2 3 1, 2.0, 3, -5.4 categoricaltrue, yes, red, mammal categoricalcategorical A B C DATE-TIME2013-09-25 10:02 DATE-TIME YEAR MONTH DAY-OF-MONTH YYYY-MM-DD DAY-OF-WEEK HOUR MINUTE YYYY-MM-DD YYYY-MM-DD M-T-W-T-F-S-D HH:MM:SS HH:MM:SS 2013 September 25 Wednesday 10 02 text Be not afraid of greatness: some are born great, some achieve greatness, and some have greatness thrust upon 'em. text “great” “afraid” “born” “some” appears 2 times appears 1 time appears 1 time appears 2 times
  • 22. 22 Text Analysis Be not afraid of greatness: some are born great, some achieve greatness, and some have greatness thrust upon 'em.
  • 23. 22 Text Analysis Be not afraid of greatness: some are born great, some achieve greatness, and some have greatness thrust upon 'em. great: appears 4 times
  • 24. 23 Text Analysis great afraid born achieve 4 1 1 1 … … … … Be not afraid of greatness: some are born great, some achieve greatness, and some have greatness thrust upon ‘em. Model The token “great” does not occur The token “afraid” occurs more than once
  • 26. 25 Brief History of ML 1950 1960 1970 1980 1990 2000 2010 Perceptron Neural Networks Ensembles Support Vector Machines Boosting Interpretability Rosenblatt, 1957 Quinlan, 1979 (ID3), Minsky, 1969 Vapnik, 1963 Corina & Vapnik, 1995 Schapire, 1989 (Boosting) Schapire, 1995 (Adaboost) Breiman, 2001 (Random Forests) Breiman, 1994 (Bagging) Deep Learning Hinton, 2006Fukushima, 1989 (ANN) Breiman, 1984 (CART) 2020 + - Decision Trees
  • 27. 26 Why ML Now? • Decreasing cost of data • Abundant computing power, especially cloud • Machine Learning APIs • Abundance of APIs + internet to combine easily
  • 29. 28 The Stages of a ML App State the problem Data Wrangling Feature Engineering Learning Deploying Predicting Measuring Impact