SlideShare a Scribd company logo
BigML Education
Supervised vs Unsupervised
July 2017
BigML Education Program 2Supervised versus Unsupervised
Supervised Learning
1. Training data provides “examples” and “outcomes”
2. The machine learns to predict the outcome of new data
based on the past examples
SQFT BEDS BATHS
3.125 5 3
2.100 4 2
1.200 3 1,5
3.950 6 4
PRICE
530.000 $
460.000 $
250.000 $
???
BigML Education Program 3Supervised versus Unsupervised
Supervised Learning
Label
Training
BigML Education Program 4Supervised versus Unsupervised
Supervised Learning
Example Question
• Will this customer default on a
loan?
Training Data
Previous months of
loans applications
Previous home sales
• How many customers will
apply for a loan next month?
• How much is this home worth?
Previous loans that
were paid or defaulted
• Is this cancer malignant? Previous stats of
benign / malignant
cancers
BigML Education Program 5Supervised versus Unsupervised
Supervised Learning
• Training data has one feature that is the “outcome”
• Sometimes referred to as the “label” or “objective”
• Goal is to build a model which can predict the outcome
• If categorical: model is a“classification”
• if numeric: model is a “regression”
• Because the data has a known value, model can be evaluated
• Split the data into a training and test set
• Model the training set / Predict the test test
• Compares the predictions to the known values
• Algorithms
• Model / Ensemble
• Logistic Regression
• Time Series
BigML Education Program 6Supervised versus Unsupervised
Unsupervised Learning
1. Training data provides “examples” - no specific “outcome”
2. The machine tries to find “interesting” patterns in the data
date customer account auth class zip amount
Mon Bob 3421 pin clothes 46140 135
Tue Bob 3421 sign food 46140 401
Tue Alice 2456 pin food 12222 234
Wed Sally 6788 pin gas 26339 94
Wed Bob 3421 pin tech 21350 2459
Wed Bob 3421 pin gas 46140 83
Thr Sally 6788 sign food 26339 51
BigML Education Program 7Supervised versus Unsupervised
Unsupervised Learning
date customer account auth class zip amount
Mon Bob 3421 pin clothes 46140 135
Tue Bob 3421 sign food 46140 401
Tue Alice 2456 pin food 12222 234
Wed Sally 6788 pin gas 26339 94
Wed Bob 3421 pin tech 21350 2459
Wed Bob 3421 pin gas 46140 83
The Sally 6788 sign food 26339 51
Clustering
similar
BigML Education Program 8Supervised versus Unsupervised
Unsupervised Learning
date customer account auth class zip amount
Mon Bob 3421 pin clothes 46140 135
Tue Bob 3421 sign food 46140 401
Tue Alice 2456 pin food 12222 234
Wed Sally 6788 pin gas 26339 94
Wed Bob 3421 pin tech 21350 2459
Wed Bob 3421 pin gas 46140 83
The Sally 6788 sign food 26339 51
Anomaly Detection
unusual
BigML Education Program 9Supervised versus Unsupervised
Unsupervised Learning
date customer account auth class zip amount
Mon Bob 3421 pin clothes 46140 135
Tue Bob 3421 sign food 46140 401
Tue Alice 2456 pin food 12222 234
Wed Sally 6788 pin gas 26339 94
Wed Bob 3421 pin tech 21350 2459
Wed Bob 3421 pin gas 46140 83
Thr Sally 6788 sign food 26339 51
zip = 46140
amount < 100
{customer = Bob, account = 3421}
{class = gas}
Association Discovery
BigML Education Program 10Supervised versus Unsupervised
Unsupervised Learning
Example Question
• Is this transaction unusual?
Training Data
Examples of previous
purchases
Customer profiles
• Are the products purchased
together?
• Are these customers similar?
Previous transactions
BigML Education Program 11Supervised versus Unsupervised
Unsupervised Learning
• Training data has only examples and no specific “outcome”
• This is common - labels are typically expensive
• Goal is to perform discovery, find patterns, etc
• Tends to be more difficult
• Algorithms
• Clusters
• Anomaly Detection
• Association Discovery
• Topic Models
• Because the data has no “outcome”, can not be evaluated
• Each discovery method has it’s own quality measures

More Related Content

PDF
BigML Education - Association Discovery
PDF
BigML Education - Models 2
PDF
BigML Education - Evaluations
PDF
BigML Education - Anomaly Detection
PDF
BigML Education - Models 1
PDF
BigML Education - Predictions
PDF
BigML Education - OptiML
PDF
BigML Education - Ensembles
BigML Education - Association Discovery
BigML Education - Models 2
BigML Education - Evaluations
BigML Education - Anomaly Detection
BigML Education - Models 1
BigML Education - Predictions
BigML Education - OptiML
BigML Education - Ensembles

What's hot (8)

PDF
BigML Education - Datasets
PPTX
Machine Learning for Product Managers
PDF
Distribution Matching Losses Can Hallucinate Features in Medical Image Transl...
PPTX
Resume-Predicting Profitability and Customer Preference Presentation-Brian Bu...
PDF
Web analytics 101: Optimization
PDF
accounting
PDF
Dwdm chapter 5 data mining a closer look
PPTX
Rethinking product lifecycle curves to fight commoditization
BigML Education - Datasets
Machine Learning for Product Managers
Distribution Matching Losses Can Hallucinate Features in Medical Image Transl...
Resume-Predicting Profitability and Customer Preference Presentation-Brian Bu...
Web analytics 101: Optimization
accounting
Dwdm chapter 5 data mining a closer look
Rethinking product lifecycle curves to fight commoditization
Ad

Similar to BigML Education - Supervised vs Unsupervised (20)

PDF
DutchMLSchool. Supervised vs Unsupervised Learning
PDF
VSSML16 L3. Clusters and Anomaly Detection
PPTX
Machine learning
PDF
Machine Learning - Deep Learning
PDF
Introduction to machine learning
PDF
Lecture 02 ml supervised and unsupervised
DOC
Lecture #1: Introduction to machine learning (ML)
PDF
Lecture 2 - Introduction to Machine Learning, a lecture in subject module Sta...
PPTX
Session 17-18 machine learning very important and good type student favour.pptx
PPTX
Day15.pptx school of computer science and ai
PPTX
Doctor, Ismail ishengoma PowerPointL3.pptx
PPTX
Advanced Working Principles on Supervised and Unsupervised Learning
PPTX
Introduction to Machine Learning
PPTX
Machine learning introduction
PDF
MLSEV. Models, Evaluations and Ensembles
PPTX
PDF
VSSML16 LR1. Summary Day 1
PDF
Machine learning
PPTX
Types of Machine Learning- Tanvir Siddike Moin
PDF
Machine Learning Basics and Supervised, unsupervised
DutchMLSchool. Supervised vs Unsupervised Learning
VSSML16 L3. Clusters and Anomaly Detection
Machine learning
Machine Learning - Deep Learning
Introduction to machine learning
Lecture 02 ml supervised and unsupervised
Lecture #1: Introduction to machine learning (ML)
Lecture 2 - Introduction to Machine Learning, a lecture in subject module Sta...
Session 17-18 machine learning very important and good type student favour.pptx
Day15.pptx school of computer science and ai
Doctor, Ismail ishengoma PowerPointL3.pptx
Advanced Working Principles on Supervised and Unsupervised Learning
Introduction to Machine Learning
Machine learning introduction
MLSEV. Models, Evaluations and Ensembles
VSSML16 LR1. Summary Day 1
Machine learning
Types of Machine Learning- Tanvir Siddike Moin
Machine Learning Basics and Supervised, unsupervised
Ad

More from BigML, Inc (20)

PDF
Digital Transformation and Process Optimization in Manufacturing
PDF
DutchMLSchool 2022 - Automation
PDF
DutchMLSchool 2022 - ML for AML Compliance
PDF
DutchMLSchool 2022 - Multi Perspective Anomalies
PDF
DutchMLSchool 2022 - My First Anomaly Detector
PDF
DutchMLSchool 2022 - Anomaly Detection
PDF
DutchMLSchool 2022 - History and Developments in ML
PDF
DutchMLSchool 2022 - End-to-End ML
PDF
DutchMLSchool 2022 - A Data-Driven Company
PDF
DutchMLSchool 2022 - ML in the Legal Sector
PDF
DutchMLSchool 2022 - Smart Safe Stadiums
PDF
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
PDF
DutchMLSchool 2022 - Anomaly Detection at Scale
PDF
DutchMLSchool 2022 - Citizen Development in AI
PDF
Democratizing Object Detection
PDF
BigML Release: Image Processing
PDF
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
PDF
Machine Learning in Retail: ML in the Retail Sector
PDF
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
PDF
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
Digital Transformation and Process Optimization in Manufacturing
DutchMLSchool 2022 - Automation
DutchMLSchool 2022 - ML for AML Compliance
DutchMLSchool 2022 - Multi Perspective Anomalies
DutchMLSchool 2022 - My First Anomaly Detector
DutchMLSchool 2022 - Anomaly Detection
DutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Citizen Development in AI
Democratizing Object Detection
BigML Release: Image Processing
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: ML in the Retail Sector
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...

Recently uploaded (20)

PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Business Analytics and business intelligence.pdf
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
1_Introduction to advance data techniques.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPT
Quality review (1)_presentation of this 21
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Business Analytics and business intelligence.pdf
IBA_Chapter_11_Slides_Final_Accessible.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Business Ppt On Nestle.pptx huunnnhhgfvu
.pdf is not working space design for the following data for the following dat...
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Reliability_Chapter_ presentation 1221.5784
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Introduction to Knowledge Engineering Part 1
1_Introduction to advance data techniques.pptx
IB Computer Science - Internal Assessment.pptx
Fluorescence-microscope_Botany_detailed content
Business Acumen Training GuidePresentation.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
Qualitative Qantitative and Mixed Methods.pptx
Miokarditis (Inflamasi pada Otot Jantung)
Quality review (1)_presentation of this 21

BigML Education - Supervised vs Unsupervised

  • 1. BigML Education Supervised vs Unsupervised July 2017
  • 2. BigML Education Program 2Supervised versus Unsupervised Supervised Learning 1. Training data provides “examples” and “outcomes” 2. The machine learns to predict the outcome of new data based on the past examples SQFT BEDS BATHS 3.125 5 3 2.100 4 2 1.200 3 1,5 3.950 6 4 PRICE 530.000 $ 460.000 $ 250.000 $ ???
  • 3. BigML Education Program 3Supervised versus Unsupervised Supervised Learning Label Training
  • 4. BigML Education Program 4Supervised versus Unsupervised Supervised Learning Example Question • Will this customer default on a loan? Training Data Previous months of loans applications Previous home sales • How many customers will apply for a loan next month? • How much is this home worth? Previous loans that were paid or defaulted • Is this cancer malignant? Previous stats of benign / malignant cancers
  • 5. BigML Education Program 5Supervised versus Unsupervised Supervised Learning • Training data has one feature that is the “outcome” • Sometimes referred to as the “label” or “objective” • Goal is to build a model which can predict the outcome • If categorical: model is a“classification” • if numeric: model is a “regression” • Because the data has a known value, model can be evaluated • Split the data into a training and test set • Model the training set / Predict the test test • Compares the predictions to the known values • Algorithms • Model / Ensemble • Logistic Regression • Time Series
  • 6. BigML Education Program 6Supervised versus Unsupervised Unsupervised Learning 1. Training data provides “examples” - no specific “outcome” 2. The machine tries to find “interesting” patterns in the data date customer account auth class zip amount Mon Bob 3421 pin clothes 46140 135 Tue Bob 3421 sign food 46140 401 Tue Alice 2456 pin food 12222 234 Wed Sally 6788 pin gas 26339 94 Wed Bob 3421 pin tech 21350 2459 Wed Bob 3421 pin gas 46140 83 Thr Sally 6788 sign food 26339 51
  • 7. BigML Education Program 7Supervised versus Unsupervised Unsupervised Learning date customer account auth class zip amount Mon Bob 3421 pin clothes 46140 135 Tue Bob 3421 sign food 46140 401 Tue Alice 2456 pin food 12222 234 Wed Sally 6788 pin gas 26339 94 Wed Bob 3421 pin tech 21350 2459 Wed Bob 3421 pin gas 46140 83 The Sally 6788 sign food 26339 51 Clustering similar
  • 8. BigML Education Program 8Supervised versus Unsupervised Unsupervised Learning date customer account auth class zip amount Mon Bob 3421 pin clothes 46140 135 Tue Bob 3421 sign food 46140 401 Tue Alice 2456 pin food 12222 234 Wed Sally 6788 pin gas 26339 94 Wed Bob 3421 pin tech 21350 2459 Wed Bob 3421 pin gas 46140 83 The Sally 6788 sign food 26339 51 Anomaly Detection unusual
  • 9. BigML Education Program 9Supervised versus Unsupervised Unsupervised Learning date customer account auth class zip amount Mon Bob 3421 pin clothes 46140 135 Tue Bob 3421 sign food 46140 401 Tue Alice 2456 pin food 12222 234 Wed Sally 6788 pin gas 26339 94 Wed Bob 3421 pin tech 21350 2459 Wed Bob 3421 pin gas 46140 83 Thr Sally 6788 sign food 26339 51 zip = 46140 amount < 100 {customer = Bob, account = 3421} {class = gas} Association Discovery
  • 10. BigML Education Program 10Supervised versus Unsupervised Unsupervised Learning Example Question • Is this transaction unusual? Training Data Examples of previous purchases Customer profiles • Are the products purchased together? • Are these customers similar? Previous transactions
  • 11. BigML Education Program 11Supervised versus Unsupervised Unsupervised Learning • Training data has only examples and no specific “outcome” • This is common - labels are typically expensive • Goal is to perform discovery, find patterns, etc • Tends to be more difficult • Algorithms • Clusters • Anomaly Detection • Association Discovery • Topic Models • Because the data has no “outcome”, can not be evaluated • Each discovery method has it’s own quality measures