SlideShare a Scribd company logo
© 2020 Minitab, LLC.
Machine Learning with Multiple Regression
© 2020 Minitab, LLC.
• 25+ years of
experience
• Minitab Trainer
• Statistical Consultant
• Minitab Software
Designer
• Master's in Statistics
Meet the Presenter:
Cheryl Pammer
Senior Advisory Statistician
© 2020 Minitab, LLC.
Learning Objectives
 The Language of Machine
Learning
 Multiple Regression in a
Machine Learning
Environment
 Validation and Cross
Validation
4
© 2020 Minitab, LLC.
So What is Machine Learning?
Machine learning helps to
analyze the patterns in the
collected data.
5
© 2020 Minitab, LLC.
WAIT… WHAT?
6
Introduction to Data Science
UC Irvine
Data Mining and Applications
Stanford
© 2020 Minitab, LLC.
Machine Learning
7
MACHINE
LEARNING
UNSUPERVISED LEARNING
SUPERVISED LEARNING
Group and interpret
data based on
input data only
(X’s).
Develop predictive
models using input
and output data
(X’s and Y’s).
CLUSTERING
CLASSIFICATION
REGRESSION
© 2020 Minitab, LLC.
Basic Supervised Machine Learning Algorithms
►Continuous Y: Regression, CART Regression Trees
►Categorical Y: Logistic Regression, CART Classification Trees
8
© 2020 Minitab, LLC.
What Has a Machine Learned?
9
Training Data Test Data
?Data is split into a training set and a test set:
►Training (or learn) = creates model
►Test = assesses model performance
© 2020 Minitab, LLC.
Why is Model Testing (Validation) Important?
►Assessing model with the same data used to fit model leads to
overfitting.
►Overfit models do not predict well.
10
© 2020 Minitab, LLC.
Bias-Variance Trade-Off
11
Model with High Bias Model with High Variance
© 2020 Minitab, LLC.
Validation
12
Validation helps find the best balance between too simple (high bias)
and too complex (high variance)
© 2020 Minitab, LLC.
Methods of Validation
►Leave-one-out validation
►Validation with a test set
►K-fold cross validation
13
© 2020 Minitab, LLC.
Example: Smart Water System
►Manufacturer of smart water
meters investigates what causes
bias in usage readings.
►Data from various test
sequences provides insight into
contributors of biased readings.
14
© 2020 Minitab, LLC.
Multiple Regression
 Examines relationship between continuous response
(Y) and multiple predictors (X).
 Relationship can be expressed as an equation:
Y = β0 + β1X1 + β2X2 + … βnXn + ε
15
© 2020 Minitab, LLC.
Leave-One-Out Validation
1. Fit model using every row except one.
2. Compute error for held-out observation.
3. Repeat for every row of data.
16
© 2020 Minitab, LLC.
Leave-One-Out Validation
4. Calculate sum of squares of errors from held-out rows. (Predicted
Residual Sum of Squares)
5. Select model using summary statistics based on PRESS.
17
𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 = � 𝑦𝑦𝑖𝑖 − �𝑦𝑦−𝑖𝑖
2
𝑅𝑅2
(𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝) = 1 −
𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃
𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆
© 2020 Minitab, LLC.
Validation With a Test Set
1. Hold out random
X% of data when
fitting model.
18
© 2020 Minitab, LLC.
Validation With a Test Set
2. Compute error for held-out data.
19
𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇 𝑆𝑆 =
∑ 𝑦𝑦𝑖𝑖 − �𝑦𝑦𝑖𝑖
2
𝑛𝑛𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇
Held-out observation
Prediction from model
from training data
© 2020 Minitab, LLC.
Validation With a Test Set
3. Select model using summary statistics based on test error.
20
𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇 𝑅𝑅2 = 1 −
𝑆𝑆𝑆𝑆𝑆𝑆(𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇)
𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆(𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇)
𝑆𝑆𝑆𝑆𝑆𝑆 𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇 = � 𝑦𝑦𝑖𝑖 − �𝑦𝑦𝑖𝑖
2
© 2020 Minitab, LLC.
Important Effects
21
© 2020 Minitab, LLC.
Model Comparison
22
“Optimal” Model Simple Model
© 2020 Minitab, LLC.
Regression Assumptions
►No distinct patterns
►Assumptions are met
 Normality
 Equal Variance
 Independence
23
© 2020 Minitab, LLC.
Model Visualizations
24
© 2020 Minitab, LLC.
Key Result
Use regression equation to predict bias in readings and calibrate
sensors.
25
© 2020 Minitab, LLC.
Example: Amusement Park Wait Times
►An amusement park
needs a model to predict
ride wait times for each
attraction.
►Predictors: park
attendance, ticket sales,
ride intensity rating, time
of day, whether it is a
weekend, percentage of
sun and temperature.
26
© 2020 Minitab, LLC.
K-Fold Cross Validation
1. Split data into k random folds.
27
Fold 1
Fold 2
Fold 3
Fold 4
Fold 5
© 2020 Minitab, LLC.
K-Fold Cross Validation
2. Fit model using every fold except one.
3. Compute test error using held-out fold.
4. Repeat k times, holding out each fold once.
28
K-fold 𝑆𝑆 =
∑ ∑ 𝑦𝑦𝑖𝑖− �𝑦𝑦𝑖𝑖
2
𝑛𝑛
Held-out observation Prediction from model
from remaining folds
© 2020 Minitab, LLC.
K-Fold Cross Validation
5. Select model using summary statistics based on k-fold test error.
29
© 2020 Minitab, LLC.
When Are the Highest Wait Times?
30
© 2020 Minitab, LLC.
Key Results
31
© 2020 Minitab, LLC.
Take-aways
►The language of machine learning
►Multiple regression in a machine learning environment
►Proven validation techniques for model selection
Cheryl Pammer
cpammer@minitab.com
32
© 2020 Minitab, LLC.
Upcoming Webinars and Virtual Events
• Machine Learning with Binary Logistic Regression
Time: Tuesday 30 June, 12PM AEST (10AM HKT / 2PM NZST)
• Machine Learning with Classification & Regression Trees
(CART® )
Time: Wednesday 15 July, 12PM AEST (10AM HKT / 2PM NZST)
See all the details and sign up at:
https://info.minitab.com/resources/webinars/webinar-wednesdays
© 2020 Minitab, LLC.
Upcoming Webinars and Virtual Events
• Online/Virtual Training
Minitab is now offering virtual training taught by
Minitab experts – perfect for remote/home workers.
Visit www.minitab.com/training/training for more info.
• Talk to Minitab
Complimentary resources to help you deal quickly with today's challenges and changing environment.
Visit www.minitab.com and click on the Talk to Minitab button and a Minitab representative will be in touch!
© 2020 Minitab, LLC.
Our Approach: More Than Business Analytics… Solutions Analytics
Software
Services
Training
Learn first-hand by attending public or
customized trainings in your facilities
according to your requirements.
Statistical
Consulting
Personalized help with statistical
challenges from collecting the right data
to interpreting analysis more.
Support
Assistance with installation,
implementation, version updates
and license management.
Master statistics and
Minitab anywhere
with online training
Machine learning and
predictive analytics
software
Start, track, manage
and execute
improvement projects
with real-time
dashboards
Powerful statistical
software everyone
can use.
Data Analysis Predictive Modeling Visual Business Tools Project Oversight
Visual tools to
process and product
excellence.
Online Training
Solutions analytics is our integrated approach to providing software and services that enable organizations to make better decisions that drive business excellence.

More Related Content

PDF
Machine Learning with Binary Logistic Regression - APAC
PDF
Machine Learning with Classification & Regression Trees - APAC
PDF
Maximize Efficiency with Minitab Workspace and Minitab Statistical Software -...
PDF
Performing at your best turning words into numbers and numbers into data driv...
PDF
Tips & Tricks for CART (Classification and Regression Trees) in Minitab Stati...
PPTX
Visualizations that make an impact - see what s new in minitab statistical s...
PDF
Minitab Preview Training: Introduction to t-Tests for Manufacturing
PDF
Unleashing the Power of Python Using the New Minitab/Python Integration Modul...
Machine Learning with Binary Logistic Regression - APAC
Machine Learning with Classification & Regression Trees - APAC
Maximize Efficiency with Minitab Workspace and Minitab Statistical Software -...
Performing at your best turning words into numbers and numbers into data driv...
Tips & Tricks for CART (Classification and Regression Trees) in Minitab Stati...
Visualizations that make an impact - see what s new in minitab statistical s...
Minitab Preview Training: Introduction to t-Tests for Manufacturing
Unleashing the Power of Python Using the New Minitab/Python Integration Modul...

What's hot (16)

PDF
Modern Toolkit for Process Excellence
PDF
Powerful Statistical Tools in the Pharmaceutical and Medical Devices Industry...
PDF
Critical Checks for Pharmaceuticals and Healthcare: Validating Your Data Inte...
PDF
Boost Your Data Expertise - What's New in Minitab 19.2020.1
PDF
Using minitab for Superior Quality in Food Manufacturing
PDF
Statistical solutions to help you with 5 FDA medical devices stages
PDF
Using Minitab for Superior Quality in Medical Device Manufacturing
PDF
Boost Your Data Expertise with the Latest Release of Minitab Statistical Soft...
PDF
Companion by Minitab - Seeing the unknown identifying risk and quantifying pr...
PDF
Minitab - Fast-track Your Data Analysis with Basic Macros
PDF
Minitab Need to Knows Series 1 - How to Manage Data in Minitab
PDF
The New Toolkit for Continuous Improvement webinar presentation slides
PDF
Minitab Tutorial for Beginners | What is Minitab? | Minitab Training for Stat...
PDF
Minitab webinar from unicorns to racehorses - presentation slides
PDF
Presentation: Ad-Click Prediction, A Data-Intensive Problem
PDF
Predictive Analytics in Manufacturing
Modern Toolkit for Process Excellence
Powerful Statistical Tools in the Pharmaceutical and Medical Devices Industry...
Critical Checks for Pharmaceuticals and Healthcare: Validating Your Data Inte...
Boost Your Data Expertise - What's New in Minitab 19.2020.1
Using minitab for Superior Quality in Food Manufacturing
Statistical solutions to help you with 5 FDA medical devices stages
Using Minitab for Superior Quality in Medical Device Manufacturing
Boost Your Data Expertise with the Latest Release of Minitab Statistical Soft...
Companion by Minitab - Seeing the unknown identifying risk and quantifying pr...
Minitab - Fast-track Your Data Analysis with Basic Macros
Minitab Need to Knows Series 1 - How to Manage Data in Minitab
The New Toolkit for Continuous Improvement webinar presentation slides
Minitab Tutorial for Beginners | What is Minitab? | Minitab Training for Stat...
Minitab webinar from unicorns to racehorses - presentation slides
Presentation: Ad-Click Prediction, A Data-Intensive Problem
Predictive Analytics in Manufacturing
Ad

Similar to Machine Learning with Multiple Regression - APAC (20)

PDF
Gaining a Competitive Advantage using Analytics to Optimize your Digital Mark...
PDF
Datascience101presentation4
PDF
Machinelearning: The next step in manufacturing performance
PPT
5995243مقايسس النزعة المركزية و التشت.ppt
PPTX
Developing Web-scale Machine Learning at LinkedIn - From Soup to Nuts
PDF
Minitab webinar presentation See the unknown with monte carlo simulation
PDF
Constraint Programming - An Alternative Approach to Heuristics in Scheduling
PDF
Ds for finance day 3
PDF
Production model lifecycle management 2016 09
PDF
Machine Learning driven Quantum Optimization for Marketing
PPTX
Forecasting at Scale with Marcello Tomasini
PPTX
Deepak-Computational Advertising-The LinkedIn Way
PDF
Modern tool kit for process excellence, gracias a Minitab Inc.
PPT
Credit risk with neural networks bankruptcy prediction machine learning
PDF
Python tutorial for ML
PDF
Machine Learning as a Daily Work for a Programmer- Volodymyr Vorobiov
PDF
Session 6.pdf
PDF
Session 6.pdf
PDF
Synthetic VIX Data Generation Using ML Techniques
PDF
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Gaining a Competitive Advantage using Analytics to Optimize your Digital Mark...
Datascience101presentation4
Machinelearning: The next step in manufacturing performance
5995243مقايسس النزعة المركزية و التشت.ppt
Developing Web-scale Machine Learning at LinkedIn - From Soup to Nuts
Minitab webinar presentation See the unknown with monte carlo simulation
Constraint Programming - An Alternative Approach to Heuristics in Scheduling
Ds for finance day 3
Production model lifecycle management 2016 09
Machine Learning driven Quantum Optimization for Marketing
Forecasting at Scale with Marcello Tomasini
Deepak-Computational Advertising-The LinkedIn Way
Modern tool kit for process excellence, gracias a Minitab Inc.
Credit risk with neural networks bankruptcy prediction machine learning
Python tutorial for ML
Machine Learning as a Daily Work for a Programmer- Volodymyr Vorobiov
Session 6.pdf
Session 6.pdf
Synthetic VIX Data Generation Using ML Techniques
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Ad

More from Minitab, LLC (17)

PDF
L'art de la visualisation pour une meilleure compréhension des données
PDF
Pilotez le développement de vos produits et de vos procédés avec Minitab et M...
PDF
Introducing Graph Builder: Visualizations Built to Move You Forward
PDF
Les solutions Minitab pour développer vos produits selon les réglementations ...
PDF
Concrétisez votre transformation digitale avec Minitab et Minitab Connect
PDF
En route vers l'excellence avec les solutions Minitab
PDF
Meet Minitab Engage Your End-to-End Improvement Solution From Idea Generation...
PDF
La puissance du machine learning et des algorithmes cart au service des métiers
PPTX
Strukturierte problemloesung mit datenunterstuetzung
PDF
Visualizaciones que crean impacto: Vea las novedades de Minitab Statistical ...
PDF
Einführung in den Minitab Workspace_Visuelle Toolkit zur Verbesserung Ihrer A...
PDF
Discover Minitab Workspace - The Ultimate Visual Toolkit to Elevate Your Work...
PDF
Melhore seu conhecimento sobre analise de dados com a versao mais recente do ...
PDF
Pilotez, structurez et cartographiez vos processus avec minitab workspace
PDF
Praesentation - Identifizieren und eliminieren sie ihre analytischen schwachp...
PDF
Nouveautes Minitab-Integration Python, Arbres de decision, Validation de mode...
PDF
Real Life Lessons From an Operational Excellence Expert
L'art de la visualisation pour une meilleure compréhension des données
Pilotez le développement de vos produits et de vos procédés avec Minitab et M...
Introducing Graph Builder: Visualizations Built to Move You Forward
Les solutions Minitab pour développer vos produits selon les réglementations ...
Concrétisez votre transformation digitale avec Minitab et Minitab Connect
En route vers l'excellence avec les solutions Minitab
Meet Minitab Engage Your End-to-End Improvement Solution From Idea Generation...
La puissance du machine learning et des algorithmes cart au service des métiers
Strukturierte problemloesung mit datenunterstuetzung
Visualizaciones que crean impacto: Vea las novedades de Minitab Statistical ...
Einführung in den Minitab Workspace_Visuelle Toolkit zur Verbesserung Ihrer A...
Discover Minitab Workspace - The Ultimate Visual Toolkit to Elevate Your Work...
Melhore seu conhecimento sobre analise de dados com a versao mais recente do ...
Pilotez, structurez et cartographiez vos processus avec minitab workspace
Praesentation - Identifizieren und eliminieren sie ihre analytischen schwachp...
Nouveautes Minitab-Integration Python, Arbres de decision, Validation de mode...
Real Life Lessons From an Operational Excellence Expert

Recently uploaded (20)

PPTX
history of c programming in notes for students .pptx
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
17 Powerful Integrations Your Next-Gen MLM Software Needs
PDF
Tally Prime Crack Download New Version 5.1 [2025] (License Key Free
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
DOCX
Greta — No-Code AI for Building Full-Stack Web & Mobile Apps
PDF
CCleaner Pro 6.38.11537 Crack Final Latest Version 2025
PDF
Complete Guide to Website Development in Malaysia for SMEs
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
Nekopoi APK 2025 free lastest update
PDF
Website Design Services for Small Businesses.pdf
PPTX
CHAPTER 2 - PM Management and IT Context
PPTX
Operating system designcfffgfgggggggvggggggggg
PPTX
Oracle Fusion HCM Cloud Demo for Beginners
PPTX
WiFi Honeypot Detecscfddssdffsedfseztor.pptx
PPTX
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
PDF
Salesforce Agentforce AI Implementation.pdf
PPTX
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
PDF
Design an Analysis of Algorithms II-SECS-1021-03
history of c programming in notes for students .pptx
wealthsignaloriginal-com-DS-text-... (1).pdf
17 Powerful Integrations Your Next-Gen MLM Software Needs
Tally Prime Crack Download New Version 5.1 [2025] (License Key Free
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Greta — No-Code AI for Building Full-Stack Web & Mobile Apps
CCleaner Pro 6.38.11537 Crack Final Latest Version 2025
Complete Guide to Website Development in Malaysia for SMEs
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Odoo Companies in India – Driving Business Transformation.pdf
Nekopoi APK 2025 free lastest update
Website Design Services for Small Businesses.pdf
CHAPTER 2 - PM Management and IT Context
Operating system designcfffgfgggggggvggggggggg
Oracle Fusion HCM Cloud Demo for Beginners
WiFi Honeypot Detecscfddssdffsedfseztor.pptx
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
Salesforce Agentforce AI Implementation.pdf
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
Design an Analysis of Algorithms II-SECS-1021-03

Machine Learning with Multiple Regression - APAC

  • 1. © 2020 Minitab, LLC. Machine Learning with Multiple Regression
  • 2. © 2020 Minitab, LLC. • 25+ years of experience • Minitab Trainer • Statistical Consultant • Minitab Software Designer • Master's in Statistics Meet the Presenter: Cheryl Pammer Senior Advisory Statistician
  • 3. © 2020 Minitab, LLC. Learning Objectives  The Language of Machine Learning  Multiple Regression in a Machine Learning Environment  Validation and Cross Validation 4
  • 4. © 2020 Minitab, LLC. So What is Machine Learning? Machine learning helps to analyze the patterns in the collected data. 5
  • 5. © 2020 Minitab, LLC. WAIT… WHAT? 6 Introduction to Data Science UC Irvine Data Mining and Applications Stanford
  • 6. © 2020 Minitab, LLC. Machine Learning 7 MACHINE LEARNING UNSUPERVISED LEARNING SUPERVISED LEARNING Group and interpret data based on input data only (X’s). Develop predictive models using input and output data (X’s and Y’s). CLUSTERING CLASSIFICATION REGRESSION
  • 7. © 2020 Minitab, LLC. Basic Supervised Machine Learning Algorithms ►Continuous Y: Regression, CART Regression Trees ►Categorical Y: Logistic Regression, CART Classification Trees 8
  • 8. © 2020 Minitab, LLC. What Has a Machine Learned? 9 Training Data Test Data ?Data is split into a training set and a test set: ►Training (or learn) = creates model ►Test = assesses model performance
  • 9. © 2020 Minitab, LLC. Why is Model Testing (Validation) Important? ►Assessing model with the same data used to fit model leads to overfitting. ►Overfit models do not predict well. 10
  • 10. © 2020 Minitab, LLC. Bias-Variance Trade-Off 11 Model with High Bias Model with High Variance
  • 11. © 2020 Minitab, LLC. Validation 12 Validation helps find the best balance between too simple (high bias) and too complex (high variance)
  • 12. © 2020 Minitab, LLC. Methods of Validation ►Leave-one-out validation ►Validation with a test set ►K-fold cross validation 13
  • 13. © 2020 Minitab, LLC. Example: Smart Water System ►Manufacturer of smart water meters investigates what causes bias in usage readings. ►Data from various test sequences provides insight into contributors of biased readings. 14
  • 14. © 2020 Minitab, LLC. Multiple Regression  Examines relationship between continuous response (Y) and multiple predictors (X).  Relationship can be expressed as an equation: Y = β0 + β1X1 + β2X2 + … βnXn + ε 15
  • 15. © 2020 Minitab, LLC. Leave-One-Out Validation 1. Fit model using every row except one. 2. Compute error for held-out observation. 3. Repeat for every row of data. 16
  • 16. © 2020 Minitab, LLC. Leave-One-Out Validation 4. Calculate sum of squares of errors from held-out rows. (Predicted Residual Sum of Squares) 5. Select model using summary statistics based on PRESS. 17 𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 = � 𝑦𝑦𝑖𝑖 − �𝑦𝑦−𝑖𝑖 2 𝑅𝑅2 (𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝) = 1 − 𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆
  • 17. © 2020 Minitab, LLC. Validation With a Test Set 1. Hold out random X% of data when fitting model. 18
  • 18. © 2020 Minitab, LLC. Validation With a Test Set 2. Compute error for held-out data. 19 𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇 𝑆𝑆 = ∑ 𝑦𝑦𝑖𝑖 − �𝑦𝑦𝑖𝑖 2 𝑛𝑛𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇 Held-out observation Prediction from model from training data
  • 19. © 2020 Minitab, LLC. Validation With a Test Set 3. Select model using summary statistics based on test error. 20 𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇 𝑅𝑅2 = 1 − 𝑆𝑆𝑆𝑆𝑆𝑆(𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇) 𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆(𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇) 𝑆𝑆𝑆𝑆𝑆𝑆 𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇 = � 𝑦𝑦𝑖𝑖 − �𝑦𝑦𝑖𝑖 2
  • 20. © 2020 Minitab, LLC. Important Effects 21
  • 21. © 2020 Minitab, LLC. Model Comparison 22 “Optimal” Model Simple Model
  • 22. © 2020 Minitab, LLC. Regression Assumptions ►No distinct patterns ►Assumptions are met  Normality  Equal Variance  Independence 23
  • 23. © 2020 Minitab, LLC. Model Visualizations 24
  • 24. © 2020 Minitab, LLC. Key Result Use regression equation to predict bias in readings and calibrate sensors. 25
  • 25. © 2020 Minitab, LLC. Example: Amusement Park Wait Times ►An amusement park needs a model to predict ride wait times for each attraction. ►Predictors: park attendance, ticket sales, ride intensity rating, time of day, whether it is a weekend, percentage of sun and temperature. 26
  • 26. © 2020 Minitab, LLC. K-Fold Cross Validation 1. Split data into k random folds. 27 Fold 1 Fold 2 Fold 3 Fold 4 Fold 5
  • 27. © 2020 Minitab, LLC. K-Fold Cross Validation 2. Fit model using every fold except one. 3. Compute test error using held-out fold. 4. Repeat k times, holding out each fold once. 28 K-fold 𝑆𝑆 = ∑ ∑ 𝑦𝑦𝑖𝑖− �𝑦𝑦𝑖𝑖 2 𝑛𝑛 Held-out observation Prediction from model from remaining folds
  • 28. © 2020 Minitab, LLC. K-Fold Cross Validation 5. Select model using summary statistics based on k-fold test error. 29
  • 29. © 2020 Minitab, LLC. When Are the Highest Wait Times? 30
  • 30. © 2020 Minitab, LLC. Key Results 31
  • 31. © 2020 Minitab, LLC. Take-aways ►The language of machine learning ►Multiple regression in a machine learning environment ►Proven validation techniques for model selection Cheryl Pammer cpammer@minitab.com 32
  • 32. © 2020 Minitab, LLC. Upcoming Webinars and Virtual Events • Machine Learning with Binary Logistic Regression Time: Tuesday 30 June, 12PM AEST (10AM HKT / 2PM NZST) • Machine Learning with Classification & Regression Trees (CART® ) Time: Wednesday 15 July, 12PM AEST (10AM HKT / 2PM NZST) See all the details and sign up at: https://info.minitab.com/resources/webinars/webinar-wednesdays
  • 33. © 2020 Minitab, LLC. Upcoming Webinars and Virtual Events • Online/Virtual Training Minitab is now offering virtual training taught by Minitab experts – perfect for remote/home workers. Visit www.minitab.com/training/training for more info. • Talk to Minitab Complimentary resources to help you deal quickly with today's challenges and changing environment. Visit www.minitab.com and click on the Talk to Minitab button and a Minitab representative will be in touch!
  • 34. © 2020 Minitab, LLC. Our Approach: More Than Business Analytics… Solutions Analytics Software Services Training Learn first-hand by attending public or customized trainings in your facilities according to your requirements. Statistical Consulting Personalized help with statistical challenges from collecting the right data to interpreting analysis more. Support Assistance with installation, implementation, version updates and license management. Master statistics and Minitab anywhere with online training Machine learning and predictive analytics software Start, track, manage and execute improvement projects with real-time dashboards Powerful statistical software everyone can use. Data Analysis Predictive Modeling Visual Business Tools Project Oversight Visual tools to process and product excellence. Online Training Solutions analytics is our integrated approach to providing software and services that enable organizations to make better decisions that drive business excellence.