SlideShare a Scribd company logo
Ensemble Modeling
Step Up Analytics
July 16, 2017
Ganesh S Step Up Analytics 1 / 12
Road Map
Introduction
Ensemble models and possible drawback/s of single specific
model
How ensemble models works and example
Frequently used ensemble methods and mathematics
Bagging and Bagging Algorithm
Bagging ensembles using R
Comparison of result
Continue with. . .
Ganesh S Step Up Analytics 2 / 12
Introduction
Many of you might studied and practiced different
classification as well regression algorithms.
Also, many a time modeler uses a model at a time.
Ever wondered what would happen if we could combine more
than one classification model?
Whether resulting combo might more accurate or less variant?
Will answer these questions shortly
Ganesh S Step Up Analytics 3 / 12
Ensemble models and possible drawback/s of single
specific model
Ensembles are the answers to these questions
It is the process of running two or more related but different
machine learning models and then synthesizing the results into
single predictive or machine learning model
It can have biases
Presence of high variability
Outright inaccuracies
Ganesh S Step Up Analytics 4 / 12
How ensemble models works and example of ensemble
Producing a distribution called a simple ML model on the
subset of original data
Combining the distribution in one aggregated model
Random Forest
It is the group of multiple decision trees which built on
different sample data,evaluates different factors and/or weight
common variables differently.
Ganesh S Step Up Analytics 5 / 12
How ensembles works
Figure: Working of Ensembles
Ganesh S Step Up Analytics 6 / 12
Frequently used ensemble methods and mathematics
Bagging
Boosting
Distance between predated (y) and actual (y) should be less.
(y āˆ’ y) = Bias + Variance + Noise
Bias - The average distance between predictions.
Variance - Variability in the predictions.
Noise - Lower bound on the prediction error that the predictor
can achieve.
If we want to minimize(y āˆ’ y) we have to minimize above
three.
Ganesh S Step Up Analytics 7 / 12
Bagging and Bagging Algorithm
Bagging stands for Bootstrapped Aggregation
Bagging is the way to decrease variance of your prediction by
generating additional training data from the original data with
different combination and replications
Bagging Algorithm
1. Samples(with replacement) are repeatedly taken from the data
set, so that each record has an equal record has an equal
probability of being selected, and each sample is of the same
size as the original training data set. These are bootstrapped
samples.
2. Train the model and record the predictions for each sample.
3. Bagging ensembles will be defined as the class with most votes
or the average of prediction made.
Ganesh S Step Up Analytics 8 / 12
Bagging Ensembles using R
Small case study using R, How ensemble bagging works!
Data Source is UCI data repository - Car Evaluation Data Set
Regression models is used
Bagging
Bagging in R
Ganesh S Step Up Analytics 9 / 12
Results of Bagging
Figure: Working of Ensembles
Ganesh S Step Up Analytics 10 / 12
Continue with...
Boosting and Boosting in R
Bagging and Boosting case study in python
Bagging-Boosting comparison
Famous GBM(Gradient Boosting Method)
GBM in R as well in Python with case study
Ganesh S Step Up Analytics 11 / 12
Thank You !!!
Ganesh S Step Up Analytics 12 / 12

More Related Content

PPTX
Ensemble methods
PPTX
Ensemble learning Techniques
PPTX
boosting algorithm
PPTX
Ensemble methods
PPTX
Machine Learning - Ensemble Methods
PPT
ensemble learning
Ā 
PPTX
(Machine Learning) Ensemble learning
PDF
Understanding Bagging and Boosting
Ensemble methods
Ensemble learning Techniques
boosting algorithm
Ensemble methods
Machine Learning - Ensemble Methods
ensemble learning
Ā 
(Machine Learning) Ensemble learning
Understanding Bagging and Boosting

What's hot (20)

PPTX
Gradient Boosting
PPTX
Boosting Approach to Solving Machine Learning Problems
PPTX
Ensemble learning
PPT
Support Vector machine
PDF
Understanding random forests
PPT
Support Vector Machines
PPTX
Gradient Boosted trees
PDF
Support Vector Machines ( SVM )
PPTX
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
PPTX
Ensemble learning
PPTX
Random forest
PDF
Big data Clustering Algorithms And Strategies
PDF
Feature Engineering - Getting most out of data for predictive models - TDC 2017
PPTX
Ensemble Learning and Random Forests
PDF
Xgboost
PPTX
Machine learning with ADA Boost
PDF
Decision theory
PPTX
Ensemble Method (Bagging Boosting)
PPTX
PPTX
Curse of dimensionality
Gradient Boosting
Boosting Approach to Solving Machine Learning Problems
Ensemble learning
Support Vector machine
Understanding random forests
Support Vector Machines
Gradient Boosted trees
Support Vector Machines ( SVM )
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Ensemble learning
Random forest
Big data Clustering Algorithms And Strategies
Feature Engineering - Getting most out of data for predictive models - TDC 2017
Ensemble Learning and Random Forests
Xgboost
Machine learning with ADA Boost
Decision theory
Ensemble Method (Bagging Boosting)
Curse of dimensionality
Ad

Similar to Ensemble modeling and Machine Learning (20)

PPTX
Ensemble Models in machine learning.pptx
PPTX
Bag the model with bagging
PDF
BaggingBoosting.pdf
PPTX
Bagging.pptx
PDF
Ensemble Learning Notes for students of CS
PPTX
Unit V -Multiple Learners.pptx for artificial intelligence
PPTX
Unit V -Multiple Learners in artificial intelligence and machine learning
PPT
Ensemble Learning Featuring the Netflix Prize Competition and ...
Ā 
PPTX
introduction to machine learning and ensemble methods
PPT
Ensemble Learning in Machine Learning.ppt
PPT
Download It
Ā 
PDF
Complete picture of Ensemble-Learning, boosting, bagging
PDF
Supervised Learning Ensemble Techniques Machine Learning
PDF
4_2_Ensemble models and gradient boosting2.pdf
PDF
Module 6: Ensemble Algorithms
PPTX
Bagging Vs Boosting In Machine Learning | Ensemble Learning In Machine Learni...
PDF
4 2 ensemble models and grad boost part 1
PPTX
Bagging_and_Boosting.pptx
PDF
4_2_Ensemble models and grad boost part 1.pdf
PPTX
Predictive analytics
Ensemble Models in machine learning.pptx
Bag the model with bagging
BaggingBoosting.pdf
Bagging.pptx
Ensemble Learning Notes for students of CS
Unit V -Multiple Learners.pptx for artificial intelligence
Unit V -Multiple Learners in artificial intelligence and machine learning
Ensemble Learning Featuring the Netflix Prize Competition and ...
Ā 
introduction to machine learning and ensemble methods
Ensemble Learning in Machine Learning.ppt
Download It
Ā 
Complete picture of Ensemble-Learning, boosting, bagging
Supervised Learning Ensemble Techniques Machine Learning
4_2_Ensemble models and gradient boosting2.pdf
Module 6: Ensemble Algorithms
Bagging Vs Boosting In Machine Learning | Ensemble Learning In Machine Learni...
4 2 ensemble models and grad boost part 1
Bagging_and_Boosting.pptx
4_2_Ensemble models and grad boost part 1.pdf
Predictive analytics
Ad

Recently uploaded (20)

PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPT
Quality review (1)_presentation of this 21
PPTX
Introduction to machine learning and Linear Models
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction-to-Cloud-ComputingFinal.pptx
Fluorescence-microscope_Botany_detailed content
Data_Analytics_and_PowerBI_Presentation.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to Knowledge Engineering Part 1
oil_refinery_comprehensive_20250804084928 (1).pptx
Clinical guidelines as a resource for EBP(1).pdf
Quality review (1)_presentation of this 21
Introduction to machine learning and Linear Models
.pdf is not working space design for the following data for the following dat...
Business Acumen Training GuidePresentation.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Acceptance and paychological effects of mandatory extra coach I classes.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Miokarditis (Inflamasi pada Otot Jantung)
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...

Ensemble modeling and Machine Learning

  • 1. Ensemble Modeling Step Up Analytics July 16, 2017 Ganesh S Step Up Analytics 1 / 12
  • 2. Road Map Introduction Ensemble models and possible drawback/s of single specific model How ensemble models works and example Frequently used ensemble methods and mathematics Bagging and Bagging Algorithm Bagging ensembles using R Comparison of result Continue with. . . Ganesh S Step Up Analytics 2 / 12
  • 3. Introduction Many of you might studied and practiced different classification as well regression algorithms. Also, many a time modeler uses a model at a time. Ever wondered what would happen if we could combine more than one classification model? Whether resulting combo might more accurate or less variant? Will answer these questions shortly Ganesh S Step Up Analytics 3 / 12
  • 4. Ensemble models and possible drawback/s of single specific model Ensembles are the answers to these questions It is the process of running two or more related but different machine learning models and then synthesizing the results into single predictive or machine learning model It can have biases Presence of high variability Outright inaccuracies Ganesh S Step Up Analytics 4 / 12
  • 5. How ensemble models works and example of ensemble Producing a distribution called a simple ML model on the subset of original data Combining the distribution in one aggregated model Random Forest It is the group of multiple decision trees which built on different sample data,evaluates different factors and/or weight common variables differently. Ganesh S Step Up Analytics 5 / 12
  • 6. How ensembles works Figure: Working of Ensembles Ganesh S Step Up Analytics 6 / 12
  • 7. Frequently used ensemble methods and mathematics Bagging Boosting Distance between predated (y) and actual (y) should be less. (y āˆ’ y) = Bias + Variance + Noise Bias - The average distance between predictions. Variance - Variability in the predictions. Noise - Lower bound on the prediction error that the predictor can achieve. If we want to minimize(y āˆ’ y) we have to minimize above three. Ganesh S Step Up Analytics 7 / 12
  • 8. Bagging and Bagging Algorithm Bagging stands for Bootstrapped Aggregation Bagging is the way to decrease variance of your prediction by generating additional training data from the original data with different combination and replications Bagging Algorithm 1. Samples(with replacement) are repeatedly taken from the data set, so that each record has an equal record has an equal probability of being selected, and each sample is of the same size as the original training data set. These are bootstrapped samples. 2. Train the model and record the predictions for each sample. 3. Bagging ensembles will be defined as the class with most votes or the average of prediction made. Ganesh S Step Up Analytics 8 / 12
  • 9. Bagging Ensembles using R Small case study using R, How ensemble bagging works! Data Source is UCI data repository - Car Evaluation Data Set Regression models is used Bagging Bagging in R Ganesh S Step Up Analytics 9 / 12
  • 10. Results of Bagging Figure: Working of Ensembles Ganesh S Step Up Analytics 10 / 12
  • 11. Continue with... Boosting and Boosting in R Bagging and Boosting case study in python Bagging-Boosting comparison Famous GBM(Gradient Boosting Method) GBM in R as well in Python with case study Ganesh S Step Up Analytics 11 / 12
  • 12. Thank You !!! Ganesh S Step Up Analytics 12 / 12