Ensemble modeling and Machine Learning

Ensemble Modeling
Step Up Analytics
July 16, 2017
Ganesh S Step Up Analytics 1 / 12

Road Map
Introduction
Ensemble models and possible drawback/s of single speciﬁc
model
How ensemble models works and example
Frequently used ensemble methods and mathematics
Bagging and Bagging Algorithm
Bagging ensembles using R
Comparison of result
Continue with. . .

Introduction
Many of you might studied and practiced different
classification as well regression algorithms.
Also, many a time modeler uses a model at a time.
Ever wondered what would happen if we could combine more
than one classification model?
Whether resulting combo might more accurate or less variant?
Will answer these questions shortly

Ensemble models and possible drawback/s of single
speciﬁc model
Ensembles are the answers to these questions
It is the process of running two or more related but diﬀerent
machine learning models and then synthesizing the results into
single predictive or machine learning model
It can have biases
Presence of high variability
Outright inaccuracies

How ensemble models works and example of ensemble
Producing a distribution called a simple ML model on the
subset of original data
Combining the distribution in one aggregated model
Random Forest
It is the group of multiple decision trees which built on
different sample data,evaluates different factors and/or weight
common variables differently.

How ensembles works
Figure: Working of Ensembles

Frequently used ensemble methods and mathematics
Bagging
Boosting
Distance between predated (y) and actual (y) should be less.
(y − y) = Bias + Variance + Noise
Bias - The average distance between predictions.
Variance - Variability in the predictions.
Noise - Lower bound on the prediction error that the predictor
can achieve.
If we want to minimize(y − y) we have to minimize above
three.

Bagging and Bagging Algorithm
Bagging stands for Bootstrapped Aggregation
Bagging is the way to decrease variance of your prediction by
generating additional training data from the original data with
diﬀerent combination and replications
Bagging Algorithm
1. Samples(with replacement) are repeatedly taken from the data
set, so that each record has an equal record has an equal
probability of being selected, and each sample is of the same
size as the original training data set. These are bootstrapped
samples.
2. Train the model and record the predictions for each sample.
3. Bagging ensembles will be deﬁned as the class with most votes
or the average of prediction made.

Bagging Ensembles using R
Small case study using R, How ensemble bagging works!
Data Source is UCI data repository - Car Evaluation Data Set
Regression models is used
Bagging
Bagging in R

Results of Bagging
Figure: Working of Ensembles

Continue with...
Boosting and Boosting in R
Bagging and Boosting case study in python
Bagging-Boosting comparison
Famous GBM(Gradient Boosting Method)
GBM in R as well in Python with case study

Thank You !!!

Ensemble modeling and Machine Learning

More Related Content

What's hot (20)

Similar to Ensemble modeling and Machine Learning (20)

Recently uploaded (20)

Ensemble modeling and Machine Learning