SlideShare a Scribd company logo
TreeNet Tree
Ensembles and CART
  Decision Trees: A
Winning Combination
                                                                             October 2012
                                                                          Mikhail Golovnya
                                                                           Salford Systems

CART® software is a trademark of California Statistical Software, Inc. and is licensed exclusively to Salford Systems.
TreeNet® software is a trademark of Salford Systems
Course Outline
• CART decision tree pros/cons
• TreeNet stochastic gradient boosting: a promising
  way to overcome the shortcomings of a single tree
• Introducing TreeNet, a powerful modern ensemble
  of boosted trees
    o   Methodology
    o   Reporting
    o   Interpretability
    o   Post-processing
    o   Interaction detection
• Advantages of using both CART and TreeNet
    o Contribution from CART
    o Contribution from TreeNet



 © Salford Systems 2012
Demonstration Dataset
108,376 bank customers (commercial and individual)
with 6,564 in bad standing over the past two years
Goal: identify customers in bad standing using the
following predictors
Revolving utilization of credit
Age of the primary account holder
Debt ratio of the primary account holder
Monthly income
Number of open credit lines
Number of mortgages
Number of dependents

 © Salford Systems 2012
CART Advantages
1. Relatively fast
2. All types of variables
    1.    Numeric, binary, categorical, missing values

3. Invariant under monotone transformations
    1.    Variable scales are irrelevant
    2.    Immunity to outliers
    3.    Most variables can be used “as is”

4. Resistance to many irrelevant variables
5. Few tunable parameters “off-the-shelf” procedure
6. Interpretable model representation



 © Salford Systems 2012
CART Disadvantages
1. Trade-off: accuracy vs. interpretability
2. Piecewise-constant model
    1.    Big errors near region boundaries
    2.    Impossible to detect fine differences within the segment

3. Instability => high variance
    1.    Small data change => big model change (especially for large trees)

4. Data fragmentation – splitting
5. High interaction order model, unreasonably
   complicated way to represent simple additive
   dependencies



 © Salford Systems 2012
TreeNet Tree Ensembles
• Complements CART advantages, while
  dramatically increasing accuracy

       Tree 1                  Tree 2                    Tree 3


                         +                        +




  First tree grown           2nd tree grown on        3rd tree grown on
     on original              residuals from            residuals from
        target.              first. Predictions       model consisting
    Intentionally            made to improve           of first two trees
   “weak” model                   first tree


© Salford Systems 2012
TreeNet Overcomes
         CART’s Shortcomings
Piecewise-Constant         CART                           TreeNet
Model                      Big errors near region         Fine predictions, nearly
                           boundaries, coarse             emulating smooth
                           predictions                    continuous response
                                                          surface
Instability and Variance   CART                           TreeNet
                           Small data changes             Stable models due to
                           induce big model changes       averaging of individual
                           (especially for large trees)   tree responses
Data Fragmentation         CART                           TreeNet
                           Relatively few predictors      Each tree works with the
                           make it into the model         entire data – many
                                                          opportunities for
                                                          variables to enter
High Interaction Order     CART                           TreeNet
Model                      Always enforced                Allows precise control
  © Salford Systems 2012                                  over the interactions
TreeNet and CART
 A Winning Combination



© Salford Systems 2012

More Related Content

PPTX
TreeNet Overview - Updated October 2012
PPTX
TreeNet Tree Ensembles & CART Decision Trees: A Winning Combination
PPTX
Decision Tree - C4.5&CART
PDF
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
PPTX
Improved Predictions in Structure Based Drug Design Using Cart and Bayesian M...
PPTX
Introduction to Text Mining and Semantics
PPTX
Text mining tutorial
PDF
Boston p camp introducing pmf v3.3 - Steve Wells at ProductCamp Boston, Ap...
TreeNet Overview - Updated October 2012
TreeNet Tree Ensembles & CART Decision Trees: A Winning Combination
Decision Tree - C4.5&CART
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Improved Predictions in Structure Based Drug Design Using Cart and Bayesian M...
Introduction to Text Mining and Semantics
Text mining tutorial
Boston p camp introducing pmf v3.3 - Steve Wells at ProductCamp Boston, Ap...

Viewers also liked (13)

PPTX
医用画像情報イントロダクション Ver.1 0_20160726
PDF
Smokeless Tobacco and Oral Cancer
PDF
FAQ How do I find My Ideal Virtual Assistant
PDF
94 1006-1-pb
PDF
Recommendation Letter - Xiuting
PDF
PEDIDO DE PROVIDÊNCIA 814
PPT
Ms word1
PPT
日本
DOCX
Mapas conceptuales de proyectos .....
PPTX
8ink 기획서V1 0 김수현,유지은
PDF
8i standby
PDF
Mobile marketing e geolocalização: um mundo de possibilidades
DOCX
Entonar
医用画像情報イントロダクション Ver.1 0_20160726
Smokeless Tobacco and Oral Cancer
FAQ How do I find My Ideal Virtual Assistant
94 1006-1-pb
Recommendation Letter - Xiuting
PEDIDO DE PROVIDÊNCIA 814
Ms word1
日本
Mapas conceptuales de proyectos .....
8ink 기획서V1 0 김수현,유지은
8i standby
Mobile marketing e geolocalização: um mundo de possibilidades
Entonar
Ad

Similar to TreeNet Tree Ensembles and CART Decision Trees: A Winning Combination (20)

PPTX
Some of the new features in SPM 7
PDF
Introduction to Random Forest
PDF
Distributed Logistic Model Trees
PDF
Deep neural networks and tabular data
PPTX
18 Simple CART
PPTX
Decision tree-for ai models-explination for beginner.pptx
PPT
The Use Of Decision Trees For Adaptive Item
PPTX
Hadoop & Greenplum: Why Do Such a Thing?
PPTX
Scaling metagenome assembly
PPTX
An Introduction to Random Forest and linear regression algorithms
PDF
The return of big iron?
PDF
Data Mining Module 3 Business Analtics..pdf
PPT
decisiontrees.ppt
PPT
decisiontrees (3).ppt
PPT
decisiontrees.ppt
PDF
Data Science - Part V - Decision Trees & Random Forests
PPTX
10 best practices in operational analytics
PPTX
Morse-Smale Regression
PPT
Machine Learning M1A.ppt for supervise and unsupervise learning
PDF
Subdivision of large uniform stands lacking natural bounding features
Some of the new features in SPM 7
Introduction to Random Forest
Distributed Logistic Model Trees
Deep neural networks and tabular data
18 Simple CART
Decision tree-for ai models-explination for beginner.pptx
The Use Of Decision Trees For Adaptive Item
Hadoop & Greenplum: Why Do Such a Thing?
Scaling metagenome assembly
An Introduction to Random Forest and linear regression algorithms
The return of big iron?
Data Mining Module 3 Business Analtics..pdf
decisiontrees.ppt
decisiontrees (3).ppt
decisiontrees.ppt
Data Science - Part V - Decision Trees & Random Forests
10 best practices in operational analytics
Morse-Smale Regression
Machine Learning M1A.ppt for supervise and unsupervise learning
Subdivision of large uniform stands lacking natural bounding features
Ad

More from Salford Systems (20)

PDF
Datascience101presentation4
PPTX
Improve Your Regression with CART and RandomForests
PPTX
Churn Modeling-For-Mobile-Telecommunications
PPT
The Do's and Don'ts of Data Mining
PPTX
Introduction to Random Forests by Dr. Adele Cutler
PPTX
9 Data Mining Challenges From Data Scientists Like You
PPTX
Statistically Significant Quotes To Remember
PPTX
Using CART For Beginners with A Teclo Example Dataset
PPT
CART Classification and Regression Trees Experienced User Guide
PPTX
Evolution of regression ols to gps to mars
PPTX
Data Mining for Higher Education
PDF
Comparison of statistical methods commonly used in predictive modeling
PDF
Molecular data mining tool advances in hiv
PDF
SPM v7.0 Feature Matrix
PDF
SPM User's Guide: Introducing MARS
PPT
Hybrid cart logit model 1998
PPTX
Session Logs Tutorial for SPM
PPT
Paradigm shifts in wildlife and biodiversity management through machine learning
PPT
Global Modeling of Biodiversity and Climate Change
PPTX
Predicting Hospital Readmission Using TreeNet
Datascience101presentation4
Improve Your Regression with CART and RandomForests
Churn Modeling-For-Mobile-Telecommunications
The Do's and Don'ts of Data Mining
Introduction to Random Forests by Dr. Adele Cutler
9 Data Mining Challenges From Data Scientists Like You
Statistically Significant Quotes To Remember
Using CART For Beginners with A Teclo Example Dataset
CART Classification and Regression Trees Experienced User Guide
Evolution of regression ols to gps to mars
Data Mining for Higher Education
Comparison of statistical methods commonly used in predictive modeling
Molecular data mining tool advances in hiv
SPM v7.0 Feature Matrix
SPM User's Guide: Introducing MARS
Hybrid cart logit model 1998
Session Logs Tutorial for SPM
Paradigm shifts in wildlife and biodiversity management through machine learning
Global Modeling of Biodiversity and Climate Change
Predicting Hospital Readmission Using TreeNet

TreeNet Tree Ensembles and CART Decision Trees: A Winning Combination

  • 1. TreeNet Tree Ensembles and CART Decision Trees: A Winning Combination October 2012 Mikhail Golovnya Salford Systems CART® software is a trademark of California Statistical Software, Inc. and is licensed exclusively to Salford Systems. TreeNet® software is a trademark of Salford Systems
  • 2. Course Outline • CART decision tree pros/cons • TreeNet stochastic gradient boosting: a promising way to overcome the shortcomings of a single tree • Introducing TreeNet, a powerful modern ensemble of boosted trees o Methodology o Reporting o Interpretability o Post-processing o Interaction detection • Advantages of using both CART and TreeNet o Contribution from CART o Contribution from TreeNet © Salford Systems 2012
  • 3. Demonstration Dataset 108,376 bank customers (commercial and individual) with 6,564 in bad standing over the past two years Goal: identify customers in bad standing using the following predictors Revolving utilization of credit Age of the primary account holder Debt ratio of the primary account holder Monthly income Number of open credit lines Number of mortgages Number of dependents © Salford Systems 2012
  • 4. CART Advantages 1. Relatively fast 2. All types of variables 1. Numeric, binary, categorical, missing values 3. Invariant under monotone transformations 1. Variable scales are irrelevant 2. Immunity to outliers 3. Most variables can be used “as is” 4. Resistance to many irrelevant variables 5. Few tunable parameters “off-the-shelf” procedure 6. Interpretable model representation © Salford Systems 2012
  • 5. CART Disadvantages 1. Trade-off: accuracy vs. interpretability 2. Piecewise-constant model 1. Big errors near region boundaries 2. Impossible to detect fine differences within the segment 3. Instability => high variance 1. Small data change => big model change (especially for large trees) 4. Data fragmentation – splitting 5. High interaction order model, unreasonably complicated way to represent simple additive dependencies © Salford Systems 2012
  • 6. TreeNet Tree Ensembles • Complements CART advantages, while dramatically increasing accuracy Tree 1 Tree 2 Tree 3 + + First tree grown 2nd tree grown on 3rd tree grown on on original residuals from residuals from target. first. Predictions model consisting Intentionally made to improve of first two trees “weak” model first tree © Salford Systems 2012
  • 7. TreeNet Overcomes CART’s Shortcomings Piecewise-Constant CART TreeNet Model Big errors near region Fine predictions, nearly boundaries, coarse emulating smooth predictions continuous response surface Instability and Variance CART TreeNet Small data changes Stable models due to induce big model changes averaging of individual (especially for large trees) tree responses Data Fragmentation CART TreeNet Relatively few predictors Each tree works with the make it into the model entire data – many opportunities for variables to enter High Interaction Order CART TreeNet Model Always enforced Allows precise control © Salford Systems 2012 over the interactions
  • 8. TreeNet and CART A Winning Combination © Salford Systems 2012