SlideShare a Scribd company logo
Data Mining By Example – Building
Predictive Model Using Microsoft
Decision Trees
by Shaoli Lu
Microsoft Decision Trees
• Developed by Microsoft research team, the
Microsoft Decision Trees algorithm is a hybrid
decision tree algorithm that supports
classification and regression
Goal
• To predict a prospect’s likelihood of
purchasing a bike
Prerequisite
• An SQL Server instance created (2005 or above)
• SQL Server Analysis Service (SSAS) –
Multidimensional Feature Installed
(this is used to host and browse the mining structures; cube is not required for data mining!)
• AdventureWorksDW database attached
(download from CodePlex - tailor to the SQL Server version you have)
• Visual Studio 2010 or above with SQL Server
Data Tools (SSDT) installed
My Demo Setup
• Visual Studio 2010
• SQL Server 2012
Create Data Mining Project
• Name the project as DM Decision Trees
(DM = Data Mining)
Data mining by example - building predictive model using microsoft decision trees
Create Data Source and Impersonation
Data mining by example - building predictive model using microsoft decision trees
Create Data Source View
Data mining by example - building predictive model using microsoft decision trees
Create Mining Structure
• Choose Microsoft Decision Trees model
• Select Data Source View
• Choose training data
• Select Input/Predict parameters
• Set content types
• Set Holdout percentage
• Name the mining structure and model
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Deploy the mining structure and
model
Data mining by example - building predictive model using microsoft decision trees
Process the mining model
• This is also called “training the model”
Data mining by example - building predictive model using microsoft decision trees
Mining Model Viewer
• Identify dominant attributes
• Left is associative with more important
attributes
• Rich visualization is good for data exploration
as well
Data mining by example - building predictive model using microsoft decision trees
Mining Model Accuracy Chart
• This is called “Testing the Model” using the
Holdout data
• Lift chart
• Profit chart
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Mining Model Prediction
• Singleton query
• Mass prediction
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Browse mining model on SQL Server
• Decision trees
• Dependency network
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Data mining by example - building predictive model using microsoft decision trees
Summary
• Microsoft Decision Trees is a powerful data
mining model, yet it is easy to build, train and use
• Can perform both Singleton (e.g. embed in an
app) and Mass Predictions (e.g. targeted
marketing)
• Holdout data can be used to test trained model
• Rich visualizations such as Lift/Profit Charts and
Dependency Network can facilitate analysis and
data exploration
• Relational database can be used for data mining;
cube is not required
The End

More Related Content

PPTX
Data mining by example forecasting and cross prediction using microsoft time ...
PDF
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
PDF
Developing with SQL Server Analysis Services 201310
PPTX
Developing ssas cube
PDF
Intoduction to sql 2012 Tabular Modeling
PPTX
Building 1 million predictions per second using SQL-R
PDF
A Gentle Introduction to Microsoft SSAS
PPTX
SSAS Tabular model importance and uses
Data mining by example forecasting and cross prediction using microsoft time ...
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
Developing with SQL Server Analysis Services 201310
Developing ssas cube
Intoduction to sql 2012 Tabular Modeling
Building 1 million predictions per second using SQL-R
A Gentle Introduction to Microsoft SSAS
SSAS Tabular model importance and uses

What's hot (20)

PPTX
Cloud architectural patterns and Microsoft Azure tools
PPTX
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
PPTX
SQL PASS 2017 - Building one million predictions per second using SQL Server ...
PPTX
Geek Sync | Taking Your First Steps to the Cloud—Building a Hybrid Model
PDF
Azure SQL Data Warehouse
PDF
Feature store Overview St. Louis Big Data IDEA Meetup aug 2020
PPTX
Microsoft Azure Data Factory Hands-On Lab Overview Slides
PPTX
Azure Data Factory Data Flow Preview December 2019
PPTX
R in Power BI
PDF
Azure data analytics platform - A reference architecture
PPTX
Machine Learning on the Microsoft Stack
PDF
ADF Mapping Data Flow Private Preview Migration
PPTX
Tech-Spark: Azure SQL Databases
PPTX
Azure data factory
PPTX
Microsoft Azure Data Factory Data Flow Scenarios
PDF
Azure Data Factory v2
PPTX
Analytics in the Cloud
PPTX
Spark Summit West 2017: Real-Time Image Recognition with MemSQL and Spark
PPTX
Intro to Azure Data Factory v1
PPTX
Understanding performance bottlenecks using performance dashboard
Cloud architectural patterns and Microsoft Azure tools
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
SQL PASS 2017 - Building one million predictions per second using SQL Server ...
Geek Sync | Taking Your First Steps to the Cloud—Building a Hybrid Model
Azure SQL Data Warehouse
Feature store Overview St. Louis Big Data IDEA Meetup aug 2020
Microsoft Azure Data Factory Hands-On Lab Overview Slides
Azure Data Factory Data Flow Preview December 2019
R in Power BI
Azure data analytics platform - A reference architecture
Machine Learning on the Microsoft Stack
ADF Mapping Data Flow Private Preview Migration
Tech-Spark: Azure SQL Databases
Azure data factory
Microsoft Azure Data Factory Data Flow Scenarios
Azure Data Factory v2
Analytics in the Cloud
Spark Summit West 2017: Real-Time Image Recognition with MemSQL and Spark
Intro to Azure Data Factory v1
Understanding performance bottlenecks using performance dashboard
Ad

Similar to Data mining by example - building predictive model using microsoft decision trees (20)

PPT
Real-world BISM in SQL Server 2012 SSAS
PPTX
Data mining tools for excel and sql server
PPSX
New features of sql server 2016 bi features
PPTX
Mine craft:
PDF
SSAS Design & Incremental Processing - PASSMN May 2010
PDF
Ds03 data analysis
PDF
Machine Learning Operations Cababilities
PPTX
Kicktag - About Kicktag & Cosmos 2014
PPTX
Msbi 2012 online training
PDF
Micro strategy 7i
PDF
Building Data Warehouse in SQL Server
PDF
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
PDF
Designing a modern data warehouse in azure
PDF
Designing a modern data warehouse in azure
PPTX
Azure machine learning tech mela
PDF
Predicting Flights with Azure Databricks
PPTX
Msbi 2008 online training
PDF
Alex mang patterns for scalability in microsoft azure application
PDF
SSAS, MDX , Cube understanding, Browsing and Tools information
PPTX
SQL Server 2019 Master Data Service
Real-world BISM in SQL Server 2012 SSAS
Data mining tools for excel and sql server
New features of sql server 2016 bi features
Mine craft:
SSAS Design & Incremental Processing - PASSMN May 2010
Ds03 data analysis
Machine Learning Operations Cababilities
Kicktag - About Kicktag & Cosmos 2014
Msbi 2012 online training
Micro strategy 7i
Building Data Warehouse in SQL Server
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure
Azure machine learning tech mela
Predicting Flights with Azure Databricks
Msbi 2008 online training
Alex mang patterns for scalability in microsoft azure application
SSAS, MDX , Cube understanding, Browsing and Tools information
SQL Server 2019 Master Data Service
Ad

Recently uploaded (20)

PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Computer network topology notes for revision
PPTX
Database Infoormation System (DBIS).pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
1_Introduction to advance data techniques.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
Mega Projects Data Mega Projects Data
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Computer network topology notes for revision
Database Infoormation System (DBIS).pptx
climate analysis of Dhaka ,Banglades.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Supervised vs unsupervised machine learning algorithms
1_Introduction to advance data techniques.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Mega Projects Data Mega Projects Data
Business Ppt On Nestle.pptx huunnnhhgfvu
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Data_Analytics_and_PowerBI_Presentation.pptx
Business Analytics and business intelligence.pdf
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
IBA_Chapter_11_Slides_Final_Accessible.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Galatica Smart Energy Infrastructure Startup Pitch Deck

Data mining by example - building predictive model using microsoft decision trees

  • 1. Data Mining By Example – Building Predictive Model Using Microsoft Decision Trees by Shaoli Lu
  • 2. Microsoft Decision Trees • Developed by Microsoft research team, the Microsoft Decision Trees algorithm is a hybrid decision tree algorithm that supports classification and regression
  • 3. Goal • To predict a prospect’s likelihood of purchasing a bike
  • 4. Prerequisite • An SQL Server instance created (2005 or above) • SQL Server Analysis Service (SSAS) – Multidimensional Feature Installed (this is used to host and browse the mining structures; cube is not required for data mining!) • AdventureWorksDW database attached (download from CodePlex - tailor to the SQL Server version you have) • Visual Studio 2010 or above with SQL Server Data Tools (SSDT) installed
  • 5. My Demo Setup • Visual Studio 2010 • SQL Server 2012
  • 6. Create Data Mining Project • Name the project as DM Decision Trees (DM = Data Mining)
  • 8. Create Data Source and Impersonation
  • 12. Create Mining Structure • Choose Microsoft Decision Trees model • Select Data Source View • Choose training data • Select Input/Predict parameters • Set content types • Set Holdout percentage • Name the mining structure and model
  • 24. Deploy the mining structure and model
  • 26. Process the mining model • This is also called “training the model”
  • 28. Mining Model Viewer • Identify dominant attributes • Left is associative with more important attributes • Rich visualization is good for data exploration as well
  • 30. Mining Model Accuracy Chart • This is called “Testing the Model” using the Holdout data • Lift chart • Profit chart
  • 34. Mining Model Prediction • Singleton query • Mass prediction
  • 41. Browse mining model on SQL Server • Decision trees • Dependency network
  • 45. Summary • Microsoft Decision Trees is a powerful data mining model, yet it is easy to build, train and use • Can perform both Singleton (e.g. embed in an app) and Mass Predictions (e.g. targeted marketing) • Holdout data can be used to test trained model • Rich visualizations such as Lift/Profit Charts and Dependency Network can facilitate analysis and data exploration • Relational database can be used for data mining; cube is not required