SlideShare a Scribd company logo
Automated Data-Mining
ALL SME
Anybody Success
www.7segments.co
m
About me
+ (421) 918 666
238
BANKS
• BCR ERSTE BANK (RO)
• TATRABANKA
• PSS
TELCO & RETAIL
• T-MOBILE
• VODAFONE (CZ)
• SOS ELECTRONIC
• EXISPORT
UTILITIES & OTHER
• SPP
• PIXEL FEDERATION
ABOUT PREDICTIVE ANALYTICS
INTRODUCTION
CRISP DM Methodology
Business Understanding
• Set business goal and ask question
– We need to grow sales in SME segment
• What SME are interested in our offer?
• Transform in into task for data-mining
– Divide SME portfolio into 2 group
• Will accept offer
• Will not accept offer
Data-mining starts by asking the right question
Data understanding
ALL SME
ACCEPTED OFFER X
Goal: Target less customers and achieve the same results
All customers Success in last 30 days
Data understanding
Imagine a predictive model splitting customers into segments:
green, orange, red
ALL SME
Target just 20% of customers to get 80% of max profit.
10
%
All customers Success in last 30 days
Best customers
10 000 (20%)
Average customers
20 000 (40%)
Worst customers
20 000 (40%)
50 000 customers
5 000 accepted offer
4000 (80%)
750 (15%)
250 (5%)
100%
20%40%40%
60%
Rule1
Rule2
Data preparation
• In order to build good model you need
relevant data
Customer Age Location Purchases Target
1 10 East 120 Yes
2 5 East 42 No
3 24 West 23 Yes
4 2 West 50 Yes
5 1 West 19 No
More attributes available = better chance for good prediction
Modeling:
Split buyers and non buyers
Logistic RegressionLinear Regression
Decision Tree Neural Networks
Model Evaluation
• Train model on
historical data
• Test on unseen data
– 50%:50%
– different history
Model vs. bežný výber
40%
25%
14%
8%
4% 3% 2% 2% 1% 1%
0%
5%
10%
15%
20%
25%
30%
35%
40%
45%
10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
% celkového počtu klientov
%targetvskupine
model
random
Model vs. random selection on history
% of total SME%buyersingroup
• Good models moves buyers to the first groups
• Weak model is like random: buyers are everywhere
Deployment
• Target campaign to the best customers
• Usual process:
– Import predictions to CRM
– Select top customers
– Evaluate campaign after a XY days
HOW PREDICTIONS WORK
IN 7SEGMENTS
DEMO
Usually it looks like this :-)
IBM SPSS Modeler 16
Smart software can simplify process
Step of methodology
Level of automation
in our solution
Business understanding Requires user input
Data understanding Automated & supervised
Data preparation Automated & supervised
Modeling Automated & supervised
Evaluation Automated & supervised
Deployment Semi-Automated
Use wizard for task definition
or
Select a pre-defined task
Let user to check results
and decide about actions
Who is eligible for prediction?
For all customer who bought some “Shoes” in last 3y
What would you like to predict?
Identify those likely to:
make another purchase above 5€ in next 300 days
Data understanding
Check default response rate & predictors
Training decision tree
Model Evaluation
and Profitability Analysis
Deployment
Use this rules in your next campaign:
Summary
• Non-coders can understand and manage
• Works well on any data
• Useful: instant deployment into campaigns
delivers immediate value
UNDER COVER
INSIGHTS
Under Cover: Data
• We use available customer data for prediction
• We generate predictors from customer events
Customer Time Event Amount Product
1 Monday Purchase 5 Shirt
1 Tuesday Purchase 10 Shoes
1 Thursday Purchase 20 Ball
1 Saturday Purchase 5 Pen
1 Sunday Purchase 2 Pin
Count of (event.purchase) = 5
Count of (event.purchase) in (last 3 days) = 2
Sum (event.purchase.amount) = 43
Sum (event.purchase.amount) in (last 3 days) = 7
Last / First / Most frequent event.purchase.product = Pin/Shirt/-
….
For every event
and all its properties
Under Cover: Training & validation
• Automatic building of train, test and actual dataset
– train is from different time period then test
– actual are the most recent data
• Rules for attribute transformations
– Discretization, coarse classing, remove noise, replace
missing value
• Variation of CHAID decision tree
– It’s almost non-parametric
– ChiSq test works fine on unbalanced data compared to
GINI in CARTs
– Generate heuristics for stopping criteria
Under Cover: Deployment
• Real-time scoring
– Calculate only final predictors what’s easy task for
single customer
– Apply rules-set to get prediction
– Keep scoring for post-evaluation
Under Cover: Near Future
• Apply various algorithms
– For better visualization & insights
• Log. Regression produces visual scorecard with points
– To reduce risk of over fitting or low AUC
• Generate less and better predictors
– [Average] [profit] in [category=value] in the [last/first]
N [periods]
• Identify right time to retrain models
(Almost) Full List of Features
Campaigns
• Automated & Planned
• Scenario Designer
• Email Designer & Gateway
• SMS Gateway
• Survey Designer
• Social Pusher (FB, Twitter, LinkedIn)
CRM
• Customer List
• Custom Attributes
• Events
• Data Management
• Import Wizard
• Aggregator and Derivator
Analytics
• Trends
• Funnels
• Descriptive Statistics
• Session Length
• Custom Report Designer
• Dashbdoard Designer
Optimization
• Segmentations
• Predictions
• Product Recommendation Engine
• Market Basket Analysis
• Real-time Customer Scoring
API and SDK for JavaScript, PHP, Python, Flash, Unity (mobile), Linux shell, …
Thank you for attention!
Call Jozo Kovac (+421) 918 666 238
and schedule live demo!
www.7segments.co
m
+ (421) 918 666
238

More Related Content

PDF
ARM yourself with Exponea - 7 tips for growth
PPTX
6 Things to Make Analytics Work
PPTX
Mastering customer journey with reactivation
PPTX
Customer Retention: The Key To Growth
PPTX
Reduce Churn and Improve Customer Loyalty
PPTX
Adestra Breakfast Seminar: Reaching your true (email) potential
PPTX
Jim Uchneat
PPTX
Driving Higher Growth and Profit, the Trilogy: Solution Selling, Pricing Exce...
ARM yourself with Exponea - 7 tips for growth
6 Things to Make Analytics Work
Mastering customer journey with reactivation
Customer Retention: The Key To Growth
Reduce Churn and Improve Customer Loyalty
Adestra Breakfast Seminar: Reaching your true (email) potential
Jim Uchneat
Driving Higher Growth and Profit, the Trilogy: Solution Selling, Pricing Exce...

What's hot (20)

PDF
Transitioning to a Subscription-based Business Model
PDF
Multi-Touch Attribution Models: Improve Your Advertising ROI
PDF
Bootstrapping a Customer Success Program
PDF
Empowering Your Renewal Teams (ServiceSource and IDC Webinar, March 18, 2013)
PDF
Higher Revenue with Renewals for Sales VPs
PPTX
Customer Journey Optimization by Funnel Ventures
PPTX
Churn Analysis
PDF
5 common mistakes with sales incentive systems: Forgetting the management in ...
PPT
Londens Consulting BI Suite
PDF
Customer Satisfaction
PDF
Segmentation Methods for Management Consultants & Business Analysts
PPT
Chapter Ii Crm
PDF
Webinar: 5 steps to maximize recurring revenue
PPTX
Performance marketing presentation 2016 final (Martech Europe 2016)
PDF
Omni-channel Retail: Fashioning a New Economics
PPTX
Bit120 m01 l05 - customer relationship management
PDF
Building Customer Success Into Your Product: The Inevitable Direction for Saa...
PPTX
Retail AI at work: How leading retailers are using AI to optimize pricing and...
PPSX
Amir yarkoni In House SEM decision support-online-marketing
Transitioning to a Subscription-based Business Model
Multi-Touch Attribution Models: Improve Your Advertising ROI
Bootstrapping a Customer Success Program
Empowering Your Renewal Teams (ServiceSource and IDC Webinar, March 18, 2013)
Higher Revenue with Renewals for Sales VPs
Customer Journey Optimization by Funnel Ventures
Churn Analysis
5 common mistakes with sales incentive systems: Forgetting the management in ...
Londens Consulting BI Suite
Customer Satisfaction
Segmentation Methods for Management Consultants & Business Analysts
Chapter Ii Crm
Webinar: 5 steps to maximize recurring revenue
Performance marketing presentation 2016 final (Martech Europe 2016)
Omni-channel Retail: Fashioning a New Economics
Bit120 m01 l05 - customer relationship management
Building Customer Success Into Your Product: The Inevitable Direction for Saa...
Retail AI at work: How leading retailers are using AI to optimize pricing and...
Amir yarkoni In House SEM decision support-online-marketing
Ad

Similar to Automated Data Mining for Everyone (20)

PPTX
Smart Driver Alert: Predictive Fatigue Detection Technology
PDF
Predictive Analytics Demystified
PPTX
"Ecommerce Customer Segmentation & Prediction: Enhancing Business Strategies ...
PPTX
Customer Analytics
PDF
Machine learning for customer classification
PPTX
Customer analytics
PDF
Predictive Analytics in Practice - Breakfast Club 11th May 2017
PPTX
Day 1 (Lecture 2): Business Analytics
PPTX
Best practices machine learning final
PPSX
Data Refinement: The missing link between data collection and decisions
PPTX
Presentation Title
PPTX
Liubomyr Bregman: Modelling Customer Behaviour 
PDF
IBM Transforming Customer Relationships Through Predictive Analytics
PDF
Best Practices for Big Data Analytics with Machine Learning by Datameer
PPTX
E-Commerce Customer Segmentation and Prediction: Unlocking Insights for Smart...
PDF
TDWI Solution Summit San Diego 2014 Advanced Analytics at Macys.com
PDF
A novel approach to optimizing customer profiles in relation to business metrics
PPTX
E-Commerce Customer Segmentation and Behavior Prediction: A Data-Driven Strategy
PDF
Mirek Cerny (Futurelytics) - Find Hidden Potential in Your Customer Base
PDF
Mirek Černý - Get more from your existing customers - Marketing Festival Brn...
Smart Driver Alert: Predictive Fatigue Detection Technology
Predictive Analytics Demystified
"Ecommerce Customer Segmentation & Prediction: Enhancing Business Strategies ...
Customer Analytics
Machine learning for customer classification
Customer analytics
Predictive Analytics in Practice - Breakfast Club 11th May 2017
Day 1 (Lecture 2): Business Analytics
Best practices machine learning final
Data Refinement: The missing link between data collection and decisions
Presentation Title
Liubomyr Bregman: Modelling Customer Behaviour 
IBM Transforming Customer Relationships Through Predictive Analytics
Best Practices for Big Data Analytics with Machine Learning by Datameer
E-Commerce Customer Segmentation and Prediction: Unlocking Insights for Smart...
TDWI Solution Summit San Diego 2014 Advanced Analytics at Macys.com
A novel approach to optimizing customer profiles in relation to business metrics
E-Commerce Customer Segmentation and Behavior Prediction: A Data-Driven Strategy
Mirek Cerny (Futurelytics) - Find Hidden Potential in Your Customer Base
Mirek Černý - Get more from your existing customers - Marketing Festival Brn...
Ad

Recently uploaded (20)

PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Cloud computing and distributed systems.
PPTX
Spectroscopy.pptx food analysis technology
PDF
Empathic Computing: Creating Shared Understanding
PDF
Approach and Philosophy of On baking technology
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Review of recent advances in non-invasive hemoglobin estimation
Building Integrated photovoltaic BIPV_UPV.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Diabetes mellitus diagnosis method based random forest with bat algorithm
MYSQL Presentation for SQL database connectivity
sap open course for s4hana steps from ECC to s4
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Encapsulation_ Review paper, used for researhc scholars
NewMind AI Weekly Chronicles - August'25 Week I
Chapter 3 Spatial Domain Image Processing.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Cloud computing and distributed systems.
Spectroscopy.pptx food analysis technology
Empathic Computing: Creating Shared Understanding
Approach and Philosophy of On baking technology
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Understanding_Digital_Forensics_Presentation.pptx

Automated Data Mining for Everyone

  • 2. www.7segments.co m About me + (421) 918 666 238 BANKS • BCR ERSTE BANK (RO) • TATRABANKA • PSS TELCO & RETAIL • T-MOBILE • VODAFONE (CZ) • SOS ELECTRONIC • EXISPORT UTILITIES & OTHER • SPP • PIXEL FEDERATION
  • 5. Business Understanding • Set business goal and ask question – We need to grow sales in SME segment • What SME are interested in our offer? • Transform in into task for data-mining – Divide SME portfolio into 2 group • Will accept offer • Will not accept offer Data-mining starts by asking the right question
  • 6. Data understanding ALL SME ACCEPTED OFFER X Goal: Target less customers and achieve the same results All customers Success in last 30 days
  • 7. Data understanding Imagine a predictive model splitting customers into segments: green, orange, red ALL SME Target just 20% of customers to get 80% of max profit. 10 % All customers Success in last 30 days Best customers 10 000 (20%) Average customers 20 000 (40%) Worst customers 20 000 (40%) 50 000 customers 5 000 accepted offer 4000 (80%) 750 (15%) 250 (5%) 100% 20%40%40% 60% Rule1 Rule2
  • 8. Data preparation • In order to build good model you need relevant data Customer Age Location Purchases Target 1 10 East 120 Yes 2 5 East 42 No 3 24 West 23 Yes 4 2 West 50 Yes 5 1 West 19 No More attributes available = better chance for good prediction
  • 9. Modeling: Split buyers and non buyers Logistic RegressionLinear Regression Decision Tree Neural Networks
  • 10. Model Evaluation • Train model on historical data • Test on unseen data – 50%:50% – different history Model vs. bežný výber 40% 25% 14% 8% 4% 3% 2% 2% 1% 1% 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% % celkového počtu klientov %targetvskupine model random Model vs. random selection on history % of total SME%buyersingroup • Good models moves buyers to the first groups • Weak model is like random: buyers are everywhere
  • 11. Deployment • Target campaign to the best customers • Usual process: – Import predictions to CRM – Select top customers – Evaluate campaign after a XY days
  • 12. HOW PREDICTIONS WORK IN 7SEGMENTS DEMO
  • 13. Usually it looks like this :-) IBM SPSS Modeler 16
  • 14. Smart software can simplify process Step of methodology Level of automation in our solution Business understanding Requires user input Data understanding Automated & supervised Data preparation Automated & supervised Modeling Automated & supervised Evaluation Automated & supervised Deployment Semi-Automated Use wizard for task definition or Select a pre-defined task Let user to check results and decide about actions
  • 15. Who is eligible for prediction? For all customer who bought some “Shoes” in last 3y
  • 16. What would you like to predict? Identify those likely to: make another purchase above 5€ in next 300 days
  • 17. Data understanding Check default response rate & predictors
  • 20. Deployment Use this rules in your next campaign:
  • 21. Summary • Non-coders can understand and manage • Works well on any data • Useful: instant deployment into campaigns delivers immediate value
  • 23. Under Cover: Data • We use available customer data for prediction • We generate predictors from customer events Customer Time Event Amount Product 1 Monday Purchase 5 Shirt 1 Tuesday Purchase 10 Shoes 1 Thursday Purchase 20 Ball 1 Saturday Purchase 5 Pen 1 Sunday Purchase 2 Pin Count of (event.purchase) = 5 Count of (event.purchase) in (last 3 days) = 2 Sum (event.purchase.amount) = 43 Sum (event.purchase.amount) in (last 3 days) = 7 Last / First / Most frequent event.purchase.product = Pin/Shirt/- …. For every event and all its properties
  • 24. Under Cover: Training & validation • Automatic building of train, test and actual dataset – train is from different time period then test – actual are the most recent data • Rules for attribute transformations – Discretization, coarse classing, remove noise, replace missing value • Variation of CHAID decision tree – It’s almost non-parametric – ChiSq test works fine on unbalanced data compared to GINI in CARTs – Generate heuristics for stopping criteria
  • 25. Under Cover: Deployment • Real-time scoring – Calculate only final predictors what’s easy task for single customer – Apply rules-set to get prediction – Keep scoring for post-evaluation
  • 26. Under Cover: Near Future • Apply various algorithms – For better visualization & insights • Log. Regression produces visual scorecard with points – To reduce risk of over fitting or low AUC • Generate less and better predictors – [Average] [profit] in [category=value] in the [last/first] N [periods] • Identify right time to retrain models
  • 27. (Almost) Full List of Features Campaigns • Automated & Planned • Scenario Designer • Email Designer & Gateway • SMS Gateway • Survey Designer • Social Pusher (FB, Twitter, LinkedIn) CRM • Customer List • Custom Attributes • Events • Data Management • Import Wizard • Aggregator and Derivator Analytics • Trends • Funnels • Descriptive Statistics • Session Length • Custom Report Designer • Dashbdoard Designer Optimization • Segmentations • Predictions • Product Recommendation Engine • Market Basket Analysis • Real-time Customer Scoring API and SDK for JavaScript, PHP, Python, Flash, Unity (mobile), Linux shell, …
  • 28. Thank you for attention! Call Jozo Kovac (+421) 918 666 238 and schedule live demo! www.7segments.co m + (421) 918 666 238