SlideShare a Scribd company logo
Customer Activation
Activity Predictive Model
Customer Activation
Focus on Equity
Objectives
1. To predict activity levels of each customer in the near future( (Current Model: 90 days)
2. To profile customer activity over time (i.e., activity states with durations)
3. To determine the recommendations to activate customers
Problem Dimensions
• People
– Who are the people likely to be inactive in the next month?
• Activity State
– What are the different states in customer life cycle?
– What is the customer behaviour in a particular state?
• State Duration
– How long would the customer will be in particular state?
– What will be the transition time for a particular customer?
• Recommendation
– What strategy will be effective to prohibit inactivity of a particular customer?
– What strategy can bring customer back from inactive state to active state?
Analysis Process
Distributions:
Inactive period
behaviour, life
cycle of
customer
Comparative
views:
First time
inactive vs.
current inactive,
inactive vs.
active customer
life cycle
ETL Merge Filter Visualize
Storage:
ACMIIL (Trades)
Data
Formats:
Dates,
categories,
numeric value
ranges, etc.
File Formats:
Comma, Tilde,
or Tab
delimited
Customer
types:
Individual vs.
Institutions,
etc.
Transaction
types:
Buying/Selling,
First time
inactive,
current
inactive
Identifiers:
Client Code,
CommonClient
Code
Timeline:
Daily, Monthly
Aggregates:
Counts, Sums
of EQ buy,
Sums of EQ
sell
Activity Modelling - Outline
Trades
Data
• Summary
• Discovery
Model
• Setup
• Application
• Code
• Results
• Setup - Next Steps
• Application – Next Steps
Future Work
Data: Summary
Statistical measures (e.g., mean) errors
– Units field has negative values
– Too large or small values
Text data:
Mis-matches
Numerical data:
Unreal ranges
Numerical data:
Spurious values
DQ Issues
Sizing for technology
• ~7M EQ and ~1M DER trades per year
• ~100k trading customers currently on
platform, and 1/3rd transacted in the last 6
months
Analysis caution
• Data distributions highly skewed,
e.g., few high amount Txs by one or
two individuals
Data: Discovery
Inactivity count
All clients
inactive at least
once for greater
than 91 days
Inactivity Count
Insights
All Clients have been inactive at least once
Frequency
Data Particulars
Data Duration 2012 Apr - 2015 Sep
Each Row Client-month
Client Category Individual and HUF
# of Rows 366421
# of Columns 31
# of Unique Clients 49444
Data: Discovery
Average Inactive Duration
Average inactivity duration (days)
Frequency
300 days
Insights
Histogram of Average inactivity duration gives maximum frequency at 300 days
Data Particulars
Data Duration 2012 Apr - 2015 Sep
Each Row Client-month
Client Category Individual and HUF
# of Rows 366421
# of Columns 31
# of Unique Clients 49444
Data: Discovery
First Time Inactive vs. Currently Inactive
First time
inactive
Currently
inactive
Vintage (yrs) Vintage (yrs)
Frequency
Frequency
5 yrs 7 yrs
Insights
Current inactive customers are a mix of first time inactive and other periods making it harder to study current
inactivity alone => It brings about the need to study each activity level or state separately
Data Particulars
Data Duration 2012 Apr - 2015 Sep
Each Row Client-month
Client Category Individual and HUF
# of Rows 366421
# of Columns 31
# of Unique Clients 49444
SumAmt.(sold)SumAmt.(Bought)
Data: Discovery
Random customer 1: currently inactive(Tx Amount)
Trend Curve
Trend Curve
Data: Discovery
Random customer 1: currently Inactive (Tx Count)
TxCount(sold)TxCount(bought)
Trend Curve
Trend Curve
Data: Discovery
Random customer 2: currently active(Tx Amount)SumAmt.(sold)SumAmt.(Bought)
Trend Curve
Trend Curve
Data: Discovery
Random customer 2: currently active (Tx Count)
TxCount(sold)TxCount(bought)
Trend Curve
Trend Curve
Data: Discovery Insights
• All clients have been inactive (> 91 days inactivity) at least once
• The most-likely inactivity duration is ~300 days, i.e., if customer becomes
inactive => there is a high chance of a long inactivity period
• Customer behaviour is different before various inactive states
• Each inactive state (i.e., first time or second time, etc.) need to be
modelled separately
• There are different trend curves in a customer’s life cycle that each of
customers follow
• The trend curves may be grouped together into a finite set of
representative trend curves
• All the above may be modelled using a State-space approach
• A simple binary approximation is the Logistic regression model
Test Data
Three Year Trade Data
60% Used for Training
Model
20% Used for Validating
Model
20% Used for Testing
Model
Total Available
Data
Training Data
Validation
Data
Time
Acc
Opening
Date
1 1
First Time
inactive Inactive
1
Active Period
Inactive Period
Inactivity: Defined as 0 transactions in consecutive
91 days
Hypothesis: Customer’s state can be predicted using
transactions data
Logistic Regression Model
 To find predictive variables
 To predict next state of the
customer
0 0 0
0
Data Set Creation
Model: Setup
Summary
after
training the
model
Model
Validation
Model Test
Model: Code View
Model: Application
0 0 1 0 0 0 1
0 0 0 0 1 0 1
Actual States
Predicted States
Inactive
State miss
Active
State miss
Actual
Predicted
Positive
Positive
Negative
Negative
a b
c d
a - True Positive
b - False Negative
c - False Positive
d - True Negative
𝐻 𝑎 =
𝑑
𝑁0
𝑀 𝑎 =
𝑏
𝑁0
𝐻𝑖 =
𝑎
𝑁1
𝑀𝑖 =
𝑐
𝑁1
𝐻 𝑎 - Active state hit rate
𝑀 𝑎- Active state miss rate
𝐻𝑖 - Inactive state hit rate
𝑀𝑖 - Inactive state miss rate
Model: Results
𝑀 𝑎= 0.01%
0
5000
10000
15000
20000
25000
30000
35000
Correct Predicted
Active State
Wrong Predicted
Active State
0
5000
10000
15000
20000
Correct Predicted
Inctive State
Wrong Prdicted
Inctive State
𝐻𝑖 = 84.5%
Threshold = 0.25
0
5000
10000
15000
20000
25000
30000
35000
Correct Predicted
Active State
Wrong Predicted
Active State
𝑀 𝑎= 60.8%
0
5000
10000
15000
20000
25000
Correct Predicted Inctive
State
Wrong Prdicted Inctive
State
𝐻𝑖 = 93.1%
Threshold = 0.35
0
10000
20000
30000
40000
50000
60000
Correct Predicted
Active State
Wrong Predicted
Active State
0
5000
10000
15000
20000
25000
Correct Predicted
Inctive State
Wrong Prdicted
Inctive State
𝑀 𝑎= 40.3%
𝐻𝑖 = 0.0009%
Threshold = 0.50
𝑀 𝑎- Active state miss rate
𝐻𝑖 - Inactive state hit rate
a
a
a
c
c
c
d
d db
b
b
Model: Application (next steps)
Multi-period
Hypothesis:
- Error rates can be decreased by taking into account multiple periods for predictions
0 0 1 0 0 0 1
0 0 0 0 1 0 1
Actual States
Predicted States
Model
predicts 1
Check customer’s
transaction in next 30
days
If Tx = 0
Model output is 0 Model output is 1
TrueFalse
1
Active Period
Inactive Period
0
Future…
State-space Model
active
inactive closed
On-
boarded
Technical Model: State-space Model
• In the applied model we have taken only two states 0 for active and 1 for inactive
• Between these active and inactive state a customer can transit into many different states as shown in the
state space model above
• By applying state space model the complete life cycle of a customer
i. Previous state
ii. Next state
iii. Time he will be in a particular state
iv. Behaviour of customer in a particular state
v. Behaviour of customer just before transition,
vi. Behaviour of customer before going off-board, etc., will be profiled
Discussions and Questions
Back-up Slides
Model: Discovery
Predictive Variables
Model: Setup (next steps)
Customer Sampling
For the current model, Training, validation and Testing dataset has been created by sampling on the basis of
rows, where each row is a particular customer and aggregated transaction amounts on monthly basis.
We can create Training, validation and Testing dataset by sampling as per customer basis.

More Related Content

PPTX
Mutual fund Redemption and Cross Sell Analytics
PPTX
Offer Recommendation methodology for Vito's Mobile App
PPTX
Real-time Market Basket Analysis for Retail with Hadoop
PPTX
Predictive analytics in marketing
PDF
How do insurers convert data to value
PPTX
demand estimation by market research
PPTX
Market demand analysis
PDF
Customer churn prediction in banking
Mutual fund Redemption and Cross Sell Analytics
Offer Recommendation methodology for Vito's Mobile App
Real-time Market Basket Analysis for Retail with Hadoop
Predictive analytics in marketing
How do insurers convert data to value
demand estimation by market research
Market demand analysis
Customer churn prediction in banking

What's hot (16)

PDF
Data Mining Problems in Retail
PPTX
An intelligent approach to demand forecasting
PPT
Demand forcasting
PPSX
Demand Forcasting
PPTX
Demand forecasting
PPTX
Demand forecasting
PPTX
Demand forecasting.
PDF
Machine Learning - Algorithms and simple business cases
PDF
Demand forecasting case study
PDF
I.liiv gaining shopper_insights_using_market_basket_analysis
PPTX
04 demand forecasting
ODP
PDF
Tarun Panchaboni - Resume
PPTX
Demand forecasting
PPT
Demand forecasting
PPTX
Demand mgt in scm
Data Mining Problems in Retail
An intelligent approach to demand forecasting
Demand forcasting
Demand Forcasting
Demand forecasting
Demand forecasting
Demand forecasting.
Machine Learning - Algorithms and simple business cases
Demand forecasting case study
I.liiv gaining shopper_insights_using_market_basket_analysis
04 demand forecasting
Tarun Panchaboni - Resume
Demand forecasting
Demand forecasting
Demand mgt in scm
Ad

Viewers also liked (6)

PPTX
Mutual fund Redemption and Cross Sell Analytics
PPTX
Offer recommendation methodology
DOC
Project on mutual funds is the better investments plan
DOCX
comparative Analysis of mutual fund
DOC
A project report on comparative study of mutual funds in india
PDF
Project on Mutual Funds
Mutual fund Redemption and Cross Sell Analytics
Offer recommendation methodology
Project on mutual funds is the better investments plan
comparative Analysis of mutual fund
A project report on comparative study of mutual funds in india
Project on Mutual Funds
Ad

Similar to Customer activation Predictive model (20)

PPTX
Prediction of customer propensity to churn - Telecom Industry
PDF
De-Mystefying Predictive Analytics
PPTX
Case study for DWDM
PPTX
Day 1 (Lecture 2): Business Analytics
PDF
Project crm submission sonali
PDF
Prediction to lower cost to serve
PPTX
Application of predictive analytics
PPTX
Cross-Sell Home Loans Model for Liability Customers in a Bank.pptx
PPTX
Capstone_Customer_Churn_Prediction_Capstone_proejct.pptx
PPTX
Predictive modeling for lifecycle marketing
PPTX
Insurance Churn Prediction Data Analysis Project
PPT
How to apply CRM using data mining techniques.
PPTX
Maximizing Retention with Minimal Effort
PPTX
Purchase Prediction for Insurance Company
PDF
Exploring the Data science Process
PDF
Techathon Idea Paper
PPTX
Bank Customer Churn Prediction- Saurav Singh.pptx
PPTX
Liubomyr Bregman: Modelling Customer Behaviour 
PDF
Customer Analytics Best Practice
PDF
Predictive analytics. overview of skills and opportunities
Prediction of customer propensity to churn - Telecom Industry
De-Mystefying Predictive Analytics
Case study for DWDM
Day 1 (Lecture 2): Business Analytics
Project crm submission sonali
Prediction to lower cost to serve
Application of predictive analytics
Cross-Sell Home Loans Model for Liability Customers in a Bank.pptx
Capstone_Customer_Churn_Prediction_Capstone_proejct.pptx
Predictive modeling for lifecycle marketing
Insurance Churn Prediction Data Analysis Project
How to apply CRM using data mining techniques.
Maximizing Retention with Minimal Effort
Purchase Prediction for Insurance Company
Exploring the Data science Process
Techathon Idea Paper
Bank Customer Churn Prediction- Saurav Singh.pptx
Liubomyr Bregman: Modelling Customer Behaviour 
Customer Analytics Best Practice
Predictive analytics. overview of skills and opportunities

Recently uploaded (20)

PDF
Business Analytics and business intelligence.pdf
PPTX
Managing Community Partner Relationships
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PPTX
A Complete Guide to Streamlining Business Processes
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
DOCX
Factor Analysis Word Document Presentation
PPTX
Leprosy and NLEP programme community medicine
PDF
Introduction to the R Programming Language
PDF
Transcultural that can help you someday.
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PPTX
Introduction to Inferential Statistics.pptx
PDF
Microsoft Core Cloud Services powerpoint
PDF
Navigating the Thai Supplements Landscape.pdf
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Business Analytics and business intelligence.pdf
Managing Community Partner Relationships
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Pilar Kemerdekaan dan Identi Bangsa.pptx
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
A Complete Guide to Streamlining Business Processes
[EN] Industrial Machine Downtime Prediction
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Factor Analysis Word Document Presentation
Leprosy and NLEP programme community medicine
Introduction to the R Programming Language
Transcultural that can help you someday.
Optimise Shopper Experiences with a Strong Data Estate.pdf
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
Introduction to Inferential Statistics.pptx
Microsoft Core Cloud Services powerpoint
Navigating the Thai Supplements Landscape.pdf
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305

Customer activation Predictive model

  • 2. Customer Activation Focus on Equity Objectives 1. To predict activity levels of each customer in the near future( (Current Model: 90 days) 2. To profile customer activity over time (i.e., activity states with durations) 3. To determine the recommendations to activate customers Problem Dimensions • People – Who are the people likely to be inactive in the next month? • Activity State – What are the different states in customer life cycle? – What is the customer behaviour in a particular state? • State Duration – How long would the customer will be in particular state? – What will be the transition time for a particular customer? • Recommendation – What strategy will be effective to prohibit inactivity of a particular customer? – What strategy can bring customer back from inactive state to active state?
  • 3. Analysis Process Distributions: Inactive period behaviour, life cycle of customer Comparative views: First time inactive vs. current inactive, inactive vs. active customer life cycle ETL Merge Filter Visualize Storage: ACMIIL (Trades) Data Formats: Dates, categories, numeric value ranges, etc. File Formats: Comma, Tilde, or Tab delimited Customer types: Individual vs. Institutions, etc. Transaction types: Buying/Selling, First time inactive, current inactive Identifiers: Client Code, CommonClient Code Timeline: Daily, Monthly Aggregates: Counts, Sums of EQ buy, Sums of EQ sell
  • 4. Activity Modelling - Outline Trades Data • Summary • Discovery Model • Setup • Application • Code • Results • Setup - Next Steps • Application – Next Steps Future Work
  • 5. Data: Summary Statistical measures (e.g., mean) errors – Units field has negative values – Too large or small values Text data: Mis-matches Numerical data: Unreal ranges Numerical data: Spurious values DQ Issues Sizing for technology • ~7M EQ and ~1M DER trades per year • ~100k trading customers currently on platform, and 1/3rd transacted in the last 6 months Analysis caution • Data distributions highly skewed, e.g., few high amount Txs by one or two individuals
  • 6. Data: Discovery Inactivity count All clients inactive at least once for greater than 91 days Inactivity Count Insights All Clients have been inactive at least once Frequency Data Particulars Data Duration 2012 Apr - 2015 Sep Each Row Client-month Client Category Individual and HUF # of Rows 366421 # of Columns 31 # of Unique Clients 49444
  • 7. Data: Discovery Average Inactive Duration Average inactivity duration (days) Frequency 300 days Insights Histogram of Average inactivity duration gives maximum frequency at 300 days Data Particulars Data Duration 2012 Apr - 2015 Sep Each Row Client-month Client Category Individual and HUF # of Rows 366421 # of Columns 31 # of Unique Clients 49444
  • 8. Data: Discovery First Time Inactive vs. Currently Inactive First time inactive Currently inactive Vintage (yrs) Vintage (yrs) Frequency Frequency 5 yrs 7 yrs Insights Current inactive customers are a mix of first time inactive and other periods making it harder to study current inactivity alone => It brings about the need to study each activity level or state separately Data Particulars Data Duration 2012 Apr - 2015 Sep Each Row Client-month Client Category Individual and HUF # of Rows 366421 # of Columns 31 # of Unique Clients 49444
  • 9. SumAmt.(sold)SumAmt.(Bought) Data: Discovery Random customer 1: currently inactive(Tx Amount) Trend Curve Trend Curve
  • 10. Data: Discovery Random customer 1: currently Inactive (Tx Count) TxCount(sold)TxCount(bought) Trend Curve Trend Curve
  • 11. Data: Discovery Random customer 2: currently active(Tx Amount)SumAmt.(sold)SumAmt.(Bought) Trend Curve Trend Curve
  • 12. Data: Discovery Random customer 2: currently active (Tx Count) TxCount(sold)TxCount(bought) Trend Curve Trend Curve
  • 13. Data: Discovery Insights • All clients have been inactive (> 91 days inactivity) at least once • The most-likely inactivity duration is ~300 days, i.e., if customer becomes inactive => there is a high chance of a long inactivity period • Customer behaviour is different before various inactive states • Each inactive state (i.e., first time or second time, etc.) need to be modelled separately • There are different trend curves in a customer’s life cycle that each of customers follow • The trend curves may be grouped together into a finite set of representative trend curves • All the above may be modelled using a State-space approach • A simple binary approximation is the Logistic regression model
  • 14. Test Data Three Year Trade Data 60% Used for Training Model 20% Used for Validating Model 20% Used for Testing Model Total Available Data Training Data Validation Data Time Acc Opening Date 1 1 First Time inactive Inactive 1 Active Period Inactive Period Inactivity: Defined as 0 transactions in consecutive 91 days Hypothesis: Customer’s state can be predicted using transactions data Logistic Regression Model  To find predictive variables  To predict next state of the customer 0 0 0 0 Data Set Creation Model: Setup
  • 16. Model: Application 0 0 1 0 0 0 1 0 0 0 0 1 0 1 Actual States Predicted States Inactive State miss Active State miss Actual Predicted Positive Positive Negative Negative a b c d a - True Positive b - False Negative c - False Positive d - True Negative 𝐻 𝑎 = 𝑑 𝑁0 𝑀 𝑎 = 𝑏 𝑁0 𝐻𝑖 = 𝑎 𝑁1 𝑀𝑖 = 𝑐 𝑁1 𝐻 𝑎 - Active state hit rate 𝑀 𝑎- Active state miss rate 𝐻𝑖 - Inactive state hit rate 𝑀𝑖 - Inactive state miss rate
  • 17. Model: Results 𝑀 𝑎= 0.01% 0 5000 10000 15000 20000 25000 30000 35000 Correct Predicted Active State Wrong Predicted Active State 0 5000 10000 15000 20000 Correct Predicted Inctive State Wrong Prdicted Inctive State 𝐻𝑖 = 84.5% Threshold = 0.25 0 5000 10000 15000 20000 25000 30000 35000 Correct Predicted Active State Wrong Predicted Active State 𝑀 𝑎= 60.8% 0 5000 10000 15000 20000 25000 Correct Predicted Inctive State Wrong Prdicted Inctive State 𝐻𝑖 = 93.1% Threshold = 0.35 0 10000 20000 30000 40000 50000 60000 Correct Predicted Active State Wrong Predicted Active State 0 5000 10000 15000 20000 25000 Correct Predicted Inctive State Wrong Prdicted Inctive State 𝑀 𝑎= 40.3% 𝐻𝑖 = 0.0009% Threshold = 0.50 𝑀 𝑎- Active state miss rate 𝐻𝑖 - Inactive state hit rate a a a c c c d d db b b
  • 18. Model: Application (next steps) Multi-period Hypothesis: - Error rates can be decreased by taking into account multiple periods for predictions 0 0 1 0 0 0 1 0 0 0 0 1 0 1 Actual States Predicted States Model predicts 1 Check customer’s transaction in next 30 days If Tx = 0 Model output is 0 Model output is 1 TrueFalse 1 Active Period Inactive Period 0
  • 20. active inactive closed On- boarded Technical Model: State-space Model • In the applied model we have taken only two states 0 for active and 1 for inactive • Between these active and inactive state a customer can transit into many different states as shown in the state space model above • By applying state space model the complete life cycle of a customer i. Previous state ii. Next state iii. Time he will be in a particular state iv. Behaviour of customer in a particular state v. Behaviour of customer just before transition, vi. Behaviour of customer before going off-board, etc., will be profiled
  • 24. Model: Setup (next steps) Customer Sampling For the current model, Training, validation and Testing dataset has been created by sampling on the basis of rows, where each row is a particular customer and aggregated transaction amounts on monthly basis. We can create Training, validation and Testing dataset by sampling as per customer basis.