SlideShare a Scribd company logo
Maximizing
Retention with
Minimal Effort
Vijay Kumar Devalla
Business Overview
• Focus: Utilizing detailed customer data to track account status—
active or closed.
• Impact: Customer churn leads to direct revenue loss, increased acquisition
costs, and reduced profitability.
• Significance: Addressing account closures (churn) is a crucial challenge for
sustained growth and competitiveness in the financial industry.
• Approach: Strategically leveraging insights to proactively manage churn,
strengthen relationships, and optimize revenue.
Dataset • Our dataset contains
– Binary target variable
– Numerical and Categorial feature variables
01
CreditScor
e
Data
Choice
02
Geography
03
Age
04
Tenure
05
Balanc
e
06
NumOfProduct
s
07
HasCrCard
08
IsActiveMembe
r
09
EstimatedSalar
y
01
RowNumber
02
CustomerID
03
Surname
04
Gender
This data set contains 10,000 rows data.
Why certain variables are removed?
Data Choice
• RowNumber: This is likely a sequential identifier for each row in your dataset. It does
not carry any meaningful information about the customer's behavior or characteristics
that would influence their decision to leave the bank.
• CustomerID: Similar to RowNumber, CustomerID is a unique identifier for each
customer.
• Surname: The surname of a customer is another variable that does not have a direct
impact on their banking behavior or their decision to stay with or leave the bank.
• Gender: Based on bias and ethical considerations, the inclusion of gender as a
variable can potentially lead to biased outcomes. Moreover, our goal is to understand
and predict churn based on financial behaviors and product usage, gender might not
be relevant to this specific analysis.
Data
Preparation
O v e r S a m p l i n g
• Observed that target
variable has unequal
distribution of 0s and 1s
• Used Synthetic Minority
Over-sampling Technique
(SMOTE) to balance the
distribution of target
variable
L a b e l E n c o d i n g
• Categorical values need to be
encoded into numerical
values.
• Created dummies for
‘Geography’
• Changed ‘Balance’ and
‘Estimated Salary’ columns
from string to numerical
values
N u l l H a n d l i n g
• To prevent inaccurate model
training, we need to address
the missing values.
• Used mode to fill in the
missing data
C o l u m n D r o p p i n g
• Remove unwanted columns to
build an efficient model.
• These columns have minimal
to negligible effect on the
target variable.
Modeling
Overview Models Implemented:
We implemented a diverse set of models,
including:
Decision Tree, Logistic Regression,
SVM, Neural Networks, Random Forest,
Ensemble Models, Gradient Boosting,
XGBoost, and K-NN.
Each model brings a unique approach to
tackling the challenge of predicting bank
churn.
F-1 score:
Model Performance Comparison:
Precision:
Precision Comparison:
Recall:
Recall Comparison:
Analysis of Specific Models
● K-Means Clustering:
K-Means Clustering, an unsupervised algorithm, employs metrics like
Silhouette Score (0.21) and Inertia (34999.87) for evaluation. The moderate
Silhouette Score suggests some separation between clusters, urging
careful examination for meaningful insights amid potential overlap. Inertia
signifies cluster compactness, with ongoing exploration involving
adjustments to cluster numbers to enhance distinct and meaningful
groupings in our project.
● Logistic Regression Attribute Selection:
In the realm of Logistic Regression attribute selection, the focus is not
solely on traditional metrics. Instead, we emphasize the importance of
feature selection and the impact of variables on the model. A careful
analysis of coefficients and statistical significance guides us in
understanding the relevance of each attribute in predicting customer
churn.
Winning Model:
Gradient Boosting Hyper-parameter Tuning
The standout model is Gradient Boosting with Hyperparameter
Tuning, as evidenced by its strong performance metrics in
precision, recall, and F1-score for both class 0 and class 1.
This noteworthy achievement suggests that the Gradient Boosting
model minimizes false predictions effectively for both categories,
reinforcing its reliability and accuracy in our predictive modeling
context.
Model
Evaluation
Evaluation:
Lift/Gain Curve
By reaching out to 40% of the customers
who belong to 4 deciles, we will be able to
find out 85% of the customers who are
more likely to exit.
A Lift of 2, for example, corresponds with
there being twice the number of
customers exited compared with the
number you’d expect by contacting the
same number of customers at random.
So, we may have only contacted 40% of
the customers, but we may have reached
85% of the customers who are likely to
exit in the customer base.
Decile Scores CustomerCountCummulativeCustomers CummCustomer%CustomersLikelytoexit CummCustomersLikelytoexit CummCustomersLikelytoexit% Lift
9 0.8769-0.9986 335 335 10.01% 319 319 33.83% 3.38
8 0.5864-0.8763 334 669 20.00% 248 567 60.13% 3.01
7 0.3277-0.5862 335 1004 30.01% 146 713 75.61% 2.52
6 0.1974-0.3275 334 1338 40.00% 83 796 84.41% 2.11
5 0.1299-0.1971 334 1672 49.99% 61 857 90.88% 1.82
4 0.089-0.1299 335 2007 60.00% 32 889 94.27% 1.57
3 0.0636-0.0889 334 2341 69.99% 23 912 96.71% 1.38
2 0.0448-0.0636 335 2676 80.00% 17 929 98.52% 1.23
1 0.0302-0.0447 334 3010 89.99% 8 937 99.36% 1.10
0 0.0053-0.0301 335 3345 100.00% 6 943 100.00% 1.00
Evaluation:
Business Case Development Steps for Project
Improvement.
Define the business problem:
- High customer churn rates impacting revenue and profitability.
Current State:
- Describe the current situation: Current churn rates, associated costs, and the impact on customer satisfaction.
- Highlight the need for a proactive approach to customer retention.
Expected Improvements:
- Quantify the expected improvements:
- Reduce churn rate by X%.
- Increase customer retention by X months.
- Enhance customer satisfaction and loyalty.
ROI Projection:
- Calculate ROI by considering:
- Cost of implementing the churn modeling solution.
- Projected revenue increase from retained customers.
- Potential cost savings in marketing efforts targeting the right customers.
Key Metrics for Evaluation:
- Identify key performance indicators (KPIs) for evaluating the success of the churn modeling solution:
- Precision, recall, and F1 score for model accuracy.
- Reduction in customer churn rate.
- Increase in customer lifetime value (CLV).
Evaluation:
Suggesting Improvements
Feature Importance Analysis:
- Conduct a feature importance analysis to understand which factors contribute
most to churn predictions. Use this information to suggest improvements in
customer retention strategies.
Explore Advanced Techniques:
- Consider exploring more advanced modeling techniques (e.g., ensemble
methods, deep learning) to potentially enhance the predictive capabilities of the
model.
Continuous Monitoring and Iteration:
- Emphasize the importance of continuous model monitoring and iteration.
Churn patterns may change over time, and the model should be adapted
accordingly.
Customer Segmentation:
- Explore customer segmentation to tailor retention strategies for different
customer groups. This can enhance the precision of targeted interventions.
Deployment
strategies
Model Deployment
● For less time-sensitive use cases, deploy the model in a batch
processing pipeline that periodically evaluates churn risk for all
customers.
● Implement an alerting system that notifies relevant stakeholders
when a high-risk customer is predicted, allowing for proactive
retention strategies.
● Establish a feedback loop where model predictions and outcomes
are monitored, and the model is periodically retrained for
continuous improvement.
● Develop a monitoring dashboard that provides insights into model
performance, prediction accuracy, and customer churn trends.
● Ensure the model's predictions are interpretable, especially if
customer-facing staff will use them, to build trust in the model's
recommendations.
CREDITS: This presentation template was created by
Slidesgo, and includes icons by Flaticon, and
infographics & images by Freepik
Thanks

More Related Content

PPTX
Bank Customer Churn Prediction- Saurav Singh.pptx
PPTX
Data mining and analysis of customer churn dataset
PDF
Churn analysis
PDF
Techathon Idea Paper
PPTX
Capstone_Customer_Churn_Prediction_Capstone_proejct.pptx
PPTX
Customer_Churn_prediction.pptx
PPTX
Customer_Churn_prediction.pptx
PDF
Customer 360 brochure
Bank Customer Churn Prediction- Saurav Singh.pptx
Data mining and analysis of customer churn dataset
Churn analysis
Techathon Idea Paper
Capstone_Customer_Churn_Prediction_Capstone_proejct.pptx
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
Customer 360 brochure

Similar to Maximizing Retention with Minimal Effort (20)

PPTX
Fintech is money for paltforms learning baout bank churn
PPTX
Prediction of customer propensity to churn - Telecom Industry
PPTX
Telecom Churn Prediction Presentation
PPTX
ai ml presentation.pptx ON SUBSCRIPTION BASED INDUSTRY
PPTX
Day 1 (Lecture 2): Business Analytics
PDF
IRJET - Customer Churn Analysis in Telecom Industry
PPTX
Insurance Churn Prediction Data Analysis Project
PDF
1055 track3 soules
PDF
QWE Inc Report_Group 2
PPT
MIS637_Final_Project_Rahul_Bhatia
PDF
Project crm submission sonali
PPTX
Bank churn with Data Science
PPTX
Kratos iim kozhikode
PPTX
Telecom Churn Analysis
PDF
OurBank Customer Retention Strategy Q1 2021
PDF
A Proposed Churn Prediction Model
PDF
Churn Analytics - Techwave .pdf
PDF
Automated Feature Selection and Churn Prediction using Deep Learning Models
PDF
1030 track3 boire
PDF
RESEARCH ON INTEGRATED LEARNING ALGORITHM MODEL OF BANK CUSTOMER CHURN PREDIC...
Fintech is money for paltforms learning baout bank churn
Prediction of customer propensity to churn - Telecom Industry
Telecom Churn Prediction Presentation
ai ml presentation.pptx ON SUBSCRIPTION BASED INDUSTRY
Day 1 (Lecture 2): Business Analytics
IRJET - Customer Churn Analysis in Telecom Industry
Insurance Churn Prediction Data Analysis Project
1055 track3 soules
QWE Inc Report_Group 2
MIS637_Final_Project_Rahul_Bhatia
Project crm submission sonali
Bank churn with Data Science
Kratos iim kozhikode
Telecom Churn Analysis
OurBank Customer Retention Strategy Q1 2021
A Proposed Churn Prediction Model
Churn Analytics - Techwave .pdf
Automated Feature Selection and Churn Prediction using Deep Learning Models
1030 track3 boire
RESEARCH ON INTEGRATED LEARNING ALGORITHM MODEL OF BANK CUSTOMER CHURN PREDIC...
Ad

Recently uploaded (20)

PDF
annual-report-2024-2025 original latest.
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Lecture1 pattern recognition............
PDF
Business Analytics and business intelligence.pdf
PPT
Quality review (1)_presentation of this 21
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Computer network topology notes for revision
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
annual-report-2024-2025 original latest.
Introduction to Knowledge Engineering Part 1
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
IBA_Chapter_11_Slides_Final_Accessible.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
Lecture1 pattern recognition............
Business Analytics and business intelligence.pdf
Quality review (1)_presentation of this 21
Clinical guidelines as a resource for EBP(1).pdf
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Qualitative Qantitative and Mixed Methods.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Computer network topology notes for revision
oil_refinery_comprehensive_20250804084928 (1).pptx
climate analysis of Dhaka ,Banglades.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Ad

Maximizing Retention with Minimal Effort

  • 2. Business Overview • Focus: Utilizing detailed customer data to track account status— active or closed. • Impact: Customer churn leads to direct revenue loss, increased acquisition costs, and reduced profitability. • Significance: Addressing account closures (churn) is a crucial challenge for sustained growth and competitiveness in the financial industry. • Approach: Strategically leveraging insights to proactively manage churn, strengthen relationships, and optimize revenue.
  • 3. Dataset • Our dataset contains – Binary target variable – Numerical and Categorial feature variables
  • 5. Data Choice • RowNumber: This is likely a sequential identifier for each row in your dataset. It does not carry any meaningful information about the customer's behavior or characteristics that would influence their decision to leave the bank. • CustomerID: Similar to RowNumber, CustomerID is a unique identifier for each customer. • Surname: The surname of a customer is another variable that does not have a direct impact on their banking behavior or their decision to stay with or leave the bank. • Gender: Based on bias and ethical considerations, the inclusion of gender as a variable can potentially lead to biased outcomes. Moreover, our goal is to understand and predict churn based on financial behaviors and product usage, gender might not be relevant to this specific analysis.
  • 7. O v e r S a m p l i n g • Observed that target variable has unequal distribution of 0s and 1s • Used Synthetic Minority Over-sampling Technique (SMOTE) to balance the distribution of target variable L a b e l E n c o d i n g • Categorical values need to be encoded into numerical values. • Created dummies for ‘Geography’ • Changed ‘Balance’ and ‘Estimated Salary’ columns from string to numerical values
  • 8. N u l l H a n d l i n g • To prevent inaccurate model training, we need to address the missing values. • Used mode to fill in the missing data C o l u m n D r o p p i n g • Remove unwanted columns to build an efficient model. • These columns have minimal to negligible effect on the target variable.
  • 9. Modeling Overview Models Implemented: We implemented a diverse set of models, including: Decision Tree, Logistic Regression, SVM, Neural Networks, Random Forest, Ensemble Models, Gradient Boosting, XGBoost, and K-NN. Each model brings a unique approach to tackling the challenge of predicting bank churn.
  • 13. Analysis of Specific Models ● K-Means Clustering: K-Means Clustering, an unsupervised algorithm, employs metrics like Silhouette Score (0.21) and Inertia (34999.87) for evaluation. The moderate Silhouette Score suggests some separation between clusters, urging careful examination for meaningful insights amid potential overlap. Inertia signifies cluster compactness, with ongoing exploration involving adjustments to cluster numbers to enhance distinct and meaningful groupings in our project. ● Logistic Regression Attribute Selection: In the realm of Logistic Regression attribute selection, the focus is not solely on traditional metrics. Instead, we emphasize the importance of feature selection and the impact of variables on the model. A careful analysis of coefficients and statistical significance guides us in understanding the relevance of each attribute in predicting customer churn.
  • 14. Winning Model: Gradient Boosting Hyper-parameter Tuning The standout model is Gradient Boosting with Hyperparameter Tuning, as evidenced by its strong performance metrics in precision, recall, and F1-score for both class 0 and class 1. This noteworthy achievement suggests that the Gradient Boosting model minimizes false predictions effectively for both categories, reinforcing its reliability and accuracy in our predictive modeling context.
  • 16. Evaluation: Lift/Gain Curve By reaching out to 40% of the customers who belong to 4 deciles, we will be able to find out 85% of the customers who are more likely to exit. A Lift of 2, for example, corresponds with there being twice the number of customers exited compared with the number you’d expect by contacting the same number of customers at random. So, we may have only contacted 40% of the customers, but we may have reached 85% of the customers who are likely to exit in the customer base. Decile Scores CustomerCountCummulativeCustomers CummCustomer%CustomersLikelytoexit CummCustomersLikelytoexit CummCustomersLikelytoexit% Lift 9 0.8769-0.9986 335 335 10.01% 319 319 33.83% 3.38 8 0.5864-0.8763 334 669 20.00% 248 567 60.13% 3.01 7 0.3277-0.5862 335 1004 30.01% 146 713 75.61% 2.52 6 0.1974-0.3275 334 1338 40.00% 83 796 84.41% 2.11 5 0.1299-0.1971 334 1672 49.99% 61 857 90.88% 1.82 4 0.089-0.1299 335 2007 60.00% 32 889 94.27% 1.57 3 0.0636-0.0889 334 2341 69.99% 23 912 96.71% 1.38 2 0.0448-0.0636 335 2676 80.00% 17 929 98.52% 1.23 1 0.0302-0.0447 334 3010 89.99% 8 937 99.36% 1.10 0 0.0053-0.0301 335 3345 100.00% 6 943 100.00% 1.00
  • 17. Evaluation: Business Case Development Steps for Project Improvement. Define the business problem: - High customer churn rates impacting revenue and profitability. Current State: - Describe the current situation: Current churn rates, associated costs, and the impact on customer satisfaction. - Highlight the need for a proactive approach to customer retention. Expected Improvements: - Quantify the expected improvements: - Reduce churn rate by X%. - Increase customer retention by X months. - Enhance customer satisfaction and loyalty. ROI Projection: - Calculate ROI by considering: - Cost of implementing the churn modeling solution. - Projected revenue increase from retained customers. - Potential cost savings in marketing efforts targeting the right customers. Key Metrics for Evaluation: - Identify key performance indicators (KPIs) for evaluating the success of the churn modeling solution: - Precision, recall, and F1 score for model accuracy. - Reduction in customer churn rate. - Increase in customer lifetime value (CLV).
  • 18. Evaluation: Suggesting Improvements Feature Importance Analysis: - Conduct a feature importance analysis to understand which factors contribute most to churn predictions. Use this information to suggest improvements in customer retention strategies. Explore Advanced Techniques: - Consider exploring more advanced modeling techniques (e.g., ensemble methods, deep learning) to potentially enhance the predictive capabilities of the model. Continuous Monitoring and Iteration: - Emphasize the importance of continuous model monitoring and iteration. Churn patterns may change over time, and the model should be adapted accordingly. Customer Segmentation: - Explore customer segmentation to tailor retention strategies for different customer groups. This can enhance the precision of targeted interventions.
  • 20. Model Deployment ● For less time-sensitive use cases, deploy the model in a batch processing pipeline that periodically evaluates churn risk for all customers. ● Implement an alerting system that notifies relevant stakeholders when a high-risk customer is predicted, allowing for proactive retention strategies. ● Establish a feedback loop where model predictions and outcomes are monitored, and the model is periodically retrained for continuous improvement. ● Develop a monitoring dashboard that provides insights into model performance, prediction accuracy, and customer churn trends. ● Ensure the model's predictions are interpretable, especially if customer-facing staff will use them, to build trust in the model's recommendations.
  • 21. CREDITS: This presentation template was created by Slidesgo, and includes icons by Flaticon, and infographics & images by Freepik Thanks