SlideShare a Scribd company logo
Construction of a robust
prediction model to forecast the
likelihood of a credit card holder
to experience payment defaults in
upcoming months.
Xi global resources
Group of company
March 28, 2024
contents
Introduction
01
Resources
02
Methodology
03
Result and discussion
04
Conclusion
06
Recommendations
05
Overview
 Card payments are essential in digital commerce.
 Card payments integral in digital commerce.
 Show consumer preference and retailer confidence.
 eWallets offer convenient alternative, affirm status.
 Installment options rise, nearly half retailers accept.
 Bank transfers, direct debits less common.
 Longer processing times, lower consumer demand.
Overview cont.'s
 a payment card is one of the best options for
obtaining cash and every year the traditional
cash in the wallet is being displaced more
and more by “plastic money”.
 Number of issued payment cards in 2009–
Q2 2021 (in units) with adjusted lineartrend
and forecast (including 90% confidence
interval)
(2) (PDF) Development of the Payment Cards Market in Poland in the Era of the Covid-19 Pandemic. Available from:
https://www.researchgate.net/publication/361429793_Development_of_the_Payment_Cards_Market_in_Poland_in_the_Era_of_the_Covid-19_Pandemic [accessed Mar 30 2024].
Introduction
Research Motivation and Significant of study
 Build a logistic regression model for classification of customer with a default
payment next month from those without it
 Accurate prediction vital for informed decision-making.
 Payment process affects company finances
significantly.
 Impact on customer relationships and credit
risk.
Problem Statement
Resources
Question addressed
 The only resource provided is the dataset. This dataset contains information on default payments,
demographic factors, credit data, history of payment, and bill statements of credit card clients in
Taiwan from April 2005 to September 2005. The dataset contains 25 variables such as:
 Given historical data and demographic information, can a predictive model effectively estimate
default payment with a high degree of accuracy?
Dataset Information
Resources
Content
There are 25 variables:
1. ID: ID of each client
2. LIMIT_BAL: Amount of given credit in NT dollars (includes individual and family/supplementary credit
3. SEX: Gender (1=male, 2=female)
4. EDUCATION: (1=graduate school, 2=university, 3=high school, 4=others, 5=unknown, 6=unknown)
5. MARRIAGE: Marital status (1=married, 2=single, 3=others)
6. AGE: Age in years
7. PAY_0 to PAY_6 (6 features) Repayment status from September to April, 2005: (-1=pay duly, 1=payment
delay for one month, 2=payment delay for two months, … 8=payment delay for eight months, 9=payment
delay for nine months and above)
8. BILL_AMT1 to BILL_AMT6: Amount of bill statement from April to September, 2005 (NT dollar)
9. PAY_AMT1 to PAY_AMT6: Amount of previous payment from April to September, 2005 (NT dollar)
10. default.payment.next.month: Default payment (1=yes, 0=no)
Dataset Information
Methods
1. Process of assigning labels: Labeling data
2. Preliminary analysis
Exploratory analysis:
Checking the data structure
Detecting missing values
Detecting outliers: boxplot
3. Visualization: A bar chart was used for qualitative variables, while boxplot and density plot was used for the
quantitative or continuous variables.
Methods
 Relationship: Correlation matrix to investigates variable relationships.
 Data partitioning: Dataset split into 75:25 training and test to prevents over fitting.
 Model trained: the model was trained with all features
 Feature selection: Backward selection process was applied to remove insignification feature with a stop
condition set at alpha level of 0.05
 Model evaluation: The model accuracy level was estimated.
Structure of the dataset
Figure 1: Variable contained in the dataset displaying the total number of observation by the
variable type(integer or numeric)
Structure of the dataset
Figure 2: Variable contained in the dataset displaying the percentage of values present(or if
there is any missing values) by total number of observation.
Preliminary Analysis: Exploratory data analysis
Figure 3: Distribution of default payment next month
Preliminary Analysis: Exploratory data analysis
Figure 3: Distribution of gender by default payment next month
Preliminary Analysis: Exploratory data analysis
Figure 4: Distribution of marriage by default payment next month
Preliminary Analysis: Exploratory data analysis
Figure 5: Distribution of education by default payment next month
Preliminary Analysis: Exploratory data analysis
Figure 6: Distribution of repayment status in September, 2005 by
default payment next month
Preliminary Analysis: Exploratory data analysis
Figure 7: Distribution of repayment status in August, 2005 by default payment
next month
Preliminary Analysis: Exploratory data analysis
Figure 8: Distribution of repayment status in July, 2005 by default
payment next month
Preliminary Analysis: Exploratory data analysis
Figure 9: Distribution of repayment status in June, 2005 by default
payment next month
Preliminary Analysis: Exploratory data analysis
Figure 10: Distribution of repayment status in May, 2005 by default
payment next month
Preliminary Analysis: Exploratory data analysis
Figure 11: Distribution of repayment status in April, 2005 by default
payment next month
Preliminary Analysis: Exploratory data analysis
Figure 12: Distribution of age by default payment next month
Preliminary Analysis: Exploratory data analysis
Figure 12: Distribution of amount of given credit bill
by default payment next month
Model evaluation
Figure 31:Confusion matrix showing the counts of true positive (TP), true
negative (TN), false positive (FP), and false negative (FN) predictions
made by the model on a dataset
Accura
cy
AUC
Trained
Model
80.84 % 0.73
Retrained
Model
80.91 % 0.72
Recommendation
 Significant Features: The bill amounts (e.g., BILL_AMT1, BILL_AMT3) and previous payment amounts (e.g.,
PAY_AMT1, PAY_AMT2) play a significant role in predicting default offering.
 Payment Status important: It is crucial for the business to closely monitor customers' payment behavior,
especially when there are signs of payment delays or defaults.
 Customer Segmentation: Utilize the insights from the model to segment customers based on their risk profiles.
 Customer Assistance Program: Implement customer assistance programs or financial counseling services to
support customers experiencing financial difficulties.
Conclusion
 In conclusion, the model provide valuable insights into the
factors influencing default payment next month in the dataset.
 The analysis highlights the significance of payment status, age,
bill amounts, and previous payments in predicting default.
 By closely monitoring these factors and adapting strategies
accordingly, businesses can better manage default risks and
improve their financial stability.
THANK YOU!

More Related Content

PPTX
Construction of a robust prediction model to forecast the likelihood of a cre...
PDF
Default of Credit Card Payments
PPTX
Default payment prediction system
PPTX
OPIM 5604 predictive modeling presentation group7
PDF
Project 01 - Data Exploration and Reporting
Construction of a robust prediction model to forecast the likelihood of a cre...
Default of Credit Card Payments
Default payment prediction system
OPIM 5604 predictive modeling presentation group7
Project 01 - Data Exploration and Reporting

Similar to prediction of default payment next month using a logistic approach (20)

PPTX
Pds assignment 2 presentation
PDF
Machine Learning Project - Default credit card clients
PDF
Taiwanese Credit Card Client Fraud detection
PPTX
Credit defaulter analysis
PPTX
Loan default prediction with machine language
PPTX
Credit Risk Evaluation Model
PPTX
Analyzing loan data project-data analysis.pptx
PDF
Phase 1 of Predicting Payment default on Vehicle Loan EMI
PDF
Loan Analysis Predicting Defaulters
PPTX
Introduction to predictive modeling v1
PDF
Predicting Credit Card Defaults using Machine Learning Algorithms
PPTX
LOAN PREDICTION BASED ON CUSTOMER BEHAVIOR.pptx
PPTX
Credit_Card_Defaulter_Prediction_Presentation.pptx
PPTX
Exploratory Data Analysis For Credit Risk Assesment
PPTX
Mining Credit Card Defults
PPTX
ai it hw mst prac[1] - Read-Offnly.pptx
PPTX
PDF
Phase 2 of Predicting Payment default on Vehicle Loan EMI
PPTX
Credit card client default visualization
PPTX
Credit EDA Assignment (Tanvi Pradhan)
Pds assignment 2 presentation
Machine Learning Project - Default credit card clients
Taiwanese Credit Card Client Fraud detection
Credit defaulter analysis
Loan default prediction with machine language
Credit Risk Evaluation Model
Analyzing loan data project-data analysis.pptx
Phase 1 of Predicting Payment default on Vehicle Loan EMI
Loan Analysis Predicting Defaulters
Introduction to predictive modeling v1
Predicting Credit Card Defaults using Machine Learning Algorithms
LOAN PREDICTION BASED ON CUSTOMER BEHAVIOR.pptx
Credit_Card_Defaulter_Prediction_Presentation.pptx
Exploratory Data Analysis For Credit Risk Assesment
Mining Credit Card Defults
ai it hw mst prac[1] - Read-Offnly.pptx
Phase 2 of Predicting Payment default on Vehicle Loan EMI
Credit card client default visualization
Credit EDA Assignment (Tanvi Pradhan)
Ad

Recently uploaded (20)

PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
Introduction to Inferential Statistics.pptx
PPTX
New ISO 27001_2022 standard and the changes
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
CYBER SECURITY the Next Warefare Tactics
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PPTX
Leprosy and NLEP programme community medicine
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PDF
Introduction to Data Science and Data Analysis
PPTX
SAP 2 completion done . PRESENTATION.pptx
PDF
[EN] Industrial Machine Downtime Prediction
Topic 5 Presentation 5 Lesson 5 Corporate Fin
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
Introduction to Inferential Statistics.pptx
New ISO 27001_2022 standard and the changes
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Qualitative Qantitative and Mixed Methods.pptx
CYBER SECURITY the Next Warefare Tactics
Acceptance and paychological effects of mandatory extra coach I classes.pptx
ISS -ESG Data flows What is ESG and HowHow
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
Leprosy and NLEP programme community medicine
STERILIZATION AND DISINFECTION-1.ppthhhbx
Optimise Shopper Experiences with a Strong Data Estate.pdf
Pilar Kemerdekaan dan Identi Bangsa.pptx
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
Introduction to Data Science and Data Analysis
SAP 2 completion done . PRESENTATION.pptx
[EN] Industrial Machine Downtime Prediction
Ad

prediction of default payment next month using a logistic approach

  • 1. Construction of a robust prediction model to forecast the likelihood of a credit card holder to experience payment defaults in upcoming months. Xi global resources Group of company March 28, 2024
  • 3. Overview  Card payments are essential in digital commerce.  Card payments integral in digital commerce.  Show consumer preference and retailer confidence.  eWallets offer convenient alternative, affirm status.  Installment options rise, nearly half retailers accept.  Bank transfers, direct debits less common.  Longer processing times, lower consumer demand.
  • 4. Overview cont.'s  a payment card is one of the best options for obtaining cash and every year the traditional cash in the wallet is being displaced more and more by “plastic money”.  Number of issued payment cards in 2009– Q2 2021 (in units) with adjusted lineartrend and forecast (including 90% confidence interval) (2) (PDF) Development of the Payment Cards Market in Poland in the Era of the Covid-19 Pandemic. Available from: https://www.researchgate.net/publication/361429793_Development_of_the_Payment_Cards_Market_in_Poland_in_the_Era_of_the_Covid-19_Pandemic [accessed Mar 30 2024].
  • 5. Introduction Research Motivation and Significant of study  Build a logistic regression model for classification of customer with a default payment next month from those without it  Accurate prediction vital for informed decision-making.  Payment process affects company finances significantly.  Impact on customer relationships and credit risk. Problem Statement
  • 6. Resources Question addressed  The only resource provided is the dataset. This dataset contains information on default payments, demographic factors, credit data, history of payment, and bill statements of credit card clients in Taiwan from April 2005 to September 2005. The dataset contains 25 variables such as:  Given historical data and demographic information, can a predictive model effectively estimate default payment with a high degree of accuracy? Dataset Information
  • 7. Resources Content There are 25 variables: 1. ID: ID of each client 2. LIMIT_BAL: Amount of given credit in NT dollars (includes individual and family/supplementary credit 3. SEX: Gender (1=male, 2=female) 4. EDUCATION: (1=graduate school, 2=university, 3=high school, 4=others, 5=unknown, 6=unknown) 5. MARRIAGE: Marital status (1=married, 2=single, 3=others) 6. AGE: Age in years 7. PAY_0 to PAY_6 (6 features) Repayment status from September to April, 2005: (-1=pay duly, 1=payment delay for one month, 2=payment delay for two months, … 8=payment delay for eight months, 9=payment delay for nine months and above) 8. BILL_AMT1 to BILL_AMT6: Amount of bill statement from April to September, 2005 (NT dollar) 9. PAY_AMT1 to PAY_AMT6: Amount of previous payment from April to September, 2005 (NT dollar) 10. default.payment.next.month: Default payment (1=yes, 0=no) Dataset Information
  • 8. Methods 1. Process of assigning labels: Labeling data 2. Preliminary analysis Exploratory analysis: Checking the data structure Detecting missing values Detecting outliers: boxplot 3. Visualization: A bar chart was used for qualitative variables, while boxplot and density plot was used for the quantitative or continuous variables.
  • 9. Methods  Relationship: Correlation matrix to investigates variable relationships.  Data partitioning: Dataset split into 75:25 training and test to prevents over fitting.  Model trained: the model was trained with all features  Feature selection: Backward selection process was applied to remove insignification feature with a stop condition set at alpha level of 0.05  Model evaluation: The model accuracy level was estimated.
  • 10. Structure of the dataset Figure 1: Variable contained in the dataset displaying the total number of observation by the variable type(integer or numeric)
  • 11. Structure of the dataset Figure 2: Variable contained in the dataset displaying the percentage of values present(or if there is any missing values) by total number of observation.
  • 12. Preliminary Analysis: Exploratory data analysis Figure 3: Distribution of default payment next month
  • 13. Preliminary Analysis: Exploratory data analysis Figure 3: Distribution of gender by default payment next month
  • 14. Preliminary Analysis: Exploratory data analysis Figure 4: Distribution of marriage by default payment next month
  • 15. Preliminary Analysis: Exploratory data analysis Figure 5: Distribution of education by default payment next month
  • 16. Preliminary Analysis: Exploratory data analysis Figure 6: Distribution of repayment status in September, 2005 by default payment next month
  • 17. Preliminary Analysis: Exploratory data analysis Figure 7: Distribution of repayment status in August, 2005 by default payment next month
  • 18. Preliminary Analysis: Exploratory data analysis Figure 8: Distribution of repayment status in July, 2005 by default payment next month
  • 19. Preliminary Analysis: Exploratory data analysis Figure 9: Distribution of repayment status in June, 2005 by default payment next month
  • 20. Preliminary Analysis: Exploratory data analysis Figure 10: Distribution of repayment status in May, 2005 by default payment next month
  • 21. Preliminary Analysis: Exploratory data analysis Figure 11: Distribution of repayment status in April, 2005 by default payment next month
  • 22. Preliminary Analysis: Exploratory data analysis Figure 12: Distribution of age by default payment next month
  • 23. Preliminary Analysis: Exploratory data analysis Figure 12: Distribution of amount of given credit bill by default payment next month
  • 24. Model evaluation Figure 31:Confusion matrix showing the counts of true positive (TP), true negative (TN), false positive (FP), and false negative (FN) predictions made by the model on a dataset Accura cy AUC Trained Model 80.84 % 0.73 Retrained Model 80.91 % 0.72
  • 25. Recommendation  Significant Features: The bill amounts (e.g., BILL_AMT1, BILL_AMT3) and previous payment amounts (e.g., PAY_AMT1, PAY_AMT2) play a significant role in predicting default offering.  Payment Status important: It is crucial for the business to closely monitor customers' payment behavior, especially when there are signs of payment delays or defaults.  Customer Segmentation: Utilize the insights from the model to segment customers based on their risk profiles.  Customer Assistance Program: Implement customer assistance programs or financial counseling services to support customers experiencing financial difficulties.
  • 26. Conclusion  In conclusion, the model provide valuable insights into the factors influencing default payment next month in the dataset.  The analysis highlights the significance of payment status, age, bill amounts, and previous payments in predicting default.  By closely monitoring these factors and adapting strategies accordingly, businesses can better manage default risks and improve their financial stability.