SlideShare a Scribd company logo
Use case : Machine Learning and AI
in banking and fnance
Dr Ahmed Rebai
Assistant Professor
Of Data Science
Esprit School of Engineering
29 December 2018
Dr Lotf Ncib
Assistant Professor
Of applied mathematics
Esprit School of Engineering
1
2
3
4
5
6
Table of
Contents
Introduction
Retrospectives
STB Bank Use Case Presentation
Data science methodology
Project’s steps
Conclusion
2
Selecting DS methodology – Available data
Business understanding- Data's Phases– Modeling-Evaluation-Deployment-Feedback
Ahmed Rebai-Lotf cib
Introduction
3
Business
Banking and Finance solutions
Credit ranking system...
Data
Varity – Volume – Digitalization
Business Intelligence
Dashboarding – Intelligent visualization
Data science
Exploitation-Meaning-Prediction
Ahmed Rebai-Lotf cib
4
public &
semipublic
sector
(52.5%)
foreign actors
(11.3%)
private
sector
(36.2%)
Financing of
Industrial
companies
18% of local
Market part
Investment
banking
Founded in
1958
In Tunis
bank's capital
participation
CEO Monsieur Samir Saied
Various instruments
Ahmed Rebai-Lotf cib
CRISP Methodology
5
Business Understanding
1st
Phase
Data Understanding
2nd
Phase
Data preparation
3rd
Phase
Deployment
Final phase
Evaluation
5th
phase
Modeling
4th
phase
CRISP
Methodolog
y
Ahmed Rebai-Lotf cib
1
2
3
4
The Master Plan
6
The new Data Science Methodology – (IBM vision 2018)
How can you use data to answer the question?
 Analytic Approach
What data do you need to answer? 
 Data requirements
Where is the data coming from and how will you get it?
 Data Collection
Can you get constructive feedback into answering the question? 
 Feedback
Ahmed Rebai-Lotf cib
Wanted Data
7
Structured
MySQL
Oracle
MSSQL
Semi-structured
CSV
JSON
XML
MongoDB
Unstructured
JPEG
PDF
MP3
Ahmed Rebai-Lotf cib
Tools that we will use
8
ETL + Reporting
Pentaho Data Integration
Dbeaver – PHPMyAdmin => MySQL database
Studio3T => MongoDB database
Power BI
Linux
Data Science
Python (numpy, pandas, matplotlib, sklearn,
tensorfow, keras, pytorch, textblob, senpy, nltk,...)
Google Cloud
Microsoft Azur
Amazon WebServices
Ahmed Rebai-Lotf cib
Business Understanding
9
Fraud Detection
Customers Sentiment
analysisAI & ML are used to identify sentiments in textual
data: in social media comments, news articles .
Risk Management
Operational
efficiency: i
ML and Graph theory can detect pattern
towards fraudulent operartions
(see Panama papers case HSBC Bank)
ML can predict risk arising out of banking exposures.
Risk could be either credit risk or fraud risk from
transactions or specifc customers.
A simple use-case is to convert hand-written forms into
machine readable data. This helps in reducing costs
signifi-cantly as most banking processes require lot of
paperwork.
Ahmed Rebai-Lotf cib
Analytic Approach - P1
10
 Semi-structured data contains :
 Clients’ information
 “Agences bancaires” ’ information
 DABs’ information
 Transactions’ information
 Find relation between clients and DAB in Transactions data.
 Week relationship between “Agences bancaires” and Transactions.
How can you use data to answer the question?
Develop a datawarehouse with this available data and try to centralize
the information in order to have a clear idea in Modeling phase
Ahmed Rebai-Lotf cib
Data Understanding
11
Reporting
Ahmed Rebai-Lotf cib
1
2
3
4
5
Modeling – P2
12
Type of model : Supervised method
Algorithm: ARMA, ARIMA , SARIMA , SARIMAX,
Implementation : Python
Robustness & Evaluation = Stochasticity evaluation , Rsquared and
Accuracy AIC
Detection of Trend , Seasonality + residuals evolutions
Users 'number
forecasting
Ahmed Rebai-Lotf cib
1
2
3
4
5
Modeling – P3
13
Type of model : Unsupervised method
Algorithm: CAH , KMEANS, Dbscan
Implementation : Python: sklean, Tensorfow
Robustness & Evaluation = silhouette score
Providing the clusters of users and then using them for group
charact-erization
Users’ profling
Ahmed Rebai-Lotf cib
1
2
3
4
5
Modeling – P4
14
Type of model : Supervised method
Algorithm: LDA , Logistic regression
Implementation : Python
Optimization & selecting model = GREEDY Wilks
Setting a score for each Reward / Loyalty based on the number of
transactions
Reward/Loyalty Scoring
Ahmed Rebai-Lotf cib
1
2
3
4
5
Modeling – P5
15
Type of model : Supervised method
Algorithm : NLP , Stemming , lemmatization
Implementation : Python
Robustness & Evaluation = MDT , IDF
Detect word weights that attract users
Knowledge text discovery
Ahmed Rebai-Lotf cib
1
2
3
4
5
Modeling – P6
16
Type of model : Supervised method
Algorithm: collaborative fltering , Turicreate , CF
Implementation : Python
Robustness & Evaluations = RMSE , NDCG , Mean Reciprocal Rank
Recommend a fnancial product (specifc category) in a specifc period
, in a specifc region Recommend a user for a loyalty ofer.
Recommender system
Ahmed Rebai-Lotf cib
1
2
3
4
5
Modeling – P7
17
Type of model : Supervised method
Algorithm : Decision Tree , Random Forest
Implementation : Python
Robustness & Evaluation = Roc Curve , Accuracy
Detect the conditions to take a ofer or not
 Need external tracking data of users in the web application:
Page views , clicks…
Boosting with Random
Forest
Ahmed Rebai-Lotf cib
1
2
3
4
5
Modeling – P8
18
Type of model : Unsupervised method, Graph theory, discrete
mathematics.
Algorithm : Clustering, Community detection, Outliers detection
Implementation : Python
Robustness & Evaluation = Roc Curve , Accuracy
Detect suspicious operations
Fraud Detection
Ahmed Rebai-Lotf cib
Deployment
19
With Juputer-lab or External Web site
Ahmed Rebai-Lotf cib
Feedback
20
Can you get constructive feedback into answering the question? 
Ahmed Rebai-Lotf cib
“In God we trust, all others must bring
data.”
W. Edwards
Derming
Ahmed Rebai-Lotf cib
Thank you for your
attention
29 December 2018
Ahmed Rebai-Lotf cib

More Related Content

PDF
Market Abuse Detection
PDF
Market Surveillance
PDF
Bayesian Portfolio Allocation
PDF
Ml conference slides boston june 2019
PDF
Qwafafew meeting 4
PDF
Qwafafew meeting - July 2020 - Jarrod Wilcox
PDF
QCon conference 2019
PPTX
A Sneak Peek into Artificial Intelligence Based HFT Trading Strategies
Market Abuse Detection
Market Surveillance
Bayesian Portfolio Allocation
Ml conference slides boston june 2019
Qwafafew meeting 4
Qwafafew meeting - July 2020 - Jarrod Wilcox
QCon conference 2019
A Sneak Peek into Artificial Intelligence Based HFT Trading Strategies

Similar to Use case stb (20)

PDF
Machine Learning for Finance Master Class
PPTX
Next Generation Fraud Solutions using Neo4j
PDF
Neo4j GraphTalk Copenhagen - Next Generation Solutions using Neo4j
PPTX
Business Intelligence
PPTX
IEGROUP Sundip Gorai v3
DOCX
Resume_Partha_Data Consultant_23_July_2016
PDF
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
PDF
Financial Markets Data & Analytics Led Transformation
PDF
Robotic Process Automation (RPA) Webinar - By Matrix-IFS
PDF
Ml master class cfa poland
PPTX
20150118 s snet analytics vca
PDF
ML and AI in Finance: Master Class
PDF
Qu for India - QuantUniversity FundRaiser
PPT
Tier2 Technical Contracting
PDF
apidays LIVE Hong Kong - Fast Track the Open Banking Ecosystem with Platform ...
PPT
Detecting Opportunities and Threats with Complex Event Processing: Case St...
PDF
PPTX
Insurance digital transformation - key challenges
PPTX
Project PPT sem 2.pptx
PPTX
Lean dqm reporting
Machine Learning for Finance Master Class
Next Generation Fraud Solutions using Neo4j
Neo4j GraphTalk Copenhagen - Next Generation Solutions using Neo4j
Business Intelligence
IEGROUP Sundip Gorai v3
Resume_Partha_Data Consultant_23_July_2016
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Financial Markets Data & Analytics Led Transformation
Robotic Process Automation (RPA) Webinar - By Matrix-IFS
Ml master class cfa poland
20150118 s snet analytics vca
ML and AI in Finance: Master Class
Qu for India - QuantUniversity FundRaiser
Tier2 Technical Contracting
apidays LIVE Hong Kong - Fast Track the Open Banking Ecosystem with Platform ...
Detecting Opportunities and Threats with Complex Event Processing: Case St...
Insurance digital transformation - key challenges
Project PPT sem 2.pptx
Lean dqm reporting
Ad

More from Ncib Lotfi (10)

PDF
Auto eda
PDF
Introduction to Data Science
PDF
Introduction: Intelligence Artificielle, Machine Learning et Deep Learning
PDF
Resume
PDF
Rapport stage
PDF
Cheat sheets for AI
PDF
ARTIFICIAL INTELLIGENCE & MACHINE LEARNING CAREER GUIDE
PDF
Optimisation
PDF
Regression
PDF
Decision trees
Auto eda
Introduction to Data Science
Introduction: Intelligence Artificielle, Machine Learning et Deep Learning
Resume
Rapport stage
Cheat sheets for AI
ARTIFICIAL INTELLIGENCE & MACHINE LEARNING CAREER GUIDE
Optimisation
Regression
Decision trees
Ad

Recently uploaded (20)

PDF
7a Lifetime Expected Income Breakeven Comparison between SPIAs and Managed Po...
PDF
Statistics for Management and Economics Keller 10th Edition by Gerald Keller ...
PPT
features and equilibrium under MONOPOLY 17.11.20.ppt
PDF
Principal of magaement is good fundamentals in economics
PDF
GVCParticipation_Automation_Climate_India
PDF
THE EFFECT OF FOREIGN AID ON ECONOMIC GROWTH IN ETHIOPIA
PDF
4a Probability-of-Failure-Based Decision Rules to Manage Sequence Risk in Ret...
PPT
Fundamentals of Financial Management Chapter 3
PPT
KPMG FA Benefits Report_FINAL_Jan 27_2010.ppt
PDF
2a A Dynamic and Adaptive Approach to Distribution Planning and Monitoring JF...
PDF
The Right Social Media Strategy Can Transform Your Business
PDF
CLIMATE CHANGE AS A THREAT MULTIPLIER: ASSESSING ITS IMPACT ON RESOURCE SCARC...
PDF
2012_The dark side of valuation a jedi guide to valuing difficult to value co...
DOCX
BUSINESS PERFORMANCE SITUATION AND PERFORMANCE EVALUATION OF FELIX HOTEL IN H...
PPTX
Q1 PE AND HEALTH 5 WEEK 5 DAY 1 powerpoint template
PDF
DTC TRADIND CLUB MAKE YOUR TRADING BETTER
PPTX
lesson in englishhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh
PDF
The Role of Islamic Faith, Ethics, Culture, and values in promoting fairness ...
PPTX
INDIAN FINANCIAL SYSTEM (Financial institutions, Financial Markets & Services)
PDF
1a In Search of the Numbers ssrn 1488130 Oct 2009.pdf
7a Lifetime Expected Income Breakeven Comparison between SPIAs and Managed Po...
Statistics for Management and Economics Keller 10th Edition by Gerald Keller ...
features and equilibrium under MONOPOLY 17.11.20.ppt
Principal of magaement is good fundamentals in economics
GVCParticipation_Automation_Climate_India
THE EFFECT OF FOREIGN AID ON ECONOMIC GROWTH IN ETHIOPIA
4a Probability-of-Failure-Based Decision Rules to Manage Sequence Risk in Ret...
Fundamentals of Financial Management Chapter 3
KPMG FA Benefits Report_FINAL_Jan 27_2010.ppt
2a A Dynamic and Adaptive Approach to Distribution Planning and Monitoring JF...
The Right Social Media Strategy Can Transform Your Business
CLIMATE CHANGE AS A THREAT MULTIPLIER: ASSESSING ITS IMPACT ON RESOURCE SCARC...
2012_The dark side of valuation a jedi guide to valuing difficult to value co...
BUSINESS PERFORMANCE SITUATION AND PERFORMANCE EVALUATION OF FELIX HOTEL IN H...
Q1 PE AND HEALTH 5 WEEK 5 DAY 1 powerpoint template
DTC TRADIND CLUB MAKE YOUR TRADING BETTER
lesson in englishhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh
The Role of Islamic Faith, Ethics, Culture, and values in promoting fairness ...
INDIAN FINANCIAL SYSTEM (Financial institutions, Financial Markets & Services)
1a In Search of the Numbers ssrn 1488130 Oct 2009.pdf

Use case stb

  • 1. Use case : Machine Learning and AI in banking and fnance Dr Ahmed Rebai Assistant Professor Of Data Science Esprit School of Engineering 29 December 2018 Dr Lotf Ncib Assistant Professor Of applied mathematics Esprit School of Engineering
  • 2. 1 2 3 4 5 6 Table of Contents Introduction Retrospectives STB Bank Use Case Presentation Data science methodology Project’s steps Conclusion 2 Selecting DS methodology – Available data Business understanding- Data's Phases– Modeling-Evaluation-Deployment-Feedback Ahmed Rebai-Lotf cib
  • 3. Introduction 3 Business Banking and Finance solutions Credit ranking system... Data Varity – Volume – Digitalization Business Intelligence Dashboarding – Intelligent visualization Data science Exploitation-Meaning-Prediction Ahmed Rebai-Lotf cib
  • 4. 4 public & semipublic sector (52.5%) foreign actors (11.3%) private sector (36.2%) Financing of Industrial companies 18% of local Market part Investment banking Founded in 1958 In Tunis bank's capital participation CEO Monsieur Samir Saied Various instruments Ahmed Rebai-Lotf cib
  • 5. CRISP Methodology 5 Business Understanding 1st Phase Data Understanding 2nd Phase Data preparation 3rd Phase Deployment Final phase Evaluation 5th phase Modeling 4th phase CRISP Methodolog y Ahmed Rebai-Lotf cib
  • 6. 1 2 3 4 The Master Plan 6 The new Data Science Methodology – (IBM vision 2018) How can you use data to answer the question?  Analytic Approach What data do you need to answer?   Data requirements Where is the data coming from and how will you get it?  Data Collection Can you get constructive feedback into answering the question?   Feedback Ahmed Rebai-Lotf cib
  • 8. Tools that we will use 8 ETL + Reporting Pentaho Data Integration Dbeaver – PHPMyAdmin => MySQL database Studio3T => MongoDB database Power BI Linux Data Science Python (numpy, pandas, matplotlib, sklearn, tensorfow, keras, pytorch, textblob, senpy, nltk,...) Google Cloud Microsoft Azur Amazon WebServices Ahmed Rebai-Lotf cib
  • 9. Business Understanding 9 Fraud Detection Customers Sentiment analysisAI & ML are used to identify sentiments in textual data: in social media comments, news articles . Risk Management Operational efficiency: i ML and Graph theory can detect pattern towards fraudulent operartions (see Panama papers case HSBC Bank) ML can predict risk arising out of banking exposures. Risk could be either credit risk or fraud risk from transactions or specifc customers. A simple use-case is to convert hand-written forms into machine readable data. This helps in reducing costs signifi-cantly as most banking processes require lot of paperwork. Ahmed Rebai-Lotf cib
  • 10. Analytic Approach - P1 10  Semi-structured data contains :  Clients’ information  “Agences bancaires” ’ information  DABs’ information  Transactions’ information  Find relation between clients and DAB in Transactions data.  Week relationship between “Agences bancaires” and Transactions. How can you use data to answer the question? Develop a datawarehouse with this available data and try to centralize the information in order to have a clear idea in Modeling phase Ahmed Rebai-Lotf cib
  • 12. 1 2 3 4 5 Modeling – P2 12 Type of model : Supervised method Algorithm: ARMA, ARIMA , SARIMA , SARIMAX, Implementation : Python Robustness & Evaluation = Stochasticity evaluation , Rsquared and Accuracy AIC Detection of Trend , Seasonality + residuals evolutions Users 'number forecasting Ahmed Rebai-Lotf cib
  • 13. 1 2 3 4 5 Modeling – P3 13 Type of model : Unsupervised method Algorithm: CAH , KMEANS, Dbscan Implementation : Python: sklean, Tensorfow Robustness & Evaluation = silhouette score Providing the clusters of users and then using them for group charact-erization Users’ profling Ahmed Rebai-Lotf cib
  • 14. 1 2 3 4 5 Modeling – P4 14 Type of model : Supervised method Algorithm: LDA , Logistic regression Implementation : Python Optimization & selecting model = GREEDY Wilks Setting a score for each Reward / Loyalty based on the number of transactions Reward/Loyalty Scoring Ahmed Rebai-Lotf cib
  • 15. 1 2 3 4 5 Modeling – P5 15 Type of model : Supervised method Algorithm : NLP , Stemming , lemmatization Implementation : Python Robustness & Evaluation = MDT , IDF Detect word weights that attract users Knowledge text discovery Ahmed Rebai-Lotf cib
  • 16. 1 2 3 4 5 Modeling – P6 16 Type of model : Supervised method Algorithm: collaborative fltering , Turicreate , CF Implementation : Python Robustness & Evaluations = RMSE , NDCG , Mean Reciprocal Rank Recommend a fnancial product (specifc category) in a specifc period , in a specifc region Recommend a user for a loyalty ofer. Recommender system Ahmed Rebai-Lotf cib
  • 17. 1 2 3 4 5 Modeling – P7 17 Type of model : Supervised method Algorithm : Decision Tree , Random Forest Implementation : Python Robustness & Evaluation = Roc Curve , Accuracy Detect the conditions to take a ofer or not  Need external tracking data of users in the web application: Page views , clicks… Boosting with Random Forest Ahmed Rebai-Lotf cib
  • 18. 1 2 3 4 5 Modeling – P8 18 Type of model : Unsupervised method, Graph theory, discrete mathematics. Algorithm : Clustering, Community detection, Outliers detection Implementation : Python Robustness & Evaluation = Roc Curve , Accuracy Detect suspicious operations Fraud Detection Ahmed Rebai-Lotf cib
  • 19. Deployment 19 With Juputer-lab or External Web site Ahmed Rebai-Lotf cib
  • 20. Feedback 20 Can you get constructive feedback into answering the question?  Ahmed Rebai-Lotf cib
  • 21. “In God we trust, all others must bring data.” W. Edwards Derming Ahmed Rebai-Lotf cib
  • 22. Thank you for your attention 29 December 2018 Ahmed Rebai-Lotf cib