SlideShare a Scribd company logo
BigData in Banking
Challenges and Solutions
Arshavsky Andzhey
Director, Big Data dept., SberBank
Avarshavsky.sbt@sberbank.ru
andzhey@mac.com
2015
3
Innovations like killers –
destruction stages of standard banking system
① Internet & social networks
Control and choice
② Screens and Smartphones
Anyplace any time
③ Mobile wallet
Out of cash and plastic cards
④ Accounts without Banks
No bank accounts
⑤ BigData
Cros-system personalization and
targeting
*Бретт Кинг, Банк 3.0
4
BIGDATA as the development of approaches to the use of data
Information like
competition
differentiator
Information like
innovation
enablement
Information as
strategic asset
Information for
business analysys
Data for business
“Day by day
operations”
“Datawarehousing”
Thevalueofinformationforbusiness
“Information in business context”
“Business innovations
based on information”
“Adaptive business strategy”
Information usage methods maturity
+ INTERNET AND OPEN DATA
BIGDATA in Banking
5
BIGDATA In Banking
Information challenges in large Banks (XL)
Data is the most valuable asset in all XL banks
A few know how to apply data for solving even this day
challenges
A few know how to leverage internet, external or open data
sources to understand clients better and attract new
customers
6
The Key challenge with data analysis
Through the development of the Big Data Infrastructure which solves
the challenges with data pre-processing and attribution thru building
intelligent data processing Framework, the company will be able to
optimize labor costs by reducing works on data preparation of data
for the development of business applications up to 70%!
BIGDATA in Banking
It is estimated (by Gartner), 70% of the time spent on analytical projects are
dedicated to bringing, cleaning and data integration, mainly due to the following
problems:
The difficulty of locating data due to the carelessness among disparate business
applications and business systems
To be more than appropriate for analysis, data require reengineering and
reformatting
􏰀The acquisition of data for analysis in a specified format creates a huge burden
on the teams that own the systems data source . Often the same data is
requested or purchase by a variety of departments and business units, which
creates additional work and chaos
The need for process setup regular data exchange
7
Data and Analytics tools as shared resource
Client
Product
Transactions
Location
….
Instruments
RISKS Dept.
RETAIL Dept.
OPERATIONS Dept.
SEQURITY Dept.
CORPORATE
CLIENTS Dept.
HR
BIGDATA in Banking
BIGDATA to a lesser extent, about the data size and is
more about the opportunity to work with many
different data types, formats and applications with
powerful analytic capabilities.
8
Sources of business growth and execution excelence
BIGDATA in Banking
Client
ПРИВЛЕЧЕНИЕ
УДЕРЖАНИЕ
ПРОДАЖИ
ПЕРВИЧНЫЕ
ВТОРИЧНЫЕ
КРЕДИТЫ
РИСКИ
ЗАДОЛЖЕННОСТИ
АНТИФРОД
ВНУТРЕННИЙ
ВНЕШНИЙ
HR
ОПТИМИЗАЦИЯ
ПРОЦЕССОВ
①
②
③ ④
9
Data Factory conception
Big Data Factory should enable data processing in a uniform manner for
all platforms, functions and customers. To build easily changeable and
easy to use data processing operating model with the required level of
trust for both traditional and not so traditional data sources
Tasks: Information trust
Traditional and not so traditional
data sources
BIGDATA in Banking
• Delivery information
• Information integration
(Cleaning, Transformation,
Mapping, Improvement)
• Information search
• Access to information
• Study hypotheses
• Learning models and
information analysis
• Backup/ Cleanup/ Restore
• Administration
• Lifecycle management
• Data quality
• Reference data
• Record linkage and the resolution of
contradictions
• Classification
• Reporting
• Internet data
• Data virtualization
10
ЦК Супермассивов данныхBIGDATA PLATFORM HIGH-LEVEL CONCEPTION
11
BIGDATA in Banking
Data Factory Scenarious
The experts of the subject areas of the Bank's business need to access the
organization's data for research, sampling, annotation and modelling
Data Scientists works on new
models
Marketing is looking for data for the
new compains
Security services looking for data
for drill a suspicious transaction
Retail unit wants to make the
best proposal to the client
……..
Daily activity
The need for ad hoc access to
diverse data
Support analysis and decision
making
To use the terminology subject
matter experts when accessing
data
Providing the same easy access to data in spreadsheets, with the ability to scale to huge
volumes and distribution on a huge variety of types of information while protecting sensitive
information and optimizing it storage systems.
BIGDATA in Banking
Data 2 profit process
Task formalization
DATA
PREPARATION
DATA
EXPLORATION
ADDITIONAL
INDICATORS
ALATITICS &
MODELING
MODEL VALIDATION
MODEL
PRODUCTIZATION
EFFECIENCY
MONITORING
12
①
13
HDFS, row data
Data
exchange
Data preparation, processing and
analytical layer
Analytical Views
Ad-hoc analytics Development factory
Streaming
Big Data applications. Integration.
marts API
BIGDATA in Banking
Possible architecture
14
BI & BIGDATA
Traditional BI Big Data
Based on DWH
Precession is crucial
Flat data scheme
Long time 2 market
hi-end hardware
Based on Hadoop and Spark
Any precesion
Complex and variable data schemes
Ad-hoc analytics
Short time 2 market
New data sources
Low cost
Both approaches are valid
BIGDATA in Banking
15
BIGDATA in Banking
Is not expensive - OPEN SOURCE does work
Low cost
No vendor lock
Community support
APPLICATION LAYER
Spark
Hadoop
SQL
NoSQLDB
16
BIGDATA in Banking
Thanks and good luck!

More Related Content

PPTX
Big Data Case study - caixa bank
PDF
Big Data Retail Banking
PPTX
Welcome to the Age of Big Data in Banking
PPTX
Big data analytics in banking sector
PDF
Big Data in Banking (Data Science Thailand Meetup #2)
PDF
Big Data & Analytics perspectives in Banking
PDF
How big data is transforming BI
PDF
Analytics in banking preview deck - june 2013
Big Data Case study - caixa bank
Big Data Retail Banking
Welcome to the Age of Big Data in Banking
Big data analytics in banking sector
Big Data in Banking (Data Science Thailand Meetup #2)
Big Data & Analytics perspectives in Banking
How big data is transforming BI
Analytics in banking preview deck - june 2013

What's hot (20)

PDF
Big data & analytics for banking new york lars hamberg
PDF
TechConnex Big Data Series - Big Data in Banking
PDF
Banking Big Data Analytics
PPTX
5 Big Data Use Cases for 2013
PPTX
Big Data Analytics
PDF
Business case for Big Data Analytics
PPTX
Advanced Analytics in Banking, CITI
PPT
Big data it’s impact on the finance function
PPTX
Digital Transformation: How to Build an Analytics-Driven Culture
PPTX
BIG Data & Hadoop Applications in Finance
PPTX
How advanced analytics is impacting the banking sector
PPTX
How Eastern Bank Uses Big Data to Better Serve and Protect its Customers
PDF
Big Data Analytics in light of Financial Industry
PPTX
Importance of Big data for your Business
PDF
Big Data Analytics for Banking, a Point of View
PDF
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
PDF
Graph Database
PPTX
Big Data & Business Analytics: Understanding the Marketspace
PDF
Big data analytic market opportunity
PPTX
Big data
Big data & analytics for banking new york lars hamberg
TechConnex Big Data Series - Big Data in Banking
Banking Big Data Analytics
5 Big Data Use Cases for 2013
Big Data Analytics
Business case for Big Data Analytics
Advanced Analytics in Banking, CITI
Big data it’s impact on the finance function
Digital Transformation: How to Build an Analytics-Driven Culture
BIG Data & Hadoop Applications in Finance
How advanced analytics is impacting the banking sector
How Eastern Bank Uses Big Data to Better Serve and Protect its Customers
Big Data Analytics in light of Financial Industry
Importance of Big data for your Business
Big Data Analytics for Banking, a Point of View
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Graph Database
Big Data & Business Analytics: Understanding the Marketspace
Big data analytic market opportunity
Big data
Ad

Viewers also liked (20)

PPTX
Code Camp Auckland 2015 - DEV1 Microsoft API Approaches 101
PPT
Custom Image Classifier with Visual Recognition: Building with Watson
PDF
Xamarin microsoft graph
PDF
Machine Learning for Images
PDF
INFOGRAPHIC: Big Data Alchemy
PDF
Building an Image Recognition Service - How to leverage IBM Watson for visual...
PPTX
Microsoft vision & strategy keynote for partners
PDF
Trivadis TechEvent 2016 Big Data Privacy and Security Fundamentals by Florian...
PPTX
Space Waste Management
PDF
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
PDF
Retail Rebooted (August 2013)
PDF
Apache hadoop bigdata-in-banking
PPTX
Big data security
PDF
A data analyst view of Bigdata
PDF
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
PPTX
Provide place information based on image matching
PPT
Jose maría
PPTX
Information house
PDF
[DDBJing29]DDBJ Nucleotide Sequence Submission System の紹介(第29回 DDBJing 講習会 in...
PDF
Could Martial Arts Improve Your Life
Code Camp Auckland 2015 - DEV1 Microsoft API Approaches 101
Custom Image Classifier with Visual Recognition: Building with Watson
Xamarin microsoft graph
Machine Learning for Images
INFOGRAPHIC: Big Data Alchemy
Building an Image Recognition Service - How to leverage IBM Watson for visual...
Microsoft vision & strategy keynote for partners
Trivadis TechEvent 2016 Big Data Privacy and Security Fundamentals by Florian...
Space Waste Management
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
Retail Rebooted (August 2013)
Apache hadoop bigdata-in-banking
Big data security
A data analyst view of Bigdata
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Provide place information based on image matching
Jose maría
Information house
[DDBJing29]DDBJ Nucleotide Sequence Submission System の紹介(第29回 DDBJing 講習会 in...
Could Martial Arts Improve Your Life
Ad

Similar to BigData in Banking (20)

PDF
Pres_Big Data for Finance_vsaini
PPTX
Bi orientations
PPT
Choosing the Right Big Data Architecture for your Business
PDF
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
PDF
Building the Artificially Intelligent Enterprise
PDF
Big Data is Here for Financial Services White Paper
PDF
Three Dimensions of Data as a Service
PDF
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
PDF
Four Key Considerations for your Big Data Analytics Strategy
PDF
Big agendas for big data analytics projects
PDF
Die Big Data Fabric als Enabler für Machine Learning & AI
PPTX
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
PDF
Mastering Big Data: Tools, Techniques, and Applications
PDF
Modern Data Challenges require Modern Graph Technology
PDF
6 Reasons to Use Data Analytics
PDF
Big Data analytics best practices
PDF
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
PDF
Data Virtualization. An Introduction (ASEAN)
PDF
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
PDF
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Pres_Big Data for Finance_vsaini
Bi orientations
Choosing the Right Big Data Architecture for your Business
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
Building the Artificially Intelligent Enterprise
Big Data is Here for Financial Services White Paper
Three Dimensions of Data as a Service
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Four Key Considerations for your Big Data Analytics Strategy
Big agendas for big data analytics projects
Die Big Data Fabric als Enabler für Machine Learning & AI
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
Mastering Big Data: Tools, Techniques, and Applications
Modern Data Challenges require Modern Graph Technology
6 Reasons to Use Data Analytics
Big Data analytics best practices
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
Data Virtualization. An Introduction (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...

More from Andzhey Arshavskiy (12)

PDF
dsl & bigdata
PDF
Dsl public
PPTX
Digital Society Lab (about)
PDF
Digital Society Laboratory (DSL)
PDF
WHAT IS BIG DATA? AND HOW IT APPLIED IN MODERN MARKETING
PPTX
Ispras (трудаков, коршунов)
PDF
Dmitry Gubanov presentation for ФИSNA
PDF
Дмитрий Игнатов для ФИSNA
PPTX
Digital Society Laboratory (Аршавский)
PPS
мосты
PPS
Japan creativity.pps
PDF
Big data, Clouds & HPC
dsl & bigdata
Dsl public
Digital Society Lab (about)
Digital Society Laboratory (DSL)
WHAT IS BIG DATA? AND HOW IT APPLIED IN MODERN MARKETING
Ispras (трудаков, коршунов)
Dmitry Gubanov presentation for ФИSNA
Дмитрий Игнатов для ФИSNA
Digital Society Laboratory (Аршавский)
мосты
Japan creativity.pps
Big data, Clouds & HPC

BigData in Banking

  • 1. BigData in Banking Challenges and Solutions Arshavsky Andzhey Director, Big Data dept., SberBank Avarshavsky.sbt@sberbank.ru andzhey@mac.com 2015
  • 2. 3 Innovations like killers – destruction stages of standard banking system ① Internet & social networks Control and choice ② Screens and Smartphones Anyplace any time ③ Mobile wallet Out of cash and plastic cards ④ Accounts without Banks No bank accounts ⑤ BigData Cros-system personalization and targeting *Бретт Кинг, Банк 3.0
  • 3. 4 BIGDATA as the development of approaches to the use of data Information like competition differentiator Information like innovation enablement Information as strategic asset Information for business analysys Data for business “Day by day operations” “Datawarehousing” Thevalueofinformationforbusiness “Information in business context” “Business innovations based on information” “Adaptive business strategy” Information usage methods maturity + INTERNET AND OPEN DATA BIGDATA in Banking
  • 4. 5 BIGDATA In Banking Information challenges in large Banks (XL) Data is the most valuable asset in all XL banks A few know how to apply data for solving even this day challenges A few know how to leverage internet, external or open data sources to understand clients better and attract new customers
  • 5. 6 The Key challenge with data analysis Through the development of the Big Data Infrastructure which solves the challenges with data pre-processing and attribution thru building intelligent data processing Framework, the company will be able to optimize labor costs by reducing works on data preparation of data for the development of business applications up to 70%! BIGDATA in Banking It is estimated (by Gartner), 70% of the time spent on analytical projects are dedicated to bringing, cleaning and data integration, mainly due to the following problems: The difficulty of locating data due to the carelessness among disparate business applications and business systems To be more than appropriate for analysis, data require reengineering and reformatting 􏰀The acquisition of data for analysis in a specified format creates a huge burden on the teams that own the systems data source . Often the same data is requested or purchase by a variety of departments and business units, which creates additional work and chaos The need for process setup regular data exchange
  • 6. 7 Data and Analytics tools as shared resource Client Product Transactions Location …. Instruments RISKS Dept. RETAIL Dept. OPERATIONS Dept. SEQURITY Dept. CORPORATE CLIENTS Dept. HR BIGDATA in Banking BIGDATA to a lesser extent, about the data size and is more about the opportunity to work with many different data types, formats and applications with powerful analytic capabilities.
  • 7. 8 Sources of business growth and execution excelence BIGDATA in Banking Client ПРИВЛЕЧЕНИЕ УДЕРЖАНИЕ ПРОДАЖИ ПЕРВИЧНЫЕ ВТОРИЧНЫЕ КРЕДИТЫ РИСКИ ЗАДОЛЖЕННОСТИ АНТИФРОД ВНУТРЕННИЙ ВНЕШНИЙ HR ОПТИМИЗАЦИЯ ПРОЦЕССОВ ① ② ③ ④
  • 8. 9 Data Factory conception Big Data Factory should enable data processing in a uniform manner for all platforms, functions and customers. To build easily changeable and easy to use data processing operating model with the required level of trust for both traditional and not so traditional data sources Tasks: Information trust Traditional and not so traditional data sources BIGDATA in Banking • Delivery information • Information integration (Cleaning, Transformation, Mapping, Improvement) • Information search • Access to information • Study hypotheses • Learning models and information analysis • Backup/ Cleanup/ Restore • Administration • Lifecycle management • Data quality • Reference data • Record linkage and the resolution of contradictions • Classification • Reporting • Internet data • Data virtualization
  • 10. 11 BIGDATA in Banking Data Factory Scenarious The experts of the subject areas of the Bank's business need to access the organization's data for research, sampling, annotation and modelling Data Scientists works on new models Marketing is looking for data for the new compains Security services looking for data for drill a suspicious transaction Retail unit wants to make the best proposal to the client …….. Daily activity The need for ad hoc access to diverse data Support analysis and decision making To use the terminology subject matter experts when accessing data Providing the same easy access to data in spreadsheets, with the ability to scale to huge volumes and distribution on a huge variety of types of information while protecting sensitive information and optimizing it storage systems.
  • 11. BIGDATA in Banking Data 2 profit process Task formalization DATA PREPARATION DATA EXPLORATION ADDITIONAL INDICATORS ALATITICS & MODELING MODEL VALIDATION MODEL PRODUCTIZATION EFFECIENCY MONITORING 12 ①
  • 12. 13 HDFS, row data Data exchange Data preparation, processing and analytical layer Analytical Views Ad-hoc analytics Development factory Streaming Big Data applications. Integration. marts API BIGDATA in Banking Possible architecture
  • 13. 14 BI & BIGDATA Traditional BI Big Data Based on DWH Precession is crucial Flat data scheme Long time 2 market hi-end hardware Based on Hadoop and Spark Any precesion Complex and variable data schemes Ad-hoc analytics Short time 2 market New data sources Low cost Both approaches are valid BIGDATA in Banking
  • 14. 15 BIGDATA in Banking Is not expensive - OPEN SOURCE does work Low cost No vendor lock Community support APPLICATION LAYER Spark Hadoop SQL NoSQLDB