SlideShare a Scribd company logo
A journey to Big Data
Main Challenges, Solutions and Benefits
Corporate & Investment Banking October 2018
Who We Are?
2
Santander Highlights
Total assets (EUR trillion)
Gross cutormer loans (EUR billion excluding reverse repos)
Customer deposits + mutual funds (EUR billion excluding repos)
Branches
2017 Attributable profit (EUR million)
H1’18 Attributable profit (EUR million)
Market capitalisation (EUR billion; 26-06-18)
People (headcount)
Customers (millions)
Shareholders (millions)
Communities (million people helped in 2017)
1.43
863
895
13,482
6,619
3,752
74
200,961
140
4.2
2.1
Corporate & Investment Banking Business
3
Few contracts – Big profits
Big Clients
Details makes the difference
Strong Competence
Adapt to client systems
Ad hoc integration
Integration layer becomes huge
Lot of middleware
Big Client
Big Client
Big Client
Big Client
Big Client
Big Client
Big Client
Big Client
Big Client
Big Client
Legacy Systems Challenges
4
Regulation
Strong Regulation
We cannot simply integrate systems in a row.
We need to consolidate data for regulators
Systems
Lot of Systems
There are more than 1000 different
applications installed on Santander
Corporate & Investment Banking
Auditors
Auditors are Welcome
We need our systems ready to be audited
at any time.
Presence in Many Countries
Same pattern is repeated on each country
Countries
Big Data. Data Organization
5
Raw Data
Data is stored ‘as is’ without any
modification.
It is required all data to have its
own metadata
Landing
Data Ontologies
Data is grouped according to
functional criteria (ontologies).
Data is consolidated to
eliminate duplicates
Common
Business Views
Is where applications access
and process the data.
It is not a copy but a view of the
common layer
Business
Applications
Finland Architecture. A Thousand Lakes
6
API S3
Landing
(RAW)
Common
(Harmonized) Business (Consolidated & Views)
CDO
BATCH CLUSTER A
worker worker worker worker
Tools
BATCH CLUSTER B
worker worker worker worker
Tools
OTHER
BATCH CLUSTER C
worker worker worker worker
Tools
Finland Architecture. On Demand Evolution
7
API S3
Landing
(RAW)
Common
(Harmonized) Business (Consolidated & Views)
CDO
ON DEMAND INSTANTIATION
worker worker worker
Datalake
Distribution
Tools worker worker worker
Datalake
Distribution
Tools
PROCESING AND SAVING DATA
worker worker worker
Datalake
Distribution
Tools
FREE RESOURCES
Online Cluster Architecture
8
New
Applications
Legacy
Applications
Online Cache
Business
Rules
Visualization
Spring Data Flow
Manual Orchestation Automatic Choreography
REST API
Online Cluster. Design Pattern
9
Source
Stream
Online Cache
Business
Rules
AVRO
AVRO
IDX
AVRO
IDX
AVRO
IDX
AVRO
AVRO
Schema
Registry
AVRO
AVRO
IDX
Complete Finland Architecture
10
API S3
Landing
(RAW)
Common
(Harmonized) Business (Consolidated & Views)
CDO
ON PREMISE BATCH CLUSTER
worker worker worker
Datalake
Distribution
Tools
ON DEMAND BATCH CLUSTER
Online Cache
Business
Rules
Visualization
Spring Data Flow
Manual Orchestation Automatic Choreography
REST API
ONLINE CLUSTER
worker worker worker worker
Tools
Datalake
Distribution
Our purpose is to help people
and business prosper.
Our culture is based on believing
that everything we do should be:
Thank You.

More Related Content

PDF
Climbing the AI Ladder
PPTX
Munich Re: Driving a Big Data Transformation
PPTX
Benefits of Transferring Real-Time Data to Hadoop at Scale
PDF
Postgres Vision 2018: Data as the New Oil
 
PPTX
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
PPTX
Data Science at Speed. At Scale.
PPTX
Oil and gas big data edition
PDF
Postgres Vision 2018: How to Consume your Database Platform On-premises
 
Climbing the AI Ladder
Munich Re: Driving a Big Data Transformation
Benefits of Transferring Real-Time Data to Hadoop at Scale
Postgres Vision 2018: Data as the New Oil
 
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
Data Science at Speed. At Scale.
Oil and gas big data edition
Postgres Vision 2018: How to Consume your Database Platform On-premises
 

What's hot (20)

PPTX
Fighting Financial Crime with Artificial Intelligence
PDF
Postgres Vision 2018: Making Modern an Old Legacy System
 
PDF
Postgres Vision 2018: The Pragmatic Cloud
 
PPTX
Artificial Intelligence and Analytic Ops to Continuously Improve Business Out...
PDF
IBM+Hortonworks = Transformation of the Big Data Landscape
PDF
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
PPTX
Klarna Tech Talk - Mind the Data!
PPTX
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
PDF
Open Source Data Management for Industry 4.0
PPTX
Securing and governing a multi-tenant data lake within the financial industry
PDF
Flash session -goldengate--lht1053-lon
PPTX
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
PPTX
Postgres Vision 2018: Taking Postgres Everywhere
 
PPTX
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
PDF
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
PDF
Accelerate Return on Data
PDF
Cloud Adoption, Risks and Rewards Infographic
PDF
Machine Learning Everywhere
PPTX
Adapting to the exponential development of technology
PDF
Making Enterprise Big Data Small with Ease
Fighting Financial Crime with Artificial Intelligence
Postgres Vision 2018: Making Modern an Old Legacy System
 
Postgres Vision 2018: The Pragmatic Cloud
 
Artificial Intelligence and Analytic Ops to Continuously Improve Business Out...
IBM+Hortonworks = Transformation of the Big Data Landscape
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Klarna Tech Talk - Mind the Data!
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Open Source Data Management for Industry 4.0
Securing and governing a multi-tenant data lake within the financial industry
Flash session -goldengate--lht1053-lon
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
Postgres Vision 2018: Taking Postgres Everywhere
 
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Accelerate Return on Data
Cloud Adoption, Risks and Rewards Infographic
Machine Learning Everywhere
Adapting to the exponential development of technology
Making Enterprise Big Data Small with Ease
Ad

Similar to Journey to Big Data: Main Issues, Solutions, Benefits (20)

PDF
Big Data in Banking (White paper)
PDF
Creating a Modern Data Architecture for Digital Transformation
PPTX
BigData in Banking
PDF
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
PPTX
Big Data in Financial Services
PDF
Big Data Paris - A Modern Enterprise Architecture
PDF
Big Data for Product Managers
PDF
Big data appliances for BI on Cloud
PDF
Big Data in Banking. Infographic
PDF
OpenSistemas Corporate Presentation
PPTX
Deutsche Telekom on Big Data
PDF
Why Big Data is Really about Small Data
PDF
Big Data & Analytics perspectives in Banking
PDF
Story of Bigdata and its Applications in Financial Institutions
PPTX
Big data in Private Banking
PDF
Big Data LDN 2018: DATA SCIENCE AT ING
PPTX
basic of data science and big data......
PPTX
Big data4businessusers
PDF
Where does Fast Data Strategy Fit within IT Projects
PDF
Big data for product managers
Big Data in Banking (White paper)
Creating a Modern Data Architecture for Digital Transformation
BigData in Banking
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Big Data in Financial Services
Big Data Paris - A Modern Enterprise Architecture
Big Data for Product Managers
Big data appliances for BI on Cloud
Big Data in Banking. Infographic
OpenSistemas Corporate Presentation
Deutsche Telekom on Big Data
Why Big Data is Really about Small Data
Big Data & Analytics perspectives in Banking
Story of Bigdata and its Applications in Financial Institutions
Big data in Private Banking
Big Data LDN 2018: DATA SCIENCE AT ING
basic of data science and big data......
Big data4businessusers
Where does Fast Data Strategy Fit within IT Projects
Big data for product managers
Ad

More from DataWorks Summit (20)

PPTX
Data Science Crash Course
PPTX
Floating on a RAFT: HBase Durability with Apache Ratis
PPTX
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
PDF
HBase Tales From the Trenches - Short stories about most common HBase operati...
PPTX
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
PPTX
Managing the Dewey Decimal System
PPTX
Practical NoSQL: Accumulo's dirlist Example
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
PPTX
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
PPTX
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
PPTX
Supporting Apache HBase : Troubleshooting and Supportability Improvements
PPTX
Security Framework for Multitenant Architecture
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
PPTX
Extending Twitter's Data Platform to Google Cloud
PPTX
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
PPTX
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
PDF
Computer Vision: Coming to a Store Near You
PPTX
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Data Science Crash Course
Floating on a RAFT: HBase Durability with Apache Ratis
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
HBase Tales From the Trenches - Short stories about most common HBase operati...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Managing the Dewey Decimal System
Practical NoSQL: Accumulo's dirlist Example
HBase Global Indexing to support large-scale data ingestion at Uber
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Security Framework for Multitenant Architecture
Presto: Optimizing Performance of SQL-on-Anything Engine
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Extending Twitter's Data Platform to Google Cloud
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Computer Vision: Coming to a Store Near You
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark

Recently uploaded (20)

PDF
cuic standard and advanced reporting.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Encapsulation theory and applications.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Empathic Computing: Creating Shared Understanding
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
A Presentation on Artificial Intelligence
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Big Data Technologies - Introduction.pptx
PDF
Modernizing your data center with Dell and AMD
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
cuic standard and advanced reporting.pdf
Review of recent advances in non-invasive hemoglobin estimation
Chapter 3 Spatial Domain Image Processing.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Encapsulation theory and applications.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Encapsulation_ Review paper, used for researhc scholars
Empathic Computing: Creating Shared Understanding
Mobile App Security Testing_ A Comprehensive Guide.pdf
A Presentation on Artificial Intelligence
MYSQL Presentation for SQL database connectivity
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Per capita expenditure prediction using model stacking based on satellite ima...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
NewMind AI Weekly Chronicles - August'25 Week I
Big Data Technologies - Introduction.pptx
Modernizing your data center with Dell and AMD
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf

Journey to Big Data: Main Issues, Solutions, Benefits

  • 1. A journey to Big Data Main Challenges, Solutions and Benefits Corporate & Investment Banking October 2018
  • 2. Who We Are? 2 Santander Highlights Total assets (EUR trillion) Gross cutormer loans (EUR billion excluding reverse repos) Customer deposits + mutual funds (EUR billion excluding repos) Branches 2017 Attributable profit (EUR million) H1’18 Attributable profit (EUR million) Market capitalisation (EUR billion; 26-06-18) People (headcount) Customers (millions) Shareholders (millions) Communities (million people helped in 2017) 1.43 863 895 13,482 6,619 3,752 74 200,961 140 4.2 2.1
  • 3. Corporate & Investment Banking Business 3 Few contracts – Big profits Big Clients Details makes the difference Strong Competence Adapt to client systems Ad hoc integration Integration layer becomes huge Lot of middleware Big Client Big Client Big Client Big Client Big Client Big Client Big Client Big Client Big Client Big Client
  • 4. Legacy Systems Challenges 4 Regulation Strong Regulation We cannot simply integrate systems in a row. We need to consolidate data for regulators Systems Lot of Systems There are more than 1000 different applications installed on Santander Corporate & Investment Banking Auditors Auditors are Welcome We need our systems ready to be audited at any time. Presence in Many Countries Same pattern is repeated on each country Countries
  • 5. Big Data. Data Organization 5 Raw Data Data is stored ‘as is’ without any modification. It is required all data to have its own metadata Landing Data Ontologies Data is grouped according to functional criteria (ontologies). Data is consolidated to eliminate duplicates Common Business Views Is where applications access and process the data. It is not a copy but a view of the common layer Business Applications
  • 6. Finland Architecture. A Thousand Lakes 6 API S3 Landing (RAW) Common (Harmonized) Business (Consolidated & Views) CDO BATCH CLUSTER A worker worker worker worker Tools BATCH CLUSTER B worker worker worker worker Tools OTHER BATCH CLUSTER C worker worker worker worker Tools
  • 7. Finland Architecture. On Demand Evolution 7 API S3 Landing (RAW) Common (Harmonized) Business (Consolidated & Views) CDO ON DEMAND INSTANTIATION worker worker worker Datalake Distribution Tools worker worker worker Datalake Distribution Tools PROCESING AND SAVING DATA worker worker worker Datalake Distribution Tools FREE RESOURCES
  • 8. Online Cluster Architecture 8 New Applications Legacy Applications Online Cache Business Rules Visualization Spring Data Flow Manual Orchestation Automatic Choreography REST API
  • 9. Online Cluster. Design Pattern 9 Source Stream Online Cache Business Rules AVRO AVRO IDX AVRO IDX AVRO IDX AVRO AVRO Schema Registry AVRO AVRO IDX
  • 10. Complete Finland Architecture 10 API S3 Landing (RAW) Common (Harmonized) Business (Consolidated & Views) CDO ON PREMISE BATCH CLUSTER worker worker worker Datalake Distribution Tools ON DEMAND BATCH CLUSTER Online Cache Business Rules Visualization Spring Data Flow Manual Orchestation Automatic Choreography REST API ONLINE CLUSTER worker worker worker worker Tools Datalake Distribution
  • 11. Our purpose is to help people and business prosper. Our culture is based on believing that everything we do should be: Thank You.