SlideShare a Scribd company logo
Data Lineage
Series: Foundational Strategies Trust in Big Data – Part 3
Webcast Audio
• Today’s webcast audio is streamed through your computer
speakers.
• If you need technical assistance with the web interface or audio,
please reach out to us using the Q&A box.
Questions Welcome
• Submit your questions at any time during the presentation using the
Q&A box.
• We will answer them during our Q&A session following the
presentation.
Recording and slides
• This webcast is being recorded. You will receive an email following
the webcast with a link to download both the recording and the
slides.
Housekeeping
Andy Reid
Director, Product Marketing
Arianna Valentini
Product Marketing Manager
What You Will Learn Today
• Review of the ingredients of successful Big Data
• What is the cost of lost data governance
• Overcoming data lineage challenges
• How one company is using DI + DQ for lineage that
fuels their anti-money laundering requirements
• What you can do in the next 90 days to take action on
data lineage
• Wrap up with Q&A
3
4
Ingredients of Successful Big Data
1. Clear Business Case 2. Extract Data 3. Understand Data 4. Trace Lineage
Data Governance
64%of IT executives have
trouble finding and cleaning
the right data for strategic
data projects
Sierra Venture, 2020
90%of executives are concerned
about the how misused data
can impact corporate
reputation
• PWC, 22nd Annual Global CEO Survey, 2019
Only 2%of firms consider
themselves fully CCPA
compliant today
International Association of Privacy Professionals,
October 2019
The Cost of Lost
Governance
GDPR Fines 2019: 27
$ 462,635,765https://alpin.io/blog/gdpr-fines-list/
December 15, 2019
The importance of data quality
and integration in the enterprise:
• Compliance
• Decision making
• Customer centricity
• Brand reputation
• Risk Mitigation
5
Goals and Challenges of
Data Governance
GOALS
• Regulatory compliance
• Understand data context,
meaning
• Accuracy, completeness,
consistency, relevancy,
timeliness, validity of data
CHALLENGES
• Multi-platform, data
volume and complexity
• Diversity and consistency of
sources
• Compliance demands:
broader, deeper & evolving
6
Regulation Pressures Continue to Grow
Broader and deeper compliance & regulationVolume and complexity of data is growing
May 2018 Jan 2020
7
8
Data Governance Requires a Multi-Faceted Approach
Quality Security Lineage
9
Why is Data Lineage
Important for Data
Governance?
• See linkages to external data sources and
targets
• Gain insight into the flow of data across
the enterprise
• Trace usage and assess the impact of
changes across the data lifecycle
• Diagnose problems faster
Transitioning to new
cloud deployments
Increasing data lineage
complexity
Rising data volumes,
sources, and variety
Growing regulatory
requirements
Challenges to
effective
Data Lineage
10
Growing Regulations
• Track data from access to integration to ensure sensitive
data is being used in a compliant way
• Regardless of the data source, mainframe, IBM i or cloud,
establish a process for lineage analysis
• See the flow of any piece of data through a job
• Consider how next-gen projects such as Machine
Learning might effect your data lineage processes
• Do you have what is needed for audits?
11
Data needs to meet quality levels but also be traced to original source
Rising Data Volumes, Sources, and Variety
• Consider how you will address data lineage for a growing
expanse of data
• Does the integration solutions you use today, create data
lineage challenges for source data?
• Ex. Mainframe data to a cloud data warehouse
• Establish data lineage processes that can cover requirements
for both batch and real-time data delivery
• Cannot forget data quality!
12
Regardless of complexities, continuous trusted data delivery is a must
Increasing Data Lineage Complexity
• Consider if you auditability and transparency in your current
data lineage processes
• Need full insight into the flow of data across the enterprise
• Is there a clear link to external data sources and targets?
• As data moves through its life cycle can you clearly trace usage
and assess?
13
As your environment complexity grows, you must have a data
lineage map to follow data throughout the enterprise
14
Remember Data Lineage is also Multi-Faceted
Business Technical
15
The Reality is…
Cloud is Here
46% of IT professionals have said that
cloud or hybrid-cloud computing
was part of their 2019 initiatives
Data Trends for 2019, Syncsort 2019
84% of organizations have a multi-
cloud strategy
State of the Cloud 2019, Flexera
Transitioning to New Cloud Deployments
• When moving from source to cloud target, you need to pass
source-to-cluster data lineage information on
• Understand how a hybrid, multi or full cloud deployment can
effect your data governance scalability
• Ask: How will this effect my current data lineage process?
• Consider which elements of your current DI/DQ strategy
need to adapt
16
Cloud deployments need to satisfy governance and compliance needs
Global Bank
Building an AML process with DI + DQ
Goal
Meet AML transaction monitoring
and Financial Conduct Authority
(FCA) compliance
Challenges
• Data volume too large,
diversely scattered to analyze
• Disparate data sources –
Mainframe, RDBMS, Cloud,
etc.
• Maximize the value/ROI of the
data lake
17
Requirements
• Consolidated and clean data
• End-to-end data lineage
• Secure integrations
• Unmodified mainframe data
for archive/backup
Global Bank
Results: Data Integration Driving Improved CX
Solution
• Connect CDC
• Connect for Big Data
• Trillium for Big Data
Benefits Achieved
• High performance AML
results
• Faster time to value
• Data lake is trusted source
• Data feeding critical
machine learning-based
fraud detection
What’s Next
• Expanding to additional
Customer Engagement
solutions and applications
18
Looking at the Next 90 Days…
• Determine if you have an understanding of your
organizational data
• Consider how you use data lineage to support
governance today
• How will you use business lineage AND technical
lineage to ensure governance?
19
Questions?
Foundational Strategies for Trust in Big Data Part 3: Data Lineage

More Related Content

PDF
GDPR Benhmark: 70% of companies failing on their own GDPR compliance claims
PDF
Delivering Analytics at Scale with a Governed Data Lake
PDF
The Top 5 Factors to Consider When Choosing a Big Data Solution
PDF
[AIIM16] The Last Mile in Information Management
PDF
Why You Need to Govern Big Data
PPTX
Data Integrity: The Baseline for Innovation
PDF
Unlocking Greater Insights with Integrated Data Quality for Collibra
PPTX
Big data
GDPR Benhmark: 70% of companies failing on their own GDPR compliance claims
Delivering Analytics at Scale with a Governed Data Lake
The Top 5 Factors to Consider When Choosing a Big Data Solution
[AIIM16] The Last Mile in Information Management
Why You Need to Govern Big Data
Data Integrity: The Baseline for Innovation
Unlocking Greater Insights with Integrated Data Quality for Collibra
Big data

What's hot (20)

PDF
Accelerating Fast Data Strategy with Data Virtualization
PPTX
Data Governance Overview - Doreen Christian
PDF
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
PDF
Liberating data with Talend Data Catalog
PDF
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
PDF
data_blending
PDF
Delivering data governance with a Yes
PPT
Making the Case for Hadoop in a Large Enterprise-British Airways
PPTX
De groote de man Ingrid de Poorter
PDF
Netspend: Maintaining "High Operations Tempo" via Multidomain MDM
PPTX
Adding Hadoop to Your Analytics Mix?
PDF
Big Data Analytics: From Insights to Production
PDF
Are Your Data Ready for GDPR? (with MAPR and Talend)
PPTX
Multi Cloud Data Integration- Manufacturing Industry
PDF
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
PPTX
Asking the Right Questions of Your Data
PPTX
Big Data for Finance – Challenges in High-Frequency Trading
PDF
Operationalising gdpr compliance with data management
PDF
Make Data Better Together
PPTX
Plateforme du Bâtiment: Product Master Data Management
Accelerating Fast Data Strategy with Data Virtualization
Data Governance Overview - Doreen Christian
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
Liberating data with Talend Data Catalog
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
data_blending
Delivering data governance with a Yes
Making the Case for Hadoop in a Large Enterprise-British Airways
De groote de man Ingrid de Poorter
Netspend: Maintaining "High Operations Tempo" via Multidomain MDM
Adding Hadoop to Your Analytics Mix?
Big Data Analytics: From Insights to Production
Are Your Data Ready for GDPR? (with MAPR and Talend)
Multi Cloud Data Integration- Manufacturing Industry
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
Asking the Right Questions of Your Data
Big Data for Finance – Challenges in High-Frequency Trading
Operationalising gdpr compliance with data management
Make Data Better Together
Plateforme du Bâtiment: Product Master Data Management
Ad

Similar to Foundational Strategies for Trust in Big Data Part 3: Data Lineage (20)

PDF
How to get data lineage right
PDF
Straight Talk to Demystify Data Lineage
PDF
Data lineage to drive compliance and as a business imperative
PPTX
How to establish a sustainable solution for data lineage
PPTX
An Agile & Adaptive Approach to Addressing Financial Services Regulations and...
PPTX
Fuel your Data-Driven Ambitions with Data Governance
PPTX
The art of implementing data lineage
PDF
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
PPTX
Creating an Enterprise AI Strategy
PPTX
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
PPTX
Financial Services - New Approach to Data Management in the Digital Era
PPTX
Developing & Deploying Effective Data Governance Framework
PPTX
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
PDF
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
PDF
EPF-datagov-part1-1.pdf
PPT
Data Governance in a big data era
PPTX
A Business-first Approach to Building Data Governance Program
PPTX
Infogix BCBS 239 Implementation Challenges
PPTX
CUAS Data Journey V3
PDF
Leveraging Data in Financial Services to Meet Regulatory Requirements and Cre...
How to get data lineage right
Straight Talk to Demystify Data Lineage
Data lineage to drive compliance and as a business imperative
How to establish a sustainable solution for data lineage
An Agile & Adaptive Approach to Addressing Financial Services Regulations and...
Fuel your Data-Driven Ambitions with Data Governance
The art of implementing data lineage
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Creating an Enterprise AI Strategy
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
Financial Services - New Approach to Data Management in the Digital Era
Developing & Deploying Effective Data Governance Framework
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
EPF-datagov-part1-1.pdf
Data Governance in a big data era
A Business-first Approach to Building Data Governance Program
Infogix BCBS 239 Implementation Challenges
CUAS Data Journey V3
Leveraging Data in Financial Services to Meet Regulatory Requirements and Cre...
Ad

More from Precisely (20)

PDF
The Future of Automation: AI, APIs, and Cloud Modernization.pdf
PDF
Unlock new opportunities with location data.pdf
PDF
Reimagining Insurance: Connected Data for Confident Decisions.pdf
PDF
Introducing Syncsort™ Storage Management.pdf
PDF
Enable Enterprise-Ready Security on IBM i Systems.pdf
PDF
A Day in the Life of Location Data - Turning Where into How.pdf
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
PDF
Solving the CIO’s Dilemma: Speed, Scale, and Smarter SAP Modernization.pdf
PDF
Solving the Data Disconnect: Why Success Hinges on Pre-Linked Data.pdf
PDF
Cooking Up Clean Addresses - 3 Ways to Whip Messy Data into Shape.pdf
PDF
Building Confidence in AI & Analytics with High-Integrity Location Data.pdf
PDF
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
PDF
Precisely Demo Showcase: Powering ServiceNow Discovery with Precisely Ironstr...
PDF
The 2025 Guide on What's Next for Automation.pdf
PDF
Outdated Tech, Invisible Expenses – How Data Silos Undermine Operational Effi...
PDF
Modernización de SAP: Maximizando el Valor de su Migración a SAP S/4HANA.pdf
PDF
Outdated Tech, Invisible Expenses – The Hidden Cost of Disconnected Data Syst...
PDF
Migration vers SAP S/4HANA: Un levier stratégique pour votre transformation d...
PDF
Outdated Tech, Invisible Expenses: The Hidden Cost of Poor Data Integration o...
PDF
The Changing Compliance Landscape in 2025.pdf
The Future of Automation: AI, APIs, and Cloud Modernization.pdf
Unlock new opportunities with location data.pdf
Reimagining Insurance: Connected Data for Confident Decisions.pdf
Introducing Syncsort™ Storage Management.pdf
Enable Enterprise-Ready Security on IBM i Systems.pdf
A Day in the Life of Location Data - Turning Where into How.pdf
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Solving the CIO’s Dilemma: Speed, Scale, and Smarter SAP Modernization.pdf
Solving the Data Disconnect: Why Success Hinges on Pre-Linked Data.pdf
Cooking Up Clean Addresses - 3 Ways to Whip Messy Data into Shape.pdf
Building Confidence in AI & Analytics with High-Integrity Location Data.pdf
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
Precisely Demo Showcase: Powering ServiceNow Discovery with Precisely Ironstr...
The 2025 Guide on What's Next for Automation.pdf
Outdated Tech, Invisible Expenses – How Data Silos Undermine Operational Effi...
Modernización de SAP: Maximizando el Valor de su Migración a SAP S/4HANA.pdf
Outdated Tech, Invisible Expenses – The Hidden Cost of Disconnected Data Syst...
Migration vers SAP S/4HANA: Un levier stratégique pour votre transformation d...
Outdated Tech, Invisible Expenses: The Hidden Cost of Poor Data Integration o...
The Changing Compliance Landscape in 2025.pdf

Recently uploaded (20)

PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Approach and Philosophy of On baking technology
PPTX
Big Data Technologies - Introduction.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Machine learning based COVID-19 study performance prediction
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Spectroscopy.pptx food analysis technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Understanding_Digital_Forensics_Presentation.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
The AUB Centre for AI in Media Proposal.docx
Approach and Philosophy of On baking technology
Big Data Technologies - Introduction.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Digital-Transformation-Roadmap-for-Companies.pptx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
The Rise and Fall of 3GPP – Time for a Sabbatical?
Machine learning based COVID-19 study performance prediction
NewMind AI Weekly Chronicles - August'25 Week I
Advanced methodologies resolving dimensionality complications for autism neur...
Network Security Unit 5.pdf for BCA BBA.
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Programs and apps: productivity, graphics, security and other tools
Diabetes mellitus diagnosis method based random forest with bat algorithm
Spectroscopy.pptx food analysis technology

Foundational Strategies for Trust in Big Data Part 3: Data Lineage

  • 1. Data Lineage Series: Foundational Strategies Trust in Big Data – Part 3
  • 2. Webcast Audio • Today’s webcast audio is streamed through your computer speakers. • If you need technical assistance with the web interface or audio, please reach out to us using the Q&A box. Questions Welcome • Submit your questions at any time during the presentation using the Q&A box. • We will answer them during our Q&A session following the presentation. Recording and slides • This webcast is being recorded. You will receive an email following the webcast with a link to download both the recording and the slides. Housekeeping Andy Reid Director, Product Marketing Arianna Valentini Product Marketing Manager
  • 3. What You Will Learn Today • Review of the ingredients of successful Big Data • What is the cost of lost data governance • Overcoming data lineage challenges • How one company is using DI + DQ for lineage that fuels their anti-money laundering requirements • What you can do in the next 90 days to take action on data lineage • Wrap up with Q&A 3
  • 4. 4 Ingredients of Successful Big Data 1. Clear Business Case 2. Extract Data 3. Understand Data 4. Trace Lineage Data Governance
  • 5. 64%of IT executives have trouble finding and cleaning the right data for strategic data projects Sierra Venture, 2020 90%of executives are concerned about the how misused data can impact corporate reputation • PWC, 22nd Annual Global CEO Survey, 2019 Only 2%of firms consider themselves fully CCPA compliant today International Association of Privacy Professionals, October 2019 The Cost of Lost Governance GDPR Fines 2019: 27 $ 462,635,765https://alpin.io/blog/gdpr-fines-list/ December 15, 2019 The importance of data quality and integration in the enterprise: • Compliance • Decision making • Customer centricity • Brand reputation • Risk Mitigation 5
  • 6. Goals and Challenges of Data Governance GOALS • Regulatory compliance • Understand data context, meaning • Accuracy, completeness, consistency, relevancy, timeliness, validity of data CHALLENGES • Multi-platform, data volume and complexity • Diversity and consistency of sources • Compliance demands: broader, deeper & evolving 6
  • 7. Regulation Pressures Continue to Grow Broader and deeper compliance & regulationVolume and complexity of data is growing May 2018 Jan 2020 7
  • 8. 8 Data Governance Requires a Multi-Faceted Approach Quality Security Lineage
  • 9. 9 Why is Data Lineage Important for Data Governance? • See linkages to external data sources and targets • Gain insight into the flow of data across the enterprise • Trace usage and assess the impact of changes across the data lifecycle • Diagnose problems faster
  • 10. Transitioning to new cloud deployments Increasing data lineage complexity Rising data volumes, sources, and variety Growing regulatory requirements Challenges to effective Data Lineage 10
  • 11. Growing Regulations • Track data from access to integration to ensure sensitive data is being used in a compliant way • Regardless of the data source, mainframe, IBM i or cloud, establish a process for lineage analysis • See the flow of any piece of data through a job • Consider how next-gen projects such as Machine Learning might effect your data lineage processes • Do you have what is needed for audits? 11 Data needs to meet quality levels but also be traced to original source
  • 12. Rising Data Volumes, Sources, and Variety • Consider how you will address data lineage for a growing expanse of data • Does the integration solutions you use today, create data lineage challenges for source data? • Ex. Mainframe data to a cloud data warehouse • Establish data lineage processes that can cover requirements for both batch and real-time data delivery • Cannot forget data quality! 12 Regardless of complexities, continuous trusted data delivery is a must
  • 13. Increasing Data Lineage Complexity • Consider if you auditability and transparency in your current data lineage processes • Need full insight into the flow of data across the enterprise • Is there a clear link to external data sources and targets? • As data moves through its life cycle can you clearly trace usage and assess? 13 As your environment complexity grows, you must have a data lineage map to follow data throughout the enterprise
  • 14. 14 Remember Data Lineage is also Multi-Faceted Business Technical
  • 15. 15 The Reality is… Cloud is Here 46% of IT professionals have said that cloud or hybrid-cloud computing was part of their 2019 initiatives Data Trends for 2019, Syncsort 2019 84% of organizations have a multi- cloud strategy State of the Cloud 2019, Flexera
  • 16. Transitioning to New Cloud Deployments • When moving from source to cloud target, you need to pass source-to-cluster data lineage information on • Understand how a hybrid, multi or full cloud deployment can effect your data governance scalability • Ask: How will this effect my current data lineage process? • Consider which elements of your current DI/DQ strategy need to adapt 16 Cloud deployments need to satisfy governance and compliance needs
  • 17. Global Bank Building an AML process with DI + DQ Goal Meet AML transaction monitoring and Financial Conduct Authority (FCA) compliance Challenges • Data volume too large, diversely scattered to analyze • Disparate data sources – Mainframe, RDBMS, Cloud, etc. • Maximize the value/ROI of the data lake 17 Requirements • Consolidated and clean data • End-to-end data lineage • Secure integrations • Unmodified mainframe data for archive/backup
  • 18. Global Bank Results: Data Integration Driving Improved CX Solution • Connect CDC • Connect for Big Data • Trillium for Big Data Benefits Achieved • High performance AML results • Faster time to value • Data lake is trusted source • Data feeding critical machine learning-based fraud detection What’s Next • Expanding to additional Customer Engagement solutions and applications 18
  • 19. Looking at the Next 90 Days… • Determine if you have an understanding of your organizational data • Consider how you use data lineage to support governance today • How will you use business lineage AND technical lineage to ensure governance? 19