SlideShare a Scribd company logo
593 Managing Enterprise Data Quality Using SAP Information Steward
Managing Enterprise Data Quality
using SAP Information Steward
Vinny Ahuja, Cheryl Johnson
Intel Corporation
SESSION CODE: BI593
Disclaimer
This presentation is for informational purposes only. INTEL MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN
THIS SUMMARY.
Software and workloads used in performance tests may have been optimized for performance only on Intel
microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer
systems, components, software, operations and functions. Any change to any of those factors may cause the results to
vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated
purchases, including the performance of that product when combined with other products.
For more complete information about performance and benchmark results, visit www.intel.com/benchmarks
Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.
For a list of Intel trademarks, go to http://legal.intel.com/Trademarks/NamesDb.htm]
* Other names and brands may be claimed as the property of others.
Copyright © 2015, Intel Corporation. All rights reserved.
 Data Quality(DQ) challenges within information
pipeline for Business Intelligence (BI)
 Provide visibility into DQ issues within a
heterogeneous landscape
 Role of Information Steward in addressing DQ
 Share implementation experience
 Use DQ tool as a regression test tool
Learning Points
About Intel
 Data quality starts with systems of record
 Data movement can introduce data quality issues
 Don’t wait for customer to find data quality issues
 Build instrumentation in pipeline to monitor quality
Business Intelligence Information Pipeline
Source
Systems Operational
Data Store (ODS)
Extract Transform
& Load (ETL)
Enterprise
DataWarehouse
EDW
Data marts BI Platforms
 Accuracy
 Data was entered or derived correctly as measured by a
physical assessment
 Completeness
 Data is not missing
 Consistency
 Data that should be the same in various systems is, in fact,
the same
 Timeliness
 Data is available for use when the business requires it
 Validity
 Data conforms to business rules(constraints)
Key Data Characteristics (Dimensions)
 Lack of ownership/accountability
 Incomplete or no checks during data entry
 Heterogeneous platforms
 Purchased and homegrown applications
 Limited or no documentation
 Limited resources
 Run vs grow the business
 Mergers & acquisitions
What Makes Managing Data Quality Hard?
Managing Data Quality
Processes to Assess, Define, Monitor and Improve Data Quality
Discover &
Understand
Data
Define
Deploy
Monitor &
Remediate
•Data Ownership, Roles &
Responsibilities
•Data specifications
•Data quality requirements
•Workflows with R&R for
accountability to resolve data quality
issues
•Analyze Monitor results
•Execute workflows to fix DQ
issues
•Assess/Profile Data
•Assess Risks and Impact
•Catalog Data Assets
•Governance processes
•Operational processes
•DQ Audit and Monitors
Analyst, Data Steward, Product Data
Manager (PdM)
Analyst, Data Steward, PdM
Enterprise
Data
Analyst, Data Steward, PdM
Data Steward, Analyst, Developer
DQ Management Capability Stack
Data Sources (ERP, MDM, CRM, DW, Data Marts)
Data Access Layer
Data Profiler Rules Engine
Audit Results Repository
Reporting
Analysis
Events
Notifications
MetadataRepository
Workflow
Engine
Analyst, Data Steward, Product Data
Manager (PdM)
Data Steward,
DQ Management with Information Steward
Source
Systems ODS
ETL EDW Data marts
BI Platforms
• Data Validation Rules
• Data Profiles Setup
• DQ Scorecards
• DQ Monitor Tolerances
• Tasks and Notifications
• Accuracy
• Completeness
• Consistency
• Integrity
• Validity
DQ Metrics Repository
Data Profiles Data Fallouts DQ Metrics & Scorecards
 Data security
 Standards
 Systems landscape
 Roles & responsibilities
 Development lifecycle (migration)
 Dashboards, alerts and notifications
 Production support
 Training
 Upgrades
Rolling Out the DQ Management Tool
 Need to protect data from unauthorized use
 Master Data, Sales, Procurement
 Projects organized by subject areas (Data
Taxonomy) or business process/function
 Separate projects for business users and operations
 Data stewards approve access to data
Data Organization and Security
• Naming standards:
• Connections: SAPCRM_ChnlMgmt
• Views: VW0012_ADRC join Channel_Mgmt.v_addr
• Rules: LR0012 SAP ADRC PK not in EDW addr
• Tasks: CHNL RT0012 VW0012 ADRC compare addr
• Emphasis on names and detailed descriptions that
resonate with business or operations users
• Documentation within the tool
Naming Standards
Systems Landscape
PF DEV BM* PRDQA
PF DEV QA BM PROD
PF DEV QA BM PROD
Pathfinding Development Test Benchmark Production
*If Necessary
Data Sources
Biz or IT
Developer
Support
Analyst
Support
Analyst
 Separation of duties in support of audit requirements
 Those who write monitors cannot deploy in production
 Monitors developed and tested using non-production
systems
 Migrations to production though manual, handled by
separate role (support analyst)
 Support analyst and tool administrator separate roles
and individuals
 Changes migrate from non-production to production
 Changes directly in production on exception basis
Roles & Responsibilities
Initial
Engagement
•Meet with prospects to understand requirements
•Assess if tool is the right fit for requirements
•Tool limitations can be a show stopper for some scenarios
Development
•Assign a mentor to project team, share best practices, standards
•Document requirements for data, views, rules, bindings, schedules, thresholds
•Build DQ Monitor – data sources, views, rules, bindings, tasks, dashboards
Deployment
•Conduct a design review (quality assurance check)
•Migrate to production (support analyst), configure notifications
•Schedule tasks per schedule (support analyst)
Improvements
•Project team makes changes and tests in development and test environments
•Project team requests changes to be migrated to production
•Support analyst migrates changes to production
Development Methodology
3 - 4
Weeks
1 - 3
Days*
* Green Period
 Need for history of records with DQ issues
 Required custom solution to report historical records
Getting to Historical DQ Issue Records
 Information pipeline is comprised of multiple platforms
 One or more platforms get software/hardware upgrade
at a minimum once per year
 Each platform upgrade requires end-to-end testing of
information pipeline
 Was the flow of data complete and consistent after upgrade?
Reality of Platform Upgrades
Source
Systems Operational
Data Store (ODS)
Extract Transform
& Load (ETL)
Enterprise
DataWarehouse
EDW
Data marts BI Platforms
 DQ monitors validate data is complete and consistent
within and across data repositories
 DQ monitors are early detectors of data issues for
critical business processes
 An upgrade of a platform requires regression testing;
a DQ monitor can do that job
 Validate data is complete and consistent between source
and target repository after platform upgrade
 Eliminate need to maintain test data sets, test scripts for
individual platforms
 Test parts of pipeline or the entire pipeline using existing
DQ monitors
DQ Monitors – Regression Test Suite
 Trust in the data has significantly improved and more
focus can be directed to value added activities
 In one scenario improved DQ from 73% to 93% in 1
quarter
 Enabled streamlining for metrics process
 Reduction in recovery time and activities during data
excursions
 Monitors take out the guesswork on what the issue is and
what the resolution needs to be
 Recover in 2-4 hours, instead of 2-3 days
Return On Investment
 BI information pipeline is a good place to start
with DQ Monitors
 Showcase value to those in business responsible
for data
 Design for data security, separation of roles and
enforce standards
 Use data profiling to troubleshoot data issue
within data pipeline, especially in production
 Use DQ monitors as regression test suite for the
information pipeline
Best Practices
 Monitors with no ownership for action, is waste of
resources
 Having metrics helps get the necessary focus on
data quality
 Partner with those championing data governance
to derive value from IT investments
 A major data crisis will get the right attention, grab the
moment
 Monitors are valuable during platform upgrades
 Set development lifecycle expectations early in
the engagement
Key Learnings
STAY INFORMED
Follow the ASUGNews team:
Tom Wailgum: @twailgum
Chris Kanaracus: @chriskanaracus
Craig Powers: @Powers_ASUG
THANK YOU FOR PARTICIPATING
Please provide feedback on this session by completing
a short survey via the event mobile application.
SESSION CODE: BI593
For ongoing education on this area of focus,
visit www.ASUG.com

More Related Content

PDF
Sap information steward
PPTX
Master data management (mdm) & plm in context of enterprise product management
PDF
Mdm for materials –positive impact of data quality improvement
PDF
Leveraging Information Steward
PDF
Estuate EDM Checklist
PDF
Creating Your Data Governance Dashboard
PPTX
Customer-Centric Data Management for Better Customer Experiences
PPT
Lean Master Data Management
Sap information steward
Master data management (mdm) & plm in context of enterprise product management
Mdm for materials –positive impact of data quality improvement
Leveraging Information Steward
Estuate EDM Checklist
Creating Your Data Governance Dashboard
Customer-Centric Data Management for Better Customer Experiences
Lean Master Data Management

What's hot (20)

PPT
09 mdm tool comaprison
PPT
MDM Strategy & Roadmap
PDF
CET DQ Tool Selection - Executive
PDF
3 Keys To Successful Master Data Management - Final Presentation
PPTX
Dimension of quality in Cloud Database Services
PDF
Data Quality
PPTX
Tips & tricks to drive effective Master Data Management & ERP harmonization
PDF
Faster Data Processing for healthcare system
DOCX
07. Analytics & Reporting Requirements Template
DOCX
EIM Tutorial
PDF
MDM for product data with Talend
PPT
Establishing a Strategy for Data Quality
PDF
Master Your Data. Master Your Business
PDF
Unlocking Success in the 3 Stages of Master Data Management
PPT
IBM InfoSphere Optim Solutions - Highlights
PDF
Data Quality Services in SQL Server 2012
PDF
White Paper - Data Warehouse Project Management
PDF
525 ibm optim
PDF
Credit Suisse, Reference Data Management on a Global Scale
09 mdm tool comaprison
MDM Strategy & Roadmap
CET DQ Tool Selection - Executive
3 Keys To Successful Master Data Management - Final Presentation
Dimension of quality in Cloud Database Services
Data Quality
Tips & tricks to drive effective Master Data Management & ERP harmonization
Faster Data Processing for healthcare system
07. Analytics & Reporting Requirements Template
EIM Tutorial
MDM for product data with Talend
Establishing a Strategy for Data Quality
Master Your Data. Master Your Business
Unlocking Success in the 3 Stages of Master Data Management
IBM InfoSphere Optim Solutions - Highlights
Data Quality Services in SQL Server 2012
White Paper - Data Warehouse Project Management
525 ibm optim
Credit Suisse, Reference Data Management on a Global Scale
Ad

Viewers also liked (20)

PPT
Radsok Presentation Ipe
PDF
Amphenol LTW Industrial Ethernet M12 to RJ45 Adaptor
PPTX
E M C Ionix Overview 2010
PDF
Global Ethernet Network
DOC
Suman Resume
PPTX
Predstavitev izobraževanja odraslih na Kosovem, dr. Rame Likaj, Konferenca Gr...
PDF
Amphenol_Backplane_Systems_UHD_NAFI_BROFinalWeb_1Apr2014
PPT
ŠTIKY ČESKÉHO BYZNYSU
PDF
InnerWireless Distributed Antenna Brochure
PDF
Job offer tpil commercial manager-surabaya
PPTX
VSE JE OK! Dr. Rok Stritar, Ekonomska fakulteta v Ljubljani, MQ konferenca, 1...
PPT
China mobile communication antenna industry report, 2011
PPTX
Keynote Presentation - The Power of Storytelling with Andrew Griffiths
PPTX
Ect english company profile v3.0
DOCX
Introduction - A STUDY ON EMPLOYEE ENGAGEMENT IN FCI OEN CONNECTORS, MULAMTHU...
PPTX
Lessons Learned: Implementing VoLTE Roaming APAC
PDF
David Fei - Session 1: The Global Airport Cities Report: The Latest Project N...
PDF
History of Fiber Optics
DOCX
Chapters(1)A STUDY ON EMPLOYEE ENGAGEMENT IN FCI OEN CONNECTORS, MULAMTHURUTH...
PDF
Digital Omnichannel Customer Acquisition
Radsok Presentation Ipe
Amphenol LTW Industrial Ethernet M12 to RJ45 Adaptor
E M C Ionix Overview 2010
Global Ethernet Network
Suman Resume
Predstavitev izobraževanja odraslih na Kosovem, dr. Rame Likaj, Konferenca Gr...
Amphenol_Backplane_Systems_UHD_NAFI_BROFinalWeb_1Apr2014
ŠTIKY ČESKÉHO BYZNYSU
InnerWireless Distributed Antenna Brochure
Job offer tpil commercial manager-surabaya
VSE JE OK! Dr. Rok Stritar, Ekonomska fakulteta v Ljubljani, MQ konferenca, 1...
China mobile communication antenna industry report, 2011
Keynote Presentation - The Power of Storytelling with Andrew Griffiths
Ect english company profile v3.0
Introduction - A STUDY ON EMPLOYEE ENGAGEMENT IN FCI OEN CONNECTORS, MULAMTHU...
Lessons Learned: Implementing VoLTE Roaming APAC
David Fei - Session 1: The Global Airport Cities Report: The Latest Project N...
History of Fiber Optics
Chapters(1)A STUDY ON EMPLOYEE ENGAGEMENT IN FCI OEN CONNECTORS, MULAMTHURUTH...
Digital Omnichannel Customer Acquisition
Ad

Similar to 593 Managing Enterprise Data Quality Using SAP Information Steward (20)

PDF
Data Quality in Data Warehouse and Business Intelligence Environments - Disc...
PPT
Building a Data Quality Program from Scratch
PPTX
Transform Your Downstream Cloud Analytics with Data Quality 
PDF
Data Quality at the Speed of Work
PDF
Getting Data Quality Right
PDF
Mastering your data with ca e rwin dm 09082010
PPTX
Data Quality Challenges & Solution Approaches in Yahoo!’s Massive Data
PPT
Artificial Intelligence Expert Session Webinar
 
PPTX
BI: How Can Your High-Performance BI System Meet Expectations When You Feed I...
PPTX
Fuel your Data-Driven Ambitions with Data Governance
PPTX
Reimagining Data Quality: Key Modern Considerations
PDF
Data Profiling: The First Step to Big Data Quality
PDF
Applying Data Quality Best Practices at Big Data Scale
PDF
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
PDF
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
PPT
Chapter 4 Organizational Aspects of Data Management.ppt
DOCX
Data quality management system
PPTX
From DQ to DG
PDF
Building Rules for Data Governance
PPT
Lecture 23
Data Quality in Data Warehouse and Business Intelligence Environments - Disc...
Building a Data Quality Program from Scratch
Transform Your Downstream Cloud Analytics with Data Quality 
Data Quality at the Speed of Work
Getting Data Quality Right
Mastering your data with ca e rwin dm 09082010
Data Quality Challenges & Solution Approaches in Yahoo!’s Massive Data
Artificial Intelligence Expert Session Webinar
 
BI: How Can Your High-Performance BI System Meet Expectations When You Feed I...
Fuel your Data-Driven Ambitions with Data Governance
Reimagining Data Quality: Key Modern Considerations
Data Profiling: The First Step to Big Data Quality
Applying Data Quality Best Practices at Big Data Scale
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
Chapter 4 Organizational Aspects of Data Management.ppt
Data quality management system
From DQ to DG
Building Rules for Data Governance
Lecture 23

593 Managing Enterprise Data Quality Using SAP Information Steward

  • 2. Managing Enterprise Data Quality using SAP Information Steward Vinny Ahuja, Cheryl Johnson Intel Corporation SESSION CODE: BI593
  • 3. Disclaimer This presentation is for informational purposes only. INTEL MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information about performance and benchmark results, visit www.intel.com/benchmarks Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries. For a list of Intel trademarks, go to http://legal.intel.com/Trademarks/NamesDb.htm] * Other names and brands may be claimed as the property of others. Copyright © 2015, Intel Corporation. All rights reserved.
  • 4.  Data Quality(DQ) challenges within information pipeline for Business Intelligence (BI)  Provide visibility into DQ issues within a heterogeneous landscape  Role of Information Steward in addressing DQ  Share implementation experience  Use DQ tool as a regression test tool Learning Points
  • 6.  Data quality starts with systems of record  Data movement can introduce data quality issues  Don’t wait for customer to find data quality issues  Build instrumentation in pipeline to monitor quality Business Intelligence Information Pipeline Source Systems Operational Data Store (ODS) Extract Transform & Load (ETL) Enterprise DataWarehouse EDW Data marts BI Platforms
  • 7.  Accuracy  Data was entered or derived correctly as measured by a physical assessment  Completeness  Data is not missing  Consistency  Data that should be the same in various systems is, in fact, the same  Timeliness  Data is available for use when the business requires it  Validity  Data conforms to business rules(constraints) Key Data Characteristics (Dimensions)
  • 8.  Lack of ownership/accountability  Incomplete or no checks during data entry  Heterogeneous platforms  Purchased and homegrown applications  Limited or no documentation  Limited resources  Run vs grow the business  Mergers & acquisitions What Makes Managing Data Quality Hard?
  • 9. Managing Data Quality Processes to Assess, Define, Monitor and Improve Data Quality Discover & Understand Data Define Deploy Monitor & Remediate •Data Ownership, Roles & Responsibilities •Data specifications •Data quality requirements •Workflows with R&R for accountability to resolve data quality issues •Analyze Monitor results •Execute workflows to fix DQ issues •Assess/Profile Data •Assess Risks and Impact •Catalog Data Assets •Governance processes •Operational processes •DQ Audit and Monitors Analyst, Data Steward, Product Data Manager (PdM) Analyst, Data Steward, PdM Enterprise Data Analyst, Data Steward, PdM Data Steward, Analyst, Developer
  • 10. DQ Management Capability Stack Data Sources (ERP, MDM, CRM, DW, Data Marts) Data Access Layer Data Profiler Rules Engine Audit Results Repository Reporting Analysis Events Notifications MetadataRepository Workflow Engine Analyst, Data Steward, Product Data Manager (PdM) Data Steward,
  • 11. DQ Management with Information Steward Source Systems ODS ETL EDW Data marts BI Platforms • Data Validation Rules • Data Profiles Setup • DQ Scorecards • DQ Monitor Tolerances • Tasks and Notifications • Accuracy • Completeness • Consistency • Integrity • Validity DQ Metrics Repository Data Profiles Data Fallouts DQ Metrics & Scorecards
  • 12.  Data security  Standards  Systems landscape  Roles & responsibilities  Development lifecycle (migration)  Dashboards, alerts and notifications  Production support  Training  Upgrades Rolling Out the DQ Management Tool
  • 13.  Need to protect data from unauthorized use  Master Data, Sales, Procurement  Projects organized by subject areas (Data Taxonomy) or business process/function  Separate projects for business users and operations  Data stewards approve access to data Data Organization and Security
  • 14. • Naming standards: • Connections: SAPCRM_ChnlMgmt • Views: VW0012_ADRC join Channel_Mgmt.v_addr • Rules: LR0012 SAP ADRC PK not in EDW addr • Tasks: CHNL RT0012 VW0012 ADRC compare addr • Emphasis on names and detailed descriptions that resonate with business or operations users • Documentation within the tool Naming Standards
  • 15. Systems Landscape PF DEV BM* PRDQA PF DEV QA BM PROD PF DEV QA BM PROD Pathfinding Development Test Benchmark Production *If Necessary Data Sources Biz or IT Developer Support Analyst Support Analyst
  • 16.  Separation of duties in support of audit requirements  Those who write monitors cannot deploy in production  Monitors developed and tested using non-production systems  Migrations to production though manual, handled by separate role (support analyst)  Support analyst and tool administrator separate roles and individuals  Changes migrate from non-production to production  Changes directly in production on exception basis Roles & Responsibilities
  • 17. Initial Engagement •Meet with prospects to understand requirements •Assess if tool is the right fit for requirements •Tool limitations can be a show stopper for some scenarios Development •Assign a mentor to project team, share best practices, standards •Document requirements for data, views, rules, bindings, schedules, thresholds •Build DQ Monitor – data sources, views, rules, bindings, tasks, dashboards Deployment •Conduct a design review (quality assurance check) •Migrate to production (support analyst), configure notifications •Schedule tasks per schedule (support analyst) Improvements •Project team makes changes and tests in development and test environments •Project team requests changes to be migrated to production •Support analyst migrates changes to production Development Methodology 3 - 4 Weeks 1 - 3 Days* * Green Period
  • 18.  Need for history of records with DQ issues  Required custom solution to report historical records Getting to Historical DQ Issue Records
  • 19.  Information pipeline is comprised of multiple platforms  One or more platforms get software/hardware upgrade at a minimum once per year  Each platform upgrade requires end-to-end testing of information pipeline  Was the flow of data complete and consistent after upgrade? Reality of Platform Upgrades Source Systems Operational Data Store (ODS) Extract Transform & Load (ETL) Enterprise DataWarehouse EDW Data marts BI Platforms
  • 20.  DQ monitors validate data is complete and consistent within and across data repositories  DQ monitors are early detectors of data issues for critical business processes  An upgrade of a platform requires regression testing; a DQ monitor can do that job  Validate data is complete and consistent between source and target repository after platform upgrade  Eliminate need to maintain test data sets, test scripts for individual platforms  Test parts of pipeline or the entire pipeline using existing DQ monitors DQ Monitors – Regression Test Suite
  • 21.  Trust in the data has significantly improved and more focus can be directed to value added activities  In one scenario improved DQ from 73% to 93% in 1 quarter  Enabled streamlining for metrics process  Reduction in recovery time and activities during data excursions  Monitors take out the guesswork on what the issue is and what the resolution needs to be  Recover in 2-4 hours, instead of 2-3 days Return On Investment
  • 22.  BI information pipeline is a good place to start with DQ Monitors  Showcase value to those in business responsible for data  Design for data security, separation of roles and enforce standards  Use data profiling to troubleshoot data issue within data pipeline, especially in production  Use DQ monitors as regression test suite for the information pipeline Best Practices
  • 23.  Monitors with no ownership for action, is waste of resources  Having metrics helps get the necessary focus on data quality  Partner with those championing data governance to derive value from IT investments  A major data crisis will get the right attention, grab the moment  Monitors are valuable during platform upgrades  Set development lifecycle expectations early in the engagement Key Learnings
  • 24. STAY INFORMED Follow the ASUGNews team: Tom Wailgum: @twailgum Chris Kanaracus: @chriskanaracus Craig Powers: @Powers_ASUG
  • 25. THANK YOU FOR PARTICIPATING Please provide feedback on this session by completing a short survey via the event mobile application. SESSION CODE: BI593 For ongoing education on this area of focus, visit www.ASUG.com