SlideShare a Scribd company logo
5
Most read
6
Most read
16
Most read
Data Quality Architecture
 Phase 1 – Account Verification

          Art Nicewick
Project Scope
• Define a Architectural flow diagram that
  provides for basis for data governance, data
  quality and impact analysis

• Create a framework to report on
  inconsistencies in data (Initial emphasis on
  Accounts)
FISMA
• The architecture provides a foundation for verifying
  that Accounts are deleted after the employees leave
  the Gallery
• The Exceptions Facility, Provides the ability for a
  application administrator to request that an Non-AD
  account be left on file
   – Audit trails
   – Non Standard accounts (e.g. TDP as Custodian)
   – CIO can approvedeny and give timelines for resolutions
• Focus of first phase of the initiative
Why Consistency Reports
• Common Practice (Asset Inventory, …)
• Ensures that data is corrected in the correct
  manner
• Re-validates automated processes
• Some changes need to be informed to system
  manager (e.g. They should know if someone
  has a new last name)
• Links into existing manual pratices
General Data Quality Process
1. Identify data stores (Based on priority)
2. Identify authoritative data
3. Identify Interfaces  replicated  redundant
   data
4. Identify consistency analysis process
5. Correct and continuous monitoring
Identify data stores
• 1.1. Create list of all know data applications
   – Define the name of the data application
   – Define the contacts related to the application
      • TDP Contact
      • Application Administrator
   – Categorize the application
Identify data stores
• 1.2. Link data into data flow representation
  for a visual analysis on enterprise data flows
Identify authoritative data
• 2.1. Review Application data to determine
  – What type of data is supported
  – Is data authoritative
Identify Interfaces  replicated 
                 redundant data
• 2.1. Review Application data to determine
   –   Where the data is sent
   –   Where the data is received from
   –   Data Quality
   –   Note: Source assumed by reverse lookup of target definitions
Identify Interfaces  replicated 
           redundant data
• Diagram linkages between data stores for
  visual review and impact analysis
Identify consistency analysis process
            Review participating data sources and
            determine how to define consistency




* At this point only “SQL” methods are used.
Correct and continuous monitoring

• Inconsistencies are periodically sent to end users for “correction” or
  “exceptions”
• Valid exceptions may be
    – “Supervisor Accounts Outside Active Directory” (e.g. TMSAdmin)
    – Ex-Employees with data attached to userid
    – Contractor or testing userid
Correct and continuous monitoring
• Users can review and update exceptions
  online
Correct and continuous monitoring
• Administrators can create schedules and Email
  recipients
Correct and continuous monitoring
• Email can be sent to
  as many people as
  desired and as
  frequently (or
  infrequently) as
  desired.
Target Data
•   Userids (First Phase and Proof of concept)
•   Object Data
•   Location data
•   Employee Names and Titles
•   Other ..
Challenges
•   Object data (Portfolio)
•   Non-SQL Data (Filemaker)
•   Secure Data (Tradewin)
•   Desktop Data (Excel)
•   Offsite data (FMS)
•   Other …

More Related Content

PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PDF
Data Quality Best Practices
PPTX
How to Build & Sustain a Data Governance Operating Model
PDF
Data Quality Best Practices
PDF
Data Governance Takes a Village (So Why is Everyone Hiding?)
PPT
Data Quality
PDF
DAS Slides: Data Governance - Combining Data Management with Organizational ...
PDF
Best Practices in Metadata Management
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Data Quality Best Practices
How to Build & Sustain a Data Governance Operating Model
Data Quality Best Practices
Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Quality
DAS Slides: Data Governance - Combining Data Management with Organizational ...
Best Practices in Metadata Management

What's hot (20)

PDF
Glossaries, Dictionaries, and Catalogs Result in Data Governance
PDF
Introduction to Data Governance
PDF
Data Governance Powerpoint Presentation Slides
PDF
You Need a Data Catalog. Do You Know Why?
PDF
Data Governance Best Practices, Assessments, and Roadmaps
PPTX
Data Governance Best Practices
PDF
Improving Data Literacy Around Data Architecture
PDF
Data at the Speed of Business with Data Mastering and Governance
PDF
Data Architecture Best Practices for Advanced Analytics
PDF
Implementing Effective Data Governance
PDF
Data Governance Best Practices
PDF
Data Governance
PDF
8 Steps to Creating a Data Strategy
PDF
Ibm data governance framework
PPTX
Chapter 6: Data Operations Management
PDF
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
PDF
Data Management vs. Data Governance Program
PDF
Master Data Management - Aligning Data, Process, and Governance
PPTX
Data Quality Management: Cleaner Data, Better Reporting
PDF
Reference master data management
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Introduction to Data Governance
Data Governance Powerpoint Presentation Slides
You Need a Data Catalog. Do You Know Why?
Data Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices
Improving Data Literacy Around Data Architecture
Data at the Speed of Business with Data Mastering and Governance
Data Architecture Best Practices for Advanced Analytics
Implementing Effective Data Governance
Data Governance Best Practices
Data Governance
8 Steps to Creating a Data Strategy
Ibm data governance framework
Chapter 6: Data Operations Management
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Management vs. Data Governance Program
Master Data Management - Aligning Data, Process, and Governance
Data Quality Management: Cleaner Data, Better Reporting
Reference master data management
Ad

Similar to Data quality architecture (20)

PDF
5 Steps To Master Data Management
DOC
Etl And Data Test Guidelines For Large Applications
PPTX
Migrating data: How to reduce risk
PPT
AMP Next Steps
PDF
Data Quality - Are We There Yet?
PDF
Data Quality in Data Warehouse and Business Intelligence Environments - Disc...
PPT
Pragmatics Driven Issues in Data and Process Integrity in Enterprises
PPTX
Data Migration and MDM - DMM5
PPTX
Data Governance Overview - Doreen Christian
PDF
Migration Services
PPT
Building a Data Quality Program from Scratch
PPT
Dcom be-en-data-assessment-approach
PDF
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
PDF
A Roadmap to Data Migration Success
PPTX
Data Governance Goal Framework Alignment.pptx
PPTX
Enterprise Data Management - Audit and Evolve_Workshop2.pptx
PDF
Saksham Sarode - Building Effective test Data Management in Distributed Envir...
PPTX
Neoaug 2013 critical success factors for data quality management-chain-sys-co...
PPT
Data Integrity : A Basic Concept of data recording and analysis
PDF
Leveraging Data in Financial Services to Meet Regulatory Requirements and Cre...
5 Steps To Master Data Management
Etl And Data Test Guidelines For Large Applications
Migrating data: How to reduce risk
AMP Next Steps
Data Quality - Are We There Yet?
Data Quality in Data Warehouse and Business Intelligence Environments - Disc...
Pragmatics Driven Issues in Data and Process Integrity in Enterprises
Data Migration and MDM - DMM5
Data Governance Overview - Doreen Christian
Migration Services
Building a Data Quality Program from Scratch
Dcom be-en-data-assessment-approach
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
A Roadmap to Data Migration Success
Data Governance Goal Framework Alignment.pptx
Enterprise Data Management - Audit and Evolve_Workshop2.pptx
Saksham Sarode - Building Effective test Data Management in Distributed Envir...
Neoaug 2013 critical success factors for data quality management-chain-sys-co...
Data Integrity : A Basic Concept of data recording and analysis
Leveraging Data in Financial Services to Meet Regulatory Requirements and Cre...
Ad

More from anicewick (6)

PPT
Semantic web2
PPT
Defining conservation taxonomy
PPT
Creating an RAD Authoratative Data Environment
PPT
FISMA Compliance
PPTX
User Interface Patterns and Nuxeo
PPTX
Understanding Document Managment Systems and Nuxeo
Semantic web2
Defining conservation taxonomy
Creating an RAD Authoratative Data Environment
FISMA Compliance
User Interface Patterns and Nuxeo
Understanding Document Managment Systems and Nuxeo

Recently uploaded (20)

PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Big Data Technologies - Introduction.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
cuic standard and advanced reporting.pdf
PDF
KodekX | Application Modernization Development
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Programs and apps: productivity, graphics, security and other tools
Big Data Technologies - Introduction.pptx
MIND Revenue Release Quarter 2 2025 Press Release
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
cuic standard and advanced reporting.pdf
KodekX | Application Modernization Development
“AI and Expert System Decision Support & Business Intelligence Systems”
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Chapter 3 Spatial Domain Image Processing.pdf
Network Security Unit 5.pdf for BCA BBA.
Review of recent advances in non-invasive hemoglobin estimation
Reach Out and Touch Someone: Haptics and Empathic Computing
Dropbox Q2 2025 Financial Results & Investor Presentation
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx

Data quality architecture

  • 1. Data Quality Architecture Phase 1 – Account Verification Art Nicewick
  • 2. Project Scope • Define a Architectural flow diagram that provides for basis for data governance, data quality and impact analysis • Create a framework to report on inconsistencies in data (Initial emphasis on Accounts)
  • 3. FISMA • The architecture provides a foundation for verifying that Accounts are deleted after the employees leave the Gallery • The Exceptions Facility, Provides the ability for a application administrator to request that an Non-AD account be left on file – Audit trails – Non Standard accounts (e.g. TDP as Custodian) – CIO can approvedeny and give timelines for resolutions • Focus of first phase of the initiative
  • 4. Why Consistency Reports • Common Practice (Asset Inventory, …) • Ensures that data is corrected in the correct manner • Re-validates automated processes • Some changes need to be informed to system manager (e.g. They should know if someone has a new last name) • Links into existing manual pratices
  • 5. General Data Quality Process 1. Identify data stores (Based on priority) 2. Identify authoritative data 3. Identify Interfaces replicated redundant data 4. Identify consistency analysis process 5. Correct and continuous monitoring
  • 6. Identify data stores • 1.1. Create list of all know data applications – Define the name of the data application – Define the contacts related to the application • TDP Contact • Application Administrator – Categorize the application
  • 7. Identify data stores • 1.2. Link data into data flow representation for a visual analysis on enterprise data flows
  • 8. Identify authoritative data • 2.1. Review Application data to determine – What type of data is supported – Is data authoritative
  • 9. Identify Interfaces replicated redundant data • 2.1. Review Application data to determine – Where the data is sent – Where the data is received from – Data Quality – Note: Source assumed by reverse lookup of target definitions
  • 10. Identify Interfaces replicated redundant data • Diagram linkages between data stores for visual review and impact analysis
  • 11. Identify consistency analysis process Review participating data sources and determine how to define consistency * At this point only “SQL” methods are used.
  • 12. Correct and continuous monitoring • Inconsistencies are periodically sent to end users for “correction” or “exceptions” • Valid exceptions may be – “Supervisor Accounts Outside Active Directory” (e.g. TMSAdmin) – Ex-Employees with data attached to userid – Contractor or testing userid
  • 13. Correct and continuous monitoring • Users can review and update exceptions online
  • 14. Correct and continuous monitoring • Administrators can create schedules and Email recipients
  • 15. Correct and continuous monitoring • Email can be sent to as many people as desired and as frequently (or infrequently) as desired.
  • 16. Target Data • Userids (First Phase and Proof of concept) • Object Data • Location data • Employee Names and Titles • Other ..
  • 17. Challenges • Object data (Portfolio) • Non-SQL Data (Filemaker) • Secure Data (Tradewin) • Desktop Data (Excel) • Offsite data (FMS) • Other …