SlideShare a Scribd company logo
Previously known as
Think Big. Move Fast.
Template designed by
brought to you by
SolidQ
• Born in 2002 in USA and Spain
• Established in 2007 in Italy
• More than 1000 customers and more than 200 consultants worldwide
• Dedicated to Data Management on the Microsoft Platform
• Books Authors, Conference Speakers, SQL Server MVPs and Regional Directors
• www.solidq.com
Davide Mauri
• 18 Years of experience on the SQL Server Platform
• Specialized in Data Solution Architecture, Database Design, Performance
Tuning, Business Intelligence
• Microsoft SQL Server MVP
• President of UGISS (Italian SQL Server UG)
• Mentor @ SolidQ
• Video, Book & Article Author
• Regular Speaker @ SQL Server events
• Projects, Consulting, Mentoring & Training
Data Quality
The BIG problem
• What’s the key asset of a company?
• Data that leads to Information and then to Knowledge
• With the mass adoption of Business Intelligence / Analytics problems with
Data Quality arises and become evident
• Wrong, incomplete or incoherent data leads to wrong decisions
• Managers cannot “trust” native data
• Data needs to be reworked a lot in order to be usable
• As per my experience, almost the 50% of the time spent developing a BI solution is use just to solve
Data Quality problems.
The BIG problem
• A Gartner research states that
«Organizations estimated they are losing an average of $8.2 million annually
as a result of data quality issues”
• 22% report estimated losses for $20 million
• 4% report estimated losses for $100 million
Data Quality Concepts
Data Quality Issue Sample Data Problem
Standard Are data elements consistently defined and
understood ?
Gender code = M, F, U in one system
and Gender code = 0, 1, 2 in another
system
Complete Is all necessary data present ? 20% of customers’ last name is blank,
50% of zip-codes are 99999
Accurate Does the data accurately represent reality or a
verifiable source?
A Supplier is listed as ‘Active’ but went
out of business six years ago
Valid Do data values fall within acceptable ranges? Salary values should be between
60,000-120,000
Unique Data appears several times Both John Ryan and Jack Ryan appear
in the system – are they the same
person?
Master Data Management
• A solution to the problem is offered by Master Data Management (MDM)
• Is a Discipline and a Process supported by Technology
• MDM aim to discover and define non-transactional lists of data, with the goal
of compiling maintainable master lists, that will become the reference data.
Master Data Management
• What are Master Data?
• Master Data: “Slowly changing reference data shared across system”
• Master Data != Transactional Data
• Master Data != Metadata
• Reference Data:
• Products, Customers, Suppliers, Geography, ecc.
• The Dimensions of a Data Warehouse 
Master Data Services
• Introduced with SQL Server 2008 R2
• With SQL Server 2012 al lots of improvements
• Web Interface improved *a lot*
• Silverlight based
• Integrated with Excel 2007 and after
• Killer Application!
• Installed with SQL Server 2012 but must be configured prior usage
• Needs IIS, WCF and so on…
• No Changes in 2014
Master Data Services
• Allow the management and the definition of Master Data
• “Model” Definition
• Entities, Attributes, Hiearchies, ecc…
• Business Rules
• Data Stewardship
• Through Excel Addin or the Web Portal
• Integration
• Batch and/or WCF Service
What is Master Data Management?
ERP CRM
Warehouse
MGMT
Invoicing
System
BI
Master Data Hub
External
System
Integration
Web Service Data Hub
Data Steward
Master Data Services
Data Quality Services
• A new Service introduced with SQL Server 2012
• Enable the verification of «new» data against a established Knowledge Base
• Has its own client
• Must be installed after SQL Server 2012 installation
• «Data Quality Service Installer»
• Three dedicated SQL Server database (DQS prefix)
Data Quality Services
• Help to
• Define a Knowledge Base
• Through «Knowledge Discovery» and «Domain Management»
• Perform Data Cleasing & Data Matching (De-Duplication)
• Integrated with
• Integration Services
• Master Data Services (via Excel Addin)
Data Cleasing
• Master (Reference) Data is needed for Data Cleansing
• Can be provided directly from our data (Customer Names, for example)
• Can be supplied by third party companies: Azure Market Place
• *Very* nice feature.
Data Quality Services
Identity Mapping & De-Duplication
• DQS is not the only solution for Identity Mapping & De-Duplication
• MDS has some built-in functions
• SSIS Fuzzy Lookup is great for this
• Great Performance & Results!
De-Duplication with Integration Services
Conclusions
• Bad Data Quality means Bad Business
• Start working on Data Quality ASAP
• Define a business process to achieve Data quality
• MDS and DQS will help to support it
• Integration with existing application via
• Batch
• SOA
• High Data Quality will be a must have!
Link
• Free Data Quality eBook
http://bit.ly/mMJgKv
Link
• MDS Homepage
http://msdn.microsoft.com/en-us/sqlserver/ff943581.aspx
• 3rd Party Client Application
http://profisee.com/
• Data Quality & Data Science
http://www.solidq.com/consulting/
Previously known as
Think Big. Move Fast.

More Related Content

PDF
IT + Line of Business - Driving Faster, Deeper Insights Together
PDF
Data Integration Trends Businesses Should Watch for in 2021
PPTX
Best Practices to Deliver BI Solutions
PPTX
Data Quality Management
PDF
DAMA June 2017 User Group presentation on ECM
PDF
Hybrid Analytics in Healthcare: Leveraging Power BI and Office 365 to Make Sm...
PDF
Data management trends
PDF
DI&A Slides: Data Insights and Analytics Frameworks
IT + Line of Business - Driving Faster, Deeper Insights Together
Data Integration Trends Businesses Should Watch for in 2021
Best Practices to Deliver BI Solutions
Data Quality Management
DAMA June 2017 User Group presentation on ECM
Hybrid Analytics in Healthcare: Leveraging Power BI and Office 365 to Make Sm...
Data management trends
DI&A Slides: Data Insights and Analytics Frameworks

What's hot (20)

PPT
RWDG Slides: Apply Data Governance to Agile Efforts
PDF
Webinar: Data Quality, Data Engineering, and Data Science
PDF
Enterprise Architecture vs. Data Architecture
PPTX
Talent Base Case: Funster - Product MDM case
PDF
Slides: Enterprise Architecture vs. Data Architecture
PDF
Case Manager for Content Management - A Customer's Perspective
PDF
These Are The Data You Are Looking For
PPTX
Pitfalls and pro-tips for effective and transparent Business Intelligence too...
PPTX
7 steps for guides how to build a successful data strategy
PDF
Salesforce Master Data Management Webinar
PPTX
Toad Business Intelligence Suite
PDF
Metadata Governance for Vocabularies, Dictionaries, and Data
PDF
How Can You Calculate the Cost of Your Data?
PDF
Effective BI Portal Design Patterns to Drive High User Engagement
PDF
CDO Webinar: Coordinating Your Data Strategies – When Data Management Worlds ...
PDF
The Shifting Landscape of Data Integration
PPTX
Keys to Formulating an Effective Data Management Strategy in the Age of Data
PDF
Real-World Data Governance: Comparing World Class Solutions in Data Governanc...
PDF
RWDG Slides: Governing Data Governance and Master Metadata
PPTX
How to Streamline Complex, Data-Intensive SAP Materials and Product Data Proc...
RWDG Slides: Apply Data Governance to Agile Efforts
Webinar: Data Quality, Data Engineering, and Data Science
Enterprise Architecture vs. Data Architecture
Talent Base Case: Funster - Product MDM case
Slides: Enterprise Architecture vs. Data Architecture
Case Manager for Content Management - A Customer's Perspective
These Are The Data You Are Looking For
Pitfalls and pro-tips for effective and transparent Business Intelligence too...
7 steps for guides how to build a successful data strategy
Salesforce Master Data Management Webinar
Toad Business Intelligence Suite
Metadata Governance for Vocabularies, Dictionaries, and Data
How Can You Calculate the Cost of Your Data?
Effective BI Portal Design Patterns to Drive High User Engagement
CDO Webinar: Coordinating Your Data Strategies – When Data Management Worlds ...
The Shifting Landscape of Data Integration
Keys to Formulating an Effective Data Management Strategy in the Age of Data
Real-World Data Governance: Comparing World Class Solutions in Data Governanc...
RWDG Slides: Governing Data Governance and Master Metadata
How to Streamline Complex, Data-Intensive SAP Materials and Product Data Proc...
Ad

Viewers also liked (18)

PDF
Ds05 power bi
PDF
Ag03 agile culture - dnc14 handouts
PPTX
Fe02 ria con breeze e knockout
PPTX
Sys04 share point-yammer_social_collaboration
PPTX
Fe04 angular js-101
PPTX
Fr01 asp.net web api reloaded
PPTX
Mob04 best practices for windows phone ui design
PPTX
Sys02 best way to create a share point app
PDF
Mob02 windows phone 8.1 app development
PDF
Ag01 agile foundation - dnc14 handouts
PPTX
Cert02 70-410
PDF
Be01 web devclientvsserver
PPTX
Win05 accesso ai dati in win 8
PPTX
Unity3 d uitools
PPTX
Sys01 creare applicazioni virtuali con microsoft application virtualization...
PDF
Cert05 70-487 - developing microsoft azure and web services
PPTX
Mob03 what's new in windows phone
PPTX
Gam03 facciamo volare il nosro drone
Ds05 power bi
Ag03 agile culture - dnc14 handouts
Fe02 ria con breeze e knockout
Sys04 share point-yammer_social_collaboration
Fe04 angular js-101
Fr01 asp.net web api reloaded
Mob04 best practices for windows phone ui design
Sys02 best way to create a share point app
Mob02 windows phone 8.1 app development
Ag01 agile foundation - dnc14 handouts
Cert02 70-410
Be01 web devclientvsserver
Win05 accesso ai dati in win 8
Unity3 d uitools
Sys01 creare applicazioni virtuali con microsoft application virtualization...
Cert05 70-487 - developing microsoft azure and web services
Mob03 what's new in windows phone
Gam03 facciamo volare il nosro drone
Ad

Similar to Ds04 data quality (20)

PDF
DQS & MDS in SQL Server 2016
PPTX
Introduction to Master Data Services in SQL Server 2012
PPTX
Enterprise Information Management (EIM) in SQL Server 2012
PDF
Master Data Management's Place in the Data Governance Landscape
 
PPTX
Training_534231.pptx
PPTX
MDS & SQL 2012
PPTX
SQL Server 2019 Master Data Service
PDF
IT6701 Information Management - Unit III
DOCX
Introduction to master data services
PPTX
Optimizing Solution Value – Dynamic Data Quality, Governance, and MDM
PDF
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
PPT
14178090.ppt
PPTX
Introduction to Microsoft’s Master Data Services (MDS)
PPTX
Master Data Management.pptx
PPT
Bad customer data?
PPTX
SQL 2012 Enterprise Information Management with DQS and MDS by Karan Gulati
PDF
Data Quality Success Stories
PPTX
IT6701-Information Management Unit 3
PPT
Data quality and bi
PDF
SQLSaturday #188 - Enterprise Information Management
DQS & MDS in SQL Server 2016
Introduction to Master Data Services in SQL Server 2012
Enterprise Information Management (EIM) in SQL Server 2012
Master Data Management's Place in the Data Governance Landscape
 
Training_534231.pptx
MDS & SQL 2012
SQL Server 2019 Master Data Service
IT6701 Information Management - Unit III
Introduction to master data services
Optimizing Solution Value – Dynamic Data Quality, Governance, and MDM
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
14178090.ppt
Introduction to Microsoft’s Master Data Services (MDS)
Master Data Management.pptx
Bad customer data?
SQL 2012 Enterprise Information Management with DQS and MDS by Karan Gulati
Data Quality Success Stories
IT6701-Information Management Unit 3
Data quality and bi
SQLSaturday #188 - Enterprise Information Management

More from DotNetCampus (20)

PDF
ARCHITETTURA DI UN'APPLICAZIONE SCALABILE
PPTX
MICROSOFT E IL MONDO IOT
PPTX
70-485: ADVANCED OF DEVELOPING WINDOWS STORE APPS USING C#
PDF
70-534: ARCHITECTING MICROSOFT AZURE SOLUTIONS
PDF
70-483: PROGRAMMING IN C#
PPTX
DSTORIE DALLA TRINCEA: TEAM FOUNDATION SERVER IN CASI LIMITE E NON SOLO...
PPTX
TUTTO SU VISUAL STUDIO ALM 2015
PPTX
CONTINUOUS INTEGRATION CON SQL SERVER
PPTX
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
PPTX
DESKTOP AND CLIENT VIRTUALIZATION: NEW WORKSTYLES WITH MICROSOFT VDI
PPTX
FROM ON-PREMISE TO THE HYBRID CLOUD WITH MICROSOFT AZURE
PPTX
SHAREPOINT 2016 - WHAT'S NEW
PPTX
COSTRUISCI IL TUO DEVICE
PPTX
SVILUPPARE PER MICROSOFT BAND
PPTX
INTERFACCE GRAFICHE CON UNITY3D 4.6: IL GIOCO NON BASTA!
PPTX
WINDOWS PHONE APPS IN C++
PPTX
AZURE NOTIFICATION HUB
PPTX
SFRUTTARE I MICROSOFT AZURE MOBILE SERVICES CON XAMARIN.FORMS
PPTX
INTRO TO XAMARIN
PPTX
UNIVERSAL APP IN TUTTE LE SALSE: PHONE, TABLET, PC, XBOX E IOT
ARCHITETTURA DI UN'APPLICAZIONE SCALABILE
MICROSOFT E IL MONDO IOT
70-485: ADVANCED OF DEVELOPING WINDOWS STORE APPS USING C#
70-534: ARCHITECTING MICROSOFT AZURE SOLUTIONS
70-483: PROGRAMMING IN C#
DSTORIE DALLA TRINCEA: TEAM FOUNDATION SERVER IN CASI LIMITE E NON SOLO...
TUTTO SU VISUAL STUDIO ALM 2015
CONTINUOUS INTEGRATION CON SQL SERVER
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
DESKTOP AND CLIENT VIRTUALIZATION: NEW WORKSTYLES WITH MICROSOFT VDI
FROM ON-PREMISE TO THE HYBRID CLOUD WITH MICROSOFT AZURE
SHAREPOINT 2016 - WHAT'S NEW
COSTRUISCI IL TUO DEVICE
SVILUPPARE PER MICROSOFT BAND
INTERFACCE GRAFICHE CON UNITY3D 4.6: IL GIOCO NON BASTA!
WINDOWS PHONE APPS IN C++
AZURE NOTIFICATION HUB
SFRUTTARE I MICROSOFT AZURE MOBILE SERVICES CON XAMARIN.FORMS
INTRO TO XAMARIN
UNIVERSAL APP IN TUTTE LE SALSE: PHONE, TABLET, PC, XBOX E IOT

Recently uploaded (20)

PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Cloud computing and distributed systems.
PDF
Electronic commerce courselecture one. Pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Approach and Philosophy of On baking technology
PPT
Teaching material agriculture food technology
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Mobile App Security Testing_ A Comprehensive Guide.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
sap open course for s4hana steps from ECC to s4
Cloud computing and distributed systems.
Electronic commerce courselecture one. Pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Unlocking AI with Model Context Protocol (MCP)
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Advanced methodologies resolving dimensionality complications for autism neur...
Programs and apps: productivity, graphics, security and other tools
Digital-Transformation-Roadmap-for-Companies.pptx
Approach and Philosophy of On baking technology
Teaching material agriculture food technology
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx

Ds04 data quality

  • 1. Previously known as Think Big. Move Fast.
  • 3. SolidQ • Born in 2002 in USA and Spain • Established in 2007 in Italy • More than 1000 customers and more than 200 consultants worldwide • Dedicated to Data Management on the Microsoft Platform • Books Authors, Conference Speakers, SQL Server MVPs and Regional Directors • www.solidq.com
  • 4. Davide Mauri • 18 Years of experience on the SQL Server Platform • Specialized in Data Solution Architecture, Database Design, Performance Tuning, Business Intelligence • Microsoft SQL Server MVP • President of UGISS (Italian SQL Server UG) • Mentor @ SolidQ • Video, Book & Article Author • Regular Speaker @ SQL Server events • Projects, Consulting, Mentoring & Training
  • 6. The BIG problem • What’s the key asset of a company? • Data that leads to Information and then to Knowledge • With the mass adoption of Business Intelligence / Analytics problems with Data Quality arises and become evident • Wrong, incomplete or incoherent data leads to wrong decisions • Managers cannot “trust” native data • Data needs to be reworked a lot in order to be usable • As per my experience, almost the 50% of the time spent developing a BI solution is use just to solve Data Quality problems.
  • 7. The BIG problem • A Gartner research states that «Organizations estimated they are losing an average of $8.2 million annually as a result of data quality issues” • 22% report estimated losses for $20 million • 4% report estimated losses for $100 million
  • 8. Data Quality Concepts Data Quality Issue Sample Data Problem Standard Are data elements consistently defined and understood ? Gender code = M, F, U in one system and Gender code = 0, 1, 2 in another system Complete Is all necessary data present ? 20% of customers’ last name is blank, 50% of zip-codes are 99999 Accurate Does the data accurately represent reality or a verifiable source? A Supplier is listed as ‘Active’ but went out of business six years ago Valid Do data values fall within acceptable ranges? Salary values should be between 60,000-120,000 Unique Data appears several times Both John Ryan and Jack Ryan appear in the system – are they the same person?
  • 9. Master Data Management • A solution to the problem is offered by Master Data Management (MDM) • Is a Discipline and a Process supported by Technology • MDM aim to discover and define non-transactional lists of data, with the goal of compiling maintainable master lists, that will become the reference data.
  • 10. Master Data Management • What are Master Data? • Master Data: “Slowly changing reference data shared across system” • Master Data != Transactional Data • Master Data != Metadata • Reference Data: • Products, Customers, Suppliers, Geography, ecc. • The Dimensions of a Data Warehouse 
  • 11. Master Data Services • Introduced with SQL Server 2008 R2 • With SQL Server 2012 al lots of improvements • Web Interface improved *a lot* • Silverlight based • Integrated with Excel 2007 and after • Killer Application! • Installed with SQL Server 2012 but must be configured prior usage • Needs IIS, WCF and so on… • No Changes in 2014
  • 12. Master Data Services • Allow the management and the definition of Master Data • “Model” Definition • Entities, Attributes, Hiearchies, ecc… • Business Rules • Data Stewardship • Through Excel Addin or the Web Portal • Integration • Batch and/or WCF Service
  • 13. What is Master Data Management? ERP CRM Warehouse MGMT Invoicing System BI Master Data Hub External System Integration Web Service Data Hub Data Steward
  • 15. Data Quality Services • A new Service introduced with SQL Server 2012 • Enable the verification of «new» data against a established Knowledge Base • Has its own client • Must be installed after SQL Server 2012 installation • «Data Quality Service Installer» • Three dedicated SQL Server database (DQS prefix)
  • 16. Data Quality Services • Help to • Define a Knowledge Base • Through «Knowledge Discovery» and «Domain Management» • Perform Data Cleasing & Data Matching (De-Duplication) • Integrated with • Integration Services • Master Data Services (via Excel Addin)
  • 17. Data Cleasing • Master (Reference) Data is needed for Data Cleansing • Can be provided directly from our data (Customer Names, for example) • Can be supplied by third party companies: Azure Market Place • *Very* nice feature.
  • 19. Identity Mapping & De-Duplication • DQS is not the only solution for Identity Mapping & De-Duplication • MDS has some built-in functions • SSIS Fuzzy Lookup is great for this • Great Performance & Results!
  • 21. Conclusions • Bad Data Quality means Bad Business • Start working on Data Quality ASAP • Define a business process to achieve Data quality • MDS and DQS will help to support it • Integration with existing application via • Batch • SOA • High Data Quality will be a must have!
  • 22. Link • Free Data Quality eBook http://bit.ly/mMJgKv
  • 23. Link • MDS Homepage http://msdn.microsoft.com/en-us/sqlserver/ff943581.aspx • 3rd Party Client Application http://profisee.com/ • Data Quality & Data Science http://www.solidq.com/consulting/
  • 24. Previously known as Think Big. Move Fast.