SlideShare a Scribd company logo
Solving
DATA ‘RELIABILITY’ & ‘VALIDITY’
Problems with Analytics
BY
N ILANGO
Assistant Professor, Department of MBA
Sri Ramakrishna College of Arts & Science, CBE
Data Reliability & Data Validity
Solving Data Reliability & Data Validity issues ensures:
…… Business Decisions are based on………..
Accurate,
Consistent &
Meaningful …….. Data
Understanding the Concepts
 Data Reliability
Refers to consistency of data over time & across systems
If same input produces different outputs, data is not reliable
Eg: A customer’s name is spelled differently in two databases.
 Data Validity
Refers to accuracy & appropriateness of data for specific purpose
If data doesn't measure what it's supposed to, it is not valid
Eg: A survey asking for age but receiving zip codes instead
Solving Data ‘Reliability’ Problems with Analytics
A. Inconsistent Data Across Systems
Solution: Use data profiling & data integration tools
Method:
Analyze records across systems (e.g., CRM vs. ERP) to identify
mismatches
Use tools like ETL (Extract, Transform, Load) to harmonize data
formats
Eg:
Customer order data shows different totals in Sales & Finance
systems
Run reconciliation analytics to match transactions line-by-line &
flag inconsistencies
Solving Data ‘Reliability’ Problems with Analytics
B. Duplicates or Redundant Entries
Solution: Use de-duplication algorithms & fuzzy matching
Method:
Use rules like “Same phone number + similar name” to identify
likely duplicates
Analytics software (e.g., Power BI, Tableau Prep, Talend) can
visualize clusters
Eg:
Two entries: “Mohan Babu” & “M. Babu” with same phone
number
Flag & merge records using BI dashboard
Solving Data ‘Reliability’ Problems with Analytics
C. Time-based Inconsistencies
Solution: Perform time-series analysis
Method:
Look for unexpected data spikes, gaps or drops using trend lines
Eg:
Daily website traffic drops to zero suddenly — analytics helps
detect missing data imports
Solving Data ‘Validity’ Problems with Analytics
I. Incorrect Data Types or Ranges
Solution: Apply validation rules & data audits
Method:
Check if “age” is within realistic bounds (e.g., 18–99)
Use dashboards to highlight values outside normal ranges
Eg:
If a customer’s age is recorded as 543, then, Analytics
dashboard flags it as a red alert
Solving Data ‘Validity’ Problems with Analytics
II. Missing or Null Values
Solution: Use null analysis & imputation models
How:
Highlight % of missing data in each field
Use predictive models to fill gaps (e.g., estimate missing income
based on location & job)
Eg:
If, 30% of "Annual Income" values are left as blank, then,
Analytics fills in values using regression models
Solving Data ‘Validity’ Problems with Analytics
III. Out-of-context Responses
Solution: Use text analytics & semantic validation
Method:
If a survey asks for a city & the response is "Great!", NLP can
detect irrelevance
Eg:
Text analytics tags invalid responses & routes them for cleanup
or exclusion
END

More Related Content

PPTX
Lecture 1b - Diagnostic Analytics Intro and Purpose ver 1.0 (1).pptx
PPTX
Lecture 1b - Diagnostic Analytics Intro and Purpose ver 1.0.pptx
PPTX
Lecture 1b - Diagnostic Analytics Intro and Purpose ver 1.0 (1).pptx
PPTX
Lecture 1b - Diagnostic Analytics Intro and Purpose ver 1.0.pptx
PDF
Data Analytics Trainning in Mohali & Chandigarh
PPTX
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
PDF
Tips --Break Down the Barriers to Better Data Analytics
PPTX
Introduction to data science
Lecture 1b - Diagnostic Analytics Intro and Purpose ver 1.0 (1).pptx
Lecture 1b - Diagnostic Analytics Intro and Purpose ver 1.0.pptx
Lecture 1b - Diagnostic Analytics Intro and Purpose ver 1.0 (1).pptx
Lecture 1b - Diagnostic Analytics Intro and Purpose ver 1.0.pptx
Data Analytics Trainning in Mohali & Chandigarh
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
Tips --Break Down the Barriers to Better Data Analytics
Introduction to data science

Similar to SOLVING DATA RELIABILITY AND VALIDITY PROBLEMS WITH ANALYTICS (20)

PPTX
Health Informatics- Module 3-Chapter 3.pptx
PPTX
MADHU namaste to you too much to me and I am
PDF
Remediating Data Quality by using Data cleaning
PDF
Data quality
PDF
Data quality
PPTX
Data analytics
PPTX
NTEN Your Analytics doesn't have to be dramatic to be useful
PDF
Address common data analytics challenges with effective solutions.
PPTX
Gartner's Data Analytics Maturity Model.pptx
PDF
5 Common Data Science Challenges and Effective Solutions.pdf
PDF
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
PPT
Data Preprocessing
PPTX
A predictive analytics primer
PPTX
Transform Your Downstream Cloud Analytics with Data Quality 
PDF
Data Profiling: The First Step to Big Data Quality
PPTX
Audit: Breaking Down Barriers to Increase the Use of Data Analytics
PDF
Data Analytics course in kerala .zoople technologies
PDF
Unveiling the Power of Data Analytics Transforming Insights into Action.pdf
PDF
lec1.pdf
PPTX
Introduction to Business Analytics Part 1
Health Informatics- Module 3-Chapter 3.pptx
MADHU namaste to you too much to me and I am
Remediating Data Quality by using Data cleaning
Data quality
Data quality
Data analytics
NTEN Your Analytics doesn't have to be dramatic to be useful
Address common data analytics challenges with effective solutions.
Gartner's Data Analytics Maturity Model.pptx
5 Common Data Science Challenges and Effective Solutions.pdf
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
Data Preprocessing
A predictive analytics primer
Transform Your Downstream Cloud Analytics with Data Quality 
Data Profiling: The First Step to Big Data Quality
Audit: Breaking Down Barriers to Increase the Use of Data Analytics
Data Analytics course in kerala .zoople technologies
Unveiling the Power of Data Analytics Transforming Insights into Action.pdf
lec1.pdf
Introduction to Business Analytics Part 1
Ad

More from ilangonatarajan2 (19)

PPTX
Interpretation of Various TYPES OF BUDGETS
PPTX
MEANING, NEED, OBJECTIVES, CLASSIFICATION, ELEMENTS AND METHODS OF COST and M...
PPTX
MEANING, USES, TYPES AND EXAMPLES OF DIFFERENT PROFITABILITY Ratios
PPTX
Adjusted ‘Profit & Loss’ Statement in Financial Accounting
PPTX
CODIFICATION and GROUPING of Accounts in Financial Accounting
PPTX
NEED OF BUSINESS INTELLIGENCE AT ALL LEVELS
PPTX
BENEFITS OF USING ANALYTICAL INFORMATION FOR DECISION SUPPORT
PPTX
DATA SCIENCE IN SCIENCE, TECHNOLOGY, ENGINEERING AND MATHEMATICS
PDF
DEFINITION, MEANING, USES, TYPES, ERRORS & TESTS OF MEASUREMENT IN RESEARCH M...
PDF
USE, REAL TIME APPLICATION, SECTOR WISE APPLICATION, PROS & CONS OF TECHNOLOG...
PDF
MEANING, USES, APPLICATION & PURPOSE OF DIFFERENT TYPES OF VARIABLES IN RESEARCH
PDF
OBJECTIVES AND BENEFITS OF HOLDING INVENTORY
PDF
POTENTIAL CREDIT POLICY INCLUDING CREDIT ANALYSIS
PDF
KEY ROLE OF FINTECH IN CASH MANAGEMENT & CHALLENGES IN ADOPTING FINTECH
PDF
IMPACT OF DIGITAL TRANSFORMATION ON WORKING CAPITAL
PDF
Meaning, Need, Benefits & Steps to Implement - SRTATEGIC RISK ANALYTICS
PDF
Meaning, Need & Key Components of Country Risk
PDF
Meaning, Uses, Applications, Benefits & Examples of Banking Fraud Analytics
PDF
MEANING, TYPES AND IMPORTANCE OF FINANCIAL ANALYTICS
Interpretation of Various TYPES OF BUDGETS
MEANING, NEED, OBJECTIVES, CLASSIFICATION, ELEMENTS AND METHODS OF COST and M...
MEANING, USES, TYPES AND EXAMPLES OF DIFFERENT PROFITABILITY Ratios
Adjusted ‘Profit & Loss’ Statement in Financial Accounting
CODIFICATION and GROUPING of Accounts in Financial Accounting
NEED OF BUSINESS INTELLIGENCE AT ALL LEVELS
BENEFITS OF USING ANALYTICAL INFORMATION FOR DECISION SUPPORT
DATA SCIENCE IN SCIENCE, TECHNOLOGY, ENGINEERING AND MATHEMATICS
DEFINITION, MEANING, USES, TYPES, ERRORS & TESTS OF MEASUREMENT IN RESEARCH M...
USE, REAL TIME APPLICATION, SECTOR WISE APPLICATION, PROS & CONS OF TECHNOLOG...
MEANING, USES, APPLICATION & PURPOSE OF DIFFERENT TYPES OF VARIABLES IN RESEARCH
OBJECTIVES AND BENEFITS OF HOLDING INVENTORY
POTENTIAL CREDIT POLICY INCLUDING CREDIT ANALYSIS
KEY ROLE OF FINTECH IN CASH MANAGEMENT & CHALLENGES IN ADOPTING FINTECH
IMPACT OF DIGITAL TRANSFORMATION ON WORKING CAPITAL
Meaning, Need, Benefits & Steps to Implement - SRTATEGIC RISK ANALYTICS
Meaning, Need & Key Components of Country Risk
Meaning, Uses, Applications, Benefits & Examples of Banking Fraud Analytics
MEANING, TYPES AND IMPORTANCE OF FINANCIAL ANALYTICS
Ad

Recently uploaded (20)

PPTX
New ISO 27001_2022 standard and the changes
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
DOCX
Factor Analysis Word Document Presentation
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
A Complete Guide to Streamlining Business Processes
PDF
Introduction to the R Programming Language
PPTX
Introduction to Inferential Statistics.pptx
PDF
annual-report-2024-2025 original latest.
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPT
Predictive modeling basics in data cleaning process
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPTX
CYBER SECURITY the Next Warefare Tactics
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PDF
Global Data and Analytics Market Outlook Report
PPTX
Qualitative Qantitative and Mixed Methods.pptx
New ISO 27001_2022 standard and the changes
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
Factor Analysis Word Document Presentation
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
A Complete Guide to Streamlining Business Processes
Introduction to the R Programming Language
Introduction to Inferential Statistics.pptx
annual-report-2024-2025 original latest.
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Predictive modeling basics in data cleaning process
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
CYBER SECURITY the Next Warefare Tactics
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
IBA_Chapter_11_Slides_Final_Accessible.pptx
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
Global Data and Analytics Market Outlook Report
Qualitative Qantitative and Mixed Methods.pptx

SOLVING DATA RELIABILITY AND VALIDITY PROBLEMS WITH ANALYTICS

  • 1. Solving DATA ‘RELIABILITY’ & ‘VALIDITY’ Problems with Analytics BY N ILANGO Assistant Professor, Department of MBA Sri Ramakrishna College of Arts & Science, CBE
  • 2. Data Reliability & Data Validity Solving Data Reliability & Data Validity issues ensures: …… Business Decisions are based on……….. Accurate, Consistent & Meaningful …….. Data Understanding the Concepts  Data Reliability Refers to consistency of data over time & across systems If same input produces different outputs, data is not reliable Eg: A customer’s name is spelled differently in two databases.  Data Validity Refers to accuracy & appropriateness of data for specific purpose If data doesn't measure what it's supposed to, it is not valid Eg: A survey asking for age but receiving zip codes instead
  • 3. Solving Data ‘Reliability’ Problems with Analytics A. Inconsistent Data Across Systems Solution: Use data profiling & data integration tools Method: Analyze records across systems (e.g., CRM vs. ERP) to identify mismatches Use tools like ETL (Extract, Transform, Load) to harmonize data formats Eg: Customer order data shows different totals in Sales & Finance systems Run reconciliation analytics to match transactions line-by-line & flag inconsistencies
  • 4. Solving Data ‘Reliability’ Problems with Analytics B. Duplicates or Redundant Entries Solution: Use de-duplication algorithms & fuzzy matching Method: Use rules like “Same phone number + similar name” to identify likely duplicates Analytics software (e.g., Power BI, Tableau Prep, Talend) can visualize clusters Eg: Two entries: “Mohan Babu” & “M. Babu” with same phone number Flag & merge records using BI dashboard
  • 5. Solving Data ‘Reliability’ Problems with Analytics C. Time-based Inconsistencies Solution: Perform time-series analysis Method: Look for unexpected data spikes, gaps or drops using trend lines Eg: Daily website traffic drops to zero suddenly — analytics helps detect missing data imports
  • 6. Solving Data ‘Validity’ Problems with Analytics I. Incorrect Data Types or Ranges Solution: Apply validation rules & data audits Method: Check if “age” is within realistic bounds (e.g., 18–99) Use dashboards to highlight values outside normal ranges Eg: If a customer’s age is recorded as 543, then, Analytics dashboard flags it as a red alert
  • 7. Solving Data ‘Validity’ Problems with Analytics II. Missing or Null Values Solution: Use null analysis & imputation models How: Highlight % of missing data in each field Use predictive models to fill gaps (e.g., estimate missing income based on location & job) Eg: If, 30% of "Annual Income" values are left as blank, then, Analytics fills in values using regression models
  • 8. Solving Data ‘Validity’ Problems with Analytics III. Out-of-context Responses Solution: Use text analytics & semantic validation Method: If a survey asks for a city & the response is "Great!", NLP can detect irrelevance Eg: Text analytics tags invalid responses & routes them for cleanup or exclusion
  • 9. END