SlideShare a Scribd company logo
www.flydata.com
Cognitive Biases in
Data Science
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
Introduction
Copyright © 2014 FlyData Inc. All rights reserved.
● We often think of “data” as objective information
● In reality, data can be just as subjective as the
people who record it!
● In scientific fields especially…
○ empirical methods are used to observe nature
○ data should always be collected and
interpreted impartially
www.flydata.com
Introduction
Copyright © 2014 FlyData Inc. All rights reserved.
● Cognitive biases are an obstacle when trying to
interpret information
○ Can easily skew results
○ They are innate tendencies
● Here are 4 major biases that are known to have
considerable effects on research and science:
www.flydata.com
#1 Confirmation
Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
Confirmation Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
● Confirmation bias is the tendency to process
information in a way that confirms one’s
preconceptions or hypotheses.
○ Actively seek out and assign more value to
data that confirms our own hypotheses...
○ And ignore/understate evidence that could
mean otherwise!
Confirmation Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
● You may have “good” preconceptions from an
educated intuition or previous experiences…
● But it’s not like that in many cases!
○ Can directly affect the results of a study
or analysis!
#2 Observation
Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
Observation Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
● The tendency to look in places where it is
expected to produce good results, or where it is
very convenient to observe
○ Easy accessibility/availability doesn’t mean
it’s the most important!
● The most available and known data source
may often be a good one…
○ But no data analysis is complete without a
complete picture of your data.
● Data science is about producing actionable
insights
○ If only the wrong things are being observed
and measured, you produce false insights!
Observation Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
● To be an efficient researcher, perhaps it’s
best to frequently ask yourself these
questions:
○ “Am I measuring the right things?”
○ “Are there better sources from which to
get data from?”
#3 Funding Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
Funding Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
● Unconscious tendency to skew models, data,
or interpretations of data in a way that favors
the objectives of a financial sponsor or
employer.
○ Sometimes called sponsorship bias
● Any scientist/researcher should keep this in
mind
○ Unknowingly making a business decision
with flawed data will ultimately damage
sponsor!
○ Will damage your career
○ ..and it’s just bad science!
Example
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
● In the 1990’s, the tobacco industry funded a
number of research studies on the effects of
tobacco and smoking cigarettes
● After investigation, industry sponsors and
research centers were found to
○ Present findings in a misleading way
○ Withhold certain findings about the
relationships between smoking and
cancer
● This is a prime example of a funding bias.
#4 Sampling Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
Sampling Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
● In experimentation, we take a sample, which
should be representative of a whole population
○ Achieved by statistical techniques and well-
designed randomization
○ What happens if proper randomization isn’t
achieved?
● It’s not uncommon for researchers to have a
sampling bias
○ Selection of groups or data for
experimentation is unintentionally not
representative of the population
Sampling Bias
Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
● No matter how big/diverse the sample is..
○ Always a possibility of inconsistency in
data/sample collection
● This bias also ties in with the other 3 biases!
○ If any of those biases affects the way in
which you collect samples, then you’re
also experiencing a sampling bias!
www.flydata.com www.flydata.com
Check us out!
-> http://flydata.com
sales@flydata.com
Toll Free: 1-855-427-9787
http://flydata.com
We are an official data integration
partner of Amazon Redshift

More Related Content

PDF
Achieving Analytical Excellence
PDF
10 commandments in rdm funder compliancy
PDF
Step Up Your Survey Research - Dawn of the Data Age Lecture Series
DOC
Unit 1 research methods worksheet for year 10
PPTX
Market research presentation
PPTX
5 ways innovative brands leverage social data insights (1)
PDF
EEDL_JUL23_Webinar_FINAL.pdf
Achieving Analytical Excellence
10 commandments in rdm funder compliancy
Step Up Your Survey Research - Dawn of the Data Age Lecture Series
Unit 1 research methods worksheet for year 10
Market research presentation
5 ways innovative brands leverage social data insights (1)
EEDL_JUL23_Webinar_FINAL.pdf

Similar to Cognitive Biases in Data Science (20)

PPT
Module Three (part 2)
PPTX
Dean R Berry Loss of Privacy: Necessary Evil or Unwanted Invasion Student Pro...
PDF
Turning Data into Infographics: An Interactive Workshop for Problem Solvers
PPTX
Unit 3: Research Techniques - LO3
PDF
Data science and ethics in fundraising
PPTX
Machine Learning - Startup weekend UCSB 2018
PDF
The Collaborative Design Process and Decision Engineering
PPT
Research Methods - an overview
PDF
Stop overthinking your Technology - Just Do it!
PPTX
Theory research
PDF
OSMC 2024 | A Guide to Responsible Data Collection In Open Source by Alex Biehl
PPT
Educational Technology (5)
PDF
Standards for a blue ocean
PDF
Research For PR And Advocacy
PPTX
Research Theory Pro-Forma-2 (1).pptx
PDF
Is it Fact or Fake News?
PDF
In:Confidence 2019 - Tools for privacy-aware data analysis
PDF
Get your own black van in the front yard series electronic dysoning- be the ...
DOC
Unit 1 research methods worksheet for year 10
PDF
Sharing Qualitative Data - Challenges and Opportunities
Module Three (part 2)
Dean R Berry Loss of Privacy: Necessary Evil or Unwanted Invasion Student Pro...
Turning Data into Infographics: An Interactive Workshop for Problem Solvers
Unit 3: Research Techniques - LO3
Data science and ethics in fundraising
Machine Learning - Startup weekend UCSB 2018
The Collaborative Design Process and Decision Engineering
Research Methods - an overview
Stop overthinking your Technology - Just Do it!
Theory research
OSMC 2024 | A Guide to Responsible Data Collection In Open Source by Alex Biehl
Educational Technology (5)
Standards for a blue ocean
Research For PR And Advocacy
Research Theory Pro-Forma-2 (1).pptx
Is it Fact or Fake News?
In:Confidence 2019 - Tools for privacy-aware data analysis
Get your own black van in the front yard series electronic dysoning- be the ...
Unit 1 research methods worksheet for year 10
Sharing Qualitative Data - Challenges and Opportunities
Ad

More from FlyData Inc. (11)

PPTX
What is Change Data Capture (CDC) and Why is it Important?
PPTX
What's So Unique About a Columnar Database?
PPTX
Three Things to Consider When Making Investments in Your Big Data Infrastructure
PPTX
How to Extract Data from Amazon Redshift
PPTX
Amazon Redshift - Create an Amazon Redshift Cluster
PPTX
The Internet of Things
PPTX
Create an Amazon Redshift Cluster with FlyData!
PPTX
Near Real-Time Data Analysis With FlyData
PPTX
FlyData Autoload: 事例集
PPTX
Scalability of Amazon Redshift Data Loading and Query Speed
PPT
Amazon Redshift ベンチマーク Hadoop + Hiveと比較
What is Change Data Capture (CDC) and Why is it Important?
What's So Unique About a Columnar Database?
Three Things to Consider When Making Investments in Your Big Data Infrastructure
How to Extract Data from Amazon Redshift
Amazon Redshift - Create an Amazon Redshift Cluster
The Internet of Things
Create an Amazon Redshift Cluster with FlyData!
Near Real-Time Data Analysis With FlyData
FlyData Autoload: 事例集
Scalability of Amazon Redshift Data Loading and Query Speed
Amazon Redshift ベンチマーク Hadoop + Hiveと比較
Ad

Recently uploaded (20)

PDF
Designing Intelligence for the Shop Floor.pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Tally Prime Crack Download New Version 5.1 [2025] (License Key Free
PPTX
Monitoring Stack: Grafana, Loki & Promtail
DOCX
Greta — No-Code AI for Building Full-Stack Web & Mobile Apps
PDF
Salesforce Agentforce AI Implementation.pdf
PDF
Digital Systems & Binary Numbers (comprehensive )
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Complete Guide to Website Development in Malaysia for SMEs
PPTX
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
PPTX
CHAPTER 2 - PM Management and IT Context
PPTX
history of c programming in notes for students .pptx
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
iTop VPN Free 5.6.0.5262 Crack latest version 2025
PDF
Download FL Studio Crack Latest version 2025 ?
PDF
How to Make Money in the Metaverse_ Top Strategies for Beginners.pdf
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PPTX
Computer Software and OS of computer science of grade 11.pptx
PDF
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Designing Intelligence for the Shop Floor.pdf
Design an Analysis of Algorithms II-SECS-1021-03
Tally Prime Crack Download New Version 5.1 [2025] (License Key Free
Monitoring Stack: Grafana, Loki & Promtail
Greta — No-Code AI for Building Full-Stack Web & Mobile Apps
Salesforce Agentforce AI Implementation.pdf
Digital Systems & Binary Numbers (comprehensive )
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Complete Guide to Website Development in Malaysia for SMEs
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
CHAPTER 2 - PM Management and IT Context
history of c programming in notes for students .pptx
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
iTop VPN Free 5.6.0.5262 Crack latest version 2025
Download FL Studio Crack Latest version 2025 ?
How to Make Money in the Metaverse_ Top Strategies for Beginners.pdf
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Computer Software and OS of computer science of grade 11.pptx
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx

Cognitive Biases in Data Science

  • 1. www.flydata.com Cognitive Biases in Data Science Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
  • 2. Introduction Copyright © 2014 FlyData Inc. All rights reserved. ● We often think of “data” as objective information ● In reality, data can be just as subjective as the people who record it! ● In scientific fields especially… ○ empirical methods are used to observe nature ○ data should always be collected and interpreted impartially www.flydata.com
  • 3. Introduction Copyright © 2014 FlyData Inc. All rights reserved. ● Cognitive biases are an obstacle when trying to interpret information ○ Can easily skew results ○ They are innate tendencies ● Here are 4 major biases that are known to have considerable effects on research and science: www.flydata.com
  • 4. #1 Confirmation Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
  • 5. Confirmation Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com ● Confirmation bias is the tendency to process information in a way that confirms one’s preconceptions or hypotheses. ○ Actively seek out and assign more value to data that confirms our own hypotheses... ○ And ignore/understate evidence that could mean otherwise!
  • 6. Confirmation Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com ● You may have “good” preconceptions from an educated intuition or previous experiences… ● But it’s not like that in many cases! ○ Can directly affect the results of a study or analysis!
  • 7. #2 Observation Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
  • 8. Observation Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com ● The tendency to look in places where it is expected to produce good results, or where it is very convenient to observe ○ Easy accessibility/availability doesn’t mean it’s the most important! ● The most available and known data source may often be a good one… ○ But no data analysis is complete without a complete picture of your data. ● Data science is about producing actionable insights ○ If only the wrong things are being observed and measured, you produce false insights!
  • 9. Observation Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com ● To be an efficient researcher, perhaps it’s best to frequently ask yourself these questions: ○ “Am I measuring the right things?” ○ “Are there better sources from which to get data from?”
  • 10. #3 Funding Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
  • 11. Funding Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com ● Unconscious tendency to skew models, data, or interpretations of data in a way that favors the objectives of a financial sponsor or employer. ○ Sometimes called sponsorship bias ● Any scientist/researcher should keep this in mind ○ Unknowingly making a business decision with flawed data will ultimately damage sponsor! ○ Will damage your career ○ ..and it’s just bad science!
  • 12. Example Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com ● In the 1990’s, the tobacco industry funded a number of research studies on the effects of tobacco and smoking cigarettes ● After investigation, industry sponsors and research centers were found to ○ Present findings in a misleading way ○ Withhold certain findings about the relationships between smoking and cancer ● This is a prime example of a funding bias.
  • 13. #4 Sampling Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com
  • 14. Sampling Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com ● In experimentation, we take a sample, which should be representative of a whole population ○ Achieved by statistical techniques and well- designed randomization ○ What happens if proper randomization isn’t achieved? ● It’s not uncommon for researchers to have a sampling bias ○ Selection of groups or data for experimentation is unintentionally not representative of the population
  • 15. Sampling Bias Copyright © 2014 FlyData Inc. All rights reserved. www.flydata.com ● No matter how big/diverse the sample is.. ○ Always a possibility of inconsistency in data/sample collection ● This bias also ties in with the other 3 biases! ○ If any of those biases affects the way in which you collect samples, then you’re also experiencing a sampling bias!
  • 16. www.flydata.com www.flydata.com Check us out! -> http://flydata.com sales@flydata.com Toll Free: 1-855-427-9787 http://flydata.com We are an official data integration partner of Amazon Redshift