SlideShare a Scribd company logo
Data and Data Collection
Quantitative – Numbers, tests, counting,
measuring
Fundamentally--2 types of data
Qualitative – Words, images,
observations, conversations, photographs
Data Collection Techniques
Observations,
Tests,
Surveys,
Document analysis
(the research literature)
Quantitative Methods
Experiment: Research situation with at
least one independent variable, which is
manipulated by the researcher
Independent Variable: The variable in the
study under consideration. The cause for
the outcome for the study.
Dependent Variable: The variable being
affected by the independent variable.
The effect of the study
y = f(x)
Which is which here?
Key Factors for High Quality
Experimental Design
Data should not be contaminated by poor
measurement or errors in procedure.
Eliminate confounding variables from study or
minimize effects on variables.
Representativeness: Does your sample
represent the population you are studying?
Must use random sample techniques.
What Makes a Good
Quantitative Research Design?
4 Key Elements
1. Freedom from Bias
2. Freedom from Confounding
3. Control of Extraneous Variables
4. Statistical Precision to Test Hypothesis
Bias: When observations favor some
individuals in the population over others.
Confounding: When the effects of two
or more variables cannot be separated.
Extraneous Variables: Any variable that
has an effect on the dependent variable.
Need to identify and minimize these variables.
e.g., Erosion potential as a function of clay content.
rainfall intensity, vegetation & duration would be
considered extraneous variables.
Precision versus accuracy
"Precise" means sharply defined or
measured.
"Accurate" means truthful or correct.
Accurate
Not precise
Neither accurate
nor precise
Not accurate
But precise
Both Accurate
and Precise
Interpreting Results of
Experiments
Goal of research is to draw conclusions.
What did the study mean?
What, if any, is the cause and effect of
the outcome?
Introduction to Sampling
Sampling is the problem of accurately
acquiring the necessary data in order to
form a representative view of the
problem.
This is much more difficult to do than is
generally realized.
Overall Methodology:
* State the objectives of the survey
* Define the target population
* Define the data to be collected
* Define the variables to be determined
* Define the required precision & accuracy
* Define the measurement `instrument'
* Define the sample size & sampling method,
then select the sample
Sampling
Distributions:
When you form a sample you often show
it by a plotted distribution known as a
histogram .
A histogram is the distribution of
frequency of occurrence of a certain
variable within a specified range.
NOT A BAR GRAPH WHICH LOOKS VERY SIMILAR
Data and Types of Data (Statistics and Probability)
Data and Types of Data (Statistics and Probability)
Interpreting quantitative
findings
Descriptive Statistics : Mean, median,
mode, frequencies
Error analyses
Mean
• In science the term mean is really the
arithmetic mean
• Given by the equation
• X = 1
/n xi
n
i=1
Or more simply put, the sum of values divided by the
number of values summed
Median
• Consider the set
• 1, 1, 2, 2, 3, 6, 7, 11, 11, 13, 14, 16, 19
– In this case there are 13 values so the median is
the middle value, or (n+1) / 2
– (13+1) /2 = 7
• Consider the set
• 1, 1, 2, 2, 3, 6, 7, 11, 11, 13, 14, 16
– In the second case, the mean of the two middle
values is the median or (n+1) /2
(12 + 1) / 2 = 6.5 ~ (6+7) / 2 = 6.5
Or more simply put the mid value separating all
values in the upper 1/2 of the values from those
in the lower half of the values
Mode
The most frequent value in a data set
• Consider the set
• 1, 1, 1, 1, 2, 2, 3, 6, 11, 11, 11, 13, 14, 16, 19
– In this case the mode is 1 because it is the most
common value
• There may be cases where there are more than
one mode as in this case
• Consider the set
• 1, 1, 1, 1, 2, 2, 3, 6, 11, 11, 11, 11, 13, 14, 16, 19
– In this case there are two modes (bimodal) : 1 and 11
because both occur 4 times in the data set.

More Related Content

PPT
Data collection
PPT
Data and Data Collection in Data Science.ppt
PPT
UNIT I -Data and Data Collection1.ppt
PPT
UNIT I -Data and Data Collection1.ppt
PPTX
INTRODUCTION OF STATISTICS FINAL YEAR VIII SEM
PPTX
CABT Math 8 measures of central tendency and dispersion
PPTX
Planning-Data-Analysis-Using-Statistics_20241016_063349_0000.pptx
PPT
RM chapter-4 (3).ppt measurements and descriptive
Data collection
Data and Data Collection in Data Science.ppt
UNIT I -Data and Data Collection1.ppt
UNIT I -Data and Data Collection1.ppt
INTRODUCTION OF STATISTICS FINAL YEAR VIII SEM
CABT Math 8 measures of central tendency and dispersion
Planning-Data-Analysis-Using-Statistics_20241016_063349_0000.pptx
RM chapter-4 (3).ppt measurements and descriptive

Similar to Data and Types of Data (Statistics and Probability) (20)

PPTX
UNIT III Central tendency measure of dispersion.pptx
PPT
Intro to Biostat. ppt
PPTX
BIOSTATISTICS AND RESEARCH METHODOLOGY
PPTX
Descriptive Statistics: Mean, Median Mode and Standard Deviation.
PPTX
Business Statistics for Managers with SPSS[1].pptx
PPT
data_management_review_descriptive_statistics.ppt
PPTX
Introduction to Educational statistics and measurement
PPTX
Stat and prob a recap
PDF
3. measures of central tendency
PPTX
STATISTICS.pptx for the scholars and students
PPTX
Machine learning pre requisite
PPT
Introduction to Biostatistics_20_4_17.ppt
PPTX
scope and need of biostatics
PDF
4.-Science-of-Statistics-Part-1 0(1).pdf
PPTX
Data analysis
PPTX
biostatistics
PPTX
Measures in Statistics. kjc.pptx
PPTX
Measures of central tendency and dispersion
PPT
Chapter 11 Psrm
PPT
UNIT III Central tendency measure of dispersion.pptx
Intro to Biostat. ppt
BIOSTATISTICS AND RESEARCH METHODOLOGY
Descriptive Statistics: Mean, Median Mode and Standard Deviation.
Business Statistics for Managers with SPSS[1].pptx
data_management_review_descriptive_statistics.ppt
Introduction to Educational statistics and measurement
Stat and prob a recap
3. measures of central tendency
STATISTICS.pptx for the scholars and students
Machine learning pre requisite
Introduction to Biostatistics_20_4_17.ppt
scope and need of biostatics
4.-Science-of-Statistics-Part-1 0(1).pdf
Data analysis
biostatistics
Measures in Statistics. kjc.pptx
Measures of central tendency and dispersion
Chapter 11 Psrm
Ad

More from RomualdoDayrit1 (17)

PPTX
Division of Polynomials Using 2 Long Division and Synthetic Division Method
PPTX
How to overcome SINS thru Gospels from the Bible
PPTX
Q3_W1_Exploring_Random_Variables.Powerpoint
PPTX
CAL-11-8848-202402011402129972 Calculuss6028.pptx
PPT
Integers and its application in life situations
PPTX
TYPE OF FACTORING (SPECIAL PRODUCTS AND EVERYTHING)
PPT
Graphs and properties of Tangent and Cotangent
PPTX
GenMath-Q2-W9-categorical-syllogism.pptx
PPTX
graphs of tangent and cotangent function
PPT
Special topics about stocks and bonds using algebra
PPTX
atomic structure...pptx
PPTX
Human gene therapy.pptx
PPT
DEMO.ppt
PPT
stat group mean.ppt
PPTX
Interlanguage.pptx
PPTX
COMMUNITY ECOLOGY.pptx
PPTX
COVID -19 PANDEMIC + CHARMAINE NUGUID.pptx
Division of Polynomials Using 2 Long Division and Synthetic Division Method
How to overcome SINS thru Gospels from the Bible
Q3_W1_Exploring_Random_Variables.Powerpoint
CAL-11-8848-202402011402129972 Calculuss6028.pptx
Integers and its application in life situations
TYPE OF FACTORING (SPECIAL PRODUCTS AND EVERYTHING)
Graphs and properties of Tangent and Cotangent
GenMath-Q2-W9-categorical-syllogism.pptx
graphs of tangent and cotangent function
Special topics about stocks and bonds using algebra
atomic structure...pptx
Human gene therapy.pptx
DEMO.ppt
stat group mean.ppt
Interlanguage.pptx
COMMUNITY ECOLOGY.pptx
COVID -19 PANDEMIC + CHARMAINE NUGUID.pptx
Ad

Recently uploaded (20)

PPTX
CME 2 Acute Chest Pain preentation for education
PPT
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
PPTX
ACID BASE management, base deficit correction
PPTX
SKIN Anatomy and physiology and associated diseases
PPTX
POLYCYSTIC OVARIAN SYNDROME.pptx by Dr( med) Charles Amoateng
DOC
Adobe Premiere Pro CC Crack With Serial Key Full Free Download 2025
PPTX
surgery guide for USMLE step 2-part 1.pptx
PPT
STD NOTES INTRODUCTION TO COMMUNITY HEALT STRATEGY.ppt
PPTX
post stroke aphasia rehabilitation physician
PDF
شيت_عطا_0000000000000000000000000000.pdf
PPTX
Important Obstetric Emergency that must be recognised
PPT
OPIOID ANALGESICS AND THEIR IMPLICATIONS
PPT
ASRH Presentation for students and teachers 2770633.ppt
PDF
Human Health And Disease hggyutgghg .pdf
PPT
1b - INTRODUCTION TO EPIDEMIOLOGY (comm med).ppt
PPTX
neonatal infection(7392992y282939y5.pptx
PDF
Medical Evidence in the Criminal Justice Delivery System in.pdf
PPTX
NEET PG 2025 Pharmacology Recall | Real Exam Questions from 3rd August with D...
PPTX
DENTAL CARIES FOR DENTISTRY STUDENT.pptx
PPTX
ca esophagus molecula biology detailaed molecular biology of tumors of esophagus
CME 2 Acute Chest Pain preentation for education
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
ACID BASE management, base deficit correction
SKIN Anatomy and physiology and associated diseases
POLYCYSTIC OVARIAN SYNDROME.pptx by Dr( med) Charles Amoateng
Adobe Premiere Pro CC Crack With Serial Key Full Free Download 2025
surgery guide for USMLE step 2-part 1.pptx
STD NOTES INTRODUCTION TO COMMUNITY HEALT STRATEGY.ppt
post stroke aphasia rehabilitation physician
شيت_عطا_0000000000000000000000000000.pdf
Important Obstetric Emergency that must be recognised
OPIOID ANALGESICS AND THEIR IMPLICATIONS
ASRH Presentation for students and teachers 2770633.ppt
Human Health And Disease hggyutgghg .pdf
1b - INTRODUCTION TO EPIDEMIOLOGY (comm med).ppt
neonatal infection(7392992y282939y5.pptx
Medical Evidence in the Criminal Justice Delivery System in.pdf
NEET PG 2025 Pharmacology Recall | Real Exam Questions from 3rd August with D...
DENTAL CARIES FOR DENTISTRY STUDENT.pptx
ca esophagus molecula biology detailaed molecular biology of tumors of esophagus

Data and Types of Data (Statistics and Probability)

  • 1. Data and Data Collection Quantitative – Numbers, tests, counting, measuring Fundamentally--2 types of data Qualitative – Words, images, observations, conversations, photographs
  • 3. Quantitative Methods Experiment: Research situation with at least one independent variable, which is manipulated by the researcher
  • 4. Independent Variable: The variable in the study under consideration. The cause for the outcome for the study. Dependent Variable: The variable being affected by the independent variable. The effect of the study y = f(x) Which is which here?
  • 5. Key Factors for High Quality Experimental Design Data should not be contaminated by poor measurement or errors in procedure. Eliminate confounding variables from study or minimize effects on variables. Representativeness: Does your sample represent the population you are studying? Must use random sample techniques.
  • 6. What Makes a Good Quantitative Research Design? 4 Key Elements 1. Freedom from Bias 2. Freedom from Confounding 3. Control of Extraneous Variables 4. Statistical Precision to Test Hypothesis
  • 7. Bias: When observations favor some individuals in the population over others. Confounding: When the effects of two or more variables cannot be separated. Extraneous Variables: Any variable that has an effect on the dependent variable. Need to identify and minimize these variables. e.g., Erosion potential as a function of clay content. rainfall intensity, vegetation & duration would be considered extraneous variables.
  • 8. Precision versus accuracy "Precise" means sharply defined or measured. "Accurate" means truthful or correct.
  • 9. Accurate Not precise Neither accurate nor precise Not accurate But precise Both Accurate and Precise
  • 10. Interpreting Results of Experiments Goal of research is to draw conclusions. What did the study mean? What, if any, is the cause and effect of the outcome?
  • 11. Introduction to Sampling Sampling is the problem of accurately acquiring the necessary data in order to form a representative view of the problem. This is much more difficult to do than is generally realized.
  • 12. Overall Methodology: * State the objectives of the survey * Define the target population * Define the data to be collected * Define the variables to be determined * Define the required precision & accuracy * Define the measurement `instrument' * Define the sample size & sampling method, then select the sample
  • 13. Sampling Distributions: When you form a sample you often show it by a plotted distribution known as a histogram . A histogram is the distribution of frequency of occurrence of a certain variable within a specified range. NOT A BAR GRAPH WHICH LOOKS VERY SIMILAR
  • 16. Interpreting quantitative findings Descriptive Statistics : Mean, median, mode, frequencies Error analyses
  • 17. Mean • In science the term mean is really the arithmetic mean • Given by the equation • X = 1 /n xi n i=1 Or more simply put, the sum of values divided by the number of values summed
  • 18. Median • Consider the set • 1, 1, 2, 2, 3, 6, 7, 11, 11, 13, 14, 16, 19 – In this case there are 13 values so the median is the middle value, or (n+1) / 2 – (13+1) /2 = 7 • Consider the set • 1, 1, 2, 2, 3, 6, 7, 11, 11, 13, 14, 16 – In the second case, the mean of the two middle values is the median or (n+1) /2 (12 + 1) / 2 = 6.5 ~ (6+7) / 2 = 6.5 Or more simply put the mid value separating all values in the upper 1/2 of the values from those in the lower half of the values
  • 19. Mode The most frequent value in a data set • Consider the set • 1, 1, 1, 1, 2, 2, 3, 6, 11, 11, 11, 13, 14, 16, 19 – In this case the mode is 1 because it is the most common value • There may be cases where there are more than one mode as in this case • Consider the set • 1, 1, 1, 1, 2, 2, 3, 6, 11, 11, 11, 11, 13, 14, 16, 19 – In this case there are two modes (bimodal) : 1 and 11 because both occur 4 times in the data set.