SlideShare a Scribd company logo
5
Most read
6
Most read
INTRODUCTION TO
STATISTICS & PROBABILITY
Chapter 3: Producing Data
(Part 3)
Dr. Nahid Sultana
1
Chapter 3: Producing Data
Introduction
3.1 Design of Experiments
3.2 Sampling Design
3.3 Toward Statistical Inference
3.4 Ethics
2
3.3 Toward Statistical Inference
3
 Parameters and Statistics
 Sampling Variability
 Sampling Distribution
 Bias and Variability
 Sampling from Large Populations
4
Parameters and Statistics
Using samples to talk about populations
A parameter is a number that describes some characteristic of the population.
In statistical practice, the value of a parameter is not known because we cannot
examine the entire population.
Name Symbol Example
Mean µ In a nationwide test, what is the average score?
Proportion p What proportion of people choose chocolate as their favorite ice cream
flavor?
Name Symbol Example
Sample Mean Sample mean of 100 test scores
Sample
Proportion
Sample proportion of 100 people who choose chocolate
as their favorite ice cream flavor?
x
We answer such questions by studying a sample….
A statistic is a number that describes some characteristic of a sample. The
value of a statistic can be computed directly from the sample data.
p

5
Parameters and Statistics
Examples:
 Proportion of all students who attended the last home football game.
Parameter, p
 Proportion of registered voters who voted in November.
Parameter, p
 Mean height of a sample of NBA basketball players.
Statistics,
 Mean SAT of entering freshmen
Parameter, µ
 Proportion of people who prefer Coke over Pepsi in a sample of mall
shoppers
Statistics,
 Mean number of pepperoni slices on a 12̎ pizza from a sample of a
certain brand of pepperoni pizzas.
Statistics, x
x
6
Statistical Estimation
 The process of statistical inference involves using information
from a sample to draw conclusions about a wider population.
 Your estimate of the population is only as good as your sampling
design.
 Work hard to eliminate biases.
 Your sample is only an estimate—and if you randomly sampled
again you would probably get a somewhat different result.
 Bigger sample is better.
7
Sampling Variability
 Each time we take a random sample from a population, we are
likely to get a different set of individuals and calculate a different
statistic. This is called sampling variability.
 We ask, “What would happen if we took many samples?”
 Take a large number of samples from the same population.
 Calculate the sample mean/proportion for each sample.
 Make a histogram of these values.
 Examine the distribution displayed in the histogram for shape,
center, and spread, as well as outliers or other deviations.
8
Sampling Variability (Cont…)
 The sampling distribution of a statistic is the distribution of that
statistic for samples of a given size n taken from the same
population.
The variability of a statistic is described by the spread of its
sampling distribution. This spread depends on the sampling design
and the sample size n, with larger sample sizes leading to lower
variability.
9
The results of many SRSs have a regular pattern. Here, we draw 1000 SRSs
of size 100 from the same population. The population proportion is p = 0.60.
The histogram shows the distribution of the 1000 sample proportions.
The distribution of sample proportions for 1000 SRSs of size 2500 drawn
from the same population as in first figure. The two histograms have the same
scale. The statistic from the larger sample is less variable.
10
Both bias and variability describe what happens when we take many
shots at the target.
Bias concerns the center of the sampling
distribution. A statistic used to estimate a
parameter is unbiased if the mean of its
sampling distribution is equal to the true
value of the parameter being estimated.
The variability of a statistic is described
by the spread of its sampling distribution.
This spread is determined by the sampling
design and the sample size n. Statistics
from larger probability samples have
smaller spreads.10
Bias and Variability
11
A good sampling scheme must have both small bias and small variability.
To reduce bias, use random sampling.
To reduce variability of a statistic from an SRS, use a larger sample.
Managing Bias and Variability
POPULATION SIZE DOESN’T MATTER
The variability of a statistic from a random sample does not depend
on the size of the population, as long as the population is at least
100 times larger than the sample.
12
3.4 Ethics
 Institutional Review Boards
 Informed Consent
 Confidentiality
 Clinical Trials
 Behavioral and Social Science Experiments
13
Institutional Review Boards
 The organization that carries out the study must have an
institutional review board that reviews all planned studies in
advance in order to protect the subjects from possible harm.
 The institutional review board:
 reviews the plan of study
 can require changes
 reviews the consent form
 monitors progress at least once a year
14
Informed Consent
 All subjects must give their informed consent before data are
collected.
 Subjects must be informed in advance about the nature of a study
and any risk of harm it might bring.
 Subjects must then consent in writing.
 Who can’t give informed consent?
 prison inmates
 very young children
 people with mental disorders
15
Confidentiality
 All individual data must be kept confidential. Only statistical
summaries may be made public.
 Confidentiality is not the same as anonymity. Anonymity means
that subjects are anonymous—their names are not known even to
the director of the study. Anonymity prevents follow-ups to
improve non-response or inform subjects of results.
 Any breach of confidentiality is a serious violation of data ethics.
 The best practice is to separate the identity of the subjects from
the rest of the data immediately!
16
Clinical Trials
 Clinical trials study the effectiveness of medical treatments on actual
patients—these treatments can harm as well as heal.
 Points for a discussion:
 Randomized comparative experiments are the only way to
see the true effects of new treatments.
 Most benefits of clinical trials go to future patients. We must
balance future benefits against present risks.
17
Behavioral and Social Science
Experiments
 Many behavioral experiments rely on hiding the true purpose of the
study.
 Subjects would change their behavior if told in advance what
investigators were looking for.
 The “Ethical Principles” of the American Psychological Association
require consent unless a study only observes behavior in a public
space.

More Related Content

PDF
Chapter 5 part1- The Sampling Distribution of a Sample Mean
PPTX
Sampling Distribution
PPTX
Business statistic ii
PPT
The sampling distribution
PDF
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
PDF
Sampling Distribution and Simulation in R
PDF
Normal and standard normal distribution
PDF
Determination and Analysis of Sample size
Chapter 5 part1- The Sampling Distribution of a Sample Mean
Sampling Distribution
Business statistic ii
The sampling distribution
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Sampling Distribution and Simulation in R
Normal and standard normal distribution
Determination and Analysis of Sample size

What's hot (20)

PPT
Sampling in Statistical Inference
PPT
Sampling 1231243290208505 1
PPTX
Estimating a Population Proportion
PPT
Powerpoint sampling distribution
PDF
A Lecture on Sample Size and Statistical Inference for Health Researchers
PPTX
Sampling and sampling distributions
PPTX
Statistical inference concept, procedure of hypothesis testing
PPTX
Sampling and statistical inference
PPTX
Chap06 sampling and sampling distributions
PPTX
Sampling distribution
PPTX
Theory of estimation
PPTX
Statistics - Basics
PPTX
Inferential Statistics
DOCX
Estimation in statistics
PPTX
Sample Size Determination
PPT
statistical estimation
PPTX
Statistical Estimation
PPT
Telesidang 4 bab_8_9_10stst
PPTX
Standard error
Sampling in Statistical Inference
Sampling 1231243290208505 1
Estimating a Population Proportion
Powerpoint sampling distribution
A Lecture on Sample Size and Statistical Inference for Health Researchers
Sampling and sampling distributions
Statistical inference concept, procedure of hypothesis testing
Sampling and statistical inference
Chap06 sampling and sampling distributions
Sampling distribution
Theory of estimation
Statistics - Basics
Inferential Statistics
Estimation in statistics
Sample Size Determination
statistical estimation
Statistical Estimation
Telesidang 4 bab_8_9_10stst
Standard error
Ad

Viewers also liked (20)

PDF
Chapter-4: More on Direct Proof and Proof by Contrapositive
PDF
Chapter 2 part1-Scatterplots
PDF
Displaying Distributions with Graphs
PPT
1.1 to 1.3
PPT
Портрет слова группа 1
PDF
Expecting Parents Guide to Birth Defects ebook
PDF
Citrus College Sample Work
PDF
AGORA enables security companies to sell innovative remote services
PPTX
10 b klass
DOC
REPORT OF OYO SEMO[1]
PPT
proekti
PDF
Receiving your State Pension abroad
PPT
Snowmen from POland
PDF
2016: A good year to invest in Spanish property?
PPTX
GIDS 2016 Understanding and Building No SQLs
DOCX
Report submitted to (1)
PPTX
Health literacy
PPTX
Ang aking pananaw sa pamilya
PDF
DMDL EditorXとToad Editorの紹介
PPTX
Analysing problems creatively final
Chapter-4: More on Direct Proof and Proof by Contrapositive
Chapter 2 part1-Scatterplots
Displaying Distributions with Graphs
1.1 to 1.3
Портрет слова группа 1
Expecting Parents Guide to Birth Defects ebook
Citrus College Sample Work
AGORA enables security companies to sell innovative remote services
10 b klass
REPORT OF OYO SEMO[1]
proekti
Receiving your State Pension abroad
Snowmen from POland
2016: A good year to invest in Spanish property?
GIDS 2016 Understanding and Building No SQLs
Report submitted to (1)
Health literacy
Ang aking pananaw sa pamilya
DMDL EditorXとToad Editorの紹介
Analysing problems creatively final
Ad

Similar to Chapter 3 part3-Toward Statistical Inference (20)

PPTX
5_lectureslides.pptx
PPTX
Sampling Variability And The Precision Of A Sample by Dr Sindhu Almas copy.pptx
PPTX
Basic concept of statistics
PPTX
Research techniques; samling and ethics elt
PDF
Sample Size Determination.23.11.2021.pdf
PDF
Biostaticstics, Application of Biostaticstics
PPTX
Sampling for research
PDF
HEALTHCARE RESEARCH METHODS: Primary Studies: Selecting a Sample Population a...
PPTX
Unit 6 sampling techniques
PPTX
CO 3. Hypothesis Testing which is basicl
PPT
Soni_Biostatistics.ppt
PPTX
1.1 statistical and critical thinking
PPT
Sample size
PPT
Sampling and Types of sampling are given along with questions.ppt
PPT
CH 3 Sampling (3).pptx.ppt
PPTX
Basic of Statistical Inference Part-I
DOCX
Sampling
PPT
Sample size
PPT
sampling methods
5_lectureslides.pptx
Sampling Variability And The Precision Of A Sample by Dr Sindhu Almas copy.pptx
Basic concept of statistics
Research techniques; samling and ethics elt
Sample Size Determination.23.11.2021.pdf
Biostaticstics, Application of Biostaticstics
Sampling for research
HEALTHCARE RESEARCH METHODS: Primary Studies: Selecting a Sample Population a...
Unit 6 sampling techniques
CO 3. Hypothesis Testing which is basicl
Soni_Biostatistics.ppt
1.1 statistical and critical thinking
Sample size
Sampling and Types of sampling are given along with questions.ppt
CH 3 Sampling (3).pptx.ppt
Basic of Statistical Inference Part-I
Sampling
Sample size
sampling methods

More from nszakir (15)

PDF
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
PDF
Chapter 2: Relations
PDF
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
PDF
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
PDF
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
PDF
Chapter 4 part4- General Probability Rules
PDF
Chapter 4 part3- Means and Variances of Random Variables
PDF
Chapter 4 part2- Random Variables
PDF
Chapter 4 part1-Probability Model
PDF
Chapter 3 part2- Sampling Design
PDF
Chapter 3 part1-Design of Experiments
PDF
Chapter 2 part2-Correlation
PDF
Chapter 2 part3-Least-Squares Regression
PDF
Density Curves and Normal Distributions
PDF
Describing Distributions with Numbers
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter 2: Relations
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 4 part4- General Probability Rules
Chapter 4 part3- Means and Variances of Random Variables
Chapter 4 part2- Random Variables
Chapter 4 part1-Probability Model
Chapter 3 part2- Sampling Design
Chapter 3 part1-Design of Experiments
Chapter 2 part2-Correlation
Chapter 2 part3-Least-Squares Regression
Density Curves and Normal Distributions
Describing Distributions with Numbers

Recently uploaded (20)

PPTX
Institutional Correction lecture only . . .
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
01-Introduction-to-Information-Management.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Pre independence Education in Inndia.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
master seminar digital applications in india
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
Basic Mud Logging Guide for educational purpose
PDF
Classroom Observation Tools for Teachers
Institutional Correction lecture only . . .
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
01-Introduction-to-Information-Management.pdf
TR - Agricultural Crops Production NC III.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
human mycosis Human fungal infections are called human mycosis..pptx
2.FourierTransform-ShortQuestionswithAnswers.pdf
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Pre independence Education in Inndia.pdf
Final Presentation General Medicine 03-08-2024.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
master seminar digital applications in india
Microbial disease of the cardiovascular and lymphatic systems
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Basic Mud Logging Guide for educational purpose
Classroom Observation Tools for Teachers

Chapter 3 part3-Toward Statistical Inference

  • 1. INTRODUCTION TO STATISTICS & PROBABILITY Chapter 3: Producing Data (Part 3) Dr. Nahid Sultana 1
  • 2. Chapter 3: Producing Data Introduction 3.1 Design of Experiments 3.2 Sampling Design 3.3 Toward Statistical Inference 3.4 Ethics 2
  • 3. 3.3 Toward Statistical Inference 3  Parameters and Statistics  Sampling Variability  Sampling Distribution  Bias and Variability  Sampling from Large Populations
  • 4. 4 Parameters and Statistics Using samples to talk about populations A parameter is a number that describes some characteristic of the population. In statistical practice, the value of a parameter is not known because we cannot examine the entire population. Name Symbol Example Mean µ In a nationwide test, what is the average score? Proportion p What proportion of people choose chocolate as their favorite ice cream flavor? Name Symbol Example Sample Mean Sample mean of 100 test scores Sample Proportion Sample proportion of 100 people who choose chocolate as their favorite ice cream flavor? x We answer such questions by studying a sample…. A statistic is a number that describes some characteristic of a sample. The value of a statistic can be computed directly from the sample data. p 
  • 5. 5 Parameters and Statistics Examples:  Proportion of all students who attended the last home football game. Parameter, p  Proportion of registered voters who voted in November. Parameter, p  Mean height of a sample of NBA basketball players. Statistics,  Mean SAT of entering freshmen Parameter, µ  Proportion of people who prefer Coke over Pepsi in a sample of mall shoppers Statistics,  Mean number of pepperoni slices on a 12̎ pizza from a sample of a certain brand of pepperoni pizzas. Statistics, x x
  • 6. 6 Statistical Estimation  The process of statistical inference involves using information from a sample to draw conclusions about a wider population.  Your estimate of the population is only as good as your sampling design.  Work hard to eliminate biases.  Your sample is only an estimate—and if you randomly sampled again you would probably get a somewhat different result.  Bigger sample is better.
  • 7. 7 Sampling Variability  Each time we take a random sample from a population, we are likely to get a different set of individuals and calculate a different statistic. This is called sampling variability.  We ask, “What would happen if we took many samples?”  Take a large number of samples from the same population.  Calculate the sample mean/proportion for each sample.  Make a histogram of these values.  Examine the distribution displayed in the histogram for shape, center, and spread, as well as outliers or other deviations.
  • 8. 8 Sampling Variability (Cont…)  The sampling distribution of a statistic is the distribution of that statistic for samples of a given size n taken from the same population. The variability of a statistic is described by the spread of its sampling distribution. This spread depends on the sampling design and the sample size n, with larger sample sizes leading to lower variability.
  • 9. 9 The results of many SRSs have a regular pattern. Here, we draw 1000 SRSs of size 100 from the same population. The population proportion is p = 0.60. The histogram shows the distribution of the 1000 sample proportions. The distribution of sample proportions for 1000 SRSs of size 2500 drawn from the same population as in first figure. The two histograms have the same scale. The statistic from the larger sample is less variable.
  • 10. 10 Both bias and variability describe what happens when we take many shots at the target. Bias concerns the center of the sampling distribution. A statistic used to estimate a parameter is unbiased if the mean of its sampling distribution is equal to the true value of the parameter being estimated. The variability of a statistic is described by the spread of its sampling distribution. This spread is determined by the sampling design and the sample size n. Statistics from larger probability samples have smaller spreads.10 Bias and Variability
  • 11. 11 A good sampling scheme must have both small bias and small variability. To reduce bias, use random sampling. To reduce variability of a statistic from an SRS, use a larger sample. Managing Bias and Variability POPULATION SIZE DOESN’T MATTER The variability of a statistic from a random sample does not depend on the size of the population, as long as the population is at least 100 times larger than the sample.
  • 12. 12 3.4 Ethics  Institutional Review Boards  Informed Consent  Confidentiality  Clinical Trials  Behavioral and Social Science Experiments
  • 13. 13 Institutional Review Boards  The organization that carries out the study must have an institutional review board that reviews all planned studies in advance in order to protect the subjects from possible harm.  The institutional review board:  reviews the plan of study  can require changes  reviews the consent form  monitors progress at least once a year
  • 14. 14 Informed Consent  All subjects must give their informed consent before data are collected.  Subjects must be informed in advance about the nature of a study and any risk of harm it might bring.  Subjects must then consent in writing.  Who can’t give informed consent?  prison inmates  very young children  people with mental disorders
  • 15. 15 Confidentiality  All individual data must be kept confidential. Only statistical summaries may be made public.  Confidentiality is not the same as anonymity. Anonymity means that subjects are anonymous—their names are not known even to the director of the study. Anonymity prevents follow-ups to improve non-response or inform subjects of results.  Any breach of confidentiality is a serious violation of data ethics.  The best practice is to separate the identity of the subjects from the rest of the data immediately!
  • 16. 16 Clinical Trials  Clinical trials study the effectiveness of medical treatments on actual patients—these treatments can harm as well as heal.  Points for a discussion:  Randomized comparative experiments are the only way to see the true effects of new treatments.  Most benefits of clinical trials go to future patients. We must balance future benefits against present risks.
  • 17. 17 Behavioral and Social Science Experiments  Many behavioral experiments rely on hiding the true purpose of the study.  Subjects would change their behavior if told in advance what investigators were looking for.  The “Ethical Principles” of the American Psychological Association require consent unless a study only observes behavior in a public space.