SlideShare a Scribd company logo
1
Chapter 8: Introduction to
Hypothesis Testing
2
Hypothesis Testing
• The general goal of a hypothesis test is to
rule out chance (sampling error) as a
plausible explanation for the results from a
research study.
• Hypothesis testing is a technique to help
determine whether a specific treatment
has an effect on the individuals in a
population.
3
Hypothesis Testing
The hypothesis test is used to evaluate the
results from a research study in which
1. A sample is selected from the
population.
2. The treatment is administered to
the sample.
3. After treatment, the individuals in
the sample are measured.
chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
5
Hypothesis Testing (cont.)
• If the individuals in the sample are
noticeably different from the individuals in
the original population, we have evidence
that the treatment has an effect.
• However, it is also possible that the
difference between the sample and the
population is simply sampling error
chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
7
Hypothesis Testing (cont.)
• The purpose of the hypothesis test is to decide
between two explanations:
1. The difference between the sample and
the population can be explained by
sampling error (there does not appear to be a
treatment effect)
2. The difference between the sample and
the population is too large to be
explained by sampling error (there does
appear to be a treatment effect).
chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
9
The Null Hypothesis, the Alpha Level, the
Critical Region, and the Test Statistic
• The following four steps outline the
process of hypothesis testing and
introduce some of the new terminology:
10
Step 1
State the hypotheses and select an α
level. The null hypothesis, H0, always
states that the treatment has no effect (no
change, no difference). According to the
null hypothesis, the population mean after
treatment is the same is it was before
treatment. The α level establishes a
criterion, or "cut-off", for making a decision
about the null hypothesis. The alpha level
also determines the risk of a Type I error.
chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
12
Step 2
Locate the critical region. The critical
region consists of outcomes that are very
unlikely to occur if the null hypothesis is
true. That is, the critical region is defined
by sample means that are almost
impossible to obtain if the treatment has
no effect. The phrase “almost impossible”
means that these samples have a
probability (p) that is less than the alpha
level.
chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
14
Step 3
Compute the test statistic. The test
statistic (in this chapter a z-score) forms a
ratio comparing the obtained difference
between the sample mean and the
hypothesized population mean versus the
amount of difference we would expect
without any treatment effect (the standard
error).
15
Step 4
A large value for the test statistic shows that the
obtained mean difference is more than would be
expected if there is no treatment effect. If it is
large enough to be in the critical region, we
conclude that the difference is significant or
that the treatment has a significant effect. In this
case we reject the null hypothesis. If the mean
difference is relatively small, then the test
statistic will have a low value. In this case, we
conclude that the evidence from the sample is
not sufficient, and the decision is fail to reject the
null hypothesis.
chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
17
Errors in Hypothesis Tests
• Just because the sample mean (following
treatment) is different from the original
population mean does not necessarily
indicate that the treatment has caused a
change.
• You should recall that there usually is
some discrepancy between a sample
mean and the population mean simply as
a result of sampling error.
18
Errors in Hypothesis Tests (cont.)
• Because the hypothesis test relies on
sample data, and because sample data
are not completely reliable, there is always
the risk that misleading data will cause the
hypothesis test to reach a wrong
conclusion.
• Two types of error are possible.
19
Type I Errors
• A Type I error occurs when the sample data appear to
show a treatment effect when, in fact, there is none.
• In this case the researcher will reject the null hypothesis
and falsely conclude that the treatment has an effect.
• Type I errors are caused by unusual, unrepresentative
samples. Just by chance the researcher selects an
extreme sample with the result that the sample falls in
the critical region even though the treatment has no
effect.
• The hypothesis test is structured so that Type I errors
are very unlikely; specifically, the probability of a Type I
error is equal to the alpha level.
20
Type II Errors
• A Type II error occurs when the sample does
not appear to have been affected by the
treatment when, in fact, the treatment does have
an effect.
• In this case, the researcher will fail to reject the
null hypothesis and falsely conclude that the
treatment does not have an effect.
• Type II errors are commonly the result of a very
small treatment effect. Although the treatment
does have an effect, it is not large enough to
show up in the research study.
chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
22
Directional Tests
• When a research study predicts a specific
direction for the treatment effect (increase
or decrease), it is possible to incorporate
the directional prediction into the
hypothesis test.
• The result is called a directional test or a
one-tailed test. A directional test includes
the directional prediction in the statement
of the hypotheses and in the location of
the critical region.
23
Directional Tests (cont.)
• For example, if the original population has a
mean of μ = 80 and the treatment is predicted to
increase the scores, then the null hypothesis
would state that after treatment:
H0: μ < 80 (there is no increase)
• In this case, the entire critical region would be
located in the right-hand tail of the distribution
because large values for M would demonstrate
that there is an increase and would tend to reject
the null hypothesis.
24
Measuring Effect Size
• A hypothesis test evaluates the statistical
significance of the results from a research study.
• That is, the test determines whether or not it is
likely that the obtained sample mean occurred
without any contribution from a treatment effect.
• The hypothesis test is influenced not only by the
size of the treatment effect but also by the size of
the sample.
• Thus, even a very small effect can be significant
if it is observed in a very large sample.
25
Measuring Effect Size
• Because a significant effect does not necessarily
mean a large effect, it is recommended that the
hypothesis test be accompanied by a measure
of the effect size.
• We use Cohen=s d as a standardized measure
of effect size.
• Much like a z-score, Cohen=s d measures the
size of the mean difference in terms of the
standard deviation.
chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
27
Power of a Hypothesis Test
• The power of a hypothesis test is defined
is the probability that the test will reject the
null hypothesis when the treatment does
have an effect.
• The power of a test depends on a variety
of factors including the size of the
treatment effect and the size of the
sample.
chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn

More Related Content

PPT
RM U3 MGR Hypothesis Testing.ppt
PPT
Statistics - Chapter-08.ppt it is related to
PPT
Statistics - Chapter-08.ppt related to testing of hypothesis
PPT
Statistics Statistical Testing Chapter 8 .ppt
PPT
Statistics - Z test and Hypothesis Testing
PPTX
Introduction to Hypothesis Testing
PDF
Chapter 8 statistics for the sciences 10
PPTX
Hypothesis testing123444443335566677768888887.pptx
RM U3 MGR Hypothesis Testing.ppt
Statistics - Chapter-08.ppt it is related to
Statistics - Chapter-08.ppt related to testing of hypothesis
Statistics Statistical Testing Chapter 8 .ppt
Statistics - Z test and Hypothesis Testing
Introduction to Hypothesis Testing
Chapter 8 statistics for the sciences 10
Hypothesis testing123444443335566677768888887.pptx

Similar to chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn (20)

PPTX
Chapter 18 Hypothesis testing (1).pptx
DOCX
Hypothesis Testing Definitions A statistical hypothesi.docx
PDF
Hypothesis testing
PDF
Hypothesis statistics12345678910111213.pdf
PPTX
Hypothesis testing123456789101121314151617.pptx
PPT
Hypothesis testing
PPTX
312320.pptx
PPTX
Basics of Hypothesis Testing
PPTX
hypothesis testing
PDF
HypothesisTesting_HANDOUT.pdf
PPTX
Testing Of Hypothesis
PPTX
Introduction-to-Hypothesis-Testing Explained in detail
PPTX
Hypothesis Test _One-sample t-test, Z-test, Proportion Z-test
PPTX
Hypothesis .pptx
PDF
hypothesis_testing-ch9-39-14402.pdf
PPTX
hypothesis testing
PPT
Formulating Hypotheses
PPTX
hypothesis.pptx
PPTX
Hyphotheses testing 6
PPTX
99c417_Lecture 9 & 10 - Hypothesis Testing.pptx
Chapter 18 Hypothesis testing (1).pptx
Hypothesis Testing Definitions A statistical hypothesi.docx
Hypothesis testing
Hypothesis statistics12345678910111213.pdf
Hypothesis testing123456789101121314151617.pptx
Hypothesis testing
312320.pptx
Basics of Hypothesis Testing
hypothesis testing
HypothesisTesting_HANDOUT.pdf
Testing Of Hypothesis
Introduction-to-Hypothesis-Testing Explained in detail
Hypothesis Test _One-sample t-test, Z-test, Proportion Z-test
Hypothesis .pptx
hypothesis_testing-ch9-39-14402.pdf
hypothesis testing
Formulating Hypotheses
hypothesis.pptx
Hyphotheses testing 6
99c417_Lecture 9 & 10 - Hypothesis Testing.pptx
Ad

More from RAtna29 (20)

PPT
RedBlackTrees_2.pptNNNNNNNNNNNNNNNNNNNNNN
PPT
6Sorting.pptBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBB
PPTX
statisticsforsupportslides.pptxnnnnnnnnnnnnnnnnnn
PPT
Gerstman_PP09.pptvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
PDF
MLT_KCS055 (Unit-2 Notes).pdfNNNNNNNNNNNNNNNN
PPTX
red black tree.pptxMMMMMMMMMMMMMMMMMMMMMMMMMM
PPTX
Unit 5 m way tree.pptxMMMMMMMMMMMMMMMMMMM
PPTX
TF_IDF_PMI_Jurafsky.pptxnnnnnnnnnnnnnnnn
PPTX
13-DependencyParsing.pptxnnnnnnnnnnnnnnnnnnn
PPT
pos-tagging.pptbbbbbbbbbbbbbbbbbbbbnnnnnnnnnn
PPT
lecture_15.pptffffffffffffffffffffffffff
PPT
6640200.pptNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
PPT
Chapter 4.pptmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
PPT
cse220lec4.pptnnnnnnnnnnnnnnnnnnnnnnnnnnn
PPT
slp05.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnmmmmmmmmm
PPTX
lecture14-distributed-reprennnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnsentations.pptx
PPTX
lecture2-intro-boolean.pptbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbx
PPT
lecture10-efficient-scoring.ppmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmt
PPT
lecture3-indexconstruction.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
PPTX
lecture13-DTrees-textcat.pptxnnnnnnnnnnnnnnnnnnnnnn
RedBlackTrees_2.pptNNNNNNNNNNNNNNNNNNNNNN
6Sorting.pptBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBB
statisticsforsupportslides.pptxnnnnnnnnnnnnnnnnnn
Gerstman_PP09.pptvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
MLT_KCS055 (Unit-2 Notes).pdfNNNNNNNNNNNNNNNN
red black tree.pptxMMMMMMMMMMMMMMMMMMMMMMMMMM
Unit 5 m way tree.pptxMMMMMMMMMMMMMMMMMMM
TF_IDF_PMI_Jurafsky.pptxnnnnnnnnnnnnnnnn
13-DependencyParsing.pptxnnnnnnnnnnnnnnnnnnn
pos-tagging.pptbbbbbbbbbbbbbbbbbbbbnnnnnnnnnn
lecture_15.pptffffffffffffffffffffffffff
6640200.pptNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
Chapter 4.pptmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
cse220lec4.pptnnnnnnnnnnnnnnnnnnnnnnnnnnn
slp05.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnmmmmmmmmm
lecture14-distributed-reprennnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnsentations.pptx
lecture2-intro-boolean.pptbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbx
lecture10-efficient-scoring.ppmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmt
lecture3-indexconstruction.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
lecture13-DTrees-textcat.pptxnnnnnnnnnnnnnnnnnnnnnn
Ad

Recently uploaded (20)

PPT
A5_DistSysCh1.ppt_INTRODUCTION TO DISTRIBUTED SYSTEMS
PDF
SMART SIGNAL TIMING FOR URBAN INTERSECTIONS USING REAL-TIME VEHICLE DETECTI...
PDF
Level 2 – IBM Data and AI Fundamentals (1)_v1.1.PDF
PPT
introduction to datamining and warehousing
PDF
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
PDF
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
PDF
COURSE DESCRIPTOR OF SURVEYING R24 SYLLABUS
PPTX
Nature of X-rays, X- Ray Equipment, Fluoroscopy
PDF
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
PPTX
Current and future trends in Computer Vision.pptx
PDF
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
PDF
III.4.1.2_The_Space_Environment.p pdffdf
PPTX
Information Storage and Retrieval Techniques Unit III
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
Abrasive, erosive and cavitation wear.pdf
PPTX
UNIT 4 Total Quality Management .pptx
PPT
Total quality management ppt for engineering students
PDF
737-MAX_SRG.pdf student reference guides
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
Analyzing Impact of Pakistan Economic Corridor on Import and Export in Pakist...
A5_DistSysCh1.ppt_INTRODUCTION TO DISTRIBUTED SYSTEMS
SMART SIGNAL TIMING FOR URBAN INTERSECTIONS USING REAL-TIME VEHICLE DETECTI...
Level 2 – IBM Data and AI Fundamentals (1)_v1.1.PDF
introduction to datamining and warehousing
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
COURSE DESCRIPTOR OF SURVEYING R24 SYLLABUS
Nature of X-rays, X- Ray Equipment, Fluoroscopy
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
Current and future trends in Computer Vision.pptx
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
III.4.1.2_The_Space_Environment.p pdffdf
Information Storage and Retrieval Techniques Unit III
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Abrasive, erosive and cavitation wear.pdf
UNIT 4 Total Quality Management .pptx
Total quality management ppt for engineering students
737-MAX_SRG.pdf student reference guides
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Analyzing Impact of Pakistan Economic Corridor on Import and Export in Pakist...

chapter8.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn

  • 1. 1 Chapter 8: Introduction to Hypothesis Testing
  • 2. 2 Hypothesis Testing • The general goal of a hypothesis test is to rule out chance (sampling error) as a plausible explanation for the results from a research study. • Hypothesis testing is a technique to help determine whether a specific treatment has an effect on the individuals in a population.
  • 3. 3 Hypothesis Testing The hypothesis test is used to evaluate the results from a research study in which 1. A sample is selected from the population. 2. The treatment is administered to the sample. 3. After treatment, the individuals in the sample are measured.
  • 5. 5 Hypothesis Testing (cont.) • If the individuals in the sample are noticeably different from the individuals in the original population, we have evidence that the treatment has an effect. • However, it is also possible that the difference between the sample and the population is simply sampling error
  • 7. 7 Hypothesis Testing (cont.) • The purpose of the hypothesis test is to decide between two explanations: 1. The difference between the sample and the population can be explained by sampling error (there does not appear to be a treatment effect) 2. The difference between the sample and the population is too large to be explained by sampling error (there does appear to be a treatment effect).
  • 9. 9 The Null Hypothesis, the Alpha Level, the Critical Region, and the Test Statistic • The following four steps outline the process of hypothesis testing and introduce some of the new terminology:
  • 10. 10 Step 1 State the hypotheses and select an α level. The null hypothesis, H0, always states that the treatment has no effect (no change, no difference). According to the null hypothesis, the population mean after treatment is the same is it was before treatment. The α level establishes a criterion, or "cut-off", for making a decision about the null hypothesis. The alpha level also determines the risk of a Type I error.
  • 12. 12 Step 2 Locate the critical region. The critical region consists of outcomes that are very unlikely to occur if the null hypothesis is true. That is, the critical region is defined by sample means that are almost impossible to obtain if the treatment has no effect. The phrase “almost impossible” means that these samples have a probability (p) that is less than the alpha level.
  • 14. 14 Step 3 Compute the test statistic. The test statistic (in this chapter a z-score) forms a ratio comparing the obtained difference between the sample mean and the hypothesized population mean versus the amount of difference we would expect without any treatment effect (the standard error).
  • 15. 15 Step 4 A large value for the test statistic shows that the obtained mean difference is more than would be expected if there is no treatment effect. If it is large enough to be in the critical region, we conclude that the difference is significant or that the treatment has a significant effect. In this case we reject the null hypothesis. If the mean difference is relatively small, then the test statistic will have a low value. In this case, we conclude that the evidence from the sample is not sufficient, and the decision is fail to reject the null hypothesis.
  • 17. 17 Errors in Hypothesis Tests • Just because the sample mean (following treatment) is different from the original population mean does not necessarily indicate that the treatment has caused a change. • You should recall that there usually is some discrepancy between a sample mean and the population mean simply as a result of sampling error.
  • 18. 18 Errors in Hypothesis Tests (cont.) • Because the hypothesis test relies on sample data, and because sample data are not completely reliable, there is always the risk that misleading data will cause the hypothesis test to reach a wrong conclusion. • Two types of error are possible.
  • 19. 19 Type I Errors • A Type I error occurs when the sample data appear to show a treatment effect when, in fact, there is none. • In this case the researcher will reject the null hypothesis and falsely conclude that the treatment has an effect. • Type I errors are caused by unusual, unrepresentative samples. Just by chance the researcher selects an extreme sample with the result that the sample falls in the critical region even though the treatment has no effect. • The hypothesis test is structured so that Type I errors are very unlikely; specifically, the probability of a Type I error is equal to the alpha level.
  • 20. 20 Type II Errors • A Type II error occurs when the sample does not appear to have been affected by the treatment when, in fact, the treatment does have an effect. • In this case, the researcher will fail to reject the null hypothesis and falsely conclude that the treatment does not have an effect. • Type II errors are commonly the result of a very small treatment effect. Although the treatment does have an effect, it is not large enough to show up in the research study.
  • 22. 22 Directional Tests • When a research study predicts a specific direction for the treatment effect (increase or decrease), it is possible to incorporate the directional prediction into the hypothesis test. • The result is called a directional test or a one-tailed test. A directional test includes the directional prediction in the statement of the hypotheses and in the location of the critical region.
  • 23. 23 Directional Tests (cont.) • For example, if the original population has a mean of μ = 80 and the treatment is predicted to increase the scores, then the null hypothesis would state that after treatment: H0: μ < 80 (there is no increase) • In this case, the entire critical region would be located in the right-hand tail of the distribution because large values for M would demonstrate that there is an increase and would tend to reject the null hypothesis.
  • 24. 24 Measuring Effect Size • A hypothesis test evaluates the statistical significance of the results from a research study. • That is, the test determines whether or not it is likely that the obtained sample mean occurred without any contribution from a treatment effect. • The hypothesis test is influenced not only by the size of the treatment effect but also by the size of the sample. • Thus, even a very small effect can be significant if it is observed in a very large sample.
  • 25. 25 Measuring Effect Size • Because a significant effect does not necessarily mean a large effect, it is recommended that the hypothesis test be accompanied by a measure of the effect size. • We use Cohen=s d as a standardized measure of effect size. • Much like a z-score, Cohen=s d measures the size of the mean difference in terms of the standard deviation.
  • 27. 27 Power of a Hypothesis Test • The power of a hypothesis test is defined is the probability that the test will reject the null hypothesis when the treatment does have an effect. • The power of a test depends on a variety of factors including the size of the treatment effect and the size of the sample.

Editor's Notes

  • #4: Figure 8.1 The basic experimental situation for hypothesis testing. It is assumed that the parameter μ is known for the population before treatment. The purpose of the experiment is to determine whether or not the treatment has an effect on the population mean.
  • #6: Figure 8.2 From the point of view of the hypothesis test, the entire population receives the treatment and then a sample is selected from the treated population. In the actual research study, a sample is selected from the original population and the treatment is administered to the sample. From either perspective, the result is a treated sample that represents the treated population.
  • #8: Figure 8.3 The set of potential samples is divided into those that are likely to be obtained and those that are very unlikely to be obtained if the null hypothesis is true.
  • #11: Figure 8.5 The locations of the critical region boundaries for three different levels of significance: α = .05, α = .01, and α = .001.
  • #13: Figure 8.4 The critical region (very unlikely outcomes) for α = .05.
  • #16: Figure 8.6 The structure of a research study to determine whether prenatal alcohol affects birth weight. A sample is selected from the original population and is given alcohol. The question is what would happen if the entire population were given alcohol. The treated sample provides information about the unkonwn treated population.
  • #26: Figure 8.11 The appearance of a 15-point treatment effect in two different situations. In part (a), the standard deviation is σ = 100 and the 15-point effect is relatively small. In part (b), the standard deviation is σ = 15 and the 15-point effect is relatively large. Cohen’s d uses the standard deviation to help measure effect size.
  • #28: Figure 8.12 A demonstration of measuring power for a hypothesis test. The left-hand side shows the distribution of sample means that would occur if the null hypothesis is true. The critical region is defined for this distribution. The right-hand side shows the distribution of sample means that would be obtained if there were an 8-point treatment effect. Notice that if there is an 8-point effect, essentially all of the sample means would be in the critical region. Thus, the probability of rejecting H0 (the power of the test) would be nearly 100% for an 8-point treatment effect.