SlideShare a Scribd company logo
One-Way ANOVA
Introduction to Analysis of Variance
(ANOVA)
What is ANOVA?
 ANOVA is short for ANalysis Of VAriance
 Used with 3 or more groups to test for MEAN
DIFFS.
 E.g., caffeine study with 3 groups:
 No caffeine
 Mild dose
 Jolt group
 Level is value, kind or amount of IV
 Treatment Group is people who get specific
treatment or level of IV
 Treatment Effect is size of difference in means
Rationale for ANOVA (1)
 We have at least 3 means to test, e.g., H0: 1 = 2
= 3.
 Could take them 2 at a time, but really want to test
all 3 (or more) at once.
 Instead of using a mean difference, we can use the
variance of the group means about the grand mean
over all groups.
 Logic is just the same as for the t-test. Compare the
observed variance among means (observed
difference in means in the t-test) to what we would
expect to get by chance.
Rationale for ANOVA (2)
Suppose we drew 3 samples from the same population.
Our results might look like this:
10
0
-10
-20
4
3
2
1
0
10
0
-10
-20 10
0
-10
-20
Ra w Sc o re s (X)
10
0
-10
-20
Three Samples fromthe Same Population
M ean 1
M ean 2
M ean 3
Standard Dev Group 3
Note that the
means from the 3
groups are not
exactly the same,
but they are
close, so the
variance among
means will be
small.
Rationale for ANOVA (3)
Suppose we sample people from 3 different populations.
Our results might look like this:
20
10
0
-10
-20
4
3
2
1
0
Three Samples from3 DiffferentPopulations
20
10
0
-10
-20
Three Samples from3 DiffferentPopulations
20
10
0
-10
-20
Three Samples from3 DiffferentPopulations
20
10
0
-10
-20
Raw Scores (X)
Three Samples from3 DiffferentPopulations
Mean 1
Mean 2
Mean 3
SD Group 1
Note that the
sample means are
far away from one
another, so the
variance among
means will be
large.
Rationale for ANOVA (4)
Suppose we complete a study and find the following
results (either graph). How would we know or decide
whether there is a real effect or not?
10
0
-10
-20
4
3
2
1
0
10
0
-10
-20 10
0
-10
-20
Ra w Sc o re s (X)
10
0
-10
-20
Three Samples fromthe Same Population
M ean 1
M ean 2
M ean 3
Standard Dev Group 3
20
10
0
-10
-20
4
3
2
1
0
Three Samples from3 DiffferentPopulations
20
10
0
-10
-20
Three Samples from3 DiffferentPopulations
20
10
0
-10
-20
Three Samples from3 DiffferentPopulations
20
10
0
-10
-20
Raw Scores (X)
Three Samples from3 DiffferentPopulations
Mean 1
Mean 2
Mean 3
SD Group 1
To decide, we can compare our observed variance in
means to what we would expect to get on the basis
of chance given no true difference in means.
Review
 When would we use a t-test versus 1-way
ANOVA?
 In ANOVA, what happens to the variance in
means (between cells) if the treatment effect
is large?
Rationale for ANOVA
We can break the total variance in a study into
meaningful pieces that correspond to treatment effects
and error. That’s why we call this Analysis of Variance.
Definitions of Terms Used in ANOVA:
G
X The Grand Mean, taken over all observations.
A
X
1
A
X
The mean of any level of a treatment.
The mean of a specific level (1 in this case)
of a treatment.
i
X The observation or raw data for the ith person.
The ANOVA Model
A treatment effect is the difference between the overall,
grand mean, and the mean of a cell (treatment level).
G
A X
X
Effect
IV 

Error is the difference between a score and a cell
(treatment level) mean.
A
i X
X
Error 

The ANOVA Model:
)
(
)
( A
i
G
A
G
i X
X
X
X
X
X 




An individual’s
score is The grand
mean
+
A treatment
or IV effect + Error
The ANOVA Model
)
(
)
( A
i
G
A
G
i X
X
X
X
X
X 




The grand
mean
A treatment
or IV effect
Error
40
30
20
10
0
Frequency
ANOVA Data by Treatment Level
ANOVA Data by Treatment Level
ANOVA Data by Treatment Level
Grand Mean
Treatment Mean
Error
IV Effect
The graph shows the
terms in the equation.
There are three cells or
levels in this study.
The IV effect and error
for the highest scoring
cell is shown.
ANOVA Calculations
Sums of squares (squared deviations from the mean)
tell the story of variance. The simple ANOVA designs
have 3 sums of squares.
 
 2
)
( G
i
tot X
X
SS
The total sum of squares comes from the
distance of all the scores from the grand
mean. This is the total; it’s all you have.
 
 2
)
( A
i
W X
X
SS The within-group or within-cell sum of
squares comes from the distance of the
observations to the cell means. This
indicates error.
 
 2
)
( G
A
A
B X
X
N
SS The between-cells or between-groups
sum of squares tells of the distance of
the cell means from the grand mean.
This indicates IV effects.
W
B
TOT SS
SS
SS 

Computational Example: Caffeine on
Test Scores
G1: Control G2: Mild G3: Jolt
Test Scores
75=79-4 80=84-4 70=74-4
77=79-2 82=84-2 72=74-2
79=79+0 84=84+0 74=74+0
81=79+2 86=84+2 76=74+2
83=79+4 88=84+4 78=74+4
Means
79 84 74
SDs (N-1)
3.16 3.16 3.16
G1 75 79 16
Control 77 79 4
M=79 79 79 0
SD=3.16 81 79 4
83 79 16
G2 80 79 1
M=84 82 79 9
SD=3.16 84 79 25
86 79 49
88 79 81
G3 70 79 81
M=74 72 79 49
SD=3.16 74 79 25
76 79 9
78 79 1
Sum 370
G
X
i
X 2
)
( G
i X
X 
Total
Sum of
Squares
 
 2
)
( G
i
tot X
X
SS
In the total sum of squares, we are finding the
squared distance from the Grand Mean. If we took
the average, we would have a variance.
 
 2
)
( G
i
tot X
X
SS
Scores on the Dependent Variable by Group
0.5
0.4
0.3
0.1
0.0
Relative
Frequency
Low High
Grand Mean
G1 75 79 16
Control 77 79 4
M=79 79 79 0
SD=3.16 81 79 4
83 79 16
G2 80 84 16
M=84 82 84 4
SD=3.16 84 84 0
86 84 4
88 84 16
G3 70 74 16
M=74 72 74 4
SD=3.16 74 74 0
76 74 4
78 74 16
Sum 120
Within
Sum of
Squares
i
X A
X 2
)
( A
i X
X 
 
 2
)
( A
i
W X
X
SS
Within sum of squares refers to the variance within
cells. That is, the difference between scores and their
cell means. SSW estimates error.
Scores on the Dependent Variable by Group
0.5
0.4
0.3
0.1
0.0
Relative
Frequency
Low High
Cell or Treatment Mean
 
 2
)
( A
i
W X
X
SS
G1 79 79 0
Control 79 79 0
M=79 79 79 0
SD=3.16 79 79 0
79 79 0
G2 84 79 25
M=84 84 79 25
SD=3.16 84 79 25
84 79 25
84 79 25
G3 74 79 25
M=74 74 79 25
SD=3.16 74 79 25
74 79 25
74 79 25
Sum 250
Between
Sum of
Squares
 
 2
)
( G
A
A
B X
X
N
SS
A
X G
X 2
)
( G
A X
X 
The between sum of squares relates the Cell Means to
the Grand Mean. This is related to the variance of the
means.
Scores on the Dependent Variable by Group
0.5
0.4
0.3
0.1
0.0
Relative
Frequency
Low High
Cell Mean
Grand Mean
Cell Mean
Cell Mean
 
 2
)
( G
A
A
B X
X
N
SS
ANOVA Source Table (1)
Source SS df MS F
Between
Groups
250 k-1=2 SS/df
250/2=
125
=MSB
F =
MSB/MSW
= 125/10
=12.5
Within
Groups
120 N-k=
15-3=12
120/12 =
10 =
MSW
Total 370 N-1=14
ANOVA Source Table (2)
 df – Degrees of freedom. Divide the sum of
squares by degrees of freedom to get
 MS, Mean Squares, which are population
variance estimates.
 F is the ratio of two mean squares. F is
another distribution like z and t. There are
tables of F used for significance testing.
The F Distribution
F Table – Critical Values
Numerator df: dfB
dfW 1 2 3 4 5
5 5%
1%
6.61
16.3
5.79
13.3
5.41
12.1
5.19
11.4
5.05
11.0
10 5%
1%
4.96
10.0
4.10
7.56
3.71
6.55
3.48
5.99
3.33
5.64
12 5%
1%
4.75
9.33
3.89
6.94
3.49
5.95
3.26
5.41
3.11
5.06
14 5%
1%
4.60
8.86
3.74
6.51
3.34
5.56
3.11
5.04
2.96
4.70
Review
 What are critical values of a statistics (e.g.,
critical values of F)?
 What are degrees of freedom?
 What are mean squares?
 What does MSW tell us?
Review 6 Steps
1. Set alpha (.05).
2. State Null &
Alternative
H0:
H1: not all  are =.
3. Calculate test statistic:
F=12.5
4. Determine critical value
F.05(2,12) = 3.89
5. Decision rule: If test
statistic > critical value,
reject H0.
6. Decision: Test is
significant (12.5>3.89).
Means in population are
different.
3
2
1 

 

Post Hoc Tests
 If the t-test is significant, you have a
difference in population means.
 If the F-test is significant, you have a
difference in population means. But you
don’t know where.
 With 3 means, could be A=B>C or A>B>C or
A>B=C.
 We need a test to tell which means are
different. Lots available, we will use 1.
Tukey HSD (1)
HSD means honestly significant difference.
A
W
N
MS
q
HSD 
 
 is the Type I error rate (.05).

q Is a value from a table of the studentized range
statistic based on alpha, dfW (12 in our example)
and k, the number of groups (3 in our example).
W
MS Is the mean square within groups (10).
A
N Is the number of people in each group (5).
33
.
5
5
10
77
.
3
05
. 

HSD
From table
MSW
A
N
Result for our example.
Use with equal sample size per cell.
Tukey HSD (2)
To see which means are significantly different, we
compare the observed differences among our means to
the critical value of the Tukey test.
The differences are:
1-2 is 79-84 = -5 (say 5 to be positive).
1-3 is 79-74 = 5
2-3 is 84-74 = 10. Because 10 is larger than 5.33, this result
is significant (2 is different than 3). The other differences
are not significant. Review 6 steps.
Review
 What is a post hoc test? What is its use?
 Describe the HSD test. What does HSD
stand for?
Test
 Another name for mean square is
_________.
1. standard deviation
2. sum of squares
3. treatment level
4. variance
Test
When do we use post hoc tests?
 a. after a significant overall F test
 b. after a nonsignificant overall F test
 c. in place of an overall F test
 d. when we want to determine the impact of
different factors

More Related Content

PPT
PPTX
Challenges for drug development jsr slides aug 2013
PPTX
Clinical trials designs
PPT
Regulations, guidelines & ethics in clinical research
PPT
Biostatistics in Clinical Research
PPT
Analysis of variance
PPTX
Strategies for Navigating ICH E9(R1) Webinar slides
PPTX
Non Parametric Tests
Challenges for drug development jsr slides aug 2013
Clinical trials designs
Regulations, guidelines & ethics in clinical research
Biostatistics in Clinical Research
Analysis of variance
Strategies for Navigating ICH E9(R1) Webinar slides
Non Parametric Tests

What's hot (20)

PPTX
Student t-test
PPTX
non parametric statistics
PPT
Anova lecture
PPTX
PPTX
International Classification of Disease
PDF
RWD/RWE for Rare Disease Drugs Webinar
PPT
One Sample T Test
PPT
Study design
PPTX
Research methodoloy
PPT
pharmacoepidemiology.ppt
PPTX
Methods of Randomization
PPTX
Qol
PPTX
PHARMACOEPIDEMIOLOGY
PPTX
Drug Development Process
PPT
T test statistics
PPTX
Key Concepts of Clinical Research & Clinical Trial
PPT
Stat5 the t test
PPTX
Clinical trials its types and designs
PPTX
Pharmacoeconomics
Student t-test
non parametric statistics
Anova lecture
International Classification of Disease
RWD/RWE for Rare Disease Drugs Webinar
One Sample T Test
Study design
Research methodoloy
pharmacoepidemiology.ppt
Methods of Randomization
Qol
PHARMACOEPIDEMIOLOGY
Drug Development Process
T test statistics
Key Concepts of Clinical Research & Clinical Trial
Stat5 the t test
Clinical trials its types and designs
Pharmacoeconomics
Ad

Similar to ANOVA.ppt (20)

PPTX
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
PPTX
ANOVA Analysis of Varianace Biostatistics
PPTX
Analysis of variance
PPTX
ANOVA BIOstat short explaination .pptx
PPT
Ashish Dadheech PRESENT TOPIC NURSING RESEARCH
PPT
Anova by Hazilah Mohd Amin
PPTX
Tugasan kumpulan anova
PPT
Q3W3_ANOVA_SC.pptQ3W3_ANOVA_SC.pptQ3W3_ANOVA_SC.ppt
PPT
Anova.ppt
PPTX
One-Way ANOVA: Conceptual Foundations
PPTX
Analysis of variance (ANOVA)
PPTX
Full Lecture Presentation on ANOVA
PPT
ANOVA.ppt
PPT
lecture12.ppt
PPT
lecture12.ppt
PPTX
PDF
anovappt-141025002857-conversion-gate01 (1).pdf
PPT
oneway.ppt
PPTX
PDF
Anova one way sem 1 20142015 dk
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
ANOVA Analysis of Varianace Biostatistics
Analysis of variance
ANOVA BIOstat short explaination .pptx
Ashish Dadheech PRESENT TOPIC NURSING RESEARCH
Anova by Hazilah Mohd Amin
Tugasan kumpulan anova
Q3W3_ANOVA_SC.pptQ3W3_ANOVA_SC.pptQ3W3_ANOVA_SC.ppt
Anova.ppt
One-Way ANOVA: Conceptual Foundations
Analysis of variance (ANOVA)
Full Lecture Presentation on ANOVA
ANOVA.ppt
lecture12.ppt
lecture12.ppt
anovappt-141025002857-conversion-gate01 (1).pdf
oneway.ppt
Anova one way sem 1 20142015 dk
Ad

More from sadiakhan783184 (6)

PPT
Theories of Envoirnmnetal Psy.ppt
PPTX
Frequency distributions.pptx
PPTX
LINEAR REGRESSION ANALYSIS.pptx
PPT
STATISTICAL MEASURES.ppt
PPT
HYPOTHESIS TESTING.ppt
PPT
CORRELATION.ppt
Theories of Envoirnmnetal Psy.ppt
Frequency distributions.pptx
LINEAR REGRESSION ANALYSIS.pptx
STATISTICAL MEASURES.ppt
HYPOTHESIS TESTING.ppt
CORRELATION.ppt

Recently uploaded (20)

PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Cell Types and Its function , kingdom of life
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Pharma ospi slides which help in ospi learning
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
102 student loan defaulters named and shamed – Is someone you know on the list?
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Complications of Minimal Access Surgery at WLH
Cell Types and Its function , kingdom of life
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Pharma ospi slides which help in ospi learning
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
O7-L3 Supply Chain Operations - ICLT Program
Week 4 Term 3 Study Techniques revisited.pptx
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Renaissance Architecture: A Journey from Faith to Humanism
Supply Chain Operations Speaking Notes -ICLT Program
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Final Presentation General Medicine 03-08-2024.pptx
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx

ANOVA.ppt

  • 1. One-Way ANOVA Introduction to Analysis of Variance (ANOVA)
  • 2. What is ANOVA?  ANOVA is short for ANalysis Of VAriance  Used with 3 or more groups to test for MEAN DIFFS.  E.g., caffeine study with 3 groups:  No caffeine  Mild dose  Jolt group  Level is value, kind or amount of IV  Treatment Group is people who get specific treatment or level of IV  Treatment Effect is size of difference in means
  • 3. Rationale for ANOVA (1)  We have at least 3 means to test, e.g., H0: 1 = 2 = 3.  Could take them 2 at a time, but really want to test all 3 (or more) at once.  Instead of using a mean difference, we can use the variance of the group means about the grand mean over all groups.  Logic is just the same as for the t-test. Compare the observed variance among means (observed difference in means in the t-test) to what we would expect to get by chance.
  • 4. Rationale for ANOVA (2) Suppose we drew 3 samples from the same population. Our results might look like this: 10 0 -10 -20 4 3 2 1 0 10 0 -10 -20 10 0 -10 -20 Ra w Sc o re s (X) 10 0 -10 -20 Three Samples fromthe Same Population M ean 1 M ean 2 M ean 3 Standard Dev Group 3 Note that the means from the 3 groups are not exactly the same, but they are close, so the variance among means will be small.
  • 5. Rationale for ANOVA (3) Suppose we sample people from 3 different populations. Our results might look like this: 20 10 0 -10 -20 4 3 2 1 0 Three Samples from3 DiffferentPopulations 20 10 0 -10 -20 Three Samples from3 DiffferentPopulations 20 10 0 -10 -20 Three Samples from3 DiffferentPopulations 20 10 0 -10 -20 Raw Scores (X) Three Samples from3 DiffferentPopulations Mean 1 Mean 2 Mean 3 SD Group 1 Note that the sample means are far away from one another, so the variance among means will be large.
  • 6. Rationale for ANOVA (4) Suppose we complete a study and find the following results (either graph). How would we know or decide whether there is a real effect or not? 10 0 -10 -20 4 3 2 1 0 10 0 -10 -20 10 0 -10 -20 Ra w Sc o re s (X) 10 0 -10 -20 Three Samples fromthe Same Population M ean 1 M ean 2 M ean 3 Standard Dev Group 3 20 10 0 -10 -20 4 3 2 1 0 Three Samples from3 DiffferentPopulations 20 10 0 -10 -20 Three Samples from3 DiffferentPopulations 20 10 0 -10 -20 Three Samples from3 DiffferentPopulations 20 10 0 -10 -20 Raw Scores (X) Three Samples from3 DiffferentPopulations Mean 1 Mean 2 Mean 3 SD Group 1 To decide, we can compare our observed variance in means to what we would expect to get on the basis of chance given no true difference in means.
  • 7. Review  When would we use a t-test versus 1-way ANOVA?  In ANOVA, what happens to the variance in means (between cells) if the treatment effect is large?
  • 8. Rationale for ANOVA We can break the total variance in a study into meaningful pieces that correspond to treatment effects and error. That’s why we call this Analysis of Variance. Definitions of Terms Used in ANOVA: G X The Grand Mean, taken over all observations. A X 1 A X The mean of any level of a treatment. The mean of a specific level (1 in this case) of a treatment. i X The observation or raw data for the ith person.
  • 9. The ANOVA Model A treatment effect is the difference between the overall, grand mean, and the mean of a cell (treatment level). G A X X Effect IV   Error is the difference between a score and a cell (treatment level) mean. A i X X Error   The ANOVA Model: ) ( ) ( A i G A G i X X X X X X      An individual’s score is The grand mean + A treatment or IV effect + Error
  • 10. The ANOVA Model ) ( ) ( A i G A G i X X X X X X      The grand mean A treatment or IV effect Error 40 30 20 10 0 Frequency ANOVA Data by Treatment Level ANOVA Data by Treatment Level ANOVA Data by Treatment Level Grand Mean Treatment Mean Error IV Effect The graph shows the terms in the equation. There are three cells or levels in this study. The IV effect and error for the highest scoring cell is shown.
  • 11. ANOVA Calculations Sums of squares (squared deviations from the mean) tell the story of variance. The simple ANOVA designs have 3 sums of squares.    2 ) ( G i tot X X SS The total sum of squares comes from the distance of all the scores from the grand mean. This is the total; it’s all you have.    2 ) ( A i W X X SS The within-group or within-cell sum of squares comes from the distance of the observations to the cell means. This indicates error.    2 ) ( G A A B X X N SS The between-cells or between-groups sum of squares tells of the distance of the cell means from the grand mean. This indicates IV effects. W B TOT SS SS SS  
  • 12. Computational Example: Caffeine on Test Scores G1: Control G2: Mild G3: Jolt Test Scores 75=79-4 80=84-4 70=74-4 77=79-2 82=84-2 72=74-2 79=79+0 84=84+0 74=74+0 81=79+2 86=84+2 76=74+2 83=79+4 88=84+4 78=74+4 Means 79 84 74 SDs (N-1) 3.16 3.16 3.16
  • 13. G1 75 79 16 Control 77 79 4 M=79 79 79 0 SD=3.16 81 79 4 83 79 16 G2 80 79 1 M=84 82 79 9 SD=3.16 84 79 25 86 79 49 88 79 81 G3 70 79 81 M=74 72 79 49 SD=3.16 74 79 25 76 79 9 78 79 1 Sum 370 G X i X 2 ) ( G i X X  Total Sum of Squares    2 ) ( G i tot X X SS
  • 14. In the total sum of squares, we are finding the squared distance from the Grand Mean. If we took the average, we would have a variance.    2 ) ( G i tot X X SS Scores on the Dependent Variable by Group 0.5 0.4 0.3 0.1 0.0 Relative Frequency Low High Grand Mean
  • 15. G1 75 79 16 Control 77 79 4 M=79 79 79 0 SD=3.16 81 79 4 83 79 16 G2 80 84 16 M=84 82 84 4 SD=3.16 84 84 0 86 84 4 88 84 16 G3 70 74 16 M=74 72 74 4 SD=3.16 74 74 0 76 74 4 78 74 16 Sum 120 Within Sum of Squares i X A X 2 ) ( A i X X     2 ) ( A i W X X SS
  • 16. Within sum of squares refers to the variance within cells. That is, the difference between scores and their cell means. SSW estimates error. Scores on the Dependent Variable by Group 0.5 0.4 0.3 0.1 0.0 Relative Frequency Low High Cell or Treatment Mean    2 ) ( A i W X X SS
  • 17. G1 79 79 0 Control 79 79 0 M=79 79 79 0 SD=3.16 79 79 0 79 79 0 G2 84 79 25 M=84 84 79 25 SD=3.16 84 79 25 84 79 25 84 79 25 G3 74 79 25 M=74 74 79 25 SD=3.16 74 79 25 74 79 25 74 79 25 Sum 250 Between Sum of Squares    2 ) ( G A A B X X N SS A X G X 2 ) ( G A X X 
  • 18. The between sum of squares relates the Cell Means to the Grand Mean. This is related to the variance of the means. Scores on the Dependent Variable by Group 0.5 0.4 0.3 0.1 0.0 Relative Frequency Low High Cell Mean Grand Mean Cell Mean Cell Mean    2 ) ( G A A B X X N SS
  • 19. ANOVA Source Table (1) Source SS df MS F Between Groups 250 k-1=2 SS/df 250/2= 125 =MSB F = MSB/MSW = 125/10 =12.5 Within Groups 120 N-k= 15-3=12 120/12 = 10 = MSW Total 370 N-1=14
  • 20. ANOVA Source Table (2)  df – Degrees of freedom. Divide the sum of squares by degrees of freedom to get  MS, Mean Squares, which are population variance estimates.  F is the ratio of two mean squares. F is another distribution like z and t. There are tables of F used for significance testing.
  • 22. F Table – Critical Values Numerator df: dfB dfW 1 2 3 4 5 5 5% 1% 6.61 16.3 5.79 13.3 5.41 12.1 5.19 11.4 5.05 11.0 10 5% 1% 4.96 10.0 4.10 7.56 3.71 6.55 3.48 5.99 3.33 5.64 12 5% 1% 4.75 9.33 3.89 6.94 3.49 5.95 3.26 5.41 3.11 5.06 14 5% 1% 4.60 8.86 3.74 6.51 3.34 5.56 3.11 5.04 2.96 4.70
  • 23. Review  What are critical values of a statistics (e.g., critical values of F)?  What are degrees of freedom?  What are mean squares?  What does MSW tell us?
  • 24. Review 6 Steps 1. Set alpha (.05). 2. State Null & Alternative H0: H1: not all  are =. 3. Calculate test statistic: F=12.5 4. Determine critical value F.05(2,12) = 3.89 5. Decision rule: If test statistic > critical value, reject H0. 6. Decision: Test is significant (12.5>3.89). Means in population are different. 3 2 1     
  • 25. Post Hoc Tests  If the t-test is significant, you have a difference in population means.  If the F-test is significant, you have a difference in population means. But you don’t know where.  With 3 means, could be A=B>C or A>B>C or A>B=C.  We need a test to tell which means are different. Lots available, we will use 1.
  • 26. Tukey HSD (1) HSD means honestly significant difference. A W N MS q HSD     is the Type I error rate (.05).  q Is a value from a table of the studentized range statistic based on alpha, dfW (12 in our example) and k, the number of groups (3 in our example). W MS Is the mean square within groups (10). A N Is the number of people in each group (5). 33 . 5 5 10 77 . 3 05 .   HSD From table MSW A N Result for our example. Use with equal sample size per cell.
  • 27. Tukey HSD (2) To see which means are significantly different, we compare the observed differences among our means to the critical value of the Tukey test. The differences are: 1-2 is 79-84 = -5 (say 5 to be positive). 1-3 is 79-74 = 5 2-3 is 84-74 = 10. Because 10 is larger than 5.33, this result is significant (2 is different than 3). The other differences are not significant. Review 6 steps.
  • 28. Review  What is a post hoc test? What is its use?  Describe the HSD test. What does HSD stand for?
  • 29. Test  Another name for mean square is _________. 1. standard deviation 2. sum of squares 3. treatment level 4. variance
  • 30. Test When do we use post hoc tests?  a. after a significant overall F test  b. after a nonsignificant overall F test  c. in place of an overall F test  d. when we want to determine the impact of different factors