SlideShare a Scribd company logo
Statistical tests to observe the statistical
significance of qualitative variables (Chi-
square, Fisher’s exact & Mac Nemar’s chi-
square tests)
Types of Categorical Data
Types of Categorical Data
Types of Analysis for
Types of Analysis for
Categorical Data
Categorical Data
23-Statistical_tests_(chi-square,Fishers___&Macnemars))(UG1435-36).ppt
Choosing the appropriate
Statistical test
 Based on the three aspects of the data
 Types of variables
 Number of groups being
compared &
 Sample size
Statistical test (cont.)
Chi-square test:
Study variable: Qualitative(Categorical )
Outcome variable: Qualitative(Categorical)
Comparison: two or more proportions
Sample size: >30
Expected frequency: > 5
Chi-square test
Purpose
Purpose
To find out whether the association between two
To find out whether the association between two
categorical variables are statistically significant
categorical variables are statistically significant
Null Hypothesis
Null Hypothesis
There is no association between two variables
There is no association between two variables
 
( o - e ) 2
e
X
X 2
=
=
Figure for Each Cell
reject
reject H
Ho
o if
if 
2
2
>
> 
2
2
.
.
,
,df
df
where df = (
where df = (r
r-1)(
-1)(c
c-1)
-1)

2
2
= ∑
= ∑
(
(O
O -
- E
E)
)2
2
E
E
3.
3. E is the expected frequency
E is the expected frequency
^
^
E
E =
=
^
^
(total of all cells)
(total of all cells)
total of row in
total of row in
which the cell lies
which the cell lies
total of column in
total of column in
which the cell lies
which the cell lies
•
•
1.
1. The summation is over all cells of the contingency
The summation is over all cells of the contingency
table consisting of r rows and c columns
table consisting of r rows and c columns
2.
2. O is the observed frequency
O is the observed frequency
4.
4. The degrees of freedom are df = (r-1)(c-1)
The degrees of freedom are df = (r-1)(c-1)
Requirements
 Prior to using the chi square test,
there are certain requirements that
must be met.
 The data must be in the form of
frequencies counted in each of a set of
categories. Percentages cannot be used.
 The total number observed must be
exceed 20.
Requirements
 The expected frequency under the H0
hypothesis in any one fraction must not
normally be less than 5.
 All the observations must be
independent of each other. In other
words, one observation must not have
an influence upon another observation.
APPLICATION OF CHI-SQUARE TEST
 TESTING INDEPENDCNE (or
ASSOCATION)
 TESTING FOR HOMOGENEITY
 TESTING OF GOODNESS-OF-FIT
Chi-square test
 Objective : Smoking is a risk factor for MI
 Null Hypothesis: Smoking does not
cause MI
D (MI)
D (MI) No D( No MI)
No D( No MI) Total
Total
Smokers
Smokers 29
29 21
21 50
50
Non-smokers
Non-smokers 16
16 34
34 50
50
Total
Total 45
45 55
55 100
100
E
O
29
E
O
21
E
O
16
E
O
34
MI Non-MI
Smoker
Non-Smoker
Chi-Square test
E
O
29
E
O
21
E
O
16
E
O
34
MI Non-MI
Smoker
Non-smoker
50
50
55
45 100
Chi-square test
E
O
29
E
O
21
E
O
16
E
O
34
MI Non-MI
Smoker
Non-smoker
50
50
55
45 100
50 X 45
100
22.5 =
22.5
Chi-square test
E
O
29
E
O
21
E
O
16
E
O
34
MI No MI
smoker
Non
smoker
50
50
55
45 100
22.5 27.5
22.5 27.5
Chi-square test
 Degrees of Freedom
df = (r-1) (c-1)
= (2-1) (2-1) =1
 Critical Value (Table A.6) = 3.84
 X2
= 6.84
 Calculated value(6.84) is greater than critical (table)
value (3.84) at 0.05 level with 1 d.f.f
 Hence we reject our Ho and conclude that there is
highly statistically significant association between
smoking and MI.
Chi-Square
Association between Diabetes and Heart
Disease?
 Background:
Contradictory opinions:
 1. A diabetic’s risk of dying after a first heart attack is the same as that of
someone without diabetes. There is no association between diabetes and
heart disease.
vs.
 2. Diabetes takes a heavy toll on the body and diabetes patients often
suffer heart attacks and strokes or die from cardiovascular complications
at a much younger age.
 So we use hypothesis test based on the latest data to see what’s the right
conclusion.
 There are a total of 5167 patients, among which 1131 patients are non-
diabetics and 4036 are diabetics. Among the non-diabetic patients, 42%
of them had their blood pressure properly controlled (therefore it’s 475 of
1131). While among the diabetic patients only 20% of them had the blood
pressure controlled (therefore it’s 807 of 4036).
Association between Diabetes and Heart
Disease?
 Data
Controlled Uncontrolled Total
Non-diabetes 475 656 1131
Diabetes 807 3229 4036
Total 1282 3885 5167
Association between Diabetes and Heart
Disease?
Data:
Diabetes: 1=Not have diabetes, 2=Have Diabetes
Control: 1=Controlled, 2=Uncontrolled
DIABETES * CONTROL Crosstabulation
Count
475 656 1131
807 3229 4036
1282 3885 5167
1.00
2.00
DIABETES
Total
1.00 2.00
CONTROL
Total
Association between Diabetes and Heart
Disease?
Hypothesis test:
H0: There is no association between diabetes
and heart disease. (or) Diabetes and heart
disease are independent.
vs
HA: There is an association between diabetes
and heart disease. (or) Diabetes and heart
disease are dependent.
--- Assume a significance level of 0.05
Association between Diabetes and Heart
Disease?
---The computer gives us a Chi-Square Statistic
of 229.268
---The computer gives us a p-value of .000
(<0.0001)
--- Because our p-value is less than alpha, we
would reject the null hypothesis.
--- There is sufficient evidence to conclude that
there is an association between diabetes and
heart disease.
Age
Age
Gender
Gender <30
<30 30-45
30-45 >45
>45 Total
Total
Male
Male 60 (60)
60 (60) 20 (30)
20 (30) 40 (30)
40 (30) 120
120
Female
Female 40 (40)
40 (40) 30 (20)
30 (20) 10 (20)
10 (20) 80
80
Total
Total 100
100 50
50 50
50 200
200
Chi- square test
Find out whether the gender is equally
distributed among each age group
Test for Homogeneity (Similarity)
To test similarity between frequency distribution or group. It is
used in assessing the similarity between non-responders and
responders in any survey
Age (yrs)
Age (yrs) Responders
Responders Non-responders
Non-responders Total
Total
<20
<20 76 (82)
76 (82) 20 (14)
20 (14) 96
96
20 – 29
20 – 29 288 (289)
288 (289) 50 (49)
50 (49) 338
338
30-39
30-39 312 (310)
312 (310) 51 (53)
51 (53) 363
363
40-49
40-49 187 (185)
187 (185) 30 (32)
30 (32) 217
217
>50
>50 77 (73)
77 (73) 9 (13)
9 (13) 86
86
Total
Total 940
940 160
160 1100
1100
Fisher’s exact test:
Study variable: Qualitative(Categorical)
Outcome variable: Qualitative(Categorical)
Comparison: two proportions
Sample size: < 30
Example
Example
 The following data relate to suicidal feelings in
samples of psychotic and neurotic patients:
Example
Example
 The following data compare malocclusion
of teeth with method of feeding infants.
Normal teeth Malocclusion
Breast fed 4 (a) 16 (b)
Bottle fed 1 (c) 21 (d)
Fisher’s Exact Test:
Fisher’s Exact Test:
 The method of Yates's correction was useful
when manual calculations were done. Now
different types of statistical packages are
available. Therefore, it is better to use
Fisher's exact test rather than Yates's
correction as it gives exact result.
1 2 1 2
! ! ! !
'
! ! ! ! !
R R C C
Fisher s ExactTest
n a b c d

23-Statistical_tests_(chi-square,Fishers___&Macnemars))(UG1435-36).ppt
23-Statistical_tests_(chi-square,Fishers___&Macnemars))(UG1435-36).ppt
23-Statistical_tests_(chi-square,Fishers___&Macnemars))(UG1435-36).ppt
23-Statistical_tests_(chi-square,Fishers___&Macnemars))(UG1435-36).ppt
What to do when we have a
paired samples and both the
exposure and outcome
variables are qualitative
variables (Binary).
Macnemar’s test: (for paired samples)
Study variable: Qualitative (categorical)
Outcome variable: Qualitative(categorical)
Comparison: two proportions
Sample size: Any
Problem
 A researcher has done a matched case-
control study of endometrial cancer
(cases) and exposure to conjugated
estrogens (exposed).
 In the study cases were individually
matched 1:1 to a non-cancer hospital-
based control, based on age, race, date
of admission, and hospital.
McNemar’s test
McNemar’s test
Situation:
Situation:
Two paired binary variables that
Two paired binary variables that
form a particular type of 2 x 2
form a particular type of 2 x 2
table
table
e.g. matched case-control study or
e.g. matched case-control study or
cross-over trial
cross-over trial
Cases Controls Total
Exposed 55 19 74
Not exposed 128 164 292
Total 183 183 366
Data
 can’t use a chi-squared test - observations
are not independent - they’re paired.
 we must present the 2 x 2 table differently
 each cell should contain a count of the
number of pairs with certain criteria, with
the columns and rows respectively referring
to each of the subjects in the matched pair
 the information in the standard 2 x 2 table
used for unmatched studies is insufficient
because it doesn’t say who is in which pair -
ignoring the matching
Controls
Cases Exposed Not exposed Total
Exposed 12 43 55
Not exposed 7 121 128
Total 19 164 183
Data
Controls
Cases Exposed Not exposed Total
Exposed e f e+f
Not exposed g h g+h
Total e+g f+h n
We construct a matched 2 x 2 table:
The odds ratio is: f/g
The test is:
Formula
g
f
1)
-
g
f
( 2
2




Compare this to the 2
distribution on 1 df
5
.
24
50
1225
7
3
4
1)
-
7
-
43
( 2
2





P <0.001, Odds Ratio = 43/7 = 6.1
p1 - p2 = (55/183) – (19/183) = 0.197 (20%)
s.e.(p1 - p2) = 0.036
95% CI: 0.12 to 0.27 (or 12% to 27%)
 Degrees of Freedom
df = (r-1) (c-1)
= (2-1) (2-1) =1
 Critical Value (Table A.6) = 3.84
 X2
= 25.92
 Calculated value(25.92) is greater than critical
(table) value (3.84) at 0.05 level with 1 d.f.f
 Hence we reject our Ho and conclude that
there is highly statistically significant
association between Endometrial cancer and
Estrogens.
23-Statistical_tests_(chi-square,Fishers___&Macnemars))(UG1435-36).ppt
In Conclusion !
When both the study variables and outcome
variables are categorical (Qualitative):
Apply
(i) Chi square test
(ii) Fisher’s exact test (Small samples)
(iii) Mac nemar’s test ( for paired samples)

More Related Content

PPT
Statistical tests for categorical data(2020)88.ppt
PPT
Chi-square, Yates, Fisher & McNemar
PPTX
Company Induction process and Onboarding
PDF
7. Chi square test.pdf pharmaceutical biostatistics
PPTX
Biostatistics- lesson is chapter-10.pptx
PPT
THE CHI-SQUARE TEST IN DEPTH AND WITH EXAMPLES
PDF
PPTX
Chi square(hospital admin)
Statistical tests for categorical data(2020)88.ppt
Chi-square, Yates, Fisher & McNemar
Company Induction process and Onboarding
7. Chi square test.pdf pharmaceutical biostatistics
Biostatistics- lesson is chapter-10.pptx
THE CHI-SQUARE TEST IN DEPTH AND WITH EXAMPLES
Chi square(hospital admin)

Similar to 23-Statistical_tests_(chi-square,Fishers___&Macnemars))(UG1435-36).ppt (20)

PPTX
Lecture 12 Chi-Square.pptx
PPTX
Lecture chi squire. For Postgraduate and Undergraduate
PPTX
NON-PARAMETRIC TESTS.pptx
PDF
Chisquare Test of Association.pdf in biostatistics
PPTX
Chi square test
PPTX
Non parametric tests
PDF
nonparametrictestresearchmethodology.pdf
PPTX
Chi square(hospital admin) A
PPTX
UNIT 5.pptx
PDF
Parametric & Non-Parametric tests SPSS WORKSHOPpdf
PPT
Chi-Square Presentation - Nikki.ppt
PDF
Chi square test
PPTX
Chi sqyre test
PPTX
Inferential statistics nominal data
PPTX
Chi square test
PDF
Non Parametric Test by Vikramjit Singh
PDF
Categorical data analysis
PPTX
chai square test 1 chai square test 1 chai square test 1
PPTX
Summary of statistical tools used in spss
Lecture 12 Chi-Square.pptx
Lecture chi squire. For Postgraduate and Undergraduate
NON-PARAMETRIC TESTS.pptx
Chisquare Test of Association.pdf in biostatistics
Chi square test
Non parametric tests
nonparametrictestresearchmethodology.pdf
Chi square(hospital admin) A
UNIT 5.pptx
Parametric & Non-Parametric tests SPSS WORKSHOPpdf
Chi-Square Presentation - Nikki.ppt
Chi square test
Chi sqyre test
Inferential statistics nominal data
Chi square test
Non Parametric Test by Vikramjit Singh
Categorical data analysis
chai square test 1 chai square test 1 chai square test 1
Summary of statistical tools used in spss
Ad

More from kamranhanif3111 (7)

PPT
Alterations of Pulmonary respiratory disorders
PPTX
Respiratory- disease disorders for the patients ot RTI
PPT
5-_Introduction_to_Communicable_Disease.ppt
PPT
Principles of Communicable Diseases Epidemiology.ppt
PPTX
Lesson_1_Part_2_Epidemiology_Lifestyle_Summary.pptx
PPTX
Lesson_1_Introduction_to_Epidemiology_Summary.pptx
PPTX
Pollution_Presentation for the measirement of the
Alterations of Pulmonary respiratory disorders
Respiratory- disease disorders for the patients ot RTI
5-_Introduction_to_Communicable_Disease.ppt
Principles of Communicable Diseases Epidemiology.ppt
Lesson_1_Part_2_Epidemiology_Lifestyle_Summary.pptx
Lesson_1_Introduction_to_Epidemiology_Summary.pptx
Pollution_Presentation for the measirement of the
Ad

Recently uploaded (20)

PPT
Quality review (1)_presentation of this 21
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Business Acumen Training GuidePresentation.pptx
PDF
Introduction to Business Data Analytics.
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PDF
Lecture1 pattern recognition............
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
Launch Your Data Science Career in Kochi – 2025
Quality review (1)_presentation of this 21
Acceptance and paychological effects of mandatory extra coach I classes.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
.pdf is not working space design for the following data for the following dat...
Galatica Smart Energy Infrastructure Startup Pitch Deck
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
climate analysis of Dhaka ,Banglades.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Business Acumen Training GuidePresentation.pptx
Introduction to Business Data Analytics.
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Lecture1 pattern recognition............
Clinical guidelines as a resource for EBP(1).pdf
Launch Your Data Science Career in Kochi – 2025

23-Statistical_tests_(chi-square,Fishers___&Macnemars))(UG1435-36).ppt

  • 1. Statistical tests to observe the statistical significance of qualitative variables (Chi- square, Fisher’s exact & Mac Nemar’s chi- square tests)
  • 2. Types of Categorical Data Types of Categorical Data
  • 3. Types of Analysis for Types of Analysis for Categorical Data Categorical Data
  • 5. Choosing the appropriate Statistical test  Based on the three aspects of the data  Types of variables  Number of groups being compared &  Sample size
  • 6. Statistical test (cont.) Chi-square test: Study variable: Qualitative(Categorical ) Outcome variable: Qualitative(Categorical) Comparison: two or more proportions Sample size: >30 Expected frequency: > 5
  • 7. Chi-square test Purpose Purpose To find out whether the association between two To find out whether the association between two categorical variables are statistically significant categorical variables are statistically significant Null Hypothesis Null Hypothesis There is no association between two variables There is no association between two variables
  • 8.   ( o - e ) 2 e X X 2 = = Figure for Each Cell
  • 9. reject reject H Ho o if if  2 2 > >  2 2 . . , ,df df where df = ( where df = (r r-1)( -1)(c c-1) -1)  2 2 = ∑ = ∑ ( (O O - - E E) )2 2 E E 3. 3. E is the expected frequency E is the expected frequency ^ ^ E E = = ^ ^ (total of all cells) (total of all cells) total of row in total of row in which the cell lies which the cell lies total of column in total of column in which the cell lies which the cell lies • • 1. 1. The summation is over all cells of the contingency The summation is over all cells of the contingency table consisting of r rows and c columns table consisting of r rows and c columns 2. 2. O is the observed frequency O is the observed frequency 4. 4. The degrees of freedom are df = (r-1)(c-1) The degrees of freedom are df = (r-1)(c-1)
  • 10. Requirements  Prior to using the chi square test, there are certain requirements that must be met.  The data must be in the form of frequencies counted in each of a set of categories. Percentages cannot be used.  The total number observed must be exceed 20.
  • 11. Requirements  The expected frequency under the H0 hypothesis in any one fraction must not normally be less than 5.  All the observations must be independent of each other. In other words, one observation must not have an influence upon another observation.
  • 12. APPLICATION OF CHI-SQUARE TEST  TESTING INDEPENDCNE (or ASSOCATION)  TESTING FOR HOMOGENEITY  TESTING OF GOODNESS-OF-FIT
  • 13. Chi-square test  Objective : Smoking is a risk factor for MI  Null Hypothesis: Smoking does not cause MI D (MI) D (MI) No D( No MI) No D( No MI) Total Total Smokers Smokers 29 29 21 21 50 50 Non-smokers Non-smokers 16 16 34 34 50 50 Total Total 45 45 55 55 100 100
  • 18.  Degrees of Freedom df = (r-1) (c-1) = (2-1) (2-1) =1  Critical Value (Table A.6) = 3.84  X2 = 6.84  Calculated value(6.84) is greater than critical (table) value (3.84) at 0.05 level with 1 d.f.f  Hence we reject our Ho and conclude that there is highly statistically significant association between smoking and MI. Chi-Square
  • 19. Association between Diabetes and Heart Disease?  Background: Contradictory opinions:  1. A diabetic’s risk of dying after a first heart attack is the same as that of someone without diabetes. There is no association between diabetes and heart disease. vs.  2. Diabetes takes a heavy toll on the body and diabetes patients often suffer heart attacks and strokes or die from cardiovascular complications at a much younger age.  So we use hypothesis test based on the latest data to see what’s the right conclusion.  There are a total of 5167 patients, among which 1131 patients are non- diabetics and 4036 are diabetics. Among the non-diabetic patients, 42% of them had their blood pressure properly controlled (therefore it’s 475 of 1131). While among the diabetic patients only 20% of them had the blood pressure controlled (therefore it’s 807 of 4036).
  • 20. Association between Diabetes and Heart Disease?  Data Controlled Uncontrolled Total Non-diabetes 475 656 1131 Diabetes 807 3229 4036 Total 1282 3885 5167
  • 21. Association between Diabetes and Heart Disease? Data: Diabetes: 1=Not have diabetes, 2=Have Diabetes Control: 1=Controlled, 2=Uncontrolled DIABETES * CONTROL Crosstabulation Count 475 656 1131 807 3229 4036 1282 3885 5167 1.00 2.00 DIABETES Total 1.00 2.00 CONTROL Total
  • 22. Association between Diabetes and Heart Disease? Hypothesis test: H0: There is no association between diabetes and heart disease. (or) Diabetes and heart disease are independent. vs HA: There is an association between diabetes and heart disease. (or) Diabetes and heart disease are dependent. --- Assume a significance level of 0.05
  • 23. Association between Diabetes and Heart Disease? ---The computer gives us a Chi-Square Statistic of 229.268 ---The computer gives us a p-value of .000 (<0.0001) --- Because our p-value is less than alpha, we would reject the null hypothesis. --- There is sufficient evidence to conclude that there is an association between diabetes and heart disease.
  • 24. Age Age Gender Gender <30 <30 30-45 30-45 >45 >45 Total Total Male Male 60 (60) 60 (60) 20 (30) 20 (30) 40 (30) 40 (30) 120 120 Female Female 40 (40) 40 (40) 30 (20) 30 (20) 10 (20) 10 (20) 80 80 Total Total 100 100 50 50 50 50 200 200 Chi- square test Find out whether the gender is equally distributed among each age group
  • 25. Test for Homogeneity (Similarity) To test similarity between frequency distribution or group. It is used in assessing the similarity between non-responders and responders in any survey Age (yrs) Age (yrs) Responders Responders Non-responders Non-responders Total Total <20 <20 76 (82) 76 (82) 20 (14) 20 (14) 96 96 20 – 29 20 – 29 288 (289) 288 (289) 50 (49) 50 (49) 338 338 30-39 30-39 312 (310) 312 (310) 51 (53) 51 (53) 363 363 40-49 40-49 187 (185) 187 (185) 30 (32) 30 (32) 217 217 >50 >50 77 (73) 77 (73) 9 (13) 9 (13) 86 86 Total Total 940 940 160 160 1100 1100
  • 26. Fisher’s exact test: Study variable: Qualitative(Categorical) Outcome variable: Qualitative(Categorical) Comparison: two proportions Sample size: < 30
  • 27. Example Example  The following data relate to suicidal feelings in samples of psychotic and neurotic patients:
  • 28. Example Example  The following data compare malocclusion of teeth with method of feeding infants. Normal teeth Malocclusion Breast fed 4 (a) 16 (b) Bottle fed 1 (c) 21 (d)
  • 29. Fisher’s Exact Test: Fisher’s Exact Test:  The method of Yates's correction was useful when manual calculations were done. Now different types of statistical packages are available. Therefore, it is better to use Fisher's exact test rather than Yates's correction as it gives exact result. 1 2 1 2 ! ! ! ! ' ! ! ! ! ! R R C C Fisher s ExactTest n a b c d 
  • 34. What to do when we have a paired samples and both the exposure and outcome variables are qualitative variables (Binary).
  • 35. Macnemar’s test: (for paired samples) Study variable: Qualitative (categorical) Outcome variable: Qualitative(categorical) Comparison: two proportions Sample size: Any
  • 36. Problem  A researcher has done a matched case- control study of endometrial cancer (cases) and exposure to conjugated estrogens (exposed).  In the study cases were individually matched 1:1 to a non-cancer hospital- based control, based on age, race, date of admission, and hospital.
  • 37. McNemar’s test McNemar’s test Situation: Situation: Two paired binary variables that Two paired binary variables that form a particular type of 2 x 2 form a particular type of 2 x 2 table table e.g. matched case-control study or e.g. matched case-control study or cross-over trial cross-over trial
  • 38. Cases Controls Total Exposed 55 19 74 Not exposed 128 164 292 Total 183 183 366 Data
  • 39.  can’t use a chi-squared test - observations are not independent - they’re paired.  we must present the 2 x 2 table differently  each cell should contain a count of the number of pairs with certain criteria, with the columns and rows respectively referring to each of the subjects in the matched pair  the information in the standard 2 x 2 table used for unmatched studies is insufficient because it doesn’t say who is in which pair - ignoring the matching
  • 40. Controls Cases Exposed Not exposed Total Exposed 12 43 55 Not exposed 7 121 128 Total 19 164 183 Data
  • 41. Controls Cases Exposed Not exposed Total Exposed e f e+f Not exposed g h g+h Total e+g f+h n We construct a matched 2 x 2 table:
  • 42. The odds ratio is: f/g The test is: Formula g f 1) - g f ( 2 2     Compare this to the 2 distribution on 1 df
  • 43. 5 . 24 50 1225 7 3 4 1) - 7 - 43 ( 2 2      P <0.001, Odds Ratio = 43/7 = 6.1 p1 - p2 = (55/183) – (19/183) = 0.197 (20%) s.e.(p1 - p2) = 0.036 95% CI: 0.12 to 0.27 (or 12% to 27%)
  • 44.  Degrees of Freedom df = (r-1) (c-1) = (2-1) (2-1) =1  Critical Value (Table A.6) = 3.84  X2 = 25.92  Calculated value(25.92) is greater than critical (table) value (3.84) at 0.05 level with 1 d.f.f  Hence we reject our Ho and conclude that there is highly statistically significant association between Endometrial cancer and Estrogens.
  • 46. In Conclusion ! When both the study variables and outcome variables are categorical (Qualitative): Apply (i) Chi square test (ii) Fisher’s exact test (Small samples) (iii) Mac nemar’s test ( for paired samples)