SlideShare a Scribd company logo
ANALYSIS OF
CONTINUOUS
VARIABLES
(ANOVA test & Correlation)
Dr Lipilekha Patnaik
Professor, Community Medicine
Institute of Medical Sciences & SUM Hospital
Siksha ‘O’Anusandhan deemed to be University
Bhubaneswar, Odisha, India
Email: drlipilekha@yahoo.co.in
Session Objectives
• ANOVA test
• Correlation
2
Which parametric test to use
3
Outcome/
dependent
variable
Exposure/
independent
variable
Continuous
variable
Normally
Distributed
2 related group
..
One group
…..
2 independent
group
Continuous
variable
Normal dist.
Categorical
variable
> 2 independent
group
One sample ..
t test
Paired sample
t test
Unpaired
sample t test
ANOVA Test
…..
Pearson
correlation test
• If more than two groups of data
• Estimating the difference in a quantitative/ continuous parameter
between more than 2 independent groups.
- ANOVA TEST
How many groups and between whom we are comparing?
4
If means are significantly different:
• The independent variable has an effect on the
dependent variable
• Compare the blood sugar of Heavy Smokers, mild
smokers and Non-smokers.
• Independent groups,>2 groups
• Quantitative/Continuousvariable
5
When to choose ANOVA?
• Criterion 1: Comparison between groups
Eg: Compare the birth weight of children born to mothers in different BMI
groups (Under weight, Normal, Over weight/Obese)
• Criterion 2: More than 2 groups
Eg.- Comparison between 3 BMI groups
• Criterion 3: The groups are independent
Eg.- Subjects can only belong to either one of the BMI groups i.e.:
(Under weight, Normal, Over weight/Obese)
6
Cont.
• Criterion 4: Dependent variable is continuous/quantitative
(in other words one should be able to compute the mean of the
measured variable)
eg.: The variable to be compared (birth weight) measured in grams is a
continuous variable
• Criterion 5: The data should follow normal distribution in each group
of the sampled population.
Eg: Birth weight data follows normal distribution in Under weight,
Normal, Over weight/Obese
7
Additional assumption for ANOVA
• The population variances should be equal
Eg: The amount of variation of birth weight in Under weight, Normal,
Over weight/Obese.
8
9
10
Post-hoc tests
• Further analysis to understand how the groups differ.
• If the F-test is significant, you have a difference in population
means. But you don’t know where.
• We need a test to tell which means are different.
• Bonferroni/ Tukey HSD should be done.
11
12
• Used to predict the association between two continuous variables.
• Eg. Correlation between systolic blood pressure and cholesterol
levels
• We estimate correlation coefficient (Pearson Product Moment
Correlation coefficient).
Correlation analysis
13
• The sample correlation coefficient, denoted by r
• This quantifies the direction and strength of correlation.
• Direction may be
• Positive:Positivechangein one producespositivechangein the other
• Negative: Positivechange in one producesnegativechangein the other
• Ranges between +1 and -1
Correlation coefficient
14
r value Nature of correlation
(Negative correlation)
r value Nature of correlation
(Positivecorrelation)
-1 Absolute correlation +1 Absolute correlation
-0.9 to -1 Very high correlation +0.9 to +1 Very high correlation
-0.7 to -0.9 High correlation +0.7 to +0.9 High correlation
-0.5 to -0.7 Moderate correlation +0.5 to +0.7 Moderate correlation
-0.3 to -0.5 Low correlation +0.3 to +0.5 Low correlation
0 to -0.3 Negligible correlation 0 to +0.3 Negligible correlation
Degree of correlation
15
Magnitude of ‘r’ determines the strength of association
Usually scatter plot is used to determine if any relation exists.
r value‘0’- No correlation
Correlation table
16
17
18
19
Scatter plot
The pattern of data is indicative of the type of relationship between
two variables:
Øpositive relationship
Ønegative relationship
Øno relationship
20
Scatter plot
• No Correlation (r = 0)
• Random or circular assortment of dots
• Positive Correlation (r > 0)
• ellipse leaning to right
• Age of children and height
• Age and SBP
• Negative Correlation (r < 0)
• ellipse learning to left
• Depression & Self-esteem
• Hours of studying & test errors
21
22
23
24
25
26
• Failure to consider a third variable related to both and responsible for
the results of correlation can be omitted (Confounders).
• Can not establish causation.
• Non-linear relationship, though may exist, may not become visible in
correlation analysis.
Limitations of correlation
27
Take Home Messages
• Difference in a quantitative/ continuous parameter between 2
independent groups -Unpaired T-test/ Independent samples T –test
• Difference in a quantitative/ continuous parameter between paired
data from one sample - Paired T-test
• Difference in a quantitative/ continuous parameter between more than
2 groups – ANOVA
• Association between two continuous variables – Correlation
• Predict the value of one variable corresponding to a given value of
other variable - Regression
28
29

More Related Content

PPTX
Nonparametric tests
PPTX
Parametric tests
PPT
Correlation - Biostatistics
PPTX
Measures of relationships
PPTX
Parametric versus non parametric test
PPT
DIstinguish between Parametric vs nonparametric test
PPTX
Non Parametric Tests
PDF
Repeated Measures ANOVA
Nonparametric tests
Parametric tests
Correlation - Biostatistics
Measures of relationships
Parametric versus non parametric test
DIstinguish between Parametric vs nonparametric test
Non Parametric Tests
Repeated Measures ANOVA

What's hot (20)

PPTX
Student t-test
PPT
The Kruskal-Wallis H Test
PPT
PPTX
The mann whitney u test
PPTX
Parametric Test
PPTX
Wilcoxon signed rank test
PPTX
Parametric tests
PDF
Parametric and non parametric test in biostatistics
PPTX
Non-Parametric Tests
PPTX
What is a paired samples t test
PDF
Analysis of Variance (ANOVA)
PPTX
PPT
Testing of hypothesis
PPTX
Statistical tests of significance and Student`s T-Test
PPTX
PPTX
Sign test
PPTX
Sample and sample size
PPTX
Parametric Statistical tests
Student t-test
The Kruskal-Wallis H Test
The mann whitney u test
Parametric Test
Wilcoxon signed rank test
Parametric tests
Parametric and non parametric test in biostatistics
Non-Parametric Tests
What is a paired samples t test
Analysis of Variance (ANOVA)
Testing of hypothesis
Statistical tests of significance and Student`s T-Test
Sign test
Sample and sample size
Parametric Statistical tests
Ad

Similar to ANOVA test and correlation (20)

PDF
PPTX
Correlation analysis
PPTX
Introduction to Educational statistics and measurement Unit 2
PPTX
Correlation analysis
PPTX
Predictive analytics using 'R' Programming
PPTX
Biostatistics Lecture on Correlation.pptx
PPT
Simple linear regressionn and Correlation
PPT
Correlation Research_Arslan Sheikh_PhD Scholar
PPT
Coefficient of Correlation Pearsons .ppt
PPTX
Data processing
PPTX
Correlation.pptx
PDF
P G STAT 531 Lecture 9 Correlation
PPTX
4. correlations
PPTX
Correlation research
PPT
QUANTITATIVE DATA ANALYSIS for powerful research.ppt
PPT
Linear Correlation
PPTX
Correlation Analysis in the statistics.pptx
PPT
Ebd1 lecture 8 2010
PPTX
Methods of computation of correction stat.
Correlation analysis
Introduction to Educational statistics and measurement Unit 2
Correlation analysis
Predictive analytics using 'R' Programming
Biostatistics Lecture on Correlation.pptx
Simple linear regressionn and Correlation
Correlation Research_Arslan Sheikh_PhD Scholar
Coefficient of Correlation Pearsons .ppt
Data processing
Correlation.pptx
P G STAT 531 Lecture 9 Correlation
4. correlations
Correlation research
QUANTITATIVE DATA ANALYSIS for powerful research.ppt
Linear Correlation
Correlation Analysis in the statistics.pptx
Ebd1 lecture 8 2010
Methods of computation of correction stat.
Ad

More from Dr Lipilekha Patnaik (20)

PDF
Health Programmes in India.pdf
PDF
Concept of public health.pdf
PDF
Demographic profile of india
PDF
Indicators of health
PDF
Immunization
PDF
Epidemic investigation
PDF
Study designs
PDF
Cross sectional study
PDF
Descriptive epidemiology
PDF
Rate, ratio, proportion
PDF
Introduction to epidemiology
PDF
Health planning in india
PDF
12th five year plan and NITI ayog
PDF
National programme for prevention and control of cancer, diabetes, CVDs and s...
PDF
Universal immunization programme
PDF
Health education
PDF
National AIDS Control Programme
PDF
PDF
Nutrition programmes in india
PDF
Normality tests
Health Programmes in India.pdf
Concept of public health.pdf
Demographic profile of india
Indicators of health
Immunization
Epidemic investigation
Study designs
Cross sectional study
Descriptive epidemiology
Rate, ratio, proportion
Introduction to epidemiology
Health planning in india
12th five year plan and NITI ayog
National programme for prevention and control of cancer, diabetes, CVDs and s...
Universal immunization programme
Health education
National AIDS Control Programme
Nutrition programmes in india
Normality tests

Recently uploaded (20)

PPT
OPIOID ANALGESICS AND THEIR IMPLICATIONS
PPT
Breast Cancer management for medicsl student.ppt
PPTX
15.MENINGITIS AND ENCEPHALITIS-elias.pptx
PPT
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
PPTX
Stimulation Protocols for IUI | Dr. Laxmi Shrikhande
PPTX
Imaging of parasitic D. Case Discussions.pptx
PPTX
CEREBROVASCULAR DISORDER.POWERPOINT PRESENTATIONx
PPTX
Respiratory drugs, drugs acting on the respi system
PPT
STD NOTES INTRODUCTION TO COMMUNITY HEALT STRATEGY.ppt
PPTX
SKIN Anatomy and physiology and associated diseases
PPTX
Transforming Regulatory Affairs with ChatGPT-5.pptx
PPT
Obstructive sleep apnea in orthodontics treatment
PPTX
neonatal infection(7392992y282939y5.pptx
PPTX
anaemia in PGJKKKKKKKKKKKKKKKKHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH...
PPT
1b - INTRODUCTION TO EPIDEMIOLOGY (comm med).ppt
PPTX
Fundamentals of human energy transfer .pptx
PDF
شيت_عطا_0000000000000000000000000000.pdf
PPTX
Neuropathic pain.ppt treatment managment
PPT
ASRH Presentation for students and teachers 2770633.ppt
PPTX
History and examination of abdomen, & pelvis .pptx
OPIOID ANALGESICS AND THEIR IMPLICATIONS
Breast Cancer management for medicsl student.ppt
15.MENINGITIS AND ENCEPHALITIS-elias.pptx
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
Stimulation Protocols for IUI | Dr. Laxmi Shrikhande
Imaging of parasitic D. Case Discussions.pptx
CEREBROVASCULAR DISORDER.POWERPOINT PRESENTATIONx
Respiratory drugs, drugs acting on the respi system
STD NOTES INTRODUCTION TO COMMUNITY HEALT STRATEGY.ppt
SKIN Anatomy and physiology and associated diseases
Transforming Regulatory Affairs with ChatGPT-5.pptx
Obstructive sleep apnea in orthodontics treatment
neonatal infection(7392992y282939y5.pptx
anaemia in PGJKKKKKKKKKKKKKKKKHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH...
1b - INTRODUCTION TO EPIDEMIOLOGY (comm med).ppt
Fundamentals of human energy transfer .pptx
شيت_عطا_0000000000000000000000000000.pdf
Neuropathic pain.ppt treatment managment
ASRH Presentation for students and teachers 2770633.ppt
History and examination of abdomen, & pelvis .pptx

ANOVA test and correlation

  • 1. ANALYSIS OF CONTINUOUS VARIABLES (ANOVA test & Correlation) Dr Lipilekha Patnaik Professor, Community Medicine Institute of Medical Sciences & SUM Hospital Siksha ‘O’Anusandhan deemed to be University Bhubaneswar, Odisha, India Email: drlipilekha@yahoo.co.in
  • 2. Session Objectives • ANOVA test • Correlation 2
  • 3. Which parametric test to use 3 Outcome/ dependent variable Exposure/ independent variable Continuous variable Normally Distributed 2 related group .. One group ….. 2 independent group Continuous variable Normal dist. Categorical variable > 2 independent group One sample .. t test Paired sample t test Unpaired sample t test ANOVA Test ….. Pearson correlation test
  • 4. • If more than two groups of data • Estimating the difference in a quantitative/ continuous parameter between more than 2 independent groups. - ANOVA TEST How many groups and between whom we are comparing? 4
  • 5. If means are significantly different: • The independent variable has an effect on the dependent variable • Compare the blood sugar of Heavy Smokers, mild smokers and Non-smokers. • Independent groups,>2 groups • Quantitative/Continuousvariable 5
  • 6. When to choose ANOVA? • Criterion 1: Comparison between groups Eg: Compare the birth weight of children born to mothers in different BMI groups (Under weight, Normal, Over weight/Obese) • Criterion 2: More than 2 groups Eg.- Comparison between 3 BMI groups • Criterion 3: The groups are independent Eg.- Subjects can only belong to either one of the BMI groups i.e.: (Under weight, Normal, Over weight/Obese) 6
  • 7. Cont. • Criterion 4: Dependent variable is continuous/quantitative (in other words one should be able to compute the mean of the measured variable) eg.: The variable to be compared (birth weight) measured in grams is a continuous variable • Criterion 5: The data should follow normal distribution in each group of the sampled population. Eg: Birth weight data follows normal distribution in Under weight, Normal, Over weight/Obese 7
  • 8. Additional assumption for ANOVA • The population variances should be equal Eg: The amount of variation of birth weight in Under weight, Normal, Over weight/Obese. 8
  • 9. 9
  • 10. 10
  • 11. Post-hoc tests • Further analysis to understand how the groups differ. • If the F-test is significant, you have a difference in population means. But you don’t know where. • We need a test to tell which means are different. • Bonferroni/ Tukey HSD should be done. 11
  • 12. 12
  • 13. • Used to predict the association between two continuous variables. • Eg. Correlation between systolic blood pressure and cholesterol levels • We estimate correlation coefficient (Pearson Product Moment Correlation coefficient). Correlation analysis 13
  • 14. • The sample correlation coefficient, denoted by r • This quantifies the direction and strength of correlation. • Direction may be • Positive:Positivechangein one producespositivechangein the other • Negative: Positivechange in one producesnegativechangein the other • Ranges between +1 and -1 Correlation coefficient 14
  • 15. r value Nature of correlation (Negative correlation) r value Nature of correlation (Positivecorrelation) -1 Absolute correlation +1 Absolute correlation -0.9 to -1 Very high correlation +0.9 to +1 Very high correlation -0.7 to -0.9 High correlation +0.7 to +0.9 High correlation -0.5 to -0.7 Moderate correlation +0.5 to +0.7 Moderate correlation -0.3 to -0.5 Low correlation +0.3 to +0.5 Low correlation 0 to -0.3 Negligible correlation 0 to +0.3 Negligible correlation Degree of correlation 15 Magnitude of ‘r’ determines the strength of association Usually scatter plot is used to determine if any relation exists. r value‘0’- No correlation
  • 17. 17
  • 18. 18
  • 19. 19
  • 20. Scatter plot The pattern of data is indicative of the type of relationship between two variables: Øpositive relationship Ønegative relationship Øno relationship 20
  • 21. Scatter plot • No Correlation (r = 0) • Random or circular assortment of dots • Positive Correlation (r > 0) • ellipse leaning to right • Age of children and height • Age and SBP • Negative Correlation (r < 0) • ellipse learning to left • Depression & Self-esteem • Hours of studying & test errors 21
  • 22. 22
  • 23. 23
  • 24. 24
  • 25. 25
  • 26. 26
  • 27. • Failure to consider a third variable related to both and responsible for the results of correlation can be omitted (Confounders). • Can not establish causation. • Non-linear relationship, though may exist, may not become visible in correlation analysis. Limitations of correlation 27
  • 28. Take Home Messages • Difference in a quantitative/ continuous parameter between 2 independent groups -Unpaired T-test/ Independent samples T –test • Difference in a quantitative/ continuous parameter between paired data from one sample - Paired T-test • Difference in a quantitative/ continuous parameter between more than 2 groups – ANOVA • Association between two continuous variables – Correlation • Predict the value of one variable corresponding to a given value of other variable - Regression 28
  • 29. 29