SlideShare a Scribd company logo
3
Most read
15
Most read
22
Most read
Correlation(Pearson & spearman) 
&Linear Regression
Correlation 
 Semantically, Correlation means Co-together and 
Relation. 
 Statistical correlation is a statistical technique which 
tells us if two variables are related.
PEARSON CORRELATION 
 measures the degree of linear association between 
two interval scaled variables analysis of the 
relationship between two quantitative outcomes, 
e.g., height and weight
Assumption under Pearson’s 
Correlation Coefficient 
 Assumption 1: The correlation coefficient r assumes 
that the two variables measured 
form a bivariate normal distribution population. 
 Assumption 2: The correlation coefficient r measures 
only linear associations: how nearly the data 
falls on a straight line. It is not a good summary of 
the association if the scatterplot has a nonlinear 
(curved) pattern.
Assumptions Contd. 
 Assumption 3: The correlation coefficient r is not a 
good summary of association if the data are 
heteroscedastic.(when random variables have the 
same finite variance. It is also known as homogenity 
of variance) 
 Assumption 4: The correlation coefficient r is not a 
good summary of association if the data have 
outliers.
Correlation and regression
Strength of Relationship 
 r lies between -1 and 1. Values near 0 means no (linear) 
correlation and values near ± 1 means very strong 
correlation. 

Correlation and regression
Interpretation of the value of r
Coefficient of Determination 
 Pearson's r can be squared , r 2, to derive a 
coefficient of determination. 
 Coefficient of determination - the portion of 
variability in one of the variables that can be 
accounted for by variability in the second 
variable
Example 
 Pearson's r can be squared , r 2 
 If r=0.5, then r2=0.25 If r=0.7 then r2=0.49 
 Thus while r=0.5 versus 0.7 might not look so 
different in terms of strength, r2 tells us that r=0.7 
accounts for about twice the variability relative to 
r=0.5
Spearman’s Rank Correlation 
 Spearman's rank correlation coefficient 
or Spearman's rho, is a measure of statistical 
dependence between two variables.
Interpretation 
 The sign of the Spearman correlation indicates the 
direction of association between X (the independent 
variable) and Y (the dependent variable). If Y tends to 
increase when X increases, the Spearman correlation 
coefficient is positive. If Y tends to decrease when X 
increases, the Spearman correlation coefficient is 
negative. A Spearman correlation of zero indicates 
that there is no tendency for Y to either increase or 
decrease when X increases
Repeated Ranks 
 If there is more 
than one item with 
the same value , 
then they are 
given a common 
rank which is 
average of their 
respective ranks 
as shown in the 
table.
 The raw data in the 
table below is used to 
calculate the 
correlation between 
the IQ of an with the 
number of hours 
spent in front 
of TV per week. 
Example
Example Contd.
Regression 
 One variable is a direct cause of the other or if the 
value of one variable is changed, then as a direct 
consequence, the other variable also change or if the 
main purpose of the analysis is prediction of one 
variable from the other 

Regression 
 Regression: the dependence of dependent variable Y 
on the independent variable X. 
 Relationship is summarized by a regression equation. 
y = a + bx 
 A=intercept at y axis 
 B=regression coefficient
The Least Squares Method 
 The line of regression is the line which gives the best 
estimate to the value of one variable for any specific 
value of the other variable. Thus the line of regression 
is the line of “best fit” and is Obtained by the principle 
of least squares. 
 This principle consists in minimizing the sum of the 
squares of the deviations of the actual values of y 
from their estimate values given by the line of best fit
Formulas to be used
Example 
 Fit a least square line to the following data
Solution
THANK YOU.

More Related Content

PDF
Correlation Analysis
PPTX
Correlation and regression analysis
PPT
PPTX
Correlation and Regression
PPTX
Correlation analysis
PPT
Measures of Central Tendency - Biostatstics
PPTX
Correlation and Regression
PDF
Simple linear regression
Correlation Analysis
Correlation and regression analysis
Correlation and Regression
Correlation analysis
Measures of Central Tendency - Biostatstics
Correlation and Regression
Simple linear regression

What's hot (20)

PPTX
Correlation and Regression ppt
PDF
Introduction to correlation and regression analysis
PPT
Chi square mahmoud
PPT
Regression analysis
PPT
Regression analysis
PPT
Correlation and regression
PPT
Probability concept and Probability distribution
PPTX
Regression analysis
PPTX
STATISTIC ESTIMATION
PPTX
Point and Interval Estimation
PDF
Correlation and Regression
PPTX
PPT
Regression
PPTX
Regression analysis
PPTX
Regression analysis.
PPTX
Multiple Linear Regression
PPTX
Multiple Linear Regression
PPTX
Multiple linear regression
PPT
Simple linear regression
PPT
Ch4 Confidence Interval
Correlation and Regression ppt
Introduction to correlation and regression analysis
Chi square mahmoud
Regression analysis
Regression analysis
Correlation and regression
Probability concept and Probability distribution
Regression analysis
STATISTIC ESTIMATION
Point and Interval Estimation
Correlation and Regression
Regression
Regression analysis
Regression analysis.
Multiple Linear Regression
Multiple Linear Regression
Multiple linear regression
Simple linear regression
Ch4 Confidence Interval
Ad

Viewers also liked (10)

PDF
Correlation and Simple Regression
PPS
Correlation and regression
PPT
regression and correlation
PPTX
Chi square test final
PPT
Chi Square Worked Example
PPTX
Contingency Table Test, M. Asad Hayat, UET Taxila
PPT
Correlation
PPT
Chi – square test
PPT
Regression analysis ppt
PPTX
Chi square test
Correlation and Simple Regression
Correlation and regression
regression and correlation
Chi square test final
Chi Square Worked Example
Contingency Table Test, M. Asad Hayat, UET Taxila
Correlation
Chi – square test
Regression analysis ppt
Chi square test
Ad

Similar to Correlation and regression (20)

PPT
correlation.ppt
PPT
Correlation
PDF
correlationppt-111222215110-phpapp02.pdf
PPTX
Correlation and regression impt
PDF
Correlation.pdf
PPTX
CORRELATION ( srm1) - Copy.pptx
PPTX
Simple correlation & Regression analysis
PPTX
Correlation analysis in Biostatistics .pptx
PPT
5 regressionand correlation
PDF
9. parametric regression
PDF
Correlation Analysis for MSc in Development Finance .pdf
PPTX
Dr Amita Marwha -correlation coeeficient and partial.pptx
PPTX
PPT Correlation.pptx
PPT
Hph7310week2winter2009narr
PPT
2-20-04.ppthjjbnjjjhhhhhhhhhhhhhhhhhhhhhhhh
PPTX
Correlation and regression
PPT
2-20-04.ppt
PPTX
Dependance Technique, Regression & Correlation
PPT
12943625.ppt
PPTX
Correlation
correlation.ppt
Correlation
correlationppt-111222215110-phpapp02.pdf
Correlation and regression impt
Correlation.pdf
CORRELATION ( srm1) - Copy.pptx
Simple correlation & Regression analysis
Correlation analysis in Biostatistics .pptx
5 regressionand correlation
9. parametric regression
Correlation Analysis for MSc in Development Finance .pdf
Dr Amita Marwha -correlation coeeficient and partial.pptx
PPT Correlation.pptx
Hph7310week2winter2009narr
2-20-04.ppthjjbnjjjhhhhhhhhhhhhhhhhhhhhhhhh
Correlation and regression
2-20-04.ppt
Dependance Technique, Regression & Correlation
12943625.ppt
Correlation

Recently uploaded (20)

PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
Cell Structure & Organelles in detailed.
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Cardiovascular Pharmacology for pharmacy students.pptx
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Introduction-to-Social-Work-by-Leonora-Serafeca-De-Guzman-Group-2.pdf
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Business Ethics Teaching Materials for college
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
Open Quiz Monsoon Mind Game Final Set.pptx
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
Renaissance Architecture: A Journey from Faith to Humanism
Cell Structure & Organelles in detailed.
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Cardiovascular Pharmacology for pharmacy students.pptx
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Introduction-to-Social-Work-by-Leonora-Serafeca-De-Guzman-Group-2.pdf
GDM (1) (1).pptx small presentation for students
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Business Ethics Teaching Materials for college
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Microbial disease of the cardiovascular and lymphatic systems
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Open Quiz Monsoon Mind Game Final Set.pptx
Microbial diseases, their pathogenesis and prophylaxis
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES

Correlation and regression

  • 1. Correlation(Pearson & spearman) &Linear Regression
  • 2. Correlation  Semantically, Correlation means Co-together and Relation.  Statistical correlation is a statistical technique which tells us if two variables are related.
  • 3. PEARSON CORRELATION  measures the degree of linear association between two interval scaled variables analysis of the relationship between two quantitative outcomes, e.g., height and weight
  • 4. Assumption under Pearson’s Correlation Coefficient  Assumption 1: The correlation coefficient r assumes that the two variables measured form a bivariate normal distribution population.  Assumption 2: The correlation coefficient r measures only linear associations: how nearly the data falls on a straight line. It is not a good summary of the association if the scatterplot has a nonlinear (curved) pattern.
  • 5. Assumptions Contd.  Assumption 3: The correlation coefficient r is not a good summary of association if the data are heteroscedastic.(when random variables have the same finite variance. It is also known as homogenity of variance)  Assumption 4: The correlation coefficient r is not a good summary of association if the data have outliers.
  • 7. Strength of Relationship  r lies between -1 and 1. Values near 0 means no (linear) correlation and values near ± 1 means very strong correlation. 
  • 10. Coefficient of Determination  Pearson's r can be squared , r 2, to derive a coefficient of determination.  Coefficient of determination - the portion of variability in one of the variables that can be accounted for by variability in the second variable
  • 11. Example  Pearson's r can be squared , r 2  If r=0.5, then r2=0.25 If r=0.7 then r2=0.49  Thus while r=0.5 versus 0.7 might not look so different in terms of strength, r2 tells us that r=0.7 accounts for about twice the variability relative to r=0.5
  • 12. Spearman’s Rank Correlation  Spearman's rank correlation coefficient or Spearman's rho, is a measure of statistical dependence between two variables.
  • 13. Interpretation  The sign of the Spearman correlation indicates the direction of association between X (the independent variable) and Y (the dependent variable). If Y tends to increase when X increases, the Spearman correlation coefficient is positive. If Y tends to decrease when X increases, the Spearman correlation coefficient is negative. A Spearman correlation of zero indicates that there is no tendency for Y to either increase or decrease when X increases
  • 14. Repeated Ranks  If there is more than one item with the same value , then they are given a common rank which is average of their respective ranks as shown in the table.
  • 15.  The raw data in the table below is used to calculate the correlation between the IQ of an with the number of hours spent in front of TV per week. Example
  • 17. Regression  One variable is a direct cause of the other or if the value of one variable is changed, then as a direct consequence, the other variable also change or if the main purpose of the analysis is prediction of one variable from the other 
  • 18. Regression  Regression: the dependence of dependent variable Y on the independent variable X.  Relationship is summarized by a regression equation. y = a + bx  A=intercept at y axis  B=regression coefficient
  • 19. The Least Squares Method  The line of regression is the line which gives the best estimate to the value of one variable for any specific value of the other variable. Thus the line of regression is the line of “best fit” and is Obtained by the principle of least squares.  This principle consists in minimizing the sum of the squares of the deviations of the actual values of y from their estimate values given by the line of best fit
  • 21. Example  Fit a least square line to the following data