SlideShare a Scribd company logo
Correlation and Regression 
10/13/14 1
Product Moment Correlation 
• The product moment correlation, r, summarizes the 
strength of association between two metric (interval or 
ratio scaled) variables, say X and Y. 
• It is an index used to determine whether a linear or 
straight-line relationship exists between X and Y. 
• As it was originally proposed by Karl Pearson, it is 
also known as the Pearson correlation coefficient. 
It is also referred to as simple correlation, bivariate 
correlation, or merely the correlation coefficient. 
• A correlation is a single number that describes the 
degree of relationship between two variables. (http:// 
www.socialresearchmethods.net/kb/statcorr.php ) 
10/13/14 2
SPSS 
• Is there any correlation/association 
between attitude toward the city and 
duration of residence? 
H0 : There is no association/ correlation 
between duration of residence and attitude 
toward the city 
H1 : There is an association/ correlation 
between between duration of residence 
and attitude toward the city 
10/13/14 3
Tab le 17.1 
Respondent No Attitude Toward 
the City 
Duration of 
Residence 
Importance 
Attached to 
Weather 
1 6 10 3 
2 9 12 11 
3 8 12 4 
4 3 4 1 
5 10 12 11 
6 4 6 1 
7 5 8 7 
8 2 2 4 
9 11 18 8 
10 9 9 10 
11 10 17 8 
12 2 2 5 
10/13/14 4
Descriptive Statistics 
Mean Std. Deviation N 
6,58 3,32 12 
9,33 5,26 12 
Attitude toward the city 
Duration of residence 
Correlations 
1,000 ,936** 
10/13/14 5 
, ,000 
120,917 179,667 
10,992 16,333 
12 12 
,936** 1,000 
,000 , 
179,667 304,667 
16,333 27,697 
12 12 
Pearson Correlation 
Sig. (2-tailed) 
Sum of Squares and 
Cross-products 
Covariance 
N 
Pearson Correlation 
Sig. (2-tailed) 
Sum of Squares and 
Cross-products 
Covariance 
N 
Attitude toward the city 
Duration of residence 
Attitude 
toward the city 
Duration of 
residence 
Correlation is significant at the 0.01 **. level (2-tailed).
Correlation Coefficient 
n 
S (Xi – X) (Yi – Y) 
i=1 
r = = COVxy / SxSy 
n n 
S (Xi – X)2 S (Yi – Y)2 
i=1 i=1 
10/13/14 6
Y X1 X2 
r = (179.6668) / [(304/6669)(120.9168) = 0.9361 
The respondent’s duration of residence in the city 
is strongly associated with attitude toward the city 
10/13/14 7
city 
the toward Attitude N = 2 1 1 1 1 1 3 1 1 
2 4 6 8 9 10 12 17 18 10/13/14 8 
Duration of residence 
12 
10 
8 
6 
4 
2 
0
• Analyze > correlate > bivariate 
– Options : - means and standar deviations 
- cross product deviations and covariances 
• Graphs > box plot > simple > define >ok 
10/13/14 9
Regression Analysis 
• Regression analysis is a statistical procedure for 
analyzing associative relationship between a 
metric dependent variable and one or more 
metric independent variables. 
• Independent variable affect dependent variable 
10/13/14 10
Regression Analysis can be used in 
the following ways: 
1. Determine whether the independent variables 
explain a significant variation in the dependent 
variable: whether a relationship exists. 
2. Determine how much of the variation in the 
dependent variable can be explained by the 
independent variables: strength of the relationship. 
3. Determine the structure or form of the relationship: 
the mathematical equation relating the independent 
and dependent variables. 
4. Predict the values of the dependent variables. 
5. Control for other independent variables when 
evaluating the contributions of specific variable or 
set of variables. 
10/13/14 11
Bivariate Regression 
• Bivariate regression is a procedure for deriving a 
mathematical relationship, in the form of an 
equation, between a single metric dependent 
variable and a single metric independent 
variable. 
• Bivariate regression example in marketing: 
– Can variation in sales be explained in terms of 
variation in advertising expenditures? What is the 
structure and form of this relationship, can it be 
modeled mathematically by an equation describing a 
straight line. 
10/13/14 12
Multiple Regression 
The general form of the multiple regression model 
is as follows: 
Y = b0 + b1X1 + b2X2 + b3X3+ . . . + bkXk + e 
which is estimated by the following equation: 
Y 
= a + b1X1 + b2X2 + b3X3+ . . . + bXkk 
As before, the coefficient a represents the intercept, 
but the b's are now the partial regression coefficients. 
10/13/14 13
Statistics Associated with 
Multiple Regression 
• Adjusted R2. R2, coefficient of multiple determination, is 
adjusted for the number of independent variables and the 
sample size to account for the diminishing returns. After 
the first few variables, the additional independent 
variables do not make much contribution. 
• Coefficient of multiple determination. The strength of 
association in multiple regression is measured by the 
square of the multiple correlation coefficient, R2, which is 
also called the coefficient of multiple determination. 
• F test. The F test is used to test the null hypothesis that 
the coefficient of multiple determination in the population, 
R2 
pop, is zero. This is equivalent to testing the null 
hypothesis. The test statistic has an F distribution with k 
and (n - k - 1) degrees of freedom. 
10/13/14 14
Model Summary 
R R Square 
Adjusted 
R Square 
Std. Error of 
the Estimate 
Change Statistics 
R Square 
Change F Change df1 df2 Sig. F Change 
,975a ,951 ,942 ,762 ,951 105,826 2 11 ,000 
Model 
1 
Predictors: (Constant), a. PRICE, QUALITY 
ANOVAb 
Sum of 
Squares df Mean Square F Sig. 
122,831 2 61,415 105,826 ,000a 
6,384 11 ,580 
129,214 13 
Regression 
Residual 
Total 
Model 
1 
a. Predictors: (Constant), PRICE, QUALITY 
b. Dependent Variable: PREFEREN 
Coefficientsa 
Unstandardized 
Coefficients 
B Std. Error 
Standardized 
Coefficients 
Beta 
t Sig. 
,535 ,471 1,136 ,280 
,976 ,097 ,798 10,096 ,000 
,251 ,071 ,278 3,522 ,005 
(Constant) 
QUALITY 
PRICE 
Model 
1 
a. Dependent Variable: PREFEREN 
10/13/14 15

More Related Content

PPTX
Multiple Linear Regression
PPTX
Statistical inference
PPTX
Correlation analysis
PPT
Multiple regression presentation
PPTX
Regression vs correlation and causation
PPTX
Binomial distribution
PPT
Simple Linier Regression
Multiple Linear Regression
Statistical inference
Correlation analysis
Multiple regression presentation
Regression vs correlation and causation
Binomial distribution
Simple Linier Regression

What's hot (20)

PPTX
Statistics-Correlation and Regression Analysis
PDF
Simple linear regression
PPTX
Goodness of fit (ppt)
PPTX
Type i and type ii errors
PPTX
Simple Linear Regression: Step-By-Step
PPTX
Point estimation
PPT
DIstinguish between Parametric vs nonparametric test
PPTX
Logistic regression
PPTX
Binomial probability distributions ppt
PPTX
Goodness Of Fit Test
PPT
Skewness & Kurtosis
PDF
Probability Distributions
PPTX
Regression analysis
PPT
Linear regression
PDF
Multiple Correlation - Thiyagu
PDF
Multiple regression
PPTX
Spearman’s rank correlation
PPTX
Factor analysis
PDF
Regression Analysis
PPTX
Karl pearson's coefficient of correlation (1)
Statistics-Correlation and Regression Analysis
Simple linear regression
Goodness of fit (ppt)
Type i and type ii errors
Simple Linear Regression: Step-By-Step
Point estimation
DIstinguish between Parametric vs nonparametric test
Logistic regression
Binomial probability distributions ppt
Goodness Of Fit Test
Skewness & Kurtosis
Probability Distributions
Regression analysis
Linear regression
Multiple Correlation - Thiyagu
Multiple regression
Spearman’s rank correlation
Factor analysis
Regression Analysis
Karl pearson's coefficient of correlation (1)
Ad

Viewers also liked (8)

PPT
Correlation and Regression
PPTX
Correlation and regression
PPTX
Correlation & Regression
PPTX
Correlation and regression
PPT
Correlation
PDF
Correlation and Simple Regression
PPT
Correlation analysis ppt
PPS
Correlation and regression
Correlation and Regression
Correlation and regression
Correlation & Regression
Correlation and regression
Correlation
Correlation and Simple Regression
Correlation analysis ppt
Correlation and regression
Ad

Similar to Correlation and regression (20)

PPT
Chap013.ppt
PPTX
manecohuhuhuhubasicEstimation-1.pptx
PPTX
Correlation and Regression Analysis.pptx
PDF
PPT
Demand Estimation
PPTX
1094_9961_492123.pptx b jkjkjmkm ,n,,mnnmn,mmnmn,mm,
PPTX
SSP PRESENTATION COMPLETE ( ADVANCE ) .pptx
PPT
Chap013.ppt
PPTX
Regression
DOCX
Chapter 15Multiple Regression and Model BuildingCo
PDF
Applied statistics lecture_6
PPTX
Lesson 3 Exploring the Regression Analysis.pptx
PPT
Multiple Regression.ppt
PPTX
Regression-SIMPLE LINEAR (1).psssssssssptx
PPT
Chapter 10
PPT
Chapter 10
PDF
Assessing relative importance using rsp scoring to generate
PDF
Assessing Relative Importance using RSP Scoring to Generate VIF
PPTX
Unit-III Correlation and Regression.pptx
PPT
Regression Analysis.ppt
Chap013.ppt
manecohuhuhuhubasicEstimation-1.pptx
Correlation and Regression Analysis.pptx
Demand Estimation
1094_9961_492123.pptx b jkjkjmkm ,n,,mnnmn,mmnmn,mm,
SSP PRESENTATION COMPLETE ( ADVANCE ) .pptx
Chap013.ppt
Regression
Chapter 15Multiple Regression and Model BuildingCo
Applied statistics lecture_6
Lesson 3 Exploring the Regression Analysis.pptx
Multiple Regression.ppt
Regression-SIMPLE LINEAR (1).psssssssssptx
Chapter 10
Chapter 10
Assessing relative importance using rsp scoring to generate
Assessing Relative Importance using RSP Scoring to Generate VIF
Unit-III Correlation and Regression.pptx
Regression Analysis.ppt

Recently uploaded (20)

PPTX
Computer network topology notes for revision
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
annual-report-2024-2025 original latest.
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Introduction to Knowledge Engineering Part 1
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
Introduction to machine learning and Linear Models
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Computer network topology notes for revision
Supervised vs unsupervised machine learning algorithms
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Acceptance and paychological effects of mandatory extra coach I classes.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
annual-report-2024-2025 original latest.
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
IBA_Chapter_11_Slides_Final_Accessible.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Introduction to Knowledge Engineering Part 1
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Foundation of Data Science unit number two notes
Introduction to machine learning and Linear Models
Business Ppt On Nestle.pptx huunnnhhgfvu
ISS -ESG Data flows What is ESG and HowHow
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx

Correlation and regression

  • 2. Product Moment Correlation • The product moment correlation, r, summarizes the strength of association between two metric (interval or ratio scaled) variables, say X and Y. • It is an index used to determine whether a linear or straight-line relationship exists between X and Y. • As it was originally proposed by Karl Pearson, it is also known as the Pearson correlation coefficient. It is also referred to as simple correlation, bivariate correlation, or merely the correlation coefficient. • A correlation is a single number that describes the degree of relationship between two variables. (http:// www.socialresearchmethods.net/kb/statcorr.php ) 10/13/14 2
  • 3. SPSS • Is there any correlation/association between attitude toward the city and duration of residence? H0 : There is no association/ correlation between duration of residence and attitude toward the city H1 : There is an association/ correlation between between duration of residence and attitude toward the city 10/13/14 3
  • 4. Tab le 17.1 Respondent No Attitude Toward the City Duration of Residence Importance Attached to Weather 1 6 10 3 2 9 12 11 3 8 12 4 4 3 4 1 5 10 12 11 6 4 6 1 7 5 8 7 8 2 2 4 9 11 18 8 10 9 9 10 11 10 17 8 12 2 2 5 10/13/14 4
  • 5. Descriptive Statistics Mean Std. Deviation N 6,58 3,32 12 9,33 5,26 12 Attitude toward the city Duration of residence Correlations 1,000 ,936** 10/13/14 5 , ,000 120,917 179,667 10,992 16,333 12 12 ,936** 1,000 ,000 , 179,667 304,667 16,333 27,697 12 12 Pearson Correlation Sig. (2-tailed) Sum of Squares and Cross-products Covariance N Pearson Correlation Sig. (2-tailed) Sum of Squares and Cross-products Covariance N Attitude toward the city Duration of residence Attitude toward the city Duration of residence Correlation is significant at the 0.01 **. level (2-tailed).
  • 6. Correlation Coefficient n S (Xi – X) (Yi – Y) i=1 r = = COVxy / SxSy n n S (Xi – X)2 S (Yi – Y)2 i=1 i=1 10/13/14 6
  • 7. Y X1 X2 r = (179.6668) / [(304/6669)(120.9168) = 0.9361 The respondent’s duration of residence in the city is strongly associated with attitude toward the city 10/13/14 7
  • 8. city the toward Attitude N = 2 1 1 1 1 1 3 1 1 2 4 6 8 9 10 12 17 18 10/13/14 8 Duration of residence 12 10 8 6 4 2 0
  • 9. • Analyze > correlate > bivariate – Options : - means and standar deviations - cross product deviations and covariances • Graphs > box plot > simple > define >ok 10/13/14 9
  • 10. Regression Analysis • Regression analysis is a statistical procedure for analyzing associative relationship between a metric dependent variable and one or more metric independent variables. • Independent variable affect dependent variable 10/13/14 10
  • 11. Regression Analysis can be used in the following ways: 1. Determine whether the independent variables explain a significant variation in the dependent variable: whether a relationship exists. 2. Determine how much of the variation in the dependent variable can be explained by the independent variables: strength of the relationship. 3. Determine the structure or form of the relationship: the mathematical equation relating the independent and dependent variables. 4. Predict the values of the dependent variables. 5. Control for other independent variables when evaluating the contributions of specific variable or set of variables. 10/13/14 11
  • 12. Bivariate Regression • Bivariate regression is a procedure for deriving a mathematical relationship, in the form of an equation, between a single metric dependent variable and a single metric independent variable. • Bivariate regression example in marketing: – Can variation in sales be explained in terms of variation in advertising expenditures? What is the structure and form of this relationship, can it be modeled mathematically by an equation describing a straight line. 10/13/14 12
  • 13. Multiple Regression The general form of the multiple regression model is as follows: Y = b0 + b1X1 + b2X2 + b3X3+ . . . + bkXk + e which is estimated by the following equation: Y = a + b1X1 + b2X2 + b3X3+ . . . + bXkk As before, the coefficient a represents the intercept, but the b's are now the partial regression coefficients. 10/13/14 13
  • 14. Statistics Associated with Multiple Regression • Adjusted R2. R2, coefficient of multiple determination, is adjusted for the number of independent variables and the sample size to account for the diminishing returns. After the first few variables, the additional independent variables do not make much contribution. • Coefficient of multiple determination. The strength of association in multiple regression is measured by the square of the multiple correlation coefficient, R2, which is also called the coefficient of multiple determination. • F test. The F test is used to test the null hypothesis that the coefficient of multiple determination in the population, R2 pop, is zero. This is equivalent to testing the null hypothesis. The test statistic has an F distribution with k and (n - k - 1) degrees of freedom. 10/13/14 14
  • 15. Model Summary R R Square Adjusted R Square Std. Error of the Estimate Change Statistics R Square Change F Change df1 df2 Sig. F Change ,975a ,951 ,942 ,762 ,951 105,826 2 11 ,000 Model 1 Predictors: (Constant), a. PRICE, QUALITY ANOVAb Sum of Squares df Mean Square F Sig. 122,831 2 61,415 105,826 ,000a 6,384 11 ,580 129,214 13 Regression Residual Total Model 1 a. Predictors: (Constant), PRICE, QUALITY b. Dependent Variable: PREFEREN Coefficientsa Unstandardized Coefficients B Std. Error Standardized Coefficients Beta t Sig. ,535 ,471 1,136 ,280 ,976 ,097 ,798 10,096 ,000 ,251 ,071 ,278 3,522 ,005 (Constant) QUALITY PRICE Model 1 a. Dependent Variable: PREFEREN 10/13/14 15