SlideShare a Scribd company logo
Correlation and Regrretion
Submitted to :
Prof. Tushar Mangukiya
Prof. Sneha Patel
Submitted by :
Desani Parth - 140763106002
 Correlation
 Bivariate Distribution
 Scatter Diagram
 Coefficient of
Correlation
 Method of Calculation
 Example
 Regression
 Regression Line
 Regression
Coefficients
 Rank Correlation
Coefficients
 Example
Correlation and Regrretion
 when a distribution has two variables then it is called
bivariate.
 Example: we consider the raw data collected from census
considering only two variables, age and sex (26, M), (18,
F), (48, F),(32, M), ..
 The first data shows age and second data shows sex of
persons in each bracket. Thus it is the case of bivariate
distribution. The bivariate raw data is usually presented in
the form of a table called “the bivariate frequency table”.
 The simplest way to represent bivariate data in a
diagram is called scatter diagram.
 In this diagram plot the corresponding pairs of
numerical values(x , y) of the two series in the xy-
plane.
 It is known as Dot Diagram. If the line goes upward
& this upward movement is from left to right it
will show positive correlation.
 If the line moves downward & this movement is
from left to right, it will show negative correlation.
 The degree of slope will give the degree of
correlation.
 If the points are scattered widely , it will show
absence of correlation.
0
1
2
3
4
5
6
7
8
9
0 0.1 0.2 0.3 0.4 0.5 0.6
Y-Values
y values
Negative
0
2
4
6
8
10
12
0 0.1 0.2 0.3 0.4 0.5 0.6
Y-Values
Column1
Positive
0
0.2
0.4
0.6
0.8
1
1.2
1.4
1.6
1.8
2
0 0.2 0.4 0.6 0.8 1 1.2
Y-Values
y-value
y-value2
Absence
As no numerical measure is possible with scatter diagram,
Karl Pearson derived a mathematical formula to measure the degree
of correlation between two variables, known as Karl Pearson's
coefficient of correlation. It is also known as “ product moment
method.”
For a bivariate distribution (x, y) the coefficient of correlation
is denoted by and is defined asrxy
 
 yx
xy
yx
r
,cov

 yx
n
XY
 

22
YX
XY
a) Direct method:
     


 



  
  
 

2222
22
yyxx
yxyx
r
r
iiii
iiii
xy
xy
NN
n
YX
XY
A sample of 6 children was selected, data about their age in years and
weight in kilograms was recorded as shown in the following table . It
is required to find the correlation between age and weight.
Weight (Kg)Age (years)serial No
1271
862
1283
1054
1165
1396
Y2X2xy
Weight
(Kg)
(y)
Age
(years)
(x)
Serial
n.
14449841271
643648862
14464961283
10025501054
12136661165
169811171396
∑y2=
742
∑x2=
291
∑xy= 461
∑y=
66
∑x=
41
Total
r = 0.759……..........strong direct correlation
     


 


  
  
2222
yyxx
yxyx
r
iiii
iiii
xy
NN
n
 
        22xy
667426.412916
66414616
r 


b) Step deviation method:
The above method is simple, but becomes tedious if the mean
of the variable is not integers. This can be overcome by the use of
assumed mean. Let A and B be the assumed mean of the variables x
and y respectively.
Let,
 
 B
A
yd
xd
iy
ix


       
  



2222
dddd
dddd
r
yyxx
yxyx
xy
NN
N
E. R.I. R.Student
1011051
1031042
1001023
981014
951005
96996
104987
92968
97939
949210
Find the coefficients of correlation between Intelligence Ratio (I.R.) and
Emotional Ration (E.R.) from the following data :
Let X be intelligence ration
And Y be emotional ratio.
We construct the following table.
Taking
A = 100
B = 100
dx = X – A = X – 100
dy = Y – B = Y - 100
X Y dx = X - 100 dx² dy = Y - 100 dy² dxdy
105 101 5 25 1 1 5
104 103 4 16 3 9 12
102 100 2 4 0 0 0
101 98 1 1 -2 4 -2
100 95 0 0 -5 25 0
99 96 -1 1 -4 16 4
98 104 -2 4 4 16 -8
96 92 -4 16 -8 64 32
93 97 -7 49 -3 9 21
92 94 -8 64 -6 36 48
- - ∑dx = -10
∑dx²
= 180
∑dy = -20
∑dy² =
180
∑dxdy =
112
 The correlation coefficient between I. R. and
E. R. is given by,
     


 


  
  
2222
yyxx
yxyx
r
iiii
iiii
xy
NN
n
   22
)20()180(10)10()180(10
)20)(10()112(10


rxy
   4001800)100(1800
2001120


rxy
   14001700
920
rxy
5963.0
83.1542
920
42.37*23.41
920
rxy
c) For grouped data:
  














 


  
  
2222
dfdfdfdf
dfdfddf
r
ii
N
ii
N
ii
N
yyxx
yixyx
xy
Correlation and Regrretion
Let be the given observations and line to
be fitted
……(1)
Using the method of least squares, we can estimate the values of a
and b. That is by using from (1)
But,
    yxyxyx nn
,,
2211

bxay 
  
 


bxaxy
xbany
      2
  xxbxxayyxx
  0 xx
The line of best fit becomes.
 xxyy
x
y
xyr 




y
x
xyr
  
  222
 x
n
XY
X
XY
xx
yyxx
b




 



i. When
or
ii. When , where A and B are assumed values.
yYxX yx ii
 ,



  22
,
X
XY
Y
XY
bb xyxy
    
  
 
  





 2222
,
xxn
yxxyn
yyn
yxxyn
bb yxxy
BA ydxd iyix
 ,
    
  
 
  





 22
2
,
2
dd
dddd
b
dd
dddd
b
xx
yxyx
yx
yy
yxyx
xy
n
n
n
n
iii. For frequency distribution :
    
  
 
  





 2222
,
dd
dddd
b
dd
dddd
b
xx
yxyx
yx
yy
yxyx
xy
ffn
ffn
ffn
ffN
 It is a non-parametric measure of correlation.
 This procedure makes use of the two sets of ranks that may be
assigned to the sample values of x and Y.
 Spearman Rank correlation coefficient could be computed in the
following cases:
 Both variables are quantitative.
 Both variables are qualitative ordinal.
 One variable is quantitative and the other is qualitative ordinal.
1)n(n
(di)6
1r 2
2
s



 In a study of the relationship between level education and income the
following data was obtained. Find the relationship between them and
comment.
Income
(Y)
level education
(X)
sample
numbers
25Preparatory.A
10Primary.B
8University.C
10secondaryD
15secondaryE
50illiterateF
60University.G
di2diRank
Y
Rank
X(Y)(X)
423525PreparatoryA
0.250.55.5610Primary.B
30.25-5.571.58University.C
4-25.53.510secondaryD
0.25-0.543.515secondaryE
2552750illiterateF
0.250.511.560university.G
∑ di2=64
1.0
)48(7
646
1 

sr
Correlation and Regrretion

More Related Content

PPTX
Presentation on Regression Analysis
PPTX
PEARSON PRODUCT MOMENT CORRELATION COEFFICIENT
PDF
Linear Regression Ordinary Least Squares Distributed Calculation Example
PPTX
Non Linear Equation
PPT
Chapter5
PPTX
Method of least square
PPTX
Principle of Least Square, its Properties, Regression line and standard error...
PPTX
METHOD OF LEAST SQURE
Presentation on Regression Analysis
PEARSON PRODUCT MOMENT CORRELATION COEFFICIENT
Linear Regression Ordinary Least Squares Distributed Calculation Example
Non Linear Equation
Chapter5
Method of least square
Principle of Least Square, its Properties, Regression line and standard error...
METHOD OF LEAST SQURE

What's hot (20)

PDF
Chap5 correlation
PPT
Ch07 2
DOCX
5 regression
PPT
Chapter7
PPTX
REGRESSION ANALYSIS
PPT
1563 matrix algebra
PDF
Econometrics assignment help
PPT
Direct variation-ppt
PPTX
Relaxation method
PDF
Applied numerical methods lec8
PPTX
Least Squares
PPTX
Curve fitting
PPT
Matrix algebra
PDF
Basic concepts of curve fittings
DOC
Ordinary least squares linear regression
PDF
metod linearne regresije
PDF
On the Seidel’s Method, a Stronger Contraction Fixed Point Iterative Method o...
PPTX
PPT
Direct Variation
PPTX
Grade 9 homework questions on 2.4 and 2.5
Chap5 correlation
Ch07 2
5 regression
Chapter7
REGRESSION ANALYSIS
1563 matrix algebra
Econometrics assignment help
Direct variation-ppt
Relaxation method
Applied numerical methods lec8
Least Squares
Curve fitting
Matrix algebra
Basic concepts of curve fittings
Ordinary least squares linear regression
metod linearne regresije
On the Seidel’s Method, a Stronger Contraction Fixed Point Iterative Method o...
Direct Variation
Grade 9 homework questions on 2.4 and 2.5
Ad

Similar to Correlation and Regrretion (20)

PPTX
Regression and correlation in statistics
PPT
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
PPTX
Regression
PPTX
Multivariate Methods Assignment Help
PPT
Chapter13
PPTX
4_Correlation and and Regression (1).pptx
PDF
application of correlation and recurssion
PDF
Nonparametric approach to multiple regression
DOCX
Statistics
PPT
Correlation and Regression
PDF
Curve Fitting in Numerical Methods Regression
PPTX
Simple Linear Regression
DOCX
Unit 5 Correlation
PPTX
Regression and corelation (Biostatistics)
PPT
"Understanding Correlation and Regression: Key Concepts for Data Analysis"
PPSX
Simple linear regression
PPT
Corr-and-Regress (1).ppt
PPT
Corr-and-Regress.ppt
PPT
Cr-and-Regress.ppt
PPT
Correlation & Regression for Statistics Social Science
Regression and correlation in statistics
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Regression
Multivariate Methods Assignment Help
Chapter13
4_Correlation and and Regression (1).pptx
application of correlation and recurssion
Nonparametric approach to multiple regression
Statistics
Correlation and Regression
Curve Fitting in Numerical Methods Regression
Simple Linear Regression
Unit 5 Correlation
Regression and corelation (Biostatistics)
"Understanding Correlation and Regression: Key Concepts for Data Analysis"
Simple linear regression
Corr-and-Regress (1).ppt
Corr-and-Regress.ppt
Cr-and-Regress.ppt
Correlation & Regression for Statistics Social Science
Ad

More from Parth Desani (20)

PPTX
Pile foundation
PPT
Self purification
PPTX
Role of various agencies in wrm
PPTX
Recycling of waste water & Solid Waste
PPT
Air Traffic Control
PPTX
Open Channel Flow
PPTX
Noise pollution
PPT
Non Destructive Test
PPTX
Natural phenomenon
PPTX
Management information system
PPTX
Flood Prevention by Gabion Basket Wall
PPTX
Gabion Basket Wall for Flood Prevention
PPTX
Earthquake engineering
PPTX
Earthquake engineering presentation
PPTX
Drilling and blasting
PPTX
Drainage system
PPTX
Drag line excavator
PPTX
Development plan (Basic Concept)
PPTX
Deep excavation
PPTX
CPM - Critical Path Method
Pile foundation
Self purification
Role of various agencies in wrm
Recycling of waste water & Solid Waste
Air Traffic Control
Open Channel Flow
Noise pollution
Non Destructive Test
Natural phenomenon
Management information system
Flood Prevention by Gabion Basket Wall
Gabion Basket Wall for Flood Prevention
Earthquake engineering
Earthquake engineering presentation
Drilling and blasting
Drainage system
Drag line excavator
Development plan (Basic Concept)
Deep excavation
CPM - Critical Path Method

Recently uploaded (20)

PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Introduction to machine learning and Linear Models
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
annual-report-2024-2025 original latest.
PDF
Clinical guidelines as a resource for EBP(1).pdf
climate analysis of Dhaka ,Banglades.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Mega Projects Data Mega Projects Data
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
IB Computer Science - Internal Assessment.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Miokarditis (Inflamasi pada Otot Jantung)
SAP 2 completion done . PRESENTATION.pptx
Introduction to machine learning and Linear Models
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
Business Ppt On Nestle.pptx huunnnhhgfvu
annual-report-2024-2025 original latest.
Clinical guidelines as a resource for EBP(1).pdf

Correlation and Regrretion

  • 2. Submitted to : Prof. Tushar Mangukiya Prof. Sneha Patel Submitted by : Desani Parth - 140763106002
  • 3.  Correlation  Bivariate Distribution  Scatter Diagram  Coefficient of Correlation  Method of Calculation  Example  Regression  Regression Line  Regression Coefficients  Rank Correlation Coefficients  Example
  • 5.  when a distribution has two variables then it is called bivariate.  Example: we consider the raw data collected from census considering only two variables, age and sex (26, M), (18, F), (48, F),(32, M), ..  The first data shows age and second data shows sex of persons in each bracket. Thus it is the case of bivariate distribution. The bivariate raw data is usually presented in the form of a table called “the bivariate frequency table”.
  • 6.  The simplest way to represent bivariate data in a diagram is called scatter diagram.  In this diagram plot the corresponding pairs of numerical values(x , y) of the two series in the xy- plane.  It is known as Dot Diagram. If the line goes upward & this upward movement is from left to right it will show positive correlation.  If the line moves downward & this movement is from left to right, it will show negative correlation.  The degree of slope will give the degree of correlation.  If the points are scattered widely , it will show absence of correlation.
  • 7. 0 1 2 3 4 5 6 7 8 9 0 0.1 0.2 0.3 0.4 0.5 0.6 Y-Values y values Negative
  • 8. 0 2 4 6 8 10 12 0 0.1 0.2 0.3 0.4 0.5 0.6 Y-Values Column1 Positive
  • 9. 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 0 0.2 0.4 0.6 0.8 1 1.2 Y-Values y-value y-value2 Absence
  • 10. As no numerical measure is possible with scatter diagram, Karl Pearson derived a mathematical formula to measure the degree of correlation between two variables, known as Karl Pearson's coefficient of correlation. It is also known as “ product moment method.” For a bivariate distribution (x, y) the coefficient of correlation is denoted by and is defined asrxy    yx xy yx r ,cov   yx n XY    22 YX XY
  • 11. a) Direct method:                       2222 22 yyxx yxyx r r iiii iiii xy xy NN n YX XY
  • 12. A sample of 6 children was selected, data about their age in years and weight in kilograms was recorded as shown in the following table . It is required to find the correlation between age and weight. Weight (Kg)Age (years)serial No 1271 862 1283 1054 1165 1396
  • 14. r = 0.759……..........strong direct correlation                   2222 yyxx yxyx r iiii iiii xy NN n           22xy 667426.412916 66414616 r   
  • 15. b) Step deviation method: The above method is simple, but becomes tedious if the mean of the variable is not integers. This can be overcome by the use of assumed mean. Let A and B be the assumed mean of the variables x and y respectively. Let,    B A yd xd iy ix                 2222 dddd dddd r yyxx yxyx xy NN N
  • 16. E. R.I. R.Student 1011051 1031042 1001023 981014 951005 96996 104987 92968 97939 949210 Find the coefficients of correlation between Intelligence Ratio (I.R.) and Emotional Ration (E.R.) from the following data :
  • 17. Let X be intelligence ration And Y be emotional ratio. We construct the following table. Taking A = 100 B = 100 dx = X – A = X – 100 dy = Y – B = Y - 100
  • 18. X Y dx = X - 100 dx² dy = Y - 100 dy² dxdy 105 101 5 25 1 1 5 104 103 4 16 3 9 12 102 100 2 4 0 0 0 101 98 1 1 -2 4 -2 100 95 0 0 -5 25 0 99 96 -1 1 -4 16 4 98 104 -2 4 4 16 -8 96 92 -4 16 -8 64 32 93 97 -7 49 -3 9 21 92 94 -8 64 -6 36 48 - - ∑dx = -10 ∑dx² = 180 ∑dy = -20 ∑dy² = 180 ∑dxdy = 112
  • 19.  The correlation coefficient between I. R. and E. R. is given by,                   2222 yyxx yxyx r iiii iiii xy NN n    22 )20()180(10)10()180(10 )20)(10()112(10   rxy    4001800)100(1800 2001120   rxy    14001700 920 rxy 5963.0 83.1542 920 42.37*23.41 920 rxy
  • 20. c) For grouped data:                            2222 dfdfdfdf dfdfddf r ii N ii N ii N yyxx yixyx xy
  • 22. Let be the given observations and line to be fitted ……(1) Using the method of least squares, we can estimate the values of a and b. That is by using from (1) But,     yxyxyx nn ,, 2211  bxay         bxaxy xbany       2   xxbxxayyxx   0 xx
  • 23. The line of best fit becomes.  xxyy x y xyr      y x xyr      222  x n XY X XY xx yyxx b         
  • 24. i. When or ii. When , where A and B are assumed values. yYxX yx ii  ,      22 , X XY Y XY bb xyxy                    2222 , xxn yxxyn yyn yxxyn bb yxxy BA ydxd iyix  ,                    22 2 , 2 dd dddd b dd dddd b xx yxyx yx yy yxyx xy n n n n
  • 25. iii. For frequency distribution :                    2222 , dd dddd b dd dddd b xx yxyx yx yy yxyx xy ffn ffn ffn ffN
  • 26.  It is a non-parametric measure of correlation.  This procedure makes use of the two sets of ranks that may be assigned to the sample values of x and Y.  Spearman Rank correlation coefficient could be computed in the following cases:  Both variables are quantitative.  Both variables are qualitative ordinal.  One variable is quantitative and the other is qualitative ordinal. 1)n(n (di)6 1r 2 2 s   
  • 27.  In a study of the relationship between level education and income the following data was obtained. Find the relationship between them and comment. Income (Y) level education (X) sample numbers 25Preparatory.A 10Primary.B 8University.C 10secondaryD 15secondaryE 50illiterateF 60University.G