SlideShare a Scribd company logo
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 1
Simple Linear
Regression
Chapter 12
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 2
Correlation vs. Regression
 A scatter plot can be used to show the
relationship between two variables
 Correlation analysis is used to measure the
strength of the association (linear relationship)
between two variables
 Correlation is only concerned with strength of the
relationship
 No causal effect is implied with correlation
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 3
12.1 Regression Models
Y
X
Y
X
Y
Y
X
X
Linear relationships Curvilinear relationships
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 4
Types of Relationships
Y
X
Y
X
Y
Y
X
X
Strong relationships Weak relationships
(continued)
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 5
Types of Relationships
Y
X
Y
X
No relationship
DCOVA
(continued)
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 6
12.2 Introduction to
Regression Analysis
 Regression analysis is used to:
 Predict the value of a dependent variable based on
the value of at least one independent variable
 Explain the impact of changes in an independent
variable on the dependent variable
 Dependent variable: the variable we wish to
predict or explain
 Independent variable: the variable used to
predict or explain the
dependent variable
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 7
Simple Linear Regression Model
 Only one independent variable, X
 Relationship between X and Y is described by
a linear function
 Changes in Y are assumed to be related to
changes in X
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 8
i
i
1
0
i ε
X
β
β
Y 


Linear component
Simple Linear Regression Model
Population
Y intercept
Population
Slope
Coefficient
Random
Error
term
Dependent
Variable
Independent
Variable
Random Error
component
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 9
Random Error
for this Xi value
Y
X
Observed Value
of Y for Xi
Predicted Value
of Y for Xi
i
i
1
0
i ε
X
β
β
Y 


Xi
Slope = β1
Intercept = β0
εi
Simple Linear Regression
Model DCOVA
(continued)
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 10
Simple Linear Regression Equation
(Prediction Line)
i
1
0
i X
b
b
Ŷ 

The simple linear regression equation provides an
estimate of the population regression line
Estimate of
the regression
intercept
Estimate of the
regression slope
Estimated
(or predicted)
Y value for
observation i
Value of X for
observation i
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 11
The Least Squares Method
b0 and b1 are obtained by finding the values that
minimize the sum of the squared differences
between Y and Y :
2
i
1
0
i
2
i
i ))
X
b
(b
(Y
min
)
Ŷ
(Y
min 


 

DCOVA

Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 12
Finding the Least Squares
Equation
 The coefficients b0 and b1, and other
regression results in this chapter, will be
found using Excel or Minitab
Formulas are shown in the text for those
who are interested
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 13
Interpretation of the
Slope and the Intercept
 b0 is the estimated mean value of Y when
the value of X is zero
 b1 is the estimated change in the mean
value of Y as a result of a one-unit increase
in X
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 14
Simple Linear Regression
Example
 A real estate agent wishes to examine the
relationship between the selling price of a home
and its size (measured in square feet)
 A random sample of 10 houses is selected
 Dependent variable (Y) = house price in $1000s
 Independent variable (X) = square feet
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 15
Simple Linear Regression
Example: Data
House Price in $1000s
(Y)
Square Feet
(X)
245 1400
312 1600
279 1700
308 1875
199 1100
219 1550
405 2350
324 2450
319 1425
255 1700
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 16
0
50
100
150
200
250
300
350
400
450
0 500 1000 1500 2000 2500 3000
House
Price
($1000s)
Square Feet
Simple Linear Regression Example:
Scatter Plot
House price model: Scatter Plot
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 17
Simple Linear Regression Example:
Using Excel Data Analysis Function
1. Choose Data 2. Choose Data Analysis
3. Choose Regression
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 18
Simple Linear Regression Example:
Using Excel Data Analysis Function
Enter Y range and X range and desired options
DCOVA
(continued)
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 19
Simple Linear Regression
Example: Using PHStat
Add-Ins: PHStat: Regression: Simple Linear Regression
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 20
Simple Linear Regression Example:
Excel Output
Regression Statistics
Multiple R 0.76211
R Square 0.58082
Adjusted R Square 0.52842
Standard Error 41.33032
Observations 10
ANOVA
df SS MS F Significance F
Regression 1 18934.9348 18934.9348 11.0848 0.01039
Residual 8 13665.5652 1708.1957
Total 9 32600.5000
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 98.24833 58.03348 1.69296 0.12892 -35.57720 232.07386
Square Feet 0.10977 0.03297 3.32938 0.01039 0.03374 0.18580
The regression equation is:
feet)
(square
0.10977
98.24833
price
house 

DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 21
Simple Linear Regression Example:
Minitab Output
The regression equation is
Price = 98.2 + 0.110 Square Feet
Predictor Coef SE Coef T P
Constant 98.25 58.03 1.69 0.129
Square Feet 0.10977 0.03297 3.33 0.010
S = 41.3303 R-Sq = 58.1% R-Sq(adj) = 52.8%
Analysis of Variance
Source DF SS MS F P
Regression 1 18935 18935 11.08 0.010
Residual Error 8 13666 1708
Total 9 32600
The regression
equation is:
house price = 98.24833 +
0.10977 (square feet)
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 22
0
50
100
150
200
250
300
350
400
450
0 500 1000 1500 2000 2500 3000
Square Feet
House
Price
($1000s)
Simple Linear Regression Example:
Graphical Representation
House price model: Scatter Plot and Prediction Line
feet)
(square
0.10977
98.24833
price
house 

Slope
= 0.10977
Intercept
= 98.248
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 23
Simple Linear Regression
Example: Interpretation of bo
 b0 is the estimated mean value of Y when the
value of X is zero (if X = 0 is in the range of
observed X values)
 Because a house cannot have a square footage
of 0, b0 has no practical application
feet)
(square
0.10977
98.24833
price
house 

DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 24
Simple Linear Regression
Example: Interpreting b1
 b1 estimates the change in the mean value
of Y as a result of a one-unit increase in X
 Here, b1 = 0.10977 tells us that the mean value of a
house increases by .10977($1000) = $109.77, on
average, for each additional one square foot of size
feet)
(square
0.10977
98.24833
price
house 

DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 25
317.85
0)
0.1098(200
98.25
(sq.ft.)
0.1098
98.25
price
house





Predict the price for a house
with 2000 square feet:
The predicted price for a house with 2000
square feet is 317.85($1,000s) = $317,850
Simple Linear Regression
Example: Making Predictions
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 26
0
50
100
150
200
250
300
350
400
450
0 500 1000 1500 2000 2500 3000
Square Feet
House
Price
($1000s)
Simple Linear Regression
Example: Making Predictions
 When using a regression model for prediction,
only predict within the relevant range of data
Relevant range for
interpolation
Do not try to
extrapolate
beyond the range
of observed X’s
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 27
12.4 Assumptions of Regression
L.I.N.E
 Linearity
 The relationship between X and Y is linear
 Independence of Errors
 Error values are statistically independent
 Particularly important when data are collected over a period
of time
 Normality of Error
 Error values are normally distributed for any given value of X
 Equal Variance (also called homoscedasticity)
 The probability distribution of the errors has constant
variance
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 28
12.7 Inferences About the Slope
 The standard error of the regression slope
coefficient (b1) is estimated by
 


2
i
YX
YX
b
)
X
(X
S
SSX
S
S 1
where:
= Estimate of the standard error of the slope
= Standard error of the estimate
1
b
S
2
n
SSE
SYX


DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 29
Inferences About the Slope:
t Test
 t test for a population slope
 Is there a linear relationship between X and Y?
 Null and alternative hypotheses

H0: β1 = 0 (no linear relationship)

H1: β1 ≠ 0 (linear relationship does exist)
 Test statistic
1
b
1
1
STAT
S
β
b
t


2
n
d.f. 

where:
b1 = regression slope
coefficient
β1 = hypothesized slope
Sb1 = standard
error of the slope
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 30
Inferences About the Slope:
t Test Example
House Price
in $1000s
(y)
Square Feet
(x)
245 1400
312 1600
279 1700
308 1875
199 1100
219 1550
405 2350
324 2450
319 1425
255 1700
Estimated Regression Equation:
The slope of this model is 0.1098
Is there a relationship between the
square footage of the house and its
sales price?
DCOVA
house price = 98.25 + 0.1098 (sq. ft.)
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 31
Inferences About the Slope:
t Test Example
1
b
S
H0: β1 = 0
H1: β1 ≠ 0
From Excel output:
Coefficients Standard Error t Stat P-value
Intercept 98.24833 58.03348 1.69296 0.12892
Square Feet 0.10977 0.03297 3.32938 0.01039
1
b
S
b1
32938
3
03297
0
0
10977
0
S
β
b
t
1
b
1
1
STAT
.
.
.





Predictor Coef SE Coef T P
Constant 98.25 58.03 1.69 0.129
Square Feet 0.10977 0.03297 3.33 0.010
From Minitab output:
b1
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 32
Inferences About the Slope:
t Test Example
H0: β1 = 0
H1: β1 ≠ 0
Test Statistic: tSTAT = 3.329
There is sufficient evidence
that square footage affects
house price
Decision: Reject H0
Reject H0
Reject H0
/2=.025
-tα/2
Do not reject H0
0
tα/2
/2=.025
-2.3060 2.3060 3.329
d.f. = 10- 2 = 8
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 33
Inferences About the Slope:
t Test Example
H0: β1 = 0
H1: β1 ≠ 0
From Excel output:
Coefficients Standard Error t Stat P-value
Intercept 98.24833 58.03348 1.69296 0.12892
Square Feet 0.10977 0.03297 3.32938 0.01039
p-value
There is sufficient evidence that
square footage affects house price.
Decision: Reject H0, since p-value < α
Predictor Coef SE Coef T P
Constant 98.25 58.03 1.69 0.129
Square Feet 0.10977 0.03297 3.33 0.010
From Minitab output:
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 34
Confidence Interval Estimate
for the Slope
Confidence Interval Estimate of the Slope:
Excel Printout for House Prices:
At 95% level of confidence, the confidence interval for
the slope is (0.0337, 0.1858)
1
b
2
/
1 S
b α
t

Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 98.24833 58.03348 1.69296 0.12892 -35.57720 232.07386
Square Feet 0.10977 0.03297 3.32938 0.01039 0.03374 0.18580
d.f. = n - 2
DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 35
Since the units of the house price variable is
$1000s, we are 95% confident that the average
impact on sales price is between $33.74 and
$185.80 per square foot of house size
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 98.24833 58.03348 1.69296 0.12892 -35.57720 232.07386
Square Feet 0.10977 0.03297 3.32938 0.01039 0.03374 0.18580
This 95% confidence interval does not include 0.
Conclusion: There is a significant relationship between
house price and square feet at the .05 level of significance
Confidence Interval Estimate
for the Slope
DCOVA
(continued)
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 36
Confidence Interval Estimate for
the Slope from Minitab
DCOVA
Predictor Coef SE Coef T P
Constant 98.25 58.03 1.69 0.129
Square Feet 0.10977 0.03297 3.33 0.010
Minitab does not automatically calculate a confidence
interval for the slope but provides the quantities necessary
to use the confidence interval formula.
1
b
2
/
1 S
b α
t

(continued)
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 37
t Test for a Correlation Coefficient
 Hypotheses
H0: ρ = 0 (no correlation between X and Y)
H1: ρ ≠ 0 (correlation exists)

Test statistic
(with n – 2 degrees of freedom)
2
n
r
1
ρ
-
r
t
2
STAT



0
b
if
r
r
0
b
if
r
r
where
1
2
1
2






DCOVA
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 38
t-test For A Correlation Coefficient
Is there evidence of a linear relationship
between square feet and house price at
the .05 level of significance?
H0: ρ = 0 (No correlation)
H1: ρ ≠ 0 (correlation exists)
 =.05 , df = 10 - 2 = 8
3.329
2
10
.762
1
0
.762
2
n
r
1
ρ
r
t
2
2
STAT 








DCOVA
(continued)
Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 39
t-test For A Correlation Coefficient
Conclusion:
There is
evidence of a
linear association
at the 5% level of
significance
Decision:
Reject H0
Reject H0
Reject H0
/2=.025
-tα/2
Do not reject H0
0
tα/2
/2=.025
-2.3060 2.3060
3.329
d.f. = 10-2 = 8
3.329
2
10
.762
1
0
.762
2
n
r
1
ρ
r
t
2
2
STAT 








DCOVA
(continued)

More Related Content

DOCX
Balagtasan
PPTX
Tempo & music
PPTX
PPT
Pang abay
PPTX
ANGLES-LESSON- ACUTE, RIGHT, AND OBTUSE ANGLES
DOCX
"Pagpapahalaga sa pamilya ng isang anak"
PPTX
Aralin 28 Ang Kababaihan sa Kasalukuyang Asya
PPTX
Pamilyar at Di-Pamilyar na Salita
Balagtasan
Tempo & music
Pang abay
ANGLES-LESSON- ACUTE, RIGHT, AND OBTUSE ANGLES
"Pagpapahalaga sa pamilya ng isang anak"
Aralin 28 Ang Kababaihan sa Kasalukuyang Asya
Pamilyar at Di-Pamilyar na Salita

What's hot (20)

PPTX
Alpabetong filipino
DOC
AP 7 Lesson no. 34-B: Uri ng Pamahalaan sa Indonesia
PPTX
Kabanata 3 pag iimbak
DOCX
Senator 2016
PPTX
Parable of Lazarus and the Rich Man
PPSX
Sayaw Pilipinas
PPTX
QUIZ BEE
PPTX
LESSON 1 READING AND WRITING WHOLE NUMBERS.pptx
PPTX
Science Class grade 9.pptx
PPTX
Epiko (Prinsipe Bantugan)
PPTX
Class IV -States of Matter
PPTX
Aspektong Nagaganap o Imperpektibo
PPTX
PPTX
Anyo ng globalisasyon at pagharap sa hamon
PPT
Adam And Eve
PPTX
PPTX
Kontemporaryong dagli
PPTX
Pag-aalsa ni Lakandula Araling Panlipunan
PPTX
Karahasan sa Paaralan
PPTX
Love for country
Alpabetong filipino
AP 7 Lesson no. 34-B: Uri ng Pamahalaan sa Indonesia
Kabanata 3 pag iimbak
Senator 2016
Parable of Lazarus and the Rich Man
Sayaw Pilipinas
QUIZ BEE
LESSON 1 READING AND WRITING WHOLE NUMBERS.pptx
Science Class grade 9.pptx
Epiko (Prinsipe Bantugan)
Class IV -States of Matter
Aspektong Nagaganap o Imperpektibo
Anyo ng globalisasyon at pagharap sa hamon
Adam And Eve
Kontemporaryong dagli
Pag-aalsa ni Lakandula Araling Panlipunan
Karahasan sa Paaralan
Love for country
Ad

Similar to Simple Linear Regression for business statistics (20)

PPT
Linear Regression statistical analysis s
PPT
Linear Reqression_used in statistics and engineering
PDF
metod linearne regresije
PDF
simple regression-1.pdf
PPT
Newbold_chap12.ppt
PPT
Simple linear regression - regression analysis.ppt
PPTX
Sessions 18 19- Regression- SLR MLR.pptx
PPT
Regression analysis
PDF
regressionanalysis-110723130213-phpapp02.pdf
PDF
Module 3 Course Slides Lesson 1 McGill University
PPTX
Regression analysis in Research Methodology.pptx
PPT
linear Regression, multiple Regression and Annova
PPTX
Chap11 simple regression
PDF
An Overview of Simple Linear Regression
PPTX
Evans_Analytics3e_ppt_08_Accessible.pptx
PDF
1. Regression_V1.pdf
PPTX
Intro to econometrics
PDF
Concept of Regression in Research Methodology.pdf
PPTX
Regression-SIMPLE LINEAR (1).psssssssssptx
PPTX
chap12.pptx
Linear Regression statistical analysis s
Linear Reqression_used in statistics and engineering
metod linearne regresije
simple regression-1.pdf
Newbold_chap12.ppt
Simple linear regression - regression analysis.ppt
Sessions 18 19- Regression- SLR MLR.pptx
Regression analysis
regressionanalysis-110723130213-phpapp02.pdf
Module 3 Course Slides Lesson 1 McGill University
Regression analysis in Research Methodology.pptx
linear Regression, multiple Regression and Annova
Chap11 simple regression
An Overview of Simple Linear Regression
Evans_Analytics3e_ppt_08_Accessible.pptx
1. Regression_V1.pdf
Intro to econometrics
Concept of Regression in Research Methodology.pdf
Regression-SIMPLE LINEAR (1).psssssssssptx
chap12.pptx
Ad

Recently uploaded (20)

PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPT
Mechanical Engineering MATERIALS Selection
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
additive manufacturing of ss316l using mig welding
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
web development for engineering and engineering
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
Strings in CPP - Strings in C++ are sequences of characters used to store and...
PDF
PPT on Performance Review to get promotions
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
DOCX
573137875-Attendance-Management-System-original
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PDF
composite construction of structures.pdf
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Internet of Things (IOT) - A guide to understanding
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Mechanical Engineering MATERIALS Selection
UNIT 4 Total Quality Management .pptx
additive manufacturing of ss316l using mig welding
CYBER-CRIMES AND SECURITY A guide to understanding
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
web development for engineering and engineering
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Foundation to blockchain - A guide to Blockchain Tech
Strings in CPP - Strings in C++ are sequences of characters used to store and...
PPT on Performance Review to get promotions
Embodied AI: Ushering in the Next Era of Intelligent Systems
573137875-Attendance-Management-System-original
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
composite construction of structures.pdf

Simple Linear Regression for business statistics

  • 1. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 1 Simple Linear Regression Chapter 12
  • 2. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 2 Correlation vs. Regression  A scatter plot can be used to show the relationship between two variables  Correlation analysis is used to measure the strength of the association (linear relationship) between two variables  Correlation is only concerned with strength of the relationship  No causal effect is implied with correlation DCOVA
  • 3. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 3 12.1 Regression Models Y X Y X Y Y X X Linear relationships Curvilinear relationships DCOVA
  • 4. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 4 Types of Relationships Y X Y X Y Y X X Strong relationships Weak relationships (continued) DCOVA
  • 5. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 5 Types of Relationships Y X Y X No relationship DCOVA (continued)
  • 6. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 6 12.2 Introduction to Regression Analysis  Regression analysis is used to:  Predict the value of a dependent variable based on the value of at least one independent variable  Explain the impact of changes in an independent variable on the dependent variable  Dependent variable: the variable we wish to predict or explain  Independent variable: the variable used to predict or explain the dependent variable DCOVA
  • 7. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 7 Simple Linear Regression Model  Only one independent variable, X  Relationship between X and Y is described by a linear function  Changes in Y are assumed to be related to changes in X DCOVA
  • 8. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 8 i i 1 0 i ε X β β Y    Linear component Simple Linear Regression Model Population Y intercept Population Slope Coefficient Random Error term Dependent Variable Independent Variable Random Error component DCOVA
  • 9. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 9 Random Error for this Xi value Y X Observed Value of Y for Xi Predicted Value of Y for Xi i i 1 0 i ε X β β Y    Xi Slope = β1 Intercept = β0 εi Simple Linear Regression Model DCOVA (continued)
  • 10. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 10 Simple Linear Regression Equation (Prediction Line) i 1 0 i X b b Ŷ   The simple linear regression equation provides an estimate of the population regression line Estimate of the regression intercept Estimate of the regression slope Estimated (or predicted) Y value for observation i Value of X for observation i DCOVA
  • 11. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 11 The Least Squares Method b0 and b1 are obtained by finding the values that minimize the sum of the squared differences between Y and Y : 2 i 1 0 i 2 i i )) X b (b (Y min ) Ŷ (Y min       DCOVA 
  • 12. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 12 Finding the Least Squares Equation  The coefficients b0 and b1, and other regression results in this chapter, will be found using Excel or Minitab Formulas are shown in the text for those who are interested DCOVA
  • 13. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 13 Interpretation of the Slope and the Intercept  b0 is the estimated mean value of Y when the value of X is zero  b1 is the estimated change in the mean value of Y as a result of a one-unit increase in X DCOVA
  • 14. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 14 Simple Linear Regression Example  A real estate agent wishes to examine the relationship between the selling price of a home and its size (measured in square feet)  A random sample of 10 houses is selected  Dependent variable (Y) = house price in $1000s  Independent variable (X) = square feet DCOVA
  • 15. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 15 Simple Linear Regression Example: Data House Price in $1000s (Y) Square Feet (X) 245 1400 312 1600 279 1700 308 1875 199 1100 219 1550 405 2350 324 2450 319 1425 255 1700 DCOVA
  • 16. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 16 0 50 100 150 200 250 300 350 400 450 0 500 1000 1500 2000 2500 3000 House Price ($1000s) Square Feet Simple Linear Regression Example: Scatter Plot House price model: Scatter Plot DCOVA
  • 17. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 17 Simple Linear Regression Example: Using Excel Data Analysis Function 1. Choose Data 2. Choose Data Analysis 3. Choose Regression DCOVA
  • 18. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 18 Simple Linear Regression Example: Using Excel Data Analysis Function Enter Y range and X range and desired options DCOVA (continued)
  • 19. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 19 Simple Linear Regression Example: Using PHStat Add-Ins: PHStat: Regression: Simple Linear Regression
  • 20. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 20 Simple Linear Regression Example: Excel Output Regression Statistics Multiple R 0.76211 R Square 0.58082 Adjusted R Square 0.52842 Standard Error 41.33032 Observations 10 ANOVA df SS MS F Significance F Regression 1 18934.9348 18934.9348 11.0848 0.01039 Residual 8 13665.5652 1708.1957 Total 9 32600.5000 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 98.24833 58.03348 1.69296 0.12892 -35.57720 232.07386 Square Feet 0.10977 0.03297 3.32938 0.01039 0.03374 0.18580 The regression equation is: feet) (square 0.10977 98.24833 price house   DCOVA
  • 21. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 21 Simple Linear Regression Example: Minitab Output The regression equation is Price = 98.2 + 0.110 Square Feet Predictor Coef SE Coef T P Constant 98.25 58.03 1.69 0.129 Square Feet 0.10977 0.03297 3.33 0.010 S = 41.3303 R-Sq = 58.1% R-Sq(adj) = 52.8% Analysis of Variance Source DF SS MS F P Regression 1 18935 18935 11.08 0.010 Residual Error 8 13666 1708 Total 9 32600 The regression equation is: house price = 98.24833 + 0.10977 (square feet) DCOVA
  • 22. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 22 0 50 100 150 200 250 300 350 400 450 0 500 1000 1500 2000 2500 3000 Square Feet House Price ($1000s) Simple Linear Regression Example: Graphical Representation House price model: Scatter Plot and Prediction Line feet) (square 0.10977 98.24833 price house   Slope = 0.10977 Intercept = 98.248 DCOVA
  • 23. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 23 Simple Linear Regression Example: Interpretation of bo  b0 is the estimated mean value of Y when the value of X is zero (if X = 0 is in the range of observed X values)  Because a house cannot have a square footage of 0, b0 has no practical application feet) (square 0.10977 98.24833 price house   DCOVA
  • 24. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 24 Simple Linear Regression Example: Interpreting b1  b1 estimates the change in the mean value of Y as a result of a one-unit increase in X  Here, b1 = 0.10977 tells us that the mean value of a house increases by .10977($1000) = $109.77, on average, for each additional one square foot of size feet) (square 0.10977 98.24833 price house   DCOVA
  • 25. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 25 317.85 0) 0.1098(200 98.25 (sq.ft.) 0.1098 98.25 price house      Predict the price for a house with 2000 square feet: The predicted price for a house with 2000 square feet is 317.85($1,000s) = $317,850 Simple Linear Regression Example: Making Predictions DCOVA
  • 26. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 26 0 50 100 150 200 250 300 350 400 450 0 500 1000 1500 2000 2500 3000 Square Feet House Price ($1000s) Simple Linear Regression Example: Making Predictions  When using a regression model for prediction, only predict within the relevant range of data Relevant range for interpolation Do not try to extrapolate beyond the range of observed X’s DCOVA
  • 27. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 27 12.4 Assumptions of Regression L.I.N.E  Linearity  The relationship between X and Y is linear  Independence of Errors  Error values are statistically independent  Particularly important when data are collected over a period of time  Normality of Error  Error values are normally distributed for any given value of X  Equal Variance (also called homoscedasticity)  The probability distribution of the errors has constant variance DCOVA
  • 28. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 28 12.7 Inferences About the Slope  The standard error of the regression slope coefficient (b1) is estimated by     2 i YX YX b ) X (X S SSX S S 1 where: = Estimate of the standard error of the slope = Standard error of the estimate 1 b S 2 n SSE SYX   DCOVA
  • 29. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 29 Inferences About the Slope: t Test  t test for a population slope  Is there a linear relationship between X and Y?  Null and alternative hypotheses  H0: β1 = 0 (no linear relationship)  H1: β1 ≠ 0 (linear relationship does exist)  Test statistic 1 b 1 1 STAT S β b t   2 n d.f.   where: b1 = regression slope coefficient β1 = hypothesized slope Sb1 = standard error of the slope DCOVA
  • 30. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 30 Inferences About the Slope: t Test Example House Price in $1000s (y) Square Feet (x) 245 1400 312 1600 279 1700 308 1875 199 1100 219 1550 405 2350 324 2450 319 1425 255 1700 Estimated Regression Equation: The slope of this model is 0.1098 Is there a relationship between the square footage of the house and its sales price? DCOVA house price = 98.25 + 0.1098 (sq. ft.)
  • 31. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 31 Inferences About the Slope: t Test Example 1 b S H0: β1 = 0 H1: β1 ≠ 0 From Excel output: Coefficients Standard Error t Stat P-value Intercept 98.24833 58.03348 1.69296 0.12892 Square Feet 0.10977 0.03297 3.32938 0.01039 1 b S b1 32938 3 03297 0 0 10977 0 S β b t 1 b 1 1 STAT . . .      Predictor Coef SE Coef T P Constant 98.25 58.03 1.69 0.129 Square Feet 0.10977 0.03297 3.33 0.010 From Minitab output: b1 DCOVA
  • 32. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 32 Inferences About the Slope: t Test Example H0: β1 = 0 H1: β1 ≠ 0 Test Statistic: tSTAT = 3.329 There is sufficient evidence that square footage affects house price Decision: Reject H0 Reject H0 Reject H0 /2=.025 -tα/2 Do not reject H0 0 tα/2 /2=.025 -2.3060 2.3060 3.329 d.f. = 10- 2 = 8 DCOVA
  • 33. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 33 Inferences About the Slope: t Test Example H0: β1 = 0 H1: β1 ≠ 0 From Excel output: Coefficients Standard Error t Stat P-value Intercept 98.24833 58.03348 1.69296 0.12892 Square Feet 0.10977 0.03297 3.32938 0.01039 p-value There is sufficient evidence that square footage affects house price. Decision: Reject H0, since p-value < α Predictor Coef SE Coef T P Constant 98.25 58.03 1.69 0.129 Square Feet 0.10977 0.03297 3.33 0.010 From Minitab output: DCOVA
  • 34. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 34 Confidence Interval Estimate for the Slope Confidence Interval Estimate of the Slope: Excel Printout for House Prices: At 95% level of confidence, the confidence interval for the slope is (0.0337, 0.1858) 1 b 2 / 1 S b α t  Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 98.24833 58.03348 1.69296 0.12892 -35.57720 232.07386 Square Feet 0.10977 0.03297 3.32938 0.01039 0.03374 0.18580 d.f. = n - 2 DCOVA
  • 35. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 35 Since the units of the house price variable is $1000s, we are 95% confident that the average impact on sales price is between $33.74 and $185.80 per square foot of house size Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 98.24833 58.03348 1.69296 0.12892 -35.57720 232.07386 Square Feet 0.10977 0.03297 3.32938 0.01039 0.03374 0.18580 This 95% confidence interval does not include 0. Conclusion: There is a significant relationship between house price and square feet at the .05 level of significance Confidence Interval Estimate for the Slope DCOVA (continued)
  • 36. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 36 Confidence Interval Estimate for the Slope from Minitab DCOVA Predictor Coef SE Coef T P Constant 98.25 58.03 1.69 0.129 Square Feet 0.10977 0.03297 3.33 0.010 Minitab does not automatically calculate a confidence interval for the slope but provides the quantities necessary to use the confidence interval formula. 1 b 2 / 1 S b α t  (continued)
  • 37. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 37 t Test for a Correlation Coefficient  Hypotheses H0: ρ = 0 (no correlation between X and Y) H1: ρ ≠ 0 (correlation exists)  Test statistic (with n – 2 degrees of freedom) 2 n r 1 ρ - r t 2 STAT    0 b if r r 0 b if r r where 1 2 1 2       DCOVA
  • 38. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 38 t-test For A Correlation Coefficient Is there evidence of a linear relationship between square feet and house price at the .05 level of significance? H0: ρ = 0 (No correlation) H1: ρ ≠ 0 (correlation exists)  =.05 , df = 10 - 2 = 8 3.329 2 10 .762 1 0 .762 2 n r 1 ρ r t 2 2 STAT          DCOVA (continued)
  • 39. Copyright © 2016 Pearson Education, Ltd. Chapter 12, Slide 39 t-test For A Correlation Coefficient Conclusion: There is evidence of a linear association at the 5% level of significance Decision: Reject H0 Reject H0 Reject H0 /2=.025 -tα/2 Do not reject H0 0 tα/2 /2=.025 -2.3060 2.3060 3.329 d.f. = 10-2 = 8 3.329 2 10 .762 1 0 .762 2 n r 1 ρ r t 2 2 STAT          DCOVA (continued)