SlideShare a Scribd company logo
Simple linear regression
It is statistical method that allows us to summarize and study relationships between two continuous (quantitative) variables:
• One variable, denoted x, is regarded as the predictor, explanatory, or independent variable.
• The other variable, denoted y, is regarded as the response, outcome, or dependent variable.
• We will examine the relationship between quantitative variables x and y via a mathematical equation.
• The model has a deterministic and a statistical components
House
Cost
Most lots sell
for $25,000
Building a house costs about
$75 per square foot.
House cost = 25000 + 75(Size)
House cost = 25000 + 75(Size)
House size
House
Cost
Most lots sell
for $25,000
+ e
Since cost behave unpredictably,
we add a random component.
House size
Simple linear regression
• The simplest deterministic mathematical relationship between two variables x and y is a linear
relationship: y = β0 + β1x. (True regression line)
• The objective is to develop an equivalent linear probabilistic model.
• If the two (random) variables are probabilistically related, then for a fixed value of x, there is
uncertainty in the value of the second variable.
• So, we assume y = β0 + β1x + ε, where ε is a random variable.
Simple linear regression
• The points (x1, y1), …, (xn, yn) resulting from n independent observations will then be scattered
about the true regression line:





 x
y 1
0
Simple linear regression
Estimating Model parameters:
• The values of β0, β1 and ε will almost never be known to an investigator.
• Instead, sample data consists of n observed pairs (x1, y1), … , (xn, yn), from which
the model parameters and the true regression line itself can be estimated.
• Where Yi = β0 +β1xi + εi for i = 1, 2, … , n and the n deviations ε1, ε2,…, εn are
independent r.v.’ s.
• Aim is to find the Best Fit Line: the sum of the squared vertical distances
(deviations) from the observed points to that line is as small as it can be.
Simple linear regression
The sum of squared vertical deviations from the points (x1, y1), … , (xn, yn), to the
line is then
  
 
1
0 1
2
2 2
( 1)
xy
xx
i i
xy i i
i
xx i x
SS
b
SS
b y b x
x y
SS x y
n
x
SS x n s
n

 
 
   
 



The point estimates of β0 and β1, denoted by b1
and b0, are called the least squares estimates –
they are those values that minimize using
partial derivatives.
The predicted values are obtained using:
0 1
ŷ b b x
 
Simple linear regression
Simple linear regression
Simple linear regression
Simple linear regression
Simple linear regression
Linear regression, while a powerful tool, has certain limitations that should be considered:
• Linearity: Assumes a linear relationship between the dependent and independent variables. If the relationship is non-
linear, the model may not accurately capture the underlying pattern.
• Independence: Assumes that the errors are independent of each other. If there is autocorrelation in the errors, the
model's estimates may be biased and inefficient.
• Homoscedasticity: Assumes that the variance of the errors is constant across all levels of the independent variable. If
the variance is not constant (heteroscedasticity), the model's estimates may be inefficient.
• Normality: Assumes that the errors are normally distributed. If the errors are not normally distributed, the model's
inferences may be invalid.
• Sensitivity to Outliers: Linear regression can be sensitive to outliers, which can have a significant impact on the
model's estimates. Outliers can distort the relationship between the variables and lead to biased results.
• Limited Flexibility: Linear regression can only model linear relationships. If the relationship between the variables is
complex or non-linear, linear regression may not be able to adequately capture the pattern.

More Related Content

PPTX
Detail Study of the concept of Regression model.pptx
PPTX
Linear regression.pptx
PPTX
REGRESSION METasdfghjklmjhgftrHODS1.pptx
PPTX
Introduction to Regression . pptx
PPTX
Regression refers to the statistical technique of modeling
PPTX
MachineLearning_Unit-II.pptxScrum.pptxAgile Model.pptxAgile Model.pptxAgile M...
PPTX
ML-UNIT-IV complete notes download here
PDF
MachineLearning_Unit-II.FHDGFHJKpptx.pdf
Detail Study of the concept of Regression model.pptx
Linear regression.pptx
REGRESSION METasdfghjklmjhgftrHODS1.pptx
Introduction to Regression . pptx
Regression refers to the statistical technique of modeling
MachineLearning_Unit-II.pptxScrum.pptxAgile Model.pptxAgile Model.pptxAgile M...
ML-UNIT-IV complete notes download here
MachineLearning_Unit-II.FHDGFHJKpptx.pdf

Similar to Unit4- Lecture1.pptx simple linear regression (20)

PPTX
Regression Analysis.pptx
PPTX
Regression Analysis Techniques.pptx
PPTX
Sessions 18 19- Regression- SLR MLR.pptx
PPTX
Buoi 1.2-EMBS6Regre, Simple Lineae Regression,
PPTX
Lecture 8 Linear and Multiple Regression (1).pptx
PPTX
regression analysis presentation slides.
PPTX
Linear regression aims to find the "best-fit" linear line
PPT
Simple Linear Regression.pptSimple Linear Regression.ppt
PPTX
SET-02_SOCS_ESE-DEC23__B.Tech%20(CSE-H+NH)-AIML_5_CSAI3001_Neural%20Networks.pdf
PPTX
Linear Regression final-1.pptx thbejnnej
PPTX
simple and multiple linear Regression. (1).pptx
PPTX
Linear Regression | Machine Learning | Data Science
PDF
Lecture 1.pdf
PDF
The normal presentation about linear regression in machine learning
PDF
Simple Linear Regression detail explanation.pdf
PDF
Machine-Learning-Unit2Machine Learning-Machine Learning--Regression.pdf
PPTX
Simple Linear Regression explanation.pptx
PPTX
Basics of Regression analysis
PDF
Module 5.pdf Machine Learning Types and examples
Regression Analysis.pptx
Regression Analysis Techniques.pptx
Sessions 18 19- Regression- SLR MLR.pptx
Buoi 1.2-EMBS6Regre, Simple Lineae Regression,
Lecture 8 Linear and Multiple Regression (1).pptx
regression analysis presentation slides.
Linear regression aims to find the "best-fit" linear line
Simple Linear Regression.pptSimple Linear Regression.ppt
SET-02_SOCS_ESE-DEC23__B.Tech%20(CSE-H+NH)-AIML_5_CSAI3001_Neural%20Networks.pdf
Linear Regression final-1.pptx thbejnnej
simple and multiple linear Regression. (1).pptx
Linear Regression | Machine Learning | Data Science
Lecture 1.pdf
The normal presentation about linear regression in machine learning
Simple Linear Regression detail explanation.pdf
Machine-Learning-Unit2Machine Learning-Machine Learning--Regression.pdf
Simple Linear Regression explanation.pptx
Basics of Regression analysis
Module 5.pdf Machine Learning Types and examples
Ad

Recently uploaded (20)

PDF
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
PDF
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
PDF
Soil Improvement Techniques Note - Rabbi
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PDF
EXPLORING LEARNING ENGAGEMENT FACTORS INFLUENCING BEHAVIORAL, COGNITIVE, AND ...
PPTX
Artificial Intelligence
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
PPTX
Fundamentals of Mechanical Engineering.pptx
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PDF
Exploratory_Data_Analysis_Fundamentals.pdf
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPT
introduction to datamining and warehousing
PDF
Abrasive, erosive and cavitation wear.pdf
PDF
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
PDF
Categorization of Factors Affecting Classification Algorithms Selection
PDF
SMART SIGNAL TIMING FOR URBAN INTERSECTIONS USING REAL-TIME VEHICLE DETECTI...
PDF
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
Soil Improvement Techniques Note - Rabbi
Fundamentals of safety and accident prevention -final (1).pptx
EXPLORING LEARNING ENGAGEMENT FACTORS INFLUENCING BEHAVIORAL, COGNITIVE, AND ...
Artificial Intelligence
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
R24 SURVEYING LAB MANUAL for civil enggi
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
Fundamentals of Mechanical Engineering.pptx
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
Exploratory_Data_Analysis_Fundamentals.pdf
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
introduction to datamining and warehousing
Abrasive, erosive and cavitation wear.pdf
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
Categorization of Factors Affecting Classification Algorithms Selection
SMART SIGNAL TIMING FOR URBAN INTERSECTIONS USING REAL-TIME VEHICLE DETECTI...
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Ad

Unit4- Lecture1.pptx simple linear regression

  • 1. Simple linear regression It is statistical method that allows us to summarize and study relationships between two continuous (quantitative) variables: • One variable, denoted x, is regarded as the predictor, explanatory, or independent variable. • The other variable, denoted y, is regarded as the response, outcome, or dependent variable. • We will examine the relationship between quantitative variables x and y via a mathematical equation. • The model has a deterministic and a statistical components House Cost Most lots sell for $25,000 Building a house costs about $75 per square foot. House cost = 25000 + 75(Size) House cost = 25000 + 75(Size) House size House Cost Most lots sell for $25,000 + e Since cost behave unpredictably, we add a random component. House size
  • 2. Simple linear regression • The simplest deterministic mathematical relationship between two variables x and y is a linear relationship: y = β0 + β1x. (True regression line) • The objective is to develop an equivalent linear probabilistic model. • If the two (random) variables are probabilistically related, then for a fixed value of x, there is uncertainty in the value of the second variable. • So, we assume y = β0 + β1x + ε, where ε is a random variable.
  • 3. Simple linear regression • The points (x1, y1), …, (xn, yn) resulting from n independent observations will then be scattered about the true regression line:       x y 1 0
  • 4. Simple linear regression Estimating Model parameters: • The values of β0, β1 and ε will almost never be known to an investigator. • Instead, sample data consists of n observed pairs (x1, y1), … , (xn, yn), from which the model parameters and the true regression line itself can be estimated. • Where Yi = β0 +β1xi + εi for i = 1, 2, … , n and the n deviations ε1, ε2,…, εn are independent r.v.’ s. • Aim is to find the Best Fit Line: the sum of the squared vertical distances (deviations) from the observed points to that line is as small as it can be.
  • 5. Simple linear regression The sum of squared vertical deviations from the points (x1, y1), … , (xn, yn), to the line is then      1 0 1 2 2 2 ( 1) xy xx i i xy i i i xx i x SS b SS b y b x x y SS x y n x SS x n s n               The point estimates of β0 and β1, denoted by b1 and b0, are called the least squares estimates – they are those values that minimize using partial derivatives. The predicted values are obtained using: 0 1 ŷ b b x  
  • 10. Simple linear regression Linear regression, while a powerful tool, has certain limitations that should be considered: • Linearity: Assumes a linear relationship between the dependent and independent variables. If the relationship is non- linear, the model may not accurately capture the underlying pattern. • Independence: Assumes that the errors are independent of each other. If there is autocorrelation in the errors, the model's estimates may be biased and inefficient. • Homoscedasticity: Assumes that the variance of the errors is constant across all levels of the independent variable. If the variance is not constant (heteroscedasticity), the model's estimates may be inefficient. • Normality: Assumes that the errors are normally distributed. If the errors are not normally distributed, the model's inferences may be invalid. • Sensitivity to Outliers: Linear regression can be sensitive to outliers, which can have a significant impact on the model's estimates. Outliers can distort the relationship between the variables and lead to biased results. • Limited Flexibility: Linear regression can only model linear relationships. If the relationship between the variables is complex or non-linear, linear regression may not be able to adequately capture the pattern.