SlideShare a Scribd company logo
SIMPLE LINEAR
REGRESSION
By –
Shubham Bhardwaj
Brief History
• This term was introduced by Francis Galton
• It was introduced in the year 1877
• It was first started with the study of height of parents and the height of their
children.
• In the beginning it is only regarded as biological phenomenon.
• Later it was extended by Udny Yule and Karl Pearson.
INTRODUCTION
• The word regression means going back or returning.
• It is the study of effect of one variable over another variable.
• Also regression can be called as a statistical tool with the help of which we
can estimate the unknown values of one variable with the known values of
another variable.
• In modern world it has very important role in the field of Machine Learning.
Regression model
Yi =β1+ β2Xi+Ui
β1 : Intercept Parameters or
β2 : Slope coefficient regression coefficients
Yi : Dependent/Explained/Regressed/Endogenous variable
Xi : Independent/Explanatory/Regressor/Exogenous variable
Ui : Disturbance term or error term
Difference between correlation and regression
Correlation
• Used to determine the strength of
linear relationship between two
variables.
• No difference between dependent
and explanatory variables.
Regression
• Used to predict average value of
one variable on the basis of the
fixed values of other variables.
• Dependent variable is random and
explanatory variable is fixed.
Dependent variable : The variable whose value is influenced or predicted.
Explanatory variable : The variable which influences the values or is used for
prediction.
Estimate of parameters using OLS method
Scatter Plot Diagram
Scatter plots (also called scatter graphs) are
similar to line graphs. A line graph uses a
line on an X-Y axis to plot a continuous
function, while a scatter plot uses dots to
represent individual pieces of data. In
statistics, these plots are useful to see if
two variables are related to each other.
For example, a scatter chart can suggest
a linear relationship (i.e. a straight line).
Line of best fit
A line of best fit (or "trend" line) is a straight line that best represents
the data on a scatter plot.
This line may pass through some of the points, none of the points, or all
of the points.
Also the sum of squares of the deviations of the actual value of Y from
their estimated value is minimum.
Where Do we use regression analysis ?
• Where we want to indicates the significant relationships between dependent
variable and independent variable. As for example - income and expenditure.
• To make predictions and forecasting.
• Where we can find dependency among different variables relating to a
phenomenon. For example - We want to estimate growth in sales of a company
based on current economic conditions. You have the recent company data
which indicates that the growth in sales is around two and a half times the growth
in the economy. Using this insight, we can predict future sales of the company
based on current & past information.
R-Square
• It is a measure of “Goodness of Fit” for the regression model.
• It is the ratio of Explained Sum of Square (ESS) to the Total sum of Square (TSS).
• It tells us how much capable the regressors are in explaining the variability of the
dependent variable in the model.
• Its value can never be negative.
• Its value lie between 0 to 1
• It is denoted by R2 .
Thank you !

More Related Content

PDF
Simple linear regression
PPTX
Simple Linear Regression
PPT
Simple linear regression
PPTX
Simple linear regression analysis
PPTX
Binomial probability distributions
PPTX
Regression
PPTX
Regression
PPTX
Gamma, Expoential, Poisson And Chi Squared Distributions
Simple linear regression
Simple Linear Regression
Simple linear regression
Simple linear regression analysis
Binomial probability distributions
Regression
Regression
Gamma, Expoential, Poisson And Chi Squared Distributions

What's hot (20)

PPTX
Statistics-Regression analysis
PPTX
Regression analysis
PPTX
Standard normal distribution
PPTX
Regression analysis
PPTX
BINOMIAL ,POISSON AND NORMAL DISTRIBUTION.pptx
PPTX
Presentation On Regression
PPTX
Normal as Approximation to Binomial
PDF
Linear regression theory
PDF
Multiple linear regression
PDF
Correlation and Simple Regression
PPTX
Regression ppt
PPTX
Statistical inference: Estimation
PPTX
PPTX
Hypothesis testing , T test , chi square test, z test
PPT
Regression analysis
PPTX
Inferential Statistics
PPT
Regression analysis
PPTX
INTRODUCTION TO BIO STATISTICS
PDF
Probability - I
Statistics-Regression analysis
Regression analysis
Standard normal distribution
Regression analysis
BINOMIAL ,POISSON AND NORMAL DISTRIBUTION.pptx
Presentation On Regression
Normal as Approximation to Binomial
Linear regression theory
Multiple linear regression
Correlation and Simple Regression
Regression ppt
Statistical inference: Estimation
Hypothesis testing , T test , chi square test, z test
Regression analysis
Inferential Statistics
Regression analysis
INTRODUCTION TO BIO STATISTICS
Probability - I
Ad

Similar to Simple linear regression (20)

PPTX
ML4 Regression.pptx
PPTX
Regression analysis
PDF
Correlation and Regression.pdf
PPTX
Module 4- Correlation & Regression Analysis.pptx
PPTX
STATISTICAL REGRESSION MODELS
PDF
Simple regressionand correlation (2).pdf
PPTX
how to select the appropriate method for our study of interest
PPTX
DciupewdncupiercnuiperhcCORRELATION-ANALYSIS.pptx
PPTX
Correlation and Regression Analysis.pptx
DOCX
How regression analysis can be applied in business
PPTX
Module - 2 correlation and regression.pptx
PPTX
Module - 2 correlation and regression.pptx
PDF
Artificial Intelligence (Unit - 8).pdf
PPTX
Chapter 2 Simple Linear Regression Model.pptx
PPTX
Regression
PPTX
Linear regression aims to find the "best-fit" linear line
PPTX
MIKKAH MAE C. MANGAYAN- FS103.inferential-statisticspptx
PPTX
Regression Analysis
PPTX
Regression of research methodlogyyy.pptx
DOC
Covariance and correlation
ML4 Regression.pptx
Regression analysis
Correlation and Regression.pdf
Module 4- Correlation & Regression Analysis.pptx
STATISTICAL REGRESSION MODELS
Simple regressionand correlation (2).pdf
how to select the appropriate method for our study of interest
DciupewdncupiercnuiperhcCORRELATION-ANALYSIS.pptx
Correlation and Regression Analysis.pptx
How regression analysis can be applied in business
Module - 2 correlation and regression.pptx
Module - 2 correlation and regression.pptx
Artificial Intelligence (Unit - 8).pdf
Chapter 2 Simple Linear Regression Model.pptx
Regression
Linear regression aims to find the "best-fit" linear line
MIKKAH MAE C. MANGAYAN- FS103.inferential-statisticspptx
Regression Analysis
Regression of research methodlogyyy.pptx
Covariance and correlation
Ad

Recently uploaded (20)

PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PPTX
A Complete Guide to Streamlining Business Processes
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PDF
Introduction to Data Science and Data Analysis
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
modul_python (1).pptx for professional and student
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
New ISO 27001_2022 standard and the changes
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PPTX
Introduction to Inferential Statistics.pptx
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
STERILIZATION AND DISINFECTION-1.ppthhhbx
IBA_Chapter_11_Slides_Final_Accessible.pptx
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
A Complete Guide to Streamlining Business Processes
Topic 5 Presentation 5 Lesson 5 Corporate Fin
Pilar Kemerdekaan dan Identi Bangsa.pptx
Introduction to Data Science and Data Analysis
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
[EN] Industrial Machine Downtime Prediction
modul_python (1).pptx for professional and student
Qualitative Qantitative and Mixed Methods.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
New ISO 27001_2022 standard and the changes
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
Introduction to Inferential Statistics.pptx
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
Acceptance and paychological effects of mandatory extra coach I classes.pptx

Simple linear regression

  • 2. Brief History • This term was introduced by Francis Galton • It was introduced in the year 1877 • It was first started with the study of height of parents and the height of their children. • In the beginning it is only regarded as biological phenomenon. • Later it was extended by Udny Yule and Karl Pearson.
  • 3. INTRODUCTION • The word regression means going back or returning. • It is the study of effect of one variable over another variable. • Also regression can be called as a statistical tool with the help of which we can estimate the unknown values of one variable with the known values of another variable. • In modern world it has very important role in the field of Machine Learning.
  • 4. Regression model Yi =β1+ β2Xi+Ui β1 : Intercept Parameters or β2 : Slope coefficient regression coefficients Yi : Dependent/Explained/Regressed/Endogenous variable Xi : Independent/Explanatory/Regressor/Exogenous variable Ui : Disturbance term or error term
  • 5. Difference between correlation and regression Correlation • Used to determine the strength of linear relationship between two variables. • No difference between dependent and explanatory variables. Regression • Used to predict average value of one variable on the basis of the fixed values of other variables. • Dependent variable is random and explanatory variable is fixed.
  • 6. Dependent variable : The variable whose value is influenced or predicted. Explanatory variable : The variable which influences the values or is used for prediction.
  • 7. Estimate of parameters using OLS method
  • 8. Scatter Plot Diagram Scatter plots (also called scatter graphs) are similar to line graphs. A line graph uses a line on an X-Y axis to plot a continuous function, while a scatter plot uses dots to represent individual pieces of data. In statistics, these plots are useful to see if two variables are related to each other. For example, a scatter chart can suggest a linear relationship (i.e. a straight line).
  • 9. Line of best fit A line of best fit (or "trend" line) is a straight line that best represents the data on a scatter plot. This line may pass through some of the points, none of the points, or all of the points. Also the sum of squares of the deviations of the actual value of Y from their estimated value is minimum.
  • 10. Where Do we use regression analysis ? • Where we want to indicates the significant relationships between dependent variable and independent variable. As for example - income and expenditure. • To make predictions and forecasting. • Where we can find dependency among different variables relating to a phenomenon. For example - We want to estimate growth in sales of a company based on current economic conditions. You have the recent company data which indicates that the growth in sales is around two and a half times the growth in the economy. Using this insight, we can predict future sales of the company based on current & past information.
  • 11. R-Square • It is a measure of “Goodness of Fit” for the regression model. • It is the ratio of Explained Sum of Square (ESS) to the Total sum of Square (TSS). • It tells us how much capable the regressors are in explaining the variability of the dependent variable in the model. • Its value can never be negative. • Its value lie between 0 to 1 • It is denoted by R2 .