SlideShare a Scribd company logo
REGRESSION
Meaning
• In statistics, regression analysis is a statistical process
for estimating the relationships among variables.
• Regression analysis:There are several types of
regression:
– Linear regression
– Simple linear regression
– Logistic regression
– Nonlinear regression
– Nonparametric regression
– Robust regression
– Stepwise regression
Meaning: regression
Noun
1.the act of going back to a previous place or state;
return or reversion.
2.retrogradation; retrogression.
3.Biology. reversion to an earlier or less advanced
state or form or to a common or general type.
4.Psychoanalysis. the reversion to a chronologically
earlier or less
adapted pattern of behaviour and feeling.
5.a subsidence of a disease or its manifestations:
a regression of symptoms.
INTRODUCTION
• Many engineering and scientific problems
– concerned with determining a relationship between a
set of variables.
• In a chemical process, might be interested in
– the relationship between
• the output of the process, the temperature at which it
occurs, and
• the amount of catalyst employed.
• Knowledge of such a relationship would enable us
– to predict the output for various values of
temperature and amount of catalyst.
INTRODUCTION
• Situation-
– there is a single response variable Y , also called
the dependent variable
– depends on the value of a set of input, also called
independent, variables x1, . . . , xr
• The simplest type of relationship these
varibles is a linear relationship
INTRODUCTION
• If this was the linear relationship between Y and
the xi, i = 1, . . . , r, then it would be possible
– once the βi were learned
– to exactly predict the response for any set of input
values.
• In practice, such precision is almost never
attainable
• The most that one can expect is that Equation
would be valid subject to random error
INTRODUCTION
Introduction
Regression
• Suppose that the responses Yi corresponding to the input
values xi, i = 1, . . . , n are to be observed and used to
estimate α and β in a simple linear regression model.
• To determine estimators of α and β we reason as follows:
– If A is the estimator of α and B of β, then the estimator of the
response corresponding to the input variable xi would be A + Bxi.
• Since the actual response is Yi ,
– the squared difference is (Yi − A − Bxi)2, and so if A and B are the
estimators of α and β,
– then the sum of the squared differences between the estimated
responses
– the actual response values—call it SS — is given by
Regression
Regression
Regression
Regression
Regression
Joke on Regression
DISTRIBUTION OF THE ESTIMATORS
• To specify the distribution of the estimators A
and B, it is necessary to make additional
assumptions about the random errors aside
from just assuming that their mean is 0.
• The usual approach is to assume that the
random errors are independent normal
random variables having mean 0 and variance
σ2.
DISTRIBUTION OF THE ESTIMATORS
DISTRIBUTION OF THE ESTIMATORS
DISTRIBUTION OF THE ESTIMATORS
DISTRIBUTION OF THE ESTIMATORS
DISTRIBUTION OF THE ESTIMATORS
DISTRIBUTION OF THE ESTIMATORS
• Remarks
DISTRIBUTION OF THE ESTIMATORS
DISTRIBUTION OF THE ESTIMATORS
DISTRIBUTION OF THE ESTIMATORS
• Notation:
DISTRIBUTION OF THE ESTIMATORS
DISTRIBUTION OF THE ESTIMATORS
EXAMPLE
• The following data relate
– X: the moisture of a wet mix of a certain product
– Y: the density of the finished product
• Fit a linear curve to these data. Also determine SSR.
EXAMPLE
EXAMPLE
STATISTICAL INFERENCES ABOUT THE
REGRESSION PARAMETERS
• Inferences Concerning β
Inferences Concerning β
Inferences Concerning β
Inferences Concerning β
EXAMPLE
• An individual claims that the fuel consumption of
his automobile does not depend on how fast the
car is driven.
• To test the plausibility of this hypothesis, the car
was tested at various speeds between 45 and 70
miles per hour. The miles per gallon attained at
each of these speeds was determined, with the
following data resulting is given.
• Do these data refute the claim that the mileage
per gallon of gas is unaffected by the speed
at which the car is being driven?
EXAMPLE
EXAMPLE
Inferences Concerning α
Regression
Regression
Regression
Regression
Regression
Summary of Distributional Results
Summary of Distributional Results
THE COEFFICIENT OF DETERMINATION AND THE
SAMPLE CORRELATION COEFFICIENT
THE COEFFICIENT OF DETERMINATION AND THE
SAMPLE CORRELATION COEFFICIENT
THE COEFFICIENT OF DETERMINATION AND THE
SAMPLE CORRELATION COEFFICIENT
ANALYSIS OF RESIDUALS: ASSESSING
THE MODEL
The figure shows that, as indicated both by its scatter diagram and the random
nature of its standardized residuals, appears to fit the straight-line model quite
well.
The figure of the residual plot shows a discernible pattern, in that the residuals
appear to be first decreasing and then increasing as the input level increases. This
often means that higher-order (than just linear) terms are needed to describe the
relationship between the input and response. Indeed, this is also indicated by the
scatter diagram in this case.
The standardized residual plot shows a pattern, in that the absolute value
of the residuals, and thus their squares, appear to be increasing, as the
input level increases. This often indicates that the variance of the response
is not constant but, rather, increases with the input level.
TRANSFORMING TO LINEARITY
• The mean response is not a linear function
• In such cases, if the form of the relationship
can be determined it is sometimes
possible, by a change of variables, to
transform it into a linear form.
• For instance, in certain applications it is known that
W(t), the amplitude of a signal a time t after its
origination, is approximately related to t by the
functional form
TRANSFORMING TO LINEARITY
EXAMPLE
• The following table gives the percentages of a chemical that
were used up when an experiment was run at various
temperatures (in degrees celsius).
• Use it to estimate the percentage of the chemical that would
be used up if the experiment were to be run at 350 degrees.
EXAMPLE
• Let P(x) be the percentage of the chemical that is used up when the
experiment is run at 10x degrees.
• Even though a plot of P(x) looks roughly linear, we can improve upon the
fit by considering a nonlinear relationship between x and P(x).
• Specifically, let us consider a relationship of the form : 1 − P(x) ≈ c(1 − d)x
EXAMPLE
EXAMPLE
POLYNOMIAL REGRESSION
EXAMPLE
• Fit a polynomial to the following data.
EXAMPLE
EXAMPLE
MULTIPLE LINEAR REGRESSION
MULTIPLE LINEAR REGRESSION
MULTIPLE LINEAR REGRESSION
MULTIPLE LINEAR REGRESSION
Regression

More Related Content

PPTX
Parametric Statistical tests
PPT
multiple regression
PPTX
Multivariate analysis
PPTX
Regression analysis
PPTX
Multivariate analysis - Multiple regression analysis
PPTX
Anova ppt
PPTX
Regression Analysis
PPT
correlation and regression
Parametric Statistical tests
multiple regression
Multivariate analysis
Regression analysis
Multivariate analysis - Multiple regression analysis
Anova ppt
Regression Analysis
correlation and regression

What's hot (20)

PPTX
Regression analysis.
PPTX
Correlation and regression analysis
PPTX
Correlation
PDF
Multiple linear regression
PDF
Statistical Distributions
PPTX
Regression Analysis
PPTX
Correlation analysis
PPTX
Lecture 6. univariate and bivariate analysis
PPTX
Normal distribution
PDF
Multiple Correlation - Thiyagu
PPTX
Non Linear Equation
PPTX
Karl pearson's correlation
PPTX
Multivariate data analysis
PPTX
The Binomial, Poisson, and Normal Distributions
PPTX
Linear regression
PPTX
Statistical analysis
PPT
Multiple regression presentation
PPTX
Anova; analysis of variance
PPTX
Parametric Test
Regression analysis.
Correlation and regression analysis
Correlation
Multiple linear regression
Statistical Distributions
Regression Analysis
Correlation analysis
Lecture 6. univariate and bivariate analysis
Normal distribution
Multiple Correlation - Thiyagu
Non Linear Equation
Karl pearson's correlation
Multivariate data analysis
The Binomial, Poisson, and Normal Distributions
Linear regression
Statistical analysis
Multiple regression presentation
Anova; analysis of variance
Parametric Test
Ad

Viewers also liked (20)

PPT
Simple linear regression (final)
PPT
Regression analysis ppt
PPTX
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
PDF
Chapter 16
PPTX
The power of RapidMiner, showing the direct marketing demo
PPTX
Midterm
PDF
Statisticsfor businessproject solution
DOCX
Financialmodeling
PPTX
5 random variables
PPT
Qam formulas
PPTX
Maths scert text book model, chapter 7,statistics
PPTX
Arrays & functions in php
PPTX
Regression
PPT
09 numerical integration
PPTX
ForecastIT 2. Linear Regression & Model Statistics
PDF
Regression: A skin-deep dive
PDF
Probability mass functions and probability density functions
PPT
07 interpolation
PDF
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
PDF
C2.1 intro
Simple linear regression (final)
Regression analysis ppt
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Chapter 16
The power of RapidMiner, showing the direct marketing demo
Midterm
Statisticsfor businessproject solution
Financialmodeling
5 random variables
Qam formulas
Maths scert text book model, chapter 7,statistics
Arrays & functions in php
Regression
09 numerical integration
ForecastIT 2. Linear Regression & Model Statistics
Regression: A skin-deep dive
Probability mass functions and probability density functions
07 interpolation
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
C2.1 intro
Ad

Similar to Regression (20)

PPTX
Regression-Sheldon Ross from Chapter 9-year2024
PPT
Simple Linear Regression.pptSimple Linear Regression.ppt
PDF
9. parametric regression
PPT
Chapter13
PPTX
Regression -Linear.pptx
PPT
regression analysis .ppt
PPT
15.Simple Linear Regression of case study-530 (2).ppt
PPTX
Regression analysis in R
PPTX
Unit-III Correlation and Regression.pptx
PPTX
Regression analysis
PPTX
regression.pptx
PDF
Lecture 1.pdf
PDF
Regression analysis
PDF
Business statistics-ii-aarhus-bss
PDF
need help with stats 301 assignment help
PPT
Simple lin regress_inference
PPTX
Chapter 12
PPTX
01_SLR_final (1).pptx
PDF
P G STAT 531 Lecture 10 Regression
PPTX
Regression Analysis.pptx
Regression-Sheldon Ross from Chapter 9-year2024
Simple Linear Regression.pptSimple Linear Regression.ppt
9. parametric regression
Chapter13
Regression -Linear.pptx
regression analysis .ppt
15.Simple Linear Regression of case study-530 (2).ppt
Regression analysis in R
Unit-III Correlation and Regression.pptx
Regression analysis
regression.pptx
Lecture 1.pdf
Regression analysis
Business statistics-ii-aarhus-bss
need help with stats 301 assignment help
Simple lin regress_inference
Chapter 12
01_SLR_final (1).pptx
P G STAT 531 Lecture 10 Regression
Regression Analysis.pptx

More from RAVI PRASAD K.J. (16)

PPTX
Interval estimates
PPTX
Application Of Definite Integral
PPTX
Optimisation-two and three variables
PPTX
Optimisation- Nonlinear Problems
PPTX
Big-M method
PPTX
Transport problem
PPTX
Arc length
PPTX
Applications of Definite Intergral
PPTX
Partial derivatives
PPTX
Differentiation and Linearization
PPTX
Hypothesis testing
PPTX
Hypothesis testing
PDF
Parameter estimation
PDF
Economic interpretations of Linear Programming Problem
PDF
Duality in Linear Programming Problem
PDF
Post-optimal analysis of LPP
Interval estimates
Application Of Definite Integral
Optimisation-two and three variables
Optimisation- Nonlinear Problems
Big-M method
Transport problem
Arc length
Applications of Definite Intergral
Partial derivatives
Differentiation and Linearization
Hypothesis testing
Hypothesis testing
Parameter estimation
Economic interpretations of Linear Programming Problem
Duality in Linear Programming Problem
Post-optimal analysis of LPP

Recently uploaded (20)

PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
OOP with Java - Java Introduction (Basics)
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPT
Mechanical Engineering MATERIALS Selection
PPTX
Strings in CPP - Strings in C++ are sequences of characters used to store and...
DOCX
573137875-Attendance-Management-System-original
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
Digital Logic Computer Design lecture notes
PPTX
additive manufacturing of ss316l using mig welding
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
web development for engineering and engineering
PPTX
Lesson 3_Tessellation.pptx finite Mathematics
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
OOP with Java - Java Introduction (Basics)
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Embodied AI: Ushering in the Next Era of Intelligent Systems
Mechanical Engineering MATERIALS Selection
Strings in CPP - Strings in C++ are sequences of characters used to store and...
573137875-Attendance-Management-System-original
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Internet of Things (IOT) - A guide to understanding
Model Code of Practice - Construction Work - 21102022 .pdf
Digital Logic Computer Design lecture notes
additive manufacturing of ss316l using mig welding
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
CH1 Production IntroductoryConcepts.pptx
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
web development for engineering and engineering
Lesson 3_Tessellation.pptx finite Mathematics

Regression

  • 2. Meaning • In statistics, regression analysis is a statistical process for estimating the relationships among variables. • Regression analysis:There are several types of regression: – Linear regression – Simple linear regression – Logistic regression – Nonlinear regression – Nonparametric regression – Robust regression – Stepwise regression
  • 3. Meaning: regression Noun 1.the act of going back to a previous place or state; return or reversion. 2.retrogradation; retrogression. 3.Biology. reversion to an earlier or less advanced state or form or to a common or general type. 4.Psychoanalysis. the reversion to a chronologically earlier or less adapted pattern of behaviour and feeling. 5.a subsidence of a disease or its manifestations: a regression of symptoms.
  • 4. INTRODUCTION • Many engineering and scientific problems – concerned with determining a relationship between a set of variables. • In a chemical process, might be interested in – the relationship between • the output of the process, the temperature at which it occurs, and • the amount of catalyst employed. • Knowledge of such a relationship would enable us – to predict the output for various values of temperature and amount of catalyst.
  • 5. INTRODUCTION • Situation- – there is a single response variable Y , also called the dependent variable – depends on the value of a set of input, also called independent, variables x1, . . . , xr • The simplest type of relationship these varibles is a linear relationship
  • 6. INTRODUCTION • If this was the linear relationship between Y and the xi, i = 1, . . . , r, then it would be possible – once the βi were learned – to exactly predict the response for any set of input values. • In practice, such precision is almost never attainable • The most that one can expect is that Equation would be valid subject to random error
  • 10. • Suppose that the responses Yi corresponding to the input values xi, i = 1, . . . , n are to be observed and used to estimate α and β in a simple linear regression model. • To determine estimators of α and β we reason as follows: – If A is the estimator of α and B of β, then the estimator of the response corresponding to the input variable xi would be A + Bxi. • Since the actual response is Yi , – the squared difference is (Yi − A − Bxi)2, and so if A and B are the estimators of α and β, – then the sum of the squared differences between the estimated responses – the actual response values—call it SS — is given by
  • 17. DISTRIBUTION OF THE ESTIMATORS • To specify the distribution of the estimators A and B, it is necessary to make additional assumptions about the random errors aside from just assuming that their mean is 0. • The usual approach is to assume that the random errors are independent normal random variables having mean 0 and variance σ2.
  • 18. DISTRIBUTION OF THE ESTIMATORS
  • 19. DISTRIBUTION OF THE ESTIMATORS
  • 20. DISTRIBUTION OF THE ESTIMATORS
  • 21. DISTRIBUTION OF THE ESTIMATORS
  • 22. DISTRIBUTION OF THE ESTIMATORS
  • 23. DISTRIBUTION OF THE ESTIMATORS • Remarks
  • 24. DISTRIBUTION OF THE ESTIMATORS
  • 25. DISTRIBUTION OF THE ESTIMATORS
  • 26. DISTRIBUTION OF THE ESTIMATORS • Notation:
  • 27. DISTRIBUTION OF THE ESTIMATORS
  • 28. DISTRIBUTION OF THE ESTIMATORS
  • 29. EXAMPLE • The following data relate – X: the moisture of a wet mix of a certain product – Y: the density of the finished product • Fit a linear curve to these data. Also determine SSR.
  • 32. STATISTICAL INFERENCES ABOUT THE REGRESSION PARAMETERS • Inferences Concerning β
  • 36. EXAMPLE • An individual claims that the fuel consumption of his automobile does not depend on how fast the car is driven. • To test the plausibility of this hypothesis, the car was tested at various speeds between 45 and 70 miles per hour. The miles per gallon attained at each of these speeds was determined, with the following data resulting is given. • Do these data refute the claim that the mileage per gallon of gas is unaffected by the speed at which the car is being driven?
  • 47. THE COEFFICIENT OF DETERMINATION AND THE SAMPLE CORRELATION COEFFICIENT
  • 48. THE COEFFICIENT OF DETERMINATION AND THE SAMPLE CORRELATION COEFFICIENT
  • 49. THE COEFFICIENT OF DETERMINATION AND THE SAMPLE CORRELATION COEFFICIENT
  • 50. ANALYSIS OF RESIDUALS: ASSESSING THE MODEL The figure shows that, as indicated both by its scatter diagram and the random nature of its standardized residuals, appears to fit the straight-line model quite well.
  • 51. The figure of the residual plot shows a discernible pattern, in that the residuals appear to be first decreasing and then increasing as the input level increases. This often means that higher-order (than just linear) terms are needed to describe the relationship between the input and response. Indeed, this is also indicated by the scatter diagram in this case.
  • 52. The standardized residual plot shows a pattern, in that the absolute value of the residuals, and thus their squares, appear to be increasing, as the input level increases. This often indicates that the variance of the response is not constant but, rather, increases with the input level.
  • 53. TRANSFORMING TO LINEARITY • The mean response is not a linear function • In such cases, if the form of the relationship can be determined it is sometimes possible, by a change of variables, to transform it into a linear form. • For instance, in certain applications it is known that W(t), the amplitude of a signal a time t after its origination, is approximately related to t by the functional form
  • 55. EXAMPLE • The following table gives the percentages of a chemical that were used up when an experiment was run at various temperatures (in degrees celsius). • Use it to estimate the percentage of the chemical that would be used up if the experiment were to be run at 350 degrees.
  • 56. EXAMPLE • Let P(x) be the percentage of the chemical that is used up when the experiment is run at 10x degrees. • Even though a plot of P(x) looks roughly linear, we can improve upon the fit by considering a nonlinear relationship between x and P(x). • Specifically, let us consider a relationship of the form : 1 − P(x) ≈ c(1 − d)x
  • 60. EXAMPLE • Fit a polynomial to the following data.