SlideShare a Scribd company logo
2
Most read
3
Most read
5
Most read
LOGISTIC
REGRESSION
Senior Lecturer King
Faisal University,KSA
By Dr Zahid Khan
1
What is logistic regression?
Logit for short, a specialized form of regression
used when the dependent variable is dichotomous and categorical while
the independent variable(s) could be any type.
• Dependent variable is binary (yes/no, Male/Female ) outcome.
• Independent variables are continuous interval
• Examples:
• Relation of weight and BP to 10 year risk of death
• Relation of CD4 count to 1 year risk of AIDS diagnosis
2
Logistic Regression
• Form of regression that allows the prediction of discrete
variables by a mix of continuous and discrete/nominal
predictors.
• Addresses the same questions that discriminant
function analysis and multiple regression do but with no
distributional assumptions on the predictors (the
predictors do not have to be normally distributed,
linearly related or have equal variance in each group)
• Does not assume a linear relationship between DV and
IV
3
Why we use
logistic ?
No assumptions about the distributions of the predictor variables.
Predictors do not have to be normally distributed
Does not have to be linearly related.
When equal variances , covariance doesn't exist across the groups.
Predictor variables is not parametric and there is no
homoscedasticity. E.g variance of dependent and independent
variables are not equal.
4
Stage 1:
Objectives Of logistic regression
 Identify the independent variable that impact in
the dependent variable
 Establishing classification system based on the
logistic model for determining the group
membership
DECISION PROCESS
Types of logistic regression
• BINARY LOGISTIC REGRESSION
It is used when the dependent variable is dichotomous.
MULTINOMIAL LOGISTIC REGRESSION
It is used when the dependent or outcomes variable has more than
two categories.
Linear Regression
Independent Variable Dependent Variable
7
Logistic Regression
Independent Variable Dependent Variable
8
Binary logistic regression expression
Y = Dependent Variables
ß˚ = Constant
ß1 = Coefficient of variable X1
X1 = Independent Variables
E = Error Term
BINARY
SAMPLE SIZE
Very small samples have so much sampling
errors.
Very large sample size decreases the chances of
errors.
Logistic requires larger sample size than multiple
regression.
Hosmer and Lamshow recommended sample
size greater than 400.
SAMPLE SIZE PER CATEGORY OF THE
INDEPENDENT VARIABLE
 The recommended sample size for each group is at
least 10 observations per estimated parameters.
Estimation of logistic regression model
assessing overall fit
Logistic relationship describe earlier in both estimating the logistic
model and establishing the relationship between the dependent
and independent variables.
Result is a unique transformation of dependent variables which
impacts not only the estimation process but also the resulting
coefficients of independent variables.
Transforming the dependent
variable
S-shaped
Range (0-1)
Description of the data
The data used to conduct logistic regression is from a survey of 30
homeowners conducted by an electricity company about an offer of roof
solar panels with a 50% subsidy from the state government as part of the
state’s environmental policy.
The variables are:
•IVs: household income measured in units of a thousand dollars
age of householder
monthly mortgage
size of family household
•DV: whether the householder would take or decline the offer. Take the
offer was coded as 1 and decline the offer was coded as 0.
WHAT IS THE RESEARCH QUESTION?
To determine whether household income and monthly
mortgage will predict taking or declining the solar panel
offer
Independent Variables: household income and monthly
mortgage
Dependent Variables: Take the offer or decline the offer
Two hypotheses to be tested
• There are two hypotheses to test in relation to the overall fit of the
model:
H0: The model is a good fitting model
H1: The model is not a good fitting model (i.e. the predictors have a
significant effect)
How to perform logistic regression in spss
1) Click Analyze
2) Select Regression
3) Select Binary Logistic
4) Select the dependent variable, the one which is a grouping
variable (0 and 1) and place it into the Dependent Box, in this
case, take or decline offer
5) Enter the predictors (IVs) that you want to test into the
Covariates Box. In this case, Household Income and Monthly
Mortgage
6) Leave Enter as the default method
Continuation of SPSS Steps
•7) If there is any categorical IV, click on Categorical button and enter it.
There is none in this case.
•8) In the Options button, select Classification Plots, Hosmer-Lemeshow
goodness-of-fit, Casewise Listing of residuals. Retain default entries for
probability of stepwise, classification cutoff, and maximum iterations
•9) Continue, then, OK
Any Questions !!!
• Thank You
20

More Related Content

PPTX
Logistic regression
PPTX
logisticregression-120102011227-phpapp01.pptx
PPTX
Logistic regression
PPTX
Module5.slp
PPTX
Module5.slp
PPTX
Module5.slp
PPTX
Logistic regression
PPTX
Correlation & Regression.pptx
Logistic regression
logisticregression-120102011227-phpapp01.pptx
Logistic regression
Module5.slp
Module5.slp
Module5.slp
Logistic regression
Correlation & Regression.pptx

Similar to Log reg pdf.pdf (20)

PDF
Regression analysis made easy
PDF
Introduction to Limited Dependent variable
PPTX
STATISTICAL REGRESSION MODELS
PPTX
Regression analysis - Estimation of relationships between the variables
PDF
Logistic regression sage
PPT
A presentation for Multiple linear regression.ppt
PPTX
A Topic on REGRESSION Analysis conducted pptx
PPTX
Predictive analytics and Type of Predictive Analytics
PPTX
SSP PRESENTATION COMPLETE ( ADVANCE ) .pptx
PDF
Regression
PPTX
PPT_logistic regression.pptx
PPTX
Logit and Probit and Tobit model: Basic Introduction
PPTX
Regression of research methodlogyyy.pptx
PPTX
Multicollinearity PPT
PPTX
Dependence Techniques
PPTX
Lecture_16_Hope_to_skills.pptx
PPTX
Presentation1.econometrics 2 by habtamuu
PPTX
Regression analysis
PPTX
Bus 173_6.pptx
PPTX
An Introduction to Regression Models: Linear and Logistic approaches
Regression analysis made easy
Introduction to Limited Dependent variable
STATISTICAL REGRESSION MODELS
Regression analysis - Estimation of relationships between the variables
Logistic regression sage
A presentation for Multiple linear regression.ppt
A Topic on REGRESSION Analysis conducted pptx
Predictive analytics and Type of Predictive Analytics
SSP PRESENTATION COMPLETE ( ADVANCE ) .pptx
Regression
PPT_logistic regression.pptx
Logit and Probit and Tobit model: Basic Introduction
Regression of research methodlogyyy.pptx
Multicollinearity PPT
Dependence Techniques
Lecture_16_Hope_to_skills.pptx
Presentation1.econometrics 2 by habtamuu
Regression analysis
Bus 173_6.pptx
An Introduction to Regression Models: Linear and Logistic approaches
Ad

Recently uploaded (20)

PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
Database Infoormation System (DBIS).pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
1_Introduction to advance data techniques.pptx
PDF
Launch Your Data Science Career in Kochi – 2025
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPT
Quality review (1)_presentation of this 21
PPTX
Global journeys: estimating international migration
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
Computer network topology notes for revision
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
IB Computer Science - Internal Assessment.pptx
Foundation of Data Science unit number two notes
Database Infoormation System (DBIS).pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
1_Introduction to advance data techniques.pptx
Launch Your Data Science Career in Kochi – 2025
Moving the Public Sector (Government) to a Digital Adoption
Quality review (1)_presentation of this 21
Global journeys: estimating international migration
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
Computer network topology notes for revision
Introduction-to-Cloud-ComputingFinal.pptx
Mega Projects Data Mega Projects Data
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Ad

Log reg pdf.pdf

  • 1. LOGISTIC REGRESSION Senior Lecturer King Faisal University,KSA By Dr Zahid Khan 1
  • 2. What is logistic regression? Logit for short, a specialized form of regression used when the dependent variable is dichotomous and categorical while the independent variable(s) could be any type. • Dependent variable is binary (yes/no, Male/Female ) outcome. • Independent variables are continuous interval • Examples: • Relation of weight and BP to 10 year risk of death • Relation of CD4 count to 1 year risk of AIDS diagnosis 2
  • 3. Logistic Regression • Form of regression that allows the prediction of discrete variables by a mix of continuous and discrete/nominal predictors. • Addresses the same questions that discriminant function analysis and multiple regression do but with no distributional assumptions on the predictors (the predictors do not have to be normally distributed, linearly related or have equal variance in each group) • Does not assume a linear relationship between DV and IV 3
  • 4. Why we use logistic ? No assumptions about the distributions of the predictor variables. Predictors do not have to be normally distributed Does not have to be linearly related. When equal variances , covariance doesn't exist across the groups. Predictor variables is not parametric and there is no homoscedasticity. E.g variance of dependent and independent variables are not equal. 4
  • 5. Stage 1: Objectives Of logistic regression  Identify the independent variable that impact in the dependent variable  Establishing classification system based on the logistic model for determining the group membership DECISION PROCESS
  • 6. Types of logistic regression • BINARY LOGISTIC REGRESSION It is used when the dependent variable is dichotomous. MULTINOMIAL LOGISTIC REGRESSION It is used when the dependent or outcomes variable has more than two categories.
  • 9. Binary logistic regression expression Y = Dependent Variables ß˚ = Constant ß1 = Coefficient of variable X1 X1 = Independent Variables E = Error Term BINARY
  • 10. SAMPLE SIZE Very small samples have so much sampling errors. Very large sample size decreases the chances of errors. Logistic requires larger sample size than multiple regression. Hosmer and Lamshow recommended sample size greater than 400.
  • 11. SAMPLE SIZE PER CATEGORY OF THE INDEPENDENT VARIABLE  The recommended sample size for each group is at least 10 observations per estimated parameters.
  • 12. Estimation of logistic regression model assessing overall fit Logistic relationship describe earlier in both estimating the logistic model and establishing the relationship between the dependent and independent variables. Result is a unique transformation of dependent variables which impacts not only the estimation process but also the resulting coefficients of independent variables.
  • 14. Description of the data The data used to conduct logistic regression is from a survey of 30 homeowners conducted by an electricity company about an offer of roof solar panels with a 50% subsidy from the state government as part of the state’s environmental policy. The variables are: •IVs: household income measured in units of a thousand dollars age of householder monthly mortgage size of family household •DV: whether the householder would take or decline the offer. Take the offer was coded as 1 and decline the offer was coded as 0.
  • 15. WHAT IS THE RESEARCH QUESTION? To determine whether household income and monthly mortgage will predict taking or declining the solar panel offer Independent Variables: household income and monthly mortgage Dependent Variables: Take the offer or decline the offer
  • 16. Two hypotheses to be tested • There are two hypotheses to test in relation to the overall fit of the model: H0: The model is a good fitting model H1: The model is not a good fitting model (i.e. the predictors have a significant effect)
  • 17. How to perform logistic regression in spss 1) Click Analyze 2) Select Regression 3) Select Binary Logistic 4) Select the dependent variable, the one which is a grouping variable (0 and 1) and place it into the Dependent Box, in this case, take or decline offer 5) Enter the predictors (IVs) that you want to test into the Covariates Box. In this case, Household Income and Monthly Mortgage 6) Leave Enter as the default method
  • 18. Continuation of SPSS Steps •7) If there is any categorical IV, click on Categorical button and enter it. There is none in this case. •8) In the Options button, select Classification Plots, Hosmer-Lemeshow goodness-of-fit, Casewise Listing of residuals. Retain default entries for probability of stepwise, classification cutoff, and maximum iterations •9) Continue, then, OK
  • 19. Any Questions !!! • Thank You 20