SlideShare a Scribd company logo
Prediction Method


By Rama Krishna Kompella
Multiple Regression
• MR is an intermediate prediction method, allowing:

• 2 or more (usually continuous) IVs

• 1 Continuous DV

• Want IVs relatively uncorrelated

• Want IVs correlated with DV

• Focus is on weights for IVs
Multiple Regression
• A regression model specifies a relation between a dependent
  variable Y and certain independent variables X1, …,XK.
  – Here “independence” is not in the sense of random variables; rather, it
    means that the value of Y depends on - or is determined by - the Xi
    variables.)
• A linear model sets
      Y = β1 + β1X1 + … + βkXK + ε,
   where ε is the error term.
• To use such a model, we need to have data on values of Y
  corresponding to values of the Xi's.
  – selling prices for various house features, past growth values for various
    economic conditions
When to Use MR?
o Standard: Examines how whole set of IVs relates to DV
o Combines all IVs at once to find multiple correlation
o Hierarchical: Examines several sets of IVs based on theory
o Researcher chooses order of variables entered in steps
o Stepwise: Examines IVs most highly correlated with DV
o Computer selects best IVs related to DV
o Conduct any of above in stand-alone MR analysis
o Conduct set of MRs as follow-up to significant Canonical
                                4
Correlation
Example
• Suppose we have data on sales of houses in some area.
  – For each house, we have complete information about its size,
    the number of bedrooms, bathrooms, total rooms, the size of
    the lot, the corresponding property tax, etc., and also the price
    at which the house was eventually sold.
  – Can we use this data to predict the selling price of a house
    currently on the market?
  – The first step is to postulate a model of how the various
    features of a house determine its selling price.
Example
– A linear model would have the following form:
   selling price = β0 + β1(sq.ft.) + β2 (no. bedrooms) + β3 (no. bath)
                  + β4 (no. acres) + β5 (taxes) + error
   • In this expression, β1 represents the increase in selling price for each
     additional square foot of area: it is the marginal cost of additional area.
   • β2 and β3 are the marginal costs of additional bedrooms and bathrooms,
     and so on.
   • The intercept β0 could in theory be thought of as the price of a house for
     which all the variables specified are zero; of course, no such house could
     exist, but including β0 gives us more flexibility in picking a model.
Example
  – The error reflects the fact that two houses with exactly the same
    characteristics need not sell for exactly the same price.
     • There is always some variability left over, even after we specify the value of a large
       number variables.
     • This variability is captured by an error term, which we will treat as a random
       variable.
• Regression analysis is a technique for using data to identify
  relationships among variables and use these relationships to make
 predictions.
Levels of advertising
• Determine appropriate levels of advertising and promotion for a
  particular market segment.
• Consider the problem of managing sales of beer at large college
  campuses.
   – Sales over, say, one semester might be influenced by ads in the college
     paper, ads on the campus radio station, sponsorship of sports-related
     events, sponsorship of contests, etc.
• Use data on advertising and promotional expenditures at many
  different campuses to tell us the marginal value of dollars spent in
  each category.
• A marketing strategy is designed accordingly.
• Set up a model of the following type:
   sales = β0 + β1(print budget) + β2(radio budget)
          + β3(sports promo budget) + β4(other promo) + error
General Research Questions:

• How do consumers make decisions about
  the foods that they eat?

• How do these decisions vary across
  cultures?



                                          9
More Specific Research Questions:

• What factors influence consumers’
  willingness to purchase genetically modified
  food products?

• Does the influence of these factors vary
  between U.S. and U.K. consumers?
Descriptive Statistics
                        U.S. and U.K. Students
                                                                 US            UK
                      N                                          44            33

                      Willingness to Purchase                   4.86          4.60

                      General Trust                            3.51*         3.22*

                      Cognitive Trust                          5.39*         4.65*

                      Affective Trust                          5.02*         4.41*

                      Technology                               5.17*         4.70*
All student data are on a 7 point scale except general trust which is on a 5 point scale.

†p < .10
*p < .05
Multiple Regression Results
       U.S. and U.K. Students
      Dependent Variable: WTP
             U.S.         U.K.    Combined
              β              β       β

General       .465†       .645†     .520*
Cognitive    -.422†        .121     -.167
Affective     .892*        .271     .649*
Technology     .179        .000      .000
Country         na          na      -1.09

N             44           31        75
R2            .54          .34       .46

               †p < .10
               *p < .05
Questions?

More Related Content

PPT
Prediction of house price using multiple regression
PDF
Average performance prediction of elementary school using multiple regression
PPTX
House Sale Price Prediction
PPTX
Predicting crop yield and response to Nutrients from soil spectra at WCSS 201...
PPT
2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English
PDF
Ijetcas14 379
PPTX
Regression analysis
PDF
DOSUG Intro to google prediction api
Prediction of house price using multiple regression
Average performance prediction of elementary school using multiple regression
House Sale Price Prediction
Predicting crop yield and response to Nutrients from soil spectra at WCSS 201...
2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English
Ijetcas14 379
Regression analysis
DOSUG Intro to google prediction api

Viewers also liked (11)

PDF
Analysis of crop yield prediction using data mining techniques
PPTX
Predicting the future with Google Prediction API
DOCX
Scale Invariant Feature Tranform
PPTX
Prediction of House Sales Price
PPT
Data mining in agriculture
PPT
Michal Erel's SIFT presentation
PPT
Simple Linier Regression
PDF
Regression Analysis
PDF
Correlation and Simple Regression
PPT
Regression analysis ppt
PPTX
Slideshare ppt
Analysis of crop yield prediction using data mining techniques
Predicting the future with Google Prediction API
Scale Invariant Feature Tranform
Prediction of House Sales Price
Data mining in agriculture
Michal Erel's SIFT presentation
Simple Linier Regression
Regression Analysis
Correlation and Simple Regression
Regression analysis ppt
Slideshare ppt
Ad

Similar to T16 multiple regression (20)

PPTX
2010 06-03 pilot study 1950s with-basements
PPTX
2010 06-03 pilot study 1950s with-basements
PPTX
Math 221 week 1 lecture feb 2012
PDF
probability distribustion of chapter 7 and
PDF
Home Performance Labelling
PDF
2010 pilot study 1950s with basements
PDF
2010 Pilot Study Regression Analysis of 1950s Housing Stock
PPT
Exploring housing patterns and dynamics in low demand neighbourhoods using Ge...
PPT
demand forecasting
PPT
Chapter 20: investment Analysis & Fund Managemnet
PPT
bbch5.ppt.ppt
PDF
Some results on household subjective probabilities of future house prices
PPT
Economic NotesLipsey ppt ch02
PPT
Chapter 4 - multiple regression
PDF
Getting testing right
PDF
Resolving e commerce challenges with probabilistic programming
PDF
2016_Apres_Lares_EC_29set2016
PPTX
Math 221 week 1 lecture
PPTX
A review of net lift models
2010 06-03 pilot study 1950s with-basements
2010 06-03 pilot study 1950s with-basements
Math 221 week 1 lecture feb 2012
probability distribustion of chapter 7 and
Home Performance Labelling
2010 pilot study 1950s with basements
2010 Pilot Study Regression Analysis of 1950s Housing Stock
Exploring housing patterns and dynamics in low demand neighbourhoods using Ge...
demand forecasting
Chapter 20: investment Analysis & Fund Managemnet
bbch5.ppt.ppt
Some results on household subjective probabilities of future house prices
Economic NotesLipsey ppt ch02
Chapter 4 - multiple regression
Getting testing right
Resolving e commerce challenges with probabilistic programming
2016_Apres_Lares_EC_29set2016
Math 221 week 1 lecture
A review of net lift models
Ad

More from kompellark (20)

PPT
T22 research report writing
PPT
Rubric assignment 2
PPT
Answers mid-term
PDF
Exam paper
PPT
T21 conjoint analysis
PPT
T20 cluster analysis
PPT
T19 factor analysis
PPT
T18 discriminant analysis
PPT
T17 correlation
PPT
T15 ancova
PPT
T14 anova
PPT
T13 parametric tests
PPT
T11 types of tests
PPT
T15 ancova
PPT
T14 anova
PPT
T13 parametric tests
PPT
T12 non-parametric tests
PPT
T11 types of tests
PPT
T16 multiple regression
PPT
T10 statisitical analysis
T22 research report writing
Rubric assignment 2
Answers mid-term
Exam paper
T21 conjoint analysis
T20 cluster analysis
T19 factor analysis
T18 discriminant analysis
T17 correlation
T15 ancova
T14 anova
T13 parametric tests
T11 types of tests
T15 ancova
T14 anova
T13 parametric tests
T12 non-parametric tests
T11 types of tests
T16 multiple regression
T10 statisitical analysis

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Encapsulation theory and applications.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Electronic commerce courselecture one. Pdf
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPT
Teaching material agriculture food technology
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
Mobile App Security Testing_ A Comprehensive Guide.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Empathic Computing: Creating Shared Understanding
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Encapsulation theory and applications.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Electronic commerce courselecture one. Pdf
sap open course for s4hana steps from ECC to s4
Assigned Numbers - 2025 - Bluetooth® Document
Teaching material agriculture food technology
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Spectral efficient network and resource selection model in 5G networks
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Advanced methodologies resolving dimensionality complications for autism neur...
The AUB Centre for AI in Media Proposal.docx
Encapsulation_ Review paper, used for researhc scholars
Review of recent advances in non-invasive hemoglobin estimation
Reach Out and Touch Someone: Haptics and Empathic Computing

T16 multiple regression

  • 1. Prediction Method By Rama Krishna Kompella
  • 2. Multiple Regression • MR is an intermediate prediction method, allowing: • 2 or more (usually continuous) IVs • 1 Continuous DV • Want IVs relatively uncorrelated • Want IVs correlated with DV • Focus is on weights for IVs
  • 3. Multiple Regression • A regression model specifies a relation between a dependent variable Y and certain independent variables X1, …,XK. – Here “independence” is not in the sense of random variables; rather, it means that the value of Y depends on - or is determined by - the Xi variables.) • A linear model sets Y = β1 + β1X1 + … + βkXK + ε, where ε is the error term. • To use such a model, we need to have data on values of Y corresponding to values of the Xi's. – selling prices for various house features, past growth values for various economic conditions
  • 4. When to Use MR? o Standard: Examines how whole set of IVs relates to DV o Combines all IVs at once to find multiple correlation o Hierarchical: Examines several sets of IVs based on theory o Researcher chooses order of variables entered in steps o Stepwise: Examines IVs most highly correlated with DV o Computer selects best IVs related to DV o Conduct any of above in stand-alone MR analysis o Conduct set of MRs as follow-up to significant Canonical 4 Correlation
  • 5. Example • Suppose we have data on sales of houses in some area. – For each house, we have complete information about its size, the number of bedrooms, bathrooms, total rooms, the size of the lot, the corresponding property tax, etc., and also the price at which the house was eventually sold. – Can we use this data to predict the selling price of a house currently on the market? – The first step is to postulate a model of how the various features of a house determine its selling price.
  • 6. Example – A linear model would have the following form: selling price = β0 + β1(sq.ft.) + β2 (no. bedrooms) + β3 (no. bath) + β4 (no. acres) + β5 (taxes) + error • In this expression, β1 represents the increase in selling price for each additional square foot of area: it is the marginal cost of additional area. • β2 and β3 are the marginal costs of additional bedrooms and bathrooms, and so on. • The intercept β0 could in theory be thought of as the price of a house for which all the variables specified are zero; of course, no such house could exist, but including β0 gives us more flexibility in picking a model.
  • 7. Example – The error reflects the fact that two houses with exactly the same characteristics need not sell for exactly the same price. • There is always some variability left over, even after we specify the value of a large number variables. • This variability is captured by an error term, which we will treat as a random variable. • Regression analysis is a technique for using data to identify relationships among variables and use these relationships to make predictions.
  • 8. Levels of advertising • Determine appropriate levels of advertising and promotion for a particular market segment. • Consider the problem of managing sales of beer at large college campuses. – Sales over, say, one semester might be influenced by ads in the college paper, ads on the campus radio station, sponsorship of sports-related events, sponsorship of contests, etc. • Use data on advertising and promotional expenditures at many different campuses to tell us the marginal value of dollars spent in each category. • A marketing strategy is designed accordingly. • Set up a model of the following type: sales = β0 + β1(print budget) + β2(radio budget) + β3(sports promo budget) + β4(other promo) + error
  • 9. General Research Questions: • How do consumers make decisions about the foods that they eat? • How do these decisions vary across cultures? 9
  • 10. More Specific Research Questions: • What factors influence consumers’ willingness to purchase genetically modified food products? • Does the influence of these factors vary between U.S. and U.K. consumers?
  • 11. Descriptive Statistics U.S. and U.K. Students US UK N 44 33 Willingness to Purchase 4.86 4.60 General Trust 3.51* 3.22* Cognitive Trust 5.39* 4.65* Affective Trust 5.02* 4.41* Technology 5.17* 4.70* All student data are on a 7 point scale except general trust which is on a 5 point scale. †p < .10 *p < .05
  • 12. Multiple Regression Results U.S. and U.K. Students Dependent Variable: WTP U.S. U.K. Combined β β β General .465† .645† .520* Cognitive -.422† .121 -.167 Affective .892* .271 .649* Technology .179 .000 .000 Country na na -1.09 N 44 31 75 R2 .54 .34 .46 †p < .10 *p < .05