SlideShare a Scribd company logo
Prediction Method


By Rama Krishna Kompella
Multiple Regression
• MR is an intermediate prediction method, allowing:

• 2 or more (usually continuous) IVs

• 1 Continuous DV

• Want IVs relatively uncorrelated

• Want IVs correlated with DV

• Focus is on weights for IVs
Multiple Regression
• A regression model specifies a relation between a dependent
  variable Y and certain independent variables X1, …,XK.
  – Here “independence” is not in the sense of random variables; rather, it
    means that the value of Y depends on - or is determined by - the Xi
    variables.)
• A linear model sets
      Y = β1 + β1X1 + … + βkXK + ε,
   where ε is the error term.
• To use such a model, we need to have data on values of Y
  corresponding to values of the Xi's.
  – selling prices for various house features, past growth values for various
    economic conditions
When to Use MR?
o Standard: Examines how whole set of IVs relates to DV
o Combines all IVs at once to find multiple correlation
o Hierarchical: Examines several sets of IVs based on theory
o Researcher chooses order of variables entered in steps
o Stepwise: Examines IVs most highly correlated with DV
o Computer selects best IVs related to DV
o Conduct any of above in stand-alone MR analysis
o Conduct set of MRs as follow-up to significant Canonical
                                4
Correlation
Example
• Suppose we have data on sales of houses in some area.
  – For each house, we have complete information about its size,
    the number of bedrooms, bathrooms, total rooms, the size of
    the lot, the corresponding property tax, etc., and also the price
    at which the house was eventually sold.
  – Can we use this data to predict the selling price of a house
    currently on the market?
  – The first step is to postulate a model of how the various
    features of a house determine its selling price.
Example
– A linear model would have the following form:
   selling price = β0 + β1(sq.ft.) + β2 (no. bedrooms) + β3 (no. bath)
                  + β4 (no. acres) + β5 (taxes) + error
   • In this expression, β1 represents the increase in selling price for each
     additional square foot of area: it is the marginal cost of additional area.
   • β2 and β3 are the marginal costs of additional bedrooms and bathrooms,
     and so on.
   • The intercept β0 could in theory be thought of as the price of a house for
     which all the variables specified are zero; of course, no such house could
     exist, but including β0 gives us more flexibility in picking a model.
Example
  – The error reflects the fact that two houses with exactly the same
    characteristics need not sell for exactly the same price.
     • There is always some variability left over, even after we specify the value of a large
       number variables.
     • This variability is captured by an error term, which we will treat as a random
       variable.
• Regression analysis is a technique for using data to identify
  relationships among variables and use these relationships to make
 predictions.
Levels of advertising
• Determine appropriate levels of advertising and promotion for a
  particular market segment.
• Consider the problem of managing sales of beer at large college
  campuses.
   – Sales over, say, one semester might be influenced by ads in the college
     paper, ads on the campus radio station, sponsorship of sports-related
     events, sponsorship of contests, etc.
• Use data on advertising and promotional expenditures at many
  different campuses to tell us the marginal value of dollars spent in
  each category.
• A marketing strategy is designed accordingly.
• Set up a model of the following type:
   sales = β0 + β1(print budget) + β2(radio budget)
          + β3(sports promo budget) + β4(other promo) + error
General Research Questions:

• How do consumers make decisions about
  the foods that they eat?

• How do these decisions vary across
  cultures?



                                          9
More Specific Research Questions:

• What factors influence consumers’
  willingness to purchase genetically modified
  food products?

• Does the influence of these factors vary
  between U.S. and U.K. consumers?
Descriptive Statistics
                        U.S. and U.K. Students
                                                                 US            UK
                      N                                          44            33

                      Willingness to Purchase                   4.86          4.60

                      General Trust                            3.51*         3.22*

                      Cognitive Trust                          5.39*         4.65*

                      Affective Trust                          5.02*         4.41*

                      Technology                               5.17*         4.70*
All student data are on a 7 point scale except general trust which is on a 5 point scale.

†p < .10
*p < .05
Multiple Regression Results
       U.S. and U.K. Students
      Dependent Variable: WTP
             U.S.         U.K.    Combined
              β              β       β

General       .465†       .645†     .520*
Cognitive    -.422†        .121     -.167
Affective     .892*        .271     .649*
Technology     .179        .000      .000
Country         na          na      -1.09

N             44           31        75
R2            .54          .34       .46

               †p < .10
               *p < .05
Questions?

More Related Content

PPT
Chapter 2
PPT
2.7 other classifiers
PDF
DOSUG Intro to google prediction api
PPTX
Data mining
PPT
Day 10 prediction and regression
PPTX
Chapter 4 Classification
PDF
Regression Analysis
PPTX
Data mining: Classification and prediction
Chapter 2
2.7 other classifiers
DOSUG Intro to google prediction api
Data mining
Day 10 prediction and regression
Chapter 4 Classification
Regression Analysis
Data mining: Classification and prediction

Similar to T16 multiple regression (20)

PDF
Identification and ugc
PPTX
Marketing Experiment - Part II: Analysis
PPT
BRM-lecture-11.ppt
DOC
Mining Loyalty Card Data for Increased Competitiveness: Case of a leading Ret...
PPT
Analytical Design in Applied Marketing Research
PPT
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
PDF
PPT
Abdm4064 week 04 research methodology
PPTX
How to avoid some of the pitfalls when deploying legacy targeting models da...
PPT
Research Design
PDF
Hy2208 final
PPTX
Hy2208 Final
PDF
Final presentation zg2088
PPTX
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
PDF
Bmgt 311 week_4
PDF
Cutting Edge Predictive Modeling For Classification
PDF
Bmgt 311 chapter_4
PPT
Research design & secondary data
PPTX
Chris and Andrew 20.4.2023.pptx
PPTX
wt2084 final presentation slides
Identification and ugc
Marketing Experiment - Part II: Analysis
BRM-lecture-11.ppt
Mining Loyalty Card Data for Increased Competitiveness: Case of a leading Ret...
Analytical Design in Applied Marketing Research
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
Abdm4064 week 04 research methodology
How to avoid some of the pitfalls when deploying legacy targeting models da...
Research Design
Hy2208 final
Hy2208 Final
Final presentation zg2088
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
Bmgt 311 week_4
Cutting Edge Predictive Modeling For Classification
Bmgt 311 chapter_4
Research design & secondary data
Chris and Andrew 20.4.2023.pptx
wt2084 final presentation slides
Ad

More from kompellark (20)

PPT
T22 research report writing
PPT
Rubric assignment 2
PPT
Answers mid-term
PDF
Exam paper
PPT
T21 conjoint analysis
PPT
T20 cluster analysis
PPT
T19 factor analysis
PPT
T18 discriminant analysis
PPT
T17 correlation
PPT
T16 multiple regression
PPT
T15 ancova
PPT
T14 anova
PPT
T13 parametric tests
PPT
T11 types of tests
PPT
T15 ancova
PPT
T14 anova
PPT
T13 parametric tests
PPT
T12 non-parametric tests
PPT
T11 types of tests
PPT
T10 statisitical analysis
T22 research report writing
Rubric assignment 2
Answers mid-term
Exam paper
T21 conjoint analysis
T20 cluster analysis
T19 factor analysis
T18 discriminant analysis
T17 correlation
T16 multiple regression
T15 ancova
T14 anova
T13 parametric tests
T11 types of tests
T15 ancova
T14 anova
T13 parametric tests
T12 non-parametric tests
T11 types of tests
T10 statisitical analysis
Ad

Recently uploaded (20)

PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Approach and Philosophy of On baking technology
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Spectroscopy.pptx food analysis technology
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
A Presentation on Artificial Intelligence
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPT
Teaching material agriculture food technology
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Machine Learning_overview_presentation.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
Building Integrated photovoltaic BIPV_UPV.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Per capita expenditure prediction using model stacking based on satellite ima...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
The AUB Centre for AI in Media Proposal.docx
Approach and Philosophy of On baking technology
Empathic Computing: Creating Shared Understanding
Programs and apps: productivity, graphics, security and other tools
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Spectroscopy.pptx food analysis technology
Diabetes mellitus diagnosis method based random forest with bat algorithm
Encapsulation_ Review paper, used for researhc scholars
A Presentation on Artificial Intelligence
Review of recent advances in non-invasive hemoglobin estimation
Teaching material agriculture food technology
“AI and Expert System Decision Support & Business Intelligence Systems”
Machine Learning_overview_presentation.pptx
Advanced methodologies resolving dimensionality complications for autism neur...

T16 multiple regression

  • 1. Prediction Method By Rama Krishna Kompella
  • 2. Multiple Regression • MR is an intermediate prediction method, allowing: • 2 or more (usually continuous) IVs • 1 Continuous DV • Want IVs relatively uncorrelated • Want IVs correlated with DV • Focus is on weights for IVs
  • 3. Multiple Regression • A regression model specifies a relation between a dependent variable Y and certain independent variables X1, …,XK. – Here “independence” is not in the sense of random variables; rather, it means that the value of Y depends on - or is determined by - the Xi variables.) • A linear model sets Y = β1 + β1X1 + … + βkXK + ε, where ε is the error term. • To use such a model, we need to have data on values of Y corresponding to values of the Xi's. – selling prices for various house features, past growth values for various economic conditions
  • 4. When to Use MR? o Standard: Examines how whole set of IVs relates to DV o Combines all IVs at once to find multiple correlation o Hierarchical: Examines several sets of IVs based on theory o Researcher chooses order of variables entered in steps o Stepwise: Examines IVs most highly correlated with DV o Computer selects best IVs related to DV o Conduct any of above in stand-alone MR analysis o Conduct set of MRs as follow-up to significant Canonical 4 Correlation
  • 5. Example • Suppose we have data on sales of houses in some area. – For each house, we have complete information about its size, the number of bedrooms, bathrooms, total rooms, the size of the lot, the corresponding property tax, etc., and also the price at which the house was eventually sold. – Can we use this data to predict the selling price of a house currently on the market? – The first step is to postulate a model of how the various features of a house determine its selling price.
  • 6. Example – A linear model would have the following form: selling price = β0 + β1(sq.ft.) + β2 (no. bedrooms) + β3 (no. bath) + β4 (no. acres) + β5 (taxes) + error • In this expression, β1 represents the increase in selling price for each additional square foot of area: it is the marginal cost of additional area. • β2 and β3 are the marginal costs of additional bedrooms and bathrooms, and so on. • The intercept β0 could in theory be thought of as the price of a house for which all the variables specified are zero; of course, no such house could exist, but including β0 gives us more flexibility in picking a model.
  • 7. Example – The error reflects the fact that two houses with exactly the same characteristics need not sell for exactly the same price. • There is always some variability left over, even after we specify the value of a large number variables. • This variability is captured by an error term, which we will treat as a random variable. • Regression analysis is a technique for using data to identify relationships among variables and use these relationships to make predictions.
  • 8. Levels of advertising • Determine appropriate levels of advertising and promotion for a particular market segment. • Consider the problem of managing sales of beer at large college campuses. – Sales over, say, one semester might be influenced by ads in the college paper, ads on the campus radio station, sponsorship of sports-related events, sponsorship of contests, etc. • Use data on advertising and promotional expenditures at many different campuses to tell us the marginal value of dollars spent in each category. • A marketing strategy is designed accordingly. • Set up a model of the following type: sales = β0 + β1(print budget) + β2(radio budget) + β3(sports promo budget) + β4(other promo) + error
  • 9. General Research Questions: • How do consumers make decisions about the foods that they eat? • How do these decisions vary across cultures? 9
  • 10. More Specific Research Questions: • What factors influence consumers’ willingness to purchase genetically modified food products? • Does the influence of these factors vary between U.S. and U.K. consumers?
  • 11. Descriptive Statistics U.S. and U.K. Students US UK N 44 33 Willingness to Purchase 4.86 4.60 General Trust 3.51* 3.22* Cognitive Trust 5.39* 4.65* Affective Trust 5.02* 4.41* Technology 5.17* 4.70* All student data are on a 7 point scale except general trust which is on a 5 point scale. †p < .10 *p < .05
  • 12. Multiple Regression Results U.S. and U.K. Students Dependent Variable: WTP U.S. U.K. Combined β β β General .465† .645† .520* Cognitive -.422† .121 -.167 Affective .892* .271 .649* Technology .179 .000 .000 Country na na -1.09 N 44 31 75 R2 .54 .34 .46 †p < .10 *p < .05