TitleSubtitleDetecting Attribute by Covariate interaction in discrete choice modelTHINK.CHANGE.DOANZMAC 2010 – Christ Church, New Zealand, 30 November 2010Kyuseop Kwak, Paul Wang, Jordan LouviereUniversity of Technology Sydney
Motivation of the researchDiscrete Choice Model based on Discrete Choice Experiment (DCE) is used widely in studying people’s choice behaviours in various context (Louviere et al 2000)
Identifying preference heterogeneity, e.g. segmentation, may require observable consumer characteristics or covariates
Online panel often provides more than 100 individual characteristics or covariates,
Of which many are categorical or nominally scaled, requiring more dummy coded variables  consumption of degrees of freedom
Thus, a systematic approach to screening important interactions of attribute by covariate is demanded2
Example DCE data: Carbon Trading SchemeCarbon trading scheme Discrete Choice Experiment (DCE) dataFive attributes: (1) starting years, (2) redistribution of tax, (3) exemption of transport sector, (4) R&D investment, and (5) special treatment for energy intensive sector
35 covariates: demographics, attitudes and opinions3
Modelling Preference HeterogeneityUnobserved Heterogeneity ModellingContinuous distribution (mixed logit, Train 2002; Hierarchical Bayes, Rossi and Allenby 2003)
Discrete distribution (latent class, Kamakura and Russell 1989)
Managerially not very helpfulObserved Heterogeneity ModellingCovariates such as demographics and other consumer characteristics may explain individual heterogeneity and are often managerially relevant
Concomitant latent class (Kamakura et al. 1994)
Interaction between attributes and covariates4
Proposed Approach to Identify Important InteractionsSelect cases where the option is chosen (y = 1) or simply weigh the stacked choice data using the dummy coded choice variable (y=1 or 0)Let each attribute be a dependent variable and other covariates such as demographics be independent variablesStage 1Run a series of logistic regressions (i.e., unconditional logit) andidentify significant covariates in the resultsSpecify conditional logit choice model with main effects andinteractions (attributes x covariates) identified in previous stepStage 25
Detecting Attribute by Covariate Interaction – Why it worksSuppose a simple binary choice, i.e., y = 0 or 1, with a single binary attribute, i.e., X = -1 or 1, and a single binary covariate, i.e., Z = -1 or 1
Contingency table where each cell represents number of respondents who make choice, is a simple way of analysing the interaction effects
Assume there are 100 respondents and their preferences are equally distributedNO Interaction between Z and XInteraction between Z and X6
Detecting Attribute by Covariate Interaction – Why it works7

More Related Content

DOCX
Nonparametric Methods and Evolutionary Algorithms in Genetic Epidemiology
PPTX
Banking case study
PDF
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
PDF
Perspective of feature selection in bioinformatics
PDF
Regression models for panel data
PDF
Artikel Original Uji Sobel (Sobel Test)
DOCX
Sensitivity analysis
PDF
Feature selection for classification
Nonparametric Methods and Evolutionary Algorithms in Genetic Epidemiology
Banking case study
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
Perspective of feature selection in bioinformatics
Regression models for panel data
Artikel Original Uji Sobel (Sobel Test)
Sensitivity analysis
Feature selection for classification

What's hot (16)

PDF
Strong Heredity Models in High Dimensional Data
PPT
Mixed models
PPTX
NCME_040916
ODP
Choice Models
PDF
因果関係の推論
PDF
Regression Discontinuity Method
PDF
Lehmann 1990
PPSX
Multivariate Analysis An Overview
PPTX
Sensitivity analysis
PDF
Ensemble models: theory and applications, Figini, Vezzoli. September, 3 2013
PDF
Commentary: Aedes albopictus and Aedes japonicus—two invasive mosquito specie...
PDF
PDF
Modeling Social Data, Lecture 6: Regression, Part 1
PPT
Diffmeans
PPTX
Discriminant analysis
PDF
Detecting Multicollinearity in Regression Analysis
Strong Heredity Models in High Dimensional Data
Mixed models
NCME_040916
Choice Models
因果関係の推論
Regression Discontinuity Method
Lehmann 1990
Multivariate Analysis An Overview
Sensitivity analysis
Ensemble models: theory and applications, Figini, Vezzoli. September, 3 2013
Commentary: Aedes albopictus and Aedes japonicus—two invasive mosquito specie...
Modeling Social Data, Lecture 6: Regression, Part 1
Diffmeans
Discriminant analysis
Detecting Multicollinearity in Regression Analysis
Ad

Viewers also liked (10)

PDF
Valuing Health at the End of Life: Defining Public Preferences
PPTX
Cadth 2015 e3 eq5 d
PPTX
Accessing Farmers' Willingness-To-PaY For Improved Common Beans Varieties
PDF
Consumers' willingness to pay for improved water services in ilorin metropoli...
PPTX
Cadth 2015 a4 2015 cadth values
PPTX
Emerging Picture of Value Based Pricing
PPTX
How to Run Discrete Choice Conjoint Analysis
PDF
Introduction to Design of Experiments by Teck Nam Ang (University of Malaya)
PPT
Writing the 'Discussion and Analysis'
PPT
Chapter 10-DATA ANALYSIS & PRESENTATION
Valuing Health at the End of Life: Defining Public Preferences
Cadth 2015 e3 eq5 d
Accessing Farmers' Willingness-To-PaY For Improved Common Beans Varieties
Consumers' willingness to pay for improved water services in ilorin metropoli...
Cadth 2015 a4 2015 cadth values
Emerging Picture of Value Based Pricing
How to Run Discrete Choice Conjoint Analysis
Introduction to Design of Experiments by Teck Nam Ang (University of Malaya)
Writing the 'Discussion and Analysis'
Chapter 10-DATA ANALYSIS & PRESENTATION
Ad

Similar to Detecting Attributes and Covariates Interaction in Discrete Choice Model (20)

PDF
JSM2013,Proceedings,paper307699_79238,DSweitzer
PPT
Episode 18 : Research Methodology ( Part 8 )
PPT
Episode 12 : Research Methodology ( Part 2 )
PDF
Nbe rcausalpredictionv111 lecture2
PDF
A Modified KS-test for Feature Selection
PDF
the unconditional Logistic Regression .pdf
PDF
2013jsm,Proceedings,DSweitzer,26sep
PPTX
Modeling_fundamentals For MsC environment science
PDF
Empirical Analysis of the Bias-Variance Tradeoff Across Machine Learning Models
PDF
Empirical Analysis of the Bias-Variance Tradeoff Across Machine Learning Models
PDF
Optimal Stopping Report Final
PPTX
Data science
PPT
Topic_6
PPTX
Sensitivity Analysis
PDF
Estimation of parameters of linear econometric model and the power of test in...
PPT
Experimental
PDF
debatrim_report (1)
PDF
Applied Business Statistics ,ken black , ch 3 part 1
PPT
Prote-OMIC Data Analysis and Visualization
PDF
Feature selection in multimodal
JSM2013,Proceedings,paper307699_79238,DSweitzer
Episode 18 : Research Methodology ( Part 8 )
Episode 12 : Research Methodology ( Part 2 )
Nbe rcausalpredictionv111 lecture2
A Modified KS-test for Feature Selection
the unconditional Logistic Regression .pdf
2013jsm,Proceedings,DSweitzer,26sep
Modeling_fundamentals For MsC environment science
Empirical Analysis of the Bias-Variance Tradeoff Across Machine Learning Models
Empirical Analysis of the Bias-Variance Tradeoff Across Machine Learning Models
Optimal Stopping Report Final
Data science
Topic_6
Sensitivity Analysis
Estimation of parameters of linear econometric model and the power of test in...
Experimental
debatrim_report (1)
Applied Business Statistics ,ken black , ch 3 part 1
Prote-OMIC Data Analysis and Visualization
Feature selection in multimodal

Detecting Attributes and Covariates Interaction in Discrete Choice Model

  • 1. TitleSubtitleDetecting Attribute by Covariate interaction in discrete choice modelTHINK.CHANGE.DOANZMAC 2010 – Christ Church, New Zealand, 30 November 2010Kyuseop Kwak, Paul Wang, Jordan LouviereUniversity of Technology Sydney
  • 2. Motivation of the researchDiscrete Choice Model based on Discrete Choice Experiment (DCE) is used widely in studying people’s choice behaviours in various context (Louviere et al 2000)
  • 3. Identifying preference heterogeneity, e.g. segmentation, may require observable consumer characteristics or covariates
  • 4. Online panel often provides more than 100 individual characteristics or covariates,
  • 5. Of which many are categorical or nominally scaled, requiring more dummy coded variables  consumption of degrees of freedom
  • 6. Thus, a systematic approach to screening important interactions of attribute by covariate is demanded2
  • 7. Example DCE data: Carbon Trading SchemeCarbon trading scheme Discrete Choice Experiment (DCE) dataFive attributes: (1) starting years, (2) redistribution of tax, (3) exemption of transport sector, (4) R&D investment, and (5) special treatment for energy intensive sector
  • 8. 35 covariates: demographics, attitudes and opinions3
  • 9. Modelling Preference HeterogeneityUnobserved Heterogeneity ModellingContinuous distribution (mixed logit, Train 2002; Hierarchical Bayes, Rossi and Allenby 2003)
  • 10. Discrete distribution (latent class, Kamakura and Russell 1989)
  • 11. Managerially not very helpfulObserved Heterogeneity ModellingCovariates such as demographics and other consumer characteristics may explain individual heterogeneity and are often managerially relevant
  • 12. Concomitant latent class (Kamakura et al. 1994)
  • 14. Proposed Approach to Identify Important InteractionsSelect cases where the option is chosen (y = 1) or simply weigh the stacked choice data using the dummy coded choice variable (y=1 or 0)Let each attribute be a dependent variable and other covariates such as demographics be independent variablesStage 1Run a series of logistic regressions (i.e., unconditional logit) andidentify significant covariates in the resultsSpecify conditional logit choice model with main effects andinteractions (attributes x covariates) identified in previous stepStage 25
  • 15. Detecting Attribute by Covariate Interaction – Why it worksSuppose a simple binary choice, i.e., y = 0 or 1, with a single binary attribute, i.e., X = -1 or 1, and a single binary covariate, i.e., Z = -1 or 1
  • 16. Contingency table where each cell represents number of respondents who make choice, is a simple way of analysing the interaction effects
  • 17. Assume there are 100 respondents and their preferences are equally distributedNO Interaction between Z and XInteraction between Z and X6
  • 18. Detecting Attribute by Covariate Interaction – Why it works7
  • 19. Empirical StudyMonte-Carlo SimulationBased upon banking choice data (Kamakura and Wedel 1994)
  • 20. Assumed four attributes with three levels each (3^4) and five covariates
  • 21. Varied sample sizes (100, 300, 600, 900 and 1200 individuals) and 9 choice sets per each individual
  • 22. Effect sizes (parameters) of both main and Interactions were manipulatedProposed approach versus Full Interaction ModelIdentify important interactions suggested by proposed approach
  • 23. Calibrate three groups of models: (1) main effects only, (2) main + proposed interactions, and (3) full interactions
  • 24. Compare model fit statistics across various sample sizes
  • 25. Compare parameter recoveries (bias or error) with true parameter values8
  • 27. Monte-Carlo Simulation: Five individual difference variables (covariates)r(G,A) = 0r(G,I) = 0.4r(G,E) = 0r(G,D) = 0Genderr(E,A) = 0r(E,D) = 0.2r(E,I) = 0.7Educationr(I,D) = 0.4r(I,A) = 0.2Incomer(D,A) = 0.3Deposit# AccountGenderEducationIncomeDeposit# Account10
  • 28. Monte-Carlo Simulation: Parameter SetupAttributesNote: ‘n.a.’ stands for ‘Not Applicable’, i.e., no parameter assumed in the simulation setup11
  • 29. Stage 1 - Unconditional Logit– Detecting interactions based on p-values Minimum BalanceCheck FeeMonthly FeeATM FeeThe performance depends on sample size as well as assumed effect sizeHighly continuous variables are relatively hard to be detectedHigh correlations among covariates made detection harder12
  • 30. Stage 2 – Choice Model (Conditional Logit): Fit StatisticsN=300N=600N=900N=1200Fit statistics of proposed model is very close to full interaction modelHOWEVER, ‘BIC’ always picks the proposed model as the best13
  • 31. Stage 2 – Choice Model (Conditional Logit): Parameter Recovery (bias)Mean Absolute Error (MAE)Mean Absolute Percentage Error (MAPE)The models based on the proposed approach produce smaller biases across the samples14
  • 32. Summary of findings and DiscussionsStage 1 - Unconditional logit: Sensitivity to sample size and effect sizeThe larger the sample size is, the better performance in general due to low S.E.
  • 33. Actual effect sizes also influence the performance of the proposed approach
  • 34. Correlations among covariates also play a role  conduct factor analysis first to make variables independentStage 2 - Conditional logit: Fit statistics versus Parameter BiasFit statistics are very close between proposed and full interaction model
  • 35. HOWEVER, Bayesian Information Criteria (BIC) supports the proposed model due to its parsimony
  • 36. In addition, the proposed model produces consistent estimates and lower biases regardless of sample size15
  • 37. ConclusionsA novel quick & easy approach
  • 38. New insights into modelling individual heterogeneity
  • 39. More replication studies are needed using both simulated data and actual SP or RP data
  • 40. More power analyses are required to fully understand the impact of sample size and effect size on the performance of the proposed approach
  • 41. Explore different techniques such as CART, MARS, etc16
  • 43. Stage 2 - Conditional Logit: Parameter Recovery (bias)Mean Error (ME)Duh!, The larger sample, the closer estimatesBoth produce equally consistent estimates18