SlideShare a Scribd company logo
The Case for Generalized Estimating Equations
in State-level Analysis
Tucker Staley
Department of Political Science
University of Central Arkansas
tstaley@uca.edu
Prepared for the Annual Meeting of the Southern Political Science Association, Jan. 9-11, 2014.
New Orleans, LA.
Correlated Data
 Assume data are independent and identically
distributed.
 However, often not the case.
 Panel studies
 Cross-sectional time-series
 Dyadic studies
 Decision making environments
 Specifically concerned with intra-class
correlations resulting from grouped
observations.
Dealing with Intraclass Correlations
 Adjust standard errors of GLM
 Ignore impact of coefficient estimates
 Better Options
 Generalized Linear Mixed Models
 Generalized Estimating Equations
GLMM
 Most common
 Vectors for both fixed and random effects
accounted for
 Generalized for non-normal responses with a
known link function
Issues with GLMM
 Designed specifically for exchangeable
correlation between groups (clusters).
 For the most part, mainly allows for an
individual level interpretation.
GEE
 Marginal model
 population-averaged expectations of the dependent
variables as functions of the covariates
 No individual effects included
 intracluster variation accounted for by adjusting the
covariance matrix
 Average effect across entire sub-population
 Note: We get this interpretation with independent observations
in any model and when there is an identity link in hierarchical
models.
 Flexible correlation structures (“working
correlation”)
GEE Model Overview
 Extension of a generalized linear model (GLM)
 postulate relationship b/t DV and IV and the
conditional mean and variance of DV
 GEE reduces to GLM when T=1
 estimates are solutions to a set of “quasi-score”
differential equations
 residuals from Fisher scoring used to consistently
estimate structure of the unknown parameters
Model Specification
 Most Simply
 Goal: minimize this objective function
Process
 Estimate coefficients iteratively.
 Estimate regression coefficients.
 Use residuals from these to estimate the covariance
term.
 Repeat until convergence.
 Does not require a lot of parametric
assumptions as does MLE.
Issues w/ GEE
 Specify the correct correlation structure
 Should be based on theory. However, asymptotically efficient if incorrect.
 Sample size
 Smaller N may have issues with convergence.
 Missing Data
 Important to specify true correlation matrix.
 Goodness-of-fit
 Correlated residuals, so can't use standard statistics.
 Uncertainty
 Need to adjust s.e. (robust, bootstrapped)
Initiative and Party Power
 Justin Phillips (2008)
 “Direct democracy alters the ability of partisan
legislative majorities...to shape the size...of the
public sector.”
 DV: Tax effort
 IVs:
 Partisan Control
 State-level characteristics
 Interactions
The Case for Generalized Estimating Equations in State-level Analysis
100 Miles of Dry
 John Frendreis & Raymond Tatalovich (2010)
 What factors identified as important for
Prohibition remain important today?
 DV: Dry county
 IVs:
 Religion
 Demographics
 Partisan voting
The Case for Generalized Estimating Equations in State-level Analysis
Conclusions
 Often deal with correlated data in the real-world.
 GEE allows us to deal with intraclass
correlations and produces efficient coefficient
estimates.
 More flexible than GLMM: correlation
structures, interpretation
 Estimates may differ once correlated data are
accounted for.
 Toss in the methodological toolbox.

More Related Content

PPT
Econometric model ing
PDF
Thesis_NickyGrant_2013
PPT
Google earth engine_GEE_Engineering_Presentation2.pptx
PPTX
Ps602 notes part1
PDF
Wooldridge (2011), Difference in difference estimation.pdf
PPT
MEASUREMENT.ppt for business research methods
PDF
Priliminary Research on Multi-Dimensional Panel Data Modeling
PDF
Student Solutions Manual To Introductory Econometrics 2nd Edition Jeffrey M W...
Econometric model ing
Thesis_NickyGrant_2013
Google earth engine_GEE_Engineering_Presentation2.pptx
Ps602 notes part1
Wooldridge (2011), Difference in difference estimation.pdf
MEASUREMENT.ppt for business research methods
Priliminary Research on Multi-Dimensional Panel Data Modeling
Student Solutions Manual To Introductory Econometrics 2nd Edition Jeffrey M W...

Similar to The Case for Generalized Estimating Equations in State-level Analysis (20)

PPTX
GLM_2020_21.pptx
PDF
Session III - Census and Registers - S. Falorsi, A. Fasulo, Census and Soc...
PPTX
correction maximum likelihood estimation method
PDF
Full Download An Introduction to Generalized Linear Models 3rd Edition Annett...
PDF
Nilanjan growth and development
PDF
Warshaw initial evaluation_indiana_210915
PDF
Eviews forecasting
PDF
Greene_econometric_5ta edición.pdf
PPTX
SURE Model_Panel data.pptx
PDF
Advanced Applied Econometrics Panel-Dynamic
PDF
An Introduction to Generalized Linear Models 3rd Edition Annette J. Dobson
PDF
Statistical analysis of correlated data using generalized estimating equation...
PDF
Differences-in-Differences
PDF
An Introduction to Generalized Linear Models 3rd Edition Annette J. Dobson
PDF
Panel data methods for microeconometrics using Stata! Short and good one :)
PDF
Linear models for data science
PDF
Sigma xi showcase_2013_draft0
PDF
1_Introduction_printable.pdf
PPTX
Static Models of Continuous Variables
PPT
14.20 o2 j clare
GLM_2020_21.pptx
Session III - Census and Registers - S. Falorsi, A. Fasulo, Census and Soc...
correction maximum likelihood estimation method
Full Download An Introduction to Generalized Linear Models 3rd Edition Annett...
Nilanjan growth and development
Warshaw initial evaluation_indiana_210915
Eviews forecasting
Greene_econometric_5ta edición.pdf
SURE Model_Panel data.pptx
Advanced Applied Econometrics Panel-Dynamic
An Introduction to Generalized Linear Models 3rd Edition Annette J. Dobson
Statistical analysis of correlated data using generalized estimating equation...
Differences-in-Differences
An Introduction to Generalized Linear Models 3rd Edition Annette J. Dobson
Panel data methods for microeconometrics using Stata! Short and good one :)
Linear models for data science
Sigma xi showcase_2013_draft0
1_Introduction_printable.pdf
Static Models of Continuous Variables
14.20 o2 j clare
Ad

Recently uploaded (20)

PPTX
Lesson notes of climatology university.
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
PDF
Computing-Curriculum for Schools in Ghana
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
History, Philosophy and sociology of education (1).pptx
PDF
Complications of Minimal Access Surgery at WLH
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
RMMM.pdf make it easy to upload and study
PDF
Trump Administration's workforce development strategy
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
01-Introduction-to-Information-Management.pdf
PDF
What if we spent less time fighting change, and more time building what’s rig...
PPTX
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
PDF
Yogi Goddess Pres Conference Studio Updates
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
Lesson notes of climatology university.
Microbial disease of the cardiovascular and lymphatic systems
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
Computing-Curriculum for Schools in Ghana
LDMMIA Reiki Yoga Finals Review Spring Summer
STATICS OF THE RIGID BODIES Hibbelers.pdf
History, Philosophy and sociology of education (1).pptx
Complications of Minimal Access Surgery at WLH
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Anesthesia in Laparoscopic Surgery in India
RMMM.pdf make it easy to upload and study
Trump Administration's workforce development strategy
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
01-Introduction-to-Information-Management.pdf
What if we spent less time fighting change, and more time building what’s rig...
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
Yogi Goddess Pres Conference Studio Updates
2.FourierTransform-ShortQuestionswithAnswers.pdf
Ad

The Case for Generalized Estimating Equations in State-level Analysis

  • 1. The Case for Generalized Estimating Equations in State-level Analysis Tucker Staley Department of Political Science University of Central Arkansas tstaley@uca.edu Prepared for the Annual Meeting of the Southern Political Science Association, Jan. 9-11, 2014. New Orleans, LA.
  • 2. Correlated Data  Assume data are independent and identically distributed.  However, often not the case.  Panel studies  Cross-sectional time-series  Dyadic studies  Decision making environments  Specifically concerned with intra-class correlations resulting from grouped observations.
  • 3. Dealing with Intraclass Correlations  Adjust standard errors of GLM  Ignore impact of coefficient estimates  Better Options  Generalized Linear Mixed Models  Generalized Estimating Equations
  • 4. GLMM  Most common  Vectors for both fixed and random effects accounted for  Generalized for non-normal responses with a known link function
  • 5. Issues with GLMM  Designed specifically for exchangeable correlation between groups (clusters).  For the most part, mainly allows for an individual level interpretation.
  • 6. GEE  Marginal model  population-averaged expectations of the dependent variables as functions of the covariates  No individual effects included  intracluster variation accounted for by adjusting the covariance matrix  Average effect across entire sub-population  Note: We get this interpretation with independent observations in any model and when there is an identity link in hierarchical models.  Flexible correlation structures (“working correlation”)
  • 7. GEE Model Overview  Extension of a generalized linear model (GLM)  postulate relationship b/t DV and IV and the conditional mean and variance of DV  GEE reduces to GLM when T=1  estimates are solutions to a set of “quasi-score” differential equations  residuals from Fisher scoring used to consistently estimate structure of the unknown parameters
  • 8. Model Specification  Most Simply  Goal: minimize this objective function
  • 9. Process  Estimate coefficients iteratively.  Estimate regression coefficients.  Use residuals from these to estimate the covariance term.  Repeat until convergence.  Does not require a lot of parametric assumptions as does MLE.
  • 10. Issues w/ GEE  Specify the correct correlation structure  Should be based on theory. However, asymptotically efficient if incorrect.  Sample size  Smaller N may have issues with convergence.  Missing Data  Important to specify true correlation matrix.  Goodness-of-fit  Correlated residuals, so can't use standard statistics.  Uncertainty  Need to adjust s.e. (robust, bootstrapped)
  • 11. Initiative and Party Power  Justin Phillips (2008)  “Direct democracy alters the ability of partisan legislative majorities...to shape the size...of the public sector.”  DV: Tax effort  IVs:  Partisan Control  State-level characteristics  Interactions
  • 13. 100 Miles of Dry  John Frendreis & Raymond Tatalovich (2010)  What factors identified as important for Prohibition remain important today?  DV: Dry county  IVs:  Religion  Demographics  Partisan voting
  • 15. Conclusions  Often deal with correlated data in the real-world.  GEE allows us to deal with intraclass correlations and produces efficient coefficient estimates.  More flexible than GLMM: correlation structures, interpretation  Estimates may differ once correlated data are accounted for.  Toss in the methodological toolbox.