SlideShare a Scribd company logo
@EpiEllie
COVID-19 and
causal inference
Eleanor Murray, ScD
Department of
Epidemiology
CogX
June 8, 2020
What do we want to know?
What will the world look like in the future?
How would things have changed if the world
had been slightly different?
Treat
now
Treat later
@EpiEllie
How do we estimate causal
effects?
Miguel Hernàn’s two-step causal algorithm:
1. Ask good questions
2. Answer them with appropriate methods
? !
@EpiEllie
What do we want to know?
If we can’t have a time machine, we’d like to
have a randomized trial.
Treat now
Treat later
@EpiEllie
Target trial framework for estimating
causal effects
If we can’t have a randomized trial, we’d like to
emulate what would have happened if we could
have done one.
We can do this with:
Imperfect trials
Observational data
Simulation modeling Treat now
Treat later
@EpiEllie
What assumptions do we need?
No unmeasured confounding: all common causes of
the treatment and outcome are known and measured in
the data
No open colliders: all common effects of the treatment
and outcome are known and not conditioned on in the
data or analysis
@EpiEllie
What assumptions do we
need?
Positivity: there is a non-zero probability of all
levels of treatment for all types of individuals in
our population
@EpiEllie
What assumptions do we
need?
 Consistency: our treatment levels are clearly
specified, aka:
 Well-defined interventions
 Well-defined causal questions
@EpiEllie
Why are well-defined causal questions
important for complex exposures?
When there are multiple possible ‘interventions’
and we don’t specify one, our answer is a weighted
average of all ‘interventions’ but we don’t know
the weights
Murray, 2016. Agent-based models for causal inference. Harvard
University.
@EpiEllie
Why are well-defined causal questions
important for complex exposures?
Worse, if the ‘intervention’ is ill-defined, the
confounding is probably also ill-defined!
Murray, 2016. Agent-based models for causal inference. Harvard
@EpiEllie
All communicable disease by nature
involve complex exposures
We cannot define the causal effect of an
intervention on a communicable disease
without accounting for who is infected and
how people come into contact
This is the problem of interference
Special challenges for COVID19 RCTs
Identifying an appropriate control group
Can we ethically use placebo? What is ‘standard
of care’?
Identifying an appropriate outcome
Many early studies assessed ‘symptom
improvement’ as a surrogate outcome, but failed
to properly account for ICU admission or death
Many later studies looked at death but only in
hospital. Individuals who were discharged were
‘lost to follow-up’ @EpiEllie
Special challenges for COVID19 RCTs
Understanding to whom the results apply
Can a trial run at a hospital in NYC tell us how
to treat patients in Singapore? Kinshasa?
London?
Can a trial run among patients were never on
ventilators tell us how to treat new patients who
may need ventilators?
@EpiEllie
Special challenges for COVID19
observational studies
All the same problems as with RCTs apply,
plus:
Identifying an appropriate control group
Who should we compare to when experimental
treatments are preferentially given to the sickest
patients?
We know we have confounding. How do we
control for it?
@EpiEllie
Special challenges for COVID19
simulation studies
All the same problems as with RCTs &
observational studies apply, plus:
Identifying parameter inputs
We need external validity for not only our effect
estimates but also our model inputs
Understanding uncertainty
This is a big unanswered question in simulation
modelling that needs more statistical attention
@EpiEllie
Uncertainty in simulation models for
causal effects
We cannot generate confidence intervals for
our simulation models because we do not
have a way to capture the full variance.
Modelers recognize three sources of
uncertainty:
Stochastic
Parametric
Structural
 Solution: Increase sample size or number of runs
 Solution: Probabilistic or Bayesian sensitivity analyses
on key parameters.
 Which are key? What distributions should we use? What impact does it have to
assume other parameters don’t have uncertainty?
 Solution: ?? Probably will involve synthesizing across
different model structures but how do we decide which
ones & how do we quantify uncertainty? @EpiEllie
Summary
COVID19 urgently needs good causal effect
estimates for identifying effective prevention
and treatment strategies.
Many existing studies are committing basic
statistical and epidemiologic errors.
There are also important gaps in our
statistical methods for quantifying uncertainty
in simulation model estimates.
@EpiEllie

More Related Content

PPTX
A Cartoon Guide to Causal Inference
PPTX
Causal Inference from Pragmatic Trials.
PPTX
Causal inference for complex exposures: asking questions that matter, getting...
PPTX
COVID and Causal Inference -- NAS CATS 6/2020
PDF
Experimental Causal Inference
PDF
P-values in crisis
PPT
Aron chpt 7 ed effect size f2011
PPTX
Basic Statistical Concepts & Decision-Making
A Cartoon Guide to Causal Inference
Causal Inference from Pragmatic Trials.
Causal inference for complex exposures: asking questions that matter, getting...
COVID and Causal Inference -- NAS CATS 6/2020
Experimental Causal Inference
P-values in crisis
Aron chpt 7 ed effect size f2011
Basic Statistical Concepts & Decision-Making

What's hot (20)

PPTX
Choosing Regression Models
PDF
Prediction research in a pandemic: 3 lessons from a living systematic review ...
PDF
Clinical prediction models: development, validation and beyond
PPTX
The challenge of small data
PDF
Regression shrinkage: better answers to causal questions
PDF
Missing data and non response pdf
PDF
The basics of prediction modeling
PPTX
Replicability and questionable research practices
PPT
First in man tokyo
PDF
Why you need power analysis
PDF
Biostatistics Workshop: Regression
PPTX
Vaccine trials in the age of COVID-19
PPT
Power, effect size, and Issues in NHST
PDF
Improving predictions: Lasso, Ridge and Stein's paradox
PPTX
What is your question
PDF
To p or not to p
PPT
Hypothesistesting2
PPT
Chapter 021
PPTX
In search of the lost loss function
PPT
Is ignorance bliss
Choosing Regression Models
Prediction research in a pandemic: 3 lessons from a living systematic review ...
Clinical prediction models: development, validation and beyond
The challenge of small data
Regression shrinkage: better answers to causal questions
Missing data and non response pdf
The basics of prediction modeling
Replicability and questionable research practices
First in man tokyo
Why you need power analysis
Biostatistics Workshop: Regression
Vaccine trials in the age of COVID-19
Power, effect size, and Issues in NHST
Improving predictions: Lasso, Ridge and Stein's paradox
What is your question
To p or not to p
Hypothesistesting2
Chapter 021
In search of the lost loss function
Is ignorance bliss
Ad

Similar to COVID and Causal Inference -- CogX 6/2020 (20)

PDF
Proactive COVID-19 testing to mitigate spread
PDF
Bias in covid 19 models
PPTX
Lessons from COVID-19: How Are Data Science and AI Changing Future Biomedical...
PDF
Computational Epidemiology Datadriven Modeling Of Covid19 Ellen Kuhl
PDF
Computational Epidemiology Datadriven Modeling Of Covid19 Ellen Kuhl
PDF
Computational Epidemiology Datadriven Modeling Of Covid19 Ellen Kuhl
PDF
Computational Epidemiology Datadriven Modeling Of Covid19 Ellen Kuhl
PPTX
Importance of Basic Understanding of Epidemiology in Mental Health Support Du...
PPTX
Coronavirus pandemic public health - lessons in mathematics
PDF
A Mathematical Model in Public Health Epidemiology: Covid-19 Case Resolution ...
PDF
Role of data science during covid times
PDF
Defing the Epidemiologic of Covid-19
DOCX
Downloadedfromhttpjournals.lww.comjphm
PDF
Differential Equation Analysis on COVID-19_Crimson Publishers
PDF
EXL Analytics
PPTX
Presentation what if the whole world is bad in data-driven decision-making ...
PDF
Computational Epidemiology tutorial featured at ACM Knowledge Discovery and D...
PDF
Covid tto reino_unido Dr. Freddy Flores Malpartida
PDF
Modelo matemático: el confinamiento disminuye significativamente la velocidad...
PDF
Modelling the-spread-of-sars-cov-2
Proactive COVID-19 testing to mitigate spread
Bias in covid 19 models
Lessons from COVID-19: How Are Data Science and AI Changing Future Biomedical...
Computational Epidemiology Datadriven Modeling Of Covid19 Ellen Kuhl
Computational Epidemiology Datadriven Modeling Of Covid19 Ellen Kuhl
Computational Epidemiology Datadriven Modeling Of Covid19 Ellen Kuhl
Computational Epidemiology Datadriven Modeling Of Covid19 Ellen Kuhl
Importance of Basic Understanding of Epidemiology in Mental Health Support Du...
Coronavirus pandemic public health - lessons in mathematics
A Mathematical Model in Public Health Epidemiology: Covid-19 Case Resolution ...
Role of data science during covid times
Defing the Epidemiologic of Covid-19
Downloadedfromhttpjournals.lww.comjphm
Differential Equation Analysis on COVID-19_Crimson Publishers
EXL Analytics
Presentation what if the whole world is bad in data-driven decision-making ...
Computational Epidemiology tutorial featured at ACM Knowledge Discovery and D...
Covid tto reino_unido Dr. Freddy Flores Malpartida
Modelo matemático: el confinamiento disminuye significativamente la velocidad...
Modelling the-spread-of-sars-cov-2
Ad

Recently uploaded (20)

PDF
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PPT
6.1 High Risk New Born. Padetric health ppt
PPTX
Microbiology with diagram medical studies .pptx
PDF
Sciences of Europe No 170 (2025)
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
famous lake in india and its disturibution and importance
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
Pharmacology of Autonomic nervous system
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PDF
Placing the Near-Earth Object Impact Probability in Context
PDF
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
PDF
The scientific heritage No 166 (166) (2025)
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
6.1 High Risk New Born. Padetric health ppt
Microbiology with diagram medical studies .pptx
Sciences of Europe No 170 (2025)
neck nodes and dissection types and lymph nodes levels
famous lake in india and its disturibution and importance
Taita Taveta Laboratory Technician Workshop Presentation.pptx
. Radiology Case Scenariosssssssssssssss
Pharmacology of Autonomic nervous system
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Classification Systems_TAXONOMY_SCIENCE8.pptx
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
Placing the Near-Earth Object Impact Probability in Context
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
The scientific heritage No 166 (166) (2025)
ECG_Course_Presentation د.محمد صقران ppt
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg

COVID and Causal Inference -- CogX 6/2020

  • 1. @EpiEllie COVID-19 and causal inference Eleanor Murray, ScD Department of Epidemiology CogX June 8, 2020
  • 2. What do we want to know? What will the world look like in the future? How would things have changed if the world had been slightly different? Treat now Treat later @EpiEllie
  • 3. How do we estimate causal effects? Miguel Hernàn’s two-step causal algorithm: 1. Ask good questions 2. Answer them with appropriate methods ? ! @EpiEllie
  • 4. What do we want to know? If we can’t have a time machine, we’d like to have a randomized trial. Treat now Treat later @EpiEllie
  • 5. Target trial framework for estimating causal effects If we can’t have a randomized trial, we’d like to emulate what would have happened if we could have done one. We can do this with: Imperfect trials Observational data Simulation modeling Treat now Treat later @EpiEllie
  • 6. What assumptions do we need? No unmeasured confounding: all common causes of the treatment and outcome are known and measured in the data No open colliders: all common effects of the treatment and outcome are known and not conditioned on in the data or analysis @EpiEllie
  • 7. What assumptions do we need? Positivity: there is a non-zero probability of all levels of treatment for all types of individuals in our population @EpiEllie
  • 8. What assumptions do we need?  Consistency: our treatment levels are clearly specified, aka:  Well-defined interventions  Well-defined causal questions @EpiEllie
  • 9. Why are well-defined causal questions important for complex exposures? When there are multiple possible ‘interventions’ and we don’t specify one, our answer is a weighted average of all ‘interventions’ but we don’t know the weights Murray, 2016. Agent-based models for causal inference. Harvard University. @EpiEllie
  • 10. Why are well-defined causal questions important for complex exposures? Worse, if the ‘intervention’ is ill-defined, the confounding is probably also ill-defined! Murray, 2016. Agent-based models for causal inference. Harvard @EpiEllie
  • 11. All communicable disease by nature involve complex exposures We cannot define the causal effect of an intervention on a communicable disease without accounting for who is infected and how people come into contact This is the problem of interference
  • 12. Special challenges for COVID19 RCTs Identifying an appropriate control group Can we ethically use placebo? What is ‘standard of care’? Identifying an appropriate outcome Many early studies assessed ‘symptom improvement’ as a surrogate outcome, but failed to properly account for ICU admission or death Many later studies looked at death but only in hospital. Individuals who were discharged were ‘lost to follow-up’ @EpiEllie
  • 13. Special challenges for COVID19 RCTs Understanding to whom the results apply Can a trial run at a hospital in NYC tell us how to treat patients in Singapore? Kinshasa? London? Can a trial run among patients were never on ventilators tell us how to treat new patients who may need ventilators? @EpiEllie
  • 14. Special challenges for COVID19 observational studies All the same problems as with RCTs apply, plus: Identifying an appropriate control group Who should we compare to when experimental treatments are preferentially given to the sickest patients? We know we have confounding. How do we control for it? @EpiEllie
  • 15. Special challenges for COVID19 simulation studies All the same problems as with RCTs & observational studies apply, plus: Identifying parameter inputs We need external validity for not only our effect estimates but also our model inputs Understanding uncertainty This is a big unanswered question in simulation modelling that needs more statistical attention @EpiEllie
  • 16. Uncertainty in simulation models for causal effects We cannot generate confidence intervals for our simulation models because we do not have a way to capture the full variance. Modelers recognize three sources of uncertainty: Stochastic Parametric Structural  Solution: Increase sample size or number of runs  Solution: Probabilistic or Bayesian sensitivity analyses on key parameters.  Which are key? What distributions should we use? What impact does it have to assume other parameters don’t have uncertainty?  Solution: ?? Probably will involve synthesizing across different model structures but how do we decide which ones & how do we quantify uncertainty? @EpiEllie
  • 17. Summary COVID19 urgently needs good causal effect estimates for identifying effective prevention and treatment strategies. Many existing studies are committing basic statistical and epidemiologic errors. There are also important gaps in our statistical methods for quantifying uncertainty in simulation model estimates. @EpiEllie