SlideShare a Scribd company logo
Propensity Score
Matching Methods
Day 1, Lecture 3
By Ragui Assaad
Training on Applied Micro-Econometrics and
Public Policy Evaluation
July 25-27, 2016
Economic Research Forum
Type I Methods
• Assumption:
• Conditional exogeneity of placement
• i.e. placement into program does not depend on unobservables
affecting outcome of interest
• In that case Ti is not correlated with error term and is therefore not
endogenous
• Can simply use regression to get at ATE
• This is a fairly strong assumption that is unlikely to be met in most
cases.
Multiple Regression Approach
• Assumes conditional exogeneity of placement (selection
on observables only)
• Run a multiple regression explaining the outcome (Y) as a
function of the Treatment (T) and other controls (X). The
coefficient of T is the average treatment effect.
• Can Differentiate effect of treatment across different
groups by interacting X and T
iiii XTY  
iiiiii XTXTY  
Problems with Regression Approach
• Besides the strong assumption of conditional exogeneity
of placement, linear regression makes other strong
parametric assumptions relating to how treatment and
outcome are related conditional on observables.
• We must assume a particular functional form, like linear,
log-linear, etc.
• We would prefer not to make such assumptions and use a
non-parametric approach.
• Regular regression also gives equal weight to all
comparison observations, irrespective of how likely they
are to participate in the program
Introduction to Propensity Score
Matching (PSM)
• Method is used to find a comparison group of non-
participants with similar pre-intervention characteristics as
the treatment group
• Like regression, method assumes conditional exogeneity
of placement or “selection on observables only”
• Main Question: Which characteristics does one use and
what weight to put on each of them?
• Propensity score matching is a way of answering this
question
What are the advantages of PSM over
Regression?
• PSM does not assume a particular functional form for the
way the X’s affect Y and the way T affects Y
• It is referred to as a non-parametric method
• Regression uses the whole dataset to compare treatment
and controls, even if treatment and control observations
are very different in terms of the observables
• PSM only compares treatment and control observations
that are similar (along observable characteristics)
• We only keep control observations that "close" to the treatment
observations we want to match
Basic Theorem of PSM
• Rosenbaum and Rubin (1983) showed that instead of
having to match on a multitude of dimensions in a vector
of observable characteristics Z, it is only necessary to
match on a single dimension P(Z), which is the propensity
score.
• A treatment and control observation that are "close" in the
propensity score space, will also be "close" along the
various elements making up Z.
What is a propensity score?
• A propensity score is given by:
• P(Z)=Pr(T=1 | Z) where Z is a vector of pre-exposure
characteristics
– Z can include the pre-treatment value of the outcome
– Treatment units are matched to comparison or control units with
similar values of P(Z)
• Impact estimates from Propensity Score Matching (PSM)
will depend on the variables that go into the equation
used to estimate propensity score and also on the
specification of that equation
• Can be estimated using probit or logit
Region of Common Support
Region of common support for propensity score between participants and non-
participants must be large enough to find an adequate comparison group
Assumption Underlying PSM Methods
• Eliminating selection based on observables will reduce
overall bias
• This is the case if and only if the bias based on
unobservables goes in the same direction as bias based
on observables
• If they go in opposite direction, eliminating bias based on
observables will increase overall bias
• Possible in theory, but not likely in practice
What variables go in Z?
• Any variable that can affect program placement, including
the pre-intervention level of the outcome variable
• Can be obtained through interviews
• If one cannot find good observable variables to explain
who participates and who doesn’t, we will do a poor job
eliminating selection bias
PSM in practice
• Use predicted probabilities of participation from a probit or
logit model
• Matching is done based on the distance between each
treatment observation and comparison observations in its
neighborhood, using the propensity score as a metric of
distance.
• The outcome of each treatment is compared to that of its
matched comparison observations to detect whether the
treatment has a statistically significant effect
Matching Methods
• There are many ways to undertake the matching
• Simplest method:
• One-to-one matching
• Match each treatment observation with “nearest” comparison observation
• Can impose a maximum distance, through a “caliper”
• Subject to potential outliers
• Addressing ties in propensity scores
• Nearest n-neighbors
• Uses the average of the nearest n-neighbors among comparison observations
• Can also impose a caliper
Matching Method
• More complicated but probably sounder methods
• Kernel matching
• Kernel functions are symmetric functions around a particular point that
can be used to provide a weight that is inversely proportional to the
horizontal distance from that point
• The weight is the vertical height at each distance from the central point
• The weight typically goes to zero after a certain distance has been
reached (usually referred to as bandwidth)
Some Popular Kernel Function
0
0.2
0.4
0.6
0.8
1
1.2
-1 -0.5 0 0.5 1
Epanechnikov
0
0.2
0.4
0.6
0.8
1
1.2
-1 -0.5 0 0.5 1
Tricube
0
0.2
0.4
0.6
0.8
1
1.2
-1 -0.5 0 0.5 1
Uniform
0
0.2
0.4
0.6
0.8
1
-3 -2 -1 0 1 2 3
Exponential
Epanechnikov Tricube
Uniform Gaussian or Normal
PSM Estimator
• The simplest measure of the impact of program is
measured by:
Where Yj
C is the value for a single “nearest neighbor” from
the comparison group.
• If we are using multiple “nearest neighbors” we take a
weighted mean of all the “nearest neighbors” to form Yj
C
with the weights provided by the kernel function used
(generally declining with distance from observation j)
(1/ NT
) (Yj
T
-Yj
C
)
j=1
NT
å
Notes on Estimating Propensity Score
• Although effect of treatment on outcome is non-
parametric, estimation of propensity score itself
depends on a parametric specification (usually
logit or probit)
• Specification of propensity score equation will
affect quality of matching and, therefore, final
result
• Must try out multiple specifications with lots of
interaction terms between the observable
characteristics to get a good fit
Testing how good the match is
• Need to test the extent to which observable
characteristics are balanced in the matched sample, using
a t-test of of the difference in means of the covariates
across matched samples.
• None of the test should be statistically significant
• Should examine the effect of the matching method on the
area of common support
• The closer we try to make the match, the more likely that some
treatment observation won’t find matches, which reduce the
efficiency of the model
Standard Errors
• Because propensity scores are predicted, analytical
standard errors obtained from a t-test of the outcome for
matched treatment and comparison observations are not
correct
• We typically use bootstrapping to estimate the correct
standard errors.
Sensitivity Analysis
• There are many modeling decisions that can affect the
results, including
• Specification of propensity score equation
• What variables to include
• How many interactions to include and at what level
• Matching method and caliper or bandwidth to use
• Must test sensitivity of results to these decisions.
• If results are not robust to these changes, this should raise a
question mark about their reliability
PSM compared to Social Experiements
• In social experiments PS= constant.
– Everyone has same probability of participating.
• PSM tries to match treatment and control groups based
on them having an equal probability of participation (using
the probability predicted from observables)
• But probability of participation can also be affected by
unobservables raising concerns about remaining selection
bias in PSM methods
Propensity Score
Weighting
Motivation for propensity score
weighting
• Propensity score methods are used to remove the effects
of observable confounders when estimating the effect of a
treatment on an outcome
• Have been discussing matching methods
• Stratification, nearest neighbor, etc.
• Propensity scores can also be used to weight
observations (like a sample weight)
• Specifically use inverse probability of treatment weighting (IPTW)
• This approach can be used in contexts where models are non-linear
• Example: Propensity score weights in survival analysis (Austin, 2011)
23
Creating propensity score weights
• Same creation of propensity scores as for matching
• P(Z)=Pr(Ti=1 | Zi) where Z is a vector of pre-exposure
characteristics
• Predict probabilities of participation using logit or probit:
• Use propensity scores to create weights
• Weights (wi) are the inverse of the probability of receiving the
treatment (or non-treatment) that the subject actually received
• High weights for those who were unlikely to receive treatment but
did and those who were likely to receive treatment but did not
24
ˆP
wi =
Ti
ˆPi
+
1-Ti
1- ˆPi
If treated wi =
1
ˆPi
If control wi =
1
1- ˆPi
Estimates of treatment effects
• There are a number of ways to use IPTW directly
to estimate treatment effects
• Can estimate average treatment effect (ATE) as:
• Average treatment on the treated (ATT) and
average treatment on the untreated (controls,
ATC) use different weights:
25
ATE =
1
n
TiYi
ˆPi1
n
å -
(1-Ti )Yi
(1- ˆPi )1
n
å
wi,ATT = Ti +
ˆPi (1-Ti )
1- ˆPi
wi,ATC =
Ti (1- ˆPi )
ˆPi
+(1-Ti )
Incorporating weights into models
• Regression models can be weighted by the inverse
probability of treatment to estimate the causal effects of
treatment
• Use like a sample weight
• Need to account for weighted nature of synthetic sample
(use robust estimation) when using IPTW
26
Checking weights
• When weights are applied to treatment and control
groups, they should then have statistically
indistinguishable group means
• Weights may be very inaccurate/unstable for subjects with
a very low probability of receive the treatment status they
received
• Very unlikely to get treatment and did
• Very likely to get treatment and did not
• Can stabilize weights by multiplying weights by mean
probability of treatment received (T or C) propensity score
• Can also trim (use region of common support)
27
Example of propensity score weighting
• Southwest China Poverty Reduction Project ran from
1995 to 2005
• Surveys in 1996 and 2000 of 2,000 households in
targeted and non-targeted villages
• Follow-up in 2004/5
• Selection might drive treatment effect estimates if initial
differences in samples were large
• Propensity score weighting in regression models was
used as part of estimating program effects
• Impact evaluation of Chen, Mu, and Ravallion (2008)
found impacts on income, consumption, and education
diminished by 2004/5
28

More Related Content

PPTX
Neural control of bladder
PPTX
PSM - Images.pptx
PDF
Causal Inference
PDF
어떻게 하면 데이터 사이언티스트가 될 수 있나요?
PDF
자습해도 모르겠던 딥러닝, 머리속에 인스톨 시켜드립니다.
PPTX
Equilibrium of Firm Under Perfect Competition
PPT
Osteoporosis diet and control
PPTX
Concepts of disease causation
Neural control of bladder
PSM - Images.pptx
Causal Inference
어떻게 하면 데이터 사이언티스트가 될 수 있나요?
자습해도 모르겠던 딥러닝, 머리속에 인스톨 시켜드립니다.
Equilibrium of Firm Under Perfect Competition
Osteoporosis diet and control
Concepts of disease causation

What's hot (20)

PPTX
Logistic regression with SPSS
PDF
Difference-in-Difference Methods
PDF
Logistic Regression Analysis
PPTX
Logistic regression with SPSS examples
PPTX
Logistic regression
PPTX
Non parametric presentation
PDF
Sample size calculation in medical research
PDF
Normality tests
PPTX
Non parametric-tests
PPT
Part 2 Cox Regression
PPTX
The mann whitney u test
PDF
Linear regression theory
PPT
Multivariate Analysis Techniques
PDF
PPT
Survival Analysis Lecture.ppt
PPTX
Statistical Power
PDF
Instrumental Variables and Control Functions
PPTX
Survival analysis
PPTX
Multivariate data analysis
Logistic regression with SPSS
Difference-in-Difference Methods
Logistic Regression Analysis
Logistic regression with SPSS examples
Logistic regression
Non parametric presentation
Sample size calculation in medical research
Normality tests
Non parametric-tests
Part 2 Cox Regression
The mann whitney u test
Linear regression theory
Multivariate Analysis Techniques
Survival Analysis Lecture.ppt
Statistical Power
Instrumental Variables and Control Functions
Survival analysis
Multivariate data analysis
Ad

Viewers also liked (20)

PDF
Causal Inference and Program Evaluation
PDF
Panel Data Models
PDF
Regression Discontinuity Method
PDF
Potential Solutions to the Fundamental Problem of Causal Inference: An Overview
PDF
Overview and Objectives of the Workshop
PDF
Overview of Mixed Models
PDF
Multilevel Binary Logistic Regression
PPTX
How do we do research in economics
PDF
Lessons for Democratic Transition in the Arab World
PDF
Propensity score matching overview
PDF
How to make democratic accountability work better for development?
PDF
Democracy, Elections, and Development
PDF
Forms of Democracy and Development
PDF
Culture and democratization
PDF
The Arab Spring: Much Violence, Little Democracy
PDF
傾向スコアの概念とその実践
PDF
傾向スコア:その概念とRによる実装
PDF
Propensity score analysis__statistical_methods_and_applications_advanced_quan...
PPTX
Evaluation in Africa RISING
PPT
Funderburk et al 2011 ICHPS_Using_Propensity_Score_& MI_Analysis_10-6-11
Causal Inference and Program Evaluation
Panel Data Models
Regression Discontinuity Method
Potential Solutions to the Fundamental Problem of Causal Inference: An Overview
Overview and Objectives of the Workshop
Overview of Mixed Models
Multilevel Binary Logistic Regression
How do we do research in economics
Lessons for Democratic Transition in the Arab World
Propensity score matching overview
How to make democratic accountability work better for development?
Democracy, Elections, and Development
Forms of Democracy and Development
Culture and democratization
The Arab Spring: Much Violence, Little Democracy
傾向スコアの概念とその実践
傾向スコア:その概念とRによる実装
Propensity score analysis__statistical_methods_and_applications_advanced_quan...
Evaluation in Africa RISING
Funderburk et al 2011 ICHPS_Using_Propensity_Score_& MI_Analysis_10-6-11
Ad

Similar to Propensity Score Matching Methods (20)

PDF
Regression analysis made easy
PPTX
UNIT 5.pptx
PDF
Factor Analysis - Statistics
PPTX
Bio-Statistics in Bio-Medical research
PPT
Inferential statistics_AAF 500L 2021.ppt
PPTX
F unit 5.pptx
PPTX
Research data analysis b.sc nursing .pptx
PPTX
Biostatistics and Research Methodology Semester 8
PDF
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
PDF
Wright_Bennett_PSI2015
PPTX
UNIT_2.2 Calibration and Discrimination.power point presentation format
PPTX
Chapter 13 Data Analysis Inferential Methods and Analysis of Time Series
PPTX
statistical remodelling in pharmaceutical research and developmnent by aina b...
PPTX
Meta analysis
PPTX
Understanding Meta-Analysis: A Comprehensive Guide
PDF
Data Ananlysis lecture 7 Simon Fraser University
PPTX
Inferential statistics quantitative data - single sample and 2 groups
PPTX
Non_parametric_test-n3.pptx ndufhdnjdnfufbfnfcnj
PDF
Selection of appropriate data analysis technique
PDF
Research 101: Inferential Quantitative Analysis
Regression analysis made easy
UNIT 5.pptx
Factor Analysis - Statistics
Bio-Statistics in Bio-Medical research
Inferential statistics_AAF 500L 2021.ppt
F unit 5.pptx
Research data analysis b.sc nursing .pptx
Biostatistics and Research Methodology Semester 8
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
Wright_Bennett_PSI2015
UNIT_2.2 Calibration and Discrimination.power point presentation format
Chapter 13 Data Analysis Inferential Methods and Analysis of Time Series
statistical remodelling in pharmaceutical research and developmnent by aina b...
Meta analysis
Understanding Meta-Analysis: A Comprehensive Guide
Data Ananlysis lecture 7 Simon Fraser University
Inferential statistics quantitative data - single sample and 2 groups
Non_parametric_test-n3.pptx ndufhdnjdnfufbfnfcnj
Selection of appropriate data analysis technique
Research 101: Inferential Quantitative Analysis

More from Economic Research Forum (20)

PPTX
Session 4 farhad mehran, single most data gaps
PDF
Session 3 mahdi ben jelloul, microsimulation for policy evaluation
PDF
Session 3 m.a. marouani, structual change, skills demand and job quality
PPTX
Session 3 ishac diwn, bridging mirco and macro appraoches
PDF
Session 3 asif islam, jobs flagship report
PPTX
Session 2 yemen hlel, insights from tunisia
PPT
Session 2 samia satti, insights from sudan
PPTX
Session 2 mona amer, insights from egypt
PPTX
Session 2 ali souag, insights from algeria
PPTX
Session 2 abdel rahmen el lahga, insights from tunisia
PPTX
Session 1 ragui assaad, moving beyond the unemployment rate
PPTX
Session 1 luca fedi, towards a research agenda
PDF
من البيانات الى السياسات : مبادرة إتاحة البيانات المنسقة
PPTX
The Future of Jobs is Facing the Biggest Policy Induced Price Distortion in H...
PPTX
Job- Creating Growth in the Emerging Global Economy
PPTX
The Role of Knowledge in the Process of Innovation in the New Global Economy:...
PPTX
Rediscovering Industrial Policy for the 21st Century: Where to Start?
PPTX
How the Rise of the Intangibles Economy is Disrupting Work in Africa
PPTX
On Ideas and Economic Policy: A Survey of MENA Economists
PPTX
Future Research Directions for ERF
Session 4 farhad mehran, single most data gaps
Session 3 mahdi ben jelloul, microsimulation for policy evaluation
Session 3 m.a. marouani, structual change, skills demand and job quality
Session 3 ishac diwn, bridging mirco and macro appraoches
Session 3 asif islam, jobs flagship report
Session 2 yemen hlel, insights from tunisia
Session 2 samia satti, insights from sudan
Session 2 mona amer, insights from egypt
Session 2 ali souag, insights from algeria
Session 2 abdel rahmen el lahga, insights from tunisia
Session 1 ragui assaad, moving beyond the unemployment rate
Session 1 luca fedi, towards a research agenda
من البيانات الى السياسات : مبادرة إتاحة البيانات المنسقة
The Future of Jobs is Facing the Biggest Policy Induced Price Distortion in H...
Job- Creating Growth in the Emerging Global Economy
The Role of Knowledge in the Process of Innovation in the New Global Economy:...
Rediscovering Industrial Policy for the 21st Century: Where to Start?
How the Rise of the Intangibles Economy is Disrupting Work in Africa
On Ideas and Economic Policy: A Survey of MENA Economists
Future Research Directions for ERF

Recently uploaded (20)

PPTX
GOVERNMENT-ACCOUNTING1. bsa 4 government accounting
PPTX
Portland FPDR Oregon Legislature 2025.pptx
PDF
Building Bridges (of Hope) over Our Troubled Waters_PART 1
DOC
LU毕业证学历认证,赫尔大学毕业证硕士的学历和学位
PPTX
SOMANJAN PRAMANIK_3500032 2042.pptx
PDF
PPT - Primary Rules of Interpretation (1).pdf
PPTX
School Education Programs for Social Impact Learn with Parramatta Mission
PDF
About Karen Miner-Romanoff - Academic & nonprofit consultant
PDF
oil palm convergence 2024 mahabubnagar.pdf
PPTX
Introduction_to_the_Study_of_Globalization.pptx
PPTX
Developing_An_Advocacy_Agenda_by_Kevin_Karuga.pptx
PDF
CXPA Finland Webinar - Modern Components of Service Quality - Alec Dalton - ...
PDF
buyers sellers meeting of mangoes in mahabubnagar.pdf
PPTX
BHARATIYA NAGARIKA SURAKSHA SAHMITA^J2023 (1).pptx
PDF
PPT Items # 6&7 - 900 Cambridge Oval Right-of-Way
PPTX
DFARS Part 253 - Forms - Defense Contracting Regulations
PPTX
Quiz - Saturday.pptxaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
PPTX
Weekly Report 17-10-2024_cybersecutity.pptx
PDF
PPT Item # 2 -- Announcements Powerpoint
PPT
The Central Civil Services (Leave Travel Concession) Rules, 1988, govern the ...
GOVERNMENT-ACCOUNTING1. bsa 4 government accounting
Portland FPDR Oregon Legislature 2025.pptx
Building Bridges (of Hope) over Our Troubled Waters_PART 1
LU毕业证学历认证,赫尔大学毕业证硕士的学历和学位
SOMANJAN PRAMANIK_3500032 2042.pptx
PPT - Primary Rules of Interpretation (1).pdf
School Education Programs for Social Impact Learn with Parramatta Mission
About Karen Miner-Romanoff - Academic & nonprofit consultant
oil palm convergence 2024 mahabubnagar.pdf
Introduction_to_the_Study_of_Globalization.pptx
Developing_An_Advocacy_Agenda_by_Kevin_Karuga.pptx
CXPA Finland Webinar - Modern Components of Service Quality - Alec Dalton - ...
buyers sellers meeting of mangoes in mahabubnagar.pdf
BHARATIYA NAGARIKA SURAKSHA SAHMITA^J2023 (1).pptx
PPT Items # 6&7 - 900 Cambridge Oval Right-of-Way
DFARS Part 253 - Forms - Defense Contracting Regulations
Quiz - Saturday.pptxaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
Weekly Report 17-10-2024_cybersecutity.pptx
PPT Item # 2 -- Announcements Powerpoint
The Central Civil Services (Leave Travel Concession) Rules, 1988, govern the ...

Propensity Score Matching Methods

  • 1. Propensity Score Matching Methods Day 1, Lecture 3 By Ragui Assaad Training on Applied Micro-Econometrics and Public Policy Evaluation July 25-27, 2016 Economic Research Forum
  • 2. Type I Methods • Assumption: • Conditional exogeneity of placement • i.e. placement into program does not depend on unobservables affecting outcome of interest • In that case Ti is not correlated with error term and is therefore not endogenous • Can simply use regression to get at ATE • This is a fairly strong assumption that is unlikely to be met in most cases.
  • 3. Multiple Regression Approach • Assumes conditional exogeneity of placement (selection on observables only) • Run a multiple regression explaining the outcome (Y) as a function of the Treatment (T) and other controls (X). The coefficient of T is the average treatment effect. • Can Differentiate effect of treatment across different groups by interacting X and T iiii XTY   iiiiii XTXTY  
  • 4. Problems with Regression Approach • Besides the strong assumption of conditional exogeneity of placement, linear regression makes other strong parametric assumptions relating to how treatment and outcome are related conditional on observables. • We must assume a particular functional form, like linear, log-linear, etc. • We would prefer not to make such assumptions and use a non-parametric approach. • Regular regression also gives equal weight to all comparison observations, irrespective of how likely they are to participate in the program
  • 5. Introduction to Propensity Score Matching (PSM) • Method is used to find a comparison group of non- participants with similar pre-intervention characteristics as the treatment group • Like regression, method assumes conditional exogeneity of placement or “selection on observables only” • Main Question: Which characteristics does one use and what weight to put on each of them? • Propensity score matching is a way of answering this question
  • 6. What are the advantages of PSM over Regression? • PSM does not assume a particular functional form for the way the X’s affect Y and the way T affects Y • It is referred to as a non-parametric method • Regression uses the whole dataset to compare treatment and controls, even if treatment and control observations are very different in terms of the observables • PSM only compares treatment and control observations that are similar (along observable characteristics) • We only keep control observations that "close" to the treatment observations we want to match
  • 7. Basic Theorem of PSM • Rosenbaum and Rubin (1983) showed that instead of having to match on a multitude of dimensions in a vector of observable characteristics Z, it is only necessary to match on a single dimension P(Z), which is the propensity score. • A treatment and control observation that are "close" in the propensity score space, will also be "close" along the various elements making up Z.
  • 8. What is a propensity score? • A propensity score is given by: • P(Z)=Pr(T=1 | Z) where Z is a vector of pre-exposure characteristics – Z can include the pre-treatment value of the outcome – Treatment units are matched to comparison or control units with similar values of P(Z) • Impact estimates from Propensity Score Matching (PSM) will depend on the variables that go into the equation used to estimate propensity score and also on the specification of that equation • Can be estimated using probit or logit
  • 9. Region of Common Support Region of common support for propensity score between participants and non- participants must be large enough to find an adequate comparison group
  • 10. Assumption Underlying PSM Methods • Eliminating selection based on observables will reduce overall bias • This is the case if and only if the bias based on unobservables goes in the same direction as bias based on observables • If they go in opposite direction, eliminating bias based on observables will increase overall bias • Possible in theory, but not likely in practice
  • 11. What variables go in Z? • Any variable that can affect program placement, including the pre-intervention level of the outcome variable • Can be obtained through interviews • If one cannot find good observable variables to explain who participates and who doesn’t, we will do a poor job eliminating selection bias
  • 12. PSM in practice • Use predicted probabilities of participation from a probit or logit model • Matching is done based on the distance between each treatment observation and comparison observations in its neighborhood, using the propensity score as a metric of distance. • The outcome of each treatment is compared to that of its matched comparison observations to detect whether the treatment has a statistically significant effect
  • 13. Matching Methods • There are many ways to undertake the matching • Simplest method: • One-to-one matching • Match each treatment observation with “nearest” comparison observation • Can impose a maximum distance, through a “caliper” • Subject to potential outliers • Addressing ties in propensity scores • Nearest n-neighbors • Uses the average of the nearest n-neighbors among comparison observations • Can also impose a caliper
  • 14. Matching Method • More complicated but probably sounder methods • Kernel matching • Kernel functions are symmetric functions around a particular point that can be used to provide a weight that is inversely proportional to the horizontal distance from that point • The weight is the vertical height at each distance from the central point • The weight typically goes to zero after a certain distance has been reached (usually referred to as bandwidth)
  • 15. Some Popular Kernel Function 0 0.2 0.4 0.6 0.8 1 1.2 -1 -0.5 0 0.5 1 Epanechnikov 0 0.2 0.4 0.6 0.8 1 1.2 -1 -0.5 0 0.5 1 Tricube 0 0.2 0.4 0.6 0.8 1 1.2 -1 -0.5 0 0.5 1 Uniform 0 0.2 0.4 0.6 0.8 1 -3 -2 -1 0 1 2 3 Exponential Epanechnikov Tricube Uniform Gaussian or Normal
  • 16. PSM Estimator • The simplest measure of the impact of program is measured by: Where Yj C is the value for a single “nearest neighbor” from the comparison group. • If we are using multiple “nearest neighbors” we take a weighted mean of all the “nearest neighbors” to form Yj C with the weights provided by the kernel function used (generally declining with distance from observation j) (1/ NT ) (Yj T -Yj C ) j=1 NT å
  • 17. Notes on Estimating Propensity Score • Although effect of treatment on outcome is non- parametric, estimation of propensity score itself depends on a parametric specification (usually logit or probit) • Specification of propensity score equation will affect quality of matching and, therefore, final result • Must try out multiple specifications with lots of interaction terms between the observable characteristics to get a good fit
  • 18. Testing how good the match is • Need to test the extent to which observable characteristics are balanced in the matched sample, using a t-test of of the difference in means of the covariates across matched samples. • None of the test should be statistically significant • Should examine the effect of the matching method on the area of common support • The closer we try to make the match, the more likely that some treatment observation won’t find matches, which reduce the efficiency of the model
  • 19. Standard Errors • Because propensity scores are predicted, analytical standard errors obtained from a t-test of the outcome for matched treatment and comparison observations are not correct • We typically use bootstrapping to estimate the correct standard errors.
  • 20. Sensitivity Analysis • There are many modeling decisions that can affect the results, including • Specification of propensity score equation • What variables to include • How many interactions to include and at what level • Matching method and caliper or bandwidth to use • Must test sensitivity of results to these decisions. • If results are not robust to these changes, this should raise a question mark about their reliability
  • 21. PSM compared to Social Experiements • In social experiments PS= constant. – Everyone has same probability of participating. • PSM tries to match treatment and control groups based on them having an equal probability of participation (using the probability predicted from observables) • But probability of participation can also be affected by unobservables raising concerns about remaining selection bias in PSM methods
  • 23. Motivation for propensity score weighting • Propensity score methods are used to remove the effects of observable confounders when estimating the effect of a treatment on an outcome • Have been discussing matching methods • Stratification, nearest neighbor, etc. • Propensity scores can also be used to weight observations (like a sample weight) • Specifically use inverse probability of treatment weighting (IPTW) • This approach can be used in contexts where models are non-linear • Example: Propensity score weights in survival analysis (Austin, 2011) 23
  • 24. Creating propensity score weights • Same creation of propensity scores as for matching • P(Z)=Pr(Ti=1 | Zi) where Z is a vector of pre-exposure characteristics • Predict probabilities of participation using logit or probit: • Use propensity scores to create weights • Weights (wi) are the inverse of the probability of receiving the treatment (or non-treatment) that the subject actually received • High weights for those who were unlikely to receive treatment but did and those who were likely to receive treatment but did not 24 ˆP wi = Ti ˆPi + 1-Ti 1- ˆPi If treated wi = 1 ˆPi If control wi = 1 1- ˆPi
  • 25. Estimates of treatment effects • There are a number of ways to use IPTW directly to estimate treatment effects • Can estimate average treatment effect (ATE) as: • Average treatment on the treated (ATT) and average treatment on the untreated (controls, ATC) use different weights: 25 ATE = 1 n TiYi ˆPi1 n å - (1-Ti )Yi (1- ˆPi )1 n å wi,ATT = Ti + ˆPi (1-Ti ) 1- ˆPi wi,ATC = Ti (1- ˆPi ) ˆPi +(1-Ti )
  • 26. Incorporating weights into models • Regression models can be weighted by the inverse probability of treatment to estimate the causal effects of treatment • Use like a sample weight • Need to account for weighted nature of synthetic sample (use robust estimation) when using IPTW 26
  • 27. Checking weights • When weights are applied to treatment and control groups, they should then have statistically indistinguishable group means • Weights may be very inaccurate/unstable for subjects with a very low probability of receive the treatment status they received • Very unlikely to get treatment and did • Very likely to get treatment and did not • Can stabilize weights by multiplying weights by mean probability of treatment received (T or C) propensity score • Can also trim (use region of common support) 27
  • 28. Example of propensity score weighting • Southwest China Poverty Reduction Project ran from 1995 to 2005 • Surveys in 1996 and 2000 of 2,000 households in targeted and non-targeted villages • Follow-up in 2004/5 • Selection might drive treatment effect estimates if initial differences in samples were large • Propensity score weighting in regression models was used as part of estimating program effects • Impact evaluation of Chen, Mu, and Ravallion (2008) found impacts on income, consumption, and education diminished by 2004/5 28