Propensity Score Matching
Propensity Score Matching
Ani Katchova
2
Propensity Score Matching
Treatment evaluation is the estimation of the average effects of a program or treatment on the
outcome of interest.
Comparison of outcomes between treated and control observations.
3
Propensity score matching methodology
Assign the observations into two groups: the treated group that received the treatment and the
control group that did not.
o Treatment D is a binary variable that determines if the observation has the treatment or
not
o D=1 for treated observations and D=0 for control observations
Estimate a probit/logit model for the propensity of observations to be assigned into the treated
group. Use x variables that may affect the likelihood of being assigned into the treated group.
o The propensity score model is a probit/logit model with D as the dependent variable and
x as independent variables.
1| |
o The propensity score is the conditional (predicted) probability of receiving treatment
given pre-treatment characteristics x.
4
Match observations from treated and control groups based on their propensity scores
o Several matching methods are available: kernel, nearest neighbor, radius, stratification
Calculate the treatment effects: compare the outcomes y between the treated and control
observations, after matching
if 1
if 0
o Counterfactual situation: compare the outcome of the treated observations with the
outcome of the treated observations if they were not treated (find a close match using the
control observations and use their outcome)
5
Matching methods explained
Propensity scores for treated and control groups
1
0.8
0.6
0.4
0.2
0
Treated group Control group
Matching methods: for each treated observation i, we need to find matches of control observation(s)
j with similar characteristics.
6
Kernel matching
1
0.8
0.6
0.4
0.2
0
Treated group Control group
1
0.8
0.6
0.4
0.2
0
Treated group Control group
7
Nearest neighbor matching
For each treated observation i, select a control observation j that has the closest x.
min ∥ ∥
Radius matching
Each treated observation i is matched with control observations j that fall within a specified
radius.
∥ ∥
Kernel matching
Each treated observation i is matched with several control observations, with weights inversely
proportional to the distance between treated and control observations.
With matching based on propensity scores, the weights are defined as:
,
∑
8
Matching with common support
1
0.8
0.6
0.4
0.2
0
Treated group Control group
9
Treatment effects
ATE is the difference between the outcomes of treated and control observations.
∆ | , 1 | , 0
A simple t-test between the outcomes for the treated and control groups.
ATE is fine for random experiments but in observational studies, it may be biased if treated
and control observations are not similar.
10
Average treatment effect on the treated (ATET)
ATET is the difference between the outcomes of treated and the outcomes of the treated
observations if they had not been treated.
∆| 1 | , 1 | , 1
The second term is a counterfactual so it is not observable and needs to be estimated.
After matching on propensity scores, we can compare the outcomes of treated and control
observations.
∆| , 1 | , 1 | , 0
Empirical estimation
Each treated observation i is matched j control observations and their outcomes y0 are weighed
by w.
1
, , ,
∈
11
Assumptions
Unconfoundedness assumption
12
Matching or overlap assumption
For each value of x, there are both treated and control observations.
For each treated observation, there is a matched control observation with similar x.
0 1| 1
Balancing condition
13
Difference-in-differences model
The difference-in-differences model is applied when panel data on outcomes are available
before (b) and after (a) the experiment occurs.
The difference-in-differences model is an improvement over the one-period model.
The difference-in-differences average treatment effect on the treated is specified as:
∆ ∆ | 1 | , 1
| , 1 | , 1
The first term refers to the differences in outcomes before and after the treatment for the
treated group. This term may be biased if there are time trends. The second term uses the
differences in outcomes for the control group to eliminate this bias.
To apply the difference-in-differences model: instead of the outcomes for the treated and
control groups, we use the differences in outcomes after the treatment and before the treatment.
The rest of the analysis is the same.
14