Professional Documents
Culture Documents
Synthetic Control Slides
Synthetic Control Slides
Maximilian Kasy
Fall 2022
Outline
• Permutation inference.
• Applications.
1 / 41
Takeaways for this part of class
2 / 41
Introduction
Applications
Requirements
References
Introduction
3 / 41
Introduction
4 / 41
Setup
• Suppose that we observe J + 1 units in periods 1, 2, . . . , T.
• Unit “one” is exposed to the intervention of interest (that is, “treated”) during
periods T0 + 1, . . . , T.
• The remaining J are an untreated reservoir of potential controls (a “donor pool”).
• Let YitN be the outcome that would be observed for unit i at time t in the
absence of the intervention.
• Let YitI be the outcome that would be observed for unit i at time t if unit i is
exposed to the intervention in periods T0 + 1 to T.
• We aim to estimate the effect of the intervention on the treated unit,
for t > T0 , and Y1t is the outcome for unit one at time t.
5 / 41
Setup continued
• Let W = (w2 , . . . , wJ+1 )0 with wj ≥ 0 for j = 2, . . . , J + 1 and w2 + · · · + wJ+1 = 1.
Each value of W represents a potential synthetic control.
• Let Yjt be the value of the outcome for unit j at time t. For a post-intervention
period t (with t ≥ T0 ) the synthetic control estimator is:
J+1
τb1t = Y1t − ∑ wj∗ Yjt .
j=2
6 / 41
Weighted squared error
• Typically,
!1/2
k
2
kX 1 − X 0 Wk = ∑ vh (Xh1 − w2 Xh2 − · · · − wJ+1 XhJ+1 )
h=1
7 / 41
Application: German reunification
(a) (b)
30000
30000
Per Capita GDP (PPP 2002 USD)
20000
5000 10000
5000 10000
0
0
1960 1970 1980 1990 2000 1960 1970 1980 1990 2000
Year Year
West Germany
OECD
Synthetic West Germany
8 / 41
Application: German reunification
Note: First column reports X 1 , second column reports X 0 W ∗ , and last column reports a
simple average for the 16 OECD countries in the donor pool. GDP per capita, inflation rate,
and trade openness are averages for 1981–1990. Industry share (of value added) is the
average for 1981–1989. Schooling is the average for 1980 and 1985. Investment rate is
averaged over 1980–1984.
9 / 41
Application: German reunification
10 / 41
Introduction
Applications
Requirements
References
Factor model and bias bound
• Abadie et al. (2010) establish a bias bound under the factor model
YitN = δt + θ t Z i + λ t µ i + εit ,
11 / 41
Practice problem
Suppose that the factor model holds,
and that the pre-treatment fit of the synthetic control is exact.
Derive an expression for the estimation error for treatment effects.
12 / 41
Solution
If ∑Tt=1
0
λ 0t λ t is nonsingular, then
J+1
Y1tN − ∑ Yjt =
j=2
!−1
J+1 T0 T0
∑ wj∗ ∑ λ t ∑ λ 0n λ n λ 0s (εjs − ε1s )
j=2 s=1 n=1
J+1
− ∑ wj∗ (εjt − ε1t ).
j=2
13 / 41
Fit and validity
• The bias bound is predicated on close fit, and controlled by the ratio between
the scale of εit and T0 .
14 / 41
Fit and validity
• There are no ex-ante guarantees on the fit. If the fit is poor, Abadie et al. (2010)
recommend against the use of synthetic controls.
• In particular, settings with small T0 , large J, and large noise create substantial
risk of overfitting.
• To reduce interpolation biases and risk of overfitting, restrict the donor pool to
units that are similar to the treated unit.
15 / 41
Permutation inference
• Abadie et al. (2010) propose a mode of inference for the synthetic control
framework that is based on permutation methods.
• The effect of the treatment on the unit affected by the intervention is deemed
to be significant when its magnitude is extreme relative to the permutation
distribution.
16 / 41
Permutation inference
• Permutation inference is complicated by the fact that the pre-intervention fit on
the outcome variable may be of different quality for different sample units.
• This can be addressed by using the ratio between post-treatment and
pre-treatment RMSE as a test statistic. Let
!1/2
t2
1 N 2
Rj (t1 , t2 ) = ∑ (Yjt − Ybjt ) ,
t2 − t1 + 1 t=t 1
Rj (T0 + 1, T)
rj = .
Rj (1, T0 )
17 / 41
Introduction
Applications
Requirements
References
Application: German reunification
West Germany ●
Italy ●
Australia ●
Norway ●
Greece ●
Netherlands ●
USA ●
New Zealand ●
Spain ●
Belgium ●
UK ●
Switzerland ●
Japan ●
France ●
Denmark ●
Austria ●
Portugal ●
5 10 15
19 / 41
Application: California tobacco control program
140
California
rest of the U.S.
120
per−capita cigarette sales (in packs)
100
80
60
40
Passage of Proposition 99
20
0
year
20 / 41
Application: California tobacco control program
140
California
synthetic California
120
per−capita cigarette sales (in packs)
100
80
60
40
Passage of Proposition 99
20
0
year
21 / 41
Application: California tobacco control program
30
gap in per−capita cigarette sales (in packs)
20
10
0
−10
−20
Passage of Proposition 99
−30
year
22 / 41
Application: California tobacco control program
(All States in Donor Pool)
California
30
control states
20
10
0
−10
−20
Passage of Proposition 99
−30
year
23 / 41
Application: California tobacco control program
(Pre-Prop. 99 MSPE ≤ 20 Times Pre-Prop. 99 MSPE for CA)
California
30
control states
20
10
0
−10
−20
Passage of Proposition 99
−30
year
24 / 41
Application: California tobacco control program
(Pre-Prop. 99 MSPE ≤ 5 Times Pre-Prop. 99 MSPE for CA)
California
30
control states
20
10
0
−10
−20
Passage of Proposition 99
−30
year
25 / 41
Application: California tobacco control program
(Pre-Prop. 99 MSPE ≤ 2 Times Pre-Prop. 99 MSPE for CA)
California
30
control states
20
10
0
−10
−20
Passage of Proposition 99
−30
year
26 / 41
Application: California tobacco control program
(All 38 States in Donor Pool)
5
4
3
frequency
California
2
1
0
0 20 40 60 80 100 120
28 / 41
Permutation inference
29 / 41
Introduction
Applications
Requirements
References
Why use synthetic controls?
• Compare to linear regression. Let:
• Y 0 be the (T − T0 ) × J matrix of post-intervention outcomes for the units in the
donor pool.
• X 1 and X 0 be the result of augmenting X 1 and X 0 with a row of ones.
b = (X 0 X 0 )−1 X 0 Y 0 collects the coefficients of the regression of Y 0 on X 0 .
• B 0 0
• The components of W reg sum to one, but may be outside [0, 1], allowing
extrapolation.
30 / 41
Application: German reunification
31 / 41
Why use synthetic controls?
32 / 41
Why use synthetic controls?
• Sparsity. Because the synthetic control coefficients are proper weights and are
sparse, they allow a precise interpretation of the nature of the estimate of the
counterfactual of interest (and of potential biases).
33 / 41
Synthetic Control Method: Sparsity
Sparsity: Geometric interpretation
Sparsity comes from projecting X 1 on the convex hull of X0
X 1
XW0
∗
34 / 41
Why use synthetic controls?
• In some cases, especially in applications with many treated units, the values of
the predictors for some of the treated units may fall in the convex hull of the
columns of X 0 .
35 / 41
Introduction
Applications
Requirements
References
Contextual requirements
• Size of the effect and volatility of the outcome. Small effects will be
indistinguishable from other shocks to the outcome of the affected unit,
especially if the outcome variable of interest is highly volatile.
• Do not adopt interventions similar to the one under investigation during the period
of the study.
• Do not suffer large idiosyncratic shocks to the outcome of interest during the
study period.
• Have characteristics similar to the characteristics of the affected unit.
36 / 41
Contextual requirements
• Convex hull condition. Synthetic control estimates are predicated on the idea
that a combination of unaffected units can approximate the pre-intervention
characteristics of the affected unit.
• Time horizon. The effect of some interventions may take time to emerge or to
be of enough magnitude to be quantitatively detected in the data.
37 / 41
Data requirements
38 / 41
Robustness and Diagnosis Checks
• Backdating. Backdating was discussed before as a way to address
anticipation effects on the outcome variable before an intervention occurs. In
the absence of anticipation effects, he same idea can be applied to assess the
credibility of a synthetic control in concrete empirical applications.
35000
Per Capita GDP (PPP 2002 USD)
25000 Reunification backdated
to 1980
15000
5000
West Germany
Synthetic West Germany
0
35000
Per Capita GDP (PPP 2002 USD)
25000
15000
5000
West Germany
synthetic West Germany
synthetic West Germany (leave−one−out)
0
Abadie, A., Diamond, A., and Hainmueller, J. (2010). Synthetic control meth-
ods for comparative case studies: Estimating the effect of california’s to-
bacco control program. Journal of the American Statistical Association,
105(490):493–505
These slides are based on the slides by Alberto Abadie for previous iterations of
14.385.
41 / 41