Professional Documents
Culture Documents
Economics of Conflict
Eik Swee
AY2022/2023 Trimester 1
1 / 19
Data & Methods
2 / 19
Data & Methods
3 / 19
Data & Methods
4 / 19
Data & Methods
5 / 19
Data & Methods
6 / 19
Data & Methods
7 / 19
Data & Methods
▶ To see how sample selection can confound causal
interpretations, consider another example in history
▶ During WWII, American aircraft that returned from missions
over German positions were covered in bullet holes along their
main bodies and wings
8 / 19
Data & Methods
9 / 19
Data & Methods
10 / 19
Data & Methods
11 / 19
Data & Methods
12 / 19
Data & Methods
13 / 19
Data & Methods
Proxy controls:
▶ Suppose you have an omitted variable problem i.e. you know
there is an o that you don’t observe, but that cov (x, o) ̸= 0
and cov (y , o) ̸= 0
▶ If you happen to observe a proxy variable for o, then you may
be able to (at least partially) remove omitted variable bias, by
including the proxy as a control variable
▶ This is usually a partial solution because the proxy would have
to completely absorb the correlation between x and o in order
to completely get rid of the omitted variable bias (not
impossible, but highly unlikely)
14 / 19
Data & Methods
Fixed effects:
▶ Again, suppose you have an omitted variable problem
▶ If panel data are available, you will be able to directly
corrected for omitted variable bias that is associated with the
observation unit, by including fixed effects (dummy variables)
for each unit
▶ This is again a partial solution because the the fixed effects
cannot get rid of time-varying omitted variables
▶ That being said, it is nonetheless a very effective method that
does not require you to observe everything about the
observation unit, and still be able to parse out all omitted
variables associated with the it
15 / 19
Data & Methods
Insturmental variable (IV):
▶ Suppose that you observe a z such that
B1. z is partially correlated with x, i.e. cov (z, x) ̸= 0
B2. z is not otherwise correlated with y , i.e. cov (z, ϵ) = 0
▶ Then, you may use z as an instrument for x
▶ The logic being that we can use z to tease out the random
variation in x and use that random variation to identify its
causal effect on y
▶ In practice, 2SLS estimation is the most common way of
operationalising the IV method
▶ The so-called relevance condition B1 is testable, but the
exclusion restriction B2 is usually not testable (unless you
have more instruments than endogenous variables)
16 / 19
Data & Methods
Difference-in-differences (DID):
▶ When panel data are available, and some observations receive
the intervention (treatment) while others do not (control), it
is possible to design a natural experiment to address
non-random treatment
17 / 19
Data & Methods
18 / 19
Data & Methods
19 / 19