Professional Documents
Culture Documents
Karim Nchare
January 2020
Correlation does not imply causation!!
Correlation does not imply causation!!
Correlation does not imply causation!!
Definitions of internal and external validity
Yi = β0 + β1 X1i + ui
Yi = β0 + β1 X1i + ui , ui = β2 X2i + wi
I we then have
cov (Yi , X1i ) cov (ui , X1i )
plim β̂1 = = β1 +
n→∞ var (X 1i ) var (X1i )
cov (β2 X2i + wi , X1i ) cov (X2i , X1i )
= β1 + = β1 + β2
var (X1i ) var (X1i )
cov (X2i , X1i )
plim β̂1 = β1 + β2
n→∞ var (X1i )
Omitted variables
I An omitted variable X2i leads to an inconsistent OLS estimate
of the causal effect of X1i if:
I The omitted variable X2i is a determinant of the dependent
variable Yi (β2 6= 0), and
I he omitted variable X2i is correlated with the regressor of
interest X1i (cov (X2i , X1i ) 6= 0)
I If there exists 1 or more variables that satisfy both conditions,
I The OLS estimator does not provide a unbiased and consistent
estimate of the causal effect of X1i
I the OLS regression is not internally valid
I The sign of the bias is given by the sign of β2 cov (X2i , X1i )
I If β2 cov (X2i , X1i ) < 0 we say that OLS underestimates the
causal effect of X1i
I If β2 cov (X2i , X1i ) > 0 we say that OLS overestimates the
causal effect of X1i
I Example: In the returns to education, ability is an omitted
variable.
Functional form misspecification
I Suppose that the true population regression model is
Yi = β0 + β1 X1i + ui , ui = β2 X1i2 + wi
I we then have
cov (Yi , X1i ) cov (ui , X1i )
plim β̂1 = = β1 +
n→∞ var (X1i ) var (X1i )
2
cov (β2 X1i + wi , X1i )
= β1 +
var (X1i )
cov (X1i2 , X1i )
plim β̂1 = β1 + β2
n→∞ var (X1i )
X̃1i = X1i + ωi
Yi = β0 + β1 X̃1i − β1 ωi + ui
Ỹi = Yi + ηi
I and we estimate
Ỹi = β0 + β1 X1i + ei , ei = ui + ηi
Cov (ui , ui ) 6= 0, i 6= j