Professional Documents
Culture Documents
• Example:
y*: annual family savings
y: reported annual savings
6
where y* is the actual value and (1) satisfies all the CLRM assumptions.
where e0 ~ N (0, e2 ) .
ˆ u2 02 u2
• The variance of the OLS estimator is: Var ( 1 ) 2
i
( x x ) ( xi x ) 2
where x* is the actual value and (3) satisfies the first four CLRM assumptions.
• The OLS estimators from equation (4) is biased and inconsistent because
x and η are correlated:
Cov ( x, ) 0
Example
• We estimate the effect of family income on CAS mark average, after
controlling for A-level grade average:
CAS 0 1 fa min c* 2 Alevel u
where faminc* is actual annual family income.
• because, for example, those with high A-level grade average may report
family income more accurately.
▫ Then, Cov ( x, e1 ) 0 .
13
Ho: β1 = 0
• It can be shown that (the expectation of) the OLS estimator for β1 in
equation (2) is:
E ( ˆ1 ) 1 2
( x x )( z z )
( x x )2 (3)
• (You must be able to) Realise that θ is the slope from the regression
of z on x.
16
ˆ
E ( 1 ) 1 2
( x x )( z z )
1
2 (3)
(x x)
bias
ˆ
E ( 1 ) 1 2
( x x )( z z )
1 (3)
2
( xx )
bias
• Equation (3) tells that the sign of the bias in ̂1 depends on the signs of
both β2 and θ:
• The bias is…
1. positive if β2 > 0 and θ > 0.
2. negative if β2 > 0 and θ < 0.
3. negative if β2 < 0 and θ > 0.
4. positive if β2 < 0 and θ < 0.
18
Example
• Consider again the wage equation [the true model]:
E ( ˆ1 ) 1 2
(educ educ)(abil abil )
(educ educ) 2
bias
bias 2
( x x )( z z ) [from equation (3)]
2
(x x)
• Hypothesis testing, such as t test, is wrong.
• Answer: No!
SST (4)
where SST is the total variation in x and R^2 is the R-squared from the
regression of x on z.
• Comparing (3) and (4) reveals that
Var ( ˆ1 ) F Var ( ˆ1 )T
What is simultaneity?
• Simultaneity is a form of endogeneity that arises when…
▫ one or more of the explanatory variables is jointly determined with the
dependent variable.
hi 2 wi 2 z 2i u 2 i (2)’
• Equations (1)’ and (2)’ constitute a simultaneous equations model.
• hi, wi - called endogenous variables.
• z1i, z2i - called exogenous.
• u1i, u2i - called structural errors.
26
• Recall that, in the equilibrium, the labour demand and supply functions
can be expressed as:
hi 1wi 1 z1i u1i (1)’
hi 2 wi 2 z 2i u2i
(2)’
where z1i and z2i are exogenous, so that each is uncorrelated with u1 and u2.
• To show that wi is correlated with u1i, we solve the two equations for wi,
in terms of z1i, z2i, u1i and u2i.
27
Example
• Consider the following model:
score 0 1skipped 2income u
where score = final exam score, skipped = the total number of lectures missed
and income = a student’s income.
IV estimation
• Now let’s see how an IV can be used to consistently estimate β1 in the
following equation:
y 0 1 x u (1)
• Using (1), the covariance between z (= instrument) and y can be written as:
Cov(z,y) = β1Cov(z,x) + Cov(z,u)
• Estimate (2) using sample analogs of (2). After cancelling the sample sizes
in the numerator and denominator:
n
(z i z )( yi y )
ˆ1 i 1
n
(z
i 1
i z )( xi x )
called the IV estimator.
33
• Suppose now that we have two instruments, z1 and z2 for x. That is, z’s
satisfy:
1. Cov( zi , u ) 0 for i = 1, 2
2. Cov( zi , x) 0 for i = 1, 2
• To find the best IV, choose the linear combination that is most highly
correlated with x (call it x*) given by:
x* 0 1 z1 2 z 2 (1)
• We can estimate (1) by OLS to obtain the fitted values for x*:
xˆ* ˆ 0 ˆ1 z1 ˆ 2 z 2
• Once we have x̂ * , we can use it as the IV for x.
35
Example
• To investigate the effect of education on wage, an economist
estimated model (1) using a sample of men in the US in 1976:
US wage equation
• u contains unobserved factors affecting wage,
▫ e.g., ability, how disciplined workers are, etc.
Q: What do we do??
A: use an instrument for educ.
Summary
4. In the next lecture, we will depart from cross section and time
series econometrics and go through panel data econometrics.