Professional Documents
Culture Documents
(A) Simultaneity
For the demand curve shifter we can use the price of substitute
goods (cottonseed)
For the supply curve shifter can use factors that affect costs
(yield per acre) such as weather patterns
Intuitively:
weather related shifts (which shift the supply curve) are used to
trace out the demand curve
Contents
4 Instrumental Variables 2
4 Instrumental Variables
Y = β 0 + β 1X + u (1)
has an endogenous X , that is, whenever Cov (X, u) 6= 0
. reg lwage educ exper expersq black south smsa smsa66 reg661-reg668 ;
------------------------------------------------------------------------------
lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .0746933 .0034983 21.35 0.000 .0678339 .0815527
exper | .084832 .0066242 12.81 0.000 .0718435 .0978205
Econ 495 - Econometric Review 4
• The main problem is that we have to use common sense and economic
theory to decide if it makes sense to assume Cov (Z, u) = 0
• In the case of multiple instruments, we can use the overid test below
X = π 0 + π 1Z + v (2)
. reg educ nearc4 exper expersq black south smsa smsa66 reg661-reg668 ;
------------------------------------------------------------------------------
educ | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
nearc4 | .3198989 .0878638 3.64 0.000 .1476194 .4921785
exper | -.4125334 .0336996 -12.24 0.000 -.4786101 -.3464566
expersq | .0008686 .0016504 0.53 0.599 -.0023674 .0041046
black | -.9355287 .0937348 -9.98 0.000 -1.11932 -.7517377
south | -.0516126 .1354284 -0.38 0.703 -.3171548 .2139296
smsa | .4021825 .1048112 3.84 0.000 .1966732 .6076918
Econ 495 - Econometric Review 8
. test nearc4;
( 1) nearc4 = 0
F( 1, 2994) = 13.26
Prob > F = 0.0003
Econ 495 - Econometric Review 9
Cov (Z, Y )
so β IV
1 =
Cov (Z, X )
Econ 495 - Econometric Review 10
E (u) = E (Y − β 0 − β 1X ) = m1 = 0
E (Zu) = E [Z (Y − β 0 − β 1X )] = m2 = 0
. ivreg lwage (educ=nearc4) exper expersq black south smsa smsa66 reg661-reg668 ;
------------------------------------------------------------------------------
lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .1315038 .0549637 2.39 0.017 .0237335 .2392742
exper | .1082711 .0236586 4.58 0.000 .0618824 .1546598
expersq | -.0023349 .0003335 -7.00 0.000 -.0029888 -.001681
black | -.1467757 .0538999 -2.72 0.007 -.2524603 -.0410912
south | -.1446715 .0272846 -5.30 0.000 -.19817 -.091173
smsa | .1118083 .031662 3.53 0.000 .0497269 .1738898
smsa66 | .0185311 .0216086 0.86 0.391 -.0238381 .0609003
Econ 495 - Econometric Review 12
b IV = 0.132 > β
• Notice that β b OLS = 0.075 (see LATE effect below)
educ educ
• But when R2 < 1 in the first stage, IV standard errors are larger than
the OLS
b IV Corr(Z, u) σ u
plimβ 1 = β 1 + · (5)
Corr(Z, X ) σ X
– IV can be very biased (much more than OLS), when the instrument
is not truly exogenous
Y = β 0 + β 1 X 1 + β 2 X 2 + u1 (6)
where X1 is an endogeneous variable and X2 is an exogenous variable
• With more than one instrument, the IV estimator is also called the
two-stage least squares (2SLS) estimator
. reg educ nearc4 nearc2 exper expersq black south smsa smsa66 reg661-reg668 ;
------------------------------------------------------------------------------
educ | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
Econ 495 - Econometric Review 18
. predict peduc;
(option xb assumed; fitted values)
Econ 495 - Econometric Review 19
. test nearc4=nearc2=0;
( 1) nearc4 - nearc2 = 0
( 2) nearc4 = 0
F( 2, 2993) = 7.89
Prob > F = 0.0004
• The ‘2’SLS expression comes from the fact that this estimation strat-
egy is done in two steps
Econ 495 - Econometric Review 21
. ivreg lwage (educ=peduc) exper expersq black south smsa smsa66 reg661-reg668;
------------------------------------------------------------------------------
lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .1570594 .0525783 2.99 0.003 .0539662 .2601526
exper | .1188149 .0228061 5.21 0.000 .0740977 .1635321
expersq | -.0023565 .0003475 -6.78 0.000 -.0030379 -.0016751
black | -.1232778 .0521501 -2.36 0.018 -.2255313 -.0210242
south | -.1431945 .0284448 -5.03 0.000 -.1989678 -.0874212
smsa | .100753 .0315193 3.20 0.001 .0389512 .1625548
smsa66 | .0150626 .022336 0.67 0.500 -.0287328 .058858
Econ 495 - Econometric Review 22
• While the coefficients are the same, the standard errors from doing
2SLS by hand are incorrect, so let STATA do it for you
Econ 495 - Econometric Review 23
• When we use ivreg2 with the option gmm, STATA will compute 2SLS
residuals in a first step and use these residuals to compute a weighing
matrix that will give the most efficient feasible estimate.
• Thus for overidentified models, the GMM approach makes more ef-
ficient use of the information in the l moment conditions than the
standard 2SLS approach which reduces them to k instrument, and it
is heteroskedascity-efficient.
Econ 495 - Econometric Review 25
. ivreg2 lwage (educ=nearc2 nearc4) exper expersq black south smsa smsa66
reg661-reg668, gmm ;
GMM estimation
--------------
------------------------------------------------------------------------------
| Robust
lwage | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .1552102 .052387 2.96 0.003 .0525336 .2578867
exper | .1179614 .0228779 5.16 0.000 .0731215 .1628013
expersq | -.0023521 .0003674 -6.40 0.000 -.0030721 -.001632
black | -.1257875 .0514422 -2.45 0.014 -.2266124 -.0249627
Econ 495 - Econometric Review 26
• The idea of the Hausman test is to see if the estimates from OLS and
IV are different
• This test is easily done by including the residual from the first stage
in the OLS regression
. reg lwage educ reseduc exper expersq black south smsa smsa66 reg661-reg668 ;
------------------------------------------------------------------------------
lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .1570594 .0482814 3.25 0.001 .0623912 .2517275
reseduc | -.0828005 .0484086 -1.71 0.087 -.177718 .0121169
exper | .1188149 .0209423 5.67 0.000 .0777521 .1598776
expersq | -.0023565 .0003191 -7.38 0.000 -.0029822 -.0017308
black | -.1232778 .0478882 -2.57 0.010 -.2171749 -.0293806
south | -.1431945 .0261202 -5.48 0.000 -.1944098 -.0919791
smsa | .100753 .0289435 3.48 0.001 .0440018 .1575042
smsa66 | .0150626 .0205106 0.73 0.463 -.0251538 .0552789
reg661 | -.102976 .0398738 -2.58 0.010 -.1811588 -.0247932
Econ 495 - Econometric Review 30
• Alternatively, you can use the STATA command hausman IV OLS where
the commands est store OLS and est store IV have followed each
estimation command
Econ 495 - Econometric Review 31
Note: the rank of the differenced variance matrix (1) does not equal the number
of coefficients being tested (16); be sure this is what you expect, or
there may be problems computing the test. Examine the output of your
estimators for anything unexpected and possibly consider scaling your
variables so that the coefficients are on a similar scale.
chi2(1) = (b-B)’[(V_b-V_B)^(-1)](b-B)
= 2.92
Prob>chi2 = 0.0873
(V_b-V_B is not positive definite)
• We conclude that educ is correlated with the error terms at the 10%
level of significance.
Econ 495 - Econometric Review 33
• The idea is to regress the predicted residual û1 on all exogenous vari-
ables, including the instrumental variables
. ivreg2 lwage (educ=nearc2 nearc4) exper expersq black south smsa smsa66
reg661-reg668 ;
------------------------------------------------------------------------------
lwage | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .1570594 .0524383 3.00 0.003 .0542822 .2598366
exper | .1188149 .0227454 5.22 0.000 .0742348 .163395
expersq | -.0023565 .0003466 -6.80 0.000 -.0030358 -.0016772
black | -.1232778 .0520112 -2.37 0.018 -.225218 -.0213376
south | -.1431945 .0283691 -5.05 0.000 -.1987968 -.0875921
smsa | .100753 .0314355 3.21 0.001 .0391406 .1623654
smsa66 | .0150626 .0222765 0.68 0.499 -.0285986 .0587238
reg661 | -.102976 .0433068 -2.38 0.017 -.1878558 -.0180962
reg662 | -.0002286 .0337043 -0.01 0.995 -.0662879 .0658306
reg663 | .0469556 .0325621 1.44 0.149 -.016865 .1107763
reg664 | -.0554084 .0390786 -1.42 0.156 -.132001 .0211842
Econ 495 - Econometric Review 36
. regress res1 exper expersq black south smsa smsa66 reg661-reg668 nearc4 nearc2 ;
------------------------------------------------------------------------------
res1 | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
exper | .0000312 .0070379 0.00 0.996 -.0137685 .0138309
expersq | -3.43e-06 .0003447 -0.01 0.992 -.0006793 .0006724
black | -.0008853 .0196167 -0.05 0.964 -.0393489 .0375784
south | .0011448 .0283118 0.04 0.968 -.0543678 .0566574
smsa | .0006683 .0218892 0.03 0.976 -.0422511 .0435877
smsa66 | -.0005942 .0223401 -0.03 0.979 -.0443978 .0432093
reg661 | .0061073 .0426319 0.14 0.886 -.0774835 .089698
reg662 | .0032348 .0308814 0.10 0.917 -.0573161 .0637857
reg663 | .0060228 .0304496 0.20 0.843 -.0536814 .065727
reg664 | .007307 .0395175 0.18 0.853 -.0701773 .0847913
reg665 | .0054381 .039759 0.14 0.891 -.0725197 .0833959
reg666 | -.0003375 .0438038 -0.01 0.994 -.0862261 .0855511
reg667 | .0052685 .0434338 0.12 0.903 -.0798946 .0904316
reg668 | .0083152 .0512663 0.16 0.871 -.0922056 .108836
Econ 495 - Econometric Review 38
. gen overid2=_N*e(r2);
. di overid2;
1.2481526
. gen pval=chi2tail(1,overid2);
. di pval;
.26390561
. test nearc4=nearc2=0;
( 1) nearc4 - nearc2 = 0
( 2) nearc4 = 0
F( 2, 2993) = 0.62
Econ 495 - Econometric Review 39
. gen Jstat=r(df)*r(F);
. di Jstat;
1.2416182
What are the fundamental causes of the large differences in income per
capita across countries?
One part of the country stagnated under central planning and collective
ownership while the other prospered with private property and a market
economy
At one extreme:
BRA
TUN
ECU
PER
DZA DOM
FJI GTM BLZ
PRY JAM
MAR IDN
8 GUY EGY
SLV
BOL
AGO
LKA
HND
NIC CMR GIN CIV
PAK IND SDN VNMSEN COGMRT GHA
CAF TGO GMB
HTI
LAO
KEN BEN
BGD UGA
ZAR
BFA
TCD NERMDG NGA
BDI
RWA MLI
TZA SLE
ETH
6
4
2 4 6 8
Log of Settler Mortality
Colonies where Europeans faced higher mortality rates are today
substantially poorer than colonies that were healthy for Europeans
Theory implies
SGP USA
HKG CAN
10 AUS
NZL
MLT
BHS CHL
ARG VEN
URY MEX GAB
PAN ZAF
CRI COL MYS
Log GDP per capita, PPP, 1995
TTO BRA
4
4 6 8 10
Average Expropriation Risk 1985-95
USA
SGP
HKG
CAN
10 AUS
NZL
MLT CHLBHS
BRB
MUS ARG
VEN
URY
MEX
MYS
ZAF COL PAN GAB
CRI
TTO
Log GDP per capita, PPP, 1995
BRA
TUN
ECU
PER
DZA DOM
FJI GTM BLZ
PRY JAM
MAR IDN
8 GUY EGY
SLV
BOL
AGO
LKA
HND
NIC CMR GIN CIV
PAK IND SDN VNMSEN COGMRT GHA
CAF TGO GMB
HTI
LAO
KEN BEN
BGD UGA
ZAR
BFA
TCD NERMDG NGA
BDI
RWA MLI
TZA SLE
ETH
6
4
2 4 6 8
Log of Settler Mortality
USA
10 NZL CAN
AUS SGP
Average Expropriation Risk 1985-95
IND GMB
HKG
8 MYS BRA
CHL GAB
MEXBHS IDN
TTO
MLT COL
VEN
MAR
CRI JAM
URY
PRY CIV
TGO
ZAF EGY
ECU TZA GIN
TUNDZA VNM CMR
ARG
DOM GHA
PAK LKA KEN
SEN
6 GUY
ETH PER
PAN SLE
BOL NGA
HND AGO
GTM
BGD NIC
SLV NER
COG
UGA
BFA MDG
SDN MLI
4 HTI
ZAR
2 4 6 8
Log of Settler Mortality