IV Example
IV Example
1
logincome float %9.0g log(income)
ssiratio float %9.0g SSI/Income ratio
------------------------------------------------------------------------------
logmedexpe~e | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
healthinsu | .0749595 .0260124 2.88 0.004 .02397 .125949
illnesses | .440653 .0095721 46.04 0.000 .4218897 .4594162
age | -.0025946 .001879 -1.38 0.167 -.0062777 .0010886
logincome | .0172363 .0137865 1.25 0.211 -.009788 .0442607
_cons | 5.780127 .150891 38.31 0.000 5.48435 6.075903
------------------------------------------------------------------------------
For individuals with health insurance, the predicted medical expenses are 7.8% higher than those
for individuals without health insurance, ceteris paribus.
Let ssiratio be an instrument for healthinsu, estimate the 2SLS with the first stage also:
First-stage regressions
-----------------------
2
Number of obs = 10,089
F( 4, 10084) = 185.08
Prob > F = 0.0000
R-squared = 0.0684
Adj R-squared = 0.0680
Root MSE = 0.4691
------------------------------------------------------------------------------
healthinsu | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
illnesses | .011351 .0036336 3.12 0.002 .0042285 .0184736
age | -.0085302 .0007125 -11.97 0.000 -.0099268 -.0071337
logincome | .0544246 .0056429 9.64 0.000 .0433634 .0654858
ssiratio | -.1997539 .0141579 -14.11 0.000 -.2275062 -.1720017
_cons | .9591576 .0568776 16.86 0.000 .8476662 1.070649
------------------------------------------------------------------------------
------------------------------------------------------------------------------
logmedexpe~e | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
healthinsu | -.852201 .1983369 -4.30 0.000 -1.240934 -.4634679
illnesses | .4485123 .0102903 43.59 0.000 .4283437 .4686808
age | -.0117975 .0027882 -4.23 0.000 -.0172622 -.0063327
logincome | .0976929 .0224588 4.35 0.000 .0536744 .1417113
_cons | 6.589839 .2346179 28.09 0.000 6.129996 7.049681
------------------------------------------------------------------------------
Instrumented: healthinsu
Instruments: illnesses age logincome ssiratio
Durbin-Wu-Hausman test of endogeneity:
Tests of endogeneity
Ho: variables are exogenous
3
Durbin (score) chi2(1) = 25.0914 (p = 0.0000)
Wu-Hausman F(1,10083) = 25.139 (p = 0.0000)
------------------------------------------------------------------------------
healthinsu | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
ssiratio | -.1997539 .0141579 -14.11 0.000 -.2275062 -.1720017
illnesses | .011351 .0036336 3.12 0.002 .0042285 .0184736
age | -.0085302 .0007125 -11.97 0.000 -.0099268 -.0071337
logincome | .0544246 .0056429 9.64 0.000 .0433634 .0654858
_cons | .9591576 .0568776 16.86 0.000 .8476662 1.070649
------------------------------------------------------------------------------
( 1) v1hat = 0
F( 1, 10083) = 25.14
Prob > F = 0.0000
The Durbin-Wu-Hausman test compares OLS and the 2SLS model coefficients. The null hypothesis
that the regressors are exogenous is rejected. Therefore, the health insurance is an endogenous
regressor and we need to use instrumental variables approach.
Without using the automatic way, estimate the coefficients by 2SLS:
4
------------------------------------------------------------------------------
healthinsu | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
ssiratio | -.1997539 .0141579 -14.11 0.000 -.2275062 -.1720017
illnesses | .011351 .0036336 3.12 0.002 .0042285 .0184736
age | -.0085302 .0007125 -11.97 0.000 -.0099268 -.0071337
logincome | .0544246 .0056429 9.64 0.000 .0433634 .0654858
_cons | .9591576 .0568776 16.86 0.000 .8476662 1.070649
------------------------------------------------------------------------------
------------------------------------------------------------------------------
logmedexpe~e | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
y2hat | -.8522011 .1868427 -4.56 0.000 -1.21845 -.4859521
illnesses | .4485123 .0096939 46.27 0.000 .4295103 .4675143
age | -.0117975 .0026266 -4.49 0.000 -.0169461 -.0066488
logincome | .0976929 .0211572 4.62 0.000 .0562204 .1391653
_cons | 6.589839 .2210212 29.82 0.000 6.156593 7.023084
------------------------------------------------------------------------------
After instrumentation, for individuals with health insurance, their medical expenses are predicted
57.3% lower than those for individuals without health insurance, ceteris paribus. Note that the
2SLS coefficient estimate turned out quite different from the OLS coefficient estimate.
Alternatively, let ssiratio f irmlocation be the instruments for healthinsu, estimate by 2SLS:
First-stage regressions
-----------------------
------------------------------------------------------------------------------
5
healthinsu | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
illnesses | .0117912 .0036286 3.25 0.001 .0046785 .0189039
age | -.0079491 .0007184 -11.06 0.000 -.0093573 -.0065409
logincome | .0509146 .0056665 8.99 0.000 .039807 .0620221
ssiratio | -.1909688 .0142168 -13.43 0.000 -.2188365 -.163101
firmlocation | .1156546 .0200232 5.78 0.000 .0764051 .1549041
_cons | .9124637 .0573591 15.91 0.000 .8000285 1.024899
------------------------------------------------------------------------------
------------------------------------------------------------------------------
logmedexpe~e | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
healthinsu | -.9696236 .1863391 -5.20 0.000 -1.334841 -.6044057
illnesses | .4495077 .0104242 43.12 0.000 .4290766 .4699387
age | -.012963 .002727 -4.75 0.000 -.0183079 -.0076181
logincome | .1078825 .0218155 4.95 0.000 .0651249 .1506401
_cons | 6.692387 .2286487 29.27 0.000 6.244244 7.14053
------------------------------------------------------------------------------
Instrumented: healthinsu
Instruments: illnesses age logincome ssiratio firmlocation
With two instruments instead of one, the estimates changed only slightly from -0.852 to -0.970 for
the coefficient on have health insurance.
Test of overidentifying restrictions:
------------------------------------------------------------------------------
logmedexpe~e | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
healthinsu | -.9696236 .1863391 -5.20 0.000 -1.334841 -.6044057
6
illnesses | .4495077 .0104242 43.12 0.000 .4290766 .4699387
age | -.012963 .002727 -4.75 0.000 -.0183079 -.0076181
logincome | .1078825 .0218155 4.95 0.000 .0651249 .1506401
_cons | 6.692387 .2286487 29.27 0.000 6.244244 7.14053
------------------------------------------------------------------------------
Instrumented: healthinsu
Instruments: illnesses age logincome ssiratio firmlocation
------------------------------------------------------------------------------
| Robust
logmedexpe~e | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
healthinsu | -.852201 .2113027 -4.03 0.000 -1.266347 -.4380553
illnesses | .4485123 .0100689 44.54 0.000 .4287776 .468247
age | -.0117975 .0029007 -4.07 0.000 -.0174828 -.0061121
logincome | .0976929 .0233306 4.19 0.000 .0519657 .14342
_cons | 6.589839 .245398 26.85 0.000 6.108867 7.07081
------------------------------------------------------------------------------
Instrumented: healthinsu
Instruments: illnesses age logincome ssiratio
7
| Adjusted Partial Robust
Variable | R-sq. R-sq. R-sq. F(1,10084) Prob > F
-------------+------------------------------------------------------------
healthinsu | 0.0684 0.0680 0.0194 68.881 0.0000
--------------------------------------------------------------------------
------------------------------------------------------------------------------
| Robust
logmedexpe~e | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
healthinsu | -.9696236 .1987108 -4.88 0.000 -1.35909 -.5801575
illnesses | .4495077 .0102219 43.98 0.000 .4294731 .4695422
age | -.012963 .0028378 -4.57 0.000 -.018525 -.007401
logincome | .1078825 .0227967 4.73 0.000 .0632018 .1525632
_cons | 6.692387 .2388115 28.02 0.000 6.224325 7.160449
------------------------------------------------------------------------------
Instrumented: healthinsu
Instruments: illnesses age logincome ssiratio firmlocation
8
--------------------------------------------------------------------------
The test for weak instruments looks at the F statistic for joint significance of instruments. The
number is 69 from the model with 1 instrument and 59 from the model with 2 instruments, which
is larger than the rule of thumb of 10. Therefore, the instruments are not weak.