Professional Documents
Culture Documents
Dougherty
Introduction to Econometrics,
5th edition
Chapter heading
Chapter 6: Specification of
Regression Variables
True model
Y 1 2 X 2 u Y 1 2 X 2 3 X 3 u
Yˆ ˆ1 ˆ2 X 2
Correct specification,
ˆ X
3 3
no problems
1
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
True model
Y 1 2 X 2 u Y 1 2 X 2 3 X 3 u
Coefficients are
Yˆ ˆ1 ˆ2 X 2 unbiased (in general), Correct specification,
ˆ X
3 3
but inefficient.
Standard errors are
no problems
The effects are different from those of omitted variable misspecification. In this case the
coefficients in general remain unbiased, but they are inefficient. The standard errors
remain valid, but are needlessly large.
2
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
Y 1 2 X 2 u
3
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
Y 1 2 X 2 u
Rewrite the true model adding X3 as an explanatory variable, with a coefficient of 0. Now
the true model and the fitted model coincide. Hence b2 will be an unbiased estimator of b2
and b3 will be an unbiased estimator of 0.
4
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
Y 1 2 X 2 u
u2 1
2ˆ2
X 2 i X 2 1 rX 2 , X 3
2 2
However, the variance of b2 will be larger than it would have been if the correct simple
regression had been run because it includes the factor 1 / (1 – r2), where r is the correlation
between X2 and X3.
5
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
Y 1 2 X 2 u
u2 1
2ˆ2
X 2 i X 2 1 rX 2 , X 3
2 2
The estimator b2 using the multiple regression model will therefore be less efficient than the
alternative using the simple regression model.
6
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
Y 1 2 X 2 u
u2 1
2ˆ2
X 2 i X 2 1 rX 2 , X 3
2 2
The intuitive reason for this is that the simple regression model exploits the information
that X3 should not be in the regression, while with the multiple regression model you find
this out from the regression results.
7
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
Y 1 2 X 2 u
u2 1
2ˆ2
X 2 i X 2 1 rX 2 , X 3
2 2
The standard errors remain valid, because the model is formally correctly specified, but
they will tend to be larger than those obtained in a simple regression, reflecting the loss of
efficiency.
8
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
Y 1 2 X 2 u
u2 1
2ˆ2
X 2 i X 2 1 rX 2 , X 3
2 2
These are the results in general. Note that if X2 and X3 happen to be uncorrelated, there will
be no loss of efficiency after all.
9
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
The table shows the output from a logarithmic regression of hourly earnings on years of
schooling, years of work experience, and a male dummy variable, using EAWE Data Set 21.
10
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
Now age has been added as an explanatory variable. There is no particular reason to
suppose that age is a relevant explanatory variable and its coefficient is small and
insignificant.
11
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
Its correlations with S, EXP, and MALE are –0.04, 0.45, and 0.04, respectively.
12
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
Its inclusion does not cause the coefficients of those variables to be biased and they are
little changed.
13
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
The effect on the standard errors of the coefficients of S and MALE are likewise negligible,
as would be expected, given their very low correlations with AGE.
14
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE
However, the correlation of EXP with AGE is large enough to cause a substantial increase in
its standard error, reflecting a loss of efficiency. Both point estimates of the coefficient of
EXP will be unbiased, but that in the first regression will tend to be closer to the true value.
15
Copyright Christopher Dougherty 2016.
Individuals studying econometrics on their own who feel that they might benefit
from participation in a formal course should consider the London School of
Economics summer school course
EC212 Introduction to Econometrics
http://www2.lse.ac.uk/study/summerSchools/summerSchool/Home.aspx
or the University of London International Programmes distance learning course
EC2020 Elements of Econometrics
www.londoninternational.ac.uk/lse.
2016.05.04