Professional Documents
Culture Documents
8 - HETEROSKEDASTICITY 3-27-24
CHAPTER 8
HETEROSKEDASTICITY
We’re now going to consider situations in which one (or more) of the basic
assumptions of the LRM is not present. Two situations we consider involve (i) the
homoscedasticity assumption failing (we call this a situation in which we have
heteroscedasticity), and (ii) the random sampling assumption that the disturbances
are independent of each other fails (the situation we will consider is called serial
correlation). This chapter focuses on the first situation.
Heteroskedasticity: return to
Assumption MLR. 5 Homoskedasticity (7th: 88) “The error u has the same
variance given any value of the explanatory variables. In other words, Var(u|
x1,..,xk) = σ2.”
Recall that it implied, graphically ….
The question we consider in this chapter is what happens to our analysis when we
are presented with a situation in which variances differ across observations?
[Note the subscript: it implies that the variance can be different for different
observations (notation is important in econometrics.)]
We will assume that all of the other assumptions of the LRM apply.
Importantly, we still have a linear regression [put up the LRM equation], still
assume that the expected value of the disturbance conditional on x equals zero, and
so on.
2
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
1) Can we still use the OLS estimator as an estimator of the parameters of the
model? Yes.
2) Do we have to change any part of our prior analysis of the OLS estimator?
Yes, the std errors.
3) Is the OLS estimator still the BLUE of the population parameters? No.
4) What is the BLUE of the population parameters? Called the GLS (or WLS)
estimator.
5) How do we estimate the WLS model? What are the relevant R commands?
Hypothesis Testing
Because we do not have to address any of these questions if heteroscedasticity is
not present in data, it makes sense to first consider whether we can perform
hypothesis tests to determine its presence. If it is not present, then we can rely on
the analysis we undertook in ECON335 (with respect to the Multiple Linear
Regression (MLR) model).
The null hypothesis in any test will be that homoscedasticity exists; in other words,
3
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
H0: Var(ui|x1i, x2i … xki) = σ2 [Why does this imply homoscedasticity? The variance
doesn’t depend on i.]
Some theoretical econometricians noted that the zero conditional mean and
homoskedasticity assumptions regarding the disturbances (MLR4 & 5) imply that
We can interpret this result as implying that the variance of u does not depend on
any of the x’s; it is constant across the x’s. This observation leads to various
heteroscedasticity tests.
4
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
What would a null hypothesis of homoscedasticity imply with respect to (wrt) the
parameters? Think of the null: it implies that the x’s don’t affect the variance. If
they don’t affect the variance, what would have to be true of the parameters?
[only δ0 is non-zero]
HO: δ1 = δ2 = … = δk
HA: HO is not true.
Given what we have learned in this class, we know how to undertake this
hypothesis test.
What would we call the test we just proposed? A test of the overall significance of
the regression.
Problem with the proposed test: don’t have the u’s for individual observations.
5
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
What can we do? We will see that it turns out that the OLS estimator is a consistent
estimator of the parameters (betas) of the LRM with heteroscedasticity.
Since the betas are consistent estimators of the population parameters, this is a
consistent estimator of the disturbance.
So, we resolve the problem by first estimating the model using the OLS estimator,
and obtaining the residuals.
Hypothesis Testing Steps for the B-P test using the F statistic [7th: 270]
(1) Estimate the econometric model using OLS and obtain residuals.
u^ 2 = δ0 + δ1•x1 + … + δk•xk + v
(3) Propose a null hypothesis that all “slope” parameters (parameters other than
the intercept) equal zero & establish a level of significance (5%).
(4) Estimate the model and calculate the R2 for this regression: R2u.
6
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
(5) Calculate the F statistic for testing the overall significance of the regression
& its prob value.
(6) If the probability of the statistic is less than the level of significance then
reject the null hypothesis. Or, if the value of the sample statistic – F s – is
greater than the critical value of the statistic then reject the null hypothesis.
Note: one does not have to use the exact variables in the regression equation of
interest. One could use a subset of the x’s. Another possibility is to use the
predicted values from the OLS regression.
7
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
(2) Propose estimating the following equation (which includes all of the
independent variables, squares of independent variables & cross-products of
independent variables).
(5) Estimate equation (I) and obtain its R2. Then, calculate
If the probability of n•R2 (given HO) is less than the level of significance of the test
we reject the NH & conclude that heteroscedasticity is present.
[Derive the equation which will be estimated & the tested hypotheses ….]
u^ 2 = ….
Note that the square of a dummy variable = the d.v. So, can’t include the square.
(2) A shorter way to test the same thing involves using predicted values from
the OLS regression. We estimate
u^ 2 = δ0 + δ1• ^y + δ2• ^y 2 + v
9
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
For this test the null hypothesis is δ1 = δ2 = 0. It has 2 d.f., a lot fewer than if we
included all of the variables, their squares, and their cross products.
Note: this can be done with the B-P test for a subset of the explanatory variables.
The first place we might look is the OLS estimator. Can we still rely on the OLS
estimator to make inferences about the population parameters (the betas)? Let’s
take a look at this.
10
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
Theorem 3.2 (7th: 88) Sampling Variances of the OLS Slope Estimators
where, as we know, SSTj is the sum of squared deviations of xj from its mean and
R j is the coefficient of determination from a regression of xj on all of the other
2
independent variables.
Note how the variance of the OLS estimator of beta depends on σ2, the assumed
constant variance of the disturbance. Because we are now assuming that the
variance can differ across observations, it is clear that this formula cannot work for
the model with heteroskedasticity.
So, the OLS formula for standard errors when we assume homoscedasticity are
wrong.
[Implication wrt R: if you use the “lm” command and don’t account for the
possibility of different variances, the standard errors you get will be incorrect.]
11
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
To get an idea of the impact of heteroscedasticity on the formula for the variance
of ^β 1 let’s consider the formula for the variance of β1 in simple linear regression
case.
SLR: y = β0 + β1•x1 + u
Note: it weighs each variance by the degree to which it deviates from the overall
mean of x1.
So, if we are to use the OLS estimator, we need to make sure that we use the
correct formula for the variance of the estimator.
☞ Issue: there’s one problem with [8.2]: we don’t know the σ i . Is all lost? No.
2
It has been shown that we can replace the σ 2i with the squared residuals from an
OLS regression.
So, we estimate the model using OLS and get the residuals from the regression: u^ i2
12
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
Wooldridge (7th: 264 eq. (8-4)) identifies the formula for the multiple linear
regression case.
Hypothesis Testing
t-Test(7th: 265): all is the same except that we use the new standard error of the
parameter estimate.
13
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
Generally, it’s called the Generalized Least Squares (GLS) estimator. In the
context of the presence of heteroscedasticity. It is called the Weighted Least
Squares (WLS) Estimator.
The general idea behind identification of the minimum variance estimator is to re-
characterize the model as one in which all of the assumptions of the MLR model
hold. Specifically, we transform the model into one in which homoscedasticity is
present. Once we have homoscedasticity, we can rely on the Gauss-Markov
Theorem & all of the desirable properties of the OLS estimators. That’s what we
will do.
We make assumptions MLR.1 to 4 and assume that the disturbances can have
different variances.
Because we are going to allow the variances of the disturbances to vary across
observations, I will represent the linear model by adding subscripts for
observations; i.e.,
yi = β0 + β1∙x1i + β2∙x2i + … + βk∙xki + ui
E(yi /σi) = β0/σi + β1∙x1i /σi + β2∙x2i /σi + … + βk∙xki /σi + E(ui/σi)
So, we have a random term with mean = zero & a constant variance = 1.
Since the variance of the random term is constant w/ mean = 0, we have a model in
which homoskedasticity is present. Further, all of the other assumptions of the
LRM apply.
What is the efficient estimator for this model? The OLS estimator!
That’s the idea behind the GLS estimator and, for heteroscedasticity, the WLS
estimator.
15
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
We transform the data and then estimate the model using the OLS estimator. So,
how do we implement the estimator?
Interpretation of GLS Estimator: why is it called a WLS Estimator? (7th: 275)
In order to interpret GLS estimation, let’s set up the basic OLS estimation problem.
Recall that we obtain the OLS estimator by minimizing the sum of squared
residuals (7th: 70); i.e.,
^β 0 … ^β k
^β 0 … ^β k
If we let wi = 1/σi2.
^β 0 … ^β k
If we contrast this minimization with that for the LRM model, we see that the
difference lies in the wi.
16
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
☞ Discuss: the relationship between σi2 & the size of the weight. Graphically ….
OLS is not efficient because it does not take into account the differences in the
variances.
Letting x = [x1 … xk]′, suppose that the variance of u (conditional on x) takes the
following form:
V(u|x) = σ2h(x).
This gets us a homoscedastic disturbance. So, we can estimate equation (III) using
the OLS estimator.
There are some cases in which we have to obtain estimates of the individual
variances and other cases where we do not. We will consider the latter first.
How would we implement the GLS estimator? We know that we have to divide all
variables (including the intercept) by the square root of h( ).
18
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
So, we divide y, the intercept and income by (income)1/2. Go through the detail and
note that there is no intercept (column of ones).
And estimate the model using the OLS estimator (with no intercept).
Notes:
( ) In this type of case, we don’t have to obtain estimates of the individual
variances; we just need to have the x variables.
( ) Remember that the GLS estimator is efficient. So, if the form of the variance is
correct, the estimates are efficient.
19
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
Why have the exponential function? Note that variances can’t be negative. If we
ran a linear regression we could get negative predicted values. With the
exponential function we can’t get a negative value.
Issue: don’t have the u’s. We should we do? Because the OLS estimator is
unbiased, we can use residuals from an OLS regression: u^ i2. We have
Issue: how do we estimate the model? It’s not linear in the parameters (because
they are in the power of the exponential function). Taking the natural log, we get
Issues/Notes:
20
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
(1) We use FGLS to obtain estimates of the parameters. The fact that the
independent variables (incl. the intercept) are divided by an h( ) function does not
change this result. The transformation gets us efficient estimates of the parameters
of the underlying linear model.
(2) Alternative FGLS model: ln(u^ i2) = δ0 + δ1• ^y + δk• ^y 2 + v [7th: 279 eq 8.34]
(3) [7th: 281] What if the OLS and GLS estimates are notably different (e.g.,
different signs and both s.s.). Since both estimators are consistent this suggests that
there might be some other problem with the regression specification (i.e., some
other LRM assumption has failed, such as the form of the CEF).
If we simply use an h(x) function, the estimates are unbiased because any function
of the x is uncorrelated with the disturbance.
21
ECON335 CH.8 - HETEROSKEDASTICITY 3-27-24
What about standard errors in FGLS? They are biased but we will be okay if we
obtain robust standard errors [7th: 282]. So, suggestion would be to calculate robust
standard errors always.
Is the variance constant? Nope, it depends on the values of x. So, the LPM is
inherently a heteroskedastic model.
What to do? Get estimates of the variances. We know that the predicted values
from an OLS regression are the probabilities. So,
^
V ( yi∨xi) = ^y i•(1- ^y i) = h^ i [8-47]
Issue: what happens if get a predicted value outside of the unit interval? [7th 285]
(2) Use OLS and compute heteroscedasticity robust standard errors. This is
probably preferred.
2) If the predicted values lie between 0 & 1, you can use the MLR model.
4) Wrt WLS, construct the estimated variances from [8.47] (7th: 285)
23