Professional Documents
Culture Documents
Chapter 12 – Heteroskedasticity
We are going to relax the assumption that variance of error terms are
homoskedastic, i.e.
y = β0 + β1 x1 + β2 x2 + ... + βk xk + u
u 2 = δ0 + δ1 x1 + δ2 x2 + ... + δk xk + v
δ1 = δ2 = ... = δk = 0
Since we never know the actual errors in the population model, but we do have
estimates of them: the OLS residuals, ûi , is an estimate of the error ui for
observation i.
Thus, we can estimate the equation:
Rû22 /k
F = ∼ Fk,n−k−1
(1 − Rû22 )/(n − k − 1)
1 Estimate the model by OLS as usual. Obtain the squared OLS residuals, uˆ2
2 Run the regression of uˆ2 against the independent variables (x1 , x2 , ..., xk ). Keep
the R-squared from the regression, Rû22
3 Form the F statistic and compute the p-value (using the Fk,n−k−1 distribution)
4 If the p-value is sufficiently small, that is, below the chosen significance level,
then we reject the null hypothesis of homoskedasticity
200
100
Residuals
0
-100
We can see that in the residual plot, the errors seem to be more dispersed when
the fitted value becomes bigger
We should then carry out the Breusch-Pagan test for heteroskedasticity
1
.5
Residuals
0
-.5
-1
5 5.5 6 6.5
Fitted values
. predict res2,residual
The p-value of the F-test is 0.245 which means we fail to reject the null
hypothesis of homoskedasticity in the model with logarithmic unctional forms
The difference between the White test and the Breusch-Pagan test is that the
former includes the squares and cross products of the independent variables
To obtain function of all the squares and cross products of the independent
variables, we can square the fitted values
This suggests testing for heteroskedasticity by estimating the equation:
where h(x) is some function of the explanatory variables that determines the
heteroskedasticity
Since variance must be positive, so h(x) > 0 for all possible values of the
independent variables
Suppose the heteroskedasticity takes the following form:
savi = β0 + β1 inci + ui
var (ui |inci ) = σ 2 inci
p 2
E [(ui / hi ] = E (ui2 /hi ) = (σ 2 hi )/hi = σ 2
or
yi∗ = β0 xi0
∗ ∗
+ β1 xi1 + ... + βk xik∗ + ui∗
√
∗ = 1/ h and the other starred variables denote the corresponding
where xi0 i
√
original variables divided by hi
Consider our simple example of savings function, the transformed equation is:
In this example, we estimate equations that explain net total financial wealth in
terms of income
nettfa = β0 + β1 inc + u
We use the data on single people (fsize=1 ) for the above regression: