Professional Documents
Culture Documents
Multicollinearity
Multicollinearity
Dwelling size
Electricity Income
consumption
2
Multicollinearity
𝑌𝑖 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽3 𝑋3𝑖 + 𝑢𝑖
3
Multicollinearity
𝜆1 𝑋1 + 𝜆2 𝑋2 + ⋯ + 𝜆𝑘 𝑋𝑘 = 0
• Where 𝜆1 , 𝜆2 , … , 𝜆𝑘 are all constant such that not all of them are simultaneously
equal to zero.
4
Multicollinearity
5
Multicollinearity
𝜆1 𝑋1 + 𝜆2 𝑋2 + ⋯ + 𝜆𝑘 𝑋𝑘 + 𝑣𝑖 = 0
6
Multicollinearity
• 𝑋2𝑖 is not an exact linear combination of other 𝑋 given that there is also the
stochastic error term 𝑣𝑖 .
7
Multicollinearity
• Example:
𝑿𝟐 𝑿𝟑 𝑿𝟑∗ 3. No perfect multicollinearity
10 50 52 between 𝑋2 and 𝑋3∗ .
15 75 75
18 90 97
4. However 𝑋2 and 𝑋3∗ are
24 120 129
closely correlated. Their
30 150 152
correlation coefficient is
0.9959.
• This concept does not apply to non-linear relationships among the control
variables.
𝑌𝑖 = 𝛽0 + 𝛽1 𝑋𝑖 + 𝛽2 𝑋𝑖2 + 𝛽3 𝑋𝑖3 + 𝑢𝑖
9
Multicollinearity
10
Multicollinearity
2
σ 𝑦𝑖 𝑥2𝑖 σ 𝑥3𝑖 − σ 𝑦𝑖 𝑥3𝑖 σ 𝑥2𝑖 𝑥3𝑖
𝛽መ2 = 2 2
σ 𝑥2𝑖 σ 𝑥3𝑖 − σ 𝑥2𝑖 𝑥3𝑖 2
2
σ 𝑦𝑖 𝑥3𝑖 σ 𝑥2𝑖 − σ 𝑦𝑖 𝑥2𝑖 σ 𝑥2𝑖 𝑥3𝑖
መ
𝛽3 = 2 2
σ 𝑥2𝑖 σ 𝑥3𝑖 − σ 𝑥2𝑖 𝑥3𝑖 2
11
Estimation under perfect Multicollinearity
2
σ 𝑦𝑖 𝑥2𝑖 𝜆2 σ 𝑥2𝑖 2
− 𝜆 σ 𝑦𝑖 𝑥2𝑖 𝜆 σ 𝑥2𝑖
𝛽መ2 = 2
2 2 2
σ 𝑥2𝑖 𝜆2 σ 𝑥2𝑖 − 𝜆2 σ 𝑥2𝑖
0
𝛽መ2 =
0
For home: Show that 𝛽መ3 is undetermined as well.
12
Estimation under high but imperfect Multicollinearity
𝑥3𝑖 = 𝜆𝑥2𝑖 + 𝑣𝑖
13
Estimation under high but imperfect Multicollinearity
2
σ 𝑦𝑖 𝑥2𝑖 𝜆2 σ 𝑥2𝑖 + σ 𝑣𝑖2 − 𝜆 σ 𝑦𝑖 𝑥2𝑖 + σ 𝑦𝑖 𝑣𝑖 𝜆 σ 𝑥2𝑖
2
𝛽መ2 = 2
2 2
σ 𝑥2𝑖 𝜆2 σ 𝑥2𝑖 + σ 𝑣𝑖2 − 2
𝜆 σ 𝑥2𝑖
14
Practical consequences of multicollinearity
1. Even though the OLS estimators 𝛽መ𝑘 are BLUE, they show large variances and
covariances. Lack of precision in the estimation!
2. Due to 1), the confidence intervals are wider. This means, it is easier not to
reject 𝐻0 : 𝛽መ𝑘 = 0.
3. Due to 1), the statistic 𝑡 from one or more coefficients tend to be non
significant.
4. Despite the 𝑡 of one or more coefficients being non significant, 𝑅2 , can be very
high.
5. The 𝛽መ𝑘 of OLS and their standard errors are very sensible to small changes in
the data.
15
1. OLS estimators with large variances and covariances
−𝑟2,3 𝜎 2
𝑐𝑜𝑣𝑎𝑟 𝛽መ2 , 𝛽መ3 =
2 2 σ 2
(1 − 𝑟2,3 ) σ 𝑥2𝑖 𝑥3𝑖
16
1. OLS estimators with large variances and covariances
• The speed of increase shown by the variances and covariances can be tracked
with the Variance Inflation Factor (VIF):
1
𝑉𝐼𝐹 = 2
(1 − 𝑟2,3 )
2
What happens with VIF as 𝑟2,3 approaches to 1?
VIF raises. This means that the degree of collinearity rises and so does the variance
of the estimator too.
17
1. OLS estimators with large variances and covariances
𝜎2
𝑣𝑎𝑟 𝛽መ2 = 2 𝑉𝐼𝐹
σ 𝑥2𝑖
𝜎2
𝑣𝑎𝑟 𝛽መ3 = 2 𝑉𝐼𝐹
σ 𝑥3𝑖
18
1. OLS estimators with large variances and covariances
𝜎2 1
𝑣𝑎𝑟 𝛽መ𝑘 =
σ 𝑥𝑘2 1 − 𝑅𝑘2
• This is the same:
𝜎2
𝑣𝑎𝑟 𝛽መ𝑘 = 2 𝐹𝐼𝑉𝑘
σ 𝑥𝑘
For home: With 𝜎 2 and 𝐹𝐼𝑉𝑘 constant, What happens with 𝑣𝑎𝑟 𝛽መ𝑘 if the variation of a
control variable is higher?
19
2. Wider confidence intervals
• The confidence interval is given by:
0.50
𝛽መ2 ± 1.96 (1.33) 𝑣𝑎𝑟(𝛽መ2 )
𝛽መ2 − 𝛽2
𝑡=
𝑒𝑒(𝛽መ2 )
21
4. 𝑹𝟐 is high but not that many 𝒕 are significant
• 𝑡 values which are non significant with high 𝑅2 and a significant 𝐹 value: sign of
multicollinearity.
22
5. OLS estimators and their standard errors sensibility to small changes in the data
Model 1
𝑌 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽3 𝑋3𝑖
Model 2
𝑌 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽3 𝑋𝑐𝑖
Observe 𝑋3 and 𝑋𝑐
23
5. OLS estimators and their standard errors sensibility to small changes in the data
Model 1 Model 2
𝑟2,𝑥𝑐 = 0.8285
𝑟2,3 = 0.5523
24
5. OLS estimators and their standard errors sensibility to small changes in the data
𝑟2,𝑥𝑐 = 0.8285
Model 1 Model 2
26
How to detect multicollinearity?
27
REFERENCE
28