Professional Documents
Culture Documents
11/01/93
Date
9/01/97
The linear paradigm is a useful one. Many apparently non-linear relationships can be made linear by a suitable transformation. On the other hand, it is likely that many relationships in finance are intrinsically non-linear.
There are many types of non-linear models, e.g. - ARCH / GARCH - switching models - bilinear models
Heteroscedasticity Revisited
An example of a structural model is yt = F1 + F2x2t + F3x3t + F4x4t + u t 2 with ut b N(0,W u ). The assumption that the variance of the errors is constant is known as 2 homoscedasticity, i.e. Var (ut) = W u . What if the variance of the errors is not constant? - heteroscedasticity - would imply that standard error estimates could be wrong. Is the variance of the errors likely to be constant over time? Not for financial data.
Introductory Econometrics for Finance Chris Brooks 2002 7
W t ! E0 E1ut21
vt b N(0,1)
The two are different ways of expressing exactly the same model. The first form is easier to understand while the second form is required for simulating from an ARCH model, for example.
10
4. The null and alternative hypotheses are H0 : K1 = 0 and K2 = 0 and K3 = 0 and ... and Kq = 0 H1 : K1 { 0 or K2 { 0 or K3 { 0 or ... or Kq { 0. If the value of the test statistic is greater than the critical value from the G2 distribution, then reject the null hypothesis. Note that the ARCH test is also sometimes applied directly to returns instead of the residuals from Stage 1 above.
12
How do we decide on q? The required value of q might be very large Non-negativity constraints might be violated. When we estimate an ARCH model, we require Ei >0 i=1,2,...,q (since variance cannot be negative) A natural extension of an ARCH(q) model which gets around some of these problems is a GARCH model.
13
Generalised ARCH (GARCH) Models Due to Bollerslev (1986). Allow the conditional variance to be dependent upon previous own lags The variance equation is now (1) Wt2 = E0 + E1 ut21 +FWt-12 This is a GARCH(1,1) model, which is like an ARMA(1,1) model for the variance equation. We could also write Wt-12 = E0 + E1 ut22 +FWt-22
An infinite number of successive substitutions would yield Wt2 = E0 (1+F +F2+...) + E1 ut21 (1+FL+F2L2+...) + FgW02 So the GARCH(1,1) model can be written as an infinite order ARCH model. We can again extend the GARCH(1,1) model to a GARCH(p,q):
Wt2 = E0+E1 ut21 +E2 ut2 2 +...+Eq ut2 q +F1Wt-1 2+F2Wt-22+...+FpWt-p 2 Wt2 = E 0 E i u
i !1 q 2 t i
Wt2 =E0 + E1 ut21 +E0F + E1 F ut22 +F2(E0 + E1 ut23 +FWt-32 ) Wt2 = E0 + E1 ut21 +E0F + E1F ut22 +E0F2 + E1F2 ut23 +F3Wt-32 Wt2 = E0 (1+F +F2) + E1 ut21 (1+FL+F2L2 ) + F3Wt-32
F jW t j
j !1
15
But in general a GARCH(1,1) model will be sufficient to capture the volatility clustering in the data. Why is GARCH Better than ARCH? - more parsimonious - avoids overfitting - less likely to breech non-negativity constraints
16
when E1 F < 1
E1 F u 1 is termed non-stationarity in variance
E1 F = 1
For non-stationarity in variance, the conditional variance forecasts will not converge on their unconditional value as the horizon increases.
Introductory Econometrics for Finance Chris Brooks 2002 17
18
1.
2.
3. The computer will maximise the function and give parameter values and their standard errors
Introductory Econometrics for Finance Chris Brooks 2002 19
Successive values of yt would trace out the familiar bell-shaped curve. Assuming that ut are iid, then yt will also be iid.
20
(2)
(3)
21
1 T ( y t F 1 F 2 xt ) 2 exp LF ( F 1 , F 2 , W ) ! T T 2 t !1 W2 W ( 2T )
2
(4)
Maximum likelihood estimation involves choosing parameter values (F1, F2,W2) that maximise this function. We want to differentiate (4) w.r.t. F1, F2,W2, but (4) is a product containing T terms.
Introductory Econometrics for Finance Chris Brooks 2002 22
Then, using the various laws for transforming functions containing logarithms, we obtain the log-likelihood function, LLF: 1 T ( y t F 1 F 2 xt ) 2 T LLF ! T log W log(2T ) 2 2 t !1 W2 which is equivalent to T T 1 T ( y t F 1 F 2 xt ) 2 LLF ! log W 2 log(2T ) 2 2 2 t !1 W2 Differentiating (5) w.r.t. F1, F2,W2, we obtain ( y t F 1 F 2 xt ).2. 1 1 xLLF ! 2 xF 1 W2
Introductory Econometrics for Finance Chris Brooks 2002
(5)
(6)
23
( y F F x ) ! 0 y F F x ! 0 y TF F x ! 0
t 1 2 t
t 1 2 t t 1 2 t
1 T
F1 ! y F 2 x
1 yt F1 F 2 T
!0
(9)
24
( y F F x ) x ! 0 y x F x F x ! 0 y x F x F x ! 0 F x ! y x ( y F x ) x F x ! y x Tx y F Tx F 2 ( xt2 Tx 2 ) ! y t xt Tx y
t 1 2 t t
t t 1 t
t
2 t
2 t
2 t
2 t
F2
y x Tx y ! ( x Tx )
t t 2 t 2
(10)
T 1 ! 4 From (8), 2 W W
(y
F 1 F 2 xt ) 2
25
(11)
How do these formulae compare with the OLS estimators? (9) & (10) are identical to OLS (11) is different. The OLS estimator was
W2 ! 1 ut2 Tk
Therefore the ML estimator of the variance of the disturbances is biased, although it is consistent. But how does this help us in estimating heteroscedastic models?
Introductory Econometrics for Finance Chris Brooks 2002 26
Unfortunately, the LLF for a model with time-varying variances cannot be maximised analytically, except in the simplest of cases. So a numerical procedure is used to maximise the log-likelihood function. A potential problem: local optima or multimodalities in the likelihood surface. The way we do the optimisation is: 1. Set up LLF. 2. Use regression to get initial guesses for the mean parameters. 3. Choose some initial guesses for the conditional variance parameters. 4. Specify a convergence criterion - either by criterion or by value.
Introductory Econometrics for Finance Chris Brooks 2002 27
Recall that the conditional normality assumption for ut is essential. We can test for normality using the following representation ut = vtWt vt b N(0,1) u vt ! t W t ! E 0 E1ut21 E 2W t21 Wt
ut Wt
Are the vt normal? Typically vt are still leptokurtic, although less so than the ut . Is this a problem? Not really, as we can use the ML with a robust variance/covariance estimator. ML with robust standard errors is called QuasiMaximum Likelihood or QML.
Introductory Econometrics for Finance Chris Brooks 2002 28
Since the GARCH model was developed, a huge number of extensions and variants have been proposed. Three of the most important examples are EGARCH, GJR, and GARCH-M models. Problems with GARCH(p,q) Models: - Non-negativity constraints may still be violated - GARCH models cannot account for leverage effects Possible solutions: the exponential GARCH (EGARCH) model or the GJR model, which are asymmetric GARCH models.
29
log(W t ) ! [ F log(W t 1 ) K
u t 1 W t 1
2
u 2 t 1 E T W 2 t 1
Advantages of the model - Since we model the log(Wt2), then even if the parameters are negative, Wt2 will be positive. - We can account for the leverage effect: if the relationship between volatility and returns is negative, K, will be negative.
Introductory Econometrics for Finance Chris Brooks 2002 30
31
Using monthly S&P 500 returns, December 1979- June 1998 Estimating a GJR model, we obtain the following results.
yt ! 0.172 (3.198)
W t ! 1.243 0.015u t21 0.498W t 1 0.604u t21 I t 1 (16.372) (0.437) (14.999) (5.772)
2 2
32
The news impact curve plots the next period volatility (ht) that would arise from various positive and negative values of ut-1, given an estimated model. News Impact Curves for S&P 500 Returns using Coefficients from GARCH and GJR 0.14 Model Estimates: GARCH
GJR 0.12
0.1
0.08
0.06
0.04
0.02
0 -1 -0.9 -0.8 -0.7 -0.6 -0.5 -0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Value of Lagged Shock
33
GARCH-in Mean
We expect a risk to be compensated by a higher return. So why not let the return of a security be partly determined by its risk? Engle, Lilien and Robins (1987) suggested the ARCH-M specification. A GARCH-M model would be yt = Q + HW t-1+ ut , ut b N(0,Wt2)
35
Given, W how is W , the 2-step ahead forecast for W2 made at time T, calculated? Taking the conditional expectation of the second equation at the bottom of slide 36: 2 2 f 2 W 2f,T = E0 + E1E( u T 1 ;T) +F W 1,T 2 2 where E( uT 1 ;T) is the expectation, made at time T, of uT 1, which is the squared disturbance term.
Introductory Econometrics for Finance Chris Brooks 2002 37
f 2 2 ,T
! E 0 (E 1 F ) i 1 (E 1 F ) s 1 h1f,T
i !1
s 1
38
3. Dynamic hedge ratios The Hedge Ratio - the size of the futures position to the size of the underlying exposure, i.e. the number of futures contracts to buy or sell per unit of the spot good.
Introductory Econometrics for Finance Chris Brooks 2002 39
W F ,t
40
Consider a single parameter, U to be estimated, Denote the MLE as U ~ and a restricted estimate as U .
41
We estimate the model imposing the restriction and observe the maximised LLF falls to 64.54. Can we accept the restriction? LR = -2(64.54-66.85) = 4.62. The test follows a G2(1) = 3.84 at 5%, so reject the null. Denoting the maximised value of the LLF by unconstrained ML as L(U ) ~ and the constrained optimum as L(U ) . Then we can illustrate the 3 testing procedures in the following diagram:
Introductory Econometrics for Finance Chris Brooks 2002 43
L U
LU
B ~ U
~ LU
44
We know at the unrestricted MLE, L(U ), the slope of the curve is zero.
But is it significantly steep at L(U ) ? This formulation of the test is usually easiest to estimate.
Introductory Econometrics for Finance Chris Brooks 2002 45
The Models
The Base Models For the conditional mean
R Mt R Ft ! P 0 P1 ht u t
ht ! E 0 E 1u t21 F 1 ht 1
u t 1 ht 1
1/ 2 u 2 t 1 K ) ht 1 T
or ln(ht ) ! E 0 F 1 ln(ht 1 ) E 1 (U
where RMt denotes the return on the market portfolio RFt denotes the risk-free rate ht denotes the conditional variance from the GARCH-type models while Wt2 denotes the implied variance from option prices.
Introductory Econometrics for Finance Chris Brooks 2002 47
(5)
We are interested in testing H0 : H = 0 in (4) or (5). Also, we want to test H0 : E1 = 0 and F1 = 0 in (4), and H0 : E1 = 0 and F1 = 0 and U = 0 and K = 0 in (5).
Introductory Econometrics for Finance Chris Brooks 2002 48
(5)
49
Notes: t-ratios in parentheses, Log-L denotes the maximised value of the log-likelihood function in each case. G2 denotes the value of the test statistic, which follows a G2(1) in the case of (8.81) restricted to (8.79), and a G2 (2) in the case of (8.81) restricted to (8.81d Source: Day and Lewis (1992). ). Reprinted with the permission of Elsevier Science.
Introductory Econometrics for Finance Chris Brooks 2002 50
u t 1 ht 1 u t 1 ht 1
F1
K K
u t 1 ht 1 u t 1 ht 1
U
(8.82d ) Log-L
G2 8.09 30.89
Notes: t-ratios in parentheses, Log-L denotes the maximised value of the log-likelihood function in each case. G2 denotes the value of the test statistic, which follows a G2(1) in the case of (8.82) restricted ). to (8.80), and a G2 (2) in the case of (8.82) restricted to (8.82d Source: Day and Lewis (1992). Reprinted with the permission of Elsevier Science.
51
(8.83)
R2 0.094 0.024 0.039 0.018 0.022 0.008 0.037 0.026
Notes: Historic refers to the use of a simple historical average of the squared returns to forecast volatility; t-ratios in parentheses; SR and WV refer to the square of the weekly return on the S&P 100, and the variance of the weeks daily returns multiplied by the number of trading days in that week, respectively. Source: Day and Lewis (1992). Reprinted with the permission of Elsevier Science.
54
Encompassing Test Results: Do the IV Forecasts Encompass those of the GARCH Models?
2 2 2 2 W t21 ! b0 b1W It b2W Gt b3W Et b4W Ht \ t 1
(8.86)
b3 b4 0.123 (7.01) 0.118 (7.74) R2 0.027 0.038
Forecast comparison Implied vs. GARCH Implied vs. GARCH vs. Historical Implied vs. EGARCH Implied vs. EGARCH vs. Historical GARCH vs. EGARCH
b0 -0.00010 (-0.09) 0.00018 (1.15) -0.00001 (-0.07) 0.00026 (1.37) 0.00005 (0.37)
0.176 (0.27) -
0.026 0.038
-0.001 (-0.00)
0.018
Notes: t-ratios in parentheses; the ex post measure used in this table is the variance of the weeks daily returns multiplied by the number of trading days in that week. Source: Day and Lewis (1992). Reprinted with the permission of Elsevier Science. Introductory Econometrics for Finance Chris Brooks 2002 55
Conclusions of Paper
Within sample results suggest that IV contains extra information not contained in the GARCH / EGARCH specifications. Out of sample results suggest that nothing can accurately predict volatility!
56
h11t Ht ! h21t
h12 t h22 t
Such a model would be hard to estimate. The diagonal VECH is much simpler and is specified, in the 2 variable case, as follows:
h11t ! E 0 E 1u12t 1 E 2 h11t 1 2 h22t ! F 0 F 1u 2t 1 F 2 h22t 1
An Example: Estimating a Time-Varying Hedge Ratio for FTSE Stock Index Returns (Brooks, Henry and Persand, 2002).
Data comprises 3580 daily observations on the FTSE 100 stock index and stock index futures contract spanning the period 1 January 1985 - 9 April 1999. Several competing models for determining the optimal hedge ratio are constructed. Define the hedge ratio as F. No hedge (F=0) Nave hedge (F=1) Multivariate GARCH hedges: Symmetric BEKK Asymmetric BEKK In both cases, estimating the OHR involves forming a 1-step ahead forecast and computing hCF ,t 1 OHRt 1 ! ;t hF ,t 1
Introductory Econometrics for Finance Chris Brooks 2002 60
OHR Results
In Sample Unhedged F=0 Nave Hedge F=1 Symmetric Time Varying Hedge Asymmetric Time Varying Hedge
Ft !
Return Variance 0.0389 {2.3713} 0.8286 -0.0003 {-0.0351} 0.1718 Out of Sample Unhedged F=0 Nave Hedge F=1
hFC ,t h F ,t
Ft !
hFC ,t hF ,t
Ft !
Return Variance 0.0819 {1.4958} 1.4972 -0.0004 {0.0216} 0.1696
hFC ,t h F ,t
Ft !
hFC ,t hF ,t
61
1.00
0.95
Conclusions - OHR is time-varying and less than 1 - M-GARCH OHR provides a better hedge, both in-sample and out-of-sample. - No role in calculating OHR for asymmetries
0.90
0.85
0.80
0.75
0.70
62