Professional Documents
Culture Documents
Financial Econometrics (Lecture Notes) PDF
Financial Econometrics (Lecture Notes) PDF
Lecture Notes
which is about ten estimates for expected return, ten for volatility,
and 45 for co-variances/correlations.
Overall, 65 estimates are required.
That is a lot given the limited amount of data.
16 Professor Doron Avramov, Financial Econometrics
More generally,
if there are N investable assets, you need:
N estimates for means,
N estimates for volatilities,
0.5N(N-1) estimates for correlations.
Overall: 2N+0.5N(N-1) estimates are required!
Mean, volatility, and correlation estimates are noisy as reflected
through their standard errors.
Beyond such parameter uncertainty there is also model
uncertainty.
It is about the uncertainty about the correct model underlying
the evolution of expected returns, volatilities, and correlations.
17 Professor Doron Avramov, Financial Econometrics
Sample Mean and Volatility
The volatility estimate is typically less noisy than the mean
estimate (more later).
When returns are IID, the mean and volatility are estimated as
σ𝑇𝑡=1 𝑅𝑡 σ𝑇𝑡=1 𝑅𝑡 − 𝑅ത 2
𝑅ത = Less Noise ⟶ 𝜎ො =
𝑇 𝑇−1
We will also depart from the mean variance paradigm and
consider down side risk measures to form as well as evaluate
investment strategies.
1
Note that 𝑝𝑑𝑓 𝜇 = , and further
2𝜋𝜎 2
1
if 𝜎 = 1, then 𝑝𝑑𝑓 𝜇 =
2𝜋
The Cumulative Density Function (CDF) is the integral of the
pdf, e. g. , 𝑐𝑑𝑓 𝜇 = 0.5.
In the next slides, skewness and kurtosis are presented for
other probability distribution functions.
31 Professor Doron Avramov, Financial Econometrics
Skewness
The skewness can be negative (left tail) or positive (right tail).
2
𝜎𝑥𝑦
That is, if 𝜎𝑥𝑦 = 0, then 𝜎𝑦∣𝑥 = 𝜎𝑦2 − = 𝜎𝑦
𝜎𝑥2
meaning that the realization of 𝑥 does not say anything about
the conditional distribution of 𝑦.
It should be noted that random variables can be uncorrelated
yet dependent.
Dependence can be represented by a copula.
Under normality, zero correlation and independence coincide.
2 2
𝜎𝑥𝑦 𝜎𝑥𝑦
𝜎𝑦∣𝑥 = 𝜎𝑦2 − = 𝜎𝑦2 1 − = 𝜎𝑦 1 − 𝑅2
𝜎𝑥2 𝜎𝑥2 ∙ 𝜎𝑦2
Define Z=AX+B.
1
−
𝐴𝑛×𝑚 σ𝑚×𝑚 𝐴′𝑚×𝑛 2 𝑍 − 𝐴𝑛×𝑚 𝜇𝑚×1 − 𝐵𝑛×1 ∼ 𝑁 0, 𝐼
1 1
−2 −2
σ ∙σ = σ−1
1 1 1
−2
What does σ mean? A few rules: σ ∙σ =σ
2 2
1 1
−
σ 2 ∙σ =𝐼2
41 Professor Doron Avramov, Financial Econometrics
The Chi-Squared Distribution
If 𝑋1 ~𝑁(0,1) then 𝑋12 ~𝜒 2 1
𝑋12 + 𝑋22 ~𝜒 2 2
then 𝑋1 + 𝑋2 ~𝜒 2 𝑚 + 𝑛
𝑟ҧ − 𝜇
𝑡=𝑠
ൗ 𝑇−1
45 Professor Doron Avramov, Financial Econometrics
The t Distribution
The pdf of student-t is given by:
𝑣+1
−
1 𝑥2 2
𝑓 𝑥, 𝑣 = ∙ 1+
𝑣 ∙ 𝐵 𝑣 Τ2 , 1Τ2 𝑣
𝑉′ ⋅ σ ⋅ > 0
1×𝑁 𝑁×𝑁 𝑁×1
𝜎12 … … … … … 𝜎1,𝑁
2
σ = 𝜎1,2 2 … … … … … = σ 1 , … , σ 𝑁
, 𝜎
Let
𝑁×𝑁 …………………… 𝑁×1 𝑁×1
𝜎𝑁,1, … … … … … 𝜎𝑁2
Trace of a matrix is the sum of diagonal elements
𝜕𝑍
= 𝑌′𝐴
𝜕𝑋
𝜕𝑍
= 𝑋 ′ 𝐴′
𝜕𝑌
𝑎11 𝐵 … … … … … , 𝑎1𝑚 𝐵
𝐶𝑛𝑝×𝑚𝑔 = ………………………
𝑎𝑛1 𝐵 … … … … … , 𝑎𝑛𝑚 𝐵
𝐴𝑀×𝑀 ⊗ 𝐵𝑁×𝑁 = 𝐴 𝑀 𝐵 𝑁
𝐴 ⊗ 𝐵 𝐶 ⊗ 𝐷 = 𝐴𝐶 ⊗ 𝐵𝐷
𝜇𝑝 = 𝜔1 𝜇1 + 𝜔2 𝜇2 + 𝜔3 𝜇3 = 𝜔′ 𝜇
𝐸 𝑌 = 𝐸𝑥 𝐸 𝑌 ∣ 𝑋
Paul Samuelson shows the relation between the LIE and the
notion of market efficiency – which loosely speaking asserts
that the change in asset prices cannot be predicted using
current information.
𝐷𝑡+1 𝐷𝑡+2
The theoretical stock price: = + 𝑃𝑡∗ 2 ⋯
1+𝑟 1+𝑟
based on actual future dividends
R is therefore:
�𝑇� exp ( 𝑟)
1 𝛾 1 1
𝑢 𝑊𝑇+𝐾 = 𝑊𝑇+𝑘 = 𝑊𝑇 exp ( 𝑟) 𝛾
= 𝑊𝑇 𝛾 exp ( 𝛾𝑟)
𝛾 𝛾 𝛾
𝐶𝐹𝑖
The bond price is: 𝑃 𝑦0 = σ𝑛𝑖=1 where 𝑦0 is the yield to
1+𝑦0 𝑖
maturity.
Therefore:
1 ′ 1 ′′ 2
𝑃 𝑦1 − 𝑃 𝑦0 ≈ 𝑃 𝑦0 𝑦1 − 𝑦0 + 𝑃 𝑦0 𝑦1 − 𝑦0 ≈
1! 2!
≈ 𝑃′ 𝑦0 ∆𝑦 + 12𝑃′′ 𝑦0 ∙ ∆𝑦 2
∆𝑃 𝑃′ 𝑦0 ∙ ∆𝑦 𝑃′′ 𝑦0 ∙ ∆𝑦 2
≈ +
𝑃 𝑃 𝑦0 2𝑃 𝑦0
𝑃′ 𝑦0
Instead of: we can write “-MD” (Modified Duration).
𝑃 𝑦0
𝑃′′ 𝑦0
Instead of: we can write “Con” (Convexity).
𝑃 𝑦0
∆𝑃 𝐶𝑜𝑛 ∙ ∆𝑦 2
≈ −𝑀𝐷 ∙ ∆𝑦 +
𝑃 2
∆𝑃 𝐶𝑜𝑛 ∙ ∆𝑦 2
= −𝑀𝐷 ∙ ∆𝑦 +
𝑃 2
What is the change in the call price when the underlying asset
pricing changes?
𝜕𝐶 1 𝜕2𝐶 2
𝐶(𝑃𝑠 ) ≈ 𝐶(𝑃𝑡 ) + 𝑃𝑠 − 𝑃𝑡 + 𝑃 − 𝑃
𝜕𝑃 2 𝜕𝑃2 𝑠 𝑡
1
≈ 𝐶(𝑃𝑡 ) + Δ 𝑃𝑠 − 𝑃𝑡 + Γ 𝑃𝑠 − 𝑃𝑡 2
2
𝑓 𝛽 = 𝜀𝑡2 = 𝑉 ′ 𝑉 = 𝑌 − 𝑋𝛽 ′ 𝑌 − 𝑋𝛽 = 𝑌 ′ 𝑌 + 𝛽′ 𝑋 ′ 𝑋𝛽 − 2𝛽′ 𝑋 ′ 𝑌
𝑡=1
𝜕𝑓 𝛽
= 2(𝑋 ′ 𝑋)𝛽 − 2𝑋 ′ 𝑌 = 0
𝜕𝛽
𝑋′𝑋 𝛽 = 𝑋′𝑌
⇒ 𝛽መ = (𝑋 ′ 𝑋)−1 𝑋 ′ 𝑌
Recall: X and Y are observations.
Could we choose other 𝛽 to obtain smaller 𝑉 ′ 𝑉 = σ𝑇𝑡=1 𝜀𝑡2 ?
𝐸 𝛽መ − 𝛽
= 𝐸 (𝑋 ′ 𝑋)−1 𝑋 ′ 𝑉
= (𝑋 ′ 𝑋)−1 𝑋 ′ 𝐸 𝑉 = 0 , So – it is indeed unbiased.
Reminder:
𝛽መ − 𝛽 = (𝑋 ′ 𝑋)−1 𝑋 ′ 𝑉
′
𝛽መ − 𝛽 = 𝑉 ′ 𝑋(𝑋 ′ 𝑋)−1
explanatory variables.
𝛽:
Therefore, we can calculate σ
(𝑋 ′ 𝑋)−1 𝑉 ′ 𝑉
𝛽 =
σ
𝑇−𝐾−1
97 Professor Doron Avramov, Financial Econometrics
Maximum Likelihood Estimation (MLE)
We now turn to Maximum Likelihood as a tool for estimating parameters
as well as testing models.
𝑖𝑖𝑑
Assume that 𝑟𝑡 ~ 𝑁 (μ, σ2 )
The goal is to estimate the distribution of the underlying parameters:
𝜇
𝜃= 2 .
𝜎
Intuition: the data implies the “most likely” value of 𝜃.
𝑇
1
𝜇 − 𝑟𝑡
𝑇
𝑡=1 0 𝜎2, 0
𝑇 𝑇 𝑇 ∼𝑁 ,
1 1 0 0,2𝜎 4
𝜎2 − ( 𝑟𝑡 − 𝑟𝑡 )2
𝑇 𝑇
𝑡=1 𝑡=1
Then:
0
𝐸 𝑔𝑡 =
0
That is, we set two momentum conditions.
106 Professor Doron Avramov, Financial Econometrics
Method of Moments (MOM)
There are two parameters: 𝜇, 𝜎 2
𝑟𝑡 − 𝜇
Stage 1: Moment Conditions 𝑔𝑡 =
𝑟𝑡 − 𝜇 2 − 𝜎 2
Stage 2: estimation
𝑇
1
𝑔ො𝑡 = 0
𝑇
𝑡=1
𝑟𝑡 − 𝜇 2 , 𝑟𝑡 − 𝜇 3 − 𝜎 2 𝑟𝑡 − 𝜇
𝑆=𝐸 4
𝑟𝑡 − 𝜇 3 2
− 𝜎 𝑟𝑡 − 𝜇 , 𝑟𝑡 − 𝜇 − 2𝜎 2 𝑟𝑡 − 𝜇 2
+ 𝜎4
𝜎2, 𝜇3
𝑆=
𝜇3 , 𝜇4 − 𝜎 4
𝜇3 = 𝑆𝐾 = 𝑠𝑘 × 𝜎 3
(sk is the skewness of the standardized return)
𝜇4 = 𝐾𝑅 = 𝑘𝑟 × 𝜎 4
(kr is the kurtosis of the standardized return)
Sample estimates of the Skewness and Kurtosis are
𝑇 𝑇
1 3
1 4
𝜇Ƹ 3 = 𝑟𝑡 − 𝑟lj 𝜇Ƹ 4 = 𝑟𝑡 − 𝑟lj
𝑇 𝑇
𝑡=1 𝑡=1
112 Professor Doron Avramov, Financial Econometrics
Under Normality the MOM Covariance Matrix
Boils Down to MLE
𝜎 2, 𝜇3 = 0
σ2𝜃 = = σ1
𝜃
𝜇3 = 0, 𝜇4 − 𝜎 4 = 2𝜎 4
−1
𝑇 𝑇
𝜃መ = 𝑥𝑡 𝑥𝑡′ 𝑥𝑡 𝑟𝑡 = 𝑋 ′ 𝑋 −1 𝑋 ′ 𝑅
𝑡=1 𝑡=1
1, 𝑟𝑚1 𝑟1
𝑋𝑇×2 = ⋯ ⋯ 𝑅= …
1, 𝑟𝑚𝑇 𝑟𝑇
115 Professor Doron Avramov, Financial Econometrics
MOM: Estimating Standard Errors
Estimation of the covariance matrix (assuming no serial
correlation)
Stage 3:
𝑇 𝑇
1 1
መ መ
𝑆𝑇 = 𝑔𝑡 (𝜃)𝑔𝑡 (𝜃) = ( 𝑥𝑡 𝑥𝑡′ )𝜀𝑡Ƹ 2
′
𝑇 𝑇
𝑡=1 𝑡=1
Stage 4:
𝑇 𝑇
1 መ
𝜕𝑔𝑡 (𝜃) 1 𝑋 ′
𝑋
′
𝐷𝑇 = = − ( 𝑥𝑡 𝑥𝑡 ) = −
𝑇 𝜕𝜃መ 𝑇 𝑇
𝑡=1 𝑡=1
Type 1 error
H0 Good decision
𝛼
True state of world
Type 2 error
H1 Good decision
𝛽
After transformation:
𝑇 𝐻0
𝐾 − 3 ∼ 𝑁 0,1
24
Why 𝜒 2 (2) ?
𝑋12 + 𝑋22 ∼ 𝜒 2 2
𝑇 𝑇
𝑆 and 𝐾 − 3 are both standard normal and
6 24
they are independent random variables.
1
−2
Suppose that: 𝑦 = σ 𝑅 − 𝜇 ∼ 𝑁 0, 𝐼
Then: 𝑦 ′ 𝑦 = 𝑅 − 𝜇 ′ σ−1 𝑅 − 𝜇 ∼ 𝜒 2 𝑁
𝐻0 : 𝛽0 = 1, 𝛽2 = 0
𝐻1 : 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
′ −1
Chi squared: 𝑅𝛽መ − 𝑞 𝑅 σ𝛽 𝑅 ′ 𝑅𝛽መ − 𝑞 ~𝜒 2 2
𝑡𝑒𝑠𝑡 𝑠𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐
𝑌 = 𝑋𝛽 + 𝐸
Here is a receipt:
1. 𝛽መ = 𝑋 ′ 𝑋 −1 𝑋 ′ 𝑌
2. 𝐸 = 𝑌 − 𝑋𝛽መ
1
3. 𝜎ො𝜀2 = 𝐸 ′ ⋅ 𝐸
𝑇−6
4. 𝛽መ ∼ 𝑁 𝛽, 𝑋 ′ 𝑋 −1 𝜎 2
𝜀 = σ𝛽
𝜇−𝑟𝑓
The Sharpe ratio is equal to 𝑆𝑅(𝜃) =
𝜎
139 Professor Doron Avramov, Financial Econometrics
MLE vs. MOM
To develop a test statistic for the SR, we can implement the
MLE or MOM, depending upon our assumption about the
return distribution.
Let us denote the sample estimates by 𝜃መ
𝑀𝐿𝐸 𝑀𝑂𝑀
𝑇 𝜃መ − 𝜃 ∼ 𝑁 0, σ1𝜃 𝑇 𝜃መ − 𝜃 ∼ 𝑁 0, σ2𝜃
𝜕𝑆𝑅
𝑆𝑅 𝜃 − 𝑆𝑅 𝜃መ = ⋅ 𝜃 − 𝜃መ
𝜕𝜃′ 1×2 2×1
𝜕𝑆𝑅
𝐸 𝑆𝑅 𝜃 − 𝑆𝑅 𝜃መ =𝐸 𝜃 − 𝐸 𝜃መ =0
𝜕𝜃′
whereas
′
𝑉𝐴𝑅 𝜃መ = 𝐸 𝜃መ − 𝜃 𝜃መ − 𝜃
2
𝑉𝐴𝑅 𝑆𝑅 𝜃 − 𝑆𝑅 𝜃መ = 𝐸 𝑆𝑅 𝜃 − 𝑆𝑅 𝜃መ
𝜕𝑆𝑅 ′ 𝜕𝑆𝑅
𝐸 መ መ
𝜃−𝜃 𝜃−𝜃 =
𝜕𝜃 ′ 𝜕𝜃
𝜕𝑆𝑅 ′ 𝜕𝑆𝑅
= ′
𝐸 𝜃 − 𝜃መ 𝜃 − 𝜃መ =
𝜕𝜃 𝜕𝜃
𝜕𝑆𝑅 𝜕𝑆𝑅
= ′
σ𝜃
𝜕𝜃 𝜕𝜃
Let us derive:
𝜕𝑆𝑅 1
=
𝜕𝜇 𝜎
1 2 −0.5
𝜕𝑆𝑅 − 2 𝜎 𝜇 − 𝑟𝑓 − 𝜇 − 𝑟𝑓
2
= 2
=
𝜕𝜎 𝜎 2𝜎 3
The first four are equity while the last two are bond portfolios.
Let us check:
𝜎12 , 𝜎12 𝑤1 2 2 2 2
𝜎𝑝2 = 𝑤1 , 𝑤2 𝑤2 = 𝑤 𝜎
1 1 + 2𝑤 𝑤 𝜎
1 2 12 + 𝑤2 𝜎2
𝜎12 , 𝜎22
So it works!
1
where 𝜄 is the Greek letter iota ɇ𝑁×1 = … ,
1
and where 𝜇𝑝 is the expected return target set by the investor.
Indeed, 𝑤 ′ 𝜄 = 1.
Notice that
𝑈 ′ 𝑊 = 𝜆 exp −𝜆𝑊 > 0
𝑈 ′′ 𝑊 = −𝜆2 exp −𝜆𝑊 < 0
That is
max 𝐸 𝑈 𝑊𝑡+1 |𝑊𝑡
𝑤
𝑒
𝑠. 𝑡. 𝑊𝑡+1 = 𝑊𝑡 1 + 𝑅𝑓 + 𝑤 ′ 𝑅𝑡+1
1 2
−𝐸 exp −𝜆𝑊𝑡+1 = − exp −𝜆𝐸 𝑊𝑡+1 + 𝜆 ⋅ 𝑉𝐴𝑅 𝑊𝑡+1
2
′ 𝑒
1 2 2 ′
= − exp −𝜆𝑊𝑡 1 + 𝑅𝑓 + 𝑤 𝜇 + 𝜆 𝑊𝑡 𝑤 𝑉𝑤
2
1
= − exp −𝜆𝑊𝑡 1 + 𝑅𝑓 exp −𝜆𝑊𝑡 𝑤 ′ 𝜇𝑒 − 𝜆𝑊𝑡 𝑤 ′ 𝑉𝑤
2
1
max 𝑤 𝜇 − 𝛾𝑤 ′ 𝑉𝑤
′ 𝑒
𝑤 2
1 −1 𝑒
The optimal solution is 𝑤∗ = 𝑉 𝜇 .
𝛾
𝑉 −1 𝜇 𝑒
ഥ ∗ = ′ −1 𝑒
𝑤
𝜄𝑉 𝜇
193 Professor Doron Avramov, Financial Econometrics
Exponential: The Optimization Mechanism
Conclusion:
The joint assumption of exponential utility and normally
distributed stock return leads to the well-known mean
variance solution.
𝜕𝑈
= 1 − 𝑏𝑊
𝜕𝑊
𝜕2𝑈
2
= −𝑏 < 0
𝜕𝑊
1
Notice that the first derivative is positive for 𝑏 <
𝑊
195 Professor Doron Avramov, Financial Econometrics
Quadratic Preferences
The utility function looks like
𝑈 𝑊
𝑊
1
𝑏
196 Professor Doron Avramov, Financial Econometrics
Quadratic Preferences
It has a diminishing part – which makes no sense – because we
always prefer higher than lower wealth
𝑈 ′′ 𝑏𝑊
Notice that 𝛾 = 𝑅𝑅𝐴 = − ′𝑊 =
𝑈 1−𝑏𝑊
Let us prove it
−1 𝑒
∗
𝑉 𝜇
𝑤
ഥ = ′ −1 𝑒
𝜄𝑉 𝜇
𝑒′−1 𝑒
∗′ 𝑒 ∗ ∗ 𝑒
𝜇 𝑉 𝜇
𝜇 𝑇𝑃 − 𝑅𝑓 = 𝑤
ഥ 𝜇 + 1−𝑤 ഥ ′𝜄 𝑅𝑓 = 𝑤
ഥ 𝜇 = ′ −1 𝑒
𝜄𝑉 𝜇
𝑒 ′ −1 𝑒
2 ∗′ ∗
𝜇 𝑉 𝜇
𝜎𝑇𝑃 = 𝑤ഥ 𝑉𝑤 ഥ = ′ −1 𝑒 2
𝜄𝑉 𝜇
It turns out that pricing models have crucial roles in various
applications in financial economics – both asset pricing as well
as corporate finance.
𝑉 −1 𝜇𝑒
𝑤𝑇𝑃 = ′ −1 𝑒
𝜄𝑉 𝜇
𝑉 = 𝛽𝛽′ 𝜎𝑚
2
+Σ
𝜇𝑒 = 𝛽𝑀𝐾𝑇 𝜇𝑚
𝑒 +𝛽
𝑆𝑀𝐿 𝜇𝑆𝑀𝐿 + 𝛽𝐻𝑀𝐿 𝜇𝐻𝑀𝐿
𝑉 = 𝛽σ𝐹 𝛽′ + σ
When factors are return spreads the risk premium is the mean
of the factor.
𝑒
𝑟𝑡𝑒 = 𝛼 + 𝛽𝑀𝐾𝑇 × 𝑟𝑀𝐾𝑇,𝑡 + 𝛽𝑆𝑀𝐵 × 𝑆𝑀𝐵𝑡
+𝛽𝐻𝑀𝐿 × 𝐻𝑀𝐿𝑡 + 𝛽𝑊𝑀𝐿 × 𝑊𝑀𝐿𝑡 + 𝜀𝑡
WALD.
Likelihood Ratio.
GMM.
1 𝑇
× exp − σ𝑡=1 𝑟𝑡𝑒 − 𝛼 − 𝛽𝑟𝑚𝑡
𝑒 ′ −1 𝑒 𝑒
σ 𝑟𝑡 − 𝛼 − 𝛽𝑟𝑚𝑡
2
where
−1
2
𝜕 𝑙𝑛𝐿
Σ 𝜃 = −𝐸
𝜕𝜃𝜕𝜃 ′
𝜕 𝑙𝑛 𝐿 𝑇
= σ−1 σ 𝑟𝑡𝑒 − 𝛼 − 𝛽𝑟𝑚𝑡
𝑒
𝜕𝛼 𝑡=1
𝜕 𝑙𝑛 𝐿 𝑇
= σ−1 σ 𝑟𝑡𝑒 − 𝛼 − 𝛽𝑟𝑚𝑡
𝑒 𝑒
× 𝑟𝑚𝑡
𝜕𝛽 𝑡=1
𝜕 𝑙𝑛 𝐿 𝑇 −1 1 −1 𝑇
=− σ + σ σ 𝜀𝑡 𝜀𝑡′ σ−1
𝜕σ 2 2 𝑡=1
𝛼ො = 𝜇ො𝑒 − 𝛽መ ⋅ 𝜇ො𝑚
𝑒
𝜕 2 𝑙𝑛 𝐿 𝜕 2 𝑙𝑛 𝐿 𝜕 2 𝑙𝑛 𝐿
, ,
𝜕𝛼𝜕𝛼 ′ 𝜕𝛼𝜕𝛽′ 𝜕𝛼𝜕 σ′
𝜕 2 𝑙𝑛 𝐿 𝜕 2 𝑙𝑛 𝐿 𝜕 2 𝑙𝑛 𝐿
𝐼 𝜃 =− 𝐸 , ,
𝜕𝛽𝜕𝛼 ′ 𝜕𝛽𝜕𝛽′ 𝜕𝛽𝜕 σ′
𝜕 2 𝑙𝑛 𝐿 𝜕 2 𝑙𝑛 𝐿 𝜕 2 𝑙𝑛 𝐿
, ,
𝜕 σ 𝜕 𝛼′ 𝜕 σ 𝜕 𝛽′ 𝜕 σ 𝜕 σ′
𝑇Σ ∼ 𝑊 𝑇 − 2, Σ
𝑒
𝑟1𝑒 = 𝑋 θ1 + ε1 1, 𝑟𝑚1
𝑇×1 𝑇×22×1 𝑇×1
𝑇×2 = ⋮
⋮ where 𝑒
1, 𝑟 𝑚𝑇
𝑟𝑁𝑒 = 𝑋 θ𝑁 + ε𝑁
𝑇×1 𝑇×22×1 𝑇×1 𝜃𝑖 = 𝛼𝑖 , 𝛽𝑖 ′
Compute 𝐽1 .
𝑇Σ ∗ ∼ 𝑊 𝑇 − 1, Σ
let A ~𝑊 𝜏, Σ where 𝜏 ≥ 𝑁
𝑁×𝑁
′ −1 −1 ′ −1 𝐻0
𝐽4 = 𝑇𝛼ො ′ 𝑅 𝐷𝑇 𝑆𝑇 𝐷𝑇 𝑅 ⋅ 𝛼ො ∼ 𝜒 2 (𝑁)
where
𝑅 = 𝐼𝑁 , 0
𝑁×2𝑁 𝑁×𝑁 𝑁×𝑁
𝑒
1, 𝜇ො𝑚
𝐷𝑇 = − 𝑒 𝑒 2 2 ⊗ 𝐼𝑁
𝜇ො𝑚 , 𝜇ො𝑚 + 𝜎ො𝑚
𝑟𝑡𝑒 = Ƚ + Ⱦ ⋅ 𝐹𝑡 + ε𝑡
𝑁×1 𝑁×1 𝑁×𝐾 𝐾×1 𝑁×1
′ −1 −1 ′ −1
𝐽1 = 𝑇 1 + 𝜇ො𝐹 Σ𝐹 𝜇ො𝐹 𝛼ො Σ 𝛼ො ∼𝜒 𝑁
𝑇−𝑁−𝐾 −1 ′ −1
𝐽3 = 1 + 𝜇ො𝐹′ Σ 𝐹−1 𝜇ො𝐹 𝛼ො Σ 𝛼ො ∼𝐹 𝑁,𝑇−𝑁−𝐾
𝑁
where 𝜇ො𝐹 is the mean vector of the factor based return spreads.
2 𝜀 𝑒 2 2
𝜎 1 1 2 ෝ𝑚
𝜇 ෝ
𝜎
=𝐸 𝑒 2
𝜇ො𝑚 ෝ𝑚2
𝑖
⋅ 𝑒 =
+ 𝑉𝑎𝑟 𝛽𝑖 𝜇ො𝑚 𝜎 𝜀𝑖 𝐸 ෝ𝑚2 + 𝛽𝑖2 𝑚
𝜎 𝑇 𝑇 𝜎 𝑇
1
= 𝑆𝑅𝑚 𝜎 2 𝜀𝑖 + 𝛽𝑖2 𝜎ො𝑚
2
𝑇
where
𝑒 2
ෝ𝑚
𝜇
𝑆𝑅𝑚 = 𝐸 ෝ𝑚2
𝜎
For instance, Fama and French argue that SMB and HML
factors are proxying for underlying state variables in the
economy.
𝛼ො ′ Σ −1 𝛼ො
𝐽1 = 𝑇
1 + 𝑆𝑅𝑚 2
𝐽1
𝐽2 = 𝑇 ⋅ ln +1
𝑇
𝑇 − 𝑁 − 1 𝛼ො ′ Σ −1 𝛼ො
𝐽3 =
𝑁 1 + 𝑆𝑅𝑚 2
𝜎ො𝑚2, 𝛽መ ′ 𝜎ො𝑚
2
Φ =
𝑁+1 × 𝑁+1 𝛽መ 𝜎ො𝑚
2, 𝑉
where
2 is the estimated variance of the market factor.
𝜎ො𝑚
𝛽መ is the N-vector of market loadings and 𝑉 is the covariance
matrix of the N test assets.
𝑉 = 𝛽መ 𝛽መ ′ 𝜎ො𝑚
2
+ Σ
𝑆𝑅 𝑇𝑃
2
= 𝜆መ ′ Φ
−1 𝜆መ
−1 =
2
𝜎ො𝑚 −1
+ 𝛽መ ′ Σ −1 𝛽,
መ −𝛽መ ′ Σ −1
Φ
መ
−Σ −1 𝛽, Σ −1
𝑆𝑅 𝑇𝑃
2
= 𝑆𝑅𝑚
2
+ 𝛼ො ′ Σ −1 𝛼ො
or
𝛼ො ′ Σ −1 𝛼ො = 𝑆𝑅 𝑇𝑃
2
− 𝑆𝑅𝑚
2
If the CAPM is correct then these two Sharpe ratios are
identical in population, but not identical in sample due to
estimation errors.
Under the CAPM, the extra N test assets do not add anything
to improving the risk return tradeoff.
𝑇𝑃
𝑀
Φ2
𝑅𝑓
Φ1
𝛼ො ′ Σ −1 𝛼ො = Φ12 − Φ22
𝑆𝑅 𝑇𝑃
2
− 𝑆𝑅𝑚
2
2 𝑁
𝐽1 = 𝑇 ~𝜒
1 + 𝑆𝑅𝑚2
𝑇 − 𝑁 − 1 𝑆𝑅 𝑇𝑃
2
− 𝑆𝑅𝑚
2
𝐽3 = × ~𝐹 𝑁, 𝑇 − 𝑁 − 1
𝑁 1 + 𝑆𝑅𝑚2
If you only scale the market beta, as is typically the case, we
have an 𝑀 + 𝐾 unconditional factor model.
𝑒
𝑉 𝑟𝑡+1 ห𝑧𝑡 = 𝛽0 + 𝛽1 𝑧𝑡 𝛽0 + 𝛽1 𝑧𝑡 ′ 𝜎𝑚
2 +Σ
𝑉 −1 𝜄
Solution: 𝑤𝐺𝑀𝑉𝑃 =
𝜄′ 𝑉 −1 𝜄
Also notice that the expected return and volatility of the chosen
portfolio are
𝜇𝑝 = 𝜇𝐵 + 𝑥 ′ 𝜇
𝜎𝑝2 = 𝑤 ′ 𝑉𝑤 = 𝜎𝐵2 + 2𝑞′ 𝑉𝑥 + 𝜎𝜉2
where 𝐹 = 𝜄 𝑇 , 𝐹
𝜃 = 𝛼, 𝛽
4. Let Ψ
be a diagonal matrix with the (i,i)-th component being
equal to the (i,i)-th component of Δ
.
1
6. Estimate: σ𝐹 = 𝑉 ′ ⋅ 𝑉
𝐾×𝐾 𝑇 𝐾×𝑇 𝑇×𝐾
and where 𝛾 = σ𝑁 σ 𝑁
(𝑓
𝑖=1 𝑗=1 𝑖𝑗 − 𝑠𝑖𝑗 ) 2
with 𝑠𝑖𝑗 being the (𝑖, 𝑗)
component of 𝑆 and 𝑓𝑖𝑗 is the (𝑖, 𝑗) component of 𝐹.
𝜋 = 𝜋𝑖𝑗
𝑖=1 𝑗=1
𝑇
1 _
𝜋𝑖𝑗 = {(𝑟𝑖𝑡 − 𝑟𝑖ҧ )(𝑟𝑗𝑡 − 𝑟𝑗 ) − 𝑠𝑖𝑗 }2
𝑇
𝑡=1
𝑁 𝑁 𝑁
𝜌ҧ 𝑠𝑗𝑗 𝑠𝑖𝑖
𝜂 = 𝜋𝑖𝑖 + 𝜗𝑖𝑖,𝑖𝑗 + 𝜗𝑗𝑗,𝑖𝑗
2 𝑠𝑖𝑖 𝑠𝑗𝑗
𝑖=1 𝑖=1 𝑗=1,𝑗≠𝑖
𝑇
1 _ 2 _ _
𝜗𝑖𝑖,𝑖𝑗 = (𝑟𝑖𝑡 − 𝑟𝑖 ) −𝑠𝑖𝑖 (𝑟𝑖𝑡 − 𝑟𝑖 ) (𝑟𝑗𝑡 − 𝑟𝑗 ) − 𝑠𝑖𝑗
𝑇
𝑡=1
𝑇
1 _ _
𝜗𝑗𝑗,𝑖𝑗 = (𝑟𝑗𝑡 − 𝑟𝑗ҧ )2 −𝑠𝑗𝑗 (𝑟𝑖𝑡 − 𝑟𝑖 ) (𝑟𝑗𝑡 − 𝑟𝑗 ) − 𝑠𝑖𝑗
𝑇
𝑡=1
and with 𝑟𝑖𝑡 and 𝑟𝑖ҧ being the time 𝑡 return on asset 𝑖 and the time-series
average of return on asset 𝑖, respectively.
317 Professor Doron Avramov, Financial Econometrics
The Shrinkage Intensity – A Naïve Method
If you get overwhelmed by the derivation of the shrinkage
intensity it would still be useful to use a naïve shrinkage
approach, which often even works better. For instance, you can
take equal weights:
1 1
𝑉ത = 𝐹 + 𝑆
2 2
That is, we can run a “horse race” across the various models
searching for the best performer.
That is, you first estimate the GMVPs based on the first 60
observations, then based on 66 observations, and so on, while
in the rolling scheme you always use the last 60 observations.
For instance, you can form the table on the next page and
examine which specification has been able to deliver the best
performance.
FF+ FF+
S F MKT FF LW S F MKT FF LW
MOM MOM
Mean
STD
SR
SP
(5%)
alpha
IR
1 𝑇 1
𝜇ො = σ 𝑟, and 𝜎ො 2 = σ𝑇𝑡=1 𝑟𝑡 − 𝜇ො 2 .
𝑇 𝑡=1 𝑡 𝑇−1
ෝ2
𝜎 𝜎4
2ෝ
We have 𝑉𝑎𝑟
𝜇ො = and 𝑉𝑎𝑟
𝜎ො 2 = .
𝑇 𝑇
ෝ𝑛 2
𝜎
As usual, 𝑉𝑎𝑟 𝜇ො𝑛 =
𝑇𝑁
Therefore we have (also using the results from previous slide):
ෝ𝑛 2
𝑁2 𝜎 𝜎𝑛 2
𝑁ෝ ෝ2
𝜎
𝑉𝑎𝑟 𝜇ො = 𝑁 2 𝑉𝑎𝑟 𝜇ො𝑛 = = = .
𝑇𝑁 𝑇 𝑇
The conclusion regarding 𝜇:ො its variance is the same whether it
is estimated based on 𝑇 annual observations or whether it is
estimated using 𝑇 ∙ 𝑁 high-frequency observations.
There is no gain in using high-frequency observations.
We will see a different result for 𝜎ො 2 .
𝑅1 , … , 𝑅𝑁
𝑇×1 𝑇×1
𝑅෨1 = 𝑅1 − 𝜇ො1 … 𝑅෨𝑁 = 𝑅𝑁 − 𝜇ො𝑁
let
𝑅෨ = 𝑅෨1 , 𝑅෨2 , … , 𝑅෨𝑁
1 ′
𝑉 = 𝑅෨ 𝑅෨
𝑇
max 2
𝑤2′ 𝑉𝑤
𝑤2
𝑠. 𝑡 𝑤2′ 𝑤2 = 1
𝑤1′ 𝑤2 = 0
The second Eigen value is smaller than the first due to the
presence of one extra constraint in the optimization – the
orthogonality constraint.
𝜆𝑗
σ𝑁
𝑖=1 𝜆𝑖
𝜆1, … , 0 𝑤1 ′
𝑉 = 𝑤1 , … , 𝑤𝑁 ⋮ ⋱ ⋮
0, … , 𝜆𝑁 𝑤𝑁 ′
The first factor stands for the term structure level, the second
for the term structure slope, and the third for the curvature of
the term structure.
⋮
𝑟𝑁𝑡 = 𝛼𝑁 + Ⱦ𝑁1 𝑓1𝑡 + Ⱦ𝑁2 𝑓2𝑡 + ⋯ + Ⱦ𝑁𝐾 𝑓𝐾𝑡 + 𝜀𝑁𝑡 ∀𝑡 = 1, … , 𝑇
𝑟ǁ 𝑁𝑡
𝛿𝑖2 𝑣𝑎𝑟 ( 𝑓𝑖 )
σ𝑘𝑗=1 𝛿𝑗2 𝑣𝑎𝑟 ( 𝑓𝑗 )
𝑍1 = ⋅ 𝑤1
𝑇×1 𝑇×𝑀 𝑀×1
⋮
𝑍𝐾 = ⋅ 𝑤𝐾
𝑇×1 𝑇×𝑀 𝑀×1
𝑡𝑟
𝑉
𝜎ො 2 𝐾 =
𝑁
𝑁+𝑇 𝑁𝑇
𝑃𝐶 𝐾 = 𝜎ො 2 𝐾 + 𝐾 𝜎ො 2 𝐾max ⋅ ln
𝑁𝑇 𝑁+𝑇
𝑃 𝑦𝑥 ∝𝑃 𝑦 𝑃 𝑥𝑦
Notice that the right hand side retains only factors related to y, thereby excluding 𝑃 𝑥
= න 𝑃 𝑥, 𝑦 𝑑𝑦
𝑃 𝑦𝑥 ∝𝑃 𝑦 𝑃 𝑥𝑦
Zellner (1971) is an excellent reference
1 1
𝑃 𝑟𝑡 𝜇, 𝜎0 2 = 𝑒𝑥𝑝 − 𝑟𝑡 − 𝜇 2
2𝜋𝜎0 2 2𝜎0 2
Since returns are assumed to be IID, the joint likelihood of all realized returns is
𝑇
1
𝑃 𝑅 𝜇, 𝜎0 2
= 2𝜋𝜎0 2 −2 𝑒𝑥𝑝 − 2𝜎 2 σ𝑇𝑡=1 𝑟𝑡 − 𝜇 2
0
Notice:
σ 𝑟𝑡 − 𝜇 2 2
= σ 𝑟𝑡 − 𝜇Ƹ + 𝜇Ƹ − 𝜇
= ν𝑠 2 + 𝑇 𝜇 − 𝜇Ƹ 2
2 𝜎0 2ൗ
𝜇|𝑅, 𝜎0 ~𝑁 𝜇,Ƹ 𝑇
In classical econometrics:
2 𝜎0 2ൗ
𝜇|𝑅,
Ƹ 𝜎0 ~𝑁 𝜇, 𝑇
That is, in classical econometrics, the sample estimate of µ is stochastic while µ itself is an
unknown constant.
1 𝜇−𝜇 2
∝ 𝑒𝑥𝑝 − 2 𝜎 2
𝜇0 𝑇𝜇Ƹ
𝜇 = 𝜎 2 +
𝜎𝑎 2 𝜎0 2
= 𝑤1 𝜇0 + 𝑤2 𝜇Ƹ
where
1
𝜎𝑎 2 prior precision
𝑤1 = =
1
+
1 prior precision + likelihood precision
𝜎𝑎 2 𝜎0 2ൗ
𝑇
𝑤2 = 1 − 𝑤1
So the posterior mean of µ is the weighted average of the prior mean and the sample mean
with weights depending on the prior and likelihood precisions, respectively.
378 Professor Doron Avramov, Financial Econometrics
What if σ is unknown? – The case of Diffuse Prior
Bayes: 𝑃 𝜇, 𝜎 𝑅 ∝ 𝑃 𝜇, 𝜎 𝑃 𝑅 𝜇, 𝜎
The non-informative prior is typically modeled as
𝑃 𝜇, 𝜎 ∝ 𝑃 𝜇 𝑃 𝜎
𝑃 𝜇 ∝𝑐
𝑃 𝜎 ∝ 𝜎 −1
Thus, the joint posterior of µ and σ is
− 𝑇+1
1
𝑃 𝜇, 𝜎 𝑅 ∝ 𝜎 𝑒𝑥𝑝 − 2 ν𝑠 2 + 𝑇 𝜇 − 𝜇Ƹ 2
2𝜎
The conditional distribution of the mean follows straightforwardly
2
𝜎
𝑃 𝜇 𝜎, 𝑅 is 𝑁 𝜇,Ƹ ൗ𝑇
More challenging is to uncover the marginal distributions, which are obtained as
𝑃 𝜇 𝑅 = න 𝑃 𝜇, 𝜎 𝑅 𝑑𝜎
𝑃 𝜎 𝑅 = න 𝑃 𝜇, 𝜎 𝑅 𝑑𝜇
379 Professor Doron Avramov, Financial Econometrics
Solving the Integrals: Posterior of 𝜇
Let Ƚ = ν𝑠 2 + 𝑇 𝜇 − 𝜇Ƹ 2
Then, ∞
𝛼
𝑃 𝜇𝑅 ∝න 𝜎− 𝑇+1
𝑒𝑥𝑝 − 𝑑𝜎
𝜎=0 2𝜎 2
We do a change of variable
𝛼
𝑥= 2
𝑑𝜎 2𝜎 −11 1 −11
= −2 2 𝛼 2 𝑥 2
𝑑𝑥 𝑇+1
𝛼 −
2
𝜎 −𝑇+1 =
2𝑥
𝑇−2 𝑇𝑇
∞
Then 𝑃 𝜇 𝑅 ∝ 2 𝛼 =𝑥0 𝑥 2−1 𝑒𝑥𝑝
2
−
2 −𝑥 𝑑𝑥
𝑇
∞ −1 𝑇
Notice =𝑥0 𝑥 2 𝑒𝑥𝑝 −𝑥 𝑑𝑥 = Γ 2
Therefore,
𝑇−2 𝑇 −𝑇
𝑃 𝜇𝑅 ∝ 2 2 Γ 𝛼 2
2 ν+1
2 − 2
∝ ν𝑠 2 + 𝑇 𝜇 − 𝜇Ƹ
𝜇−ෝ
𝜇
We get 𝑡 = 𝑠 ~𝑡 ν , corresponding to the Student t distribution with ν degrees of freedom.
ൗ 𝑇
380 Professor Doron Avramov, Financial Econometrics
The Marginal Posterior of σ
1
The posterior on 𝜎 𝑃 𝜎 𝑅 ∝ 𝜎 − 𝑇+1
𝑒𝑥𝑝 − 2𝜎2 ν𝑠 2 + 𝑇 𝜇 − 𝜇Ƹ 2
𝑑𝜇
− 𝑇+1 ν𝑠 2 𝑇 2
∝𝜎 𝑒𝑥𝑝 − 2 𝑝𝑥𝑒 − 2𝜎2 𝜇 − 𝜇Ƹ 𝑑𝜇
2𝜎
𝑇 𝜇−ෝ
𝜇
Let 𝑧 = 𝜎
, then
𝑑𝑧 1
𝑑𝜇
= 𝑇𝜎
−𝑇 ν𝑠 2 2
𝑃 𝜎𝑅 ∝𝜎 𝑒𝑥𝑝 − 2𝜎2 𝑝𝑥𝑒 − 𝑧 ൗ2 𝑑𝑧
2
ν𝑠
∝ 𝜎 −𝑇 𝑒𝑥𝑝 −
2𝜎 2
− ν+1 ν𝑠 2
∝𝜎 𝑒𝑥𝑝 − 2𝜎2
which corresponds to the inverted gamma distribution with ν degrees of freedom and
parameter s
The explicit form (with constant of integration) of the inverted gamma is given by
νൗ
2
2 ν𝑠 2
− ν+1
ν𝑠 2
𝑃 𝜎 ν, 𝑠 = ν 𝜎 𝑒𝑥𝑝 − 2
Γ 2 2 2𝜎
381 Professor Doron Avramov, Financial Econometrics
The Multiple Regression Model
The regression model is given by
𝑦 = 𝑋𝛽 + 𝑢
where
y is a 𝑇 × 1 vector of the dependent variables
X is a 𝑇 × 𝑀 matrix with the first column being a 𝑇 × 1 vector of ones
β is an M × 1 vector containing the intercept and M-1 slope coefficients
and u is a 𝑇 × 1 vector of residuals.
We assume that 𝑢𝑡 ~𝑁 0, 𝜎 2 ∀ 𝑡 = 1, … , 𝑇 and IID through time
The likelihood function is
1
𝑃 𝑦 𝑋, 𝛽, 𝜎 ∝ 𝜎 𝑒𝑥𝑝 − 2 𝑦 − 𝑋𝛽 ′ 𝑦 − 𝑋𝛽
−𝑇
2𝜎
1 ′
∝ 𝜎 𝑒𝑥𝑝 − 2 ν𝑠 2 + 𝛽 − 𝛽መ 𝑋 ′ 𝑋 𝛽 − 𝛽መ
−𝑇
2𝜎
It follows since
′
𝑦 − 𝑋𝛽 ′
𝑦 − 𝑋𝛽 = 𝑦 − 𝑋𝛽መ − 𝑋 𝛽 − 𝛽መ 𝑦 − 𝑋𝛽መ − 𝑋 𝛽 − 𝛽መ
′ ′ ′
= 𝑦 − 𝑋𝛽 𝑦 − 𝑋𝛽 + 𝛽 − 𝛽 𝑋 𝑋 𝛽 − 𝛽መ
መ መ መ
′ −𝑇ൗ2
∝ ν𝑠 2 + 𝛽 − 𝛽መ 𝑋 ′ 𝑋 𝛽 − 𝛽መ
which pertains to the multivariate student t with mean 𝛽መ and T-M degrees of freedom
What about the marginal posterior for 𝜎?
𝑃 𝜎 𝑦, 𝑋 = න 𝑃 𝛽, 𝜎 𝑦, 𝑋 𝑑𝛽
ν𝑠 2
∝ 𝜎− ν+1
𝑒𝑥𝑝 − 2
2𝜎
which stands for the inverted gamma with T-M degrees of freedom and parameter s
You can simulate the distribution of β in two steps without solving analytically the integral,
drawing first 𝜎 from its inverted gamma distribution and then drawing from the conditional
of β which is normal as shown earlier.
385 Professor Doron Avramov, Financial Econometrics
Bayesian Updating/Learning
Suppose the initial sample consists of 𝑇1 observations of 𝑋1 and 𝑦1 .
Suppose further that the posterior distribution of 𝛽, 𝜎 based on those observations is given
by:
1
𝑃 𝛽, 𝜎 𝑦1 , 𝑋1 ∝ 𝜎 −(𝑇1+1) 𝑒𝑥𝑝 − 2 𝑦1 − 𝑋1 𝛽 ′ 𝑦1 − 𝑋1 𝛽
2𝜎
1
∝𝜎 −(𝑇1 +1)
𝑒𝑥𝑝 − 2 ν1 𝑠1 2 + 𝛽 − 𝛽መ1 ′𝑋1 ′𝑋1 𝛽 − 𝛽መ1
2𝜎
where
ν1 = 𝑇1 − 𝑀
𝛽መ1 = 𝑋1 ′𝑋1 −1 𝑋1 𝑦1
′
ν1 𝑠1 2 = 𝑦1 − 𝑋1 𝛽መ1 𝑦1 − 𝑋1 𝛽መ1
You now observe one additional sample 𝑋2 and 𝑦2 of length 𝑇2 observations
The likelihood based on that sample is
1
𝑃 𝑦2 , 𝑋2 𝛽, 𝜎 ∝ 𝜎 −𝑇2 𝑒𝑥𝑝 − 2𝜎2 𝑦2 − 𝑋2 𝛽 ′ 𝑦2 − 𝑋2 𝛽
The sample means are quite noisy – thus asset pricing models -even
if misspecified -could give a good guidance.
where 𝛽መ1 , 𝛽መ2 … 𝛽መ𝐾 are the sample estimates of factor loadings,
and 𝜇ො𝑓1 , 𝜇ො𝑓2 … 𝜇ො𝑓𝐾 are the sample estimates of the factor mean
returns.
−1
−1 −1
μ𝐵𝐿 = ɒ σ + 𝑃′ Ω−1 𝑃 ⋅ ɒ σ μ𝑒𝑞 + 𝑃′ Ω−1 μ𝑣
𝑁×1 1×1 𝑁×𝑁 𝑁×𝐾 𝐾×𝐾 𝐾×𝑁 1×1 𝑁×𝑁 𝑁×1 𝑁×𝐾 𝐾×𝐾 𝐾×1
σ, 𝜇 𝑒𝑞 , 𝑃, 𝜏, Ω, 𝜇𝑣
𝜇𝑚 −𝑅𝑓
Then, the price of risk is 𝛾 = 2 where 𝜇𝑚 and 𝜎𝑚
2
are the
𝜎𝑚
expected return and variance of the market portfolio.
Yes, if…
𝜇 𝑒 = 𝛽 𝜇𝑚
𝑒
𝑁×1 𝑁×1
σ 𝑤𝑀𝐾𝑇
𝐶𝐴𝑃𝑀: 𝜇𝑒 = 2
𝑒 = 𝛾σ𝑤
𝜇𝑚 𝑀𝐾𝑇
𝑁×1 𝜎𝑚
𝑒
𝜇𝑚
then 𝜇 = 𝑒
2 σ 𝑤𝑀𝐾𝑇 = 𝜇 𝑒𝑞
𝜎𝑚
𝜇𝑚 − 𝑅𝑓
𝛾= 2
𝜎𝑚
Then 𝑃 is a 2 × 𝑁 matrix.
The first row is all zero except for the fifth entry which is one.
Likewise, the second row is all zero except for the fifth entry
which is one.
400 Professor Doron Avramov, Financial Econometrics
The 𝑃 Matrix: Relative Views
Let us consider now two "relative views".
Loser stocks are the bottom 10% performers during the past six
months.
Value stocks are 10% of the stocks having the highest book-to-
market ratio.
Growth stocks are 10% of the stocks having the lowest book-to-
market ratios.
Using the relative views above, the first element is the payoff
to momentum trading strategy (sample mean); the second
element is the payoff to value investing (sample mean).
You can also use other values and examine how they perform
in real-time investment decisions.
410 Professor Doron Avramov, Financial Econometrics
Maximizing Sharpe Ratio
The remaining task is to run the maximization program
𝑤 ′ 𝜇𝐵𝐿
max
𝑤 𝑤 ′ 𝛴𝑤
−1 −1 −1
𝜇= Δ + (𝑉𝑠𝑎𝑚𝑝𝑙𝑒 /𝑇) Δ−1 𝜇𝐵𝐿 + (𝑉𝑠𝑎𝑚𝑝𝑙𝑒 /𝑇)−1 𝜇𝑠𝑎𝑚𝑝𝑙𝑒
where
Δ = (𝜏Σ)−1 +𝑃′ Ω−1 𝑃 −1
5%
−𝑉𝑎𝑅95% 𝜇
416 Professor Doron Avramov, Financial Econometrics
Value at Risk (VaR)
−𝑉𝑎𝑅1−𝛼 − 𝜇
= Φ−1 𝛼 ⇒ 𝑉𝑎𝑅1−𝛼 = − 𝜇 + 𝜎Φ−1 𝛼
𝜎
where
After the year passes you count the number of wrong analysts.
An analyst is wrong if he/she predicts up move when the
market is down, or predict down move when the market is up.
𝐻0 : 𝑥 = 12.6 20 − 12.6
→ 𝑍= = 2.14
𝐻1 : 𝑂𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 0.05 ⋅ 0.95 ⋅ 252
They did find out that the problem was that the actual revenue
departed from the normal distribution.
The ES is formulated as
𝜙 𝛷 −1 𝛼
⇒ 𝐸𝑆1−𝛼 = −𝜇 + 𝜎
𝛼
𝜎𝜑 𝜅0 𝜑 𝛷 −1 𝛼
𝐸 𝑥|𝑥 ≤ −𝑉𝑎𝑅1−𝛼 =𝜇− =𝜇−𝜎
𝛷 𝜅0 𝛼
since
−𝑉𝑎𝑅1−𝛼 − 𝜇
𝜅0 = = Φ−1 𝛼
𝜎
where
𝜙 and Φ are the PDF and CDF of a 𝑁 0,1 variable,
respectively
Of course if ℎ = 𝜇
𝜎2
then 𝜆 ℎ =
2
Useful for an investor who does not know the entry/exit point
and is concerned about the worst outcome.
Assuming that
𝐼𝐼𝐷
𝑟𝑡 ∼ 𝑁 𝜇, 𝜎 2 ∀𝑡 = 1, . . . , 𝑇
𝑇
𝑉𝑟𝑓 = 𝑉0 1 + 𝑅𝑓
= 𝑉0 exp 𝑇𝑟𝑓
𝑆𝑃 = Φ − 𝑇 𝑆𝑅
1 1
𝑃𝑢𝑡 = 𝑁 𝜎 𝑇 − 𝑁 − 𝜎 𝑇
2 2
1
𝑙𝑛(𝑆/𝐾)+(𝑟−𝛿+ 𝜎 2 )𝑇
d1 = 2
and 𝑑2 = 𝑑1 − 𝜎 𝑇
𝜎 𝑇
while 𝑁 −𝑑1 = 1 − 𝑁 𝑑1
𝑁 −𝑑2 = 1 − 𝑁 𝑑2
𝑃𝑢𝑡 = 𝑁 𝑑1 − 𝑁 𝑑2
where
𝜎 𝑇
𝑑1 =
2
𝑑2 = −𝑑1
1 0.08
5 0.18
10 0.25
20 0.35
30 0.42
50 0.52
∞
𝜇+0.5𝜎 2
1 1
−2 𝑣 2
𝐹ത c E(x|x > c) = 𝑒 න 𝑒 𝑑𝑣
2𝛱 𝑙𝑛 𝑐 −𝜇
𝜎 −𝜎
𝜇+0.5𝜎 2
𝑙𝑛 𝑐 − 𝜇 − 𝜎 2
=𝑒 ⋅Φ
𝜎
𝑙𝑛 𝑐 − 𝜇 − 𝜎 2
𝛷
𝜎
ELN (x|x ≤ c) = ELN (x) ⋅
𝑙𝑛 𝑐 − 𝜇
𝛷
𝜎
Analytics follow.
𝑙𝑛 𝑉𝑇 ∣ 𝑉0 − 𝑇𝜇 𝑙𝑛 𝑉𝑎𝑅 ∣ 𝑉0 − 𝑇𝜇
= Pr <
𝑇𝜎 𝑇𝜎
𝑙𝑛 𝑉𝑎𝑅 ∣ 𝑉0 − 𝑇𝜇
=Φ
𝑇𝜎
𝑙𝑛 𝑉𝑎𝑅 ∣ 𝑉0 − 𝑇𝜇
= Φ−1 𝛼
𝑇𝜎
𝑇𝜇+ 𝑇𝜎⋅Φ−1 𝛼
𝑉𝑎𝑅 = 𝑉0 𝑒
𝑣−1
−
1 −𝑣 𝑥2 2
𝑧
= ⋅ 1+ |−∞ =
𝑣 ⋅ 𝐵 𝑣/2,1/2 𝑣−1 𝑣
𝑣+1
− 2 +1
1 −𝑣 𝑧2 𝑣 + 𝑧2
⋅ 1+ = −𝑓 𝑧, 𝑣 ⋅
𝑣 ⋅ 𝐵 𝑣/2,1/2 𝑣 − 1 𝑣 𝑣−1
Let 𝜇𝑡 = 0.01; 𝜎𝑡 = 0.05. The next graphs show normal curve fit
to the sum of 𝑡 r.v.s (over 𝑇 periods); sample estimates vs.
predicted parameters are includes
𝑇 = 10
900
800
500
𝑣=3 400
300
200
100
0
-3 -2 -1 0 1 2 3
𝑇 = 100
400
350
300
𝜇Ƹ = 1.011 𝜇𝑁 = 1
𝜎ො = 0.856 𝜎𝑁 = 0.866
250
200
𝑣=3
150
100
50
0
-4 -3 -2 -1 0 1 2 3 4 5
𝑇 = 10
350
300
𝜇Ƹ = 0.099 𝜇𝑁 = 0.1
250 𝜎ො = 0.174 𝜎𝑁 = 0.177
200
𝑣 = 10
150
100
50
0
-0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8
𝑇 = 100
350
300
𝜇Ƹ = 0.994 𝜇𝑁 = 1
250
𝜎ො = 0.566 𝜎𝑁 = 0.559
200
𝑣 = 10
150
100
50
0
-1 -0.5 0 0.5 1 1.5 2 2.5 3 3.5
1
𝑙𝑛(𝑆/𝐾)+(𝑟−𝛿+2𝜎 2 )𝑇
where d1 = and d2 = d1 − 𝜎 T
𝜎 𝑇
If B&S is correct then the implied volatility (IV) of the matched
pair is equal. Time to maturity plays no role.
Implied
Volatility
Strike
Price
Implied
Volatility
Strike
Price
𝐸 𝜎𝑡4
=3 2 ≥3
𝐸 𝜎𝑡2
so
𝐸 𝜀𝑡4
2 ≥1
𝐸 𝜀𝑡2
𝜀𝑡 = 𝜎𝑡 𝑒𝑡 where 𝑒𝑡 ~𝑁 0,1
2 2
𝜎𝑡2 = 𝑤 + 𝛼𝜀𝑡−1 + 𝛽𝜎𝑡−1
2 2
𝐸 𝜎𝑡2 = 𝑤 + 𝛼𝐸 𝜀𝑡−1 + 𝛽𝐸 𝜎𝑡−1
2 𝑤
𝜎ത =
1−𝛼−𝛽
3 1+𝛼+𝛽 1−𝛼−𝛽
𝜇4 = >3
1−2𝛼𝛽−3𝛼2 −𝛽 2
𝜀𝑡−1 2 2
The first component − = 𝑒𝑡−1 −
𝜎𝑡−1 𝜋 𝜋
You can use AR(1) to model log realized variance and then
predict future values.
1
𝑙𝑛(𝑆/𝐾)+(𝑟−𝛿+2𝜎 2 )𝑇
Where d1 = and d2 = d1 − 𝜎 T
𝜎 𝑇
IV is that volatility that if inserted into the B&S formula would
yield the market price of the call or put option.
e.g. 𝑔=2
𝑑
𝑇 𝑉𝑅2 − 1 →𝑁 0,1
𝑔=3
𝑑 20
𝑇 𝑉𝑅3 − 1 →𝑁 0,
9
𝑟𝑡+1,𝑡+𝐾 = 𝑥𝑡 ′ 𝛽 + 𝜀𝑡+1,𝑡+𝐾
𝑥𝑡 ′ = 1, 𝑧𝑡 ′
𝛽′ = 𝑎, 𝑏 ′
Feasible solution:
1
𝐾∞𝑇 3
1 𝑇
መ
𝑆 = σ𝑡=1 𝜀𝑡2
𝑇
𝑇−1 𝑇 𝑇
1 1 1 1 1
𝑆መ = ⋅ 𝜀𝑡 𝜀𝑡+1 + 𝜀𝑡 + ⋅ 𝜀𝑡 𝜀𝑡−1
2
2 𝑇 𝑇 2 𝑇
𝑡=1 𝑡=−1 𝑡=2
𝑇 𝑇−1
1
= 𝜀𝑡2 + 𝜀𝑡 𝜀𝑡+1
𝑇
𝑡=1 𝑡=1
𝐾 𝑇−𝐾+1
𝐾− 𝑗 1
𝑆መ = ⋅ ⋅ 𝜀𝑡 𝑥𝑡 𝑥𝑡−𝑗 ′ 𝜀𝑡−𝑗
𝐾 𝑇
𝑗=−𝐾 𝑡=1
Can compute the MSE (Mean Square Error) for both models.
In the extreme, the model that drops all predictors is the no-
predictability or IID model.
The one that retains all predictors is the all inclusive model.