Professional Documents
Culture Documents
Questions Short Concept Questions
Questions Short Concept Questions
QUESTIONS
Practice Questions
1.4 Based on the probabilities in the diagram below, what are a. Pr(Ac)
the values of the following? b. P r (D |A (J B (J C )
c. P r(A |A )
d. Pr(B| A)
e. Pr(C| A)
f. Pr(D| A)
g. P r((A U D )c)
h. Pr(A cn D c)
i. Are any of the four events pairwise independent?
ANSW ERS
Solved Problems
1.4 a. Pr(A) = 12% + 11% + 15% + 10% = 48% c. This is trivially 100%.
b. Pr(A| B) = P r(A rW P r(B ) = (11% + 10%)/ d. P r(B H A ) = 9%. The conditional probability is
(11% + 10% + 16% + 18%) = 38.2%
c. Pr(B| A) = P r(A rW P r(A ) = (11% + 10%)/48%
e. There is no overlap and so P r(C n A ) = 0.
= 43.8%
f. P r(D H A ) = 9%. The conditional probability is 30%.
d. P r (A D B n C ) = 10%
g. This is the total probability not in A or D. It is
e. Pr(B|ADC) = Pr(BnADC)/Pr(AnC)
1 - P r(A U D ) = 1 - (Pr(A) + Pr(D) - P r(A D D ))
= 10% /(15% + 10%) = 40%
= 100% - (30% + 36% - 9%) = 43%.
f. Pr(ADB|C) = Pr(AnenC)/Pr(C)
h. This area is the intersection of the space not in A with
= 10%/(15% + 10% + 18 + 10%) = 18.9%
the space not in D. This area is the same as the area
g. P r (A U B |C ) = P r((A (JB )n C )/P r(C ) that is not in A or D, P r((A (JD )c) and so 43% .
= (15% + 10% + 18%)/53% = 81.1%
i. The four regions have probabilities A = 30%,
h. We can use the rule that events are indepen B = 30% , C = 28% and D = 36% . The only region
dent if their joint probability is the product of the that satisfied the requirem ent that the joint probability
probability of each. Pr(A) = 48% , Pr(B) = 55%, is the product of the individual probabilities is A and B
Pr(C) = 53%, P r(A D B ) = 21% # (48% X 55%), because P r(A D B ) = 9% = Pr(A)Pr(B) = 30% X 30%.
P r(A D C ) = 25% ^ (48% X 53%),
1.6 Consider the three scenarios: (High, High), (High, Low)
P r(B D C ) = 28% ^ (55% X 53%). None of these
and (Low, Low).
events are pairwise independent.
We are interested in Pr (Star\High, High) using Bayes'
1 .5 a. 1 - Pr(A) = 100% - 30% = 70%
P r(High, High] Star)Pr(Star)
rule, this is equal t o -------- ^
P
-'
b. This value is P r(D n (A U B U C ))/P r(A (JB (JC ).
The total probability in the three areas A , B, and Stars produce high returns in 20% of years, and so
C is 73% . The overlap of D with these three is Pr (High, H igh\Star) = 20% X 20% Pr (Star) is still 10%.
9% + 8% + 7% = 24% , and so the conditional prob Finally, we need to com pute Pr (High, High), which is
24% Pr (High, H ig h jS ta r) Pr (Star) + Pr(H/gh, High\ Normal)
ability is 33%.
73%
Pr(Normal). This value is 20% X 20% X 10% + 5% X 5% 1.8 We are interested in Pr(Fraud|Flag). This value is
= 90% = 0.625% . Com bing these values, Pr(Fraud D FIag )/Pr(Flag ). The probability that a trans
This is a large increase from the 30% chance after one year. .0009
= .0009. Com bining these values, = 90% . This
.000999
1.7 a. 0.10*0.20 = 0.02 —»• 2% indicates that 10% of the flagged transactions are not
b. Calculate this in two ways: actually fraudulent.
i. Using Equation 1.1: 1.9 If each year was independent, then the probability of
P r(A U B ) = Pr(A) + Pr(B) - P r(A D B ) beating the benchm ark ten years in a row is the product
P r(A U B ) = Pr(A) + Pr(B) - P r(A D B ) of the probabilities of beating the benchm ark each year,
= 10% + 20% - 2% = 28%.
1Y ° = 1 . 1% .
2 1,024
ii. Using the identity: P r(A U A c) = Pr(A) + Pr(Ac) = 1
Let C be the event of no defaults. Then
Pr(C) = (1 - (1 - Pr(A))*(1 - Pr(B)) 最新cfafrm加微信286982279
= (1 - 10%)(1 - 20%) = 0.9*0.8 = 72%.
Then the com plem ent of C is the event of any
defaults Pr(C c) = 1 - Pr(C) = 28%.
QUESTIONS
2 .3 W hat two properties must a PM F satisfy? 2 .7 W hat are the median and interquartile range? W hen is the
2 .4 How are the mean, standard deviation, and variance of Y median equal to the mean?
Practice Questions
2 .8 W hat is E[XE[X]]? Hint: recall that E[X] is a constant. 2 .1 0 Suppose the return on an asset has the following
variable X:
Return Probability
X -4 % 6%
1 - 2 .4 5 6 -3 % 9%
2 - 3 .3 8 8 -2 % 11%
3 - 6 .8 1 6 -1 % 12%
4 1.531 0% 14%
5 1.737 1% 16%
6 - 1 .2 5 4 2% 15%
7 - 1 .1 6 4 3% 8%
8 1.532 4% 5%
9 2.550 5% 4%
15 - 0 .5 7 5
ANSW ERS
2.5 Excess kurtosis is the standard kurtosis measure minus 3. 2.7 The median is the point where 50% of the probability is
This reference value com es from the normal distribution, located on either side. It may not be unique in discrete
and so excess kurtosis m easures the kurtosis above that distributions, although it is common to choose the sm all
of the normal distribution. est value where this condition is satisfied. The IQR is the
2.6 The quantile function is the inverse of the C D F function. difference between the 25% and 75% quantiles. These
That is, if u = Fx(x) returns the C D F value of X, then are the points where 25% and 75% of the probability of
q = F x \u ) is the quantile function. The C D F function is the random variable lies to the left. This difference is a
not invertible if there are regions where there is no prob measure of spread that is like the standard deviation.
ability. This corresponds to a ju m p in the CDF, as shown The median is equal to the mean in any sym m etric distri
in the figures that follow. bution if the first moment exists (is finite).
Solved Problems
2.8 E[XE[X\] = 1 E[X \X Pr(X = x) = E [X ]2 X Pr(X = x) X X Standardized
= E[X] X E[X] = (E[X])2 because E[X] is a constant and
8 1.532 0.898
does not depend on X. In other words, the expected
value of a random variable d o e s not depend on the ran 9 2.550 1.289
dom variable. 10 0.296 0.423
2.9 a. The mean is —0.803 and the variance is 6.762. 11 - 0 .9 7 9 - 0 .0 6 8
b.
12 - 4 .2 5 9 - 1 .3 2 9
X X Standardized 13 2.810 1.389
1 - 2 .4 5 6 - 0 .6 3 6 14 - 1 .6 0 8 - 0 .3 1 0
2 - 3 .3 8 8 - 0 .9 9 4 15 - 0 .5 7 5 0.088
3 - 6 .8 1 6 - 2 .3 1 2 Mean - 0 .8 0 3 0.000
4 1.531 0.897 Variance 6.762 1.000
7 - 1 .1 6 4 - 0 .1 3 9
(X Standardized)3 (X Standardized)4
X X Standardized (Skew) (Kurtosis)
1 - 2 .4 5 6 - 0 .6 3 6 - 0 .2 5 7 0.163
2 - 3 .3 8 8 - 0 .9 9 4 - 0 .9 8 2 0.977
3 - 6 .8 1 6 - 2 .3 1 2 - 1 2 .3 6 4 28.590
6 - 1 .2 5 4 - 0 .1 7 3 - 0 .0 0 5 0.001
7 - 1 .1 6 4 - 0 .1 3 9 - 0 .0 0 3 0.000
11 - 0 .9 7 9 - 0 .0 6 8 0.000 0.000
12 - 4 .2 5 9 - 1 .3 2 9 - 2 .3 4 8 3.120
14 - 1 .6 0 8 - 0 .3 1 0 - 0 .0 3 0 0.009
2.10 a. The mean is E[X] = Pr(X = x) = 0.25% . d. The kurtosis requires computing
Pr(X = x).
= a
QUESTIONS
3.2 Under what conditions can a binomial and a Poisson be 3.6 How many independent standard normal random vari
approxim ated by a normal random variable? ables are required to construct a
3.3 W hat type of events are Poisson random variables used to 3.7 If a Beta-distributed random variable is used to describe
describe? some data, what requirem ent must the data satisfy?
3.4 If X is a standard uniform, what is Pr(0.2 < Z < 0.4)? W hat 3.8 How are the mean and variance of a mixture of normal
about Pr(/ < X < u), where 0 < / < u < 1? random variables related to the mean and variance of the
com ponents of the m ixture?
Practice Questions
3.9 Either using a Z table or the Excel function N O RM .S.D IST, a. The fund has access to a USD 10 million line of credit
com pute that does not count as part of its portfolio. W hat is the
chance that the firm's loss in a month exceeds this line
a. P r ( - 1.5 < Z < 0), where Z ~ N(0, 1)
b. Pr(Z < - 1 .5 ), where Z ~ N(0, 1) of credit?
b. W hat would the line of credit need to be to ensure
c. Pr(—1.5 < X < 0), where X ~ N(1, 2)
d. Pr(X > 2), where X ~ N(1, 2) that the firm's loss was less than the line of credit in
99.9% of months (or equivalently, larger than the LO C
e. Pr(W > 12), where W ~ N(3, 9)
in 0.1% of months)?
3.10 Either using a Z table or the Excel function N O RM .S.IN V,
com pute
3.13 If the kurtosis of some returns on a small-cap stock portfo
lio was 6, what would the degrees of freedom param eter
a. z so that Pr(z < Z) = .95 when Z ~ N(0, 1) be if they were generated by a generalized Student's t„?
b. z so that Pr(z > Z) = .95 when Z ~ N(0, 1)
W hat if the kurtosis was 9?
c. z so that Pr(—z < Z < z) = .75 when Z ~ N(0, 1)
d. a and b so that Pr(a < X < b) = .75 and
3.14 An analyst is using the following exponential function to
Pr(a < X) = 0.125 when X ~ N(2, 4) model corporate default rates:
daily mean of 8%/252 and a daily variance of (20%)2/252, where y is the number of years.
find the values where
a. W hat is the cum ulative probability of default within the
a. Pr(R < r) = .001 first five years?
b. Pr(R < r) = .01 b. W hat is the cum ulative probability of default within the
c. Pr(R < r) = .05 first ten years given that the com pany has survived for
five years?
3.12 The monthly return on a hedge fund portfolio with USD 1
billion in assets is N[.02, .0003). W hat is the distribution of
the gain in a month?
A N SW ERS
Solved Problems
3.9 a. 43.3% . In Excel, the command to com pute this value answer to the previous problem by re-centering on
is N O R M .S.D IST(0,TR U E) - N O R M .S .D IS T (-1 .5 ,T R U E ). the mean and scaling by the standard deviation, so
that a = 2 X —1. 15 + 2 and b = 2 X 1.15 + 2. Note
b. 6.7% . In Excel, the command to com pute this value is
N O R M .S .D IS T (—1.5,TR U E). that the formula is a = a X q + / j l , where q is the
quantile value.
c. 20.1% . In Excel, the command to com pute this
value is N O RM .S.D IST((0 - 1)/SQ RT(2),TRUE) 3.11 a. The mean is 0.031% per day and the variance is
- N O R M .S .D IS T ((-1 .5 -1 )/S Q R T (2 ),T R U E ). 1.58 per day (so that the standard deviation is
1.26% per day). To find these values, we trans
d. 24.0% . In Excel, the command to com pute this value
form the variable to be standard normal, so that
is 1 - N O RM .S.D IST((2 - 1)/SQ RT(2),TRU E).
r - /x
Pr(R < r) = .001 = Pr Z < = .001
e. 0.13% . In Excel, the command to com pute this value \ (T /
is 1 - N O RM .S.D IST((12 - 3)/3,TRU E). The value for the standard normal is —3.09
3.10 a. 1.645. In Excel, the command to com pute this value is (NORM.S.INV(O.OOI) in Excel) so that
one in 1,000 chance that the return would be less than that k [v — 4) = 3{v — 2) so that k v — 4 k = 3v — 6 and
3.86% , if returns were normally distributed. 4 k —6
— 3v = 4 — 6. Finally, solving for v, v = • Plug-
—3
k v k
2 4 -6
deviation is 1.73% . In USD, the monthly change in gmg in v 18 = 6andi^ = = 5. The
6 -3 3 9 -3
portfolio value has a mean of 2% X USD 1 billion =
kurtosis falls rapidly as v grows. For exam ple, if v = 12
USD 20 million and a standard deviation of 1.73% X
then k = 3.75, which is only slightly higher than the kur
USD 1 billion = USD 17.3 million. The probability that
tosis of a normal (3).
the portfolio loses more than USD 10 million is than
(working in millions) 3.14 a. Fy(5) = 1 — exp
V - 20 b. W e need to divide the marginal probability of default
Pr(V < - 1 0 ) = Pr <
17.3 between years five and ten:
= Pr (Z < - 1 .7 3 )
Fy(10) - Fy(5)
Using the normal table, Pr(Z < - 1 .7 3 ) = 4.18% .
By the SURVIVAL probability through year five
b. Here we work in the other direction. First, we find (1 - result from part a).
the quantile where Pr(Z < z) = 99.9% , which gives
z = - 3 .0 9 . This is then scaled to the distribution of Fy(10) - Fy(5) = exp - exp ( - 1% )
the change in the value of the portfolio by m ultiply
ing by the standard deviation and adding the mean, — exp
Fy( 10) ~ Fy(5)
17.3 X —3.09 + 20 = —33.46. The fund would need
1 - Fy(5)
a line of credit of USD 33.46 million to have a 99.9% exp
change of having a change above this level.
QUESTIONS
4.2 W hen are conditional distributions equal to marginal 4.7 If X-| and X 2 are independent continuous random variables
distributions? with PDFs fXl(x 1 ) and fx2 (x 2 )» respectively, what is the joint
4.3 If X-| and X 2 are independent, what is the correlation PD F of the two random variables?
between X-| and X 2? 4.8 If X-| and X 2 both have univariate normal distributions, is
the joint distribution of X-, and X 2 a bivariate normal?
4.4 If X-| and X 2 are uncorrelated (Corr[X-|, X 2] = 0), are they
independent? 4.9 In the previous question, what if X-, and X 2 are
independent?
4.5 W hat is the effect on the covariance between X-| and X 2 of
rescaling X-| by w and X 2 by (1 — w), where w is between 4.10 Are sums of iid normal random variables normally distributed?
0 and 1 ?
Practice Questions
4.11 Suppose that the annual profit of two firm s, one an of the profit of Big Firm when Small Firm either has no
incum bent (Big Firm , X-|) and the other a startup (Small profit or loses money (X 2 < 0)?
Firm , X 2), can be described with the following probability 4.15 The return distributions for the S&P 500 and Nikkei for the
m atrix:
next year are given as:
4.12 Using the probability m atrix for Big Firm and Small Firm, Nikkei 0% 5% 40%
what are the covariance and correlations between the 8% 6% 9%
profits of these two firm s?
Fill in the missing cells to match the original given
4.13 If an investor owned 20% of Big Firm and 80% of Small marginal distributions.
Firm, what are the expected profits of the investor and the
c. For the m atrix in b, what is the conditional distribution
standard deviation of the investor's profits?
of the Nikkei given a 10% return on the S&P?
4.14 In the Big Firm -Sm all Firm exam ple, what are the condi
tional expected profit and conditional standard deviation
ANSW ERS
4.2 W hen the random variables are independent. If the condi 4.6 If the two components X-] and X 2 are independent, then the
tional distribution of X i given X 2 equals the marginal, then conditional expectation of any function of X i given X 2 = x 2
X 2 has no information about the probability of different is always the same as the marginal expectation of the same
values in X-|. function. That is, E[g(X-|)|X 2 = x 2] = E[g(X-|)] for any value x 2.
4.3 Independent random variables always have a correlation of 0. 4.7 The joint PD F is fx1 (x 2 (x i/ x 2 ) = fx 1 (x i)^x2 (x 2 ) because when
This statement is only technically true if the variance of both independent the joint is the product of the marginals.
is well defined so that the correlation is also well defined. 4.8 Not necessarily. It is possible that the joint is not a bivari
4.4 No. There are many ways that two random variables can ate normal if the dependence structure between X-| and
be dependent but not correlated. For exam ple, if there is X 2 is different from what is possible with a normal. The
no linear relationship between the variables but they both distribution that generated the data points plotted below
tend to be large in m agnitude (of either sign) at the same has normal marginals, but this pattern of dependence is
tim e. In this case, there may be tail dependence but not not possible in a normal that always appear elliptical.
linear dependence (correlation). The image below plots
realizations from a dependent bivariate random variable
with 0 correlation. The dependence in this distribution
arises because if X-| is close to 0 then X 2 is also close to 0.
- 2 - 1 0 1 2
Xi
4.10 Yes because iid normal random variables are jointly nor
- 2 - 1 0 1 2
Solved Problems
4.11 a. The marginal distributions are com puted by summing b. They are independent if the joint is the product of the
across rows for Big Firm and down columns for Small marginals. Looking at the upper left cell, 6.77% X 6.70%
Firm. They are is not equal to 1.97%, and so they are not.
c. The conditional distribution is just the row correspond
Big Firm Small Firm ing to USD 100M normalized to sum to 1. The condi
tional is different from the marginal, which is another
- U S D 50M 6.77 - U S D 1M 6.70
dem onstration that the profits are not independent.
USD 0 43.02 USD 0 43.19
- U S D 1M USD 0 USD 2M USD 4M
USD 10M 34.38 USD 2M 34.18
4.12 First, the two means and variances can be com puted from Finally, the conditional expectation is E[X-||X 2 < 0] =
the marginal distributions in the earlier problem . Then 2x-|Pr(Xi = x-||X2 < 0) = USD 3.01 M. The conditional
the covariance can be com puted using the alternative expectation squared is E [X f|X 2 < 0] = 940.31, and so
form , which is E[X-[X 2\ — E[X-|]E[X2]. The means can be the conditional variance is \/[X 1 ] = E[Xf] — E [X - |]2 =
com puted using E[Xj] = I x j Pr(Xy = xj) for j = 1,2. The 940.31 - 3.01 2 = 931.25 and the conditional standard
variance can be com puted using E [X f] - (E[Xy])2, which deviation is USD 30.52M .
requires computing E [X 2] using X x 2 Pr(X/ = xj). 4.15 a. If the two variables are unrelated, then the joint distri
For Big Firm , these values are E[X-|] = USD 15.88M , bution is just given by the products
E[X?j = 1786.63 and V[X-,] = 1534.36.
fx1(x2(*-w *2) =
For Small Firm , these values are E [X 2] = USD 1.25M , For exam ple, the probability of seeing —10% on the
E [X 2] = 3.98 and V[X2] = 2.41.
S&P and —5% on the Nikkei is 25% *20% = 5%:
The expected value of the cross product is
4.14 We need to com pute the conditional distribution given Nikkei 0% 5% 40%
X 2 < 0. The relevant rows of the probability m atrix are 8% 6% 9%
- U S D 1M USD 0 The sum of the rows needs to match the S&P 500 mar
X^Xz
ginal distribution:
- U S D 50M 1.97% 3.90%
Continuing yields: Dividing each entry by the total mass (25% in this case)
The same process works going across the columns, 25% 100%
S&P 500
+ 10%
-5 % 1%
Nikkei 0% 15%
8% 9%
25%
QUESTIONS
5.2 W hat is the difference between the standard deviation of 5.6 W hat do skewness and kurtosis measure?
the sample data and the standard error of the sample mean?
5.7 W hat is the skewness and kurtosis of a normal random
5.3 Is an unbiased estim ator always better than a biased variable?
estim ator?
5.8 How are quantiles estim ated?
5.4 W hat is the sam ple analog to the expectation operator, 5.9 W hat advantages do quantiles have compared to moments?
and how is it used?
5.10 W hen dealing with two random variables, how is coskew
ness interpreted?
Practice Questions
5.11 Suppose four independent random variables X h X 2, X 3, and It is hypothesized that the data comes from a uniform dis
X 4 all have mean /z = 1 and variances o f 1 2, 1 2 <2, and 2, tribution, U(0 , b).
1
respectively. What is the expectation and variance of — a. Calculate the sam ple mean and variance.
5.12 Using the same information from question 1, com pute the b. W hat are the unbiased estim ators of the mean and
1 1 variance?
expectation and variance of 2X-| + 2 X 2 + —X 3 + —X 4.
c. Calculate the b in 1/(0, b) using the formula for the
5.13 Using the same information from question 1, com pute the mean of a uniform distribution and the value of the
2 2 1 1
expectation and variance of —X i + —X? + — Xa + — X 4? unbiased sam ple mean found in part b.
^ 5 1 5 ^ 10 3 10 4
d. Calculate the b in 1/(0, b) using the formula for the
5.14 W hat is the effect on the sam ple covariance between X
variance of a uniform distribution and the value of the
and Y if X is scaled by a constant a? W hat is the effect on
unbiased sam ple variance found in part b.
the sam ple correlation?
5.15 An experim ent yields the following data: 5.16 For the following data:
Observation Value
Trial Number Value
1 0.38
1 0 .0 0
2 0.28
2 0.07
3 0.27
3 0.13
4 0.99
4 0.13
5 0.26
5 0 .2 0
0.43
0.23
6
6
13 0.76
14 0.77
15 0.96
ANSW ERS
Solved Problems
5.13 The expectation is com puted using the same steps:
5.11 The expectation is E —(Xi + X 2 + X 3 + X 4 ) = 2 2 1 1
-X-, + - x 2 + — x 3 + — x 4
1 5 1 5 1 10 3 10 4
+ E [X 2] + E [X 3] + E [X 4]) — — (/x + /x + /x + /x) — /x — 1
= | E [X , ] + | e [X2] + ^ £ [ X 3] + ^ E [ X 4]
The variance is Var — (X-i + X 2 + X 3 + X 4) = T ( V a r [ X , ]
2 2 1 1
5/x + 5 /x + 1 0 jJL + — /JL = /X = 1.
1 0 ‘
+ Var[X2] + Var[X3] + Var[X4]) — — ( — + 2 + ^ + / ~ ^\6
This estim ator is unbiased.
5.12 The expectation is E 2 Xy + 2 X 2 + —X 3 + —X 4 = 2\± The variance of the sum is the sum of the variances
because the random variables are independent.
1 1
+ 2/x + —/jl + = 5/x = 5. The variance is 2Xi + 2X 2 2 2 1 1
Var 77X3 + —-x4
— f x V a r
5 X l + 5 Xa + *10 6 1 0 4
1 1 1 1
+ X 3 + ~X 4 = 4Var[X-,] + 4V ar[X 2] + ~Var[X3]
— Trz’VarfX'i ] + 7rir\/ar[X2] + Var[X3] + - 7 —Var[X4]
77
2 6 2 4
25 1J 25 2 100 1 0 0
1 4 4 2 2 4 1 4 1 1 1
+ - V a r[X 4] = - + - + - + - = 5 .
4 4J 2 2 4 4 — X - + — x - + x 2+ x 2=
25 2 25 2 1 0 0 1 0 0 25
3 0.28 40%
produce p = „ , and the sam ple correlation is
dC Ty(Ty CTy(Ty 4 0.38 60%
invariant to rescaling the data.
5 0.43 80%
5.15 a. Use the standard formula to get the sam ple variance
6 0.99 1 0 0 %
(here, n = 15):
n The median (50%) lies exactly half way between the
jl = n“ 1 2 */ = 0-39, third and fourth ranked observations. Therefore:
n
0.28 + 0.38
a 2 = n“ 1 2 ( X / - A ) 2 = 0.08. Median = ---------------= 0.33
/=1
b. The sam ple mean is already unbiased. b. This requires calculating the 25% and 75% levels
For the variance: The 25% level is 5/20 = 25% of the way between
ranked observations 2 and 3. Therefore:
n
s2 = ct 2 = l 5 0.080 = 0.086.
n - 1 14 q25% = 0-75 * 0.27 + 0.25 * 0.28 = 0.2725
c. The mean for a U(a,b) distribution is given as: Sim ilarly, the 75% level is 75% of the way between
a + b observations 4 and 5:
2 _ (b - a ) 2
fJ “ 12
b2
0.086 = — ^ b = 1.016
12
5.16 a. The first step is to rank order the observations:
Ranked
Position Value
1 0.26
2 0.27
3 0.28
4 0.38
5 0.43
6 0.99
Construct an appropriate null hypothesis and alternative • Understand how a hypothesis test and a confidence
hypothesis and distinguish between the two. interval are related.
Construct and apply confidence intervals for one Explain what the p-value of a hypothesis test measures.
sided and two-sided hypothesis tests, and interpret
the results of hypothesis tests with a specific level of Interpret the results of hypothesis tests with a specific
confidence. level of confidence.
Differentiate between a one-sided and a two-sided test Identify the steps to test a hypothesis about the difference
and identify when to use each test. between two population means.
Explain the difference between Type I and Type II errors Explain the problem of multiple testing and how it can
and how these relate to the size and power of a test. bias results.
最新cfafrm加微信286982279
83
The following questions are intended to help candidates understand the material. They are n o t actual FRM exam questions.
QUESTIONS
6.4 W hat are the trade-offs when choosing the size of a test? 6.9 W hat does the VaR of a portfolio measure?
6.5 W hat is the power of a hypothesis test? 6.10 W hat are three m ethods to evaluate a VaR model of a
portfolio?
Practice Questions
6.11 Suppose you wish to test w hether the default rate of 6.15 You collect 50 years of annual data on equity and bond
bonds that are rated as investm ent grade by S&P is returns. The estim ated mean equity return is 7.3% per
the same as the default rate on bonds rated as invest year, and the sam ple mean bond return is 2.7% per year.
ment grade by Fitch. W hat are the null and alternative The sam ple standard deviations are 18.4% and 5.3% ,
hypotheses? W hat data would you need to test the null respectively. The correlation between the two-return
hypothesis? series is - 6 0 % . Are the expected returns on these two
assets the sam e? Does your answer change if the correla
6.12 Using the Excel function N O R M .S.D IST or a normal prob
ability table, what are the critical values when testing the tion is 0 ?
null hypothesis, H0: /z = /z0, against 6.16 If a p-VaR model is well specified, HITs should be iid
a. a one-sided lower alternative using a size of 1 0 %? Bernoulli^ — p). W hat is the probability of observing two
HITs in a row? Can you think of how this could be used to
b. a one-sided upper alternative using a size of 2 0 %?
c. a two-sided alternative using a size of 2 %? perform a test that the model is correct?
d. a two-sided alternative using a size of . 1 %? 6.17 A data m anagem ent group wants to test the null hypoth
esis that observed data is N(0,1) distributed by evaluating
6.13 Find the p-value for the following t-test statistics of the
the mean of a set of random draws. However, the actual
null hypothesis, H0: /z = /z0.
underlying data is distributed as N(1, 2.25).
a. Statistic: 1.45, Alternative: Two-sided
b. Statistic: 1.45, Alternative: One-sided upper a. If the sam ple size is 10, what is the probability of a
Type II error and the power of the test? Assum e a 90%
c. Statistic: 1.45, Alternative: One-sided lower
d. Statistic: —2.3, Alternative: One-sided upper confidence level on a two-sided test.
b. How many sam ples would need to be taken to reduce
e. Statistic: 2.7, Alternative: Two-sided
the probability of a Type II error to less than 1%?
6.14 If you are given a 99% confidence interval for the mean
return on the Nasdaq 100 of [2.32% , 12.78% ], what is the
6.18 Suppose that for a linear regression, an estimation of
sam ple mean and standard error? If this confidence inter slope is stated as having a one-sided p-value of 0.04.
W hat does this mean?
val is based on 37 years of data, assumed to be iid, what is
the sam ple standard deviation?
ANSW ERS
Solved Problems
6.11 The null is that the IG (investm ent grade) default rate d. This size corresponds to the same critical value as a
is the same for S&P rated firms as it is for Fitch-rated .05% upper-tailed test, which is 3.29.
firms. If Sj are default indicators for S&P rated firms, and
6.13 a. In a two-sided test, the p-value is the area under
Fj are default indicators for Fitch-rated firm s, then the the normal curve for values greater than 1.45 or less
null is H q : As = A f , which states that the mean values than - 1 .4 5 . This is tw ice the area less than - 1 .4 5 , or
are the sam e. The null can be equivalently expressed 2 X 0.074. = .148.
as H0: a s — A f = 0. The alternative is Hy. /j l $ ^ a f or b. In a one-sided upper test, we need the area above
equivalently H^: /xs — a f ^ 0 - The data required to test 1.45. This is half the previous answer, or 0.074.
this hypothesis would be binary random variables where
1 indicates that an IG bond defaulted and 0 if it did not
c. In a one-sided lower test, we need the area under
the curve for values less than 1.45. This is just
within a fixed tim e fram e (e.g ., a quarter).
1 - 0.074 = .926.
6.12 a. The Iower-tailed critical value is —1.28.
d. Here we need to probability under the normal curve
b. The one-sided upper-critical value is 0.84.
for values above —2.3, which is 1 minus the area less
c. A two-sided alternative with a size of 2% uses the than - 2 .3 , or 1 — 0.01 = 0.99.
same critical value as a one-sided upper test, which is
e. Because it is a two-sided alternative, we need that
2.32. Each tail has probability 1% when using the criti
area for values larger than 2.7 or less than —2.7,
cal value.
which is tw ice the area less than —2.7. This value is ± 1.96 using a size of 5%. The null is not rejected. If the
2 X .0034 = .0068. correlation was 0 , then the variance estim ate would be
6.14 The mean is the midpoint of a sym m etric confidence 0.0366, and the test statistic is 1.69. The null would still
not be rejected if the size was 5%, although if the test
interval (the usual type), and so is 7.55% . The 99%
Cl is constructed as [/x — c X & ,jl + c X a] and so size was 10%, then the critical value would be ± 1.645
and the correlation would matter.
c X a = 12.78% — 7.55% = 5.23% . The critical value for
a 99% Cl corresponds to the point where there is 0.5% in 6.16 The probability of a H IT should be 1 — p if the model
5.23% is correct. They should also be independent, and so
each tail, or 2.57, and so a 2.03% needs to
2.57 the probability of observing two HITs in a row should
be surveyed.
be (1 — p)2. The can be form ulated as the null that
6.15 The null hypothesis is H0: /j l e = /j l b . The alternative is Hb: E[H IT i X H ITi+i] = (1 — p ) 2 and tested against the
H-|: fx E ¥= /j l b . The test statistic is based on the difference alternative H-p E[H/T(- X H/T/+1] ^ (1 — p)2. This can be
of the average returns, 8 = 7.3% — 2.7% = 4.6% . im plem ented as a sim ple test of a mean by defining the
The estim ator of the variance of the difference is random variable X/ = HIT} X H/T( + 1 and then testing the
a E + a B - 2(t be, which is 0 .1 842 + 0.0532 - 2 X - 0 .6 X null H0: fJLx = (1 — p ) 2 using a standard test of a mean.
0.184 X 0.053 = 0.0483. The test statistic is
g
6.17 a. W hen the null hypothesis is false, the probability of
—, = 1.47. The critical value for a two-sides test is a Type II error is equal to the probability that the
/O043
hypothesis fails to be rejected, as per the diagram in
V 50
the chapter:
Further recall that for a 90% confidence level on a N(0,1) distribution, the cut-off points are + /—1.65.
-1.65 1.65
Chapter 6 Hypothesis Testing 97
The following questions are intended to help candidates understand the material. They are n o t actual FRM exam questions.
Now, if there are 10 sam ples taken from an N(0,1) In actuality, the true distribution is N (1,2.25), so
then the standard deviation is reduced the a = V 2 .2 5 = 1.5. For a sam ple size of 10, the
expected sam ple standard deviation is
°H> — = = 0.316
V 10
/V
A 5
0.474
O s a m p le
VTo
Therefore, the cut-off points are
Calculating the equivalent distance of + /— 1.65 in this
± 1 .6 5 * 0 .3 1 6 = ± 0 .5 2 2 distribution com pared to a standard N(0,1) yields
6 .1 8 The p-value tells us the "probability of observing a test will correspond to the value of x that solves the following
statistic that is more extrem e than the one com puted equation:
from the observed data when the null hypothesis is tru e." 1 - O(x) = 0.04
In other words, if the estim ate of the slope is true, then in
Using the excel function N O RM SIN V gives x = 1.75.
a randomized trial the probability of getting a larger new
slope estim ate is 4% .Assum ing a normal distribution, this
QUESTIONS
Practice Questions
7.9 Find the O LS estim ators for the following data: 7.11 In a CAPM that regresses W ells Fargo on the m arket, the
coefficients on monthly data are a = 0.1 and [3 = 1. 2.
X y W hat is the expected excess return on W ells Fargo when
1 0.35 7.12 You fit a CAPM that regresses the excess return of
Coca-Cola on the excess m arket return using 20 years
2 6.46
of monthly data. You estim ate a = 0.71, (3 = 1.37,
3 4.09 s 2 = 20.38, <xj = 19.82 and jl x = 0.71.
4 7.34 a. W hat are the standard errors of a and /3?
A
ANSW ERS
7.2 The model that allowed differences in the slope would 7.6 The only tim e /3 = 0 is when Cov(X,V) = 0 and so the
A
be Rj = a + p R mii + ylFOMCRm,i + €i where If o mc is two variables are uncorrelated. The R2 is the squared cor
1 on FO M C days and 0 otherw ise. If y is not zero, relation and so must be 0 .
then the slope is different on FO M C days. This can be 7.7 False. The optim ally hedged portfolio is uncorrelated
extended to both param eters by estim ating the model
with the return on the hedge. It is not necessarily inde
Ot y IFO M C "f P R m ,i y ^ F O M C ^ m .i "f pendent. It would be independent if the returns were
7.3 The 1 — a confidence interval contains the set of null jointly normally distributed.
hypotheses that could not be rejected using a test size 7.8 If the benchm ark is poor, so that the evaluation model is
of a.
m isspecified, then the slope /3 will be m ism easured. As a
7.4 The t-stat is a t-test of the null H0:f3 = 0 against the alterna result, some of the com pensation for system ic risk (which
tive H-\:(3 9^ 0. is captured by (3) may be attributed to skill a.
Solved Problems
7.9 Doing the basic needed calculations:
最新cfafrm加微信286982279
X y x —x y-y 1 .x-xf (x -x )(y -y ) ( y - ( a + /be) ) 2
QUESTIONS
8.2 Both R2 and R2 only increase when adding a new variable. of the nulls H q :/3^ = 0 or H q :/32 = 0 using the t-stat of the
coefficient. Which values of p = C o rr[X 1f X 2] make this
True or false?
scenario more likely?
8.3 W hen is R2 less than 0? Can this measure of fit be larger
than 1 ? 8.6 Suppose you fit a model where both t-statistics are exactly
+ 1.0. Could an F-statistic lead to rejection in this circum
8.4 The value of an F-test can be negative. True or false?
stance? Use a diagram to help explain why or why not.
Practice Questions
8.7 Construct 95% confidence intervals and p-values for the 8.11 Using the data below:
coefficients in the shipping containers industry portfolio
returns.
Trial Number y *1 *2
V; -0 .8 2 0.39
5 -3 .0 8
6 -7.1 -2 .0 8 1.39
W hat is the R2 in a model that regresses Y on X? W hat
7 -4.1 - 1 .0 6 0.75
would the R2 be in a model that regressed X on V?
0.14 -0 .6 3
8.10 A model was estim ated using daily data from the S&P 500
8 0 .0 2
from 1977 until 2017 which included five day-of-the-week 9 -6 .1 3 - 1.6 6 1.31
dummies (n = 10,087). The R2 from this regression was 10 0.74 0 .6 8 -0 .1 5
0.000599. Is there evidence that the mean varies with the
a. A pply O LS linear regression to find the param eters for
day of the w eek?
the following equation:
Y, = a + p-\XVl + (32X 2i + €j
ANSW ERS
8.3 R2 is always less than R2 and so cannot be larger than 1. 8.6 Yes. When the regressors are very positively correlated,
It can be sm aller than 0 if the model explains virtually then this can happen. The t-stats are small because the
nothing and the degree-of-freedom loss is sufficient to variables are highly co-linear, so that the variation in the
push its value negative. left-hand-side variable cannot be uniquely attributed to
8.4 False. An F-test measures the reduction in the sum of either. However, the F-stat is a joint test and so if there is
squared residuals that occurs in the expanded model and an effect in at least one, the Fe an reject even if the t-stats
depends on SSRu — S S R r . This value is always positive do not. See the image below that shows the region where
because the unrestricted model must fit better than the the F would fail to reject, and the two t-stats.
restricted model.
8.5 This is most likely to occur when the regressors are highly
correlated. If the regressors are positively correlated, then
the parameter estimators of the coefficients will be nega
tively correlated. If both values are positive, this would lead
to rejection by the F-test. Similarly, if the regressors were
negatively correlated, then the estimators are positively
correlated and the F will reject if one t is positive and the
other is negative. The figure below shows the case for pos
itively correlated regressors. The shaded region is the area 3 •-------- •-------- --------- 1-------- 1-------- •
where the F would fail to reject. The t-stats are outside this -3 - 2 - 1 0 1 2 3
area even though neither is individually significant.
Solved Problems
8.7 The confidence interval is ft ± 1.96 X se. The p-value is
2(1 — <E>(| 1 1)) where <I>( -) is the standard normal CDF.
Coefficient Estimate t-stat Std. Err. Conf. Int. P-value
8.8 The annualized a is 12a = 12 X 0.517 = 6.204% . The
Pm 1 . 0 1 1 18.63 0.054 [0.905, 0.000 t-stat is unaffected because the standard error is also
1.117]
scaled by 1 2 .
- - 0 .1 6 0.075 [- 0 .1 5 9 , 1.128
Ps 0 .0 1 2
8.9 Because this is a regression with a single explanatory
0.135]
variable, the R2 is the squared correlation. The correla
Pv 0.276 3.51 0.079 [0 . 1 2 1 , 0 .0 0 2
tion is 0 .9 / ^ 3 = .519 and so the R2 = 0.27. The R2 is
0.431]
Yj = /jl + 82D2
S3D3 + 84D4 + S4D5 + Sj, therefore,
+ A verage -2 .8 2 3 -0 .9 1 0 - 0 .1 0 1
1 - 5 .7 6 - 3 .4 8 -1 .3 7 9 -3 .3 0 7 -0 .7 5 0 1.411
3 -0 .2 5 - 0 .5 -1 .0 7 A verage
N>
Trial Number (*! - X t )2 (x2 - x2)2 (x-, - Xt )(y - y) (x2 - x2)(y - y) (*! - )(*2 - * 2)
<<
I
X1(- — S'0
n + 5iX->; + X 1/
1^2/ 2 1.047 -0 .4 0 6 -0 .4 2 5 0.164
X2 - 0 .1 7 2
0.189 3 1.741 -0 .9 1 7 -1 .5 9 6 0.841
0.911
4 -1 .3 7 8 -0 .8 1 6 1.125 0.666
5q = /xXl - ^i Mx2 = -0-91 - 0.189*(—0.101) = -0.929
5 -0 .4 4 0 0.502 -0.221 0.252
Vi ~ To + 71^2/ + 6 -1 .9 0 5 1.341 -2 .5 5 4 1.798
- 1 .4 3 4
- 1 .5 7 4 7 -0 .9 7 4 0.831 -0 .8 0 9 0.691
0.911
8 1.076 -0 .4 1 0 -0 .4 4 2 0.168
7o = Mr - fiM x2 = - 2 .8 2 3 - (—1 .574*—0.101 ) = - 2 .9 8 2
9 -1 .7 8 7 1.315 -2 .3 4 9 1.728
Trial 10 0.338 0.154 0.052 0.024
Number ~2
y xi y*1 xi A verage -1 .0 8 5 0.889
1 - 4 .9 3 4 -2 .8 1 0 13.865 7.896
and
2 2.036 0.792 1.612 0.627
C o v ( K ,X 2) = - 1 .0 8 5 o-| = 0.889
3 1.048 0.227 0.238 0.051
- 1 .0 8 5
4 -1 .3 2 8 0.558 -0.741 0.311 = - 1.221
0.889
f t =
4 -3 .6 1 5 -2 .3 7 3 5.632
C o v (Y ,X J 2.459 5 -1 .0 6 8 0.173 0.030
1.873
1.313 6 -1 .5 0 8 -0 .2 6 7 0.071
Repeating the process for X 2: 7 -1 .2 0 0 0.042 0.002
*2 i = io + + x 2i 8 -0 .6 6 6 0.576 0.331
- 0 .1 7 2 9 -1 .4 2 3 -0.181 0.033
- 0 .1 2 8
1.345
10 -0 .7 1 7 0.525 0.276
£0 = Mx 2 - ^iM x, = -0 .1 0 1 - (- 0 .1 2 8 ) - (- 0 .9 1 ) = - 0 .2 1 7
A verage -1 .2 4 2 0.718
Yj = fjo + + K So a = - 1 .2 4 2 , and the variance of the residuals
2.729 is 0.718.
2.029
1.345
Finally, asserting the presumption of normality: So, the TSS = 75.797 and the ESS = 68.598.
y y (y - y )2 (y - y )2
1 -5 .7 6 0 -6 .0 8 9 8.626 10.664
3 -0 .2 5 0 -0 .8 7 3 6.620 3.802
4 -2 .7 2 0 -0 .3 4 7 0.011 6.131
5 -3 .0 8 0 -3 .2 5 4 0.066 0.185
6 -7 .1 0 0 -6 .8 3 4 18.293 16.085
7 -4 .1 0 0 -4 .1 4 2 1.631 1.741
9 -6 .1 3 0 -5 .9 4 9 10.936 9.774
A verage -2 .8 2 3
QUESTIONS
9.2. W hat conditions are required for a regression to suffer 9.5. W hy does a variable with a large variance inflation factor
from om itted variable bias? indicate that a model may be im proved?
9.3. W hat are the costs and benefits of dropping highly collin- 9.6. The sam ple mean is an O LS estim ator of the model
ear variables? Yj = a + 6,. W hat does the BLU E property imply about
the mean estim ator?
9.4. If you use a general-to-specific model selection with a
test size of a, what is the probability that one or more 9.7. W hat is the strongest justification for using O LS to esti
mate model param eters?
Practice Questions
9.8. In a model with a single explanatory variable, what value there are nonlinear effects of some of these variables,
of the R2 in the second step in a test for heteroskedasticity and so use a R E S E T test including both the squared and
indicates that the null would be rejected for sam ple sizes cubic term . The R2 of the original model is 68.6% , and the
of 100, 500, or 2500? (Hint: Look up the critical values of a R2 from the model that includes both additional term s is
Xq, where q is the number of restrictions in the test.) 68.9% . You have 456 observations. W hat do you conclude
about the specification of the model?
9.9. Suppose the true relationship between Y and two explan
atory variables is Yj = 2 + 1.2X-I,- — 2.1X2/ + e/- 9.12. You are investigating the determ inants of book leverage
of firms. Your model includes an intercept and four addi
a. W hat is the population value of (3-\ in the regression
tional variables: the natural logarithm of sales revenue, the
Yj = a + (3-jX-1 + €j if pxiX2 = 0-6 and ox = 1 and
book-to-market ratio, the ratio of EBlTD A-to-book assets,
<>i = 1 / 2 ?
and Net property-plant-and-equipm ent (Net PPE). Your
b. W hat value of pXlx2 would make f3-\ = 0?
initial model groups firm s in all industries. Suppose you
9.10. In a model with two explanatory variables,
want to test w hether the four slope coefficients are con
Y-, = a + jS-|X1( + /32X 2; + €„ what does the correlation
stant across energy sector firms.
need to be between the two regressors, X | and X 2, for the
variance inflation factor to be above 10? a. W hat additional model would you estim ate to im ple
ment the Chow test?
9.11. You are interested in understanding the determ inants of
b. If your sam ple has 433 observations, what is the distri
the yield spread of corporate bonds above a maturity
bution of the Chow test?
matched sovereign bond. You include three explanatory
c. If the R2 in the original specification is 16.6% , how
variables: the leverage defined as the ratio of long-term
large would the R2 have to be in the regression with
debt to the book value of assets, a dummy variable for
the dummy interactions for the null to be rejected
high yield, and a measure of the volatility of the profit
using a 5% test size?
ability of the issuer. You are interested in testing whether
9.13. G iven the data set: where the data was used for fitting the model versus the
blue indicating the data that was held back):
Y X 1 X2
M1 m2 m3
1 -2 .3 5 3 -0 .4 0 9 -0 .0 0 8
6 -0 .7 3 5 - 1 .3 9 -1 .0 8 6 5 -1 .1 - 0 .6 1.3
9 -2 .4 1 9 -0 .1 5 6 -0 .0 5 5 8 0.9 -1 - 0 .4
15 -7 .0 9 5 0.284 2.622 Y X
-10
i----------------------1---------------------- 1---------------2 0 — a—
ANSW ERS
Solved Problems
9.8. W hen there is a single explanatory variable in the original b. The om itted variable formula shows that when a vari
m odel, the auxiliary regression used for W hite's test will able is excluded, the coefficient on the included vari
have an intercept and two explanatory variables— the ables is /3-i + y/32, where y is the population regression
original variable and the variable squared. W hite's test coefficient from the regression X 2( = 8 + y X i; + I?;.
statistic is nR2, and the null has a X 2 distribution. When This coefficient depends on the correlation between
using a test with a 5% size, the critical value is 5.99. Solv the two and the standard deviations of the two vari-
5.99 ox, VT72
ing for the R2, R2 < . The maximum value of R2 that ables, so that y = pXlx2— = 0-6 = 0.424. The
n
would not reject is 0.0599, 0.011, and 0.0023 when n is coefficient is then 1.2 — 2.1 X 0.424 = 0.309.
100, 500, and 2500, respectively.
p X V T X VTT2
Solving 1.2 — 2.1 X 0 so that
9.9. a. Using the om itted variable form ula, when 1
X 2 is om itted from the regression,
if p = ---------- — = 0.522 then 13i would be 0 in the
<0 /0.6 x VT x V T 2 \ 2.1 X V T 2
f t = 1.2 - 2.1 x ( ----------- ------------ j = - 0 .1 8 ,
om itted variable regression.
, 0.6 x VT x VT2 . .
w h e re ----------- ------------ is the regression slope from
a regression of X 2 on X-|.
Y X1 (Y -Y ) ( X ,—X . H Y - Y ) ( X ,- X ,) 2
1 -2 .3 5 3 -0 .4 0 9 -0 .4 0 9 -0 .3 5 3 0.145 0.167
3 -1 .6 6 5 -0 .8 5 6 -0 .8 5 6 0.335 -0 .2 8 7 0.733
6 -0 .7 3 5 - 1 .3 9 -1 .3 9 0 1.265 -1 .7 5 8 1.932
9 -2 .4 1 9 -0 .1 5 6 -0 .1 5 6 -0 .4 1 9 0.065 0.024
11 -2 .0 8 5 -0 .5 6 2 -0 .5 6 2 -0 .0 8 5 0.048 0.316
12 -2 .9 7 2 -1 .5 5 4 -1 .5 5 4 -0 .9 7 2 1.511 2.415
13 -0 .6 3 3 -1 .1 2 3 -1 .1 2 3 1.367 -1 .5 3 5 1.261
14 -2 .6 7 8 -0 .1 2 4 -0 .1 2 4 -0 .6 7 8 0.084 0.015
Average -2 .0 0 0 0 .0 0 0 0 .0 0 0 0.933
Therefore,
0.000
0.000
0.933
And, of course,
a = Y - f r X , = - 2 .0 0 0
Y = -2.000 最新cfafrm加微信286982279
b. Proceeding as in part a gives:
Y X2 (X 2 - X 2) (Y -Y ) (X 2 - X 2) CY - Y ) (X 2 - x 2f
1 -2 .3 5 3 -0 .0 0 8 -0 .0 0 8 -0 .3 5 3 0.003 0.000
2 -0 .1 1 4 -1 .2 1 6 -1 .2 1 6 1.886 -2 .2 9 3 1.478
6 -0 .7 3 5 -1 .0 8 6 -1 .0 8 6 1.265 -1 .3 7 3 1.179
9 -2 .4 1 9 -0 .0 5 5 -0 .0 5 5 -0 .4 1 9 0.023 0.003
11 -2 .0 8 5 -0 .1 3 5 -0 .1 3 5 -0 .0 8 5 0.012 0.018
12 -2 .9 7 2 -0 .2 9 9 -0 .2 9 9 -0 .9 7 2 0.291 0.089
13 -0 .6 3 3 -1 .0 2 7 -1 .0 2 7 1.367 -1 .4 0 4 1.055
A ve ra g e -2 .0 0 0 0 .0 0 0 -1 .3 8 1 0.934
- 1 .3 8 1
- 1 .4 7 9
0.934
a = Y - (32X 2 = - 2 .0 0 0
y = -2 .0 0 0 - 1 . 4 7 9 X 2
c. The key statem ent from the chapter is "the estim ator
/3-| converges to (3-\ + (328 "
/3i + (328^ = 0
Plugging back in the 2 X 2 system of equations: d. All approaches should provide the same answer.
f t + 0.501 f t = 0
0 .5 0 0 ft + f t = - 1 .4 7 9
M1 M2 M3 M1 M2 M3
7 0.6 0 0 0.36 0 0
The model selected is the one that has the sm allest RSS Y1 is the estim ate using the entire data set
within the blue out-of-sample boxes— this is M3.
Y2 is the estim ate using the first nine observations
9.15. a. Following the m ethodology of the first exam ple:
So the Cook's distance for the last data point is
106.552
Alpha Beta 0.61
2 *8 7 .7 8 3
Entire 3.26 3.49 b. The last data point does not have a major influence on
First nine 0.10 2.96 the O LS param eters. However, it may still be an out
lier, and other tests would need to be conducted to
Com pleting the table: draw such a conclusion.
Y X Y1 Y2 (Y1 —Y2)A2 Y 1 -Y
2 -1 4 .1 2 -4 -1 0 .6 9 2 -1 1 .7 4 5 1.107 3.428
3 -1 4 .7 2 -3 -7 .2 0 4 -8 .7 8 3 2.491 7.516
5 2.3 -1 -0 .2 2 8 -2 .8 5 8 6.919 -2 .5 2 8
Sum 106.552
SumSq/10 87.783
QUESTIONS
10.5. W hat are the key features of the A C F and PA C F of an 10.9. W hat steps should be taken if the A C F and/or PA C F of
model residuals do not appear to be white noise?
MA(q) process?
Practice Questions
10.10. In the covariance-stationary AR(2), Yt = 0.3 + c. Calculate the tim e series for the following ARM A(1,1)
1.4Yt_-| — 0 .6 Y t_ 2 + et, where et ~ WN(0, a 2), what is m odel, taking e0 = 0, q0 = 0, rj = 0.5, and
the long-run mean E[Yt] and variance V[Yt]? (f> = 0 = 0.375:
10.11. If (1 — 0.4L)(1 — 0 .9 L4)Y t = et, what is the form of this 10.14. In the MA(2), Yt = 4.1 + 5et_-| + 6.25et_2 + et, where
model if written as a standard AR(5) with Yt_ 1f. . . , Yt_ 5 et ~ WN[0, a 2), what is the A C F ?
on the right-hand side of the equation?
10.15. Suppose all residual autocorrelations are 1 .5 / V ^ where
10.12. For the equation: T is the sam ple size. Would these violate the confidence
15 5 3 bands in an A C F plot?
10.13. G iven the following data for a set of innovations: with 100 observations. W ould a Ljung-Box Q statistic
reject its null hypothesis using a test with a size of
et 5%? W ould the test reject the null using a size of 10%
1 -0.58 or 1%?
10 -0.88 Which model is selected by the A IC and BIC if the sample size
T = 250?
a. Calculate the tim e series for the following AR(1)
model, taking y0 = 0, S = 0.5, and </> = 0.75: 10.18. The 2018Q 4 value of real G D P growth is
b. Calculate the tim e series for the following MA(1) RGDPGj - t = 2.793. W hat are the forecasts for 2019Q1
model, taking e0 = 0, ju = 0.5, and 0 = 0.75: - 2019Q 4 using the AR(2) model in Table 10.1?
ANSWERS
Solved Problems
10.10. Because this process is covariance-stationary 10.13. a. Yt — 8 + 4>Yt_i+et
0.3
E [Y t] = A = = 1.5 yt
1 - 1.4 - (0.6)
0 .3 : 1 -0 .0 8
V [Y t] = y 0 = = 0.45
1 - 1.4 - (0.6) 2 2.06
6 1.64
10.12.1 — — f c L 2 = 1 - f l + fL*
7 2.54
8 1.76
9 2.67
10 1.62
Each value is generated via the form ula: Each value is generated via the form ula:
.2
and y 2 = 6cr . The autocorrelations are then
Zt Po = 1, py = 0.557 and p2 = 0.096.
6 0.19 Qlb = T 2 ( ) P?
/= 1 T — i
7 1.70
= 1001 ^ ) (0.24)2 + 100 [ ^ j (- 0 .0 4 )2 + 100 ( ^ ) (0.08)2
8 0.46 99 98 97
10 0.26 The critical values for tests sizes of 10%, 5%, and 1% are
6.25, 7.81, and 11.34, respectively. (CHISQ.INV(1 -a, 5)
Each value is generated via the form ula: in Excel, where a is the test size). The null is only be
rejected using a size of 10%.
Zy = 0.5 + 0.75 * 0 - 0.58 = - 0 .0 8
10.17. The ICs and the selected model in bold:
z2 = 0.5 + 0 .7 5 (—0.58) + 1.62 = 1.69
9 0.41
10 0.47
10.18. All forecasts are recursively com puted starting with the The three- and four-step depend entirely on other
first: forecasts:
1.765 + 0.319 X 2.564 + 0.114 X 2.793 = 2.90. 1.765 + 0.319 X 2.98 + 0.114 X 2.90 = 3.05.
The two-step forecast uses the one-step forecast: 1.765 + 0.319 X 3.05 + 0.114 X 2.98 = 3.08.
QUESTIONS
Practice Questions
3
11.6 A Iinear tim e trend model is estim ated on annual real 11.8 The seasonal dummy model Yt = 8 + 2 ; = 1 Yjljt + et is
euro-area GDP, measured in billions of 2010 euros, estim ated on the quarterly growth rate of housing starts,
using data from 1995 until 2018. The estim ated model is and the estim ated param eters are y i = 6.23, 72 = 56.77,
RG D P t = —234178.8 + 121.3 X t + et. The estim ate of 73 = 10.61, and 8 = —15.79 using data until the end
the residual standard deviation is a = 262.8. Construct of 2018. W hat are the forecast growth rates for the four
point forecasts and 95% confidence intervals (assuming quarters of 2019?
Gaussian white noise errors) for the next three years. Note
11.9 A D F tests are conducted on the log of the ten-year
that t is the year, so that in the first observation, t = 1995, US governm ent bond interest rate using data from
and in the last, t = 2018.
1988 until the end of 2017. Th e results of the A D F
11.7 A log-linear trend model is also estim ated on annual euro with different configurations of the determ inistic term s
area G D P for the same period. The estim ated model is are reported in the table below. The final three co l
In R G D P t = —18.15 + .0136 t + et, and the estim ated umns report the num ber of lags included in the test
standard deviation of et is 0.0322. Assum ing the shocks as selected using the A IC and the 5% and 1% critical
are normally distributed, what are the point forecasts of values that are appropriate for the sam ple size and
G D P for the next three years? How do these compare included determ inistic term s. Do interest rates contain
those from in the previous problem ? a unit root?
Deterministic r So Lags 5% CV 1% CV
None - 0 .0 0 3 7 - 1 .9 4 2 - 2 .5 7 2
( - 1 .6 6 6 )
Constant - 0 .0 0 9 0 .0 1 0 4 - 2 .8 7 0 - 3 .4 4 9
(-1 .4 2 5 ) (1.027)
Trend - 0 .0 8 5 0.188 - 0 .0 0 0 3 - 3 .4 2 3 - 3 .9 8 4
11.10 An AR(2) process is presented: The data is to be evaluated using the following regres
4 -0.81
5 -0.31
6 -0 .3 7
7 0.47
8 1.10
9 0.53
10 0.47
11 1.00
12 1.68
13 0.90
14 1.99
15 1.31
16 0.76
17 0.43
18 -1 .0 1
19 0.27
20 -2.21
21 -3.21
22 - 1 .9 0
23 -0 .5 3
24 - 0 .6 0
25 0.34
ANSWERS
EA yr+ h l = E T[e x p (g (T + h) + eT+h)] = exp (g (T + h) 11.3 The time trend becom es apparent as the series is propa
+ Eyfexp eT+h)j, gated backwards:
Solved Problems
11.6 Note that there is no AR or M A com ponent, so the vari And the error bounds on the In are + /— 1.96*0.0322, so
ance remains constant. Therefore, the 95% confidence the bounds are given in proportional term s rather than
interval is + / — 1.96*262.8 = + / — 515.1 about the fixed values.
expected value. Bounds_M ultiplier = exp( ± 1.96 * 0.0322)
A s for the expected means: = e xp (± 0.0631) = 0.939,1.065
E [R G D P 2o i 9] = -2 3 4 1 7 8 .8 + 121.3 X 2019 = 10,725.9 Calculating E[ln R G D P J:
E [R G D P 202o ] = -2 3 4 1 7 8 .8 + 121.3 X 2020 = 10,847.2 E[ln R G D P2019] = - 1 8 .1 5 + 0.0136 * 2019 = 9.308
E [R G D P 202-\] = -2 3 4 1 7 8 .8 + 121.3 X 2021 = 10,968.5
E[ln R G D P2020] = - 1 8 .1 5 + 0.0136 * 2020 = 9.322
11.7 In this case: E[ln R G D P202i ] = - 1 8 .1 5 + 0.0136 * 2020 = 9.336
a
ET[YT] = exp ET[ln Y t ] + Furtherm ore,
0.03222
0.0005
2
(which will only make a small im pact in this exam ple) For the trend m odel, both of these have t-stats with
absolute values > 4, well within the bounds of statisti
So:
cal significance at even the 99% level. Accordingly, the
E [R G D P 2o i 9] = exp(9.308 + 0.0005) = 11,031.4
proper model to study is the last one.
E [R G D P 2 0 2 0 ] = exp(9.322 + 0.0005) = 11,186.9
For this m odel, the t-stat on -y is to the right of the 1%
E[R G D P 202i ] = exp(9.336 + 0.0005) = 11,344.6 CV — therefore, the null hypothesis of having a unit root
And the 95% confidence bands are given as: is rejected at the 99% confidence level. Note that if the
proper model were either the constant or no-trend then
95% ^CBo_ = [0 .9 3 9 * 11031.4,1.065 * 11031.4]J
d RGDP 2019 1 '
the null hypothesis would not be rejected.
= [10,358.5,11,748.4]
11.10 a. G iven that the equation has a unit root means that the
95% c b RGDP2020 = [10,504.5,11,914.0] factorization of the characteristic equation must be of
95% c b RGDp2019 = [10,652.6,12,082.0] the form:
In comparison with #1, the bands are growing in size and (z - 1)(z — c) = z2 - (c + 1)z + c,
overall the results are a bit bigger.
where 0 < < 1
1 1 .8 Though there is variance from quarter-to-quarter, the
The characteristic equation for the original relation
expected value of Y t is the same for any two observa
ship is
tions of the same quarter, regardless of the year.
z2 - k i
Accordingly:
r n
t yt yt - 1 A yt yt - 1 * A y t y?-i
1 -1 .8 1 0 -1.81 0.00 0.00
23 -0 .5 3 - 1 .9 0 1.37 - 2 .6 0 3.61
24 - 0 .6 0 -0 .5 3 -0 .0 7 0.04 0.28
_ 1 x,y, _ - 1 2 .5 0
- 0 .2 7 3
m “ S x ,2 “ 45.76
t y t y t - 1 A yt y t - 1 * A yt yf-i
1 -1.81 0 -1.81 0.00 0.00 -1.81
Using the STD EV function in E X C E L , As the t-stat is less than the 5% point on the D F distri
The standard error of this is bution, the null is rejected at a 95% confidence level,
but the t-stat is not sufficiently negative to enable
(T, 0.95
= 0.140 rejection at the 99% confidence level.
nay<-i 5 * 1.36
So the t-statistic is
m - 0 .2 7 3
t - stat = = - 1 .9 6
(T,m 0.140
QUESTIONS
Practice Questions
12.4. On Black Monday in O ctober 1987, the Dow Jo n es Indus 1 2 .1 1 . The following data is collected for four distributions:
trial A verage fell from the previous close of 2,246.74 to
1,738.74. W hat was the sim ple return on this day? W hat Dataset Skew Kurtosis T
12.5. If the annualized volatility on the S&P 500 is 20% , what is B 0.85 3.00 51
the volatility over one day, one w eek, and one month? C 0.35 3.35 125
12.6. If the mean return on the S&P 500 is 9% per year, what is D 0.35 3.35 250
the mean return over one day, one w eek, and one month?
W hich of these datasets are likely (at the 95% confidence
12.7. If an asset has zero skewness, what is the maximum kurto-
level) to not be drawn from a normal distribution?
sis it can have to not reject normality with a sam ple size of
100 using a 5% test? W hat if the sam ple size is 2,500? 12.12. Calculate the rank correlations for the following data:
12.8. If the skewness of an asset's return is —0.2 and its kurtosis 1
•
X Y
is 4, what is the value of a Jarque-Bera test statistic when
1 0.22 2.73
T = 100? W hat if T = 1,000?
2 1.41 6.63
12.9. Calculate the simple and log returns for the following data:
3 - 0 .3 0 - 2 .1 9
Time Price 4 - 0 .5 9 -6.51
0 100 5 -3 .0 8 -0 .9 9
1 98.90 6 1.08 2.63
2 98.68 7 -0 .4 5 - 3 .4 0
3 99.21 8 0.40 5.10
4 98.16 9 -0 .7 5 - 5 .1 4
5 98.07 10 0.24 1.14
6 97.14
12.13. Find Kendall's Tau for the following data:
7 95.70
X V
•
8 96.57 J
1 3.12 2.58
9 97.65
2 - 1 .2 6 -0 .0 5
10 96.77
3 2.08 -0 .7 2
12.10. The implied volatility for an ATM money option is
4 -0 .2 8 -0 .5 2
reported at 20% , annualized. Based upon this, what
would be the daily implied volatility? 5 - 1 .9 6 - 0 .4 0
ANSW ERS
12.2. Yes. Linear correlation is sensitive in the sense that mov 12.3. Covariance may have either sign, and so the square
ing a single point in X 1f X 2 space far from the true linear root transform ation is not reliable if the sign is negative.
relationship can severely affect the correlation. Effec Transformation to (3 or correlation is preferred when
tively a single outlier has an unbounded effect on the exam ining the magnitude of a covariance.
covariance, and ultimately the linear correlation. Rank
Solved Problems
^7 2 8 7 4 _ 2246 74 12.9. S imple returns use the form ula:
12.4. The simple return i s ------- ------- = —22.6% . The
2246.74
P t ~ P t-
log return is In 1738.74 - In 2246.74 = 25.6% . The Rt =
P t-
difference between the two is increasing in the
m agnitude. Log returns use the form ula:
12.5. The scale factors for the variance are 252, 52, and 12, Pt
rt = In Pt — In Pt_ 1 = In
P t-
respectively. The volatility scales with the square root of
i i r i 20% 20%
the scale factor, and so — ----- = 1.26% , — — = 2.77% , Time Price Simple Log
V252 \/5 2
20 % 0 100
and — 7 = = 5.77% .
V12
1 98.90 -1 .1 0 % -1 .1 1 %
12.6. The mean scales linearly with the scale factor, and so
9% 9% . . 9% . 2 98.68 -0 .2 2 % -0 .2 2 %
— — = .036% , —— = 0.17% , and —— = 0.75% .
252 52 12 3 99.21 0.54% 0.54%
12.7. The Jarque-Bera has X 2 distribution, and the criti
4 98.16 -1 .0 6 % -1 .0 6 %
cal value for a test with a size of 5% is 5.99. The
f S 2 (k - 3)2 5 98.07 -0 .0 9 % -0 .0 9 %
Jarque-Bera statistic is J B = (T - 1)( — -I-----— —
6 97.14 -0 .9 5 % -0 .9 5 %
so that when the skewness S = 0, the test statistic is
7 95.70 -1 .4 8 % -1 .4 9 %
(k - 3)2
(T — 1)— —— . In order to not reject the null, we need 8 96.57 0.91% 0.90%
24
(k - 3)2 5.99 9 97.65 1 .12% 1.11%
( T - 1) < 5.99, and so (k — 3)2 < 24 X
24 T - 1
10 96.77 -0 .9 0 % -0 .9 1 %
5.99
and k —3 T .»/ 24 X . W hen T = 100, this value
T - 1 12.10. Using the equation
is 4.20. W hen T = 2,500 this value is 3.24. This shows
®annual = V 2 5 2 X o Ldaily
;
that the J B test statistic is sensitive to even mild excess
kurtosis. 0.2 = V 2 5 2 X daily
12.8. Using the formula in the previous problem , the value of 0.2
&daily = 1.26%
the J B is 4.785 when T = 100 and 48.3 when T = 1000. V 2 5 2
12.11. The appropriate test statistic is the Jarque-Bera statistic. 12.13. The first step is to rank the data:
1 X Y Ry/
•
R x,
1 3.12 2.58 5 5
This will be distributed as ar|d for a 5% test size the
critical value is 5.99 2 - 1 .2 6 -0 .0 5 2 4
3 2.08 -0 .7 2 4 1
Dataset Skew Kurtosis T JB
4 -0 .2 8 -0 .5 2 3 2
A 0.85 3.00 50 5.90
5 - 1 .9 6 - 0 .4 0 1 3
B 0.85 3.00 51 6.02
The next step is to see which pairs are concordant, which
C 0.35 3.35 125 3.16
are discordant, and which are neither.
D 0.35 3.35 250 6.35
W hich leads to the pair-by-pair com parison:
•
1 X Y R x, Ry; RX , ~ R Xi
1 0.22 2.73 6 8 - 2
2 1.41 6.63 10 10 0 For the first observation, because both the rank of X & Y
3 - 0 .3 0 - 2 .1 9 5 4 1 are the maximum, every other observation will autom ati
cally be concordant with respect to this one.
4 - 0 .5 9 -6.51 3 1 2
By contrast, looking at the second and third observa
S -3 .0 8 -0 .9 9 1 5 -4
tions, the X rank increases while the Y rank decreases.
6 1.08 2.63 9 7 2 Therefore, this pair is discordant.
7 -0 .4 5 - 3 .4 0 4 3 1 „ _ nc - nd _ 5 -5 _
8 0.40 5.10 8 9 -1 T n(n - 1)/2 10
9 -0 .7 5 - 5 .1 4 2 2 0
10 0.24 1.14 7 6 1
Sum Squares 32
n(n 2 - 1) -
QUESTIONS
Practice Questions
13.9 Suppose you are interested in approximating the a. Calculate the option payoffs using the equation
expected value of an option. Based on an initial sample M ax(x — 0.5,0).
of 100 replications, you estimate that the fair value of the b. W hat are the antithetic counterparts for each sce
option is USD 47 using the mean of these 100 replications. nario and their associated values?
You also note that the standard deviation of these 100 c. Assum ing that the sam ple correlation holds for the
replications is USD 12.30. How many simulations would entire population, what is the change in expected
you need to run in order to obtain a 95% confidence inter sam ple standard error attainable by using antithetic
val that is less than 1% of the fair value of the option? How sam pling?
many would you need to run to get within 0.1% ? d. How does the answer to part c change if the payoff
13.10 Plot the variance reduction when using antithetic random function is Max(| x — 0.5 |, 0)?
N(0,1)
1 - 0 .6 5
2 - 0 .1 2
3 0.92
4 1.28
5 0.63
6 - 1 .9 8
7 0.22
8 0.4
9 0.86
10 1.74
ANSW ERS
Solved Problems
1 3 .9 The standard deviation is USD 12.30, and a 95% confi = 0.47 => V n = 2 x 1•0 4 7x 1230 = 102.5 (so 103). Using
dence interval is [/x — 1.96 X A + 1.96 X and .1% , we would need 1,025.8 (replace 0.47 with 0.047)
13.10
13.
13.12 Essentially, the question is to determ ine the cumulative b. Recall that for a N(0,1) distribution:
distribution up to 1.2 on an N(1,5) distribution.
Fxiu rf = —Fx( 1 - Ut )
This can be done with the Excel function
Accordingly, the table is populated as follows:
N O RM D IST(1.2,1 ,sqrt(5),True) and returns the value 0.535.
13.13 a. The formula is simply im plem ented in Excel. For the Option Antithetic Option
first entry: N(0,1) Payoff Sample Payoff
M a x (-0 .6 5 - 1,0) = 0
1 - 0 .6 5 0 0.65 0.15
1 - 0 .6 5 0 3 0.92 0.42 - 0 .9 2 0
2 - 0 .1 2 0 4 1.28 0.78 - 1 .2 8 0
6 - 1 .9 8 0 8 0.4 0 - 0 .4 0
9 0.86 0.36
10 1.74 1.24
c. Using Excel, the correlation is —27% . Therefore, the The correlation between the two option value vec
reduction in error is tors is + 25% , so the IN C R EA S E in standard error is
1 - V i + p = 15% V i T p - 1 = 12%.