This action might not be possible to undo. Are you sure you want to continue?
https://www.scribd.com/doc/62407690/FEbook1MAIN
01/08/2013
text
original
The BIGPicture in a LITTLE Book
James A. Primbs
September 17, 2010
2
Contents
1 Basic Building Blocks and Stochastic Diﬀerential Equation Models 1
1.1 Brownian Motion and Poisson Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1.1 Gaussian Random Variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1.2 Brownian Motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1.3 Poisson Random Variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.1.4 Poisson Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.1.5 Increments of Brownian Motion and Poisson Processes . . . . . . . . . . . . . . . . . . 3
1.2 Stochastic Diﬀerential Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.2.1 Diﬀerentials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.2.2 The Diﬀerential . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2.3 Compound Poisson Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.2.4 Ito Stochastic diﬀerential equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.2.5 Poisson Driven Diﬀerential Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.4 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2 Ito’s Lemma 11
2.1 Ito’s Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.1.1 The chain rule of ordinary calculus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.1.2 Ito’s lemma for Brownian motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2.1.3 Replacing dz
2
by dt . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.1.4 Discussion of Ito’s lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.2 Ito’s lemma for Poisson Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.2.1 Interpretation of Ito’s lemma for Poisson . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.3 More versions of Ito’s Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.3.1 Ito’s Lemma for Compound Poisson Processes . . . . . . . . . . . . . . . . . . . . . . . 16
2.3.2 Ito’s Lemma for Brownian and Compound Poisson Processes . . . . . . . . . . . . . . 16
2.3.3 Ito’s Lemma for vector processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.4 Ito’s lemma, the product rule, and a rectangle . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.6 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
3 Standard Stochastic Diﬀerential Equations with Solutions 21
3.1 Geometric Brownian Motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3.1.1 Stock Price Interpretation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3.2 Geometric Poisson Motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3.2.1 A conditional lognormal version of geometric Poisson Motion . . . . . . . . . . . . . . 23
3.3 A jumpdiﬀusion model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3.4 A more general SDE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3.4.1 The OrnsteinUhlenbeck Process and Mean Reversion . . . . . . . . . . . . . . . . . . 24
i
ii CONTENTS
3.5 CoxIngersollRoss Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
3.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
3.7 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
4 The Factor Approach to Arbitrage Pricing 29
4.1 The Factor Approach to Arbitrage Pricing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4.2 Returns and Factors Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4.2.1 Returns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4.2.2 Stochastic diﬀerential equations and factor models . . . . . . . . . . . . . . . . . . . . 30
4.3 The Factor Approach to Arbitrage using Returns . . . . . . . . . . . . . . . . . . . . . . . . . 31
4.3.1 Arbitrage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
4.3.2 Null and Range Space Relationship . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
4.3.3 A Useful Absence of Arbitrage Condition . . . . . . . . . . . . . . . . . . . . . . . . . 32
4.3.4 Interpretations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
4.3.5 A Problem with Returns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
4.4 The Factor Approach using Price Changes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
4.4.1 Price Changes and Arbitrage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
4.4.2 Proﬁt/Loss and Arbitrage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
4.5 Two standard examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
4.5.1 Stocks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
4.5.2 Futures contracts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
4.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
4.7 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
5 Constructing a Factor Pricing Framework 41
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
5.2 A Classiﬁcation of Quantities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
5.2.1 Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
5.2.2 Underlying Variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
5.2.3 Tradables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
5.2.4 A Derivative is a Tradable . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
5.3 Factor Models for Underlying Variables and Tradables . . . . . . . . . . . . . . . . . . . . . . 43
5.3.1 Direct Factor Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
5.3.2 Factor Models via Ito’s Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
5.4 Tradables tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
5.5 Applying the Price APT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
5.5.1 Relative Pricing and Marketed Tradables . . . . . . . . . . . . . . . . . . . . . . . . . 44
5.5.2 Pricing the Derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
5.5.3 Underdetermined and Overdetermined Systems . . . . . . . . . . . . . . . . . . . . . . 45
5.6 Three Step Procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
5.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
5.8 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
6 Application of the Factor Form: Equity Derivatives 49
6.1 Examples from Equity Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
6.1.1 BlackScholes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
6.1.2 Dividend Paying Stocks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
6.1.3 Cash Dividends . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
6.1.4 Poisson Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
6.1.5 Options on Futures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
6.1.6 Jump diﬀusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
6.1.7 Exchange one asset for another . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
CONTENTS iii
6.1.8 Stochastic volatility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
6.2 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
7 Application of the Factor Form:
Interest Rate and Credit Derivatives 63
7.1 Notation and the Money Market Account . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
7.2 Interest Rate Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
7.2.1 Single Factor Short Rate Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
7.2.2 MultiFactor Short Rate Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
7.2.3 HeathJarrowMorton . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
7.2.4 The LIBOR Market Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
7.3 Credit Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
7.3.1 Defaultable Bonds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
7.3.2 Defaultable Bonds with Random Intensity of Default . . . . . . . . . . . . . . . . . . . 73
7.4 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
8 Hedging 77
8.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
8.2 Hedging from a Factor Perspective . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
8.2.1 Description Using a Tradables Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
8.2.2 The Relationship Between Hedging and Arbitrage . . . . . . . . . . . . . . . . . . . . 78
8.2.3 Hedging Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
8.2.4 Hedging under Incompleteness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
8.2.5 A Question of Consistency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
8.3 Hedging from an Underlying Variable Sensitivity Perspective . . . . . . . . . . . . . . . . . . 83
8.3.1 BlackScholes Hedging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
8.3.2 Hedging Bonds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
8.3.3 Derivatives imply Small Changes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
8.4 Higher Order Approximations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
8.4.1 The Greeks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
8.4.2 A DeltaGamma Hedge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
8.4.3 Determining what the error looks like . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
8.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
8.6 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
9 The Road to Risk Neutrality 89
9.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
9.2 Do the Factors Matter? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
9.2.1 Brownian Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
9.2.2 Poisson Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
9.3 Risk Neutral Representations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
9.3.1 Brownian Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
9.3.2 Poisson Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
9.4 Pricing as an Expectation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
9.5 Applications of Risk Neutral Pricing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
9.5.1 How to Apply Risk Neutral Pricing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
9.5.2 BlackScholes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
9.5.3 Poisson Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
9.5.4 HJM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
9.5.5 Libor Market Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
9.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
9.7 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
iv CONTENTS
Notation
• R  the real line (−∞, ∞).
• R
+
 the nonegative real line [0, ∞).
• S(t) Stock price at time t.
• S
t

∂S
∂t
Partial derivative of S with respect to t.
• S
x

∂S
∂x
Partial derivative of S with respect to x.
• S
xx

∂
2
S
∂x
2
Second partial derivative of S with respect to t.
• z(t)  Brownian motion.
• π(t; λ)  Poisson Process with intensity λ.
• x
T
 the transpose of a vector x.
• A
T
 the transpose of a matrix A.
• B
0
(t)  the time t value of the money market account.
• B(tT)  time t price of a zero coupon bond with maturity T and face value $1 at time t
• B(tT
1
; T
2
)  time t forward price of a zero coupon bond with maturity T
2
and face value $1 when
delivery of the forward contract is at time T
1
.
• r  a vector of returns.
• r
0
 a constant risk free rate of interest (when allowed to be a function of time, see the short rate
process below).
• r
0
(t)  the instantaneous short rate process at time t.
• r(ts)  the instantaneous forward rate at time t between times s and s +ds.
• R(tT)  spot rate for time T −t at the current time t.
• R(tT
1
; T
2
)  forward interest rate between time T
1
and T
2
at time t.
• F(tT)  time t forward price for contract with delivery at time T.
• f(tT)  time t futures price for contract with delivery at time T.
• S(t{T
i
})  time t swap rate for swap dates {T
i
}.
• x(t−)  limit from the left: x(t−) = lim
h↑t
x(h).
• x
−
 notation for limit from the left: x(t−).
• λ  market price of risk.
v
vi CONTENTS
Chapter 1
Basic Building Blocks and Stochastic
Diﬀerential Equation Models
This chapter contains an introduction to the basic mathematics required for derivative pricing and ﬁnancial
engineering. We provide building blocks for modeling assets in the form of Brownian motion and Poisson
processes. With these two building blocks we create more complicated models by using Brownian motion and
Poisson processes to drive diﬀerential equations (which are then known as stochastic diﬀerential equations).
The presentation here is tutorial and heuristic. However, don’t let that fool you. If you gain intuition from
it, then you have received a powerful tool to add to your toolbox for problem solving.
1.1 Brownian Motion and Poisson Processes
Brownian motion and Poisson processes are our fundamental building blocks for creating models of asset
prices. The key features are that Brownian motion has continuous sample paths (with probability 1), and
Poisson processes jump! We begin with Brownian motion which is built on the Gaussian random variable.
1.1.1 Gaussian Random Variables
An ndimensional Gaussian (Normal) random variable is a random variable with density function:
X ∼ f
X
(x) =
1
(2π)
n/2
Σ
1/2
exp
−
1
2
(x −µ)
T
Σ
−1
(x −µ)
(1.1)
where µ ∈ R
n
is the mean and Σ ∈ R
n×n
is the covariance matrix:
µ = E[X], (1.2)
Σ = E[(X −µ)(X −µ)
T
]. (1.3)
1.1.2 Brownian Motion
Brownian motion (also known as a Wiener Processes) is a stochastic process built upon the Gaussian random
variable as follows. A realvalued stochastic process z(t) : t ∈ R
+
is a Brownian Motion if:
1. z(0) = 0.
2. z(t) −z(s) ∼ N(0, t −s) for t > s.
3. z(t
2
) −z(t
1
), z(t
3
) −z(t
2
), . . . , z(t
n
) −z(t
n−1
) are independent for t
1
≤ t
2
≤ · · · ≤ t
n
.
1
2CHAPTER 1. BASIC BUILDINGBLOCKS ANDSTOCHASTIC DIFFERENTIAL EQUATIONMODELS
You should remember the following facts about Brownian motion, as they make Brownian motion an ideal
building block for unpredictable but continuous asset price movements:
• There exists a version of Brownian motion that has continuous sample paths.
• Brownian motion is nowhere diﬀerentiable with probability 1.
The ﬁrst property says that Brownian motion is appropriate for price processes that don’t jump. In many
cases, price processes do jump, hence we will need to introduce the Poisson process next to model jumps.
The second property can be interpreted in the context of predictability. If a curve is diﬀerentiable at a
point, then that means that locally it can be approximated by a line, with the slope of the line being the
derivative of the curve at that point. But this means that we can predict (to order dt) the future value of
the curve. In ﬁnance, we often want to assume that we cannot predict future prices. Nondiﬀerentiability
indicates that in the sense mentioned above, future prices are not predictable.
Therefore, Brownian motion is an ideal building block upon which to build asset price processes. A
sample path of Brownian motion is given in Figure 1.1.
0 0.2 0.4 0.6 0.8 1
−1
−0.8
−0.6
−0.4
−0.2
0
0.2
0.4
time
Sample Path of Brownian Motion
z
(
t
)
Figure 1.1: A typical sample path of Brownian motion.
Just as there are vector Gaussian random variables, we can deﬁne a vector Brownian motion as follows.
A vector Brownian motion z(t) ∈ R
n
with covariance structure Σ ∈ R
n×n
is a stochastic process satisfying
1. z(0) = 0.
2. z(t) −z(s) ∼ N(0, Σ(t −s)) for t > s.
3. z(t
2
) −z(t
1
), z(t
3
) −z(t
2
), . . . , z(t
n
) −z(t
n−1
) are independent for t
1
≤ t
2
≤ · · · ≤ t
n
.
Thus a vector Brownian motion is build upon the vector Gaussian random variable.
1.1. BROWNIAN MOTION AND POISSON PROCESSES 3
Brownian motion has continuous sample paths. That is too well behaved for some events we would like
to model. For instance, market crashes, bankruptcy, etc. are often discontinuous price movements. Hence,
we need a process that jumps! Poisson processes, which are built on the Poisson random variable, are what
we are looking for.
1.1.3 Poisson Random Variables
A discrete random variable X taking values in the whole numbers is Poisson with parameter λ > 0 if
P(X = k) =
λ
k
k!
exp(−λ) k = 0, 1, ... (1.4)
The mean of a Poisson random variable is E[X] = λ and the variance is V ar(X) = λ.
1.1.4 Poisson Process
A Poisson process is a stochastic process built on Poisson random variables as follows. A Poisson process
with parameter (intensity) λ is a stochastic process π(t; λ) : t ∈ R
+
that satisﬁes
1. π(0) = 0.
2. π(t) −π(s) is Poisson distributed with parameter λ(t −s) for t > s.
3. π(t
2
) −π(t
1
), π(t
3
) −π(t
2
), . . . , π(t
n
) −π(t
n−1
) are independent for t
1
≤ t
2
≤ · · · ≤ t
n
.
For us, the most important property of Poisson processes is that they jump! Hence, they are good models for
market crashes, jumps, bankruptcy, and other unexpected discontinuous price movements. A typical sample
path from a Poisson process with intensity λ = 1 is given in Figure 1.2.
The parameter λ is often called the intensity (or sometimes the propensity) of the Poisson process. You
can think of it as the expected number of jumps in a single time period. Alternatively, you expect to see a
single jump every
1
λ
time periods. Therefore, the larger the intensity, the more frequent the jumps.
We will assume that a Poisson process is continuous from the right, and not the left. That is, at the
exact time that a Poisson process jumps, it takes on the new value that it jumped to. Functions that are
rightcontinuous and have leftlimits are called rcll functions (or cadlag or Rfunctions, etc). In a Poisson
process, it is important to remember that at a jump time it takes on the new value, thus making sample
paths of a Poisson process rcll functions.
1.1.5 Increments of Brownian Motion and Poisson Processes
Here are the intuitive pictures that I keep in mind when thinking of Brownian motion and Poisson Processes.
Over a small ∆t Brownian motion and Poisson Processes can be thought of in simple and intuitive ways.
We intuitively think of ∆t as a small increment in t. When dealing with a stochastic process X(t), we will
also think of ∆X(t) as the change the occurs in X over a small time period ∆t. That is
∆X(t) = X(t + ∆t) −X(t). (1.5)
This notion of an increment of a stochastic process will guide our intuition. In this way, we can look at
increments of Brownian motion and Poisson processes.
Brownian Motion
Over a small time ∆t, Brownian motion looks like
∆z(t) = z(t + ∆t) −z(t) ∼ N(0, ∆t) (1.6)
4CHAPTER 1. BASIC BUILDINGBLOCKS ANDSTOCHASTIC DIFFERENTIAL EQUATIONMODELS
0 1 2 3 4 5
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
time
π
(
t
)
Sample Path of a Poisson Processes with Intensity 1
Figure 1.2: A typical sample path of a Poisson process.
or, written slightly diﬀerently
∆z(t) = ǫ
√
∆t where ǫ ∼ N(0, 1) (1.7)
where this follows from the second deﬁning property of Brownian motion. That is, a Brownian motion
diﬀerential looks like a standard Gaussian multiplied by
√
∆t. Thus, we will often use E[∆z] = 0 and
E[(∆z)
2
] = ∆t.
An even simpler picture arises from a binary approximation
∆z ≈
√
∆t w.p. 1/2
−
√
∆t w.p. 1/2
(1.8)
where w.p. stands for ”with probability”. This is depicted in ﬁgure 1.3.
Figure 1.3: Binary model of an increment in Brownian motion .
1.2. STOCHASTIC DIFFERENTIAL EQUATIONS 5
Poisson Process
A Poisson Process can also be approximated over a small time period ∆t. Over ∆t it is
∆π(t; λ) = π(t + ∆t; λ) −π(t; λ) = X where X ∼ Poisson(λ∆t). (1.9)
A binary approximation to a Poisson process is
∆π(λ) ≈
1 w.p. λ∆t
0 w.p. 1 −λ∆t
(1.10)
A simple picture of this heuristic increment model is given in Figure 1.4. Note that for a Poisson process,
Figure 1.4: Binary model of an increment of a Poisson process.
∆π is either 0 or 1 to order ∆t.
Thus, note the key diﬀerence between a Brownian motion and a Poisson process. From the simple binary
model approximations, we see that in a Brownian motion the size of the move scales with the square root of
∆t. Hence over short periods of time the move in a Brownian motion is also small. This is why Brownian
motion has continuous paths. On the other hand, in a Poisson process, from the binary model we see that
the move size can always be 1, regardless of how small ∆t is. On the other hand, the probability of having
a jump of size 1 scales with ∆t and is small if ∆t is small. Thus, Poisson processes jump when they move.
Over small periods of time the probability of a jump is also small. This is the essential diﬀerence between
Brownian motion and the Poisson process.
1.2 Stochastic Diﬀerential Equations
A simple way to think of a stochastic diﬀerential equation is as a diﬀerential equation that is driven by a
stochastic process. We will use this point of view here. Also, to avoid the technicalities of stochastic calculus,
we will present a simple intuitive approach to stochastic diﬀerential equations and stochastic diﬀerentials.
1.2.1 Diﬀerentials
Roughly speaking, the notion of a diﬀerential or inﬁnitesimal of a process is the idea that in an increment
∆X(t) = X(t + ∆t) −X(t) (1.11)
we can take ∆t to be inﬁnitesimally small. In such a case we would write
dX(t) = X(t +dt) −X(t) (1.12)
where dt is ”just a little bit of t”, to quote Gillespie [8]. However, (1.12) has a problem when it comes to
processes that jump that are assumed to be right continuous. The Poisson process is a good example.
6CHAPTER 1. BASIC BUILDINGBLOCKS ANDSTOCHASTIC DIFFERENTIAL EQUATIONMODELS
The Problem with Jumps
We have to be very careful when a process has jumps and we assume right continuity of paths. Here is the
problem. Assume that a Poisson process is currently at 0. That is π(t) = 0. Now, assume that a jump
occurs at time s. Our convention will be to assume that at the exact time of the jump, the Poisson process
jumps to 1. That is π(s) = 1. This means that for us, Poisson processes, and all other processes with jumps,
will be continuous from the right. Hence,
lim
h↓s
π(h) = π(s) (1.13)
A picture of this situation is shown in Figure 1.5.
Figure 1.5: A jump at time s.
Since this is our convention, now let’s consider deﬁning the diﬀerential of a Poisson process as
dπ(t) = π(t +dt) −π(t) (1.14)
where dt > 0.
Now, we know that a jump occurred at time s, so intuitively we should have dπ(s) = 1. However, let us
take the limit of dπ(t) for any t (including s) as dt ↓ 0. We obtain
lim
dt↓0
dπ(t) = lim
dt↓0
π(t +dt) −π(t) = π(t) −π(t) = 0 (1.15)
where this calculation followed by right continuity as deﬁned by equation (1.13). But this indicates that π
never jumps! Something must be wrong!!
What is wrong is that we have assumed that π is right continuous, and then when we add dt to the
current time, we are implicitly taking the limit from the right. Hence, we are guaranteed never to capture
the jump!
This is purely a problem that arises from our convention to assume that Poisson processes are right
continuous. If we had assumed left continuity, then we wouldn’t have any problem. However, it is common
in the literature to assume processes are right continuous, so we have adopted this convention. Therefore,
we need to adjust our notion of a diﬀerential of a stochastic process slightly to account for our convention.
1.2.2 The Diﬀerential
The solution to the above problem is that for a diﬀerential, we should think of the following
dX(t) = X(t +dt) −X(t−) (1.16)
where X(t−) = lim
h↑t
X(h) is the limit from the left of X at time t. By using the limit from the left, we
make sure to capture jumps of the process, no matter how small dt is made.
1.2. STOCHASTIC DIFFERENTIAL EQUATIONS 7
We will develop this point of view (which unfortunately can’t be made rigorous, but provides the proper
intuition). Hence, reviewing from above, with Brownian motion we would have:
dz(t) = z(t +dt) −z(t) ∼ N(0, dt). (1.17)
Note that since Brownian motion has continuous sample paths, z(t−) = z(t). However, for a Poisson process,
we should think of diﬀerentials as
dπ(t; λ) = π(t +dt; λ) −π(t−; λ) ∼ Poisson(λdt) (1.18)
in order to make sure that we capture jumps.
Don’t forget that we also have the binary model approximations of Figures 1.3 and 1.4. Those binary
models provide the proper intuition, and in both cases, sums of them will limit as Brownian motion or a
Poisson process. For the diﬀerential, we simply replace ∆t by dt in (1.8) and (1.10), giving
dz ≈
√
dt w.p. 1/2
−
√
dt w.p. 1/2
(1.19)
and
dπ(λ) ≈
1 w.p. λdt
0 w.p. 1 −λdt
. (1.20)
1.2.3 Compound Poisson Process
When Poisson processes jump, they jump up by 1. We can generalize this and allow them to jump randomly.
Let π(t; λ) be a Poisson process with jump times t
1
, t
2
, .... Construct a new process π
Y
(t; λ), by assigning
jump Y
1
at time t
1
, Y
2
at time t
2
, etc. where Y
1
, Y
2
, ... are iid random variables. This process can be
written as
π
Y
(t; λ) =
π(t;λ)
¸
i=0
Y
i
. (1.21)
That is, at time t it is the sum of π(t; λ) iid copies of Y , where π(t; λ) is a standard Poisson process. Processes
of this form can also conveniently be written as integrals,
π
Y
(t, λ) =
π(t;λ)
¸
i=0
Y
i
=
t
0
Y
s
dπ(s; λ). (1.22)
For this reason, we represent the diﬀerential form of a compound Poisson process by Y dπ(t, λ). That is, we
may write
dπ
Y
= Y dπ. (1.23)
Following along the lines of the binary approximation to a Poisson process as in Figure 1.4, an inﬁnitesimal
model of a compound Poisson process can be thought of as
Y dπ(λ) ≈
Y
i
w.p. λdt
0 w.p. 1 −λdt
(1.24)
and a heuristic inﬁnitesimal picture of this is given in Figure 1.6.
1.2.4 Ito Stochastic diﬀerential equations
Stochastic integrals can be deﬁned in diﬀerent ways. The most useful for us is the Ito stochastic integral. At
this point, I will not delve into the depths of the stochastic integral (because often people are never able to
return!), but merely provide the intuition that you should take away when considering stochastic diﬀerential
equations.
8CHAPTER 1. BASIC BUILDINGBLOCKS ANDSTOCHASTIC DIFFERENTIAL EQUATIONMODELS
Figure 1.6: Inﬁnitesimal model of a compound Poisson process.
A stochastic diﬀerential equation will be written as:
dx(t) = a(x(t), t)dt +b(x(t), t)dz(t) (1.25)
where in this case, it is being driven by Brownian motion z(t). (At this stage, I will ignore the technical
conditions that must be placed on a and b in order to make such an equation well deﬁned.) We will interpret
this equation as follows:
x(t +dt) −x(t) = a(x(t), t)dt +b(x(t), t)(z(t +dt) −z(t)). (1.26)
Since z(t) has independent increments, and a(x(t), t) and b(x(t), t) are evaluated at time t, they are inde
pendent of dz(t) = z(t + dt) − z(t). This is important! It allows us to do the following simple calculations
of the instantaneous drift and variance.
Instantaneous Drift and Variance
We can interpret a(x(t), t) as related to the instantaneous drift and b(x(t), t) as related to the instantaneous
volatility as follows:
E[dxx(t)] = E[a(x(t), t)dt +b(x(t), t)dz(t)x(t)] (1.27)
= a(x(t), t)dt +b(x(t), t)E[dz(t)x(t)] (1.28)
= a(x(t), t)dt. (1.29)
Therefore, a(x(t), t) determines the instantaneous drift. On the other hand, we can compute the instanta
neous variance of x as follows
E[(dx −a(x(t), t)dt)
2
x(t)] = E[b(x(t), t)
2
dz(t)
2
x(t)] (1.30)
= b
2
(x(t), t)E[dz(t)
2
x(t)] (1.31)
= b
2
(x(t), t)dt (1.32)
Hence, b
2
(x(t), t) determines the instantaneous variance of x.
1.2.5 Poisson Driven Diﬀerential Equations
We can also drive a diﬀerential equation by a Poisson process
dx(t) = a(x(t−), t)dt +b(x(t−), t)Y dπ(t; λ) (1.33)
where note that we have written x(t−) in the arguments of a and b. By x(t−) we mean x(t−) = lim
h↓0
x(t−h).
That is, x(t−) is the limit from the left at time t. We will assume that a and b are left continuous in the t
1.3. SUMMARY 9
argument so that we may use t instead of t− in the second argument of a and b. We will also sometimes use
the notation x
−
when we want to suppress the argument t, or even a
−
when suppressing the arguments of
a. The reason for using limits from the left is that in a Poisson process, we interpret our diﬀerential as
dπ(t) = π(t +dt) −π(t−)
and for the Ito integral, we assume that the coeﬃcients a and b are evaluated at the point in time that the
diﬀerential starts from. This is t−.
This limit from the left is also important in a and b because we want a and b to be independent of dπ.
The only way we can do this is to make sure that we use left limits. Note that this means that if π(t) jumps
at time t, which also causes a jump in x at time t, we evaluate x(t−) in a and b which immediately preceeds
the jump.
With that established, once again, we can compute the instantaneous mean and variance:
E[dx(t)x(t−)] = E[a(x(t−), t)dt +b(x(t−), t)Y dπ(t)x(t−)] (1.34)
= a(x(t−), t)dt +b(x(t−), t)E[Y dπ(t)x(t−)] (1.35)
= a(x(t−), t)dt +b(x(t−), t)E[Y ]λdt (1.36)
Hence, in this case, the dπ(t) term can contribute to the instantaneous mean. This can make things messy!
It is often nicer to think of the ﬁrst term as the ”mean” term, and the second as the ”risk” term. To do
this, we would like the second term to have zero instantaneous mean. Hence, we will often ”compensate” the
Poisson process to give it zero mean. This is done by simply subtracting oﬀ the instantaneous mean from
the second term and adding it to the ﬁrst.
dx(t) = (a(x(t−), t) +b(x(t−), t)E[Y ]λ)dt +b(x(t−), t)(Y dπ(t) −E[Y ]λdt) (1.37)
Then we can also compute the instantaneous variance:
E[(dx(t) −(a(x(t−), t) +b(x(t−), t)E[Y ]λ)dt)
2
x(t−)]
= E[b
2
(x(t−), t)(Y dπ(t) −E[Y ]λdt)
2
x(t−)]
= b
2
(x(t−), t)V ar(Y dπ(t))
= b
2
(x(t−), t)(E[Y
2
dπ(t)
2
) −E[Y ]
2
E[dπ(t)]
2
)
= b
2
(x(t−), t)(E[Y
2
]E[dπ(t)
2
] −E[Y ]
2
λ
2
dt
2
)
= b
2
(x(t−), t)(E[Y
2
](λdt +λ
2
dt
2
) −E[Y ]
2
λ
2
dt
2
)
= b
2
(x(t−), t)E[Y
2
]λdt +O(dt
2
)
Hence, to order dt, the instantaneous variance is given by b
2
(x(t−), t)E[Y
2
]λ.
1.3 Summary
Brownian motion (built upon the Gaussian random variable) and the Poisson Process (built upon the Poisson
random variable) are the basic building blocks used to create models of prices. In particular, we use these two
processes to drive diﬀerential equations and that will allow us to capture a wide range of price phenomena.
Due to the continuity of Brownian motion, it is good for modeling price paths and variables that do not
jump. On the other hand, Poisson processes are an essential building block for modeling jumps in price
processes or variables.
Much intuition can be gained from simple ”incremental” and ”diﬀerential” models of processes and
stochastic diﬀerential equations. The simple binary approximations to Brownian motion and Poisson pro
cesses are enough to correctly guide your intuition in the vast majority of cases. Thus, for modeling purposes,
make sure you have a solid understanding of these two building block processes.
10CHAPTER 1. BASIC BUILDINGBLOCKS ANDSTOCHASTIC DIFFERENTIAL EQUATIONMODELS
1.4 Problems
Problem 1.4.1 Verify that our inﬁnitesimal model of a Poisson process over small time dt:
dπ =
1 wp. λdt
0 wp. 1 −λdt
(1.38)
has a mean and variance that agree with a Poisson random variable with parameter λdt to order dt.
Problem 1.4.2 Poisson Processes
Consider the time interval [0, 1]. Chop this time interval into n parts of equal length. Over each interval
deﬁne the independent and identically distributed random variables X
i
where
X
i
=
1 w.p. λ/n
0 w.p. 1 −λ/n
(1.39)
Let
Y =
n
¸
i=1
X
i
(1.40)
(a) What is Pr(Y = 0)?
(b) In your answer in (a), take the limit as n → ∞. What do you get?
(b) What is Pr(Y = 1)?
(c) Again take the limit. What is your answer?
(d) Now consider an arbitrary but ﬁxed k with k < n. What is Pr(Y = k).
(e) Again take the limit as n → ∞, and show that this converges to the Poisson random variable. (You
will probably want to use Stirling’s formula n! ∼
√
2πe
−n
n
n+
1
2
. This calculation is a bit tricky!)
(Note: In this problem we converge to a Poisson random variable with parameter λ since we took the time
interval to be 1. If the time interval is t, we will converge to a Poisson random variable with parameter λt.
As a function of t, we arrive at a Poisson process.)
Problem 1.4.3 Poisson Processes again.
Consider the following Markov chain. Let the state space be the whole numbers x = 0, 1, 2, .... Consider
the following transition probabilities over the time instant dt:
Pr(x(t +dt) = nx(t) = n) = 1 −λdt (1.41)
Pr(x(t +dt) = n + 1x(t) = n) = λdt (1.42)
Let p
n
(t) = Pr(x(t) = n).
(a) Write down a diﬀerential equation for p
0
(t). (hint: to derive a diﬀerential equation, consider the
amount of probability that ﬂows into and out of the state x = 0 over time dt.)
(b) Assume p
0
(0) = 1 (that is, at time zero, x = 0 with probability 1). Solve the diﬀerential equation for
p
0
(t).
(c) Derive a diﬀerential equation for p
n
(t), n > 0. Given your answer in (a), solve for p
1
(t). Explain how
you could solve for p
n
(t) for any n.
(Note: again we have arrived at a Poisson process, but this time through Markov chain theory. A Poisson
process is an example of a continuous time Markov process, and the set of diﬀerential equations you derived
is the ”forward equation” for this process.)
Chapter 2
Ito’s Lemma
2.1 Ito’s Lemma
Ito’s lemma is the chain rule for stochastic calculus. In this chapter we present Ito’s lemma for Brownian
motion and Poisson processes. It has been said that all of math ﬁnance can be done with just the knowledge
of Ito’s lemma. To you, this means that you should make sure that you know (and understand) Ito’s lemma.
In what follows, we will present versions of Ito’s lemma for Brownian motion and Poisson processes.
2.1.1 The chain rule of ordinary calculus
In ordinary calculus, here is how the chain rule works in conjunction with a diﬀerential equation. Let x(t)
follow the diﬀerential equation
dx
dt
= a(x, t). (2.1)
Now consider a function of x(t) and t. Let’s call this function f(x(t), t). Assuming that f is diﬀerentiable,
we can ask what the derivative of f is. To calculate it, we simply apply the chain rule
df(x, t)
dt
=
∂f
∂x
dx
dt
+
∂f
∂t
= f
x
dx
dt
+f
t
(2.2)
where we are using the notation f
x
=
∂f
∂x
and f
t
=
∂f
∂t
. Finally, we can substitute in for
dx
dt
from (2.1), giving
df(x, t)
dt
=
∂f
∂x
a(x, t) +
∂f
∂t
= a(x, t)f
x
+f
t
. (2.3)
This is a fairly straightforward calculation. However, when dealing with stochastic diﬀerential equations,
the simple chain rule of ordinary calculus does not work. The reason is simple. Brownian motion is not
diﬀerentiable so we can’t really take its derivative or the derivative of any function of Brownian motion.
Also, Poisson processes jump, and at these jumps it is not even continuous, let alone diﬀerentiable. Thus,
for stochastic diﬀerential equations we need to develop the correct mathematics for dealing with a function
of a variable that follows a stochastic diﬀerential equation. The guiding result is known as Ito’s lemma.
Multivariables Taylor Series Expansions
Before diving into Ito’s lemma, you should make sure that you recall your multivariables Taylor series
expansions to second order. This is extremely important! Ito’s lemma for Brownian motion is basically just
a modiﬁed Taylor expansion to second order. So, let’s recall up to second order, the Taylor series expansion
11
12 CHAPTER 2. ITO’S LEMMA
of a function f(x, t) around a point (x
0
, t
0
),
f(x, t) = f(x
0
, t
0
) +f
t
(x
0
, t
0
)(t −t
0
) +f
x
(x
0
, t
0
)(x −x
0
) (2.4)
+
1
2
f
tt
(x
0
, t
0
)(t −t
0
)
2
+f
xt
(x
0
, t
0
)(x −x
0
)(t −t
0
) +
1
2
f
xx
(x
0
, t
0
)(x −x
0
)
2
+. . . (2.5)
Now, we will typically denote dx = x−x
0
, dt = t −t
0
, and df = f(x, t) −f(x
0
, t
0
), so that a Taylor series
expansion is
df = f
t
dt +f
x
dx +
1
2
f
tt
dt
2
+f
xt
dxdt +
1
2
f
xx
dx
2
+. . . (2.6)
and the arguments of f
t
, f
x
, ... have been supressed.
In the multivariable case when x ∈ R
n
, the Taylor series expansion is
df = f
t
dt +f
x
dx +
1
2
f
tt
dt
2
+dx
T
f
xt
dt +
1
2
dx
T
f
xx
dx +. . . (2.7)
where
f
x
= [f
x1
, . . . , f
xn
], f
xx
=
f
x1x1
. . . f
x1xn
.
.
.
.
.
.
.
.
.
f
xnx1
. . . f
xnxn
¸
¸
¸ (2.8)
A useful trick in the multivariable case is to note that
dx
T
f
xx
dx = Tr(dx
T
f
xx
dx) = Tr(f
xx
dxdx
T
) (2.9)
where Tr(·) is the Trace of a matrix (i.e. the sum of the diagonal elements).
Operationally, Ito’s lemma for Brownian motion boils down to nothing more than substituting dx =
adt +bdz in the Taylor expansion of (2.6) (or (2.7) in the multivariable case) for dx, interpreting terms such
as dz
2
, and then throwing away terms of order higher than dt.
If that sounds simple, then you are right. Let’s see how it works in more detail.
2.1.2 Ito’s lemma for Brownian motion
Given the diﬀerential of x(t), Ito’s lemma allows us to compute the diﬀerential of a function of x(t) and t.
Hence, it is the ”chain rule” for stochastic diﬀerential equations. The following result is Ito’s lemma when
x(t) is a process governed by a stochastic diﬀerential equation driven by Brownian motion.
(⋆) Ito’s Lemma for Brownian Motion:
Consider the stochastic diﬀerential equation (SDE)
dx = a(x, t)dt +b(x, t)dz (2.10)
and let f(x, t) be a twice continuously diﬀerentiable function of x and t. Then
df(x, t) = (f
t
+a(x, t)f
x
+
1
2
b
2
(x, t)f
xx
)dt +b(x, t)f
x
dz. (2.11)
”Heuristic Proof”: I will suppress the arguments of a and b for convenience. Consider writing the Taylor
expansion of df.
df = f(x(t +dt), t +dt) −f(x(t), t)
= f
t
dt +f
x
dx +
1
2
f
tt
(dt)
2
+
1
2
f
xx
(dx)
2
+f
xt
dxdt +...
2.1. ITO’S LEMMA 13
Next, we will substitute in for dx using dx = adt +bdz which gives
df = f
t
dt +f
x
(adt +bdz) +
1
2
f
tt
(dt)
2
+
1
2
f
xx
(adt +bdz)
2
+f
xt
(adt +bdz)dt +...
= f
t
dt +f
x
adt +f
x
bdz +
1
2
f
tt
(dt)
2
+
1
2
f
xx
(a
2
dt
2
+ 2abdtdz +b
2
dz
2
) +f
xt
(adt
2
+bdzdt) +...
Now we take a crucial step, and only keep terms up to order dt using the following logic. The standard
deviation of dz is of order
√
dt. Hence, we think of dz as being of order dt
1/2
and only keep terms up to
order dt yielding
df = f
t
dt +f
x
adt +f
x
bdz +
1
2
f
xx
b
2
dz
2
...
Finally we replace dz
2
by it’s expectation dt which leads to Ito’s lemma
df = (f
t
+af
x
+
1
2
b
2
f
xx
)dt +bf
x
dz. (2.12)
2.
In this ”derivation” there were a couple of dubious steps. The most glaring was replacing dz
2
by its expec
tation dt. Let’s see why this was a reasonable thing to do ...
2.1.3 Replacing dz
2
by dt
Here is a simple argument as to why it is reasonable to replace dz
2
by dt.
If we can say that dz
2
= dt, then by integrating we would have
T
0
dz
2
=
T
0
dt = T. (2.13)
Hence, let us see if this makes sense. Let’s approximate the integral above by the sum
S(T, ∆t) =
T
∆t
−1
¸
i=0
(z((i + 1)∆t) −z(i∆t))
2
≈
T
0
dz
2
. (2.14)
Now, the claim is that as ∆t → 0, then S(T, ∆t) → T. But note that S(T, ∆t) is a random variable since it
involves z(t). Therefore, we are trying to show that the random variable S(T, ∆t) converges to the constant
T. To do this, we will show that the mean of S(T, ∆t) is equal to T, and its variance approaches zero. This
does the trick, since a random variable with zero variance must be a constant equal to it’s mean. (When we
show that the variance approaches zero, we are proving convergence in mean square, or L
2
(P).)
Computing the mean:
Okay, let’s ﬁrst compute the mean of S(T, ∆t):
E[S(T, ∆t)] = E
T
∆t
−1
¸
i=0
(z((i + 1)∆t) −z(i∆t))
2
¸
¸
=
T
∆t
−1
¸
i=0
E
(z((i + 1)∆t) −z(i∆t))
2
=
T
∆t
−1
¸
i=0
∆t = T.
(2.15)
Hence, the mean is T.
Computing the variance:
Now let’s compute the variance of S(T, ∆t). By independent increments
V ar(S(T), ∆t) = V ar
¸
T
∆t
−1
¸
i=0
(z((i + 1)∆t) −z(i∆t))
2
¸
=
T
∆t
−1
¸
i=0
V ar((z((i + 1)∆t) −z(i∆t))
2
). (2.16)
14 CHAPTER 2. ITO’S LEMMA
But since z((i + 1)∆t) −z(i∆t) is Gaussian with mean zero and variance ∆t, we have
V ar((z((i + 1)∆t) −z(i∆t))
2
) = E[(z((i + 1)∆t) −z(i∆t))
4
] −E[(z((i + 1)∆t) −z(i∆t))
2
]
2
= 3(∆t)
2
−(∆t)
2
= 2(∆t)
2
.
Therefore as ∆t → 0, we have
V ar(S(T, ∆t)) =
T
∆t
−1
¸
i=0
V ar((z((i + 1)∆t) −z(i∆t))
2
) =
T
∆t
−1
¸
i=0
2(∆t)
2
= 2T∆t → 0. (2.17)
Hence, the limit of the variance of S(T, ∆t) is zero. That means that the limit (in L
2
(P)) is a constant and
equal to the mean T. Note that the above argument has much of the ﬂavor of the weak law of large numbers.
This is the essential argument that allows us to use dz
2
= dt and a simpliﬁed version of the argument behind
a real derivation of Ito’s lemma.
Let’s see an example of Ito’s lemma applied to socalled geometric Brownian motion.
Example 2.1.1 Let dx = axdt + bxdz and consider f(x) = ln(x). Then, according to Ito’s lemma, f(x)
satisﬁes
df = (f
t
+axf
x
+
1
2
b
2
x
2
f
xx
)dt +bxf
x
dz (2.18)
= (a −
1
2
b
2
)dt +bdz (2.19)
where we have used that f
t
= 0, f
x
=
1
x
, and f
xx
= −
1
x
2
.
2.1.4 Discussion of Ito’s lemma
At the risk of overdoing an attempt to provide intuition behind Ito’s lemma, I will leave you with the
following thoughts.
In Ito’s lemma, we kept terms up to order dt in the Taylor expansion. We are used to doing things like
this from ordinary calculus, but now that we are dealing with stochastic processes, we might question this
step. In particular, the standard deviation of dz is dt
1/2
. This means that moves in dz are usually much
larger than dt. Why doesn’t this dz term completely dominate and even allow us to ignore terms of order
dt?
Again, I appeal to the wisdom of Gillespie [8]. He gave the following explanation, which is based on the
story of the tortoise and the hare. As the story goes, the tortoise and the hare race each other. The tortoise
being slow, starts the race and steadily works his way toward the ﬁnish line. The hare, on the other hand,
is quick and jumpy. He is much faster than the hare, but runs forward and backwards and easily gets oﬀ
track. In the end of the story, the tortoise wins the race.
Now your asking, ”What does this have to do with Ito’s lemma and stochastic diﬀerential equations?”
Well, the deterministic dt drift term is like the tortoise. It marches forward at a constant dt rate. On the
other hand, the dz term is like the hare. It is quick, and jumps around like order dt
1/2
. But its direction
is random. Some of the time it jumps forward and other times, backwards. Together, both the dt and dz
terms contribute to the stochastic diﬀerential equation and neither term is guaranteed to dominate, just like
we don’t know whether the tortoise or the hare will win the race!
2.2 Ito’s lemma for Poisson Processes
Ito’s lemma for Brownian motion is more subtle that Ito’s lemma for Poisson Processes. (The key diﬀerence
is that Poisson processes have sample paths of ﬁnite variation, and this allows us to deﬁne the stochastic
integral pathwise. Hence, Ito’s lemma in this case is merely an application of the LebesgueStieljies calculus.)
2.2. ITO’S LEMMA FOR POISSON PROCESSES 15
(⋆) Ito’s Lemma for Poisson Processes:
Given a Poisson stochastic diﬀerential equation (SDE)
dx = a(x
−
, t)dt +b(x
−
, t)dπ (2.20)
let f(x, t) be a continuously diﬀerentiable function of x and t. Then
df(x, t) = (f
t
+a(x
−
, t)f
x
)dt + (f(x
−
+b(x
−
, t), t) −f(x
−
, t))dπ. (2.21)
The real derivation of this is using LebesgueStieljies calculus, but once again I will provide a nice heuristic
argument. This time when we consider the diﬀerential df we have to be careful because x
t
jumps! Taylor
expansions work well as an approximation when dx is small, however with jumps dx can be large! Note how
this plays into our derivation.
”Heuristic Proof”: We start by writing out the diﬀerential and substituting in for dx.
df = f(x(t +dt), t +dt) −f(x
−
, t) (2.22)
= f(x(t−) +a
−
dt +b
−
dπ, t +dt) −f(x
−
, t) (2.23)
where a
−
= a(x(t−), t) and b
−
= b(x(t−), t). Now, we note that possible jumps come from dπ. I don’t like
this term, so I will add and subtract a term that doesn’t contain the jump
df = f(x
−
+a
−
dt +b
−
dπ, t +dt) −f(x
−
+a
−
dt, t +dt) +f(x
−
+a
−
dt, t +dt) −f(x
−
, t) (2.24)
Now the ﬁnal two terms don’t have a jump in them, so I will approximate them using ordinary calculus.
df = f(x
−
+a
−
dt +b
−
dπ, t +dt) −f(x
−
+a
−
dt, t +dt) + (f
t
+a
−
f
x
)dt +O(dt
2
) (2.25)
Now let’s analyze the ﬁrst two terms
f(x
−
+a
−
dt +b
−
dπ, t +dt) −f(x
−
+a
−
dt, t +dt) (2.26)
When their is no jump in dπ, this term is zero. No jumps occur with probability 1 −λdt. When there is a
jump we have dπ = 1 and
f(x
−
+a
−
dt +b
−
, t +dt) −f(x
−
+a
−
dt, t +dt). (2.27)
This occurs with probability λdt. Since the a
−
dt term in the argument is of order dt, then overall, we can
think of the eﬀect of the a
−
dt term as overall being of order dt
2
. Therefore, as dt → 0, to order dt overall
this term is replaced by
f(x
−
+a
−
dt +b
−
, t +dt) −f(x
−
+a
−
dt, t +dt) → f(x
−
+b
−
, t) −f(x
−
, t). (2.28)
Hence, combining the above arguments gives
f(x
−
+a
−
dt +b
−
dπ, t +dt) −f(x
−
+a
−
dt, t +dt) → (f(x
−
+b
−
, t) −f(x
−
, t))dπ (2.29)
which completes the derivation of Ito’s lemma. 2
2.2.1 Interpretation of Ito’s lemma for Poisson
Ito’s lemma for Poisson processes simply says that when the Poisson process doesn’t jump, use ordinary
calculus. When it jumps, the move in f is determined completely by the jump. That is it! Let’s see an
example of this.
16 CHAPTER 2. ITO’S LEMMA
Example 2.2.1 Let dx = axdt + (b − 1)xdπ and consider f(x) = ln(x). By Ito’s lemma for Poisson
processes, f(x) satisﬁes
df = (f
t
+f
x
ax)dt + (f(x
−
+ (b −1)x
−
) −f(x
−
))dπ (2.30)
= adt + (ln(bx
−
) −ln(x
−
))dπ (2.31)
= adt + ln(b)dπ (2.32)
where we have used that f
t
= 0 and f
x
=
1
x
.
2.3 More versions of Ito’s Lemma
In this section I will present other useful versions of Ito’s lemma. Heuristic derivations follow along the lines
of those for Brownian motion and the Poisson process. It is worthwhile working your way through a couple
of them, and through some of the problems at the end.
2.3.1 Ito’s Lemma for Compound Poisson Processes
When we are using a compound Poisson process, Ito’s lemma is modiﬁed slightly as follows.
(⋆) Ito’s Lemma for Compound Poisson Processes:
Given an SDE
dx = a(x
−
, t)dt +b(x
−
, t)Y dπ (2.33)
Let f(x, t) be a continuously diﬀerentiable function of x and t. Then
df(x, t) = (f
t
+a
−
f
x
)dt + (f(x
−
+Y b
−
, t) −f(x
−
, t))dπ (2.34)
This version of Ito’s lemma can be derived in a manner similar to Ito’s lemma for Poisson processes.
2.3.2 Ito’s Lemma for Brownian and Compound Poisson Processes
If we combine Brownian motion and compound Poisson processes, then we have the following result.
(⋆) Ito’s for Brownian and Poisson
Given a Poisson stochastic diﬀerential equation (SDE)
dx = a(x
−
, t)dt +b(x
−
, t)dz +Y dπ (2.35)
let f(x, t) be a twice continuously diﬀerentiable function of x and t. Then
df(x, t) = (f
t
+a(x
−
, t)f
x
+
1
2
b(x
−
, t)
2
f
xx
)dt +b(x
−
, t)f
x
dz + (f(x
−
+Y, t) −f(x
−
, t))dπ (2.36)
Again, this can be shown with arguments similar to those given above.
2.3.3 Ito’s Lemma for vector processes
If we have multiple Brownian motions, then we have the following results
(⋆) Ito’s for two correlated Brownians
Given the stochastic diﬀerential equations
dx
1
= a
1
dt +b
1
dz
1
(2.37)
dx
2
= a
2
dt +b
2
dz
2
(2.38)
2.4. ITO’S LEMMA, THE PRODUCT RULE, AND A RECTANGLE 17
with a
1
= a
1
(x
1
, x
2
, t), a
2
= a
2
(x
1
, x
2
, t), b
1
= b
1
(x
1
, x
2
, t), b
2
= b
2
(x
1
, x
2
, t) where z
1
and z
2
are two corre
lated Brownian motions with instantaneous correlation coeﬃcient ρ (i.e. E[dz
1
dz
2
] = ρdt). Let f(x
1
, x
2
, t)
be twice continuously diﬀerentiable function of x
1
, x
2
and t. Then
df(x
1
, x
2
, t) =
f
t
+a
1
f
x1
+a
2
f
x2
+
1
2
b
2
1
f
x1x1
+
1
2
b
2
2
f
x2x2
+ρb
1
b
2
f
x1x2
dt +b
1
f
x1
dz
1
+b
2
f
x2
dz
2
(2.39)
Using vector notation we can generalize the above result
(⋆) Ito’s for vectors
Given a vector stochastic diﬀerential equation
dx = adt +Bdz (2.40)
(2.41)
where x ∈ R
n
, a = a(x, t) ∈ R
n
, B = B(x, t) ∈ R
n×m
and z ∈ R
m
a vector Brownian motion with
instantaneous covariance matrix E[dzdz
T
] = Σdt. Let f(x, t) be twice continuously diﬀerentiable function of
x and t. Then
df =
f
t
+f
x
a +
1
2
Tr(f
xx
BΣB
T
)
dt +f
x
Bdz (2.42)
where
f
x
= [f
x1
, f
x2
, ..., f
xn
] and f
xx
=
f
x1x1
· · · f
x1xn
.
.
.
.
.
.
.
.
.
f
xnx1
· · · f
xnxn
¸
¸
¸.
2.4 Ito’s lemma, the product rule, and a rectangle
In Ito’s lemma for Brownian motion, the mysterious term is
1
2
b
2
f
xx
dt, which enters because we are forced
to keep the dz
2
term which is of order dt. Without this term, Ito’s lemma would follow from the intuition
of ordinary calculus.
Let me give a little argument (which appears in Rogers and Williams [12]) to try to convince you that
Ito’s lemma is actually more intuitive than ordinary calculus. The example I will use is the product rule. In
ordinary calculus, we have the familiar formula
d(uv) = udv +vdu. (2.43)
In fact, this is the beginnings of the integration by parts formula. Let’s try to ”derive” this formula by a
simple ”rectangle” argument. Here it is.
Consider the quantity d(uv). This is a change in the product uv, that is
d(uv) = (u +du)(v +dv) −uv (2.44)
We can think of this as the area of a rectangle with sides of length u + du and v + dv minus the area of a
rectangle with sides of u and v. The rectangle in Figure 2.1 shows a picture of this. Now, it is obvious the
area of d(uv) is equal to the sum of the three colored rectangles in the ﬁgure. That is
d(uv) = udv +vdu +dudv. (2.45)
Hence, it is natural to expect to see a term related to dudv! In fact, it is more mysterious that in ordinary
calculus we are able to ignore this term. Of course, that is because in ordinary calculus, roughly speaking,
the terms du and dv are of order dt and hence dudv is a higher order term.
18 CHAPTER 2. ITO’S LEMMA
Figure 2.1: Ito’s Rectangle
Let us see how far we can get from this simple rectangle. Let us take u = v = z(t). Then our rectangle
formula (product rule) says that
d(z
2
) = d(z · z) = zdz +zdz +dz
2
. (2.46)
Note that the formula above is exact! Now, recalling that dz
2
should be replaced by dt as in the argument
of Section 2.1.3, we have
d(z
2
) = 2zdz +dt (2.47)
which is Ito’s lemma for f(z) = z
2
. This gives us a formula for d(z
2
). Now we can proceed and choose u = z
and v = z
2
. From our rectangle and the formula for d(z
2
) we have
d(z
3
) = d(z · z
2
) = zd(z
2
) +z
2
dz +dzd(z
2
) (2.48)
= z(2zdz +dt) +z
2
dz +dz(2zdz +dt) (2.49)
= 3z
2
dz + 3zdt +o(t) (2.50)
where in the last step we have placed higher order terms in o(t) and replaced dz
2
by dt. This is Ito’s lemma
for f(z) = z
3
.
Continuing on in this manner, we can easily derive Ito’s lemma for any polynomial of any order in z(t)!
At that point, we are not a far cry from Ito’s lemma for C
2
functions, as they can be approximated by limits
of polynomials. Hence, you see that Ito’s lemma is actually quite natural if you just remember the rectangle.
Furthermore, now you should have an easy time remembering Ito’s product rule d(uv) = udv +vdu +dudv.
2.5 Summary
Ito’s lemma is the most important result in stochastic calculus for derivative pricing. There are diﬀerent
versions for Brownian motion and for Poisson processes. You should be familiar with both. Roughly speaking,
for Brownian motion, since the standard deviation of dz is of order
√
dt, second order terms in dz are of
order dt and cannot be ignored. This gives an extra term in Ito’s lemma compared to the ordinary chain
rule of calculus. For Poisson processes, most of the time no jumps are occurring and ordinary calculus is
ﬁne. However, when a jump occurs, it causes a corresponding jump in any function of the Poisson process.
Thus Ito’s lemma for Poisson processes is simply a combination of the ordinary chain rule plus noting that
when the Poisson process jumps, any function of it has a corresponding jump.
2.6. PROBLEMS 19
2.6 Problems
Problem 2.6.1 Consider the Stochastic Diﬀerential Equation:
dx = a(x, t)dt +b(x, t)dz +Y dπ (2.51)
where z is Brownian motion and π is a generalized Poisson process with intensity λ and random jumps of
size Y . Let f(x, t) be a twice continuously diﬀerentiable function of x and t. Find a stochastic diﬀerential
equation for df. (i.e. what is Ito’s lemma in this case).
Problem 2.6.2 Multidimensional Ito Formula for Brownian Motion:
Consider the following vector Ito process:
dx = µdt +Kdz (2.52)
where x ∈ R
n
, µ ∈ R
n
, K ∈ R
n×m
and z ∈ R
m
where z is a vector Brownian motion of m uncorrelated
onedimensional Brownian motions. Let f(x, t) : R
n
×R → R be a twice continuously diﬀerentiable function
of x and t. What is df? (Hint: Use the same Taylor series argument used in class, but this time you can
use E[dzdz
T
] = Idt where I ∈ R
m×m
is the identity matrix.)
Problem 2.6.3 Let
dx = adt +bdz
1
(2.53)
and
dy = fdt +gdz
2
(2.54)
where z
1
and z
2
are correlated Brownian motions with correlation coeﬃcient ρ. (i.e. E(dz
1
dz
2
) = ρdt)
(a) Use Ito’s lemma to ﬁnd d(xy).
(b) Use Ito’s lemma to ﬁnd d(x/y).
Problem 2.6.4 Ito’s Lemma Practice
(a) Given dx = xdt +
√
xdz, use Ito’s lemma to derive df where f(x, t) = x
2
+t.
(b) Given dx = −xdt +dπ
α
, use Ito’s lemma to derive df for f(x, t) = t
2
√
x.
(c) Given
dx = (x +y)dt +xdz
1
(2.55)
dy = ydt +xdz
2
(2.56)
where E[dz
1
dz
2
] = ρdt. Use Ito’s lemma to derive df for a generic twice continuously diﬀerentiable
f(x, y, t).
Problem 2.6.5 Compute
T
0
z
t
dz
t
(2.57)
where z
t
is Brownian motion. (Hint: consider Ito’s lemma for z
2
.)
Problem 2.6.6 (Counterintuition) Let
dx = axdt +bxdz (2.58)
Assume that a > 0. That is, the growth rate of E[x
t
] is positive. Now consider 1/x
t
. Is it possible for
E[1/x
t
] to also have a positive growth rate? Give conditions on a and b for when this is possible. Intuitively,
provide an explanation for this.
20 CHAPTER 2. ITO’S LEMMA
Problem 2.6.7 Intuition behind Ito’s lemma for Brownian motion. (This intuitive look at Ito’s lemma was
communicated to me by Muruhan Rathinam.)
Let
dx = adt +bdz (2.59)
then from Ito’s lemma f(x, t) satisﬁes
df = (f
t
+af
x
+
1
2
b
2
f
xx
)dt +bf
x
dz. (2.60)
Recall our heuristic derivation based on a Taylor expansion:
df = f
t
dt +f
x
dx +
1
2
f
xx
(dx)
2
+... (2.61)
or
df = f
t
dt +f
x
(adt +bdz) +
1
2
f
xx
(a
2
(dt)
2
+b
2
(dz)
2
+ 2abdtdz) +... (2.62)
(a) Compute the mean of df to lowest order in dt using the terms of the Taylor expansion shown above.
(b) Compute the standard deviation of the terms that contain randomness in the above expansion. That
is, compute the standard deviation of the dz, dtdz, (dz)
2
terms separately. Which term has standard
deviation of lowest order in dt? (Hint: The fourth moment of a N(0, σ
2
) is 3σ
4
.)
(c) Use this analysis to argue for the plausibility of Ito’s lemma, (2.60).
Problem 2.6.8 Show that (2.39) follows from (2.42).
Chapter 3
Standard Stochastic Diﬀerential
Equations with Solutions
In this chapter we review some stochastic diﬀerential equations that have closed form solutions. These are
also some of the stochastic diﬀerential equation models used for modeling asset prices and other relevant
ﬁnancial variables. In these solutions, note the important role that Ito’s lemma plays. Most importantly,
not many stochastic diﬀerential equations have closed form solutions. Thus, these are stochastic diﬀerential
equations that everyone should know!
3.1 Geometric Brownian Motion
Geometric Brownian motion is the following stochastic diﬀerential equation where a and b are constants.
dx = axdt +bxdz. (3.1)
A closed form solution to geometric Brownian motion can be found as follows. By Ito’s lemma with f(x) =
ln(x) we have
d ln(x) = (a −
1
2
b
2
)dt +bdz. (3.2)
Since a and b are constants, integrating gives
ln(x
t
) −ln(x
0
) = (a −
1
2
b
2
)t +bz
t
(3.3)
which implies that
x(t) = e
(a−
1
2
b
2
)t+bzt
x(0). (3.4)
Note that geometric Brownian motion is a lognormal process. That is, from (3.3) in logcoordinates x(t) is
a Gaussian process with drift a −
1
2
b
2
and volatility b. Figure 3.1 shows a typical sample path of geometric
Brownian motion.
3.1.1 Stock Price Interpretation
Geometric Brownian motion is the standard model for continuous asset price movements. It comes from the
following. Let x(t) be the price of a stock. Then over the time period dt, the return r on the stock x(t) is
given by
r =
x(t +dt) −x(t)
x(t)
=
dx
x
. (3.5)
21
22 CHAPTER 3. STANDARD STOCHASTIC DIFFERENTIAL EQUATIONS WITH SOLUTIONS
Figure 3.1: Typical Sample path of geometric Brownian motion.
Geometric Brownian motion models this instantaneous return as
dx
x
= adt +bdz (3.6)
where a and b are constants. Thus, the instantaneous return r over the next dt is Gaussian with
E[r] = adt, V ar(r) = b
2
dt. (3.7)
Thus, this is a very natural model of asset prices.
3.2 Geometric Poisson Motion
Geometric Poisson motion is the equivalent of geometric Brownian motion, but being driven by a Poisson
process,
dx = ax
−
dt + (b −1)x
−
dπ. (3.8)
The reason for the term b −1 is as follows. If the current value is x(t) and a jump occurs, then we will jump
by an amount (b − 1)x(t). Thus, we will jump to x(t) + (b − 1)x(t) = bx(t). Hence, by writing (b − 1) we
think of b as indicating the multiple of the current state that we will jump to if a jump occurs. That is,
a jump leads to the transition x(t) → bx(t). Note that if we didn’t use this convention, it would be a bit
messier.
Again, there exists a closed form solution that is obtained by changing to log coordinates. By Ito’s lemma
with f(x) = ln(x)
d ln(x) = adt + ln(b)dπ. (3.9)
Integrating leads to
ln(x(t)) −ln(x(0)) = at +ln(b)π(t) (3.10)
or
x(t) = e
at+ln(b)π(t)
x(0) = e
at
b
π(t)
x(0). (3.11)
Thus, this process is a Poisson process plus drift in logcoordinates.
3.3. A JUMPDIFFUSION MODEL 23
3.2.1 A conditional lognormal version of geometric Poisson Motion
The following stochastic diﬀerential equation uses a compound Poisson process with a log normal jump size.
This results in a conditional lognormal process that is diﬀerent from geometric Brownian motion. It is
modeled as
dx = ax
−
dt + (Y −1)x
−
dπ (3.12)
where Y = e
Z
is a lognormal random variable.
Once again, a change to log coordinates facilitates ﬁnding a closed form solution. Using Ito’s lemma with
f(x) = ln(x) gives
x(t) =
¸
e
at
π(t)
¸
i=1
Y
i
¸
x(0). (3.13)
But since Y is lognormal, we can write it as
¸
πt
i=1
Y
i
= e
π(t)
i=1
Zi
where Z
i
is normal. Hence we have
x(t) =
e
at+
π(t)
i=1
Zi
x(0) (3.14)
which, conditional on π(t), is a lognormal process.
3.3 A jumpdiﬀusion model
The following stochastic diﬀerential equation uses a generalized Poisson process with a log normal jump size
and a Brownian motion. This results in a process that involves jumps and a diﬀusion term. It is known as
a jumpdiﬀusion model.
dx = ax
−
dt +bx
−
dz + (Y −1)x
−
dπ (3.15)
where Y is a lognormal random variable. Again, using Ito’s lemma with f(x) = ln(x) gives the solution,
which is
x(t) = e
(a−
1
2
b
2
)t+bz(t)
π(t)
¸
i=1
Y
i
x(0). (3.16)
But since Y is lognormal, we can write it as
¸
π(t)
i=1
Y
i
= e
π(t)
i=1
Zi
where Z
i
is normal. Hence we have
x(t) =
e
(a−
1
2
b
2
)t+bz(t)+
π(t)
i=1
Zi
x(0) (3.17)
which, conditioned on π(t), follows the lognormal distribution.
This model is nice because it can produce distributions that have heavier tails (i.e. more extreme price
movements) than the lognormal distribution. In this way, it can be used to create implied volatility smiles
and smirks.
3.4 A more general SDE
The following SDE contains a slightly more general description of an SDE driven only by Brownian motion.
The intuition behind our route to the solution comes from the use of integrating factors in basic ODEs.
dx = (a +bx)dt + (c +fx)dz. (3.18)
To solve this, ﬁrst write the SDE as follows:
dx −bxdt −fxdz = adt +cdz. (3.19)
24 CHAPTER 3. STANDARD STOCHASTIC DIFFERENTIAL EQUATIONS WITH SOLUTIONS
In standard ordinary diﬀerential equations, we would try to make the left hand side an exact diﬀerential
by using an integrating factor. The integrating factor is usually related to the solution to the diﬀerential
equation if the right hand side of (3.19) is set to zero. That is,
dx −bxdt −fxdz = 0. (3.20)
The solution to this is
x(t) = x(0)e
(b−
1
2
f
2
)t+fz(t)
. (3.21)
Hence, we will try an integrating factor of the form
e
−(b−
1
2
f
2
)t−fz(t)
. (3.22)
Therefore, let us compute
d(e
−(b−
1
2
f
2
)t−fz(t)
x) = e
−(b−
1
2
f
2
)t−fz(t)
−(b −
1
2
f
2
)xdt −fxdz +
1
2
f
2
xdt
+(a +bx)dt + (c +fx)dz −f(c +fx) dt
= e
−(b−
1
2
f
2
)t−fz(t)
((a −fc)dt +cdz) . (3.23)
Integrating both sides leads to
e
(−(b−
1
2
f
2
)t−fz(t))
x(t) −x(0) =
t
0
e
(−(b−
1
2
f
2
)s−fz(s))
((a −fc)ds +cdz(s)) (3.24)
and rearrangement gives the ﬁnal form
x(t) = e
((b−
1
2
f
2
)t+fz(t))
x(0) +
t
0
e
((b−
1
2
f
2
)(t−s)+f(z(t)−z(s))
((a −fc)ds +cdz(s)). (3.25)
3.4.1 The OrnsteinUhlenbeck Process and Mean Reversion
A very useful model is one in which a price or ﬁnancial variable is mean reverting. That is, the process is
pulled toward some value (in this case, its long term mean level). A standard model for this is a Gaussian
model called the OrnsteinUhlenbeck process [14],
dx = −b(x −a)dt +cdz (3.26)
where a is the level that x(t) reverts to, and b is the mean reversion rate. Thus, x(t) is drawn toward a at a
rate of b. The Brownian driven term cdz just adds noise.
A sample path of a mean reverting process is depicted in Figure 3.2. Mean reverting processes of this
speciﬁc form are sometimes also called a Vasicek model. Vasicek used a process of this form to model the
short rate process in term structure modeling [15].
Since a mean reverting process is of the form of (3.18), it has a closed form solution. The solution is
given by
x(t) = e
−bt
x(0) +
t
0
e
−b(t−s)
(abds +cdz(s)). (3.27)
Thus, observe that x(t) is a Gaussian process as well. One disadvantage of this process is that values of x(t)
can become negative because the Gaussian distribution always has some probability of being negative. This
is undesirable because prices and quantities such as interest rates should not be negative.
3.5. COXINGERSOLLROSS PROCESS 25
0 1 2 3 4 5
1.6
1.8
2
2.2
2.4
2.6
2.8
3
t
x
Simulation of Mean−Reversion Dynamics
Figure 3.2: Simulation of MeanReverting Dynamics, a = 2, b = 5, c = 0.5, x(0) = 3.
3.5 CoxIngersollRoss Process
Another version of a mean reverting process is the CoxIngersollRoss (CIR) process [4]. It is given by
dx = −b(x −a)dt +c
√
xdz (3.28)
and used often in short rate models or stochastic volatility type models. Note that it is very similar to the
OrnsteinUhlenbeck process, except that the driving Brownin motion is multiplied by
√
x. This makes the
noise dependent on the size of x. Furthermore, with correct parameter values, this process will always be
positive.
This process is related to the OrnsteinUhlenbeck process as follows. Consider n Ornstein Uhlenbeck
Processes
dy
1
= −
1
2
αy
1
dt +
1
2
βdz
1
(3.29)
.
.
. (3.30)
dy
n
= −
1
2
αy
n
dt +
1
2
βdz
n
(3.31)
where the Brownian motions z
1
, . . . , z
n
are uncorrelated. Now, consider the process
x(t) = y
2
1
(t) +. . . +y
2
n
(t) (3.32)
By Ito’s lemma, x(t) follows
dx =
n
¸
i=1
2y
i
(t)
−
1
2
αy
i
dt +
1
2
βdz
i
+
n
¸
i=1
β
2
4
dt (3.33)
=
n
¸
i=1
−αy
2
i
+
β
2
4
dt +
n
¸
i=1
(βy
i
dz
i
) (3.34)
=
−αx(t) +n
β
2
4
dt +β
n
¸
i=1
(y
i
dz
i
) (3.35)
26 CHAPTER 3. STANDARD STOCHASTIC DIFFERENTIAL EQUATIONS WITH SOLUTIONS
Now, we perform a little trick by writing the last term as
β
x(t)
n
¸
i=1
y
i
x(t)
dz
i
. (3.36)
Now, it turns out that
n
¸
i=1
y
i
x(t)
dz
i
(3.37)
is actually a Brownian motion!
How can we see this? Well, note that if we interpret each Brownian increment as being normally dis
tributed with mean zero and variance dt, i.e. dz
i
∼ N(0, dt), then (3.37) is just the sum of Gaussians. But,
it is a well known property that the sum of Gaussians is Gaussian. Thus, (3.37) is normally distributed. For
it to be the increment of Brownian motion, heuristically, we only need to show that the mean is zero and
the variance is dt. To compute the mean we have
E
¸
n
¸
i=1
y
i
x(t)
dz
i
¸
=
n
¸
i=1
y
i
x(t)
E[dz
i
]
= 0 (3.38)
and the variance is computed as
V ar
¸
n
¸
i=1
y
i
x(t)
dz
i
¸
=
n
¸
i=1
y
2
i
x(t)
V ar(dz
i
)
=
n
¸
i=1
y
2
i
x(t)
dt
=
dt
x(t)
n
¸
i=1
y
2
i
= dt (3.39)
where we used that the dz
i
’s are independent and equation (3.32). Thus we can actually use the replacement
dz =
n
¸
i=1
y
i
x(t)
dz
i
(3.40)
to convert (3.35) to
dx =
−αx(t) +n
β
2
4
dt +β
x(t)dz (3.41)
Now, it turns out that for n ≥ 2, we are guaranteed to have a positive process (otherwise the process can
and will reach zero at times!). Hence, we should always make sure that our parameter choices correspond
to selecting n ≥ 2.
This is easily done since we can match coeﬃcients in (3.28) and (3.41). This gives
ab =
nβ
2
4
, b = α, c = β (3.42)
From these relationships, we get that n =
4ab
c
2
.
For n corresponding to an integer, we have the solution as the sum of OrnsteinUhlenbeck processes.
Thus, overall, the process x(t) will be ChiSquared distributed. A sample path of a CIR process is depicted
in Figure 3.3.
3.6 Summary
In this chapter we have explored some of the standard stochastic diﬀerential equation models used in ﬁnance.
It is important to have a feel for these processes what their parameters correspond to. Note that most of them
correspond to some transformation of simple Gaussian processes such as the OrnsteinUhlenbeck process or
the Poisson process. You can think of these models as building more and more complicated models from our
basic building blocks of Brownian motion and the Poisson process.
3.7. PROBLEMS 27
0 1 2 3 4 5
1.5
2
2.5
3
t
x
Simulation of CIR Dynamics
Figure 3.3: Simulation of a sample path of a CIR process, a = 2, b = 5, c = 0.5, x(0) = 3.
3.7 Problems
Problem 3.7.1 Solve the following stochastic diﬀerential equations:
(a) dx = µdt +σxdz
(b) dx = µxdt + σxdz + x(Y − 1)dπ where π is a Poisson process with intensity λ and Y is a random
variable.
(c) What does your answer in (b) look like if Y is lognormally distributed.
Problem 3.7.2 Consider the process
dx = −b(x −a)dt +cdz (3.43)
Using Ito’s lemma, derive a stochastic diﬀerential equation for y = e
x
in terms of only y. This model is
lognormal due to the fact that x is normal, but has a mean reverting property in log coordinates.
Problem 3.7.3 Verify equation (3.23). (Hint: Let f(x, z, t) = e
−(b−
1
2
f
2
)t−fz
x where z has the trivial sde
dz = dz and use the multidimensional Ito’s lemma.)
28 CHAPTER 3. STANDARD STOCHASTIC DIFFERENTIAL EQUATIONS WITH SOLUTIONS
Chapter 4
The Factor Approach to Arbitrage
Pricing
4.1 The Factor Approach to Arbitrage Pricing
In this chapter we present absence of arbitrage conditions when returns are described by linear factor models.
In doing so, we rely heavily on Ross’ Arbitrage Pricing Theory [13]. However, we do so in light of the
application to derivative pricing. We will ﬁnd in the following chapters that when coupled with Ito’s lemma,
a simple absence of arbitrage equation results in perhaps the simplest way to obtain the vast majority of
BlackScholes type pdes that appear in derivative pricing. To back up this claim we will derive pdes in a
great number of situations. But ﬁrst we must understand the underlying principle, which is in fact quite
simple. Let’s begin...
4.2 Returns and Factors Models
Most of the modeling in asset pricing theory is done using linear factor models. Hence, we adopt that
paradigm here. That is, a return is assumed to be of the form
r = α +βf (4.1)
where a and b are constants and f is a random variable called a factor. This is a one factor model. We will
also consider multifactor models of the form
r = α +
n
¸
i=1
β
i
f
i
(4.2)
where each f
i
, i = 1, ..., n is a random factor.
4.2.1 Returns
Above I said we would model returns as factor models. However, returns are also related to price changes.
Hence, factor models should arise from price changes as well.
Given a period of time ∆t, the return on an asset is deﬁned as
r =
P(t + ∆t) −P(t)
P(t)
(4.3)
where P(t) is the amount invested at time t and P(t + ∆t) is the amount received at time t + ∆t. Figure
4.1 shows the cash ﬂow diagram corresponding to this where the time increment it denoted as dt.
29
30 CHAPTER 4. THE FACTOR APPROACH TO ARBITRAGE PRICING
Figure 4.1: Cash Flow Diagram
4.2.2 Stochastic diﬀerential equations and factor models
In ﬁnance, we like to model asset prices as stochastic diﬀerential equations. For instance, a stock price could
be
dS = µSdt +σSdz. (4.4)
This is a model of a price change over time dt. Hence, I should be able to use it to compute the return of
an asset over time dt. In fact, it is given by:
r =
S(t +dt) −S(t)
S(t)
=
dS(t)
S(t)
= µdt +σdz. (4.5)
This is also an example of a factor model. In the above formula, if we associate α = µdt, β = σ, and f = dz
then it is a standard linear factor model.
In general, if I am given a stochastic diﬀerential equation
dx(t) = a(x(t), t)dt +b(x(t), t)dz (4.6)
then it is describing price changes over time period dt, and I can use it to determine a model of returns:
r =
dx(t)
x(t)
=
a(x(t), t)
x(t)
dt +
b(x(t), t)
x(t)
dz (4.7)
This also looks like a factor model with the factor being dz. In fact, I like to write it in the form:
r = αdt +βdz (4.8)
where α =
a(x(t),t)
x(t)
and β =
b(x(t),t)
x(t)
(and arguments are being suppressed). This is slightly diﬀerent from
the standard factor model in that I explicitly write out dt in the ﬁrst term. I like to go even a little further
and interpret dt as a special factor which is nonrandom. This is purely for convenience, but I will use this
convention throughout.
Another note is that α and β don’t seem to be constant! However, we should view a factor model from
an SDE as being valid only over dt and conditioned upon information at time t, including x(t) (or t− and
x(t−) if appropriate). Therefore, conditioned upon information at time t, x(t) is known and α and β are
then known constants over time t to t +dt.
Therefore, SDEs lead to instantaneous factor models that apply over time increments of dt. In what
follows, I will use notation that is indicative of SDEs in that factors will be denoted by dz and I will include
a special factor dt when I describe returns. However, for the sections that follow, you may think of the factor
dz as being any random variable, not just Brownian motion, and dt can be any time increment, not just an
instantaneous change in time.
4.3. THE FACTOR APPROACH TO ARBITRAGE USING RETURNS 31
4.3 The Factor Approach to Arbitrage using Returns
We consider the returns of tradable assets. By a tradable asset, I mean an asset that you can actually buy.
For instance, you can purchase a share of stock. If this stock pays a dividend, then when you purchase the
share of stock, you have purchased the dividend stream as well. Note that you cannot just purchase the
”price” of the stock, only a share, and hence you are stuck with everything that comes along with purchasing
a share (such as dividends). So, let’s consider the returns of tradable assets.
Let’s list the returns of tradable assets that are driven by the factor dz
r = αdt +βdz (4.9)
where r ∈ R
n
, α ∈ R
n
are vectors and β ∈ R
n×m
is a matrix. The factors are contained in the vector
dz ∈ R
m
. That is, I have stacked up all the returns in a vector.
Now consider a portfolio of these assets. We will represent the portfolio by a vector x ∈ R
n
which denotes
the dollar amount invested in each tradable asset. Hence, the total cost of our portfolio is
cost =
n
¸
i=1
x
i
= x
T
1 (4.10)
where 1 is a vector of ones.
Over the time period dt, the returns in (4.9) indicate the change in value of each tradable asset. Hence,
the total change in our portfolio is given by:
proﬁt/loss =
n
¸
i=1
x
i
r
i
= x
T
r. (4.11)
We can analyze this return in a little more detail by writing out what each r
i
is
x
T
r = x
T
(αdt +βdz) = (x
T
α)dt + (x
T
β)dz. (4.12)
We can now see that this return will be riskless as long as
x
T
β = 0 (4.13)
since that is the coeﬃcient of the dz term which is providing the randomness. If x
T
β = 0 then the return
on this portfolio is deterministic and given by x
T
αdt.
4.3.1 Arbitrage
Let us consider a simple portfolio. I will choose this portfolio so that it costs nothing and has no risk. That
is
x
T
1 = 0 No cost (4.14)
x
T
β = 0 No risk (4.15)
But, if something costs nothing and has no risk, then it better have no return! If it had a positive return, this
would be an arbitrage, since I would be guaranteed to make money (no risk), and I didn’t use any money to
do it (no cost). If it had a negative return, then the portfolio −x would be an arbitrage. Therefore, to have
no arbitrage, I must have no return as well. This leads us to a key condition which I state as the following
implication
(⋆) A (not very useful) Necessary Absence of Arbitrage Condition
x
T
1 = 0 No cost
x
T
β = 0 No risk
⇒ x
T
α = 0 No return (4.16)
32 CHAPTER 4. THE FACTOR APPROACH TO ARBITRAGE PRICING
which can be written in matrix form as
¸
1
T
β
T
x = 0 ⇒ α
T
x = 0. (4.17)
As the name indicates, the above condition for absence of arbitrage is not terribly useful. However, an equiv
alent condition is extremely useful and will provide the foundation for our derivations of partial diﬀerential
equations. In fact, the condition which is stated next is astonishingly useful! But to derive it, we need to
ﬁrst recall some linear algebra relationships.
4.3.2 Null and Range Space Relationship
Note that the condition (4.17) can be written as
Ax = 0 ⇒ α
T
x = 0 where A =
¸
1
T
β
T
. (4.18)
Now, the set of all vectors x such that Ax = 0 is known as the null space of the matrix N(A). That is
N(A) = {xAx = 0} . (4.19)
On the other hand, the set of all vectors y such that there exists an x with y = Ax is known as the range
space of the matrix A. That is
R(A) = {y  ∃x such that Ax = y } . (4.20)
Finally, we recall the notion of the perpendicular set of a given set of vectors. Let M be a set of vectors,
then M
⊥
is a set of all vectors z such that z is orthogonal to all vectors in M. That is
M
⊥
=
¸
z  z
T
x = 0, ∀x ∈ M
¸
(4.21)
In order to derive a useful condition from (4.17), we will use the following relationship between the null
and range space of a matrix.
(⋆) Null and Range Space Relationship: N(A)
⊥
= R(A
T
).
Proof: The proof of this is rather simple.
x ∈ N(A) ⇒ Ax = 0 ⇒ y
T
Ax = 0 for all y. (4.22)
Then
y
T
Ax = (A
T
y)
T
x = 0 (4.23)
which means that A
T
y is orthogonal to the null space of A for all y. That is exactly saying that
N(A)
⊥
= R(A
T
). (4.24)
2.
4.3.3 A Useful Absence of Arbitrage Condition
Using the null and range space relationship, we can convert (4.17) into a very useful condition. It’s importance
cannot be overstated. Hence it receives two stars!!
(⋆⋆ Return APT) A Useful Necessary Absence of Arbitrage Condition
4.3. THE FACTOR APPROACH TO ARBITRAGE USING RETURNS 33
A necessary and suﬃcient condition for the implication (4.16) to be true is for there to exist a vector
ˆ
λ ∈ R
m+1
such that
1 β
ˆ
λ = λ
0
+βλ = α (4.25)
where
ˆ
λ =
¸
λ
0
λ
(4.26)
with λ
0
∈ R and λ ∈ R
m
.
Many of you will hopefully recognize this as nothing more than the simple version of Ross’ 1976 arbitrage
pricing theory [13]. Its derivation is given below.
Proof: Note that the condition in (4.17) indicates that if a portfolio x is in the null space of the matrix
¸
1
T
β
T
then it also must be orthogonal to α. Another way to put this is that
α ∈
N
¸
1
T
β
T
⊥
(4.27)
where N(·) is the null space. But, using the null and range space relationship, this means that
α ∈ R
¸
1
T
β
T
T
(4.28)
Thus, there exists a vector
ˆ
λ ∈ R
m+1
such that [ 1 β ]
ˆ
λ = α. 2
4.3.4 Interpretations
Let’s try to get a bit of intuition into the mystical λ’s.
Market Price of Risk
The APT equation for a single factor model r = µdt +σdz is
µ = [ 1 σ ]
ˆ
λ = λ
0
+σλ
1
. (4.29)
Let’s look at this equation. On the left side is the expected return (µ), and on the right side is the volatility
(σ). Hence, this equation relates volatility to expected return.
A better way to think of σ is not as volatility, but as the amount of the risk factor dz that you have. If
σ is zero, then you are not exposed to the risk dz at all. If it is large, then you have purchased a lot of that
risk. Therefore, λ
1
tells you how much your expected return is increased (assuming λ
1
is positive) for each
unit of the risk dz that you take on. For this reason, λ
1
is called the ”market price of risk”. In this case, λ
1
is the market price of the risk factor dz. This is a nice interpretation for λ
1
.
The Market Price of Time
But, we also have this pesky λ
0
to deal with. It is not tied to any risk. In fact, if we don’t take on any risk
(i.e. σ = 0), then µ = λ
0
. Intuitively, if we don’t have any risk, then we should be earning the risk free rate.
Hence, we might guess that λ
0
= r
0
, where r
0
denotes the risk free rate of interest. This is in fact correct.
But, let’s justify this through a slightly diﬀerent argument that will also get us a nice interpretation for λ
0
.
34 CHAPTER 4. THE FACTOR APPROACH TO ARBITRAGE PRICING
Let us consider a risk free asset. Its factor model is given by:
dB
0
B
0
= r
0
dt (4.30)
where r
0
is a constant risk free rate of interest. Since it must satisfy our return relationship, we have:
r
0
= λ
0
+ 0(λ
1
) = λ
0
(4.31)
which tells us that λ
0
= r
0
. If the λ’s are the market prices of factors, then we may interpret λ
0
as the
reward for the time factor dt or the ”market price of time”. Intuitively, this makes perfect sense, since the
risk free rate is the amount we are rewarded for taking on time and nothing else.
Now all the λ’s should make sense. They relate the diﬀerent factors dt, dz, etc. to how much we are
rewarded for taking on those factors. In a market with no arbitrage, every factor has a market price. Your
return is just given by looking at how much of each factor you have taken on, and multiplying them by their
market price and adding them up. Quite simple, right?
This is the basic interpretation of the concept of ”market price of risk”. Don’t forget it. It is very useful
to have this intuition.
4.3.5 A Problem with Returns
Using returns to model assets has a disadvantage. There are contracts that involve no upfront cost or price.
Hence, they don’t have a well deﬁned return, since by deﬁnition a return involves dividing by the price,
which can be zero. A futures contract is an example of this, since the marktomarket mechanism resets the
value of a contract to zero every day. Therefore, in the above framework, a futures contract must be dealt
with as a special case. I don’t like special cases, so below we will reformulate absence of arbitrage conditions
but in terms of price changes rather than returns. This eliminates the need for special cases.
4.4 The Factor Approach using Price Changes
I just mentioned that there can be some problems in dealing with returns. Hence, in this section we will
reformulate the factor approach to arbitrage pricing by working with prices rather than returns. In this case,
it will be okay to have an asset with a zero price.
4.4.1 Price Changes and Arbitrage
Let’s return to our arbitrage portfolio and reformulate it in terms of price changes. In our original argument,
we let r ∈ R
n
be the returns of assets and speciﬁed the dollar amount invested in each asset by a vector
x ∈ R
n
. This time we will specify a vector of prices per unit of tradables P ∈ R
n
, cash ﬂows resulting from
the changes in value of the tradables per unit dV ∈ R
n
, and shares or units of each asset purchased y ∈ R
n
.
The cash ﬂow from changes in value over the period will satisfy the factor model
dV = Adt +Bdz (4.32)
where A ∈ R
n
, B ∈ R
n×m
, and dz ∈ R
m
. The simple cash ﬂow diagram is given in ﬁgure 4.2.
4.4.2 Proﬁt/Loss and Arbitrage
In this case, the proﬁt/loss on the portfolio is given by:
y
T
(dV) = y
T
(Adt +Bdz) = (y
T
A)dt + (y
T
B)dz (4.33)
Therefore, an arbitrage portfolio is one that has:
4.4. THE FACTOR APPROACH USING PRICE CHANGES 35
Figure 4.2: Cash Flow Diagram with Prices P and Value Changes dV
• No Cost: y
T
P = 0,
• No Risk: y
T
B = 0,
but has a proﬁt
• Proﬁt: y
T
A = 0.
Of course, we want to eliminate arbitrages, so we would like the following implication to hold.
(⋆) A (not very useful) Necessary Absence of Arbitrage Condition
y
T
P = 0 No cost
y
T
B = 0 No risk
⇒ y
T
A = 0 No return (4.34)
which can be written in matrix form as
¸
P
T
B
T
y = 0 ⇒ A
T
y = 0. (4.35)
Once again, we can convert this to a dual condition that is useful:
(⋆⋆Price APT) A Useful Necessary Absence of Arbitrage Condition
A necessary condition for no arbitrage is for there to exist a vector
ˆ
λ ∈ R
m+1
such that
P B
λ = Pλ
0
+Bλ = A (4.36)
where
ˆ
λ =
¸
λ
0
λ
(4.37)
with λ
0
∈ R and λ ∈ R
m
.
What is the relationship between the return approach and the price change approach. Well, of course
they are highly related. The λ’s in both cases are the same, and have the same interpretation. They are
market prices of risk. The main diﬀerence is that the price approach uses shares or units of the asset to
describe the portfolio, not dollar amount. Furthermore, it describes proﬁt/loss in terms of the cash ﬂow, not
returns. These are really superﬁcial diﬀerences, but it is easier to understand some pricing situations, such
as futures contracts, in the price approach rather than in terms of returns.
36 CHAPTER 4. THE FACTOR APPROACH TO ARBITRAGE PRICING
4.5 Two standard examples
In this section we will give some little examples to help us gain some intuition into the approach. Let’s start
with a stock and a bond.
4.5.1 Stocks
Assume that a stock follows a geometric Brownian motion (GBM) and there is a bond that earns the risk
free rate, r
0
. Their dynamics are given below:
dB = r
0
Bdt (4.38)
dS = µSdt +σSdz. (4.39)
What do our absence of arbitrage conditions say about these assets?
Returns
In terms of returns, we can write that:
dB
B
= r
0
dt (4.40)
dS
S
= µdt +σdz (4.41)
Hence, the APT equation says [ 1 β ]
ˆ
λ = α or
¸
1 0
1 σ
¸
λ
0
λ
1
=
¸
r
0
µ
(4.42)
Solving this equation for λ
0
and λ
1
gives
λ
0
= r
0
(4.43)
λ
1
=
µ −r
σ
(4.44)
Note that the market price of risk for dz is like an instantaneous Sharpe ratio. We will see this equation pop
up many times.
Prices
We can derive the same results using prices. In terms of prices and changes in value, the absence of arbitrage
equation says that [ P B ]
ˆ
λ = A or
¸
B 0
S σS
¸
λ
0
λ
1
=
¸
r
0
B
µS
(4.45)
Solving this equation for λ
0
and λ
1
gives
λ
0
= r
0
(4.46)
λ
1
=
µ −r
σ
, (4.47)
as expected.
4.6. SUMMARY 37
4.5.2 Futures contracts
One can enter into a futures contract without paying any money. This means that a futures contract is a
special case in our setup, and is diﬃcult to understand in the context of returns because its return is not
deﬁned (it has zero price!). It is much more naturally considered in terms of prices and value changes. Below
is the cash ﬂow diagram for a futures contract.
The critical diﬀerence between futures contracts and forward contracts is that futures contracts are
marked to market and settled daily. This means that they always begin the day with a zero price. At the
end of the day, the price change is settled. Note that this price change is not discounted, but rather the
change in the futures price. Hence, the change in value of the portfolio is equal to the change in the futures
price dV = df!
Figure 4.3: Cash Flow Diagram for Futures Contract
Therefore, a futures contract can have zero price, but the cash ﬂow from the change in value will be given
by
df = µfdt +σfdz. (4.48)
Hence, in this case, we may consider a market with a bond and a futures contract. The relevant quantities
can be written as
prices changes factors
B dB = r
0
Bdt
0 df = µfdt +σfdz.
(4.49)
Now let’s look at the price based arbitrage equation [ P B ]λ = A or
¸
B 0
0 σS
¸
λ
0
λ
1
=
¸
r
0
B
µS
. (4.50)
Solving this equation for λ
0
and λ
1
gives
λ
0
= r
0
(4.51)
λ
1
=
µ
σ
. (4.52)
Note that a futures contract refers to the price at a ﬁxed time in the future. Hence, time is ﬁxed, and a
futures contract is purely a bet on the outcome of a random factor. Time is not in the mix. Hence, we see
that the equations separate. The bond is purely time, and the market price of time is r
0
. On the other hand,
the futures contract is purely a bet on the outcome of the factor. Hence time is not mixed into it, and the
market price of risk does not involve the risk free rate r
0
.
4.6 Summary
We have derived forms of an aribtrage pricing theory based either on linear factor models of returns or
value changes. These simple results will allow us to derive many of the partial diﬀerential (or diﬀer
ence/integral/etc.) equations that arise in derivative pricing theory. But before doing so, we develop a
38 CHAPTER 4. THE FACTOR APPROACH TO ARBITRAGE PRICING
derivative pricing framework based upon our APT factor based approach. Then, in following chapters we
tackle examples in equity derivatives, interest rate derivatives, and beyond. What you should take away is
that simple arbitrage ideas that we derived in this chapter are the underpinnings of derivative pricing theory.
4.7 Problems
Problem 4.7.1 (How to construct an arbitrage)
Assume that there does not exist a λ such that α = [1 β]λ. Then this means that an arbitrage opportunity
exists. That is, there exists a vector x specifying dollar amounts invested such that
α
T
x > 0 and 1
T
x = 0, β
T
x = 0. (4.53)
One way to select a speciﬁc arbitrage would be to solve an optimization problem of the form
max
x
α
T
x subject to 1
T
x = 0, β
T
x = 0, x
1
= 1. (4.54)
where the constraint x
1
limits the total dollar amount of long and short positions to equal $1. However,
in practice an alternative but equivalent approach is often used, as follows.
Choose λ such that α −[1 β]λ has minimum 2norm. That is
min
λ
α −[1 β]λ
2
. (4.55)
If λ
∗
is this optimal λ, then we can deﬁne
γ = α −[1 β]λ
∗
. (4.56)
One could then solve the optimization problem
max
x
γ
T
x subject to 1
T
x = 0, β
T
x = 0, x
1
= 1. (4.57)
Show that this optimization problem is equivalent to the optimization problem in (4.54).
Problem 4.7.2 (More Linear Algebra and Dualities)
The Farkas Alternative [7] of linear algebra says the following
(i) Ax = b has a solution x ≥ 0
or (exclusive)
(ii) y
T
A ≥ 0, y
T
b < 0 has a solution y.
(4.58)
We can use the Farkas Alternative for implications as well. Consider the implication:
Ax = 0 ⇒ b
T
x = 0 (4.59)
Prove by use of the Farkas alternative that this implication is true if and only if there exists a λ such that
b = A
T
λ. (4.60)
(Hint: Note that the implication is true if and only if there is no solution to Ax = 0 and b
T
x > 0.)
Problem 4.7.3 (Early Exercise and American Options)
American options allow the holder of the option to exercise at any time prior to expiration. Let’s explore
how this would aﬀect the absense of arbitrage ideas.
4.7. PROBLEMS 39
(a) Assume that the factor model for the option if it is not exercised is
r
1
= α
c
dt +β
c
dz. (4.61)
Explain why the factor model for the return on an American option, where E is the early exercise value and
c is the price of the option, is either
r
1
= α
c
dt +β
c
dz or r
1
=
E −c
cdt
dt (4.62)
depending on whether the option is exercised or not.
(b) Assume that you are long the American option. As the holder of an American option, you get to decide
when to exercise the option. Let α and β correspond to factor models for tradable assets, where the ﬁrst
elements of these vectors α
1
and β
1
corresponds to the American option. Finally, let x be a vector of dollar
amounts invested in each asset (with x
1
corresponding to the American option).
Argue that a necessary condition for you not to be able to arbitrage is that the following implication is
true regardless of whether you are exercising the American option or not,
e
1
T
x = 1, 1
T
x = 0, β
T
x = 0 ⇒ α
T
x ≤ 0. (4.63)
(I.e., the implication must hold regardless of which return in (4.62) is being chosen.)
(c) Assume that the vectors e
1
, 1, β are linearly independent. Using the Farkas Alternative from Problem
4.7.2, derive that an alternative absence of arbitrage condition is that there exists scalars ξ ≥ 0, and λ
0
and
λ
1
such that
α = λ
0
+βλ
1
−e
1
ξ. (4.64)
(d) Finally, using (4.64) and Part (a) of this problem, argue that for the American option, both of the
following conditions must hold
α
c
≤ λ
0
+β
c
λ
1
(4.65)
E ≤ c. (4.66)
Problem 4.7.4 (Modeling the binomial lattice as a factor model)
Consider a stock that can move on a binomial lattice, where at each step the stock price S(k) can either
move up to S(k + 1) = uS(k) or down to S(k + 1) = dS(k) with the probability structure
S(k + 1) =
uS(k) w.p. p
dS(k) w.p. 1 −p
(4.67)
Let ∆Z(k) be a standard binary random variable
∆Z(k) =
1 w.p. p
−1 w.p. 1 −p
(4.68)
Show that ∆S(k) = S(k + 1) −S(k) can be written in the form
∆S(k) = A(k) +B(k)∆Z(k) (4.69)
by ﬁnding A(k) and B(k).
40 CHAPTER 4. THE FACTOR APPROACH TO ARBITRAGE PRICING
Chapter 5
Constructing a Factor Pricing
Framework
5.1 Introduction
This chapter lays the framework for derivative pricing. That is, we try to provide the structure behind the
factor approach. Thus, the goal is to clarify the structure of the modeling involved in the factor approach,
and to provide an almost step by step method for attacking any problem.
As I proceed through this chapter, I will use two derivative pricing examples to illustrate points. The
examples are a call option on a nondividend paying stock, and an absence of arbitrage zero coupon bond
pricing model. In particular, I will also emphasize the relative pricing nature of derivative pricing. That is,
derivative pricing determines an appropriate price relative to other securities that have already been priced
in a market. Before jumping into any of these ideas of derivative pricing, let’s start by classifying the relevant
quantities in a model of any market.
5.2 A Classiﬁcation of Quantities
In this book, we will be interested in the pricing of derivative securities. However, to understand that pricing
theory, it is best to ﬁrst understand the general modeling paradigm.
In our modeling framework, we will classify all quantities into three (possibly overlapping) categories.
They are factors, underlying variables, and tradables.
5.2.1 Factors
Factors are the most basic source of randomness in our models. In general, they are the driving Brownian
motions (z(t)) and Poisson processes (π(t)). Furthermore, I like to think of time as a special factor. Thus,
in our models, the factors will show up as the dz, dπ, and dt terms.
5.2.2 Underlying Variables
Underlying variables are often quantities of interest that are functions of the factors. For example, an interest
rate could be an underlying variable. A stock price S(t) could also be an underlying variable. In general,
underlying variables are ﬁnancially relevant quantities that are functions of the factors, and used in the
modeling process.
41
42 CHAPTER 5. CONSTRUCTING A FACTOR PRICING FRAMEWORK
5.2.3 Tradables
Tradables are the quantities that you can actually trade and include in a portfolio. They are modeled
as being functions of the underlying variables. Some of the time, the functional relationship between the
tradable and an underlying variable is trivial. For example, a stock price S(t) can be an underlying variable,
and also a tradable. In other cases, the tradable is a more complicated function of an underlying variable.
For example, an interest rate r(t) can be an underlying variable, but it is not tradable. Instead, a bond
B(r(t), t) is tradable, and represented as a function of the underlying variable.
Figure 5.1: Picture of the Modeling Paradigm
It is extremely important to be able to separate quantities that are tradable from those that are not.
Why? Because tradables, and only tradables, are the quantities that must satisfy the absence of arbitrage
relationships from the previous chapter. And those absence of arbitrage relationships will lead to our pricing
formulas.
Let’s consider an example to clarify the notion of what is tradable.
Example: A stock paying a dividend
To make sure we understand what a tradable is, let’s consider the example of a stock that pays a dividend.
The stock has price S(t), and let’s assume that it pays a continuous dividend at a rate of q. What this means
is that by reinvesting the dividend back into the stock, if you started by purchasing 1 share of stock at time
0, by time t you would have e
qt
shares.
What is the tradable quantity? You might be tempted to say that the price of the stock S(t) is tradable.
However, note that you cannot just purchase the ”price of the stock”. Instead, you must purchase a share
of stock, and with this share you not only get the price of the stock but also the dividend. The point is that
you can’t decouple the price from the dividend. They come together and that is the tradable.
5.2.4 A Derivative is a Tradable
In this book we are interested in pricing derivative securities. So, it is important to begin by deﬁning what
we mean by a derivative security.
Derivative Security: A derivative security is a tradable whose value depends upon (or is a function of)
other underlying variables. In this case, we say that the derivative security is derivative to the underlying
variables.
Note that this deﬁnition of a derivative does not particularly distinguish it from any other tradable! We
typically model all tradables as being a function of some underlying variables (although often a tradable is
an underlying variable and hence trivially a function of itself).
In principle, there is no real distinction between a derivative and any other tradable. You will see that
the most important distinction is that a derivative is ”what you want to price” and that other tradables are
”already priced” in the market.
The name derivative comes from the fact that the value of a derivative is ”derived” from some underlying
variables. In order to have something to keep in mind, let’s use the following example:
5.3. FACTOR MODELS FOR UNDERLYING VARIABLES AND TRADABLES 43
Example: European Call Option
Consider a European call option, which is the option to purchase a speciﬁed stock, say CocaCola, for a
speciﬁed price (the strike price), at a speciﬁed date (the expiration date). In this case, the option is a
derivative, because its value depends upon the stock price of CocaCola. Thus, the stock price of CocaCola
is the underlying variable.
In this example, CocaCola is an underlying variable and also a tradable. However, it is not always the
case that the underlying variable must also be tradable. In general, derivative securities are quite ﬂexible, and
so are the possible underlying variables. In fact, there are derivatives that depend on underlying variables
such as temperature or even wind speed which clearly are not tradable.
5.3 Factor Models for Underlying Variables and Tradables
To use the APT equations from the previous chapter, what we need are linear factor models for the price
changes of tradables. In the previous chapter we saw that we can interpret SDEs as instantaneous factor
models. Hence, in continuous time models, we would like SDE models for underlying variables and tradables
in particular.
In general, there are two ways that we arrive at factor models for tradables. The ﬁrst is that we directly
model the tradable as a factor model. The second (and most important) is Ito’s lemma.
5.3.1 Direct Factor Models
In many cases, we begin the modeling process by writing down a factor model or SDE. Examples of this
include the geometric Brownian motion model of stock price movement.
dS = µSdt +σSdz (5.1)
or a model of an interest rate (most likely the instantaneous short rate) as an SDE
dr
0
= adt +bdz. (5.2)
Thus, in many case, we begin the modeling process by writing down SDEs (or factor models) for underlying
variables and tradables.
5.3.2 Factor Models via Ito’s Lemma
The second method to obtain factor models is via Ito’s lemma. Since tradables, and derivatives in particular,
are functions of underlying variables, if we have an SDE model for an underlying then we can use Ito’s lemma
to obtain an SDE for the tradable.
Consider the example of a European call option on a stock following geometric Brownian motion as in
(5.1). The call option is a function of the underlying stock price S(t) and time t. We write this as c(S(t), t).
Then, by Ito’s lemma we have
dc = (c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
)dt +σSc
S
dz (5.3)
which is a factor model for the call option c.
In some cases, we will know explicitly the functional relationship between the tradable or derivative and
the underlying variable. In that case, in Ito’s lemma we would be able to explicitly compute the partial
derivative terms c
t
, c
S
, etc.
A picture of the route to SDEs for tradables is given in Figure 5.2
44 CHAPTER 5. CONSTRUCTING A FACTOR PRICING FRAMEWORK
Figure 5.2: Obtaining factor models or SDEs for tradables.
5.4 Tradables tables
We will approach the derivative pricing problem using the Price APT equation (4.36), rather than the Return
APT. But note that everything in this book can also be done use the Return APT equation.
To use the Price APT equation, we need two things: prices and factor models for value changes. The
previous section indicated how to obtain the factor model for value changes. Thus, we can tabulate prices
and factor models for value changes for all tradables. When we list these in a table, we call this a tradable
table. For instance, in a market with a stock, a bond, and a call option on the stock, the tradable table is
Prices
B
S
c
¸
¸
Changes
d
B
S
c
¸
¸ =
Factor Model
r
0
B
µS
(c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
)
¸
¸
dt +
0
σS
σSc
S
¸
¸
dz
(5.4)
This is called a tradable table to emphasize the fact that only tradables need satisfy the absence of arbitrage
conditions. It supplies all the basic information that we need to extract from the market when applying the
Price APT.
5.5 Applying the Price APT
The ﬁnal step to a derivative pricing equation is to apply the Price APT equations. However, before applying
the Price APT equation, we ﬁrst separate marketed tradables from the derivative the we would like to price.
5.5.1 Relative Pricing and Marketed Tradables
Once we have set up the tradables table, we designate certain tradables as marketed. What we mean by a
marketed tradables is a tradable that is ”already priced” by the market.
You should contrast the marketed tradables with the derivative that we would like to price. The derivative
is what we ”want to price” using the information given from the marketed assets. This emphasizes the relative
pricing nature of derivative pricing. We price a derivative security relative to the marketed tradables. That
is, derivative pricing is a method of pricing a new tradable (the derivative) consistently with the existing
price of marketed tradables.
To see how this works, we separate the marketed tradables from the derivative, and put them all in a
tradables table.
Prices
¸
P
m
P
Changes
d
¸
V
m
V
=
Factor Model
¸
A
m
A
dt +
¸
B
m
B
dz
(5.5)
5.5. APPLYING THE PRICE APT 45
The tradables with the subscript m are the marketed tradables, and the tradable without any subscript is
the derivative that we are pricing.
5.5.2 Pricing the Derivative
Now we can use the Price APT to price the derivative. The Price APT equation says there needs to exist
λ
0
and λ such that
A
m
= P
m
λ
0
+B
m
λ (5.6)
A = Pλ
0
+Bλ. (5.7)
To obtain the market prices of risk λ
0
and λ, we only use the marketed tradables. Thus, we solve (5.6) to ﬁnd
the market prices of risk (often in terms of information from the marketed tradables). The market prices of
risk are then plugged into (5.7) which is the pricing equation for the derivative.
One may think of this procedure as ﬁrst calibration to market data, followed by pricing the new derivative.
Using marketed tradables to determine the market prices of risk is like calibrating parameters (market prices
of risk) in our absence of arbitrage pricing model to known data (marketed tradables). Once our model is
calibrated (the market prices of risk are determined), we can apply our absence of arbitrage model to price
other tradables (derivatives) that also must satisfy the same absence of arbitrage relationship.
This discussion has been a little abstract, so let’s see an example
Example: Pricing a European Call Option
Let’s take the tradables table in (5.4) and use it to illustrate pricing. Assume that c is a European call
option on the stock S.
First, we designate the bond B and stock S as being the marketed assets, and c is the derivative that we
would like to price. Thus, the price APT implies that
B
S
c
¸
¸
λ
0
+
0
σS
σSc
S
¸
¸
λ
1
=
r
0
B
µS
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
¸
¸
. (5.8)
where the top two rows correspond to the marketed tradables. Then, we use the marketed tradables to
determine the market prices of risk. Solving these equations gives λ
0
= r
0
and λ
1
=
µ−r0
σ
.
Finally, we use these market prices of risk in the third equation for our derivative to obtain
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
= r
0
c +
µ −r
0
σ
σSc
S
(5.9)
Rearranging this gives
c
t
+r
0
Sc
S
+
1
2
σ
2
S
2
c
SS
= r
0
c (5.10)
which is the BlackScholes partial diﬀerential equation for the price of an option. If we specify the boundary
condition for a European call option as
c(S, T) = (S −K)
+
, c(0, t) = 0 (5.11)
then this completely describes the price of the option.
A picture of the application of the Price APT is given in Figure 5.3.
5.5.3 Underdetermined and Overdetermined Systems
In the example above everything was perfect! I had two equations (one for the bond and one for the stock),
and two unknowns (the market price of time λ
0
and the market price of risk λ
1
) corresponding to the
marketed tradables. Thus, I could solve the equations for λ
0
and λ
1
. It was perfect!
Now, what if things aren’t so perfect. That is, what if the system of equations arising from the Price
APT for the marketed tradables is either underdetermined or overdetermined. Let use an example.
46 CHAPTER 5. CONSTRUCTING A FACTOR PRICING FRAMEWORK
Figure 5.3: A picture of the application of the Price APT.
Underdetermined and Incompleteness
Assume that our assets are given by
dB = r
0
Bdt (5.12)
dS = µSdt +σ
1
Sdz
1
+σ
2
Sdz
2
. (5.13)
In this case, the price APT becomes
¸
B 0 0
S σ
1
S σ
2
S
λ
0
λ
1
λ
2
¸
¸
=
¸
r
0
B
µS
(5.14)
And we have three unknowns (λ
0
, λ
1
, and λ
2
), but only two equations! Thus, we can’t uniquely solve for
the market prices of risk! (In this case, we can solve for λ
0
= r
0
, but not uniquely for λ
1
or λ
2
.)
What can we say in this situation and how should we think of this? Well, ﬁrst, we can say that if any
solution exists (it doesn’t have to be unique, we just need for at least one solution to exist), then there is no
arbitrage. This is guaranteed by the APT equations.
Let’s assume that many solutions exists. For example in the above equations, there are multiple possible
values for λ
1
and λ
2
. Thus, there are many possible market prices of risk that satisfy the no arbitrage
condition. This just means that from the tradable assets in the market (B and S), we cannot uniquely infer
the market prices of risk for dz
1
and dz
2
. There are many possibilities, and all are arbitrage free.
The practical consequence of this is that if we are asked to price a new security that depends on dz
1
and/or dz
2
, we will not be able to assign it a unique absence of arbitrage price. This is because the APT
equation (in either return or price form) acts as a pricing equation. (This use will become clear in the
following chapters.) However, it only provides a unique price if we have unique values for the market prices
of risk. This situation is called an incomplete market.
Incomplete markets are common in practice, and you will see in subsequent chapters that in order to
price derivative securities in incomplete markets, we must select values for market prices of risk that are not
uniquely deﬁned. Since market prices of risk relate risk to reward for various factors, selecting a value for
5.6. THREE STEP PROCEDURE 47
a market price of risk is essentially the same as specifying how investors in the market trade oﬀ risk and
return. Thus, in incomplete markets some speciﬁcation of the risk preferences of investors is needed to assign
a unique price to derivative securities. Furthermore, this speciﬁcation of risk preferences is captured by the
selection of the market price of risk.
The above discussion might be a little abstract at this point, but in subsequent chapters you might
want to refer back to it when faced with pricing of derivatives in incomplete markets (see for example,
jumpdiﬀusion models or stochastic volaility).
Overdetermined and Calibration
Now let’s consider the opposite situation. That is, when the set of equations is overdetermined. For example,
let the tradable assets be
dB = r
0
Bdt (5.15)
dS
1
= µ
1
S
1
dt +σ
1
S
1
dz (5.16)
dS
2
= µ
2
S
2
dt +σ
2
S
2
dz. (5.17)
In this case, the price APT becomes
B 0
S
1
σ
1
S
S
2
σ
2
S
¸
¸
¸
λ
0
λ
1
=
r
0
B
µ
1
S
1
µ
2
S
2
¸
¸
(5.18)
In this case there are three equations and only two unknowns (λ
1
and λ
2
)! This system looks to be overde
termined!
Now, we know from the price APT that for no arbitrage to exist there must exist a solution to this set
of equations. However, in general, for an overdetermined system of equations no solution will exist! What
does this mean?
Well, the ﬁrst thing it means is that strictly speaking, there is an arbitrage opportunity! But, the way
this situation often plays out in practice is usually slightly diﬀerent. In practice this situation often leads to
some sort of calibration procedure.
Instead of declaring that an arbitrage exists, a trader will often just assume that the models being used
for B, S
1
and S
2
are not perfect, and that is the reason that no solution exists. Thus, the trader will search
for the λ
1
and λ
2
that best ﬁt the absence of arbitrage equations. They often call this process calibration,
and in practice it may be diﬃcult to recognize directly as searching for best ﬁt λ’s. (Watch for it in situations
such as term structure modeling where a single factor dz is used, but many tradables (bonds of diﬀerent
maturities) exist, or when certain models are ﬁt to volatility smiles and smirks.) Again, you might want to
return to this discussion after reading subsequent chapters.
5.6 Three Step Procedure
To summarize, let’s present a three step procedure to deriving a derivative pricing equation. The three steps
are:
1. Identify the tradable assets, underlying variables, and factors in a model. (See Figure 5.1.)
2. Write factor models (SDEs) for each tradable asset (this may involve Ito’s lemma), and construct a
tradables table. (See Figure 5.2.)
3. Apply the Price APT equation, ﬁrst to the marketed tradables in order to solve for the market prices
of risk (calibration), and then to the derivative using those market prices of risk (this usually results
in a partial diﬀerential equation for the price). (See Figure 5.3.)
48 CHAPTER 5. CONSTRUCTING A FACTOR PRICING FRAMEWORK
In this book, I will consider the limited scope of just deriving partial diﬀerential equations for pricing, and I
will sometimes leave out the important step of actually solving the pde!! Furthermore, often many diﬀerent
derivative securities satisfy the same basic pde and only diﬀer due to their boundary condition. I will also
often sweep that under the rug, and merely note that a particular derivative security will correspond to the
solution of the pde with a particular boundary condition.
5.7 Summary
In this chapter, we have provided a three step procedure for deriving the pricing equation for a derivative
security based on the linear factor approach. This cookie cutter approach will allow us to derive many of
the partial diﬀerential (or diﬀerence/integral/etc.) equations that arise in derivative pricing theory. In the
next chapter we tackle examples in equity derivatives, and in the following chapter we address interest rate
derivatives. What you should take away is that simple arbitrage ideas that we derived in this chapter are
the underpinnings of derivative pricing theory.
5.8 Problems
Problem 5.8.1 (A Stock Paying Continuous Dividends)
Consider a stock S(t) following
dS = µSdt +σSdz (5.19)
that pays a continuous dividend at a rate of q. Also assume that a bond
dB = r
0
Bdt (5.20)
exists. If c(S, t) is a European call option on the stock, then identify the factors, underlying variables, and
create a tradables table.
Problem 5.8.2 (A Single Factor Short Rate Model)
Let r
0
(t) denote the short rate of interest, and assume that it follows
dr
0
= adt +bdz (5.21)
Assume that zero coupon bonds of maturity T are tradables and the price at time t is denoted by B(r, tT).
Let these bonds be a function of the short rate and time t. Furthermore, assume that a money market account
exists that satisﬁes
dB
0
= r
0
(t)dB
0
dt (5.22)
In this model, then identify the factors, underlying variables, and create a tradables table.
Chapter 6
Application of the Factor Form:
Equity Derivatives
The factor approach to absence of arbitrage pricing is one of the quickest and most direct routes to deriving
pdes for derivatives. In this chapter we will see that it can be used to derive almost every BlackScholes type
pde that occurs in derivative pricing.
6.1 Examples from Equity Derivatives
6.1.1 BlackScholes
Black and Scholes started everything with this model [2]. The standard BlackScholes setup involves a bond
earning a risk free rate and a nondividend paying stock that follows a GBM:
dB = r
0
Bdt (6.1)
dS = µSdt +σSdz. (6.2)
Furthermore, we consider a derivative on the stock. Generically we call this derivative c and assume that
it’s price process depends on the stock and time c(S, t) and is twice continuously diﬀerentiable in both its
arguments. By Ito’s lemma,
dc = (c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
)dt +σSc
S
dz. (6.3)
Therefore, the tradable assets are the stock S, bond B, and the derivative c. We can now move to Step 2
and construct a tradable table
B
S
c
¸
¸
d
B
S
c
¸
¸
=
r
0
B
µS
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
¸
¸
dt +
0
σS
σSc
S
¸
¸
dz. (6.4)
The tradable table contains all the information we need to apply the Price APT equation. Hence, this allows
us to move on to Step 3, which is to solve the Price APT equation Pλ
0
+Bλ
1
= A for absense of arbitrage
conditions.
B
S
c
¸
¸
λ
0
+
0
σS
σSc
S
¸
¸
λ
1
=
r
0
B
µS
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
¸
¸
. (6.5)
Solving for λ
0
and λ
1
using the ﬁrst two equations gives:
λ
0
= r
0
λ
1
=
µ −r
0
σ
. (6.6)
49
50 CHAPTER 6. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES
Plugging λ
0
and λ
1
into the last equation yields
r
0
c +
µ −r
0
σ
σSc
S
= c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
. (6.7)
Finally, rearranging leads to the BlackScholes equation:
c
t
+r
0
Sc
S
+
1
2
σ
2
S
2
c
SS
= r
0
c. (6.8)
European Call and Put Options
For a European call option, the boundary condition is that at expiration T, the option is worth c(S, T) =
max(S − K, 0) where K is the strike price. We also have the boundary conditions that c(0, t) = 0 for all t
and lim
S→∞
c(S, t) = ∞.
c(S, T) = max(S −K, 0) (6.9)
c(0, t) = 0 t ∈ [0, T] (6.10)
c(∞, t) = ∞ t ∈ [0, T] (6.11)
The solution is
d
1
=
ln(S/K) + (r
0
+
1
2
σ
2
)(T −t)
σ
√
T −t
(6.12)
d
2
= d
1
−σ
√
T −t (6.13)
c(S, t) = SN(d
1
) −Ke
−r0(T−t)
N(d
2
) (6.14)
A European put option is a derivative security p(S, t) with boundary conditions
p(S, T) = max(K −S, 0) (6.15)
p(0, t) = 0 Ke
−r0(T−t)
t ∈ [0, T] (6.16)
p(∞, t) = 0 t ∈ [0, T] (6.17)
The solution to the BlackScholes equation under these boundary conditions is
d
1
=
ln(S/K) + (r
0
+
1
2
σ
2
)(T −t)
σ
√
T −t
(6.18)
d
2
= d
1
−σ
√
T −t (6.19)
p(S, t) = Ke
−r0(T−t)
N(−d
2
) −SN(−d
1
) (6.20)
6.1.2 Dividend Paying Stocks
Let’s assume that the stock is paying a continuous dividend at a rate of q. Then the stock and its dividend
stream is a tradable asset. That is, when we purchase the stock, we are also purchasing its dividend stream.
Hence we must consider them together as a tradable asset. Let’s assume the stock price follows
dS = µSdt +σSdz. (6.21)
What we purchase is a single share of this stock. Let N(t) denote the number of shares held of this stock at
time t. The assumption of a continuous dividend at a rate of q is equivalent to assuming that the number of
shares N(t) grows according to the equation
dN = qNdt. (6.22)
6.1. EXAMPLES FROM EQUITY DERIVATIVES 51
Then over time dt, the portfolio with value N(t)S(t) changes to
N(t)S(t) → (N(t) +dN)(S(t) +dS) (6.23)
= N(t)S(t) +dNS(t) +N(t)dS +dSdN (6.24)
= N(t)S(t) +qN(t)S(t)dt +µN(t)S(t)dt +σN(t)S(t)dz +o(dt). (6.25)
If we denote the value of the share with its dividend stream by v(t) = N(t)S(t) then we have
dv = v(t +dt) −v(t) = (µ +q)v(t)dt +σv(t)dz. (6.26)
Hence, we consider v(t) the tradable.
Finally, we consider the option. Now, the payoﬀ of the option depends on the price of the stock alone,
and does not depend on the dividend stream. Therefore, we assume c = c(S, t) and apply Ito’s lemma to c
to obtain
dc = (c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
)dt +σSc
S
dz. (6.27)
The next step is to write the tradable table
B
v
c
¸
¸
d
B
v
c
¸
¸
=
r
0
B
(µ +q)v
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
¸
¸
dt +
0
σv
σSc
S
¸
¸
dz. (6.28)
Note that the stock price alone S(t) does not appear in the tradable table since it is not tradable. Only
tradable quantities appear in the tradables table, and only those quantities need to satisfy the absence of
arbitrage conditions.
The ﬁnal step is to solve the Price APT equation Pλ
0
+Bλ
1
= A,
B
v
c
¸
¸
λ
0
+
0
σv
σSc
S
¸
¸
λ
1
=
r
0
B
µv
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
¸
¸
. (6.29)
Solving for λ
0
and λ
1
gives:
λ
0
= r
0
λ
1
=
µ +q −r
0
σ
, (6.30)
and plugging λ
0
and λ
1
into the last equation gives
r
0
c +
µ +q −r
0
σ
σSc
S
= c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
. (6.31)
Finally, rearranging leads to the BlackScholes equation:
c
t
+ (r
0
−q)Sc
S
+
1
2
σ
2
S
2
c
SS
= r
0
c. (6.32)
What does this apply to?
At ﬁrst thought, a continuous model for a dividend does not seem too realistic. However, a moment of
thought more, we realize that it is a decent approximation for a number of ﬁnancial situations. For instance,
a stock index with many stock that pay dividends at diﬀerent times can be approximated as a continuous
dividend. So can a commodity with a convenience yield. Moreover, foreign currencies that are invested in a
money market account are essentially a security that earns a continuous dividend. So, the point is, don’t let
the notation of calling S a ”stock” limit your thinking about how these models can be applied.
52 CHAPTER 6. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES
European Call Option
Once again, we can consider the value of a European call option, but this time on a stock that pays a
continuous dividend. The solution is
d
1
=
ln(S/K) + (r
0
−q +
1
2
σ
2
)(T −t)
σ
√
T −t
(6.33)
d
2
= d
1
−σ
√
T −t (6.34)
c(S, t) = Se
−q(T−t)
N(d
1
) −Ke
−r0(T−t)
N(d
2
) (6.35)
Note that this is very similar to the formula for a European call option on a nondividend paying stock, except
that q shows up in a couple of places. This is too be expected, since (6.42) diﬀers from the nondividend
BlackScholes equation only slightly.
6.1.3 Cash Dividends
Most individual stocks pay a prespeciﬁed dividend at a prespeciﬁed time. We often call this either a cash or
lump dividend. To compute the equation satisﬁed by an option on a stock that pays a cash dividend prior
to expiration, we can once again use our three step procedure.
Of course, the bond is a tradable. Now, we can purchase a share of the stock, and with this we will get
the stock price plus the dividend. This entire stream together it what is tradable.
To model this, we consider that the value corresponding to a single share is continuous and follows a
geometric Brownian motion. Call this value v(t), where it satisﬁes
dv = µvdt +σvdz. (6.36)
Now, we assume that the stock pays a dividend at the time τ in the amount of D. Therefore, we model the
stock price as dropping by D exactly at the time τ. To model this drop, we will use dirac delta notation
and write it as
dS = (µS −Dδ(t −τ))dt +σdz (6.37)
where δ(t − τ) is a dirac delta function at time τ. This notation is just to indicate that the price drops by
exactly D at time τ.
Now, we assume that c(S, t) is a derivative security, and we apply Ito’s lemma to obtain
dc = (c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
+ (c(S
−
−D, t) −c(S
−
, t))δ(t −τ))dt +σSc
S
dz (6.38)
I have not described how to apply Ito’s lemma in this situation before. However, you may recognize that
this situation is very similar to a process driven by a Poisson process, since they have discrete jumps. In this
case, it is even simpler, since we know exactly when the jump will occur. The above equation just indicates
that we use Ito’s lemma for Brownian motion up to, and after the jump time τ. However, at time τ the
jump occurs, which causes c to drop by exactly D. Once again, the delta function δ(t −τ) is really just used
to indicate that a jump occurs at time τ.
Therefore, we have identiﬁed the tradables B, v, and c, and now we can write the tradables table.
B
v
c
¸
¸
d
B
v
c
¸
¸
=
r
0
B
µv
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
+ (c(S
−
−D, t) −c(S
−
, t))δ(t −τ)
¸
¸
dt +
0
σv
σSc
S
¸
¸
dz
The ﬁnal step is to solve the Price APT equation
B
v
c
¸
¸
λ
0
+
0
σv
σSc
S
¸
¸
λ
1
=
r
0
B
µv
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
+ (c(S
−
−D, t) −c(S
−
, t))δ(t −τ)
¸
¸
(6.39)
6.1. EXAMPLES FROM EQUITY DERIVATIVES 53
Solving for λ
0
and λ
1
gives:
λ
0
= r
0
λ
1
=
µ −r
0
σ
(6.40)
Plugging λ
0
and λ
1
into the last equation yields
r
0
c +
µ −r
0
σ
σSc
S
= c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
+ (c(S
−
−D, t) −c(S
−
, t))δ(t −τ) (6.41)
Finally, rearranging leads to the BlackScholes equation:
c
t
+r
0
Sc
S
+
1
2
σ
2
S
2
c
SS
+ (c(S
−
−D, t) −c(S
−
, t))δ(t −τ) = r
0
c. (6.42)
6.1.4 Poisson Processes
This was done by Cox and Ross [5]. This setup involves a bond earning a risk free rate and a nondividend
paying stock that follows a geometric Poisson motion (GPM):
dB = r
0
Bdt (6.43)
dS = µS
−
dt + (k −1)S
−
dπ(ν) (6.44)
where ν is the intensity of the Poisson process π(t; ν). Furthermore, we consider a derivative on the stock.
Generically we call this derivative c and assume that its price process depends on the stock and time c(S, t)
and is continuously diﬀerentiable in both its arguments. By Ito’s lemma,
dc = (c
t
+µSc
S
)dt + (c(kS
−
) −c(S
−
))dπ(ν). (6.45)
This is a case where the random factor does not have zero mean. I would like to write my factor equations
such that the factors are pure risk and don’t have any mean drift. Hence, I will compensate the factor in
order to give it zero drift. In this case, my tradables are
dB = r
0
Bdt (6.46)
dS = (µS +ν(k −1)S)dt + (k −1)S(dπ(ν) −νdt) (6.47)
dc = (c
t
+µSc
S
+ν(c(kS
−
) −c(S
−
)))dt + (c(kS
−
) −c(S
−
))(dπ(ν) −νdt) (6.48)
We can now construct a tradables table
B
S
c
¸
¸
d
B
S
c
¸
¸
=
r
0
B
µS +ν(k −1)S
c
t
+µSc
S
+ν(c(kS
−
) −c(S
−
))
¸
¸
dt +
0
(k −1)S
c(kS
−
) −c(S
−
)
¸
¸
(dπ(ν) −νdt)
This allows us to move on to Step 3, which is to solve the Price APT equation Pλ
0
+Bλ
1
= A for absence
of arbitrage
B
S
c
¸
¸
λ
0
+
0
(k −1)S
c(kS
−
) −c(S
−
)
¸
¸
λ
1
=
r
0
B
µS +ν(k −1)S
c
t
+µSc
S
+ν(c(kS
−
) −c(S
−
))
¸
¸
.
Solving for λ
0
and λ
1
gives:
λ
0
= r
0
λ
1
=
µ −r
0
k −1
+ν. (6.49)
Plugging λ
0
and λ
1
into the last equation yields
r
0
c +
µ −r
0
k −1
+ν
(c(kS
−
) −c(S
−
)) = c
t
+µSc
S
+ν(c(kS
−
) −c(S
−
)) (6.50)
Finally, rearranging leads to the equation:
(µ −r
0
) (c(kS
−
) −c(S
−
)) = (k −1) (c
t
+µSc
S
−r
0
c) . (6.51)
54 CHAPTER 6. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES
European Call Option
One can actually ﬁnd a closed form solution for European call and put options in this model. I will give the
solution for a call option with strike K and expiration T:
c(S, t) = SΨ(x, y) −Ke
−r0(T−t)
Ψ(x, y/k) (6.52)
where
Ψ(α, β) =
∞
¸
i=α
e
−β
β
i
i!
, y =
(r
0
−µ)(T −t)k
k −1
(6.53)
and x is the smallest nonnegative integer greater than
ln(K/S)−µ(T−t)
ln(k)
.
Note that this solution has a structure that is very similar to the BlackScholes fomula. In fact, one can
interpret it as the BlackScholes formula, but with a Poisson random variable replacing the Gaussian random
variable.
6.1.5 Options on Futures
This was essentially done by Black in [1]. Let’s derive the pde satisﬁed by an option on a futures contract.
Assume there exists a bond and a futures contract with futures price f given as
dB = r
0
Bdt (6.54)
df = µfdt +σfdz. (6.55)
Furthermore, we consider a derivative on the futures price. Generically we call this derivative c and assume
that it’s price process depends on the futures price and time c(f, t) and is twice continuously diﬀerentiable
in both its arguments. By Ito’s lemma,
dc = (c
t
+µfc
f
+
1
2
σ
2
f
2
c
ff
)dt +σfc
f
dz. (6.56)
We can now complete Step 2 and construct a tradables table
B
0
c
¸
¸
d
B
f
c
¸
¸
=
r
0
B
µf
c
t
+µfc
f
+
1
2
σ
2
f
2
c
ff
¸
¸
dt +
0
σf
σfc
f
¸
¸
dz. (6.57)
Recall that the futures contract was the special case that motivated us to consider the price approach
to absence of arbitrage, rather than working with returns. The key point was that the marktomarket
mechanism always sets the price of a futures contract to zero while the change in value of the contract over
period dt is given by the change in the futures price df. This corresponds to the tradable table we have
written above.
This allows us to move on to Step 3, which is to solve the Price APT equation Pλ
0
+Bλ
1
= A for absense
of arbitrage.
B
0
c
¸
¸
λ
0
+
0
σf
σfc
f
¸
¸
λ
1
=
r
0
B
µf
c
t
+µfc
f
+
1
2
σ
2
f
2
c
ff
¸
¸
(6.58)
Solving for λ
0
and λ
1
gives:
λ
0
= r
0
λ
1
=
µ
σ
. (6.59)
Plugging λ
0
and λ
1
into the last equation yields
r
0
c +
µ
σ
σfc
f
= c
t
+µfc
f
+
1
2
σ
2
f
2
c
ff
. (6.60)
Finally, rearranging leads to the partial diﬀerential equation:
c
t
+
1
2
σ
2
f
2
c
ff
= r
0
c. (6.61)
6.1. EXAMPLES FROM EQUITY DERIVATIVES 55
European Call Option
Note that the BlackScholes equation in this case (6.61), looks just like the BlackScholes equation on a stock
paying a continuous dividend, however the continuous dividend rate is q = r. Therefore, we can just plug
into that formula to obtain the price of a European call option. The solution is
d
1
=
ln(f/K) + (
1
2
σ
2
)(T −t)
σ
√
T −t
(6.62)
d
2
= d
1
−σ
√
T −t (6.63)
c(f, t) = fe
−r0(T−t)
N(d
1
) −Ke
−r0(T−t)
N(d
2
). (6.64)
6.1.6 Jump diﬀusion
A jump diﬀusion model was ﬁrst solved by Merton [11]. This model is nice because it is related to many
other models in equity and interest rate derivatives. The model includes a risk free bond and an underlying
asset that has a diﬀusion portion and a lognormal jump portion. This model has a closed form solution for
the derivative price which Merton computed in his original paper [11].
We will also ﬁnd that this model creates a problem for our factor approach. Merton originally solved this
problem using a similar technique, and we will bypass the problem in the same manner that Merton did.
Let’s get started. Here are the basic assets
dB = r
0
Bdt (6.65)
dS = (µ +αE[Y −1])Sdt +σSdz +S
−
((Y −1)dπ(α) −αE[Y −1]dt) (6.66)
where the jump portion of the stock has been compensated. A derivative on the stock c(S, t) is a function
of S and t. By Ito’s lemma we have
dc = Lcdt +σSc
S
dz +
(c(Y S
−
) −c(S
−
))dπ −αE[(c(Y S
−
) −c(S
−
))]dt
(6.67)
where
Lc = c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
+αE[(c(Y S
−
) −c(S
−
))] (6.68)
Now we come to the point that we need to write a tradables table. This is where we run into a problem.
The problem is how to identify factors. The randomness associated with the jump term does not enter in
the same way to S and c. We would like to be able to write down a linear factor model in the factors dz
and Y dπ or a compensated version of of Y dπ. However, in this case that is not possible unless we consider
c(Y S
−
) − c(S
−
)dπ a new factor. However, since this risk is driven by the same Poisson process and jump
size Y , we would rather not do this. Yet, at this point we have no choice.
So, let’s proceed by considering ((c(Y S
−
) −c(S
−
))dπ −αE[(c(Y S
−
) −c(S
−
))]dt) and (Y − 1)dπ(α) −
αE[Y −1]dt as two diﬀerent factors. For notational convenience, let’s call them
dψ
1
= (Y −1)dπ(α) −αE[Y −1]dt (6.69)
dψ
2
=
(c(Y S
−
) −c(S
−
))dπ −αE[(c(Y S
−
) −c(S
−
))]dt
(6.70)
Then our tradables table is
B
S
c
¸
¸
d
B
S
c
¸
¸
=
r
0
B
(µ +αE[Y −1])S
Lc
¸
¸
dt +
0 0 0
σS S 0
σSc
S
0 1
¸
¸
dz
dψ
1
dψ
2
¸
¸
(6.71)
Finally, we would solve the Price APT equations
B
S
c
¸
¸
λ
0
+
0 0 0
σS S 0
σSc
S
0 1
¸
¸
λ
1
λ
2
λ
3
¸
¸
=
r
0
B
(µ +αE[Y −1])S
Lc
¸
¸
(6.72)
56 CHAPTER 6. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES
Now, we can write out the full pde which will have a number of unknown market prices of risk. This is
fairly messy and leaves a lot of degrees of freedom. Merton doesn’t do this. Instead he make the assumption
that all jump risk is diversiﬁable. That is, the market price of risk of any risk associated with the jump term
is zero! This is the same as setting λ
2
= λ
3
= 0. This is a strong assumption! But let’s follow Merton and
see where this leads us.
With λ
2
= λ
3
= 0, the Price APT equations Pλ
0
+Bλ = A become
B
S
c
¸
¸
λ
0
+
0
σS
σSc
S
¸
¸
λ
1
=
r
0
B
(µ +αE[Y −1])S
Lc
¸
¸
. (6.73)
Now we can solve for the market prices of risk as,
λ
0
= r
0
, λ
1
=
µ +αE[Y −1]
σ
, (6.74)
and the ﬁnal equation becomes
r
0
c +
µ +αE[Y −1] −r
0
σ
σSc
S
= Lc (6.75)
which can be rewritten as
c
t
+ (r
0
−αE[Y −1])Sc
S
+
1
2
σ
2
S
2
c
SS
= r
0
c −αE[(c(Y S
−
) −c(S
−
))]. (6.76)
Bankruptcy
Let’s consider a special case of complete bankruptcy. That is, Y = 0. Then the above equation becomes
c
t
+ (r
0
+α)Sc
S
+
1
2
σ
2
S
2
c
SS
= (r
0
+α)c. (6.77)
This looks like standard BlackScholes but the interest rate has been increased by the default probability!
We will see that this same relationship will also appear in defaultable bonds. In fact, this model is really
the prototype for defaultable bonds. Of course, we can give a closed form solution in this case based on the
BlackScholes formula.
Note the following counterintuitive observation. In the BlackScholes formula, the value of a European
call option increases with the risk free rate. This means that according to our model above, if the rate
of bankruptcy increases, then the value of call option will actually increase! You should think about this
carefully to understand why that is true...
Lognormal Jumps!
When Y the jump size is lognormal, then conditional on the number of jumps that have occurred before
expiration, stock distribution at expiration is lognormal, and there is a closed form solution to the pde for
European call and put options. For a European call option with strike K and expiration T, it is given by
c(S, t) =
∞
¸
n=0
¸
e
−λ
′
(T−t)
(λ
′
(T −t))
n
n!
¸
c
BS
S, T −t, K, σ
2
+
nδ
2
T −t
, r
0
−λk +
nγ
T −t
(6.78)
where c
BS
(S, T, K, σ, r
0
) is the BlackScholes formula for a European call option with strike K and expiration
T on a nondividend paying stock with current price S, volatility σ, and with risk free rate r
0
. We also have
that k = E[Y −1], λ
′
= λ(1 +k), and γ = ln(1 +k). This formula can be found in Merton’s work [11].
Note that it is basically a combination of the BlackScholes formula and the solution under Poisson
dynamics. The key is that under this jump diﬀusion model, conditional on the number of jumps, the price
6.1. EXAMPLES FROM EQUITY DERIVATIVES 57
distribution at expiration is lognormal, indicating that the solution should look like BlackScholes. The
only question is how many jumps have occurred. Therefore, the solution is basically that conditioned on
the number of jumps that have occurred, the answer is BlackScholes. Hence, for each possible number of
jumps, we have a BlackScholes formula, then we need to weight them by the probability of that number of
jumps, which is Poisson. Simple!
6.1.7 Exchange one asset for another
Margrabe’s [10] exchange one asset for another is one of my favorite derivatives. Why? because many other
derivatives can be thought of as exchanging two diﬀerent assets. For instance, can a vanilla call option be
thought of as exchanging one asset for another? What are the two assets being exchanged?
Anyway, in this case, tradable assets will be called the bond B and two other assets S
1
and S
2
. Their
dynamics are given by:
dB = r
0
Bdt (6.79)
dS
1
= µ
1
S
1
dt +σ
1
S
1
dz
1
(6.80)
dS
2
= µ
2
S
2
dt +σ
2
S
2
dz
2
(6.81)
dc = L
1
cdt + +σS
1
c
S1
dz
1
+σS
2
c
S2
dz
2
(6.82)
where
L
1
c = (c
t
+µ
1
S
1
c
S1
+µ
2
S
2
c
S2
+
1
2
σ
2
1
S
2
1
c
S1S1
+
1
2
σ
2
2
S
2
2
c
S2S2
+ρσ
1
σ
2
S
1
S
2
c
S1S2
) (6.83)
and ρ is the correlation coeﬃcient between z
1
and z
2
. Thus, the tradable table is
B
S
1
S
2
c
¸
¸
¸
¸
d
B
S
1
S
2
c
¸
¸
¸
¸
=
r
0
B
µ
1
S
1
µ
2
S
2
Lc
¸
¸
¸
¸
dt +
0 0
σ
1
S
1
0
0 σ
2
S
2
σ
1
S
1
c
S1
σ
2
S
2
c
S2
¸
¸
¸
¸
¸
dz
1
dz
2
. (6.84)
Finally, we solve the Price APT equations
B
S
1
S
2
c
¸
¸
¸
¸
λ
0
+
0 0
σ
1
S
1
0
0 σ
2
S
2
σ
1
S
1
c
S1
σ
2
S
2
c
S2
¸
¸
¸
¸
¸
λ
1
λ
2
=
r
0
B
µ
1
S
1
µ
2
S
2
Lc
¸
¸
¸
¸
. (6.85)
In this case we have four equations and only three unknowns. Let’s use the ﬁrst three equations to solve
for λ
0
, λ
1
and λ
2
as
λ
0
= r
0
λ
1
=
µ
1
−r
0
σ
1
λ
2
=
µ
2
−r
0
σ
2
. (6.86)
If we plug these into the ﬁnal equation, we obtain
Lc = r
0
c +
µ
1
−r
0
σ
1
σ
1
S
1
c
S1
+
µ
1
−r
0
σ
1
σ
1
S
1
c
S1
(6.87)
which upon rearrangement becomes
c
t
+r
0
S
1
c
s1
+r
0
S
2
c
s2
+
1
2
σ
2
1
S
2
1
c
s1s1
+
1
2
σ
2
2
S
2
2
c
s2s2
+ρσ
1
σ
2
S
1
S
2
c
s1s2
= r
0
c. (6.88)
58 CHAPTER 6. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES
Closed form solution
There is a closed form solution for a European exchange one asset for another. Assume that you have the
option to exchange asset S
1
for asset S
2
at time T. Hence, the boundary condition is
c(S
1
, S
2
, T) = max(S
2
−S
1
, 0), c(0, S
2
, t) = S
2
, c(S
1
, 0, t) = 0. (6.89)
The closed form solution for this option is
c(S
1
, S
2
, t) = S
2
N(d
1
) −S
1
N(d
2
) (6.90)
where
d
1
=
ln(S
2
/S
1
) +
1
2
σ
2
(T −t)
σ
√
T −t
(6.91)
d
2
= d
1
−σ
√
T −t (6.92)
σ =
σ
2
1
+σ
2
2
−2ρσ
1
σ
2
. (6.93)
Note that this solution looks very much like BlackScholes. Here is some intuition into why. Let’s look at
the payoﬀ
c(S
1
, S
2
, T) = max(S
2
−S
1
, 0) = S
1
max(
S
2
S
1
−1, 0).
Now, the portion max(
S2
S1
−1, 0) looks like the payoﬀ of a call option with strike 1 on the asset
S2
S1
! In fact,
dividing S
2
by S
1
is essentially changing units in order to value S
2
in units of S
1
. Therefore, in units of S
1
,
this is just a call option on S
2
with strike 1! The multiplication on the left by S
1
just converts the call option
back to units of dollars. Therefore, we shouldn’t be surprised that the solution looks like BlackScholes,
because this is really just a call option in a diﬀerent set of units.
When we change units by denominating everything in terms of another asset (such as S
1
in this case),
we call that a change of numeraire, and we call the asset S
1
the numeraire asset. Needless to say, some
problems (such as this one) are easier to solve in a convenient set of units.
6.1.8 Stochastic volatility
First, we can list the relevant equations
dB = r
0
Bdt (6.94)
dS = µSdt +
√
vSdz
1
(6.95)
dv = adt +bdz
2
(6.96)
and we assume that z
1
and z
2
are correlated E[dz
1
dz
2
] = ρdt.
Our derivative can depend on S, t, and v. Hence, we write c(S, v, t). (What if I didn’t assume the
derivative was a function of v? This would have been a bad assumption since I know in the BlackScholes
case that c is a function of the volatility. Would I end up getting the wrong answer?)
By Ito’s lemma we have:
dc = Lcdt +
√
vSc
S
dz
1
+bc
v
dz
2
(6.97)
where
Lc = (c
t
+µSc
S
+ac
v
+
1
2
vS
2
c
SS
+
1
2
b
2
c
vv
+ρb
√
vSc
Sv
) (6.98)
In this case the tradables are
dB = r
0
Bdt (6.99)
dS = µSdt +
√
vSdz
1
(6.100)
dc = Lcdt +
√
vSc
S
dz
1
+bc
v
dz
2
(6.101)
6.2. PROBLEMS 59
So the tradable table is
B
S
c
¸
¸
d
B
S
c
¸
¸
=
r
0
B
µS
Lc
¸
¸
dt +
0 0
√
vS 0
√
vSc
S
bc
v
¸
¸
¸
dz
1
dz
2
. (6.102)
Finally, we solve the Price APT equations
B
S
c
¸
¸
λ
0
+
0 0
√
vS 0
√
vSc
S
bc
v
¸
¸
¸
λ
1
λ
2
=
r
0
B
µS
Lc
¸
¸
. (6.103)
Solving for the market prices of risks λ
0
and λ
1
gives
λ
0
= r
0
λ
1
=
µ −r
0
√
v
. (6.104)
Note that we leave λ
2
as an unknown. Plugging these into the ﬁnal equations leads to
c
t
+r
0
Sc
S
+ (a −λ
2
b)c
v
+
1
2
vS
2
c
SS
+
1
2
b
2
c
vv
+ρb
√
vSc
Sv
= r
0
c (6.105)
Under speciﬁc choices of a and b, for a European call option, this pde has a fairly convenient solution via
transform methods. I won’t cover this in detail here. See Heston [9] if interested.
6.2 Problems
Problem 6.2.1 Show that the BlackScholes formula (6.14) is a special case of exchange one asset for
another where one asset is the stock S(t) and the other is the bond B(t). In particular, show that the
exchange one asset for another formula (6.90) reduces to the BlackScholes formula (6.14) in this case.
Problem 6.2.2 (The power of linearity (Breeden and Litzenberger)) We will use linearity to derive the
formula for a European digital option in terms of a European call option price. A digital option is an option
that pays oﬀ 1 if the underlying stock ends up above the strike price K, and pays nothing if it ends up below
the strike price.
(a) Let T be the expiration date for the digital option. Draw a picture of the payoﬀ as a function of S
T
.
(b) Assume that you know the price of European call options for every strike K for the expiration date
T (i.e. c(S, K, t)). Using only this information, derive the price of a digital option with strike K in
terms of c(S, K, t). (Hint: think of how you can construct the payoﬀ of a digital option by buying and
selling call options.)
(c) Taking the above argument one step further, derive the ”price” of a security whose payoﬀ is a dirac
delta function at K at expiration T.
(d) Use the result of (c) to derive a general pricing formula for an arbitrary payoﬀ at time T. Note that if
you have made it this far, you have ”derived” the risk neutral pricing formula in terms of c(S, K, t).
(If you plug in the BlackScholes formula for c(S, K, t) then you have the standard risk neutral pricing
formula when the underlying asset follows a geometric Brownian motion.)
(e) As one ﬁnal task, price a security whose payoﬀ is (S
T
−K)
2
for S
T
≥ K and 0 for S
T
< K.
(Here is a big hint if you are having trouble with this problem. We can think of the payoﬀ of a European
call option as a ramp function. A digital option looks like a step function, and ﬁnally we price a dirac delta
function. What is the relationship between a ramp, step and delta function?)
60 CHAPTER 6. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES
Problem 6.2.3 All about PutCall parity and Early Exercise.
(a) Derive putcall parity for European call and put options on a nondividend paying stock.
(b) Early exercise is not optimal for a standard American call option on a nondividend paying stock.
Derive this.
(c) Assume the payoﬀ of a derivative security on a nondividend paying underlying stock is a convex
function that is nonpositive when the stock is zero. Prove that if this is an American option, it is
never optimal to exercise early.
(d) Consider the European option to exchange one asset, S
2
, for another S
1
(assume they are nondividend
paying) at expiration T. That is, if you currently hold S
2
, this option would give you the right to
exchange it for S
1
at time T. This is a derivative security with payoﬀ c(S
1
, S
2
, T) = max[S
1
−S
2
, 0].
Derive a parity relating S
1
, S
2
and the options to exchange S
1
for S
2
and viceversa.
(e) (Margrabe) Consider an American option to exchange one asset for another (nondividend paying
again). Is it ever optimal to exercise this early?
(f ) Merton showed that it can be optimal to exercise a put option early, even on a nondividend paying
stock. However, we can think of a European put option as exchanging a stock for a bond, in which case,
the American counterpart would never be exercised early according to Margrabe. Is there a contradiction
here?
Problem 6.2.4 (Exchange one asset for another) In this problem you will derive the formula for an option
to exhange one asset for another by reducing the pde to a standard BlackScholes pde for a call option on
geometric Brownian motion.
(a) Let S
1
and S
2
be two assets whose dynamics are given by
dS
1
= µ
1
S
1
dt +σ
1
S
1
dz
1
(6.106)
dS
2
= µ
2
S
2
dt +σ
2
S
2
dz
2
(6.107)
where E[dz
1
dz
2
] = ρdt and assume that a risk free asset exists with constant interest rate r. Write down
the pde for an option to exchange S
2
for S
1
at expiration T. (Hence, the payoﬀ is max(S
1
−S
2
, 0).)
(b) Consider the change of variable v = S
1
/S
2
and assume that the solution of the above pde is of the form
c(S
1
, S
2
, t) = S
2
f(v, t). Using these substitutions, write the pde from (a) in terms of S
2
, t, f, and v.
What equation must f(v, t) satisfy? What is the appropriate boundary condition for a European option
to exchange S
2
for S
1
in terms of the new variables.
(c) Using the BlackScholes formula, write down a formula for the value of an option to exchange S
1
for
S
2
.
Problem 6.2.5 Consider a European call option on a nondividend paying stock with stochastic volatility.
But, this time we model the instantaneous variance with a Poisson process. The idea is that most of the time
the instantaneous variance is σ
2
l
, but at random times the market goes wild and the instantaneous variance
jumps by an amount b. After the jump, the instantaneous variance exponentially decays back to its normal
level σ
l
. First write down the relevant dynamics for this problem, then derive a pde for the price of the
option. Again, this pde can be in terms of a market price of risk.
(Hint: your dynamics should look like:
dS = µSdt +
√
vSdz
dv = a(σ
2
l
−v)dt +bdπ
where π is a Poisson process with intensity α.)
6.2. PROBLEMS 61
Problem 6.2.6 (Matlab Exercise) This exercise will introduce you to the implied volatility curve. If c
m
is
the market value of a call option with strike K, expiration T, and if c
BS
(S, K, T, r
0
, σ) (assuming the stock is
nondividend paying) is the BlackScholes formula for a European call option. Then the value of the volatility
parameter σ
impl
that satisﬁes
c
m
= c
BS
(S, K, T, r
0
, σ
impl
) (6.108)
is known as the implied volatility. In this exercise, you will generate prices under Merton’s jumpdiﬀusion
model, and compute implied volatility curves.
a. Use an initial stock price of S(0) = 1, expiration of T = 0.3, and a risk free rate of r = 0.05. For
a range of strike prices from K = [0.8, 1.2], ﬁrst compute the BlackScholes price of options using
σ = 0.3, then assume that they represent market prices and plot the resulting implied volatility curve
as a function of K.
b. This time, use Merton’s jumpdiﬀusion model to to generate the market prices. Use the same parameter
values as in part (a) (i.e. S(0) = 1, T = 0.3, r = 0.05, and σ = 0.3), and also jump intensity of λ = 2,
jump mean of ν
J
= 0 and jump standard deviation of σ
J
= 0.1. Once again plot the resulting implied
volatility curve. You should see a slight implied volatility ”smile”!
Problem 6.2.7 (American Options) Use the results of Chapter 4, Problem 3 to derive the pde conditions
followed by an American call or put option where the underlying stock (nondividend paying) and bond follow
dS = µSdt +σSdz
dB = r
0
Bdt.
62 CHAPTER 6. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES
Chapter 7
Application of the Factor Form:
Interest Rate and Credit Derivatives
In this chapter, we apply the factor approach to interest rate and credit derivatives. Again, the emphasis is
showing that the pdes describing derivative pricing can all be easily derived from the simple factor approach
equations.
Before getting started, let’s make a few comments about interest rate modeling and derivatives. For any
derivative pricing problem, the ﬁrst challenge before being able to price a derivative is the calibration phase
as in Figure 5.3 of Chapter 5. For many equity derivatives, the calibration phase does not play prominently
into the analysis. In fact, in many equity derivative models the market is either complete in which case the
calibration phase of determining λ is simple, or the market is incomplete in which case the APT equations
are underdetermined and one is left with a degree of freedom in choosing λ.
Interest rate and term structure models tend to be the opposite. That is, we may use a single factor, but
have many, many marketed tradables. This makes the APT equations overdetermined! Thus, the calibration
phase is in determining λ values that best ﬁt all the marketed tradables (In theory if there isn’t a perfect
ﬁt, then an arbitrage is available. However in practice we simply recognize that our model is too simple
and try to ﬁnd a bestﬁt λ). Once calibration is done, then pricing a derivative just proceeds by solving the
appropriate pde with boundary conditions using the calibrated λ. Because calibration is so important in
interest rate and term structure models, in what follows, we will often bring the models up to the point of
calibration, but not consider speciﬁc derivatives after that point.
Beyond this, the truth is that some of these models (such as HJM) are better suited for the risk neutral
approach to derivative pricing (A subject that is touched upon in Chapter 9) simply because the pdes that
we obtain are often too large and diﬃcult to solve in any reasonable manner. Nevertheless, it is extremely
instructive to see that they can be understood via the factor approach. Since I have mentioned risk neutral
pricing, I should also mention that ”calibration” in risk neutral pricing often looks diﬀerent from what I
have called the calibration phase (which is just determining the market prices of risk from the marketed
tradables). However, they are really the same thing, just described using a diﬀerent ”language”. Hopefully,
at the end of your studies of derivative pricing you will be able to ”translate” between them.
Okay, let’s get to pricing...
7.1 Notation and the Money Market Account
In dealing with bonds and interest rate derivatives, we will need to establish some notation. Let’s start with
the short rate process. The short rate r
0
(t) is the interest rate earned from time t to time t + dt quoted
using continuous compounding. (Note that the notation is similar to the constant risk free rate r
0
used in
the previous chapter. This is because the short rate plays the same crucial role as the constant risk free rate
in that it deﬁnes the market price of time, λ
0
.)
63
64CHAPTER 7. APPLICATIONOF THE FACTOR FORM:INTEREST RATE ANDCREDIT DERIVATIVES
On a plot of the term structure, the short rate is where is spot rate curve intersects the yaxis as in
Figure 7.1. The short rate process is important because it can be used to drive the entire spot rate curve.
Figure 7.1: The Short Rate Process
Furthermore, the short rate r
0
(t) describes the instantaneous return on the money market account. That
is, we deﬁne the money market account B
0
(t) as the value of an account that is continuously rolled over at
the instantaneous short rate. That is, you invest your money at the rate r
0
(t) over the next dt, then you
reinvest that amount over the next dt at the rate r
0
(t + dt), and continue this. Hence, the money market
account B
0
(t) follows the dynamics
dB
0
(t) = r
0
(t)B
0
(t)dt (7.1)
The money market account often plays a special role in interest rate derivatives because its dynamics are
not explicitly driven by a random factor. Note that this is the case even though r
0
(t) itself can be random
and follow a stochastic diﬀerential equation of its own. Pay careful attention to the role that the money
market account plays in what follows.
7.2 Interest Rate Derivatives
7.2.1 Single Factor Short Rate Models
The simpliest interest rate derivative model has a single underlying variable: the short rate. Furthermore,
this variable is driven by a single factor. Hence, these models are typically called single factor short rate
models. Vasicek [15] was the ﬁrst to recognize that pricing with absence of arbitrage was possible by modeling
the instantaneous short rate. Hence, we will model this short rate variable as
dr
0
= adt +bdz (7.2)
where the time dependence is supressed in r
0
= r
0
(t), and a = a(r
0
, t), b = b(r
0
, t) can be functions of r
0
and
t. When convenient, we will suppress these arguments, so don’t always assume that a variable is a constant
if no arguments are explicitly listed.
The tradables are a money market account, which we denote by B
0
and zero coupon bonds with face
value of $1 of varying maturity that we denote by B(tT) where t is the current time and T is the maturity
date. This is pictured in Figure 7.2. We assume that the zero coupon bonds are functions of the short rate,
B(r
0
, tT). Then suppressing arguments, we can write
dB
0
= r
0
B
0
dt (7.3)
dB(T) = (B
t
(T) +aB
r
(T) +
1
2
b
2
B
rr
(T))dt +bB
r
(T)dz (7.4)
7.2. INTEREST RATE DERIVATIVES 65
Figure 7.2: Notation for Zero Coupon Bond Prices
where Ito’s lemma was used to derive the equation for dB(T), and B
r
, B
rr
, represent the ﬁrst and second
partial derivatives with respect to the short rate r
0
, respectively. In particular, not that for convenience
when using this partial derivative notation we supress the subscript r
0
and write B
r
= B
r0
.
Now let’s write down the tradable table:
¸
B
0
B(T)
d
¸
B
0
B(T)
=
¸
r
0
B
0
(B
t
(T) +aB
r
(T) +
1
2
b
2
B
rr
(T))
dt +
¸
0
bB
r
(T)
dz (7.5)
Finally, we solve the Price APT equations
¸
B
0
B(T)
λ
0
+
¸
0
bB
r
(T)
λ
1
=
¸
r
0
B
0
(B
t
(T) +aB
r
(T) +
1
2
b
2
B
rr
(T))
(7.6)
The ﬁrst equation gives λ
0
= r
0
which is the random short rate. Then the second equation gives
(B
t
(T) + (a −λ
1
b)B
r
(T) +
1
2
b
2
B
rr
(T)) = r
0
B(T) (7.7)
The boundary condition is of course that B(TT) = 1. This equation must hold for zero coupon bonds of
any maturity. Note that it is in terms of a market price of risk. In fact, since in general we have many bonds,
any one of them should allow us to solve for λ
1
. In practice, diﬀerent bonds will often give diﬀerent values
of λ
1
, indicating that the model is not exactly correct.
When one tries to determine a single best ﬁt λ
1
from the bond data, we are treating all the bonds as
being marketed tradables, and thus this is interpreted as a calibration phase. After this calibration phase,
one could then use this model to price other interest rate derivatives such as caps, ﬂoors, bond options, etc.
7.2.2 MultiFactor Short Rate Models
The single factor short rate models are usually not good enough to describe the term structure well. So,
instead of using a single factor model, we can use a multifactor model as follows.
Let X be a vector in R
n
of underlying variables aﬀecting the term structure. We assume that these
variables follow a stochastic diﬀerential equation model
dX = f(X, t)dt +g(X, t)dz (7.8)
where z ∈ R
n
is a vector of uncorrelated Brownian motions. (Thus, g(X, t) ∈ R
n×n
and f(X, t) ∈ R
n
.)
We then assume that the short rate is a function of the underlying variables X. That is, r
0
(X, t). Note
that one possibility is to have the short rate be one of the variables in X. Thus, it is possible to choose
r
0
(X, t) = X
i
, where X
i
is the i −th factor.
66CHAPTER 7. APPLICATIONOF THE FACTOR FORM:INTEREST RATE ANDCREDIT DERIVATIVES
Now, to derive the pde that zero coupon bonds would follow, we just note that bonds are a function of
X and t. Hence, we have B(X, tT). Via Ito’s lemma, our tradables become
¸
B
0
B(T)
d
¸
B
0
B(T)
=
¸
r
0
(X, t)B
0
B
t
(T)+B
X
(T)f(X, t)+
1
2
Tr[B
XX
(T)g(X, t)g
T
(X, t)]
dt +
¸
0
B
r
(T)g(X, t)
dz
where I have suppressed the X and t arguments in B. Now, by the price APT, we have
¸
B
0
B(T)
λ
0
+
¸
0
B
X
(T)g(X, t)
λ =
¸
r
0
(X, t)B
0
(B
t
(T) +B
X
(T)f(X, t) +
1
2
Tr[B
XX
(T)g(X, t)g
T
(X, t)])
(7.9)
where λ ∈ R
n
. The ﬁrst equation gives λ
0
= r
0
(X, t) which is the random short rate. Then the second
equation gives
(B
t
(T) +B
X
(T)(f(X, t) −g(X, t)λ) +
1
2
Tr[B
XX
(T)g(X, t)g
T
(X, t)]) = r
0
(X, t)B(T) (7.10)
The boundary condition is of course that B(TT) = 1. This can be a pretty complicated pde, and given that
it describes bonds as a function of λ, we could use the current term structure (or equivalently, bond prices)
to calibrate λ. This calibration can be quite diﬃcult because bond prices are described by a pde, and to test
the ﬁt for diﬀerent values of λ, we might have to solve the pde every time. It is much easier if the solution
to the pde is known in closed form as a function of λ, then we can quickly adjust λ to best ﬁt the market
data of bonds.
7.2.3 HeathJarrowMorton
In the HeathJarrowMorton [6] framework, instead of modeling the instantaneous short rate as driving
the term structure, they decided to model instantaneous forward rates. That is, let r(ts) denote the
instantaneous forward rate seen from time t of the forward interest rate between time s and s + ds. With
this notational convention, we have that the instantaneous short rate is given by r(tt) = r(t), and a zero
coupon bond with maturity T seen from time t will be denoted by B(tT) and related to the instantaneous
short rates by
B(tT) = exp
−
T
t
r(ts)ds
. (7.11)
Hence, we also have that
r(tT) = −
∂
∂T
ln(B(tT)). (7.12)
HJM takes the instantaneous forward rates as the underlying variables, and models them as
dr(ts) = µ(ts)dt +σ(ts)dz(t). (7.13)
This model is a bit more complicated than the previous single and multifactor short rate models because
we see through equation (7.11) that B(tT) is a function of an inﬁnite number of underlying variables r(ts)
for s ∈ [t, T]! Because of this, if we were to write pricing formulas for derivatives, they would appear as
inﬁnite dimensional pdes!
Nevertheless, this model has an advantage over the previous single and multifactor short rate models
in that equation (7.11) gives us an explicit relationship between the tradables and the (iniﬁnite) underlying
variables. Thus, instead of using a generic Ito’s lemma relationship between B(tT) and the underlying
factors r(ts), we will be able to make that relationship concrete by plugging in explicit partial derivatives
into Ito’s lemma.
Why is this helpful? Because this will allow us to pull the Price APT relationship for the marketed
tradables B(tT) back to the underlying variables r(ts) which opens up the possibility of doing calibration
7.2. INTEREST RATE DERIVATIVES 67
Figure 7.3: Notation for Zero Coupon Bond Prices
directly on the underlying factors r(ts) instead of the marketed tradables. If it is easier to work with market
data about instantaneous forward rates, then this can simplify the calibration process. To understand the
value of this, you should consider the calibration procedure that has to be done for a single or multifactor
short rate model if no closed form bond pricing formula is known (i.e. no closed form solution to the pricing
pde is known), and compare that with using forward rate data to calibrate in the HJM model that we will
derive.
Okay, with those preliminaries out of the way, let’s dive into the details. We will derive the Price APT
equations marketed tradables which are the bonds B(tT). However, using the explicit relationship between
the bonds B(tT) and the underlying variabels r(ts) will allow us to pull the Price APT relationship back
to the underlying variables r(ts) which would then allow us to calibrate using r(ts) directly.
In this setup, we will take zero coupon bonds as our marketed tradables. That is, B(tT) are tradables,
which means that we need to calculate dB(tT) for our tradables table. This is a bit of a tricky calculation
because B(tT) is really a function of an inﬁnite number of Ito processes r(ts) for s ∈ [t, T] through the
equation
B(tT) = exp
−
T
t
r(ts)ds
(7.14)
To simplify our thinking, let’s begin by assuming that s is indexed by k = 1...n so that B(t; T) will only
depend on n Ito processes. That is, let us think
B(tT) = B({r(ts
k
) : k = 1...n}, tT) (7.15)
Then, by Ito’s lemma, we would have
dB =
¸
B
t
(ts) +
n
¸
i=1
µ(t, s
i
)B
r(tsi)
+
n
¸
i=1
n
¸
j=1
1
2
σ(ts
i
)σ(ts
j
)B
r(tsi)r(tsj)
¸
dt
+
n
¸
i=1
σ(ts
i
)B
r(tsi)
dz
Now, to complete our computation of Ito’s lemma, we need to compute B
t
, B
r(tsi)
, and B
r(tsi)r(tsj)
using
68CHAPTER 7. APPLICATIONOF THE FACTOR FORM:INTEREST RATE ANDCREDIT DERIVATIVES
the discretization of (7.14):
B(tT) = exp
−
n
¸
k=1
r(ts
k
)∆s
k
. (7.16)
This gives
B
t
≈ exp
−
n
¸
k=1
r(ts
k
)∆s
k
r(tt) = exp
−
T
t
r(ts)ds
r(tt) (7.17)
B
r(tsi)
≈ −exp
−
n
¸
k=1
r(ts
k
)∆s
k
∆s
i
= −exp
−
T
t
r(ts)ds
∆s
i
(7.18)
B
r(tsi)r(tsj)
≈ exp
−
n
¸
k=1
r(ts
k
)∆s
k
∆s
i
∆s
j
= exp
−
T
t
r(ts)ds
∆s
i
∆s
j
. (7.19)
Plugging these into Ito’s lemma gives
dB(tT) =
exp
−
n
¸
k=1
r(ts
k
)∆s
k
r(tt) −
n
¸
i=1
µ(ts
i
) exp
−
n
¸
k=1
r(ts
k
)∆s
k
∆s
i
+
1
2
n
¸
i=1
n
¸
j=1
σ(ts
i
)σ(ts
j
) exp
−
n
¸
k=1
r(ts
k
)∆s
k
∆s
i
∆s
j
¸
dt
−
n
¸
i=1
σ(ts
i
) exp
−
n
¸
k=1
r(ts
k
)∆s
k
∆s
i
dz.
Taking continuous limits yields
dB(tT) =
B(tT)r(tt) −B(tT)
T
t
µ(ts)ds +
1
2
B(tT)
T
t
T
t
σ(ts)σ(tr)drds
dt
−
B(tT)
T
t
σ(ts)ds
dz.
Now, applying the Price APT equation leads to
r(tt) −
T
t
µ(ts)ds +
1
2
T
t
T
t
σ(ts)σ(tr)drds = r(tt) −
T
t
σ(ts)ds
λ
1
(7.20)
or
T
t
µ(ts)ds −
1
2
T
t
T
t
σ(ts)σ(tr)drds =
T
t
σ(ts)ds
λ
1
. (7.21)
The upshot of this calculation is that the mean and the volatilities of the forward rates r(ts) must be related
through this equation. In fact, taking the partial derivative with respect to T gives
µ(tT) −σ(tT)
T
t
σ(ts)ds = σ(tT)λ
1
(7.22)
which makes the relationship a little more explicit. Thus, we have ”pulled back” the Price APT relationship
onto the underlying factors.
Again, to remind you, the point of this is that now we can calibrate from forward rate market date r(ts)
rather than having to go through bond prices B(tT).
7.2. INTEREST RATE DERIVATIVES 69
This is just the calibration phase. To price some derivative we would not have an explicit relationship
between the derivative and the underlying variables r(ts), and thus Ito’s lemma (to create the tradables
table) combined with the Price APT would lead to an inﬁnite dimensional pde. The point is that for actual
pricing of a new derivative, using the factor approach and pdes is not easy. This is one case in which risk
neutral pricing (Chapter 9) can help quite a bit.
7.2.4 The LIBOR Market Model
This model is similar to the HJM model except that it uses discrete forward rates instead of instantaneous
forward rates. Again, we will be able to use an explicit relationship between the marketed tradables of bonds
and the underlying variables of discrete forward rates to pull the Price APT relationship onto the underlying
variables. Once again, the idea is that this can make calibration easier. Due to its similarity with HJM, this
is also a model that is easier to price with using the risk neutral approach. Nevertheless, the factor approach
can provide us with an important perspective on this model. For further reading on this approach see [3].
Let R(tT
1
; T
2
) be the forward interest rate between time T
1
and T
2
as seen at time t. For simplicity, let
T
i+1
= T
i
+ τ where τ is a ﬁxed amount of time. Furthermore, we use an interest rate convention so that
the price of a zero coupon bond is
B(tT
1
) =
1
(1 +τR(tT
1
))
(7.23)
and for bond prices and forward rates we have
B(tT
2
) =
B(tT
1
)
(1 +τR(tT
1
; T
2
))
(7.24)
and more generally
B(tT
i
) =
B(tT
i−1
)
(1 +τR(tT
i−1
; T
i
))
(7.25)
etc for i = 1, ..., n.
Now, we will model the forward rates as stochastic diﬀerential equations. For simplicity, we use a
geometric Brownian motion model.
dR(tT
1
; T
2
) = a
1
R(tT
1
; T
2
)dt +b
1
R(tT
1
; T
2
)dz
2
(7.26)
For notational simplicity, let’s write R
i
= R(tT
i
, T
i+1
) and B
i
= B(tT
i
) so that
dR
i
= a
i
R
i
dt +b
i
R
i
dz
i
(7.27)
Then prices of tradables satisfy
B
i
=
B
i−1
1 +τR
i−1
(7.28)
Now, it should be clear that B
i
depends on R
i−1
and hence the factor z
i−1
, but also B
i−1
. Now since B
i−1
depends on R
i−2
and B
i−2
, etc., then B
i
ultimately depends on the factors z
i−1
, z
i−2
, z
i−3
, ..., z
1
. Thus,
let us write generically that
dB
i
= µ
i
B
i
dt +B
i
i−1
¸
j=1
σ
ij
dz
j
. (7.29)
The relationship above leads to a recursive dependence
B
i
= (1 +τR
i
)B
i+1
. (7.30)
70CHAPTER 7. APPLICATIONOF THE FACTOR FORM:INTEREST RATE ANDCREDIT DERIVATIVES
Using Ito’s lemma gives
dB
i
= τdR
i
B
i+1
+ (1 +τR
i
)dB
i+1
+τdR
i
dB
i+1
(7.31)
= τB
i+1
(a
i
R
i
dt +b
i
R
i
dz
i
) + (1 +τR
i
)
¸
µ
i+1
B
i+1
dt +B
i+1
i
¸
j=1
σ
i+1,j
dz
j
¸
(7.32)
+τb
i
R
i
B
i+1
¸
i
¸
j=1
σ
i+1,j
ρ
ij
¸
dt (7.33)
=
¸
τB
i+1
(a
i
R
i
) +τb
i
R
i
B
i+1
i
¸
j=1
σ
i+1,j
ρ
ij
+ (1 +τR
i
)(µ
i+1
B
i+1
)
¸
dt
+τB
i+1
b
i
R
i
dz
i
+ (1 +τR
i
)
¸
B
i+1
i
¸
j=1
σ
i+1,j
dz
j
¸
. (7.34)
This is the expression for our tradables. From this we can get two things. First, we note that by comparing
this to (7.29) we should have that
τB
i+1
b
i
R
i
+ (1 +τR
i
)B
i+1
σ
i+1,i
= 0. (7.35)
Since the coeﬃcient of z
i
in (7.29) is zero. This allows us to solve for σ
i+1,i
as
σ
i+1,i
= −
τb
i
R
i
(1 +τR
i
)
. (7.36)
Futhermore, equating the coeﬃcients of z
j
for j < i in (7.29) and (7.34) gives
(1 +τR
i
)B
i+1
σ
i+1,j
= B
i
σ
i,j
. (7.37)
Now, noting that B
i
= (1 +τR
i
)B
i+1
gives
σ
i+1,j
= σ
i,j
. (7.38)
Thus, by combining (7.36) and (7.38) we obtain
σ
i+1,j
= −
τb
j
R
j
(1 +τR
j
)
, i > j. (7.39)
Once we have established these relationships, we can move to constructing the tradables table and
applying the Price APT. Returning to (7.34), the tradables look like
dB
i
=
¸
τB
i+1
(a
i
R
i
) +τb
i
R
i
B
i+1
i
¸
j=1
σ
i+1,j
ρ
ij
+ (1 +τR
i
)(µ
i+1
B
i+1
)
¸
dt
+τB
i+1
b
i
R
i
dz
i
+ (1 +τR
i
)
¸
B
i+1
i
¸
j=1
σ
i+1,j
dz
j
¸
(7.40)
= B
i
µ
i+1
+
τa
i
R
i
+τb
i
R
i
¸
i
j=1
σ
i+1,j
ρ
ij
1 +τR
i
dt
+B
i
¸
i−1
¸
j=1
σ
i+1,j
dz
j
¸
(7.41)
7.2. INTEREST RATE DERIVATIVES 71
where (7.39) was used in the last equality.
The Price APT then tells us that
µ
i
= r
0
+
i−1
¸
j=1
λ
j
σ
i+1,j
(7.42)
where r
0
is the short rate process, and from (7.41),
µ
i
=
µ
i+1
+
τa
i
R
i
+τb
i
R
i
¸
i
j=1
σ
i+1,j
ρ
ij
1 +τR
i
(7.43)
These equations mix parameters from the underlying variables (a
i
and b
i
), and the marketed tradables (µ
i
and σ
i,j
). We want conditions that don’t involve anything related to the marketed tradables. Thus, we
would like to eliminate µ
i
and σ
i,j
terms.
To do this, let’s substitute (7.42) into (7.43), giving
i−1
¸
j=1
λ
j
σ
i+1,j
=
¸
i−2
¸
j=1
λ
j
σ
i,j
+
τa
i
R
i
+τb
i
R
i
¸
i
j=1
σ
i+1,j
ρ
ij
1 +τR
i
¸
(7.44)
This still contains σ
i,j
terms. But, we can substitute in using (7.39) to get rid of those terms and have
−
i−1
¸
j=1
λ
j
τb
j
R
j
(1 +τR
j
)
=
¸
−
i−2
¸
j=1
λ
j
τb
j
R
j
(1 +τR
j
)
+
τa
i
R
i
−τb
i
R
i
¸
i
j=1
τbjRj
(1+τRj)
ρ
ij
1 +τR
i
¸
(7.45)
or
−λ
i−1
τb
i−1
R
i−1
(1 +τR
i−1
)
=
¸
τa
i
R
i
−τb
i
R
i
¸
i
j=1
τbjRj
(1+τRj)
ρ
ij
1 +τR
i
¸
(7.46)
which is the calibration relationship pulled onto the underlying variables of the discrete forward rates R
i
.
(Note that with n tradables as bond prices, we only have n−1 factors z
i
. Thus, there are only n−1 market
prices of risk, and that is why the above market price of risk is λ
i−1
and not λ
i
.)
Relations to HJM
The Libor Market Model is, of course, highly related to the HJM model. However, there are a couple of
diﬀerence that are helpful to point out. The ﬁrst is that we derived that HJM as a single factor model (a
multifactor model is easy to do as well). That is, all the instantaneous forward rates were driven by the
same factor dz. In the Libor Market Model, each discrete forward rate is driven by a distinct factor dz
i
.
The next thing to note is that in the Libor Market Model, we used a recursive relationship to relate the
underlying variables to the marketed tradables
B
i
= (1 +τR
i
)
B
i−1
(1 +τR
i−1
)
(7.47)
which can be expanded to give
B
i
= (1 +τR
i
)
B
i−1
(1 +τR
i−1
)
=
i−1
¸
j=0
1
(1 +τR
i−1
)
(7.48)
This is, in fact, the discrete analog to the continuous equation used in HJM
B(tT) = exp
−
T
t
r(ts)ds
(7.49)
72CHAPTER 7. APPLICATIONOF THE FACTOR FORM:INTEREST RATE ANDCREDIT DERIVATIVES
Thus, the two models (HJM and Libor Market Model) really are brothers.
Finally, the most important thing to note about the HJM and Libor Market Model is that they can
both be viewed and understood from the factor point of view. This is typically not done because the risk
neutral framework turns out to be better suited for these models. However, it is important to understand
that these models all come from just a simple application of the factor approach, despite their looking quite
complicated. I hope you have found this exercise valuable... let’s move on to credit.
7.3 Credit Derivatives
Credit derivatives usually refer to derivatives that pay oﬀ depending on whether a bankruptcy has occured.
Since a bankruptcy is a sudden event, credit model rely heavily on Poisson processes. In what follows, I will
present the simplest models of a Defaultable Bond when the intensity of default is a constant, and then we
will generalize that to allow the intesity to be random. You should note the similarity between these models
and Merton’s jump diﬀusion model in Chapter 6, Section 6.1.6. In fact, if you understand Merton’s model
then there isn’t too much new here...
7.3.1 Defaultable Bonds
In deriving equations for defaultable bonds, we need two factors. The ﬁrst is the short rate, and the second
is a default factor. We model these as
dr
0
= adt +bdz (7.50)
dN = dπ(α) (7.51)
In this case, a defaultable bond of maturity T is a function or r
0
, N, and t: B(r
0
, N, tT). Since N(t) is a
pure Poisson process, we can start it at N = 0 which is the no default state, and let N = 1 be the default
state. By Ito’s lemma we can write, after suppressing all arguments except N:
dB(N) = LB(N)dt +bB
r
(N)dz + (B(N + 1) −B(N))(dπ(α) −αdt) (7.52)
where I have compensated the Poisson process and
LB(N) = (B
t
(N) +aB
r
(N) +
1
2
b
2
B
rr
(N) +α(B(N) −B(N + 1))) (7.53)
Now let’s write down the tradables table, which includes the money market account:
¸
B
0
B(N)
d
¸
B
0
B(N)
=
¸
r
0
B
0
LB(N)
dt +
¸
0 0
bB
r
(N) (B(N + 1) −B(N))
¸
dz
dπ −αdt
(7.54)
Finally, we solve the Price APT equations
¸
B
0
B(N)
λ
0
+
¸
0 0
bB
r
(N) (B(N + 1) −B(N))
¸
λ
1
λ
2
=
¸
r
0
B
0
LB(N)
(7.55)
From the ﬁrst equation we have λ
0
= r
0
the short rate. The second equation gives
r
0
B(N) +bB
r
(N)λ
1
+ (B(N + 1) −B(N))λ
2
= LB(N) (7.56)
which can be rewritten as
B
t
(N) +aB
r
(N) +
1
2
b
2
B
rr
+α(B(N) −B(N +1)) = r
0
B(N) +λ
1
bB
r
(N) +λ
2
(B(N +1) −B(N)). (7.57)
But, we assume that we start with N = 0 and default is N = 1, so we have
B
t
(0) + (a −λ
1
b)B
r
(0) +
1
2
b
2
B
rr
(0) = (r
0
+α −λ
2
)B(0) + (λ
2
−α)B(1). (7.58)
7.3. CREDIT DERIVATIVES 73
Diﬀerent Types of Recovery
But, since B(0) is the no default state, it is the same as B. Furthermore, B(1) means that we are in default.
Hence, we can assume diﬀerent recoveries in default such as no recovery B(1) = 0, or fractional recovery
B(1) = xB(0), or even ﬁxed recovery B(1) = X.
If we assume no recovery we obtain
B
t
(0) + (a −λ
1
b)B
r
(0) +
1
2
b
2
B
rr
(0) = (r
0
+α −λ
2
)B(0) (7.59)
This looks exactly like the standard model for a bond, except that the short rate is increased by α − λ
2
.
That is, the default rate and its market price of risk just adjust the short rate of interest. Otherwise, the
price is the same. This means that if there is a closed form solution for bond prices under a single factor
short rate model, then there will also be a closed form solution for defaultable bonds!
7.3.2 Defaultable Bonds with Random Intensity of Default
This time we will allow the default intensity to follow a stochastic diﬀerential equation. That is, we have
dr
0
= adt +bdz
1
(7.60)
dN = dπ(α) (7.61)
dα = fdt +gdz
2
(7.62)
where E[dz
1
dz
2
] = ρdt. In this case, a defaultable bond of maturity T is a function of r
0
, N, α and t:
B(r
0
, N, α, tT). Again, we start with N = 0 which is the no default state, and let N = 1 be the default
state. By Ito’s lemma we can write:
dB(N) = LB(N)dt +bB
r
(N)dz
1
+gB
α
(N)dz
2
+ (B(N + 1) −B(N))(dπ(α) −αdt) (7.63)
where again I have compensated the Poisson process and this time
LB(N) = (B
t
(N)+aB
r
(N)+fB
α
(N)+
1
2
b
2
B
rr
(N)+
1
2
g
2
B
αα
(N)+ρbgB
rα
(N)+α(B(N)−B(N+1))) (7.64)
Now let’s write down the tradables table:
¸
B
0
B(N)
d
¸
B
0
B(N)
=
¸
r
0
B
0
LB(N)
dt +
¸
0 0 0
bB
r
(N) gB
α
(N) (B(N + 1) −B(N))
dz
1
dz
2
dπ −αdt
¸
¸
(7.65)
Finally, we solve the Price APT equations
¸
B
0
B(N)
λ
0
+
¸
0 0 0
bB
r
(N) gB
α
(N) (B(N + 1) −B(N))
λ
1
λ
2
λ
3
¸
¸
=
¸
r
0
B
0
LB(N)
(7.66)
From the ﬁrst equation we have λ
0
= r
0
the short rate. The second equation gives
r
0
B(N) +bB
r
(N)λ
1
+gB
α
(N)λ
2
+ (B(N + 1) −B(N))λ
3
= LB(N) (7.67)
which can be rewritten as
B
t
(N) +aB
r
(N) +fB
α
(N) +
1
2
b
2
B
rr
(N) +
1
2
g
2
B
αα
(N) +ρgbB
rα
(N) +α(B(N) −B(N + 1))
= r
0
B(N) +λ
1
bB
r
(N) +λ
2
gB
α
(N) +λ
3
(B(N + 1) −B(N)).
But, we assume that we start with N = 0 and default is N = 1, so we have
B
t
(0)+(a−λ
1
b)B
r
(0)+(f−λ
2
g)B
α
(0)+
1
2
b
2
B
rr
(0)+
1
2
g
2
B
αα
(0)+ρgbB
rα
(0) = (r
0
+α−λ
3
)B(0)+(λ
3
−α)B(1).
(7.68)
But, since B(0) is the no default state, it is the same as B(0). Furthermore, B(1) means that we are in
default. Hence, we can assume diﬀerent recoveries in default such as no recovery B(1) = 0, or fractional
recovery B(1) = xB(0), or even ﬁxed recovery B(1) = X.
74CHAPTER 7. APPLICATIONOF THE FACTOR FORM:INTEREST RATE ANDCREDIT DERIVATIVES
7.4 Problems
Problem 7.4.1 Derive a pde for the price of a European call option on a nondividend paying stock when
interest rates are random, and the short rate follows CoxIngersollRoss dynamics. That is:
dS = µSdt +σSdz
1
(7.69)
dr
0
= a(b −r
0
)dt +c
√
r
0
dz
2
(7.70)
where dz
1
and dz
2
are correlated E[dz
1
dz
2
] = ρdt. Your answer may contain a market price of risk.
Problem 7.4.2 (Options on Forwards and Futures)
(a) Assume interest rates are stochastic and driven by a single factor short rate model. Derive the pde for a
derivative on a futures contract and on a forward contract.
Assume the following
df = µfdt +σfdz
dr
0
= adt +bdz
B
where f is either the forward price or futures price and E[dzdz
B
] = ρdt.
Hint 1: Recall that futures prices are marked to market (See Section 6.1.5 in Chapter 6, whereas forward
prices are not marked to market.
Hint 2: Note that a forward price is not tradable. However, you may assume that the asset that the
forward contract is on is tradable on a spot market and there are standard relationships between the spot
price of an asset and its forward price.
Hint 3: My recommendation is to let your underlying variables be the future and forward price, re
spectively, and the short rate. Additionally, it might be convenient to write bond dynamics generically as
dB = µ
B
Bdt +σ
B
Bdz
B
, where the short rate process is being driven by z
B
.
(b) What if interest rates are not stochastic? Do the pdes for options on forwards and futures look the same?
Problem 7.4.3 (PDE for a derivative on a dividend paying stock under stochastic interest rates)
Consider a market with a stock price process S(t) satisfying
dS = µSdt +σSdz
1
(7.71)
where z
1
(t) is a standard Brownian motion. Assume that this stock also pays a continuous dividend at a rate
of q. Thus, over time dt, the dividend amount is qSdt.
Additionally, assume that interest rates are random, and the term structure is driven by a single factor
short rate model
dr
0
= a(b −r
0
)dt +fdz
2
(7.72)
where z
2
(t) is a standard Brownian motion and E[dz
1
dz
2
] = ρdt. Assume that a money market account
exists that satisﬁes
dB
0
= r
0
B
0
dt. (7.73)
Consider a third asset whose price at time t can be written as a twice diﬀerentiable function of S(t), r
0
(t)
and t. Call this price function c(S(t), r
0
(t), t). If no arbitrage exists in the market, derive the pde that
c(S(t), r
0
(t), t) must satisfy. (Your answer may contain a market price of risk.)
Problem 7.4.4 (Flat Term Structure)
Consider a term structure model where the term structure of interest rates is ﬂat, but moves up and down
randomly. That is, let zero coupon bond prices with face value $1 and maturity T be denoted by B(tT) and
satisfy
B(tT) = exp(−r(T −t)) (7.74)
7.4. PROBLEMS 75
where r is modeled as a stochastic diﬀerential equation
dr = a(r, t)dt +b(r, t)dz. (7.75)
Note that this model applies for all maturities T.
(a) Write down models for the instantaneous return of the money market account and for a generic zero
coupon bond with maturity T (i.e. for B(tT)).
(b) Derive the absence of arbitrage condition in this case, and derive restrictions on a(r, t) and b(r, t). What
does this imply about the allowable dynamics for this term structure?
76CHAPTER 7. APPLICATIONOF THE FACTOR FORM:INTEREST RATE ANDCREDIT DERIVATIVES
Chapter 8
Hedging
8.1 Introduction
We present two main approaches to hedging. In the ﬁrst, we recognize that risk comes from the factors.
Thus, hedging is based upon eliminating factor risk. In the second approach, we consider a derivative to be
a function of underlying variables. We are then hedged against variations in these underlying variables if
the derivative of the value of a portfolio with respect to the underlying variables is zero. Thus, our portfolio
is not sensitive to small variations in the underlying variables.
8.2 Hedging from a Factor Perspective
In abstract terms, hedging involves eliminating factor risk. Once again, we can either work with returns or
prices. In this chapter, we will work with prices.
Let dV
u
be the value change of the asset that we would like to hedge. Furthermore, we will assume that
we hold one unit of this asset. Thus a factor model for our unhedged portfolio proﬁt/loss over the next time
instance is
dV
u
= A
u
dt +B
u
dz (8.1)
and we would like to hedge this position.
By hedging, we actually just mean that we will form a portfolio (that may be dynamically traded over
time) so that the factor model of the portfolio eliminates all factor risk. In this case, we have entirely hedged
out the risk.
Let us consider having various tradable assets to hedge with, and assume that these assets have a value
proﬁt/loss change vector of
dV
h
= A
h
dt +B
h
dz (8.2)
Now, let ˆ y be the holdings of the assets used to hedge with and let dV
0
be the value change of the hedged
portfolio. This change in value is given by
dV
0
= dV
u
+ ˆ y
T
dV
h
= (A
u
+ ˆ y
T
A
h
)dt + (B
u
+ ˆ y
T
B
h
)dz (8.3)
To eliminate the risk, we need to eliminate the coeﬃcients of the factors. Thus, we need to choose ˆ y such
that
B
u
+ ˆ y
T
B = 0 (8.4)
That is really all there is to hedging!
77
78 CHAPTER 8. HEDGING
8.2.1 Description Using a Tradables Table
I like to use a tradables table description, so let’s rework the hedging analysis. Consider a tradables table
description, but add our holdings variable y
Holdings
y
Prices
P
Changes
d
V
=
Factor Model
A
dt +
B
dz
(8.5)
Let us assume that the last tradable is the derivative that we would like to hedge and we assume that
we hold one unit of that derivative. Hence, our holdings vector is actually
y =
¸
ˆ y
1
(8.6)
and the value change in the hedged portfolio is
dV
0
= y
T
dV = y
T
Adt +y
T
Bdz (8.7)
Thus, to hedge, we select ˆ y so that y
T
B = 0 (recalling of course that y is given by (8.6)).
8.2.2 The Relationship Between Hedging and Arbitrage
Hedging is actually quite closely related to arbitrage. In fact, a standard derivation of the BlackScholes
equation begins from a hedging argument. In this section, let’s see how hedging and arbitrage are related.
Recall the arbitrage price implication
y
T
P = 0 No cost
y
T
B = 0 No risk
⇒ y
T
A = 0 No return (8.8)
Note that y
T
B = 0 is one of the conditions of the arbitrage price implication. Thus, from the line following
equation (8.7), a hedged portfolio will automatically satisfy this condition.
The other condition that needs to be satisﬁed to set up the arbitrage is that the total cost or price must
be zero, ˆ y
T
P = 0. Let’s see how we can alter our hedged portfolio to satisfy this no cost condition.
Creating a No Cost Hedge
Let’s break out the tradables table in a little more detail. First, lets assume that there is a tradable with no
direct factor risk
dB
0
= r
0
B
0
dt (8.9)
Additionally, using the notation above, the asset to be hedged satisﬁes
dV
u
= A
u
dt +B
u
dz (8.10)
and the tradables used to hedge this asset are
dV
h
= A
h
dt +B
h
dz (8.11)
We can write this in a tradables table form as
y
0
ˆ y
1
¸
¸
B
0
P
h
P
u
¸
¸
d
B
0
V
h
V
u
¸
¸
=
r
0
B
0
A
h
A
u
¸
¸
dt +
0
B
h
B
u
¸
¸
dz. (8.12)
where y
0
is the holding of B
0
, ˆ y is the holdings of the assets with value changes of dV
h
, and we hold a single
unit of the asset to be hedged.
8.2. HEDGING FROM A FACTOR PERSPECTIVE 79
Now, we make the following important observation. The holding in the ﬁrst asset y
0
has no eﬀect on
the hedge! Since it has no factor risk, including it in a portfolio has no eﬀect on the factors! Hence, we can
choose its holding y
0
arbitrarily without aﬀecting the hedge.
This recognition allows us to select y
0
so that the hedge has zero cost. That is, let ˆ y be the holdings of
a hedged portfolio, and then select y
0
so that the total cost is zero
y
0
B
0
+ ˆ y
T
P
h
+P
u
= 0 (8.13)
Thus, any completely hedged portfolio can be altered to satisfy the price APT implication. Thus, for no
arbitrage to exist, we must have zero proﬁt/loss in this hedged portfolio. This means
y
0
rB
0
+ ˆ y
T
A
h
+A
u
= 0 (8.14)
This proﬁt/loss condition is actually the BlackScholes equation! Thus, a hedging strategy can be used to
derive the BlackScholes equation via the above relationships.
A Simple Explanation
Here is the simple explanation of what we have done above. We hold an asset that we would like to hedge.
First, we use ˆ y to hedge away all the factor risk in our portfolio. Thus, the hedged portfolio has no direct
factor risk. Since it has no factor risk, for no arbitrage to exist it must be the same as the ﬁrst tradable
that also has no factor risk. In the BlackScholes setting, this ﬁrst asset is the risk free asset, and all we
are saying is the the hedged portfolio must earn the risk free rate. In an interest rate derivative setting, the
tradable without direct factor risk is the moneymarket account and we are saying that the hedged portfolio
must earn the short rate. That is really all that we are doing with the above arguments!
8.2.3 Hedging Examples
Let’s see how hedging is done in some examples.
Hedging in BlackScholes
In the BlackScholes setup of Chapter 6, Section 6.1.1 we have the following tradables table with the ﬁrst
column being the holdings. We have assumed that we have one option c and that we are hedging with the
stock S. Thus, we assume the holding of the stock is ˆ y.
0
ˆ y
1
¸
¸
B
0
S
c
¸
¸
d
B
0
S
c
¸
¸
=
r
0
B
0
µS
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
¸
¸
dt +
0
σS
σSc
S
¸
¸
dz. (8.15)
Now, this portfolio is hedged if y
T
B = 0 which in this case is
ˆ yσS +σSc
S
= 0 (8.16)
Solving for ˆ y gives ˆ y = −c
S
. Thus, we have a hedged portfolio if we hold −c
S
shares of the stock for every
option. The quantity c
S
is typically called the delta of the option.
Pricing
The above portfolio is hedged, but we can convert it to satisfy the arbitrage implication by choosing y
0
so
that the total cost is zero
y
0
B
0
−c
S
S +c = 0 (8.17)
80 CHAPTER 8. HEDGING
This is now a potential arbitrage portfolio. For no arbitrage to be present, we must have no proﬁt/loss which
means that the drift term of our portfolio must be zero
y
0
r
0
B
0
−c
s
µS + (c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
) = 0 (8.18)
Substituting in (8.18) for y
0
B
0
= c
S
S −c from (8.17) gives
r
0
(c
S
S −c) −c
s
µS + (c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
) = 0 (8.19)
or
c
t
+r
0
Sc
S
+
1
2
σ
2
S
2
c
SS
= r
0
c (8.20)
which is the BlackScholes equation.
Once again, the simpler explanation of this derivation is that the hedged portfolio is risk free and hence
must earn the risk free rate. I presented the more structured derivation because I believe that it is always
important to have structure to fall back on when your intuition fails. However, the intuition is important as
well!
Hedging in Bond Pricing
Consider the single factor short rate models of Chapter 7, Section 7.2.1. Following this model, let’s write
down the tradable table with holdings:
0
ˆ y
1
¸
¸
B
0
B
1
(T)
B
2
(T
0
)
¸
¸
d
B
0
B
1
(T)
B
2
(T
0
)
¸
¸
=
r
0
B
0
(B
1
t
(T) +aB
1
r
(T) +
1
2
b
2
B
1
rr
(T))
(B
2
t
(T
0
) +aB
2
r
(T
0
) +
1
2
b
2
B
2
rr
(T
0
))
¸
¸
dt +
0
bB
1
r
(T)
bB
2
r
(T
0
)
¸
¸
dz
(8.21)
Now, this portfolio is hedged if y
T
B = 0 which in this case is
ˆ ybB
1
r
(T) +bB
2
r
(T
0
) = 0 (8.22)
Solving for ˆ y gives ˆ y = −
B
2
r
(T0)
B
1
r
(T)
. Note that hedged portfolios of bonds are sometimes called ”immunized”.
Pricing
Using the same procedure as in the BlackScholes case, we can convert this hedged portfolio into a potential
arbitrage portfolio and derive a pricing equation. Again, we choose y
0
so that the total cost is zero
y
0
B
0
−
B
2
r
(T
0
)
B
1
r
(T)
B
1
(T) +B
2
(T
0
) = 0 (8.23)
and note that for no arbitrage to exist, we must have no proﬁt/loss
y
0
r
0
B
0
−
B
2
r
(T
0
)
B
1
r
(T)
(B
1
t
(T) +aB
1
r
(T) +
1
2
b
2
B
1
rr
(T)) + (B
2
t
(T
0
) +aB
2
r
(T
0
) +
1
2
b
2
B
2
rr
(T
0
)) = 0 (8.24)
Using y
0
B
0
=
B
2
r
(T0)
B
1
r
(T)
B
1
(T) −B
2
(T
0
) from (8.23) and substituting this into the ﬁrst term in (8.24) gives
r
0
B
2
r
(T
0
)
B
1
r
(T)
B
1
(T) −B
2
(T
0
)
−
B
2
r
(T
0
)
B
1
r
(T)
(B
1
t
(T)+aB
1
r
(T)+
1
2
b
2
B
1
rr
(T))+(B
2
t
(T
0
)+aB
2
r
(T
0
)+
1
2
b
2
B
2
rr
(T
0
)) = 0
(8.25)
which can be rewritten as
B
2
t
(T
0
) +aB
2
r
(T
0
) +
1
2
b
2
B
2
rr
(T
0
) −r
0
B
2
(T
0
)
B
2
r
(T
0
)
=
B
1
t
(T) +aB
1
r
(T) +
1
2
b
2
B
1
rr
(T) −r
0
B
1
(T)
B
1
r
(T)
(8.26)
8.2. HEDGING FROM A FACTOR PERSPECTIVE 81
Now, since the left hand side only depends on B
2
and the right hand side only depends on B
1
and B
1
was
a bond of arbitrary maturity T, we must have that
B
1
t
(T) +aB
1
r
(T) +
1
2
b
2
B
1
rr
(T) −r
0
B
1
(T)
B
1
r
(T)
= λ
′
(8.27)
for some constant λ
′
. To make this look like our previously derived bond pricing equation in Chapter 7
Section 7.2.1, let’s let
λ
′
= λb (8.28)
so that (8.27) can be written as
B
1
t
(T) +aB
1
r
(T) +
1
2
b
2
B
1
rr
(T) −r
0
B
1
(T)
B
1
r
(T)
= λb (8.29)
Upon rearranging we have
B
1
t
(T) + (a −λb)B
1
r
(T) +
1
2
b
2
B
1
rr
(T) = r
0
B
1
(T) (8.30)
which is the bond pricing equation where λ is the market price of risk.
The intuitive explanation of this derivation is that ﬁrst we hedge out the factor risk by the choice of ˆ y.
Since this hedged portfolio has no factor risk, it must earn the same return as the tradable with no factor
risk, which is the money market account. This is where the pricing pde comes from!
8.2.4 Hedging under Incompleteness
In some cases it is impossible to eliminate all of the factor risk (this is true in incomplete markets). In this
case, we may still attempt to reduce the eﬀect of the factors on our portfolio. But, we will have choices in
terms of how we want to best try to reduce risk from the factors.
To see how this is done, let’s consider an example. In the jump diﬀusion model of Chapter 6 Section
6.1.6, we had the following tradables table.
0
y
1
¸
¸
B
S
c
¸
¸
d
B
S
c
¸
¸
=
r
0
B
(µ +αE[Y −1])S
Lc
¸
¸
dt +
0 0 0
σS S 0
σSc
S
0 1
¸
¸
dz
dψ
1
dψ
2
¸
¸
(8.31)
where L is the diﬀerential operator corresponding to the drift term in Ito’s lemma, and
dψ
1
= (Y −1)dπ(α) −αE[Y −1]dt (8.32)
dψ
2
=
(c(Y S
−
) −c(S
−
))dπ −αE[(c(Y S
−
) −c(S
−
))]dt
(8.33)
I have added the vector of holdings on the left side of the tradables table. I have assumed that I hold 1 unit
of the derivative c, and that I will hedge with the stock S. Since the bond B is not driven by any of the
risky factors, I do not need to consider it in the hedge.
So, the hedged portfolio takes the form
dV
h
= ydS +dc (8.34)
= y((µ +αE[Y −1])Sdt +σSdz +Sdψ
1
) +Lcdt +Sc
S
dz +dψ
2
(8.35)
= [y(µ +αE[Y −1])S +Lc]dt + [yσS +Sc
S
]dz +ySdψ
1
+dψ
2
(8.36)
One can clearly see that it is not possible to eliminate all the factor risk by choosing y. Thus, we are in an
incomplete market.
82 CHAPTER 8. HEDGING
However, we can select y to eliminate some of the risk over the next dt. In fact, what Merton [11] did in
his pricing formula is equivalent to eliminating the Brownian risk dz. In that case, we would choose
y = −c
S
(8.37)
where c
S
is the partial derivative of Merton’s pricing formula.
But, let’s consider a diﬀerent alternative. Let’s choose y to eliminate the variance of the portfolio over
the next dt. That is
V ar(dV
h
)
=
E[A
2
] −2yE[AB] +y
2
E[B
2
]
λdt +
V ar(A) −2yCov(A, B) +y
2
V ar(B)
λ
2
dt
2
+ [σSc
S
−yσS]
2
dt
(8.38)
where
A = c(Y S(t
−
), t) −c(S(t
−
), t
−
) (8.39)
B = B(Y −1)S. (8.40)
Minimizing over y gives
y =
λE[AB] +λ
2
dtCov(A, B) +σ
2
S
2
c
S
λE[B
2
] +λ
2
dtV ar(B) +σ
2
S
2
(8.41)
and sending dt → 0 leads to
y =
λE[AB] +σ
2
S
2
c
S
λE[B
2
] +σ
2
S
2
. (8.42)
8.2.5 A Question of Consistency
Note that the hedges we have derived above often involve knowledge of a pricing formula for the option c.
This can be seen by noting that y is often a function of c
S
for instance. Thus, our hedging analysis has
presupposed that the derivative c follows a speciﬁed formula that we know.
This presupposition of knowledge can bring up a question of consistency between the hedge and the
assumption of the formula for c. Consider the following example.
Let’s consider a stochastic volatility model:
dB = r
0
Bdt (8.43)
dS = µSdt +
√
vSdz
1
(8.44)
dv = adt +bdz
2
(8.45)
with the tradables table
B
S
c
¸
¸
d
B
S
c
¸
¸
=
r
0
B
µS
Lc
¸
¸
dt +
0 0
√
vS 0
√
vSc
S
bc
v
¸
¸
¸
dz
1
dz
2
. (8.46)
with Lc = c
t
+µSc
S
+ac
v
+
1
2
vS
2
c
SS
+
1
2
b
2
c
SS
+ρ
√
vbSc
Sv
.
Now, this is an incomplete market, so we cannot perfectly hedge away the risk. Nevertheless, let’s consider
a hedged portfolio
dV
h
= y(µSdt+
√
vSdz
1
)+(c
t
+µSc
S
+ac
v
+
1
2
vS
2
c
SS
+
1
2
b
2
c
vv
+ρ
√
vbSc
Sv
)dt+
√
vSc
S
dz
1
+bc
v
dz
2
(8.47)
Clearly, with y, we cannot eliminate all the risk. Let’s consider a simple hedge where we only eliminate the
dz
1
risk. Thus, we choose y
√
vS +
√
vSc
S
= 0 or y = −c
S
.
Thus, our hedge is to hold −c
S
shares of the stock. But, what is c
S
?
Well, how about using the BlackScholes formula for c(S, t) and computing c
S
from that? But, this is
not consistent because to derive the BlackScholes formula we assume that volatility is not stochastic. In
our setting above, we are assuming that volatility is stochastic and thus there is no reason to believe that c
would follow the BlackScholes equation. This is where the inconsistency in hedging often arises.
8.3. HEDGING FROM AN UNDERLYING VARIABLE SENSITIVITY PERSPECTIVE 83
What formula for c?
So, you ask, then what formula should I use for c. The answer is that you should use the formula that best
corresponds to the actual price and movements of c in the market. Thus, if there is stochastic volatility,
then you should use a model that most accurately captures that stochastic volatility and how it is reﬂected
in the movement of the derivative c.
Now, some might argue that you should use a pricing formula that comes from the hedge that you are
doing. As we saw above, pricing and hedging are intimately related. This idea is both right and wrong.
Let’s see why.
Here is the correct part. Any pricing formula should be related to a reasonable hedging scheme via
our analysis above. Why? Because hedging is the mechanism that enforces pricing. If there is no hedging
justiﬁcation for a pricing formula, then there is no good reason to expect that pricing formula to hold.
However, if a pricing formula is tightly related to a hedging strategy, then deviations from that pricing
strategy can be exploited by turning the hedging strategy into an arbitrage (or almost arbitrage) opportunity
as shown in the previous section.
Here is the wrong part. Sometimes, you want to hedge for a reason unrelated to pricing. For example, in
the stochastic volatility case I might just want to hedge out the stock risk S via dz
1
and leave my portfolio
exposed to volatility risk. If I were interested in using some trading strategy that depended on volatility v
but not the stock price S, then this would be perfectly reasonable. But it probably would not be reasonable
to claim that I should be using a pricing formula related to this hedging strategy. The pricing formula c
that I should be using is the one that best reﬂects the actual movement of c in the market.
To summarize, every pricing formula should be intimately connected to a hedging strategy. However,
the reverse is not true. Not every hedging strategy should be turned into a pricing formula. Furthermore,
hedging strategies often rely on a pricing formula for the derivative c. This pricing formula should be the
best pricing formula that reﬂects the actual pricing and movement of the derivative c in the actual market.
In practice, many times that is not done (for many reasons, including computational, ease of use, etc).
Thus, often hedging analysis is inconsistent. For example, people like to rely on the BlackScholes formula
even when its assumptions are not valid. Due to standard ﬁnance conventions, in places below we will fall
into this trap as well, so keep a sharp eye out for where it may be occurring.
8.3 Hedging from an Underlying Variable Sensitivity Perspective
In some cases, there is a simpler and faster way to derive hedges that doesn’t involve an explicit use of the
factors. In this approach, we consider the asset that we are trying to hedge to be a function of underlying
variables, and hedge against those variables by eliminating the sensitivity to those underlying variables. In
the simplest terms, sensitivity is measured by the derivative of the hedged portfolio with respect to the
underlying variable of concern.
8.3.1 BlackScholes Hedging
For example, in the BlackScholes setup, we assume that an option is a function of the underlying stock S
and time t. That is, the price of an option is a function of the variables S and t which we write as c(S, t).
Now if we would like to hedge out the risk in our option, we note that all the risk in the price of an option
comes from its dependence on the stock variable. Hence, we form a hedged portfolio
P
h
= c(S, t) + ˆ yS (8.48)
and want to choose ˆ y so that this portfolio is hedged. But, in this case, since all the risk comes from the
stock variable, we will be hedged if our portfolio has no sensitivity to changes in the stock. Another way of
saying this is that we want the derivative of P
h
with respect to S to be zero. That means that small changes
in S will cause no change in the value of the hedged portfolio P
h
. Thus, we will be hedged.
84 CHAPTER 8. HEDGING
This condition can be written as
∂P
h
∂S
= c
S
+ ˆ y = 0 (8.49)
Solving for ˆ y yields ˆ y = −c
S
which is the same answer that we arrived at using factors. This hedge is known
as a delta hedge and the quantity c
S
is commonly referred to as the delta of the option.
There is a simple graphical interpretation of this hedge. Consider a plot with the underlying variable
on the xaxis, and the value of the tradable as a function of the underlying factor on the yaxis. In this
case, the underlying variable is the stock price S and the tradables are the stock itself S and the option c.
For simplicity, let’s assume the option is a European call option following BlackScholes. Figure 8.1 shows
a plot of the stock, call option, and delta hedged portfolio for an option with strike K = 10 and expiration
T = 0.3 on a stock with volatility σ = 0.3 and an interest rate of r
0
= 0.05. The hedged portfolio on the
right in Figure 8.1 is a portfolio of the stock (left) and call option (middle) so that at the current value of
the underlying variable (S = 10), the derivative of the portfolio value yS +c is zero (right plot).
0 5 10 15
−10
−5
0
5
10
15
Stock Value
Underlying Variable: S
T
r
a
d
a
b
le
: S
0 5 10 15
−10
−5
0
5
10
15
Option Value
Underlying Variable: S
T
r
a
d
a
b
le
: c
0 5 10 15
−10
−5
0
5
10
15
Hedged Portfolio
Underlying Variable: S
T
r
a
d
a
b
le
: H
e
d
g
e
d
P
o
r
tfo
lio
Figure 8.1: Delta Hedge: Left  Plot of the Stock, Middle  Plot of the option, Right  Plot of the hedged
portfolio assuming that the current price of the stock is $10.
To make this a potential arbitrage portfolio, we could add a position in the bond in order to make the
current price of the portfolio (at S = $10) be zero.
8.3.2 Hedging Bonds
We can apply this same underlying variables approach to our hedging of bonds. In that case, we hold a
bond B
2
(r
0
, tT
0
) that is a function of the short rate r
0
(t) and time t. We wish to hedge this bond with
another bond B
1
(r
0
, tT) that is also a function of the short rate r
0
(t) and time t. In this setup, the risk
in the price of a bond comes from the short rate r
0
(t) which is our underlying variable. Thus, a portfolio
P
h
= B
2
(r
0
, tT
0
) + ˆ yB
1
(r
0
, tT) is hedged if its derivative with respect to r
0
is zero. Thus
∂P
h
∂r
= B
2
r
(T
0
) + ˆ yB
1
r
(T) = 0 (8.50)
Solving for ˆ y yields ˆ y = −
B
2
r
(T0)
B
1
r
(T)
which is the same answer that we arrived at using factors.
8.3.3 Derivatives imply Small Changes
Note that this underlying variable approach uses the derivative as a measure of sensitivity. Recall that the
derivative is the change in a function for a small change in the variable. Thus, when we say that a portfolio
is hedged against stock price movements because its derivative with respect to the stock is zero, this means
that the value of the portfolio change is approximately zero for small changes in the price of the stock. But,
the value of the portfolio change could be quite large if the stock change is large. Hence, this approach only
works if we know that the stock price changes will be small. This is the case for stock price movements
driven by Brownian motion since Brownian motion is continuous. However, if the stock price is driven by a
8.4. HIGHER ORDER APPROXIMATIONS 85
Poisson process, then we would expect it to have large jumps at times. Thus, in this case using the derivative
is not a good approach since the stock price change would be large and the derivative would likely not be
a good approximation to the change in the portfolio value. This was the situation in the JumpDiﬀusion
model above, and note that the hedging strategy that we derived was not that same one that would result
from this derivative approach.
8.4 Higher Order Approximations
Using the derivative to model the change in a portfolio due to the change in a variable is a linear approxi
mation of portfolio value as a function of the underlying variable. Of course, one can also use a higher order
approximation to the portfolio value. An easy way to do this is to use the terms of a Taylor expansion. This
leads to the socalled Greeks, presented next.
8.4.1 The Greeks
Now, in general, a call option is not just a function of the price of the stock and time. From the Black
Scholes formula, we see that it depends also on the risk free rate r
0
and the volatility of the option σ. In
our derivation of the BlackScholes formula, we assumed that r
0
and σ were constant (not random factors).
However, recognizing that that is only an approximation, we can allow them to be underlying variables and
change. Then we can ask how their changes might cause the price of an option to change assuming that the
price follows the BlackScholes formula. To do this, we would just construct a multivariable Taylor series
expansion of c(S, t, r
0
, σ) as
∆c(S, t, r
0
, σ) = c
t
∆t +c
S
∆S +c
r
∆r
0
+c
σ
∆σ +
1
2
c
SS
(∆S)
2
+... (8.51)
I only included a single second order term (there are many second order terms) because is it the only named
second order term.
By looking at this Taylor expansion, we see that the various derivatives tell us the sensitivity of the price
of an option to changes in those variables. Each of these derivatives is given a Greek letter name as in Table
8.1.
theta c
t
delta c
S
rho c
r
vega c
σ
gamma c
SS
Table 8.1: The Greeks
If you have a bad memory, like I do, then you can remember some of the Greeks by noting that the ﬁrst
letter of the Greek is the same as the partial derivative. That is, ”theta” starts with ”t” and is the partial
with respect to t. Similarly, ”rho” starts with ”r” and is with respect to r
0
, and ”vega” starts with ”v” and
is the partial with respect to ”v”olatility. Unfortunately, with ”delta” and ”gamma” you are on your own.
Note that in the typical dynamic hedging assumption, we are able to trade continuously. Thus, only
terms of order dt and lower matter. However, this Taylor expansion approach assumes that we are not
trading continuously. Thus, we use a ∆t instead of a dt and furthermore, higher order terms will enter. This
is actually more practical than the typical continuous time trading assumptions.
8.4.2 A DeltaGamma Hedge
One can think of a delta hedge as eliminate the ﬁrst order term in the Taylor expansion. Of course, one can
go further and ask if higher order terms can be eliminated as well. If you elminate the ﬁrst order term in
86 CHAPTER 8. HEDGING
∆S and the second order term (∆S)
2
than this is called a DeltaGamma hedge. For such a DeltaGamma
hedge you need more than just the underlying stock, and furthermore, you need a tradable that depends on
the second order term (∆S)
2
. Let’s show how by using another call option c
(2)
and the underlying stock,
we can DeltaGamma hedge an option c.
In this case, the hedged portfolio is
P = c +y
1
S +y
2
c
(2)
(8.52)
and let’s Taylor expand this to obtain
∆P(S, t) = ∆c(S, t) +y
1
∆S +y
2
∆c
(2)
(8.53)
=
¸
c
t
∆t +c
S
∆S +
1
2
c
SS
(∆S)
2
+y
1
∆S +y
2
¸
c
(2)
t
∆t +c
(2)
S
∆S +
1
2
c
(2)
SS
(∆S)
2
(8.54)
=
c
t
+y
2
c
(2)
t
∆t +
c
S
+y
1
+y
2
c
(2)
S
∆S +
¸
1
2
c
SS
+y
2
1
2
c
(2)
SS
(∆S)
2
(8.55)
To DeltaGamma hedge, we have to eliminate the coeﬃcients of ∆S and (∆S)
2
by choosing y
1
and y
2
. Thus,
we must solve
c
S
+y
1
+y
2
c
(2)
S
= 0 (8.56)
1
2
c
SS
+y
2
1
2
c
(2)
SS
= 0 (8.57)
The solution is
y
1
=
c
SS
c
(2)
SS
c
(2)
S
−c
S
, y
2
= −
c
SS
c
(2)
SS
(8.58)
In the plots of Figure 8.1, this would create a portfolio that has the ﬁrst and second derivative equal to zero
at the current value of the underlying variable.
In practice, DeltaGamma hedges (and other hedges as well) are diﬃcult because transaction costs can
make it expensive to trade too many assets.
8.4.3 Determining what the error looks like
We can use the Taylor expansion to see what the error looks like in a Delta hedge under BlackScholes
assumptions. Consider the hedged portfolio
P(S, t) = c(S, t) −c
S
S (8.59)
and let’s perform a Taylor expansion of this in S and t
∆P(S, t) = ∆c(S, t) −c
S
∆S (8.60)
= c
t
∆t +
1
2
c
SS
(∆S)
2
dt +... (8.61)
= c
t
∆t +
1
2
c
SS
σ
2
S
2
dt +... (8.62)
where I have only kept terms up to order ∆t in (8.62). However, (8.61) is also a very useful equation for
intuituion. So we will use it as well. Note that the only diﬀerence between (8.61) and (8.62) is that (∆S)
2
is
σ
2
S
2
dt + higher order terms. For the analysis that we are looking at, the higher order terms don’t matter.
Now let’s assume that in the market, c satisﬁes the BlackScholes equation corresponding to a volatility
value of σ
i
. Thus,
c
t
+r
0
Sc
S
+
1
2
σ
2
i
S
2
c
SS
= r
0
c (8.63)
I use the subscript i on the volatility σ
i
to denote what is known as implied volatility. Implied volatility
is the value of volatility that when plugged into the BlackScholes formula will make it equal the current
8.5. SUMMARY 87
market price of an option. To be more concrete, let c
m
be the current market price of an option where
the stock price is S
0
, strike price is K, time to expiration is T, and risk free rate is r
0
. Furthermore, let
c
BS
(S, K, T, r
0
, σ) denote the BlackScholes formula. Then the implied volatility is deﬁned as the value of
σ
i
that solves
c
m
= c
BS
(S
0
, K, T, r
0
, σ
i
) (8.64)
Thus, it answers the question: If the market is following BlackScholes, what volatility value are they pluggin
in the BlackScholes formula?
Thus, (8.63) assumes that the market is pricing the option using the BlackScholes formula with a
volatility value of σ
i
. Thus, we can use (8.63) to substitute for c
t
in (8.61) which gives
∆P(S, t) = r
0
(c −c
S
S)dt +
1
2
c
SS
(∆S)
2
−σ
2
i
S
2
dt (8.65)
and ﬁnally noting that P = c −c
S
S gives
∆P(S, t) = r
0
Pdt +
1
2
c
SS
(∆S)
2
−σ
2
i
S
2
dt (8.66)
or purely to order dt using (8.62) instead of (8.61),
∆P(S, t) = r
0
Pdt +
1
2
c
SS
σ
2
−σ
2
i
S
2
dt. (8.67)
Equation (8.66) shows that the gain or loss of our hedged portfolio relative to the risk free rate depends on
the actual change in the stock price over the next ∆t in the term (∆S)
2
relative to the implied volatility
σ
i
in BlackScholes that is being used to price the option in the market. Equation (8.67) shows the same
thing, but in terms of the volatility of the stock σ rather than the stock move ∆S. In particular, for this
hedge where we are long the option and short delta of the stock, we make money if the stock moves more
than implied volatility estimate, and we lose money if it moves less.
8.5 Summary
Hedging can be approach from two diﬀerent points of view. In the ﬁrst point of view, we recognize the fact
that risk comes from the factors. Thus, in hedging we try to eliminate the factor risk. This point of view
is appropriate regardless of what the risky factors are. In the second point of view, tradables are viewed as
functions of underlying variables, and we hedge by constructing a portfolio that eliminates the sensitivity to
moves in the underlying variables. Usually we do this by setting the derivative of the hedged portfolio with
respect to the underlying variable to zero at the current value of the underlying variable. Since we only set
the derivative to zero, and the derivative reﬂects a local approximation to the portfolio, this works as long
as there are only small moves in the underlying variable between rehedging opportunities. Furthermore, if
we have more tradables to place in our portfolio, we can set higher order derivatives of our portfolio to zero
as well and create better hedges. Really, if you underlying Taylor expansions, then you are on your way to
understanding the majority of hedging methods.
8.6 Problems
Problem 8.6.1 Verify equation (8.38).
88 CHAPTER 8. HEDGING
Chapter 9
The Road to Risk Neutrality
9.1 Introduction
Risk neutral absence of arbitrage pricing is a widely used approach to derivative pricing. However, it tends
to be one of the most confusing, misunderstood, and misused pricing approaches, especially for those new
to derivative pricing. Nevertheless, when understood correctly, it is an extremely powerful approach. Thus,
in this chapter, I will explain the risk neutral pricing principle. This introduction to risk neutral pricing
does not follow the standard probabilityheavy route, but instead motivates risk neutral pricing in a simple
manner from the factor approach.
9.2 Do the Factors Matter?
In our tradables table, we have
Prices
P
Changes
d
V
=
Factor Model
A
dt +
B
dz
(9.1)
and the absence of arbitrage condition is
A = Pλ
0
+Bλ (9.2)
Thus, absence of arbitrage only depends on the values of P, A, and B, and not on what the exact factors
dz are. Therefore, the prices P are absence of arbitrage for any value change with A, and B, regardless of
what the driving factors are. This leads us to ponder the following.
Hypothesis: Perhaps by changing the factors from dz to some other random factors dψ, where the A and
B representation of prices changes is the same, I can compute the absence of arbitrage prices P easier.
To see whether something can be made of this hypothesis, we have to start with a more basic question.
Question: Given a set of factors dz, what other set of factors dψ will have the same A and B representation?
At ﬁrst glance, you might be tempted to say that I can choose dψ to be anything I want. It doesn’t have
to relate to the original dz at all because the absence of arbitrage condition ”does not see” dz. However,
you would soon realize that this is not the case.
Why? Because when we deal with derivative securities, their factor model is determined by the application
of Ito’s lemma. Thus, the A and B representation of price changes does see the factors via Ito’s lemma.
Hence, Ito’s lemma puts a constraint on which factors dψ are consistent with the original factors dz. Let’s
be a little more concrete about this.
89
90 CHAPTER 9. THE ROAD TO RISK NEUTRALITY
Let S(t) be a stock price with model
dS = µSdt +σSdz (9.3)
and let c(S, t) be a derivative security. By Ito’s lemma we have
dc =
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
dt +σSc
S
dz. (9.4)
Thus, the tradables table is
Prices
¸
S
c
Changes
d
¸
S
c
=
Factor Model
¸
µS
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
dt +
¸
σS
σSc
S
dz.
(9.5)
9.2.1 Brownian Factors
Changing the Mean Works
Let’s try new factors given by the replacement
dz → dψ = d˜ z +βdt (9.6)
That is, I will replace the original Brownian factors dz by a new factor that is a Brownian d˜ z plus a drift
βdt. It is important to consider the new factor to be the entire term dψ = d˜ z +βdt and not just d˜ z!
We can ask whether this will lead to a representation consistent with the original dz. We would have
dS = µSdt +σS(dψ) (9.7)
= µSdt +σS(d˜ z +βdt) (9.8)
= (µ +σβS)dt +σSd˜ z (9.9)
and then by Ito’s lemma we have for c(S, t),
dc =
c
t
+ (µ +σβ)Sc
S
+
1
2
σ
2
S
2
c
SS
dt +σSc
S
d˜ z (9.10)
=
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
dt +σSc
S
(d˜ z +βdt) (9.11)
=
c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
dt +σSc
S
(dψ) (9.12)
And thus we see that dz and dψ = d˜ z + βdt are consistent in that they produce the same A and B
representation regardless of whether dz or dψ = d˜ z +βdt is the factor!
The upshot is that we can replace any Brownian factors dz by Brownians plus a drift d˜ z +βdt, and the
same absence of arbitrage prices hold!
Changing the Variance Does Not Work
Let’s try new factors given by the replacement
dz → dψ = ηd˜ z (9.13)
That is, I will replace the original Brownian factors by Brownians with a diﬀerent variance. Again, the new
factor should be considered to be the entire term dψ = ηd˜ z and not just the d˜ z. We can ask whether this
factor dψ = ηd˜ z is consistent with the original dz. We would have
dS = µSdt +σS (dψ) (9.14)
= µdt +σS(ηd˜ z) (9.15)
9.2. DO THE FACTORS MATTER? 91
and then by Ito’s lemma we have for c(S, t),
dc =
c
t
+µSc
S
+
1
2
σ
2
η
2
S
2
c
SS
dt +σSc
S
(ηd˜ z) (9.16)
=
c
t
+µSc
S
+
1
2
σ
2
η
2
S
2
c
SS
dt +σηSc
S
(dψ) (9.17)
(9.18)
Thus, we see that A is changed when we change the factor to another Brownian with a diﬀerent variance.
Therefore, this substitution is not allowed.
Similarly, in the case of multiple Brownian factors, changes in the correlations or the covariance structure
is not allowed. The above results lead us to the following principle.
(⋆) Arbitrage Invariance Principle for Brownian Motion: If a set of prices P is absence of arbitrage
under Brownian factors dz ∈ R
n
with E[dzdz
T
] = Σdt, then P is also absence of arbitrage if the factors dz are
replaced by dz → dψ = d˜ z +βdt with β arbitrary and where d˜ z are Brownian factors with E[d˜ zd˜ z
T
] = Σdt.
9.2.2 Poisson Factors
Changing the Intensity Works
Assume that the original factor is a Poisson Process dπ(t; α). Under dπ(t; α) we have
dS = µSdt + (σ −1)Sdπ(t; α) (9.19)
dc = (c
t
+µSc
S
)dt + (c(σS) −c(S))dπ(t; α). (9.20)
Now, let’s consider changing the factor to a Poisson with an altered intensity dψ = d˜ π(t; α + β) where
α +β > 0. In this case, we have
dS = µSdt + (σ −1)Sd˜ π(t; α +β) (9.21)
dc = (c
t
+µSc
S
)dt + (c(σS) −c(S))d˜ π(t; α +β) (9.22)
and we see that the A and B representations remain the same under dψ = d˜ π(t; α +β).
Adding a Drift Doesn’t Work
Again assume that the original factor is a Poisson Process dπ(t; α). Under dπ(t; α) we have
dS = µSdt + (σ −1)Sdπ(t; α) (9.23)
dc = (c
t
+µSc
S
)dt + (c(σS) −c(S))dπ(t; α) (9.24)
Now, let’s consider changing the factor dπ(t; α) → dψ = d˜ π(t; α) +ηdt. In this case, we have
dS = µSdt + (σ −1)Sdψ = µSdt + (σ −1)S(d˜ π(t; α) +ηdt) (9.25)
dc = (c
t
+ (µS + (σ −1)ηS)c
S
)dt + (c(σS) −c(S))d˜ π(t; α) (9.26)
and we see that there is no way to recover the original A and B representations under dψ.
(⋆) Arbitrage Invariance Principle for Poisson: If a set of prices P is absence of arbitrage under
Poisson factors dπ
i
(α
i
), then P is also absence of arbitrage under a diﬀerent set of factors dψ = d˜ π
i
(α
i
+β
i
)
where α
i
+β
i
is the new intensity.
92 CHAPTER 9. THE ROAD TO RISK NEUTRALITY
9.3 Risk Neutral Representations
The Brownian and Poisson factor invariance principles tell us that it is okay to alter or replace the factors in
certain ways. Why is this helpful? Because in some situations it is easier to price a derivative if we replace
the factors by something diﬀerent from the original factors. In fact, in this section we show that we can
always replace the factors and put all tradables in what we will call a risk neutral representation. The fact
that we can do this will ultimately lead us to the risk neutral pricing principle. The basic idea is that for any
pricing problem, we can replace the factors to create the risk neutral representation which leads to simpliﬁed
pricing formulas. But we are getting ahead of ourselves. First, let’s see what the risk neutral representations
are.
9.3.1 Brownian Factors
Consider the tradables table with Brownian factors dz
Prices
P
Changes
d
V
=
Factor Model
A
dt +
B
dz
(9.27)
and the absence of arbitrage condition is A = Pλ
0
+ Bλ. Now, we can substitute this into the tradables
table to obtain
Prices
P
Changes
d
V
=
Factor Model
Pλ
0
+Bλ
dt +
B
dz
(9.28)
Finally, we groups terms diﬀerently to obtain
Prices
P
Changes
d
V
=
Factor Model
Pλ
0
dt +
B
(dz +λdt)
(9.29)
Now, by the Brownian factor invariance under changes to the drift, we can replace dz → dψ = d˜ z − λdt,
which leads to
Prices
P
Changes
d
V
=
Factor Model
Pλ
0
dt +
B
d˜ z
(9.30)
This is the risk neutral representation. It states that we can ﬁnd a replacement set of factors so that all
tradables have a drift equal to the market price of time λ
0
, and absence of arbitrage prices under this
representation will be the same as under the original representation using the actual Brownian factors.
Note that to obtain this risk neutral representation, the factors were replaced by new factors that con
tained the market prices of risk! Let’s formalize this notion of a risk neutral representation.
(⋆) Risk Neutral Representation for Brownians: Let
Prices
P
Changes
d
V
=
Factor Model
A
dt +
B
dz
(9.31)
be a tradables table. Then, under arbitrage invariant substitutions of the factors dz → dψ = d˜ z − λdt, the
following tradable table will produce the same absence of arbitrage prices
Prices
P
Changes
d
V
=
Factor Model
Pλ
0
dt +
B
d˜ z
(9.32)
Equation (9.32) is called the risk neutral representation because all tradables have a drift equal to the market
price of time, regardless of how risky they really are.
9.3. RISK NEUTRAL REPRESENTATIONS 93
9.3.2 Poisson Factors
Consider the tradables table with Poisson factors dπ(t, α).
Prices
P
Changes
d
V
=
Factor Model
A
dt +
B
dπ(t; α)
(9.33)
and the absence of arbitrage condition is
A = Pλ
0
+Bλ (9.34)
Before proceeding, let’s think about the market price of risk for a Poisson factor dπ. Since a Poisson process
either does nothing or jumps up by 1, it is always good to hold a positive amount of a Poisson factor. All
the risk is on the upside. On the other hand, being short a Poisson factor is adding real (downside) risk.
Thus, we would expect the market price of risk for a Poisson factor to be negative. That is, if you are short
a Poisson factor (B is negative), then you should be rewarded for taking on that risk, and a negative market
price of risk would reﬂect that.
One can show this in a much more rigorous fashion, but for our purposes, let’s just note that λ < 0 for a
Poisson factor. I don’t like dealing with a negative quantity, so let’s deﬁne λ
′
= −λ and rewrite the absence
of arbitrage condition in terms of λ
′
as
A = Pλ
0
+Bλ (9.35)
= Pλ
0
+B(−λ
′
) (9.36)
= Pλ
0
−Bλ
′
(9.37)
where λ
′
> 0.
Now, we can substitute this into the tradables table to obtain
Prices
P
Changes
d
V
=
Factor Model
Pλ
0
−Bλ
′
dt +
B
dπ(t; α)
(9.38)
Now, by the Poisson factor invariance under changes to the intensity, we can replace dπ(t; α) → dψ =
d˜ π(t; α + β). Let’s choose β = λ
′
− α so that dψ = d˜ π(t; λ
′
) (here is where it is important that λ
′
> 0, so
that it can be an intensity!), which leads to
Prices
P
Changes
d
V
=
Factor Model
Pλ
0
−Bλ
′
dt +
B
d˜ π(t; λ
′
)
(9.39)
The ﬁnal step is to compensate the Poisson process. That is, we subtract oﬀ the mean of the Poisson process
so that the random factor has mean zero.
Prices
P
Changes
d
V
=
Factor Model
Pλ
0
dt +
B
(dπ(t; λ
′
) −λ
′
dt)
(9.40)
Since the random factor term now has zero mean, we can see that the drift of the value changes V for all
tradables is equal to the market price of time λ
0
. This was the same as in Brownian case.
Thus, for Poisson processes, we can replace the original Poisson factors by new Poisson factors with
diﬀerent intensities so that the drift of all tradables is the market price of time λ
0
(don’t forget to subtract
oﬀ the mean of the Poisson factors so that they have zero mean!). This is the risk neutral representation
under Poisson factors!
Note that in this case, the new intensity of the factors in the risk neutral representation is equal to
(minus) the market price of risk! Let’s make this formal.
94 CHAPTER 9. THE ROAD TO RISK NEUTRALITY
(⋆) Risk Neutral Representation for Poissons: Let
Prices
P
Changes
d
V
=
Factor Model
A
dt +
B
dπ(t; α)
(9.41)
be a tradables table. Then, under arbitrage invariant substitutions of the factors dπ(t; α) → dψ = d˜ π(t; λ
′
),
the following tradable table will produce the same absence of arbitrage prices
Prices
P
Changes
d
V
=
Factor Model
Pλ
0
dt +
B
(d˜ π(t; λ
′
) −λ
′
dt) .
(9.42)
Equation (9.42) is called the risk neutral representation because all tradables have a drift equal to the market
price of time, regardless of how risky they really are.
9.4 Pricing as an Expectation
The risk neutral representations lead to a powerful risk neutral pricing formula. This is because instead of
starting with the real tradables table, we can start with the risk neutral representation. The fact that the
drift of all tradables is equal to the market price of time leads us to a convenient new pricing formula. Let’s
see how it works for the Brownian case.
From the risk neutral representaion equation (9.32) we have
dV = λ
0
Pdt +Bd˜ z. (9.43)
Let’s assume that we are not dealing with a futures contract, so that V = P. Thus, we have
dP = λ
0
Pdt +Bd˜ z (9.44)
which is telling us that the drift of P is λ
0
when we use the factors based on d˜ z.
Now, via Ito’s lemma one can verify that
d
e
−
t
0
λ0(s)ds
P
= e
−
t
0
λ0(s)ds
Bd˜ z. (9.45)
Taking expectations of both sides gives
d
˜
E
e
−
t
0
λ0(s)ds
P
= 0 (9.46)
since ˜ z is a Brownian motion. (We also switched the d and the expectation.) Finally, integration of both
sides says that
P(0) =
˜
E
e
−
t
0
λ0(s)ds
P(t)
. (9.47)
We have arrived at risk neutral pricing.
(⋆) The Risk Neutral Pricing Principle: Absence of arbitrage prices are given by the formula
P(0) =
˜
E
e
−
t
0
λ0(s)ds
P(t)
(9.48)
where the expectation
˜
E(·) is taken under the risk neutral representation and λ
0
is the market price of time.
This risk neutral pricing principle applies in the Poisson case as well. Thus, this is a new pricing point
of view that follows from the factor approach! The presentation has been a little abstract to this point, so
let’s see how it would work in practice.
9.5. APPLICATIONS OF RISK NEUTRAL PRICING 95
9.5 Applications of Risk Neutral Pricing
Let’s see how the risk neutral pricing principle is used in a couple of familiar situations. But ﬁrst, let’s
outline how it is applied.
9.5.1 How to Apply Risk Neutral Pricing
Risk neutral pricing is applied using the following steps.
1. Via an arbitrage invariant substitution of the factors, convert the tradables table to its risk neutral
representation.
2. Apply the Risk Neutral Pricing Formula to the derivative security that is being priced.
That is it! Let’s clarify with some examples.
9.5.2 BlackScholes
Let’s see how risk neutral pricing applies to the BlackScholes setup. Recall that the bond and stock follow
dB = r
0
Bdt (9.49)
dS = µSdt +σSdz (9.50)
dc = (c
t
+µSc
S
+
1
2
σ
2
S
2
c
SS
)dt +σSc
S
dz. (9.51)
Now, according to the risk neutral pricing principle, we have the same absence of arbitrage prices if we set
all the drifts to the risk free rate (market price of time). Hence, we have
dB = r
0
Bdt (9.52)
dS = r
0
Sdt +σSd˜ z (9.53)
dc = r
0
cdt +σSc
S
d˜ z. (9.54)
Now, applying the risk neutral pricing formula to the call option c(S, t) gives
c(S(0), 0) =
˜
E
e
−r0T
c(S(T), T)
. (9.55)
But, if T is expiration, then we know that c(S(T), T) = (S(T) −K)
+
, so the risk neutral pricing formula is
c(S(0), 0) = e
−r0T
˜
E
(S(T) −K)
+
(9.56)
where the e
−r0T
was pulled out of the expectation because it is not random.
The expectation
˜
E(·) is taken under the risk neutral representation (i.e. S(t) follows (9.53), etc.). Per
forming this expectation leads to
d
1
=
ln(S/K) + (r
0
+
1
2
σ
2
)(T)
σ
√
T
(9.57)
d
2
= d
1
−σ
√
T (9.58)
c(S, 0) = SN(d
1
) −Ke
−r0T
N(d
2
) (9.59)
which is the BlackScholes formula! Thus, we were able to obtain a pricing formula without resorting to
partial diﬀerential equations!
96 CHAPTER 9. THE ROAD TO RISK NEUTRALITY
9.5.3 Poisson Model
Let’s try it for a Poisson model of Section 6.1.4. To be consistent with the approach above in Section 9.3.2,
I won’t compensate the original Poisson factor dπ(t; ν) as was done in Section 6.1.4. The tradables table is
B
S
c
¸
¸
d
B
S
c
¸
¸
=
r
0
B
µS
c
t
+µSc
S
¸
¸
dt +
0
(k −1)S
c(kS
−
) −c(S
−
)
¸
¸
dπ(ν)
and the market price of time is λ
0
= r
0
while the market price of risk is
λ
1
=
µ −r
0
k −1
. (9.60)
For the risk neutral representation we use minus the market price of risk
λ
′
1
=
r
0
−µ
k −1
(9.61)
and the risk neutral representation is
B
S
c
¸
¸
d
B
S
c
¸
¸
=
r
0
B
r
0
S
r
0
c
¸
¸
dt +
0
(k −1)S
c(kS
−
) −c(S
−
)
¸
¸
(dπ(λ
′
1
) −λ
′
1
dt) . (9.62)
Finally, we can apply the risk neutral pricing formula
c(S(0), 0) =
˜
E
e
−r0T
c(S(T), T)
. (9.63)
But, if T is expiration, then we know that c(S(T), T) = (S(T) −K)
+
, so the risk neutral pricing formula is
c(S(0), 0) = e
−r0T
˜
E
(S(T) −K)
+
. (9.64)
where the e
−r0T
was pulled out of the expectation because it is not random. In the risk neutral representation,
(9.62) determines the expectation
˜
E(·). Thus, computing the expectation in (9.64) gives
c(S, t) = SΨ(x, y) −Ke
−r0(T)
Ψ(x, y/k) (9.65)
where
Ψ(α, β) =
∞
¸
i=α
e
−β
β
i
i!
, y =
(r
0
−µ)kT
k −1
(9.66)
and x is the smallest nonnegative integer greater than
ln(K/S)−µ(T)
ln(k)
.
9.5.4 HJM
Recall the HJM model of Section 7.2.3. In this model, the underlying variables are given by the instantaneous
forward rates
dr(ts) = µ(ts)dt +σ(ts)dz(t). (9.67)
and the tradables are bonds that follow
dB(tT) =
B(tT)r(tt) −B(tT)
T
t
µ(ts)ds +
1
2
B(tT)
T
t
T
t
σ(ts)σ(tr)drds
dt
−
B(tT)
T
t
σ(ts)ds
dz
9.5. APPLICATIONS OF RISK NEUTRAL PRICING 97
If you recall from Section 7.2.3 this was extremely messy to deal with. So, let’s start over but with the risk
neutral perspective in mind.
Let’s start with the tradables that are the bonds. Since this is a single factor model, the risk neutral
representation under Brownians tells us that we may write the tradables in the form
dB(tT) = r(tt)B(tT)dt +ν(tT)B(tT)d˜ z (9.68)
where d˜ z is the risk neutral factor and r(tt) = r
0
(t) is the instantaneous short rate. Now, we can ask what
this implies about the instantaneous forward rates in the risk neutral world. Well, the relationship between
the bonds B(tT) and the instantaneous forward rates is
r(tT) = −
∂
∂T
ln B(tT). (9.69)
Then, by Ito’s lemma, we have
dr(tT) =
∂
∂T
1
2
ν
2
(tT) −r(tt)
dt −
∂
∂T
ν(tT)d˜ z (9.70)
=
∂
∂T
1
2
ν
2
(tT)
dt −
∂
∂T
ν(tT)d˜ z. (9.71)
If one lets
σ(tT) = −
∂
∂T
ν(tT) (9.72)
then
µ(tT) =
∂
∂T
1
2
ν
2
(tT)
= ν(tT)
∂
∂T
ν(tT) = σ(tT)
T
t
σ(ts)ds (9.73)
where
dr(tT) = µ(tT)dt +σ(tT)dz(t). (9.74)
These equations tell us that under the risk neutral representation, if we know the volatility of the instanta
neous forward rates (σ(tT)), then we can compute what the drift terms must be by equation (9.73).
Another way to get to this same result is to revisit the results of Section 7.2.3 equation (7.22) that said:
µ(tT) −σ(tT)
T
t
σ(ts)ds = σ(tT)λ
1
(9.75)
and note that in the risk neutral representation, we have a zero market price of risk λ
1
= 0 (because all
tradables earn the risk free rate). Thus, the above equation becomes
µ(tT) = σ(tT)
T
t
σ(ts)ds (9.76)
which is what we were looking for.
Thus, if we want to use the risk neutral representation, we would ﬁrst estimate the volatilities of the
instantaneous forward rates σ(tT) from market data, then use (9.73) to compute the risk neutral drifts.
Once we have this, pricing proceeds via expectations as in the risk neutral pricing formula. In the
HJM model, the expectation is often computed by Monte Carlo. That is, one simulates the risk neutral
representation of the instantaneous forward rates. Then, one can compute the payoﬀ value of a derivative,
and compute the expectation of it. This is the risk neutral priceing approach.
98 CHAPTER 9. THE ROAD TO RISK NEUTRALITY
9.5.5 Libor Market Model
The LMM model of Section 7.2.4 is also made quite simple by the use of risk neutrality. Recall that the
LMM is similar to the HJM model, but deals with forward rates between times T
i
and T
i+1
denoted by
R(tT
i
, T
i+1
) and that are assumed to follow
dR(tT
1
; T
2
) = a
1
R(tT
1
; T
2
)dt +b
1
R(tT
1
; T
2
)dz
2
. (9.77)
For notational simplicity, we write R
i
= R(tT
i
, T
i+1
) with
dR
i
= a
i
R
i
dt +b
i
R
i
dz
i
. (9.78)
Recall from equation (7.46) that the calibration relationship on the underlying R
i
variables is
−λ
i−1
τb
i−1
R
i−1
(1 +τR
i−1
)
=
¸
τa
i
R
i
−τb
i
R
i
¸
i
j=1
τbjRj
(1+τRj)
ρ
ij
1 +τR
i
¸
. (9.79)
Now, we can switch to the risk neutral representation by setting all the market prices of risk to zero λ
i−1
= 0.
This reduces the above equation to
0 =
¸
τa
i
R
i
−τb
i
R
i
¸
i
j=1
τbjRj
(1+τRj)
ρ
ij
1 +τR
i
¸
(9.80)
or by simplifying further
τa
i
R
i
−τb
i
R
i
i
¸
j=1
τb
j
R
j
(1 +τR
j
)
ρ
ij
= 0 (9.81)
or ﬁnally
a
i
= b
i
i
¸
j=1
τb
j
R
j
(1 +τR
j
)
ρ
ij
(9.82)
which is quite a bit simplier than what we started with.
Note that this is similar to what we had in the HJM risk neutral model. We can use market data to
estimate the b
i
terms (the volatility of the forward rates), and then use equation (9.82) to obtain their drifts
in the risk neutral representation. Pricing is then done by expectation.
9.6 Summary
The point of this chapter was to show that risk neutral pricing is a logical consequence of the factor approach
to derivative pricing. The idea was that details of the factors do not seem to appear in the factor APT
equations that we used throughout the book. That gave us the idea that perhaps we could change the
factors (as long as we didn’t disturb the basic factor coeﬃcient structure) and still arive at the same absense
of arbitrage price. By using a diﬀerent set of factors (that still preserved the same absense of arbitrage
prices) in many case we can simplify our calculations of the absense of arbitrage prices. This is the basic
notion of risk neutral pricing. In fact, there is quite a bit more that one can do when the full power and
generality of the risk neutral approach is explored. But that, my friends, is the subject of another little
book...
9.7 Problems
Problem 9.7.1 Verify equation (9.45).
Problem 9.7.2 Verify equatin (9.70).
Bibliography
[1] F. Black. The pricing of commodity contracts. Journal of Financial Economics, 3:167–179, 1976.
[2] F. Black and M. Scholes. The pricing of options and corporate liabilities. Journal of Political Economy,
81:637–659, 1973.
[3] A. Brace, D. Gatarek, and M. Musiela. The market model of interest rate dynamics. Mathematical
Finance, 7:127–154, 1997.
[4] J.C. Cox, J. Ingersoll, and S.A. Ross. A theory of the term structure of inteest rates. Econometrica,
53:385–467, 1985.
[5] J.C. Cox and S.A. Ross. The valuation of options for alternative stochastic processes. Journal of
Financial Economics, 3:145–166, 1976.
[6] R. Jarrow D. Heath and A. Morton. Bond Pricing and the Term Structure of Interest Rates: A New
Methodology. Econometrica, 60(1):77–105, 1992.
[7] J. Franklin. Methods of Mathematical Economics. SpringerVerlag, New York, 1980.
[8] Daniel T. Gillespie. Markov Processes: an introduction for physical scientists. Academic Press, 1997.
[9] S. L. Heston. A ClosedForm Solution for Options with Stochastic Volatility with Applications to Bond
and Currency Options. The Review of Financial Studies, 6(2):327–343, 1993.
[10] W. Margrabe. The value of an option to exchange one asset for another. Journal of Finance, 33(1):177–
186, 1978.
[11] Robert C. Merton. Option Pricing when the Underlying Stock Returns are Discontinuous. Journal of
Financial Economics, 5:125–144, 1976.
[12] L.C.G. Rogers and D. Williams. Diﬀusions, Markov Processes and Martingales: Volume 2, Ito Calculus.
Cambridge, 2000.
[13] S. A. Ross. The Arbitrage Theory of Capital Asset Pricing. Journal of Economic Theory, 59:341–360,
1976.
[14] G.E. Uhlenbeck and L.S. Ornstein. On the theory of brownian motion. Phys. Rev., 36:823–841, 1930.
[15] O. Vasicek. An Equilibrium Characterization of the Term Structure. Journal of Financial Economics,
5:177–188, 1977.
99
Index
APT, 32
Price Form, 35
Return Form, 32
Arbitrage
Price Implication, 35
Return Implication, 31
BlackScholes, 49, 95
Formula, 95
Hedging, 80
Brownian Motion, 1
Increment, 3
Calibration, 47
Compound Poisson Process, 7
CoxIngersollRoss, 25
Delta, 79, 84
Delta Hedge, 84
DeltaGamma Hedge, 85
Derivative, 42
Deﬁnition, 42
Dividends, 50
Factor Models
Via Ito’s Lemma, 43
Factors, 41
Futures
Derivative Pricing, 54
Gaussian Random Variable, 1
geometric Brownian Motion
BlackScholes, 49
Greeks, 85
delta, 79, 85
gamma, 85
rho, 85
Taylor Expansion, 85
theta, 85
vega, 85
HeathJarrowMorton, 66
Hedging, 77
BlackScholes, 80
Delta, 84
DeltaGamma, 85
Immunization, 80
Incomplete, 81
Immunization, 80
Implied Volatility, 86
Incomplete, 46, 81
Hedging, 81
intensity, 3
Ito’s Lemma
Obtaining Factor Models, 43
Jump Diﬀusion
Derivative Pricing, 55
Merton, 55
LiborMarketModel, 69
Market
Incomplete, 46
Market Price of Risk, 33
Market Price of Time, 33
Marketed Tradables, 44
Merton, 55
Money Market Account, 63, 64
Dynamics, 64
Normal Random Variable, 1
Null Space, 32
Relation to Range Space, 32
Option
European Call, 43
OrnsteinUhlenbeck, 24, 25
Perpendicular Space, 32
Poisson Process, 1, 3
Compound, 7
intensity, 3
Poisson Processes
Derivative Pricing, 53
Poisson Random Variable, 3
Price APT
100
INDEX 101
Application to Pricing, 44
Range Space, 32
Relation to Perp of Null Space, 32
Relative Pricing, 44
Risk Neutral, 89
Pricing, 95
Principle, 94
Risk Neutral Pricing
Risk Neutral Representation, 92, 94
Risk Neutral Representation, 92, 94
Short Rate, 64
Single Factor Models, 64
Vasicek, 64
Stochastic Process
Compound Poisson Process, 7
Tradables, 42
Marketed, 44
Tradables Table, 44
Underlying Variables, 41
Vasicek, 24, 64
Volatility
Implied, 86
2
Contents
1 Basic Building Blocks and Stochastic Diﬀerential Equation Models 1.1 Brownian Motion and Poisson Processes . . . . . . . . . . . . . . . . . . 1.1.1 Gaussian Random Variables . . . . . . . . . . . . . . . . . . . . . 1.1.2 Brownian Motion . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.3 Poisson Random Variables . . . . . . . . . . . . . . . . . . . . . . 1.1.4 Poisson Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.5 Increments of Brownian Motion and Poisson Processes . . . . . . 1.2 Stochastic Diﬀerential Equations . . . . . . . . . . . . . . . . . . . . . . 1.2.1 Diﬀerentials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2.2 The Diﬀerential . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2.3 Compound Poisson Process . . . . . . . . . . . . . . . . . . . . . 1.2.4 Ito Stochastic diﬀerential equations . . . . . . . . . . . . . . . . . 1.2.5 Poisson Driven Diﬀerential Equations . . . . . . . . . . . . . . . 1.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.4 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Ito’s Lemma 2.1 Ito’s Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.1.1 The chain rule of ordinary calculus . . . . . . . . . . . . . . . 2.1.2 Ito’s lemma for Brownian motion . . . . . . . . . . . . . . . . 2.1.3 Replacing dz 2 by dt . . . . . . . . . . . . . . . . . . . . . . . 2.1.4 Discussion of Ito’s lemma . . . . . . . . . . . . . . . . . . . . 2.2 Ito’s lemma for Poisson Processes . . . . . . . . . . . . . . . . . . . . 2.2.1 Interpretation of Ito’s lemma for Poisson . . . . . . . . . . . . 2.3 More versions of Ito’s Lemma . . . . . . . . . . . . . . . . . . . . . . 2.3.1 Ito’s Lemma for Compound Poisson Processes . . . . . . . . . 2.3.2 Ito’s Lemma for Brownian and Compound Poisson Processes 2.3.3 Ito’s Lemma for vector processes . . . . . . . . . . . . . . . . 2.4 Ito’s lemma, the product rule, and a rectangle . . . . . . . . . . . . . 2.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.6 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Standard Stochastic Diﬀerential Equations with Solutions 3.1 Geometric Brownian Motion . . . . . . . . . . . . . . . . . . . . . . . 3.1.1 Stock Price Interpretation . . . . . . . . . . . . . . . . . . . . 3.2 Geometric Poisson Motion . . . . . . . . . . . . . . . . . . . . . . . . 3.2.1 A conditional lognormal version of geometric Poisson Motion 3.3 A jumpdiﬀusion model . . . . . . . . . . . . . . . . . . . . . . . . . 3.4 A more general SDE . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.4.1 The OrnsteinUhlenbeck Process and Mean Reversion . . . . i . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 1 3 3 3 5 5 6 7 7 8 9 10 11 11 11 12 13 14 14 15 16 16 16 16 17 18 19 21 21 21 22 23 23 23 24
. . . .6 Jump diﬀusion . . . . .5. . . . . . . . . . . . . . . . . . Summary . . . . . . . . . 5. . . . . . . . . . . . . . . . . . . . . . . . . . . . .3 A Useful Absence of Arbitrage Condition . .ii 3. . . . . . . . . . .5. . .2 A Classiﬁcation of Quantities . . . .6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4. . . . . . . .5 Two standard examples . . . . . . . . . . . . . . . . .2 Pricing the Derivative . . . . . . . . . . . . . . . . 5. . . . . . . . . . . . . . . . . . . . . .7 Summary . . .1. . . . .1. . . 5. 5. . . . 4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .7 Exchange one asset for another . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6. . . . . . . . . . . . . . . . . . .3. . . . . . . . . . . . . . . . . . . . . . . . . . . .2. . . . . . . . . 4.1 Arbitrage .4 A Derivative is a Tradable . . . . . . . . . . . . . . . . . . . .2. .1 Stocks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5. . . . . . . . . . . . . 5. . . . . . . 4.2 Proﬁt/Loss and Arbitrage . . . . . . . . . 4. . . . . . .1. . . . . . . . . . . . . .1 Price Changes and Arbitrage . . . . . . . . . .2 Futures contracts . . . . . . . .3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4. . . . . . . . . . . .5. . . . . . . . .4 Poisson Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1. . . . . . . . . . . . . . . .3 Tradables . . .2 Returns and Factors Models . .1 Examples from Equity Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5 Applying the Price APT . . . . . . . . . . . . . . . . . . 5. . . . . . . . . . . . . . . . .5. . . . . . . . . . .3. . . . . . . . . . . . . . . . . . . . .5 Options on Futures . . . . . . . . .2. . . . . 6. . . . . . . . . . . . . . . . . . . . . . . . . . . .3 Underdetermined and Overdetermined Systems 5. . . . . . . . . . . . . . . . 4. .5 A Problem with Returns . . . . . . . . . . . . . . . . . 5. . . . . . . . . . . . . . . . . . . . . . .4. . . . . . . . . . . . . . . . . .2 Dividend Paying Stocks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5. . . . . . . . . . . . . . 5. . . . .1. 6 Application of the Factor Form: Equity Derivatives 6. . . . . .2. . . . . . . . 4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5. . .6 Three Step Procedure . . . .5 3. . . . . . 4. . . . . . . . . . . 4. . .1. . . . . . . . . . . . . . . 5. . . . . . . . . . . . . . . . . . . .2 Factor Models via Ito’s Lemma . . . . 6.1 Returns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6. . . . . . . . .2 Underlying Variables . .2 Stochastic diﬀerential equations and factor models 4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4. . . .4 Interpretations .1 The Factor Approach to Arbitrage Pricing . . . . . . . . . . . . . . . . . . . . . . . .1 Direct Factor Models . . . .3 Cash Dividends . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3. . 4. . . . .1 BlackScholes . . . . . . . . . . . . . . . . 5.3 The Factor Approach to Arbitrage using Returns . . . 6. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4. . . . 6. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4. . . . . . . . . . . . . . . . . . .1 Introduction . . . . . . . . .4. . . . . . . . . . . . . . . . . . . 5 Constructing a Factor Pricing Framework 5. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 Relative Pricing and Marketed Tradables . . .7 Problems . . . . .3. . . . . . . . . . . . . . . . 5. . . . . . . . . . . . . . .4 Tradables tables . . . . . . .2. . . . . . . . . . . . . . . . . . . . . . . .2 Null and Range Space Relationship . . . .4 The Factor Approach using Price Changes . . . .1. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 Factors . . . . . . . . .2. Problems . . . . . . .6 3. . . . . . . . . . . . . . . . . . . . . .8 Problems . . . . . . . . . . 25 26 27 29 29 29 29 30 31 31 32 32 33 34 34 34 34 36 36 37 37 38 41 41 41 41 41 42 42 43 43 43 44 44 44 45 45 47 48 48 49 49 49 50 52 53 54 55 57 4 The Factor Approach to Arbitrage Pricing 4. . . . . . . . . . . . 6. . . . . . . . 4.7 CONTENTS CoxIngersollRoss Process . . . . . . . . . .3 Factor Models for Underlying Variables and Tradables 5. . . . . . . . . . . . .
. . . . . .4 Problems . . . . . . . 7. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .2 A DeltaGamma Hedge . . . 9. . . . . . . .2. . . . . . . . . .6 Problems . . . . . . . . . . . . . . . . . . 7. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii 58 59 63 63 64 64 65 66 69 72 72 73 74 77 77 77 78 78 79 81 82 83 83 84 84 85 85 85 86 87 87 89 89 89 90 91 92 92 93 94 95 95 95 96 96 98 98 98 6. . . . . . . . . . . .2 Hedging from a Factor Perspective . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8. . . .2.1 Introduction . . . . 7. . . . . . . . .3 Poisson Model . . . . . . . . . . . . . . . . . .1 Description Using a Tradables Table . . . . . . . . . . . .3. . . . . . . . . . . . . . . . . . . . . . . . 9. .2 BlackScholes . . . . . . . . . . . . . . 7. . . . . . . . . . . . . . . . . . . . . .2.2. . . . .2. . . . . . . . . .1 How to Apply Risk Neutral Pricing . . . . . .2 MultiFactor Short Rate Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8. . . . . . 9. .2. . . . . . . . . . . . . . . . . .1 BlackScholes Hedging . . . . . . . . . . . . . . . . . . . . 7. .3 Derivatives imply Small Changes . 9. . . . . . . . . . . . . . . . . . . . . .1. . . 8. . . . . . . . . . . . . of Default .2 Poisson Factors . . 8. . . . . Problems . . . . . . . . 9. .2 7 Application of the Factor Form: Interest Rate and Credit Derivatives 7. . . . . 8. . . . . . . . . . . . . . . . . . . . . . . 8. . .3 HeathJarrowMorton . . . . . . . . . . . . .5. . . . . . . . . . . . . . . . . . . . . . . . . . . . .5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8. . . . . . . . . . . . . . . . . . . . . . . . . . . . .3. .1 Single Factor Short Rate Models . . . . . . . . . . . . . . . . .2. . . . . . . .4 The LIBOR Market Model . . . .2 Defaultable Bonds with Random Intensity 7.3. . . . . . . . . . . . . . . . . . . . . .1 Brownian Factors . . . .2 The Relationship Between Hedging and Arbitrage . . . . . . . 7.1 The Greeks . . . . . . . . . . . . . . . . . . . . . . . . . . . .4 HJM . . . . . . . . . . . . . . . 9. . . . . . . . . . . . . . . 9. . . . . . . . . . . . . . . . . . .2. .5. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8. . . . . . . . . .3. . . . . . . . . .4. . . . . . . . . . . . . . . 7. . . . . . . . . . . . . . . . . . 7. . . . . . . . . .5 Applications of Risk Neutral Pricing . . . 9. . . . . . . . . . . . . . . . . . . . .4 Pricing as an Expectation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 The Road to Risk Neutrality 9. . . . . . . . . . . . . . . . . . . .2 Interest Rate Derivatives . . . . . . . . . . . 9. . .3 Hedging Examples . . . . . . . . . . . .1 Brownian Factors . . . . . . . . . . . . . . . . . . . . . . . .5 Libor Market Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .2. . . . .2. . . . . . . . . . . . . . . . . . . . . . . . . . . . .5 A Question of Consistency . . . . . . . . . . . . . . . . . . . .7 Problems . . . . . . . . . . . . .5. . . . . . . . . . .8 Stochastic volatility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .4. . . . . . . . . . . . . . . . . . . . . . . 9. . . . 9. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .CONTENTS 6. . . . . . 8. . . . . . . . . . . . . . . . . . . . . . . . . .5. . . . . 8. . . . .1 Defaultable Bonds . . . . . . . . . .3 Hedging from an Underlying Variable Sensitivity Perspective 8. . . . . . . 9. . . . . . . . . . . . . . . . . . . . . . . . . . . 8. . 9. . . . . . . . . .2. . . . 8 Hedging 8. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9. . . . . . . . . . . . .4. . . .5. . .2 Hedging Bonds . . . . . . . . . .1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . .2 Do the Factors Matter? . . . . . . . . . . . . . . . . . . . . . . . . . . 8. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 Notation and the Money Market Account . . . . . . . . . .3 Determining what the error looks like . . . . . . . . .3 Risk Neutral Representations . . . . .3. . 9. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .4 Hedging under Incompleteness . .3. . . . . . . . . . . . . . . . . . . . . . . . . .3 Credit Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8. . . . . . . . . . . . . . . . . . . . 8. 8. . . . .4 Higher Order Approximations . . . . . .2 Poisson Factors . . . . . . . .6 Summary . . . . . . . . . . .
iv CONTENTS .
v . • B0 (t) .market price of risk.time t swap rate for swap dates {Ti }. • r(ts) . • S(t). • x− .time t forward price of a zero coupon bond with maturity T2 and face value $1 when delivery of the forward contract is at time T1 . • B(tT ) .forward interest rate between time T1 and T2 at time t.limit from the left: x(t−) = limh↑t x(h). • AT . • F (tT ) . • R(tT ) .time t forward price for contract with delivery at time T .notation for limit from the left: x(t−).the instantaneous short rate process at time t.Stock price at time t.time t futures price for contract with delivery at time T . • r0 . • R(tT1 .Poisson Process with intensity λ.Brownian motion. see the short rate process below). ∞). T2 ) . ∞). Second partial derivative of S with respect to t. Partial derivative of S with respect to x.time t price of a zero coupon bond with maturity T and face value $1 at time t • B(tT1 . • f (tT ) . • r . • π(t.the nonegative real line [0.a constant risk free rate of interest (when allowed to be a function of time. • S(t{Ti }) .a vector of returns.the instantaneous forward rate at time t between times s and s + ds.Notation • R . • x(t−) .spot rate for time T − t at the current time t.the time t value of the money market account.the transpose of a matrix A. • λ .the transpose of a vector x.the real line (−∞. ∂S ∂x 2 ∂ S ∂x2 • z(t) . • xT . T2 ) . • r0 (t) . λ) . • R+ . • St • Sx • Sxx ∂S ∂t Partial derivative of S with respect to t.
vi CONTENTS .
1 Brownian Motion and Poisson Processes Brownian motion and Poisson processes are our fundamental building blocks for creating models of asset prices.3) 1. We provide building blocks for modeling assets in the form of Brownian motion and Poisson processes. don’t let that fool you. (1.Chapter 1 Basic Building Blocks and Stochastic Diﬀerential Equation Models This chapter contains an introduction to the basic mathematics required for derivative pricing and ﬁnancial engineering. z(0) = 0. However. z(tn ) − z(tn−1 ) are independent for t1 ≤ t2 ≤ · · · ≤ tn . . .1) where µ ∈ Rn is the mean and Σ ∈ Rn×n is the covariance matrix: µ = E[X]. 1. and Poisson processes jump! We begin with Brownian motion which is built on the Gaussian random variable. Σ = E[(X − µ)(X − µ)T ]. With these two building blocks we create more complicated models by using Brownian motion and Poisson processes to drive diﬀerential equations (which are then known as stochastic diﬀerential equations). .2) (1. The key features are that Brownian motion has continuous sample paths (with probability 1).1 Gaussian Random Variables 1 (2π)n/2 Σ1/2 1 exp − (x − µ)T Σ−1 (x − µ) 2 An ndimensional Gaussian (Normal) random variable is a random variable with density function: X ∼ fX (x) = (1.2 Brownian Motion Brownian motion (also known as a Wiener Processes) is a stochastic process built upon the Gaussian random variable as follows.1.1. then you have received a powerful tool to add to your toolbox for problem solving. A realvalued stochastic process z(t) : t ∈ R+ is a Brownian Motion if: 1. z(t3 ) − z(t2 ). z(t) − z(s) ∼ N (0. . 1 . 1. The presentation here is tutorial and heuristic. 3. If you gain intuition from it. 2. z(t2 ) − z(t1 ). t − s) for t > s.
2CHAPTER 1. BASIC BUILDING BLOCKS AND STOCHASTIC DIFFERENTIAL EQUATION MODELS You should remember the following facts about Brownian motion, as they make Brownian motion an ideal building block for unpredictable but continuous asset price movements: • There exists a version of Brownian motion that has continuous sample paths. • Brownian motion is nowhere diﬀerentiable with probability 1. The ﬁrst property says that Brownian motion is appropriate for price processes that don’t jump. In many cases, price processes do jump, hence we will need to introduce the Poisson process next to model jumps. The second property can be interpreted in the context of predictability. If a curve is diﬀerentiable at a point, then that means that locally it can be approximated by a line, with the slope of the line being the derivative of the curve at that point. But this means that we can predict (to order dt) the future value of the curve. In ﬁnance, we often want to assume that we cannot predict future prices. Nondiﬀerentiability indicates that in the sense mentioned above, future prices are not predictable. Therefore, Brownian motion is an ideal building block upon which to build asset price processes. A sample path of Brownian motion is given in Figure 1.1.
Sample Path of Brownian Motion 0.4
0.2
0
−0.2 z(t) −0.4 −0.6 −0.8 −1 0
0.2
0.4 time
0.6
0.8
1
Figure 1.1: A typical sample path of Brownian motion. Just as there are vector Gaussian random variables, we can deﬁne a vector Brownian motion as follows. A vector Brownian motion z(t) ∈ Rn with covariance structure Σ ∈ Rn×n is a stochastic process satisfying 1. z(0) = 0. 2. z(t) − z(s) ∼ N (0, Σ(t − s)) for t > s. 3. z(t2 ) − z(t1 ), z(t3 ) − z(t2 ), . . . , z(tn ) − z(tn−1 ) are independent for t1 ≤ t2 ≤ · · · ≤ tn . Thus a vector Brownian motion is build upon the vector Gaussian random variable.
1.1. BROWNIAN MOTION AND POISSON PROCESSES
3
Brownian motion has continuous sample paths. That is too well behaved for some events we would like to model. For instance, market crashes, bankruptcy, etc. are often discontinuous price movements. Hence, we need a process that jumps! Poisson processes, which are built on the Poisson random variable, are what we are looking for.
1.1.3
Poisson Random Variables
λk exp(−λ) k!
A discrete random variable X taking values in the whole numbers is Poisson with parameter λ > 0 if P (X = k) = k = 0, 1, ... (1.4)
The mean of a Poisson random variable is E[X] = λ and the variance is V ar(X) = λ.
1.1.4
Poisson Process
A Poisson process is a stochastic process built on Poisson random variables as follows. A Poisson process with parameter (intensity) λ is a stochastic process π(t; λ) : t ∈ R+ that satisﬁes 1. π(0) = 0. 2. π(t) − π(s) is Poisson distributed with parameter λ(t − s) for t > s. 3. π(t2 ) − π(t1 ), π(t3 ) − π(t2 ), . . . , π(tn ) − π(tn−1 ) are independent for t1 ≤ t2 ≤ · · · ≤ tn . For us, the most important property of Poisson processes is that they jump! Hence, they are good models for market crashes, jumps, bankruptcy, and other unexpected discontinuous price movements. A typical sample path from a Poisson process with intensity λ = 1 is given in Figure 1.2. The parameter λ is often called the intensity (or sometimes the propensity) of the Poisson process. You can think of it as the expected number of jumps in a single time period. Alternatively, you expect to see a 1 single jump every λ time periods. Therefore, the larger the intensity, the more frequent the jumps. We will assume that a Poisson process is continuous from the right, and not the left. That is, at the exact time that a Poisson process jumps, it takes on the new value that it jumped to. Functions that are rightcontinuous and have leftlimits are called rcll functions (or cadlag or Rfunctions, etc). In a Poisson process, it is important to remember that at a jump time it takes on the new value, thus making sample paths of a Poisson process rcll functions.
1.1.5
Increments of Brownian Motion and Poisson Processes
Here are the intuitive pictures that I keep in mind when thinking of Brownian motion and Poisson Processes. Over a small ∆t Brownian motion and Poisson Processes can be thought of in simple and intuitive ways. We intuitively think of ∆t as a small increment in t. When dealing with a stochastic process X(t), we will also think of ∆X(t) as the change the occurs in X over a small time period ∆t. That is ∆X(t) = X(t + ∆t) − X(t). (1.5)
This notion of an increment of a stochastic process will guide our intuition. In this way, we can look at increments of Brownian motion and Poisson processes. Brownian Motion Over a small time ∆t, Brownian motion looks like ∆z(t) = z(t + ∆t) − z(t) ∼ N (0, ∆t) (1.6)
4CHAPTER 1. BASIC BUILDING BLOCKS AND STOCHASTIC DIFFERENTIAL EQUATION MODELS
Sample Path of a Poisson Processes with Intensity 1 5 4.5 4 3.5 3 π(t) 2.5 2 1.5 1 0.5 0 0 1 2 time 3 4 5
Figure 1.2: A typical sample path of a Poisson process. or, written slightly diﬀerently
√ ∆z(t) = ǫ ∆t
where ǫ ∼ N (0, 1)
(1.7)
where this follows from the second deﬁning property of Brownian motion. That is, a Brownian motion √ diﬀerential looks like a standard Gaussian multiplied by ∆t. Thus, we will often use E[∆z] = 0 and E[(∆z)2 ] = ∆t. An even simpler picture arises from a binary approximation √ √∆t w.p. 1/2 ∆z ≈ (1.8) − ∆t w.p. 1/2 where w.p. stands for ”with probability”. This is depicted in ﬁgure 1.3.
Figure 1.3: Binary model of an increment in Brownian motion .
λ) = π(t + ∆t. in a Poisson process. we see that in a Brownian motion the size of the move scales with the square root of ∆t. From the simple binary model approximations.9) (1. λ) = X A binary approximation to a Poisson process is ∆π(λ) ≈ 1 w.11) where dt is ”just a little bit of t”. regardless of how small ∆t is. 0 w. Over ∆t it is ∆π(t. On the other hand. The Poisson process is a good example. note the key diﬀerence between a Brownian motion and a Poisson process.12) (1. . Note that for a Poisson process.2. Thus.2. On the other hand. λ∆t 1 − λ∆t where X ∼ P oisson(λ∆t). ∆π is either 0 or 1 to order ∆t. Over small periods of time the probability of a jump is also small.4. Poisson processes jump when they move.12) has a problem when it comes to processes that jump that are assumed to be right continuous. we will present a simple intuitive approach to stochastic diﬀerential equations and stochastic diﬀerentials. In such a case we would write dX(t) = X(t + dt) − X(t) (1. from the binary model we see that the move size can always be 1. This is the essential diﬀerence between Brownian motion and the Poisson process.2 Stochastic Diﬀerential Equations A simple way to think of a stochastic diﬀerential equation is as a diﬀerential equation that is driven by a stochastic process. (1. Hence over short periods of time the move in a Brownian motion is also small. to quote Gillespie [8]. We will use this point of view here.10) A simple picture of this heuristic increment model is given in Figure 1.1. This is why Brownian motion has continuous paths. 5 (1. However. to avoid the technicalities of stochastic calculus. λ) − π(t.1 Diﬀerentials Roughly speaking. Figure 1.p. STOCHASTIC DIFFERENTIAL EQUATIONS Poisson Process A Poisson Process can also be approximated over a small time period ∆t. the probability of having a jump of size 1 scales with ∆t and is small if ∆t is small. Also. 1.4: Binary model of an increment of a Poisson process. Thus.p. the notion of a diﬀerential or inﬁnitesimal of a process is the idea that in an increment ∆X(t) = X(t + ∆t) − X(t) we can take ∆t to be inﬁnitesimally small. 1.
5. we need to adjust our notion of a diﬀerential of a stochastic process slightly to account for our convention. . That is π(s) = 1. However. then we wouldn’t have any problem. Since this is our convention. Hence.2. we make sure to capture jumps of the process. Now. and all other processes with jumps. That is π(t) = 0. lim π(h) = π(s) (1.13). We obtain dt↓0 lim dπ(t) = lim π(t + dt) − π(t) = π(t) − π(t) = 0 dt↓0 (1. Now. we know that a jump occurred at time s. now let’s consider deﬁning the diﬀerential of a Poisson process as dπ(t) = π(t + dt) − π(t) (1.13) h↓s A picture of this situation is shown in Figure 1. so we have adopted this convention. BASIC BUILDING BLOCKS AND STOCHASTIC DIFFERENTIAL EQUATION MODELS The Problem with Jumps We have to be very careful when a process has jumps and we assume right continuity of paths.6CHAPTER 1. we should think of the following where X(t−) = limh↑t X(h) is the limit from the left of X at time t. no matter how small dt is made. This means that for us.15) where this calculation followed by right continuity as deﬁned by equation (1. will be continuous from the right. we are implicitly taking the limit from the right. Here is the problem. Hence. Our convention will be to assume that at the exact time of the jump. assume that a jump occurs at time s. Therefore. Poisson processes. the Poisson process jumps to 1. it is common in the literature to assume processes are right continuous.5: A jump at time s. 1. and then when we add dt to the current time. let us take the limit of dπ(t) for any t (including s) as dt ↓ 0. Assume that a Poisson process is currently at 0. Figure 1. so intuitively we should have dπ(s) = 1. If we had assumed left continuity. By using the limit from the left.14) where dt > 0.16) The solution to the above problem is that for a diﬀerential. However. But this indicates that π never jumps! Something must be wrong!! What is wrong is that we have assumed that π is right continuous. we are guaranteed never to capture the jump! This is purely a problem that arises from our convention to assume that Poisson processes are right continuous.2 The Diﬀerential dX(t) = X(t + dt) − X(t−) (1.
22) For this reason.p. λ) − π(t−. The most useful for us is the Ito stochastic integral. STOCHASTIC DIFFERENTIAL EQUATIONS 7 We will develop this point of view (which unfortunately can’t be made rigorous. reviewing from above. 1..19) dz ≈ − dt w. 0 w. an inﬁnitesimal model of a compound Poisson process can be thought of as Y dπ(λ) ≈ Yi 0 w. λ). Let π(t. they jump up by 1. giving √ √dt w. I will not delve into the depths of the stochastic integral (because often people are never able to return!). (1. λ) iid copies of Y . w.2. Y2 at time t2 .p. at time t it is the sum of π(t. . . λ).4. and in both cases.4.. we simply replace ∆t by dt in (1. but provides the proper intuition). for a Poisson process. z(t−) = z(t). This process can be written as π(t.6. we may write dπ Y = Y dπ. where π(t.17) Note that since Brownian motion has continuous sample paths. 1 − λdt (1.. we represent the diﬀerential form of a compound Poisson process by Y dπ(t. but merely provide the intuition that you should take away when considering stochastic diﬀerential equations. (1. 1/2 (1.24) and a heuristic inﬁnitesimal picture of this is given in Figure 1. where Y1 . λ) = π(t + dt. λdt 1 − λdt (1. by assigning jump Y1 at time t1 . (1.2. That is.8) and (1. Construct a new process π Y (t. (1. λdt .21) That is.p. λ) be a Poisson process with jump times t1 . t2 . Hence. For the diﬀerential. At this point. We can generalize this and allow them to jump randomly.2.p. we should think of diﬀerentials as dπ(t.λ) π (t. λ).3 Compound Poisson Process When Poisson processes jump. Those binary models provide the proper intuition.4 Ito Stochastic diﬀerential equations Stochastic integrals can be deﬁned in diﬀerent ways. Processes of this form can also conveniently be written as integrals.λ) t π Y (t. with Brownian motion we would have: dz(t) = z(t + dt) − z(t) ∼ N (0. sums of them will limit as Brownian motion or a Poisson process. λ) = i=0 Y Yi .p. λ) = i=0 Yi = 0 Ys dπ(s. λ) is a standard Poisson process. 1/2 and dπ(λ) ≈ 1 w. Don’t forget that we also have the binary model approximations of Figures 1.p. However. etc.10). .18) in order to make sure that we capture jumps. Y2 . are iid random variables. λ) ∼ P oisson(λdt) (1..20) 1..1.23) Following along the lines of the binary approximation to a Poisson process as in Figure 1. dt). π(t.3 and 1.
t) as related to the instantaneous volatility as follows: E[dxx(t)] = = = E[a(x(t). This is important! It allows us to do the following simple calculations of the instantaneous drift and variance.) We will interpret this equation as follows: x(t + dt) − x(t) = a(x(t). That is.28) (1. 1. t)dt + b(x(t−).30) (1. t) determines the instantaneous variance of x. t)dt Hence. t)dz(t) (1. t) are evaluated at time t. (1.29) Therefore. BASIC BUILDING BLOCKS AND STOCHASTIC DIFFERENTIAL EQUATION MODELS Figure 1. I will ignore the technical conditions that must be placed on a and b in order to make such an equation well deﬁned. t)Y dπ(t. t)(z(t + dt) − z(t)). t)dt)2 x(t)] = = = E[b(x(t). we can compute the instantaneous variance of x as follows E[(dx − a(x(t).25) where in this case. b2 (x(t). t)dt + b(x(t). t) determines the instantaneous drift. (At this stage. (1. t) as related to the instantaneous drift and b(x(t). they are independent of dz(t) = z(t + dt) − z(t). On the other hand. t)dt + b(x(t). x(t−) is the limit from the left at time t. t) and b(x(t).27) (1. and a(x(t). it is being driven by Brownian motion z(t). By x(t−) we mean x(t−) = limh↓0 x(t−h).2. t)dt + b(x(t).6: Inﬁnitesimal model of a compound Poisson process. t)dt.5 Poisson Driven Diﬀerential Equations We can also drive a diﬀerential equation by a Poisson process dx(t) = a(x(t−).31) (1. t)dz(t)x(t)] a(x(t). a(x(t).26) Since z(t) has independent increments. t)E[dz(t)x(t)] a(x(t).8CHAPTER 1. t)2 dz(t)2 x(t)] b2 (x(t). A stochastic diﬀerential equation will be written as: dx(t) = a(x(t).32) b2 (x(t).33) where note that we have written x(t−) in the arguments of a and b. λ) (1. We will assume that a and b are left continuous in the t . Instantaneous Drift and Variance We can interpret a(x(t). t)dt + b(x(t). t)E[dz(t)2 x(t)] (1.
1.3. SUMMARY
9
argument so that we may use t instead of t− in the second argument of a and b. We will also sometimes use the notation x− when we want to suppress the argument t, or even a− when suppressing the arguments of a. The reason for using limits from the left is that in a Poisson process, we interpret our diﬀerential as dπ(t) = π(t + dt) − π(t−) and for the Ito integral, we assume that the coeﬃcients a and b are evaluated at the point in time that the diﬀerential starts from. This is t−. This limit from the left is also important in a and b because we want a and b to be independent of dπ. The only way we can do this is to make sure that we use left limits. Note that this means that if π(t) jumps at time t, which also causes a jump in x at time t, we evaluate x(t−) in a and b which immediately preceeds the jump. With that established, once again, we can compute the instantaneous mean and variance: E[dx(t)x(t−)] = = = E[a(x(t−), t)dt + b(x(t−), t)Y dπ(t)x(t−)] a(x(t−), t)dt + b(x(t−), t)E[Y dπ(t)x(t−)] a(x(t−), t)dt + b(x(t−), t)E[Y ]λdt (1.34) (1.35) (1.36)
Hence, in this case, the dπ(t) term can contribute to the instantaneous mean. This can make things messy! It is often nicer to think of the ﬁrst term as the ”mean” term, and the second as the ”risk” term. To do this, we would like the second term to have zero instantaneous mean. Hence, we will often ”compensate” the Poisson process to give it zero mean. This is done by simply subtracting oﬀ the instantaneous mean from the second term and adding it to the ﬁrst. dx(t) = (a(x(t−), t) + b(x(t−), t)E[Y ]λ)dt + b(x(t−), t)(Y dπ(t) − E[Y ]λdt) Then we can also compute the instantaneous variance: E[(dx(t) − (a(x(t−), t) + b(x(t−), t)E[Y ]λ)dt)2 x(t−)] (1.37)
= b2 (x(t−), t)V ar(Y dπ(t))
= E[b2 (x(t−), t)(Y dπ(t) − E[Y ]λdt)2 x(t−)]
= b2 (x(t−), t)E[Y 2 ]λdt + O(dt2 )
= b2 (x(t−), t)(E[Y 2 ](λdt + λ2 dt2 ) − E[Y ]2 λ2 dt2 )
= b2 (x(t−), t)(E[Y 2 ]E[dπ(t)2 ] − E[Y ]2 λ2 dt2 )
= b2 (x(t−), t)(E[Y 2 dπ(t)2 ) − E[Y ]2 E[dπ(t)]2 )
Hence, to order dt, the instantaneous variance is given by b2 (x(t−), t)E[Y 2 ]λ.
1.3
Summary
Brownian motion (built upon the Gaussian random variable) and the Poisson Process (built upon the Poisson random variable) are the basic building blocks used to create models of prices. In particular, we use these two processes to drive diﬀerential equations and that will allow us to capture a wide range of price phenomena. Due to the continuity of Brownian motion, it is good for modeling price paths and variables that do not jump. On the other hand, Poisson processes are an essential building block for modeling jumps in price processes or variables. Much intuition can be gained from simple ”incremental” and ”diﬀerential” models of processes and stochastic diﬀerential equations. The simple binary approximations to Brownian motion and Poisson processes are enough to correctly guide your intuition in the vast majority of cases. Thus, for modeling purposes, make sure you have a solid understanding of these two building block processes.
10CHAPTER 1. BASIC BUILDING BLOCKS AND STOCHASTIC DIFFERENTIAL EQUATION MODELS
1.4
Problems
dπ = 1 wp. 0 wp. λdt 1 − λdt (1.38)
Problem 1.4.1 Verify that our inﬁnitesimal model of a Poisson process over small time dt:
has a mean and variance that agree with a Poisson random variable with parameter λdt to order dt. Problem 1.4.2 Poisson Processes Consider the time interval [0, 1]. Chop this time interval into n parts of equal length. Over each interval deﬁne the independent and identically distributed random variables Xi where Xi = Let Y =
i=1
1 w.p. 0 w.p.
n
λ/n 1 − λ/n Xi
(1.39)
(1.40)
(a) What is P r(Y = 0)? (b) In your answer in (a), take the limit as n → ∞. What do you get? (b) What is P r(Y = 1)? (c) Again take the limit. What is your answer? (d) Now consider an arbitrary but ﬁxed k with k < n. What is P r(Y = k). (e) Again take the limit as n → ∞, and show that this converges to the Poisson random variable. (You √ 1 will probably want to use Stirling’s formula n! ∼ 2πe−n nn+ 2 . This calculation is a bit tricky!) (Note: In this problem we converge to a Poisson random variable with parameter λ since we took the time interval to be 1. If the time interval is t, we will converge to a Poisson random variable with parameter λt. As a function of t, we arrive at a Poisson process.) Problem 1.4.3 Poisson Processes again. Consider the following Markov chain. Let the state space be the whole numbers x = 0, 1, 2, .... Consider the following transition probabilities over the time instant dt: P r(x(t + dt) = nx(t) = n) P r(x(t + dt) = n + 1x(t) = n) Let pn (t) = P r(x(t) = n). (a) Write down a diﬀerential equation for p0 (t). (hint: to derive a diﬀerential equation, consider the amount of probability that ﬂows into and out of the state x = 0 over time dt.) (b) Assume p0 (0) = 1 (that is, at time zero, x = 0 with probability 1). Solve the diﬀerential equation for p0 (t). (c) Derive a diﬀerential equation for pn (t), n > 0. Given your answer in (a), solve for p1 (t). Explain how you could solve for pn (t) for any n. (Note: again we have arrived at a Poisson process, but this time through Markov chain theory. A Poisson process is an example of a continuous time Markov process, and the set of diﬀerential equations you derived is the ”forward equation” for this process.) = = λdt 1 − λdt (1.41) (1.42)
Chapter 2
Ito’s Lemma
2.1 Ito’s Lemma
Ito’s lemma is the chain rule for stochastic calculus. In this chapter we present Ito’s lemma for Brownian motion and Poisson processes. It has been said that all of math ﬁnance can be done with just the knowledge of Ito’s lemma. To you, this means that you should make sure that you know (and understand) Ito’s lemma. In what follows, we will present versions of Ito’s lemma for Brownian motion and Poisson processes.
2.1.1
The chain rule of ordinary calculus
In ordinary calculus, here is how the chain rule works in conjunction with a diﬀerential equation. Let x(t) follow the diﬀerential equation dx = a(x, t). (2.1) dt Now consider a function of x(t) and t. Let’s call this function f (x(t), t). Assuming that f is diﬀerentiable, we can ask what the derivative of f is. To calculate it, we simply apply the chain rule df (x, t) ∂f dx ∂f dx = + = fx + ft dt ∂x dt ∂t dt where we are using the notation fx =
∂f ∂x
(2.2)
dx dt
and ft =
∂f ∂t .
Finally, we can substitute in for
from (2.1), giving (2.3)
df (x, t) ∂f ∂f = a(x, t) + = a(x, t)fx + ft . dt ∂x ∂t
This is a fairly straightforward calculation. However, when dealing with stochastic diﬀerential equations, the simple chain rule of ordinary calculus does not work. The reason is simple. Brownian motion is not diﬀerentiable so we can’t really take its derivative or the derivative of any function of Brownian motion. Also, Poisson processes jump, and at these jumps it is not even continuous, let alone diﬀerentiable. Thus, for stochastic diﬀerential equations we need to develop the correct mathematics for dealing with a function of a variable that follows a stochastic diﬀerential equation. The guiding result is known as Ito’s lemma. Multivariables Taylor Series Expansions Before diving into Ito’s lemma, you should make sure that you recall your multivariables Taylor series expansions to second order. This is extremely important! Ito’s lemma for Brownian motion is basically just a modiﬁed Taylor expansion to second order. So, let’s recall up to second order, the Taylor series expansion 11
t)fxx )dt + b(x. . . the Taylor series expansion is df where fx = [fx1 . .e.6) (or (2. fx . t) be a twice continuously diﬀerentiable function of x and t. . Consider writing the Taylor expansion of df . Ito’s lemma for Brownian motion boils down to nothing more than substituting dx = adt + bdz in the Taylor expansion of (2. . t)fx dz.6) and the arguments of ft .5) Now. . Operationally. fxx 1 1 = ft dt + fx dx + ftt dt2 + dxT fxt dt + dxT fxx dx + .8) A useful trick in the multivariable case is to note that dxT fxx dx = T r(dxT fxx dx) = T r(fxx dxdxT ) (2. f (x. . 2 2 (2.. The following result is Ito’s lemma when x(t) is a process governed by a stochastic diﬀerential equation driven by Brownian motion. t) = (ft + a(x.. t0 )(x − x0 )2 + . (⋆) Ito’s Lemma for Brownian Motion: Consider the stochastic diﬀerential equation (SDE) dx = a(x. Hence. . and then throwing away terms of order higher than dt. t0 )(t − t0 )2 + fxt (x0 . fx1 xn . ITO’S LEMMA f (x0 . have been supressed. Let’s see how it works in more detail. interpreting terms such as dz 2 . t0 )(x − x0 )(t − t0 ) + fxx (x0 .1.9) where T r(·) is the Trace of a matrix (i. the sum of the diagonal elements).. t0 ) + ft (x0 . t + dt) − f (x(t). 2 2 (2. In the multivariable case when x ∈ Rn . t0 ). dt = t − t0 .11) (2. t) − f (x0 . 2. t) around a point (x0 . 2 2 . fxn xn (2. . .. and df = f (x. . .. t0 )(x − x0 ) 1 1 + ftt (x0 . 2 2 fx1 x1 . Then 1 df (x. then you are right.. Ito’s lemma allows us to compute the diﬀerential of a function of x(t) and t.. = .. . 2 (2. t)dt + b(x.2 Ito’s lemma for Brownian motion Given the diﬀerential of x(t). . t)dz and let f (x. fxn ].7) in the multivariable case) for dx. . df = f (x(t + dt). we will typically denote dx = x − x0 . fxn x1 .12 of a function f (x. t)fx + b2 (x. it is the ”chain rule” for stochastic diﬀerential equations. so that a Taylor series expansion is df = 1 1 ft dt + fx dx + ftt dt2 + fxt dxdt + fxx dx2 + .10) ”Heuristic Proof”: I will suppress the arguments of a and b for convenience. If that sounds simple. t0 )..4) (2. t) = CHAPTER 2. .7) (2. t0 )(t − t0 ) + fx (x0 . . t) 1 1 = ft dt + fx dx + ftt (dt)2 + fxx (dx)2 + fxt dxdt + .
then S(T. the mean is T . then by integrating we would have T T dz 2 = 0 0 dt = T. ∆t) is a random variable since it involves z(t). This does the trick. ∆t) is equal to T . (2. Let’s approximate the integral above by the sum T ∆t −1 T S(T. The most glaring was replacing dz 2 by its expectation dt. we will substitute in for dx using dx = adt + bdz which gives df = = 1 1 ft dt + fx (adt + bdz) + ftt (dt)2 + fxx (adt + bdz)2 + fxt (adt + bdz)dt + . ∆t) = V ar i=0 (z((i + 1)∆t) − z(i∆t)) 2 = i=0 V ar((z((i + 1)∆t) − z(i∆t))2 ).2. In this ”derivation” there were a couple of dubious steps. (When we show that the variance approaches zero. we are trying to show that the random variable S(T. ITO’S LEMMA Next. Hence.. let us see if this makes sense. 0 (2... (2. ∆t) converges to the constant T .13) Hence. Let’s see why this was a reasonable thing to do .. ∆t): T ∆t −1 E[S(T. The standard √ deviation of dz is of order dt..1. since a random variable with zero variance must be a constant equal to it’s mean. (2..15) Hence. But note that S(T. 2 2 13 Now we take a crucial step. the claim is that as ∆t → 0. (2. let’s ﬁrst compute the mean of S(T. ∆t) → T .) Computing the mean: Okay.14) Now. we are proving convergence in mean square. ∆t)] = E i=0 (z((i + 1)∆t) − z(i∆t)) 2 T ∆t −1 T ∆t −1 = i=0 E (z((i + 1)∆t) − z(i∆t)) 2 = i=0 ∆t = T.12) 2. 2 2.. ∆t). ∆t) = i=0 (z((i + 1)∆t) − z(i∆t))2 ≈ dz 2 . If we can say that dz 2 = dt.16) . To do this. we think of dz as being of order dt1/2 and only keep terms up to order dt yielding df = 1 ft dt + fx adt + fx bdz + fxx b2 dz 2 . we will show that the mean of S(T. and its variance approaches zero. 2 2 1 1 2 ft dt + fx adt + fx bdz + ftt (dt) + fxx (a2 dt2 + 2abdtdz + b2 dz 2 ) + fxt (adt2 + bdzdt) + . or L2 (P ). By independent increments T T ∆t −1 ∆t −1 V ar(S(T ). 2 Finally we replace dz 2 by it’s expectation dt which leads to Ito’s lemma 1 df = (ft + afx + b2 fxx )dt + bfx dz.1.3 Replacing dz 2 by dt Here is a simple argument as to why it is reasonable to replace dz 2 by dt. Therefore. Computing the variance: Now let’s compute the variance of S(T. and only keep terms up to order dt using the following logic..
Let’s see an example of Ito’s lemma applied to socalled geometric Brownian motion. We are used to doing things like this from ordinary calculus. This is the essential argument that allows us to use dz 2 = dt and a simpliﬁed version of the argument behind a real derivation of Ito’s lemma.2 Ito’s lemma for Poisson Processes Ito’s lemma for Brownian motion is more subtle that Ito’s lemma for Poisson Processes. (2. Then. on the other hand. ITO’S LEMMA But since z((i + 1)∆t) − z(i∆t) is Gaussian with mean zero and variance ∆t. the limit of the variance of S(T.4 Discussion of Ito’s lemma At the risk of overdoing an attempt to provide intuition behind Ito’s lemma. That means that the limit (in L2 (P )) is a constant and equal to the mean T . starts the race and steadily works his way toward the ﬁnish line. Ito’s lemma in this case is merely an application of the LebesgueStieljies calculus.1 Let dx = axdt + bxdz and consider f (x) = ln(x). backwards. In Ito’s lemma. It is quick. Example 2. On the other hand.19) 1 1 where we have used that ft = 0. according to Ito’s lemma. but now that we are dealing with stochastic processes. It marches forward at a constant dt rate. and fxx = − x2 . ∆t)) = i=0 V ar((z((i + 1)∆t) − z(i∆t)) ) = 2 i=0 2(∆t)2 = 2T ∆t → 0. but runs forward and backwards and easily gets oﬀ track.18) (2. This means that moves in dz are usually much larger than dt. we kept terms up to order dt in the Taylor expansion. just like we don’t know whether the tortoise or the hare will win the race! 2. which is based on the story of the tortoise and the hare. the standard deviation of dz is dt1/2 . In the end of the story. both the dt and dz terms contribute to the stochastic diﬀerential equation and neither term is guaranteed to dominate. and this allows us to deﬁne the stochastic integral pathwise. The tortoise being slow. I appeal to the wisdom of Gillespie [8]. Hence. we might question this step.1. 2. But its direction is random.1. V ar(S(T. and jumps around like order dt1/2 . fx = x .17) Hence. As the story goes. the tortoise and the hare race each other.) . Note that the above argument has much of the ﬂavor of the weak law of large numbers. He is much faster than the hare. the deterministic dt drift term is like the tortoise. He gave the following explanation. In particular. The hare. Why doesn’t this dz term completely dominate and even allow us to ignore terms of order dt? Again. we have V ar((z((i + 1)∆t) − z(i∆t))2 ) = = = Therefore as ∆t → 0. f (x) satisﬁes df = = 1 (ft + axfx + b2 x2 fxx )dt + bxfx dz 2 1 2 (a − b )dt + bdz 2 (2. ”What does this have to do with Ito’s lemma and stochastic diﬀerential equations?” Well. Together. is quick and jumpy. Now your asking. (The key diﬀerence is that Poisson processes have sample paths of ﬁnite variation. ∆t) is zero. the dz term is like the hare. we have T ∆t −1 T ∆t −1 E[(z((i + 1)∆t) − z(i∆t))4 ] − E[(z((i + 1)∆t) − z(i∆t))2 ]2 3(∆t)2 − (∆t)2 2(∆t)2 . Some of the time it jumps forward and other times. the tortoise wins the race. I will leave you with the following thoughts.14 CHAPTER 2.
t + dt) + (ft + a− fx )dt + O(dt2 ) f (x− + a− dt + b− dπ. t) − − (2. t + dt) − f (x− . as dt → 0. That is it! Let’s see an example of this. t + dt) → (f (x− + b− . df = = f (x(t + dt).27) This occurs with probability λdt. t) be a continuously diﬀerentiable function of x and t. When there is a jump we have dπ = 1 and f (x− + a− dt + b− .28) 2. t)dπ let f (x. Therefore. t + dt) − f (x− . we can think of the eﬀect of the a− dt term as overall being of order dt2 . t) and b− = b(x(t−). df = (2. t + dt) Now the ﬁnal two terms don’t have a jump in them. so I will add and subtract a term that doesn’t contain the jump df = f (x− + a− dt + b− dπ. combining the above arguments gives f (x− + a− dt + b− dπ. Since the a− dt term in the argument is of order dt.1 Interpretation of Ito’s lemma for Poisson Ito’s lemma for Poisson processes simply says that when the Poisson process doesn’t jump.29) (2. (2. . however with jumps dx can be large! Note how this plays into our derivation.2. t) = (ft + a(x− . Then df (x.2. to order dt overall this term is replaced by f (x− + a− dt + b− . 15 (2. t) (2. t + dt) − f (x− + a− dt. t))dπ which completes the derivation of Ito’s lemma. t). t) (2. t). t + dt) − f (x .20) (2.21) The real derivation of this is using LebesgueStieljies calculus. t + dt) + f (x− + a− dt. I don’t like this term. t + dt) − f (x− + a− dt. This time when we consider the diﬀerential df we have to be careful because xt jumps! Taylor expansions work well as an approximation when dx is small. 2 (2. Hence. t))dπ. Now. but once again I will provide a nice heuristic argument. so I will approximate them using ordinary calculus. t + dt) − f (x− + a− dt. we note that possible jumps come from dπ. t) − f (x− .23) where a− = a(x(t−). ”Heuristic Proof”: We start by writing out the diﬀerential and substituting in for dx.2. ITO’S LEMMA FOR POISSON PROCESSES (⋆) Ito’s Lemma for Poisson Processes: Given a Poisson stochastic diﬀerential equation (SDE) dx = a(x− .26) When their is no jump in dπ. t + dt) − f (x− + a− dt.25) Now let’s analyze the ﬁrst two terms (2. t) − f (x− . t)fx )dt + (f (x− + b(x− . t). then overall. the move in f is determined completely by the jump. t + dt) − f (x− + a− dt. When it jumps. this term is zero. use ordinary calculus. t + dt) → f (x− + b− .22) − f (x(t−) + a dt + b dπ. t + dt). No jumps occur with probability 1 − λdt. t + dt) − f (x− + a− dt. t)dt + b(x− . t) − f (x− .24) f (x− + a− dt + b− dπ.
t) be a continuously diﬀerentiable function of x and t. t) = (ft + a(x− .3. (⋆) Ito’s Lemma for Compound Poisson Processes: Given an SDE dx = a(x− .30) (2. t)fx + b(x− . 2. 2.32) 1 where we have used that ft = 0 and fx = x . (⋆) Ito’s for Brownian and Poisson Given a Poisson stochastic diﬀerential equation (SDE) dx = a(x− . Ito’s lemma is modiﬁed slightly as follows.3.1 Let dx = axdt + (b − 1)xdπ and consider f (x) = ln(x). t)dz + Y dπ let f (x. Then df (x. t)fx dz + (f (x− + Y. t))dπ This version of Ito’s lemma can be derived in a manner similar to Ito’s lemma for Poisson processes.2 Ito’s Lemma for Brownian and Compound Poisson Processes If we combine Brownian motion and compound Poisson processes.1 Ito’s Lemma for Compound Poisson Processes When we are using a compound Poisson process.34) (2.35) 2.36) (2. t))dπ 2 Again. t) = (ft + a− fx )dt + (f (x− + Y b− . Heuristic derivations follow along the lines of those for Brownian motion and the Poisson process. f (x) satisﬁes df = = = adt + (ln(bx ) − ln(x ))dπ adt + ln(b)dπ (ft + fx ax)dt + (f (x− + (b − 1)x− ) − f (x− ))dπ − − (2. t) − f (x− . and through some of the problems at the end.37) (2. t) − f (x− . t)2 fxx )dt + b(x− . t)dt + b(x− .3 More versions of Ito’s Lemma In this section I will present other useful versions of Ito’s lemma. t) be a twice continuously diﬀerentiable function of x and t.38) .31) (2.3. ITO’S LEMMA Example 2. (2. then we have the following results (⋆) Ito’s for two correlated Brownians Given the stochastic diﬀerential equations dx1 dx2 = = a1 dt + b1 dz1 a2 dt + b2 dz2 (2. It is worthwhile working your way through a couple of them. then we have the following result. By Ito’s lemma for Poisson processes. t)Y dπ Let f (x. this can be shown with arguments similar to those given above. (2. t)dt + b(x− .2.3 Ito’s Lemma for vector processes If we have multiple Brownian motions. Then 1 df (x.33) 2.16 CHAPTER 2.
this is the beginnings of the integration by parts formula. x2 . Then 1 df = ft + fx a + T r(fxx BΣB T ) dt + fx Bdz (2. roughly speaking. t) be twice continuously diﬀerentiable function of x1 . fxn x1 ··· . Then df (x1 . . Here it is. . x2 .4. Let’s try to ”derive” this formula by a simple ”rectangle” argument. t).e. fxn xn 2. Ito’s lemma would follow from the intuition of ordinary calculus. fxn ] and fxx fx1 x1 . x2 . x2 . = . Without this term. b2 = b2 (x1 . Let me give a little argument (which appears in Rogers and Williams [12]) to try to convince you that Ito’s lemma is actually more intuitive than ordinary calculus. t) ∈ Rn . a2 = a2 (x1 . t) ∈ Rn×m and z ∈ Rm a vector Brownian motion with instantaneous covariance matrix E[dzdz T ] = Σdt. a = a(x.42) 2 where fx = [fx1 . it is more mysterious that in ordinary calculus we are able to ignore this term. which enters because we are forced 2 to keep the dz term which is of order dt. x2 . and a rectangle 1 In Ito’s lemma for Brownian motion. That is d(uv) = udv + vdu + dudv. that is because in ordinary calculus. AND A RECTANGLE 17 with a1 = a1 (x1 . .. The example I will use is the product rule. the terms du and dv are of order dt and hence dudv is a higher order term. ··· fx1 xn . it is obvious the area of d(uv) is equal to the sum of the three colored rectangles in the ﬁgure. B = B(x. we have the familiar formula d(uv) = udv + vdu. This is a change in the product uv. t). t) where z1 and z2 are two correlated Brownian motions with instantaneous correlation coeﬃcient ρ (i. the mysterious term is 2 b2 fxx dt. Let f (x1 .. Of course. that is d(uv) = (u + du)(v + dv) − uv (2. .41) where x ∈ Rn .. t) be twice continuously diﬀerentiable function of x and t. Now.4 Ito’s lemma. x2 . Consider the quantity d(uv).39) Using vector notation we can generalize the above result (⋆) Ito’s for vectors Given a vector stochastic diﬀerential equation dx = adt + Bdz (2. . E[dz1 dz2 ] = ρdt).1 shows a picture of this. t) = 1 1 ft + a1 fx1 + a2 fx2 + b2 fx1 x1 + b2 fx2 x2 + ρb1 b2 fx1 x2 1 2 2 2 dt + b1 fx1 dz1 + b2 fx2 dz2 (2. the product rule. THE PRODUCT RULE. t).. . Let f (x. x2 and t.43) In fact.40) (2.44) We can think of this as the area of a rectangle with sides of length u + du and v + dv minus the area of a rectangle with sides of u and v. (2. In ordinary calculus. (2. . The rectangle in Figure 2. ITO’S LEMMA. fx2 .2. b1 = b1 (x1 .45) Hence. it is natural to expect to see a term related to dudv! In fact.
since the standard deviation of dz is of order dt. ITO’S LEMMA Figure 2. Hence. This is Ito’s lemma for f (z) = z 3 .3. 2.1. For Poisson processes. any function of it has a corresponding jump. second order terms in dz are of order dt and cannot be ignored. From our rectangle and the formula for d(z 2 ) we have d(z 3 ) = d(z · z 2 ) = = = zd(z 2 ) + z 2 dz + dzd(z 2 ) z(2zdz + dt) + z 2 dz + dz(2zdz + dt) 3z 2 dz + 3zdt + o(t) (2.18 CHAPTER 2. . we are not a far cry from Ito’s lemma for C 2 functions. for Brownian motion.47) which is Ito’s lemma for f (z) = z 2 .49) (2. recalling that dz 2 should be replaced by dt as in the argument of Section 2. This gives an extra term in Ito’s lemma compared to the ordinary chain rule of calculus. This gives us a formula for d(z 2 ).48) (2. Thus Ito’s lemma for Poisson processes is simply a combination of the ordinary chain rule plus noting that when the Poisson process jumps. There are diﬀerent versions for Brownian motion and for Poisson processes.50) where in the last step we have placed higher order terms in o(t) and replaced dz 2 by dt. Furthermore.1: Ito’s Rectangle Let us see how far we can get from this simple rectangle. Let us take u = v = z(t). most of the time no jumps are occurring and ordinary calculus is ﬁne. now you should have an easy time remembering Ito’s product rule d(uv) = udv + vdu + dudv. Roughly speaking. Continuing on in this manner. (2. we have d(z 2 ) = 2zdz + dt (2. we can easily derive Ito’s lemma for any polynomial of any order in z(t)! At that point. it causes a corresponding jump in any function of the Poisson process.46) Note that the formula above is exact! Now.5 Summary Ito’s lemma is the most important result in stochastic calculus for derivative pricing. when a jump occurs. you see that Ito’s lemma is actually quite natural if you just remember the rectangle. as they can be approximated by limits of polynomials. You should be √ familiar with both. Now we can proceed and choose u = z and v = z 2 . Then our rectangle formula (product rule) says that d(z 2 ) = d(z · z) = zdz + zdz + dz 2 . However.
Intuitively. √ (b) Given dx = −xdt + dπ α .3 Let dx = adt + bdz1 and dy = f dt + gdz2 (2. t)dz + Y dπ (2. (i.51) Problem 2. y. Let f (x.57) (2.6.6 Problems dx = a(x.54) where z1 and z2 are correlated Brownian motions with correlation coeﬃcient ρ.5 Compute 0 T zt dzt where zt is Brownian motion.6. t). Find a stochastic diﬀerential equation for df .56) (2. t) be a twice continuously diﬀerentiable function of x and t. use Ito’s lemma to derive df for f (x. µ ∈ Rn . That is.6. (i.2 Multidimensional Ito Formula for Brownian Motion: Consider the following vector Ito process: dx = µdt + Kdz (2.) Problem 2. Is it possible for E[1/xt ] to also have a positive growth rate? Give conditions on a and b for when this is possible.e. E(dz1 dz2 ) = ρdt) (a) Use Ito’s lemma to ﬁnd d(xy). Use Ito’s lemma to derive df for a generic twice continuously diﬀerentiable f (x. t) = x2 + t. Problem 2. but this time you can use E[dzdz T ] = Idt where I ∈ Rm×m is the identity matrix. Problem 2. (b) Use Ito’s lemma to ﬁnd d(x/y).55) (2. Problem 2. provide an explanation for this.52) where x ∈ Rn .) Problem 2. what is Ito’s lemma in this case). t) = t2 x.4 Ito’s Lemma Practice √ (a) Given dx = xdt + xdz.6. t)dt + b(x.53) where E[dz1 dz2 ] = ρdt.e.6. Now consider 1/xt . (Hint: consider Ito’s lemma for z 2 . .58) Assume that a > 0. What is df ? (Hint: Use the same Taylor series argument used in class. t) : Rn ×R → R be a twice continuously diﬀerentiable function of x and t.6.2.1 Consider the Stochastic Diﬀerential Equation: where z is Brownian motion and π is a generalized Poisson process with intensity λ and random jumps of size Y . the growth rate of E[xt ] is positive. (c) Given dx dy = = (x + y)dt + xdz1 ydt + xdz2 (2. Let f (x.6 (Counterintuition) Let dx = axdt + bxdz (2.6. use Ito’s lemma to derive df where f (x. PROBLEMS 19 2. K ∈ Rn×m and z ∈ Rm where z is a vector Brownian motion of m uncorrelated onedimensional Brownian motions.
62) (a) Compute the mean of df to lowest order in dt using the terms of the Taylor expansion shown above.. (dz)2 terms separately.60) (2.42). t) satisﬁes 1 df = (ft + afx + b2 fxx )dt + bfx dz. .8 Show that (2..6. 2 or 1 df = ft dt + fx (adt + bdz) + fxx (a2 (dt)2 + b2 (dz)2 + 2abdtdz) + . Problem 2. That is. ITO’S LEMMA Problem 2.) Let dx = adt + bdz (2. (b) Compute the standard deviation of the terms that contain randomness in the above expansion. dtdz. compute the standard deviation of the dz.6.20 CHAPTER 2.7 Intuition behind Ito’s lemma for Brownian motion. (2. Which term has standard deviation of lowest order in dt? (Hint: The fourth moment of a N (0.61) (2.39) follows from (2.60). (This intuitive look at Ito’s lemma was communicated to me by Muruhan Rathinam. 2 Recall our heuristic derivation based on a Taylor expansion: 1 df = ft dt + fx dx + fxx (dx)2 + . 2 (2.) (c) Use this analysis to argue for the plausibility of Ito’s lemma. σ 2 ) is 3σ 4 ..59) then from Ito’s lemma f (x..
(3. (3. note the important role that Ito’s lemma plays. these are stochastic diﬀerential equations that everyone should know! 3. Figure 3. Then over the time period dt. the return r on the stock x(t) is given by dx x(t + dt) − x(t) = . 3. These are also some of the stochastic diﬀerential equation models used for modeling asset prices and other relevant ﬁnancial variables.2) d ln(x) = (a − b2 )dt + bdz. In these solutions. integrating gives 1 ln(xt ) − ln(x0 ) = (a − b2 )t + bzt 2 which implies that x(t) = e(a− 2 b 1 2 (3.1 Stock Price Interpretation Geometric Brownian motion is the standard model for continuous asset price movements. (3. By Ito’s lemma with f (x) = ln(x) we have 1 (3.Chapter 3 Standard Stochastic Diﬀerential Equations with Solutions In this chapter we review some stochastic diﬀerential equations that have closed form solutions. A closed form solution to geometric Brownian motion can be found as follows. not many stochastic diﬀerential equations have closed form solutions.1.5) r= x(t) x 21 .1 Geometric Brownian Motion dx = axdt + bxdz.4) Note that geometric Brownian motion is a lognormal process.1 shows a typical sample path of geometric 2 Brownian motion. from (3. Let x(t) be the price of a stock. That is. Most importantly.3) )t+bzt x(0). Thus.1) Geometric Brownian motion is the following stochastic diﬀerential equation where a and b are constants. 2 Since a and b are constants.3) in logcoordinates x(t) is a Gaussian process with drift a − 1 b2 and volatility b. It comes from the following.
(3. a jump leads to the transition x(t) → bx(t). this process is a Poisson process plus drift in logcoordinates.6) 3. this is a very natural model of asset prices. Thus. there exists a closed form solution that is obtained by changing to log coordinates. it would be a bit messier. then we will jump by an amount (b − 1)x(t). Again. but being driven by a Poisson process. Note that if we didn’t use this convention.2 Geometric Poisson Motion Geometric Poisson motion is the equivalent of geometric Brownian motion.7) (3. Thus. (3. By Ito’s lemma with f (x) = ln(x) d ln(x) = adt + ln(b)dπ. Thus.10) . (3. by writing (b − 1) we think of b as indicating the multiple of the current state that we will jump to if a jump occurs. STANDARD STOCHASTIC DIFFERENTIAL EQUATIONS WITH SOLUTIONS Figure 3. Thus.1: Typical Sample path of geometric Brownian motion.11) (3. Geometric Brownian motion models this instantaneous return as dx = adt + bdz x where a and b are constants.8) The reason for the term b − 1 is as follows. That is.22 CHAPTER 3. we will jump to x(t) + (b − 1)x(t) = bx(t). Hence. If the current value is x(t) and a jump occurs. the instantaneous return r over the next dt is Gaussian with E[r] = adt. (3. dx = ax− dt + (b − 1)x− dπ. V ar(r) = b2 dt.9) Integrating leads to ln(x(t)) − ln(x(0)) = at + ln(b)π(t) or x(t) = eat+ln(b)π(t) x(0) = eat bπ(t) x(0).
more extreme price movements) than the lognormal distribution. π(t) i=1 (3. dx = (a + bx)dt + (c + f x)dz.4 A more general SDE The following SDE contains a slightly more general description of an SDE driven only by Brownian motion. is a lognormal process. π(t) i=1 Zi x(0) 3.3.18) . dx = ax− dt + bx− dz + (Y − 1)x− dπ (3.14) x(t) = eat+ which.16) Yi = e )t+bz(t)+ Zi where Zi is normal. This model is nice because it can produce distributions that have heavier tails (i. it can be used to create implied volatility smiles and smirks.19) (3. using Ito’s lemma with f (x) = ln(x) gives the solution. which is π(t) x(t) = e But since Y is lognormal. π(t) i=1 (3.3 A jumpdiﬀusion model The following stochastic diﬀerential equation uses a generalized Poisson process with a log normal jump size and a Brownian motion.1 A conditional lognormal version of geometric Poisson Motion The following stochastic diﬀerential equation uses a compound Poisson process with a log normal jump size. It is modeled as dx = ax− dt + (Y − 1)x− dπ (3. Using Ito’s lemma with f (x) = ln(x) gives π(t) But since Y is lognormal. conditional on π(t).12) where Y = eZ is a lognormal random variable. we can write it as x(t) = eat πt i=1 i=1 Yi = e Yi x(0). This results in a process that involves jumps and a diﬀusion term.e. a change to log coordinates facilitates ﬁnding a closed form solution.17) which. To solve this.15) where Y is a lognormal random variable. 3. Once again. Hence we have (3. (3. follows the lognormal distribution. In this way.13) Zi where Zi is normal. The intuition behind our route to the solution comes from the use of integrating factors in basic ODEs.2. A JUMPDIFFUSION MODEL 23 3. conditioned on π(t). This results in a conditional lognormal process that is diﬀerent from geometric Brownian motion. Again. Hence we have Zi x(t) = e(a− 2 b π(t) i=1 x(0) (3. ﬁrst write the SDE as follows: dx − bxdt − f xdz = adt + cdz. It is known as a jumpdiﬀusion model. we can write it as (a− 1 b2 )t+bz(t) 2 i=1 π(t) i=1 1 2 Yi x(0).3.
it has a closed form solution. Thus. and b is the mean reversion rate. observe that x(t) is a Gaussian process as well. Mean reverting processes of this speciﬁc form are sometimes also called a Vasicek model.21) )t−f z(t) .22) )t−f z(t) x) = e−(b− 2 f 1 2 )t−f z(t) 1 1 −(b − f 2 )xdt − f xdz + f 2 xdt 2 2 +(a + bx)dt + (c + f x)dz − f (c + f x) dt = Integrating both sides leads to e(−(b− 2 f 1 2 e−(b− 2 f 1 2 )t−f z(t) ((a − f c)dt + cdz) . The solution is given by t x(t) = e−bt x(0) + 0 e−b(t−s) (abds + cdz(s)). That is.18). we would try to make the left hand side an exact diﬀerential by using an integrating factor. dx = −b(x − a)dt + cdz (3. One disadvantage of this process is that values of x(t) can become negative because the Gaussian distribution always has some probability of being negative. we will try an integrating factor of the form e−(b− 2 f Therefore. The solution to this is x(t) = x(0)e(b− 2 f Hence.25) 3.26) where a is the level that x(t) reverts to. the process is pulled toward some value (in this case. dx − bxdt − f xdz = 0.20) )t+f z(t) . (3.24) and rearrangement gives the ﬁnal form x(t) = e((b− 2 f 1 2 t )t+f z(t)) x(0) + 0 e((b− 2 f 1 2 )(t−s)+f (z(t)−z(s)) ((a − f c)ds + cdz(s)). STANDARD STOCHASTIC DIFFERENTIAL EQUATIONS WITH SOLUTIONS In standard ordinary diﬀerential equations. Since a mean reverting process is of the form of (3. its long term mean level).19) is set to zero. . (3.27) Thus. (3. A sample path of a mean reverting process is depicted in Figure 3. This is undesirable because prices and quantities such as interest rates should not be negative.23) t )t−f z(t)) x(t) − x(0) = e(−(b− 2 f 0 1 2 )s−f z(s)) ((a − f c)ds + cdz(s)) (3.2. The Brownian driven term cdz just adds noise. let us compute d(e−(b− 2 f 1 2 1 2 1 2 (3. (3.1 The OrnsteinUhlenbeck Process and Mean Reversion A very useful model is one in which a price or ﬁnancial variable is mean reverting. x(t) is drawn toward a at a rate of b. (3. The integrating factor is usually related to the solution to the diﬀerential equation if the right hand side of (3.4. A standard model for this is a Gaussian model called the OrnsteinUhlenbeck process [14]. That is.24 CHAPTER 3. Vasicek used a process of this form to model the short rate process in term structure modeling [15].
with correct parameter values.8 2.6 2. Note that it is very similar to the √ OrnsteinUhlenbeck process.2 2 1. Now.4 x 2. x(t) follows dx = = i=1 1 1 2yi (t) − αyi dt + βdzi 2 2 i=1 n 2 −αyi + n + i=1 β2 4 dt (3. zn are uncorrelated. . consider the process 2 2 x(t) = y1 (t) + . a = 2.34) (3. . This makes the noise dependent on the size of x.3. COXINGERSOLLROSS PROCESS Simulation of Mean−Reversion Dynamics 3 25 2. c = 0. except that the driving Brownin motion is multiplied by x. 3.2: Simulation of MeanReverting Dynamics.32) n By Ito’s lemma. . This process is related to the OrnsteinUhlenbeck process as follows.28) dx = −b(x − a)dt + c xdz and used often in short rate models or stochastic volatility type models. (3.30) 1 1 dyn = − αyn dt + βdzn (3. . Furthermore.5. + yn (t) (3. this process will always be positive. . . .5.33) (3. b = 5.6 0 1 2 t 3 4 5 Figure 3. Consider n Ornstein Uhlenbeck Processes 1 1 dy1 = − αy1 dt + βdz1 (3.31) 2 2 where the Brownian motions z1 .29) 2 2 . x(0) = 3. It is given by √ (3.8 1.5 CoxIngersollRoss Process Another version of a mean reverting process is the CoxIngersollRoss (CIR) process [4].35) β2 4 β2 4 n dt + i=1 n (βyi dzi ) (yi dzi ) i=1 = −αx(t) + n dt + β . .
35) to dx = −αx(t) + n dt + β x(t)dz (3. we should always make sure that our parameter choices correspond to selecting n ≥ 2. Thus we can actually use the replacement n dz = i=1 yi x(t) β2 4 dzi (3. A sample path of a CIR process is depicted in Figure 3.3. heuristically. STANDARD STOCHASTIC DIFFERENTIAL EQUATIONS WITH SOLUTIONS Now. we perform a little trick by writing the last term as n β Now.38) and the variance is computed as n V ar i=1 yi x(t) n dzi = i=1 2 yi V ar(dzi ) x(t) n = i=1 2 yi dt x(t) = dt x(t) n 2 yi = dt i=1 (3. But. i. (3.32). dzi ∼ N (0. overall. 4 b = α.26 CHAPTER 3.6 Summary In this chapter we have explored some of the standard stochastic diﬀerential equation models used in ﬁnance. It is important to have a feel for these processes what their parameters correspond to.41).28) and (3. 3.36) n i=1 dzi (3. c2 For n corresponding to an integer. we get that n = 4ab . You can think of these models as building more and more complicated models from our basic building blocks of Brownian motion and the Poisson process. Hence. the process x(t) will be ChiSquared distributed.37) is actually a Brownian motion! How can we see this? Well.37) is just the sum of Gaussians. we only need to show that the mean is zero and the variance is dt. To compute the mean we have n E i=1 yi x(t) n dzi = i=1 yi x(t) E[dzi ] =0 (3. we are guaranteed to have a positive process (otherwise the process can and will reach zero at times!). This gives ab = nβ 2 .41) Now. c=β (3. . dt).37) is normally distributed. then (3. we have the solution as the sum of OrnsteinUhlenbeck processes. This is easily done since we can match coeﬃcients in (3. (3.42) From these relationships. note that if we interpret each Brownian increment as being normally distributed with mean zero and variance dt. Thus. For it to be the increment of Brownian motion. Note that most of them correspond to some transformation of simple Gaussian processes such as the OrnsteinUhlenbeck process or the Poisson process. Thus. it is a well known property that the sum of Gaussians is Gaussian. it turns out that for n ≥ 2. it turns out that x(t) i=1 yi x(t) yi x(t) dzi .39) where we used that the dzi ’s are independent and equation (3.40) to convert (3.e.
(c) What does your answer in (b) look like if Y is lognormally distributed.5 x 2 1.7. PROBLEMS Simulation of CIR Dynamics 3 27 2.) 1 2 )t−f z x where z has the trivial sde .2 Consider the process dx = −b(x − a)dt + cdz (3. b = 5. c = 0. (Hint: Let f (x.23).43) Using Ito’s lemma. z.3: Simulation of a sample path of a CIR process.7.5. x(0) = 3.7. a = 2. Problem 3. This model is lognormal due to the fact that x is normal.7 Problems Problem 3.3.7.3 Verify equation (3. derive a stochastic diﬀerential equation for y = ex in terms of only y. t) = e−(b− 2 f dz = dz and use the multidimensional Ito’s lemma.1 Solve the following stochastic diﬀerential equations: (a) dx = µdt + σxdz (b) dx = µxdt + σxdz + x(Y − 1)dπ where π is a Poisson process with intensity λ and Y is a random variable.5 0 1 2 t 3 4 5 Figure 3. 3. but has a mean reverting property in log coordinates. Problem 3.
28 CHAPTER 3. STANDARD STOCHASTIC DIFFERENTIAL EQUATIONS WITH SOLUTIONS .
To back up this claim we will derive pdes in a great number of situations. Hence.1 shows the cash ﬂow diagram corresponding to this where the time increment it denoted as dt. Figure 4. 4. However... i = 1. But ﬁrst we must understand the underlying principle. returns are also related to price changes.. a return is assumed to be of the form r = α + βf (4. In doing so.1 Returns Above I said we would model returns as factor models. 29 . the return on an asset is deﬁned as r= P (t + ∆t) − P (t) P (t) (4. which is in fact quite simple. Given a period of time ∆t. n is a random factor. we adopt that paradigm here. . However.3) where P (t) is the amount invested at time t and P (t + ∆t) is the amount received at time t + ∆t. Hence. factor models should arise from price changes as well. we do so in light of the application to derivative pricing.2 Returns and Factors Models Most of the modeling in asset pricing theory is done using linear factor models. 4. We will also consider multifactor models of the form n r =α+ i=1 βi fi (4.Chapter 4 The Factor Approach to Arbitrage Pricing 4.1) where a and b are constants and f is a random variable called a factor.. This is a one factor model. a simple absence of arbitrage equation results in perhaps the simplest way to obtain the vast majority of BlackScholes type pdes that appear in derivative pricing..2. We will ﬁnd in the following chapters that when coupled with Ito’s lemma. Let’s begin.1 The Factor Approach to Arbitrage Pricing In this chapter we present absence of arbitrage conditions when returns are described by linear factor models. we rely heavily on Ross’ Arbitrage Pricing Theory [13].2) where each fi . That is.
4) dS = µSdt + σSdz. In what follows.2 Stochastic diﬀerential equations and factor models In ﬁnance. I like to go even a little further and interpret dt as a special factor which is nonrandom. In fact. and I can use it to determine a model of returns: r= dx(t) a(x(t). and dt can be any time increment.1: Cash Flow Diagram 4. .t) (and arguments are being suppressed).7) This also looks like a factor model with the factor being dz. we should view a factor model from an SDE as being valid only over dt and conditioned upon information at time t.8) where α = a(x(t). not just an instantaneous change in time. S(t) S(t) (4. I like to write it in the form: r = αdt + βdz (4. Hence. you may think of the factor dz as being any random variable. if we associate α = µdt. it is given by: r= S(t + dt) − S(t) dS(t) = = µdt + σdz. t) b(x(t). In general. conditioned upon information at time t.5) This is also an example of a factor model. for the sections that follow. In the above formula. t)dt + b(x(t). However. This is purely for convenience. Therefore.2. and f = dz then it is a standard linear factor model. but I will use this convention throughout. t)dz (4. This is slightly diﬀerent from x(t) x(t) the standard factor model in that I explicitly write out dt in the ﬁrst term. I should be able to use it to compute the return of an asset over time dt.30 CHAPTER 4. Therefore.t) and β = b(x(t). SDEs lead to instantaneous factor models that apply over time increments of dt. if I am given a stochastic diﬀerential equation dx(t) = a(x(t). β = σ. including x(t) (or t− and x(t−) if appropriate). This is a model of a price change over time dt. we like to model asset prices as stochastic diﬀerential equations. t) = dt + dz x(t) x(t) x(t) (4. a stock price could be (4. For instance. THE FACTOR APPROACH TO ARBITRAGE PRICING Figure 4. I will use notation that is indicative of SDEs in that factors will be denoted by dz and I will include a special factor dt when I describe returns.6) then it is describing price changes over time period dt. x(t) is known and α and β are then known constants over time t to t + dt. In fact. Another note is that α and β don’t seem to be constant! However. not just Brownian motion.
Hence. That is xT 1 x β T = = 0 No cost 0 No risk (4. Hence.3. THE FACTOR APPROACH TO ARBITRAGE USING RETURNS 31 4. By a tradable asset. Now consider a portfolio of these assets. the returns in (4. you have purchased the dividend stream as well. the total cost of our portfolio is n cost = i=1 xi = xT 1 (4. If it had a negative return. the total change in our portfolio is given by: n proﬁt/loss = i=1 xi ri = xT r.11) We can analyze this return in a little more detail by writing out what each ri is xT r = xT (αdt + βdz) = (xT α)dt + (xT β)dz. I must have no return as well. If this stock pays a dividend. For instance.9) indicate the change in value of each tradable asset. If xT β = 0 then the return on this portfolio is deterministic and given by xT αdt. That is. then it better have no return! If it had a positive return.10) where 1 is a vector of ones. since I would be guaranteed to make money (no risk). Note that you cannot just purchase the ”price” of the stock. if something costs nothing and has no risk. Therefore. and hence you are stuck with everything that comes along with purchasing a share (such as dividends). then when you purchase the share of stock. I have stacked up all the returns in a vector. this would be an arbitrage. I mean an asset that you can actually buy. and I didn’t use any money to do it (no cost). Let’s list the returns of tradable assets that are driven by the factor dz r = αdt + βdz (4. This leads us to a key condition which I state as the following implication (⋆) A (not very useful) Necessary Absence of Arbitrage Condition xT 1 = 0 No cost xT β = 0 No risk ⇒ xT α = 0 No return (4. The factors are contained in the vector dz ∈ Rm . So. 4. let’s consider the returns of tradable assets.13) (4. I will choose this portfolio so that it costs nothing and has no risk.15) But. only a share.3 The Factor Approach to Arbitrage using Returns We consider the returns of tradable assets.16) .12) since that is the coeﬃcient of the dz term which is providing the randomness. (4.14) (4.3. α ∈ Rn are vectors and β ∈ Rn×m is a matrix. you can purchase a share of stock. We will represent the portfolio by a vector x ∈ Rn which denotes the dollar amount invested in each tradable asset.1 Arbitrage Let us consider a simple portfolio.9) where r ∈ Rn . We can now see that this return will be riskless as long as xT β = 0 (4.4. Over the time period dt. to have no arbitrage. then the portfolio −x would be an arbitrage.
18) Now. That is exactly saying that N (A)⊥ = R(AT ).21) In order to derive a useful condition from (4.3. However. (⋆) Null and Range Space Relationship: N (A)⊥ = R(AT ). Then y T Ax = (AT y)T x = 0 which means that A y is orthogonal to the null space of A for all y.19) On the other hand. (4. That is R(A) = {y  ∃x such that Ax = y } .32 CHAPTER 4. That is N (A) = {xAx = 0} .20) Finally. 2.3 A Useful Absence of Arbitrage Condition Using the null and range space relationship. an equivalent condition is extremely useful and will provide the foundation for our derivations of partial diﬀerential equations. we recall the notion of the perpendicular set of a given set of vectors.17) As the name indicates. we need to ﬁrst recall some linear algebra relationships. (4. That is M ⊥ = z  z T x = 0.23) 4.17) can be written as Ax = 0 ⇒ αT x = 0 where A = . we will use the following relationship between the null and range space of a matrix. ∀x ∈ M (4.24) T (4.2 Null and Range Space Relationship 1T βT Note that the condition (4. (4. the set of all vectors x such that Ax = 0 is known as the null space of the matrix N (A).3. the condition which is stated next is astonishingly useful! But to derive it. 4. (4. Let M be a set of vectors.17). the above condition for absence of arbitrage is not terribly useful. Hence it receives two stars!! (⋆⋆ Return APT) A Useful Necessary Absence of Arbitrage Condition .22) (4. the set of all vectors y such that there exists an x with y = Ax is known as the range space of the matrix A. (4. then M ⊥ is a set of all vectors z such that z is orthogonal to all vectors in M .17) into a very useful condition. In fact. we can convert (4. Proof: The proof of this is rather simple. THE FACTOR APPROACH TO ARBITRAGE PRICING which can be written in matrix form as 1T βT x = 0 ⇒ αT x = 0. x ∈ N (A) ⇒ Ax = 0 ⇒ y T Ax = 0 for all y. It’s importance cannot be overstated.
λ1 is called the ”market price of risk”. If it is large. In this case. On the left side is the expected return (µ). then you have purchased a lot of that risk. This is in fact correct. A better way to think of σ is not as volatility. Intuitively. then we should be earning the risk free rate. If σ is zero. The Market Price of Time But.28) ˆ ˆ Thus. 2 4. But. if we don’t take on any risk (i. Market Price of Risk The APT equation for a single factor model r = µdt + σdz is ˆ µ = [ 1 σ ]λ = λ0 + σλ1 . σ = 0).17) indicates that if a portfolio x is in the null space of the matrix 1T βT then it also must be orthogonal to α. (4. using the null and range space relationship.25) where ˆ λ= with λ0 ∈ R and λ ∈ Rm .4. Its derivation is given below. this means that α∈R 1T βT T (4.16) to be true is for there to exist a vector ˆ λ ∈ Rm+1 such that ˆ 1 β λ = λ0 + βλ = α (4. but as the amount of the risk factor dz that you have. In fact. Hence. Therefore. we also have this pesky λ0 to deal with. Many of you will hopefully recognize this as nothing more than the simple version of Ross’ 1976 arbitrage pricing theory [13]. then you are not exposed to the risk dz at all. if we don’t have any risk.e. let’s justify this through a slightly diﬀerent argument that will also get us a nice interpretation for λ0 . this equation relates volatility to expected return. For this reason. . λ1 is the market price of the risk factor dz.29) Let’s look at this equation.3. THE FACTOR APPROACH TO ARBITRAGE USING RETURNS 33 A necessary and suﬃcient condition for the implication (4. λ1 tells you how much your expected return is increased (assuming λ1 is positive) for each unit of the risk dz that you take on. we might guess that λ0 = r0 .3. where r0 denotes the risk free rate of interest. there exists a vector λ ∈ Rm+1 such that [ 1 β ]λ = α. This is a nice interpretation for λ1 . It is not tied to any risk. But.4 Interpretations Let’s try to get a bit of intuition into the mystical λ’s. and on the right side is the volatility (σ).26) (4. Hence. Another way to put this is that α∈ N 1T βT ⊥ λ0 λ (4.27) where N (·) is the null space. then µ = λ0 . Proof: Note that the condition in (4.
2 Proﬁt/Loss and Arbitrage y T (dV) = y T (Adt + Bdz) = (y T A)dt + (y T B)dz (4. 4. then we may interpret λ0 as the reward for the time factor dt or the ”market price of time”. since the marktomarket mechanism resets the value of a contract to zero every day. It is very useful to have this intuition. etc. since by deﬁnition a return involves dividing by the price. and multiplying them by their market price and adding them up. Your return is just given by looking at how much of each factor you have taken on. The simple cash ﬂow diagram is given in ﬁgure 4. to how much we are rewarded for taking on those factors. so below we will reformulate absence of arbitrage conditions but in terms of price changes rather than returns.4. I don’t like special cases. in the above framework. Hence. Don’t forget it. They relate the diﬀerent factors dt. The cash ﬂow from changes in value over the period will satisfy the factor model dV = Adt + Bdz where A ∈ Rn . this makes perfect sense. they don’t have a well deﬁned return. B ∈ Rn×m .2. which can be zero. Quite simple.34 CHAPTER 4. Therefore. Hence.32) 4. Since it must satisfy our return relationship. we have: r0 = λ0 + 0(λ1 ) = λ0 (4. 4.33) In this case. In this case. we let r ∈ Rn be the returns of assets and speciﬁed the dollar amount invested in each asset by a vector x ∈ Rn . Its factor model is given by: dB0 = r0 dt B0 (4. Intuitively.5 A Problem with Returns Using returns to model assets has a disadvantage. This eliminates the need for special cases.3.31) which tells us that λ0 = r0 . an arbitrage portfolio is one that has: . A futures contract is an example of this. This time we will specify a vector of prices per unit of tradables P ∈ Rn . 4. If the λ’s are the market prices of factors. THE FACTOR APPROACH TO ARBITRAGE PRICING Let us consider a risk free asset. In our original argument. (4. and dz ∈ Rm . a futures contract must be dealt with as a special case.1 Price Changes and Arbitrage Let’s return to our arbitrage portfolio and reformulate it in terms of price changes. There are contracts that involve no upfront cost or price.4. since the risk free rate is the amount we are rewarded for taking on time and nothing else. and shares or units of each asset purchased y ∈ Rn .30) where r0 is a constant risk free rate of interest. it will be okay to have an asset with a zero price. right? This is the basic interpretation of the concept of ”market price of risk”. dz.4 The Factor Approach using Price Changes I just mentioned that there can be some problems in dealing with returns. in this section we will reformulate the factor approach to arbitrage pricing by working with prices rather than returns. In a market with no arbitrage. cash ﬂows resulting from the changes in value of the tradables per unit dV ∈ Rn . the proﬁt/loss on the portfolio is given by: Therefore. every factor has a market price. Now all the λ’s should make sense.
4. not dollar amount. it describes proﬁt/loss in terms of the cash ﬂow. (⋆) A (not very useful) Necessary Absence of Arbitrage Condition y T P = 0 No cost y T B = 0 No risk which can be written in matrix form as PT BT y = 0 ⇒ AT y = 0. THE FACTOR APPROACH USING PRICE CHANGES 35 Figure 4. • No Risk: y T B = 0.4.2: Cash Flow Diagram with Prices P and Value Changes dV • No Cost: y T P = 0. of course they are highly related. we can convert this to a dual condition that is useful: (⋆⋆Price APT) A Useful Necessary Absence of Arbitrage Condition ˆ A necessary condition for no arbitrage is for there to exist a vector λ ∈ Rm+1 such that P where ˆ λ= with λ0 ∈ R and λ ∈ Rm . but it is easier to understand some pricing situations.35) ⇒ y T A = 0 No return (4. such as futures contracts.36) . so we would like the following implication to hold. we want to eliminate arbitrages. The λ’s in both cases are the same. They are market prices of risk. (4. Furthermore. but has a proﬁt • Proﬁt: y T A = 0. The main diﬀerence is that the price approach uses shares or units of the asset to describe the portfolio. What is the relationship between the return approach and the price change approach. and have the same interpretation. Of course.34) Once again. λ0 λ (4. in the price approach rather than in terms of returns.37) B λ = Pλ0 + Bλ = A (4. Well. not returns. These are really superﬁcial diﬀerences.
the absence of arbitrage ˆ equation says that [ P B ]λ = A or B S Solving this equation for λ0 and λ1 gives λ0 λ1 as expected.36 CHAPTER 4.46) (4. Prices We can derive the same results using prices. the APT equation says [ 1 β ]λ = α or 1 0 1 σ Solving this equation for λ0 and λ1 gives λ0 λ1 = = r0 µ−r σ (4.40) (4.43) (4. We will see this equation pop up many times.45) . Their dynamics are given below: dB dS = = r0 Bdt µSdt + σSdz. we can write that: dB B dS S ˆ Hence.44) λ0 λ1 = r0 µ (4.1 Stocks Assume that a stock follows a geometric Brownian motion (GBM) and there is a bond that earns the risk free rate.38) (4. r0 .47) 0 σS λ0 λ1 = r0 B µS (4.39) What do our absence of arbitrage conditions say about these assets? Returns In terms of returns. (4. THE FACTOR APPROACH TO ARBITRAGE PRICING 4.41) Note that the market price of risk for dz is like an instantaneous Sharpe ratio.5 Two standard examples In this section we will give some little examples to help us gain some intuition into the approach. σ (4. Let’s start with a stock and a bond. In terms of prices and changes in value. 4.42) = = r0 dt µdt + σdz (4. = = r0 µ−r .5.
we may consider a market with a bond and a futures contract. and is diﬃcult to understand in the context of returns because its return is not deﬁned (it has zero price!). we see that the equations separate. The relevant quantities can be written as prices changes factors B dB = r0 Bdt (4. and the market price of risk does not involve the risk free rate r0 . and the market price of time is r0 . At the end of the day. Hence time is not mixed into it.52) λ1 = σ Note that a futures contract refers to the price at a ﬁxed time in the future. Hence. (4. the price change is settled.49) 0 df = µf dt + σf dz.) equations that arise in derivative pricing theory. but the cash ﬂow from the change in value will be given by df = µf dt + σf dz. Note that this price change is not discounted. Below is the cash ﬂow diagram for a futures contract. the change in value of the portfolio is equal to the change in the futures price dV = df ! Figure 4.51) r0 µ . SUMMARY 37 4.48) Hence. Hence. in this case.6. Now let’s look at the price based arbitrage equation [ P B ]λ = A or B 0 Solving this equation for λ0 and λ1 gives (4. Time is not in the mix. λ0 = 0 σS λ0 λ1 = r0 B µS . This means that they always begin the day with a zero price. Hence. time is ﬁxed. (4.2 Futures contracts One can enter into a futures contract without paying any money.6 Summary We have derived forms of an aribtrage pricing theory based either on linear factor models of returns or value changes. The bond is purely time.5. The critical diﬀerence between futures contracts and forward contracts is that futures contracts are marked to market and settled daily.3: Cash Flow Diagram for Futures Contract Therefore. These simple results will allow us to derive many of the partial diﬀerential (or diﬀerence/integral/etc. but rather the change in the futures price. On the other hand. we develop a . It is much more naturally considered in terms of prices and value changes. (4. a futures contract can have zero price. But before doing so. and a futures contract is purely a bet on the outcome of a random factor. the futures contract is purely a bet on the outcome of the factor. This means that a futures contract is a special case in our setup.50) 4.4.
3 (Early Exercise and American Options) American options allow the holder of the option to exercise at any time prior to expiration. then we can deﬁne γ = α − [1 β]λ∗ . λ (4.54). in practice an alternative but equivalent approach is often used.38 CHAPTER 4. Then this means that an arbitrage opportunity exists.57) Show that this optimization problem is equivalent to the optimization problem in (4. 4. That is min α − [1 β]λ 2 . What you should take away is that simple arbitrage ideas that we derived in this chapter are the underpinnings of derivative pricing theory.) Problem 4. interest rate derivatives. However. That is. THE FACTOR APPROACH TO ARBITRAGE PRICING derivative pricing framework based upon our APT factor based approach. One way to select a speciﬁc arbitrage would be to solve an optimization problem of the form max αT x x (4. and beyond. in following chapters we tackle examples in equity derivatives.58) Prove by use of the Farkas alternative that this implication is true if and only if there exists a λ such that b = AT λ. as follows.1 (How to construct an arbitrage) Assume that there does not exist a λ such that α = [1 β]λ.2 (More Linear Algebra and Dualities) The Farkas Alternative [7] of linear algebra says the following (i) Ax = b has a solution x ≥ 0 or (exclusive) (ii) y A ≥ 0. there exists a vector x specifying dollar amounts invested such that αT x > 0 and 1T x = 0.55) If λ∗ is this optimal λ. y b < 0 has a solution y. β T x = 0. (4.59) T T (4. Choose λ such that α − [1 β]λ has minimum 2norm.7 Problems Problem 4. Problem 4. (4.7. x 1 = 1. β T x = 0. One could then solve the optimization problem max γ T x x (4. Then. β T x = 0.56) subject to 1T x = 0.7.7.53) subject to 1T x = 0.60) . Let’s explore how this would aﬀect the absense of arbitrage ideas.54) where the constraint x 1 limits the total dollar amount of long and short positions to equal $1. Consider the implication: Ax = 0 ⇒ bT x = 0 (4. We can use the Farkas Alternative for implications as well. x 1 = 1. (Hint: Note that the implication is true if and only if there is no solution to Ax = 0 and bT x > 0. (4.
Let α and β correspond to factor models for tradable assets.p.69) .p. 1T x = 0. PROBLEMS (a) Assume that the factor model for the option if it is not exercised is r1 = αc dt + βc dz.7. e1 T x = 1.62) Problem 4. 1 − p (4. the implication must hold regardless of which return in (4. using (4. (4. β T x = 0 ⇒ αT x ≤ 0. is either r1 = αc dt + βc dz or r1 = depending on whether the option is exercised or not.7. (I.62) is being chosen.66) (4.61) Explain why the factor model for the return on an American option.2.63) E−c dt cdt (4. Argue that a necessary condition for you not to be able to arbitrage is that the following implication is true regardless of whether you are exercising the American option or not.p. where E is the early exercise value and c is the price of the option. Using the Farkas Alternative from Problem 4. 1. derive that an alternative absence of arbitrage condition is that there exists scalars ξ ≥ 0.67) Show that ∆S(k) = S(k + 1) − S(k) can be written in the form ∆S(k) = A(k) + B(k)∆Z(k) by ﬁnding A(k) and B(k). where at each step the stock price S(k) can either move up to S(k + 1) = uS(k) or down to S(k + 1) = dS(k) with the probability structure S(k + 1) = Let ∆Z(k) be a standard binary random variable ∆Z(k) = 1 w.e. let x be a vector of dollar amounts invested in each asset (with x1 corresponding to the American option).68) uS(k) dS(k) w. argue that for the American option.4.64) and Part (a) of this problem. where the ﬁrst elements of these vectors α1 and β1 corresponds to the American option. (4. β are linearly independent.64) (d) Finally. you get to decide when to exercise the option. Finally. both of the following conditions must hold αc E ≤ ≤ λ0 + βc λ1 c.p.65) (4. (b) Assume that you are long the American option. (4.4 (Modeling the binomial lattice as a factor model) Consider a stock that can move on a binomial lattice.7. 1 − p (4. p w. 39 (4. p −1 w.. As the holder of an American option.) (c) Assume that the vectors e1 . and λ0 and λ1 such that α = λ0 + βλ1 − e1 ξ.
THE FACTOR APPROACH TO ARBITRAGE PRICING .40 CHAPTER 4.
underlying variables are ﬁnancially relevant quantities that are functions of the factors.2 Underlying Variables Underlying variables are often quantities of interest that are functions of the factors. I will also emphasize the relative pricing nature of derivative pricing.1 Factors Factors are the most basic source of randomness in our models. to understand that pricing theory. we will be interested in the pricing of derivative securities. dπ. in our models. we will classify all quantities into three (possibly overlapping) categories. In our modeling framework. They are factors. For example. I like to think of time as a special factor. and to provide an almost step by step method for attacking any problem. In particular. we try to provide the structure behind the factor approach.Chapter 5 Constructing a Factor Pricing Framework 5. an interest rate could be an underlying variable. In general. As I proceed through this chapter.1 Introduction This chapter lays the framework for derivative pricing. I will use two derivative pricing examples to illustrate points. and dt terms. and used in the modeling process.2 A Classiﬁcation of Quantities In this book. they are the driving Brownian motions (z(t)) and Poisson processes (π(t)). Before jumping into any of these ideas of derivative pricing. In general. derivative pricing determines an appropriate price relative to other securities that have already been priced in a market. it is best to ﬁrst understand the general modeling paradigm. Furthermore. and an absence of arbitrage zero coupon bond pricing model. The examples are a call option on a nondividend paying stock. and tradables. That is. the factors will show up as the dz. However.2. the goal is to clarify the structure of the modeling involved in the factor approach. That is. 41 . 5. 5.2. Thus. A stock price S(t) could also be an underlying variable. let’s start by classifying the relevant quantities in a model of any market. 5. Thus. underlying variables.
if you started by purchasing 1 share of stock at time 0. In this case. What this means is that by reinvesting the dividend back into the stock.1: Picture of the Modeling Paradigm It is extremely important to be able to separate quantities that are tradable from those that are not. They are modeled as being functions of the underlying variables. Let’s consider an example to clarify the notion of what is tradable.42 CHAPTER 5. They come together and that is the tradable. For example. Derivative Security: A derivative security is a tradable whose value depends upon (or is a function of) other underlying variables.3 Tradables Tradables are the quantities that you can actually trade and include in a portfolio. the tradable is a more complicated function of an underlying variable. an interest rate r(t) can be an underlying variable. the functional relationship between the tradable and an underlying variable is trivial. and represented as a function of the underlying variable. Instead. Instead. let’s use the following example: .4 A Derivative is a Tradable In this book we are interested in pricing derivative securities. In principle. and only tradables. The stock has price S(t). note that you cannot just purchase the ”price of the stock”. 5. Figure 5. In other cases. by time t you would have eqt shares. And those absence of arbitrage relationships will lead to our pricing formulas. So. let’s consider the example of a stock that pays a dividend. CONSTRUCTING A FACTOR PRICING FRAMEWORK 5. and with this share you not only get the price of the stock but also the dividend. For example. you must purchase a share of stock. and let’s assume that it pays a continuous dividend at a rate of q. there is no real distinction between a derivative and any other tradable. Example: A stock paying a dividend To make sure we understand what a tradable is. it is important to begin by deﬁning what we mean by a derivative security. However. t) is tradable. The name derivative comes from the fact that the value of a derivative is ”derived” from some underlying variables. You will see that the most important distinction is that a derivative is ”what you want to price” and that other tradables are ”already priced” in the market. and also a tradable. Note that this deﬁnition of a derivative does not particularly distinguish it from any other tradable! We typically model all tradables as being a function of some underlying variables (although often a tradable is an underlying variable and hence trivially a function of itself). are the quantities that must satisfy the absence of arbitrage relationships from the previous chapter.2. In order to have something to keep in mind.2. we say that the derivative security is derivative to the underlying variables. Why? Because tradables. The point is that you can’t decouple the price from the dividend. a bond B(r(t). a stock price S(t) can be an underlying variable. What is the tradable quantity? You might be tempted to say that the price of the stock S(t) is tradable. Some of the time. but it is not tradable.
We write this as c(S(t). In some cases. In the previous chapter we saw that we can interpret SDEs as instantaneous factor models. if we have an SDE model for an underlying then we can use Ito’s lemma to obtain an SDE for the tradable.5. dS = µSdt + σSdz or a model of an interest rate (most likely the instantaneous short rate) as an SDE dr0 = adt + bdz. In this case. say CocaCola. there are two ways that we arrive at factor models for tradables. Consider the example of a European call option on a stock following geometric Brownian motion as in (5. what we need are linear factor models for the price changes of tradables.1 Direct Factor Models In many cases.2 Factor Models via Ito’s Lemma The second method to obtain factor models is via Ito’s lemma.3. Since tradables. (5.3. In fact.3. In that case. Thus.3) which is a factor model for the call option c. we begin the modeling process by writing down SDEs (or factor models) for underlying variables and tradables. etc. Hence. we would like SDE models for underlying variables and tradables in particular. the option is a derivative. The call option is a function of the underlying stock price S(t) and time t. because its value depends upon the stock price of CocaCola. in continuous time models. A picture of the route to SDEs for tradables is given in Figure 5. are functions of underlying variables. which is the option to purchase a speciﬁed stock. The second (and most important) is Ito’s lemma. for a speciﬁed price (the strike price). we begin the modeling process by writing down a factor model or SDE. the stock price of CocaCola is the underlying variable. In this example. derivative securities are quite ﬂexible. by Ito’s lemma we have 1 dc = (ct + µScS + σ 2 S 2 cSS )dt + σScS dz 2 (5. CocaCola is an underlying variable and also a tradable. it is not always the case that the underlying variable must also be tradable. However. there are derivatives that depend on underlying variables such as temperature or even wind speed which clearly are not tradable. Examples of this include the geometric Brownian motion model of stock price movement.2 . FACTOR MODELS FOR UNDERLYING VARIABLES AND TRADABLES Example: European Call Option 43 Consider a European call option. The ﬁrst is that we directly model the tradable as a factor model. cS . 5. in many case.1). we will know explicitly the functional relationship between the tradable or derivative and the underlying variable. and derivatives in particular.3 Factor Models for Underlying Variables and Tradables To use the APT equations from the previous chapter. in Ito’s lemma we would be able to explicitly compute the partial derivative terms ct .2) (5. Then. 5. t). and so are the possible underlying variables. 5.1) Thus. In general. In general. at a speciﬁed date (the expiration date).
1 Relative Pricing and Marketed Tradables Once we have set up the tradables table. To see how this works.36).44 CHAPTER 5. derivative pricing is a method of pricing a new tradable (the derivative) consistently with the existing price of marketed tradables. This emphasizes the relative pricing nature of derivative pricing. 5. CONSTRUCTING A FACTOR PRICING FRAMEWORK Figure 5.4) 5. rather than the Return APT. we can tabulate prices and factor models for value changes for all tradables. a bond. 5. Factor Model Changes Prices (5. and put them all in a tradables table. To use the Price APT equation. What we mean by a marketed tradables is a tradable that is ”already priced” by the market. and a call option on the stock. For instance. When we list these in a table.2: Obtaining factor models or SDEs for tradables.5 Applying the Price APT The ﬁnal step to a derivative pricing equation is to apply the Price APT equations. we call this a tradable table. in a market with a stock. we need two things: prices and factor models for value changes. Thus. That is. We price a derivative security relative to the marketed tradables. we ﬁrst separate marketed tradables from the derivative the we would like to price. However.5) Bm Am Pm Vm = dz dt + d B A P V . r0 B 0 dt + σS dz µS 1 σScS (ct + µScS + 2 σ 2 S 2 cSS ) (5. we designate certain tradables as marketed. But note that everything in this book can also be done use the Return APT equation. before applying the Price APT equation. The previous section indicated how to obtain the factor model for value changes.5.4 Tradables tables We will approach the derivative pricing problem using the Price APT equation (4. You should contrast the marketed tradables with the derivative that we would like to price. It supplies all the basic information that we need to extract from the market when applying the Price APT. The derivative is what we ”want to price” using the information given from the marketed assets. we separate the marketed tradables from the derivative. the tradable table is Prices B S c Changes B d S c Factor Model = This is called a tradable table to emphasize the fact that only tradables need satisfy the absence of arbitrage conditions.
4) and use it to illustrate pricing.8) 1 2 2 σScS c ct + µScS + 2 σ S cSS where the top two rows correspond to the marketed tradables.5.6) to ﬁnd the market prices of risk (often in terms of information from the marketed tradables). Thus. Solving these equations gives λ0 = r0 and λ1 = µ−r0 . (5. Using marketed tradables to determine the market prices of risk is like calibrating parameters (market prices of risk) in our absence of arbitrage pricing model to known data (marketed tradables). Thus. so let’s see an example Example: Pricing a European Call Option Let’s take the tradables table in (5. 5. (5.6) (5. we only use the marketed tradables. APPLYING THE PRICE APT 45 The tradables with the subscript m are the marketed tradables.5. we use these market prices of risk in the third equation for our derivative to obtain µ − r0 1 σScS (5. µS (5. . Once our model is calibrated (the market prices of risk are determined). One may think of this procedure as ﬁrst calibration to market data. σ Finally.5.10) ct + r0 ScS + σ 2 S 2 cSS = r0 c 2 which is the BlackScholes partial diﬀerential equation for the price of an option. Then.5. This discussion has been a little abstract. The market prices of risk are then plugged into (5. and c is the derivative that we would like to price.11) 5. and two unknowns (the market price of time λ0 and the market price of risk λ1 ) corresponding to the marketed tradables. If we specify the boundary condition for a European call option as c(S.7) which is the pricing equation for the derivative. what if the system of equations arising from the Price APT for the marketed tradables is either underdetermined or overdetermined. That is. I could solve the equations for λ0 and λ1 . the price APT implies that r0 B 0 B S λ0 + σS λ1 = . t) = 0 then this completely describes the price of the option.9) ct + µScS + σ 2 S 2 cSS = r0 c + 2 σ Rearranging this gives 1 (5. First. It was perfect! Now. what if things aren’t so perfect.2 Pricing the Derivative Now we can use the Price APT to price the derivative. Assume that c is a European call option on the stock S.3 Underdetermined and Overdetermined Systems In the example above everything was perfect! I had two equations (one for the bond and one for the stock). T ) = (S − K)+ . we designate the bond B and stock S as being the marketed assets. c(0. and the tradable without any subscript is the derivative that we are pricing.3. followed by pricing the new derivative. The Price APT equation says there needs to exist λ0 and λ such that Am A = = Pm λ0 + B m λ Pλ0 + Bλ. we use the marketed tradables to determine the market prices of risk. we can apply our absence of arbitrage model to price other tradables (derivatives) that also must satisfy the same absence of arbitrage relationship. we solve (5. Let use an example.7) To obtain the market prices of risk λ0 and λ. A picture of the application of the Price APT is given in Figure 5. Thus.
we cannot uniquely infer the market prices of risk for dz1 and dz2 . Underdetermined and Incompleteness Assume that our assets are given by dB dS In this case. Since market prices of risk relate risk to reward for various factors. there are multiple possible values for λ1 and λ2 . we must select values for market prices of risk that are not uniquely deﬁned. This is because the APT equation (in either return or price form) acts as a pricing equation. λ1 . there are many possible market prices of risk that satisfy the no arbitrage condition. For example in the above equations. (5. CONSTRUCTING A FACTOR PRICING FRAMEWORK Figure 5. and λ2 ). Thus. selecting a value for . and you will see in subsequent chapters that in order to price derivative securities in incomplete markets.) However. Incomplete markets are common in practice. The practical consequence of this is that if we are asked to price a new security that depends on dz1 and/or dz2 . (This use will become clear in the following chapters. it only provides a unique price if we have unique values for the market prices of risk. This is guaranteed by the APT equations. ﬁrst. but only two equations! Thus. but not uniquely for λ1 or λ2 . we can’t uniquely solve for the market prices of risk! (In this case.) What can we say in this situation and how should we think of this? Well.46 CHAPTER 5.14) = r0 Bdt = µSdt + σ1 Sdz1 + σ2 Sdz2 .13) And we have three unknowns (λ0 . then there is no arbitrage. Let’s assume that many solutions exists. This just means that from the tradable assets in the market (B and S). we will not be able to assign it a unique absence of arbitrage price. and all are arbitrage free. There are many possibilities. we can solve for λ0 = r0 . This situation is called an incomplete market. we can say that if any solution exists (it doesn’t have to be unique. we just need for at least one solution to exist). the price APT becomes B S 0 σ1 S 0 σ2 S λ0 λ1 = λ2 r0 B µS (5.12) (5.3: A picture of the application of the Price APT.
In practice this situation often leads to some sort of calibration procedure. (See Figure 5. Apply the Price APT equation. or when certain models are ﬁt to volatility smiles and smirks. in general.) 3.17) In this case there are three equations and only two unknowns (λ1 and λ2 )! This system looks to be overdetermined! Now. 5. Identify the tradable assets. when the set of equations is overdetermined. this speciﬁcation of risk preferences is captured by the selection of the market price of risk. The above discussion might be a little abstract at this point. For example. (5.6 Three Step Procedure To summarize. ﬁrst to the marketed tradables in order to solve for the market prices of risk (calibration). That is. there is an arbitrage opportunity! But.5. THREE STEP PROCEDURE 47 a market price of risk is essentially the same as specifying how investors in the market trade oﬀ risk and return. a trader will often just assume that the models being used for B. They often call this process calibration. the way this situation often plays out in practice is usually slightly diﬀerent.3. S1 and S2 are not perfect. Write factor models (SDEs) for each tradable asset (this may involve Ito’s lemma).) Again. and factors in a model.18) = r0 Bdt = µ1 S1 dt + σ1 S1 dz = µ2 S2 dt + σ2 S2 dz. However. the ﬁrst thing it means is that strictly speaking. Overdetermined and Calibration Now let’s consider the opposite situation. let the tradable assets be dB dS1 dS2 In this case.15) (5. (Watch for it in situations such as term structure modeling where a single factor dz is used.) 2. Instead of declaring that an arbitrage exists. and in practice it may be diﬃcult to recognize directly as searching for best ﬁt λ’s. we know from the price APT that for no arbitrage to exist there must exist a solution to this set of equations.2. Furthermore. but in subsequent chapters you might want to refer back to it when faced with pricing of derivatives in incomplete markets (see for example. in incomplete markets some speciﬁcation of the risk preferences of investors is needed to assign a unique price to derivative securities. jumpdiﬀusion models or stochastic volaility). and construct a tradables table. (See Figure 5. you might want to return to this discussion after reading subsequent chapters. Thus. Thus. but many tradables (bonds of diﬀerent maturities) exist. and then to the derivative using those market prices of risk (this usually results in a partial diﬀerential equation for the price).) . the trader will search for the λ1 and λ2 that best ﬁt the absence of arbitrage equations. for an overdetermined system of equations no solution will exist! What does this mean? Well. and that is the reason that no solution exists. underlying variables. the price APT becomes B S1 S2 0 σ1 S σ2 S λ0 λ1 r0 B = µ 1 S1 µ 2 S2 (5.1.16) (5.6. let’s present a three step procedure to deriving a derivative pricing equation. (See Figure 5. The three steps are: 1.
19) Problem 5. often many diﬀerent derivative securities satisfy the same basic pde and only diﬀer due to their boundary condition. 5. then identify the factors. underlying variables. tT ). and in the following chapter we address interest rate derivatives. What you should take away is that simple arbitrage ideas that we derived in this chapter are the underpinnings of derivative pricing theory. underlying variables.2 (A Single Factor Short Rate Model) Let r0 (t) denote the short rate of interest. and create a tradables table. we have provided a three step procedure for deriving the pricing equation for a derivative security based on the linear factor approach.20) exists. Furthermore.21) Assume that zero coupon bonds of maturity T are tradables and the price at time t is denoted by B(r.48 CHAPTER 5. 5. Problem 5.8. Also assume that a bond dB = r0 Bdt (5. Let these bonds be a function of the short rate and time t. and merely note that a particular derivative security will correspond to the solution of the pde with a particular boundary condition. and I will sometimes leave out the important step of actually solving the pde!! Furthermore. then identify the factors. This cookie cutter approach will allow us to derive many of the partial diﬀerential (or diﬀerence/integral/etc.) equations that arise in derivative pricing theory.8. and create a tradables table.8 Problems (5. t) is a European call option on the stock. CONSTRUCTING A FACTOR PRICING FRAMEWORK In this book. In the next chapter we tackle examples in equity derivatives.22) In this model.1 (A Stock Paying Continuous Dividends) Consider a stock S(t) following dS = µSdt + σSdz that pays a continuous dividend at a rate of q. . and assume that it follows dr0 = adt + bdz (5. I will consider the limited scope of just deriving partial diﬀerential equations for pricing.7 Summary In this chapter. assume that a money market account exists that satisﬁes dB0 = r0 (t)dB0 dt (5. I will also often sweep that under the rug. If c(S.
Chapter 6 Application of the Factor Form: Equity Derivatives The factor approach to absence of arbitrage pricing is one of the quickest and most direct routes to deriving pdes for derivatives.3) Solving for λ0 and λ1 using the ﬁrst two equations gives: λ0 = r0 The tradable table contains all the information we need to apply the Price APT equation. we consider a derivative on the stock.5) 1 2 2 c σScS ct + µScS + 2 σ S cSS λ1 = 49 µ − r0 . r0 B 0 B . Hence. bond B.4) 1 c c σScS ct + µScS + 2 σ 2 S 2 cSS (6.1 Examples from Equity Derivatives BlackScholes Black and Scholes started everything with this model [2]. 6. and the derivative c. In this chapter we will see that it can be used to derive almost every BlackScholes type pde that occurs in derivative pricing. We can now move to Step 2 and construct a tradable table r0 B B B 0 S dt + σS dz. 2 (6.1) (6. σ Therefore.1 6. By Ito’s lemma. which is to solve the Price APT equation Pλ0 + Bλ1 = A for absense of arbitrage conditions.6) . The standard BlackScholes setup involves a bond earning a risk free rate and a nondividend paying stock that follows a GBM: dB dS = = r0 Bdt µSdt + σSdz. the tradable assets are the stock S. µS d S = (6. this allows us to move on to Step 3. (6.1.2) Furthermore. t) and is twice continuously diﬀerentiable in both its arguments. S λ0 + σS λ1 = µS (6. Generically we call this derivative c and assume that it’s price process depends on the stock and time c(S. 1 dc = (ct + µScS + σ 2 S 2 cSS )dt + σScS dz.
Let N (t) denote the number of shares held of this stock at time t.17) 0 Ke 0 t ∈ [0. t) The solution is d1 d2 = = ln(S/K) + (r0 + 1 σ 2 )(T − t) 2 √ σ T −t √ d1 − σ T − t (6. Let’s assume the stock price follows dS = µSdt + σSdz. t) = SN (d1 ) − Ke−r0 (T −t) N (d2 ) A European put option is a derivative security p(S.7) Finally.21) What we purchase is a single share of this stock. T ) c(0. when we purchase the stock. t) = = = max(K − S.14) = = = max(S − K. We also have the boundary conditions that c(0.50 CHAPTER 6.10) (6. t) p(∞. t) = ∞. T ] (6.16) (6. Hence we must consider them together as a tradable asset. (6. 0) where K is the strike price.18) (6. c(S. T ] The solution to the BlackScholes equation under these boundary conditions is d1 d2 = = ln(S/K) + (r0 + 1 σ 2 )(T − t) 2 √ σ T −t √ d1 − σ T − t (6. t) = 0 for all t and limS→∞ c(S.8) ∞ t ∈ [0. t) = Ke−r0 (T −t) N (−d2 ) − SN (−d1 ) 6.2 Dividend Paying Stocks Let’s assume that the stock is paying a continuous dividend at a rate of q. t) c(∞. Then the stock and its dividend stream is a tradable asset. 2 European Call and Put Options For a European call option. rearranging leads to the BlackScholes equation: 1 ct + r0 ScS + σ 2 S 2 cSS = r0 c.13) (6. the option is worth c(S. we are also purchasing its dividend stream. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES Plugging λ0 and λ1 into the last equation yields r0 c + 1 µ − r0 σScS = ct + µScS + σ 2 S 2 cSS . The assumption of a continuous dividend at a rate of q is equivalent to assuming that the number of shares N (t) grows according to the equation dN = qN dt. 0) −r0 (T −t) (6. t) with boundary conditions p(S.11) (6. σ 2 (6. That is.12) (6. T ) = max(S − K.19) (6.1. T ] c(S.22) .9) (6. T ) p(0.15) t ∈ [0. 0) 0 t ∈ [0.20) p(S. the boundary condition is that at expiration T . T ] (6. (6.
foreign currencies that are invested in a money market account are essentially a security that earns a continuous dividend. Only tradable quantities appear in the tradables table. σ (6. r0 B B 0 . the payoﬀ of the option depends on the price of the stock alone. rearranging leads to the BlackScholes equation: 1 ct + (r0 − q)ScS + σ 2 S 2 cSS = r0 c.25) If we denote the value of the share with its dividend stream by v(t) = N (t)S(t) then we have dv = v(t + dt) − v(t) = (µ + q)v(t)dt + σv(t)dz.28) Note that the stock price alone S(t) does not appear in the tradable table since it is not tradable. Now. the portfolio with value N (t)S(t) changes to N (t)S(t) → = = (N (t) + dN )(S(t) + dS) N (t)S(t) + dN S(t) + N (t)dS + dSdN N (t)S(t) + qN (t)S(t)dt + µN (t)S(t)dt + σN (t)S(t)dz + o(dt). σ 2 (6.30) Finally. and does not depend on the dividend stream.23) (6. don’t let the notation of calling S a ”stock” limit your thinking about how these models can be applied. However.31) λ1 = µ + q − r0 . v λ0 + σv λ1 = µv (6. we consider v(t) the tradable. and only those quantities need to satisfy the absence of arbitrage conditions. the point is.26) Hence. EXAMPLES FROM EQUITY DERIVATIVES Then over time dt.32) . we consider the option. Finally. For instance.29) c σScS ct + µScS + 1 σ 2 S 2 cSS 2 Solving for λ0 and λ1 gives: λ0 = r0 and plugging λ0 and λ1 into the last equation gives r0 c + 1 µ + q − r0 σScS = ct + µScS + σ 2 S 2 cSS . (6. a moment of thought more. Therefore.1. a stock index with many stock that pay dividends at diﬀerent times can be approximated as a continuous dividend. 51 (6.27) dc = (ct + µScS + σ 2 S 2 cSS )dt + σScS dz. Moreover.24) (6. So. 2 What does this apply to? At ﬁrst thought. The ﬁnal step is to solve the Price APT equation Pλ0 + Bλ1 = A.6. we assume c = c(S. (6. 2 The next step is to write the tradable table r0 B B B 0 v dt + σv dz. we realize that it is a decent approximation for a number of ﬁnancial situations. t) and apply Ito’s lemma to c to obtain 1 (6. (µ + q)v d v = 1 2 2 c c σScS ct + µScS + 2 σ S cSS (6. So can a commodity with a convenience yield. a continuous model for a dividend does not seem too realistic.
but this time on a stock that pays a continuous dividend. (6. the delta function δ(t − τ ) is really just used to indicate that a jump occurs at time τ . since we know exactly when the jump will occur. since (6. v. you may recognize that this situation is very similar to a process driven by a Poisson process. t) is a derivative security. we model the stock price as dropping by D exactly at the time τ . This is too be expected.3 Cash Dividends Most individual stocks pay a prespeciﬁed dividend at a prespeciﬁed time. t))δ(t − τ ))dt + σScS dz 2 (6. r0 B B B 0 v dt + σv dz µv d v = 1 2 2 − − c c σScS ct + µScS + 2 σ S cSS + (c(S − D. we can consider the value of a European call option. t))δ(t − τ ) The ﬁnal step is to solve the Price APT equation r0 B B 0 v λ0 + σv λ1 = µv 1 2 2 − − c σScS ct + µScS + 2 σ S cSS + (c(S − D. 6. Call this value v(t). This notation is just to indicate that the price drops by exactly D at time τ . and with this we will get the stock price plus the dividend. we have identiﬁed the tradables B.36) Now.34) (6. The above equation just indicates that we use Ito’s lemma for Brownian motion up to. However.42) diﬀers from the nondividend BlackScholes equation only slightly. we will use dirac delta notation and write it as dS = (µS − Dδ(t − τ ))dt + σdz (6. Therefore. t) − c(S . and we apply Ito’s lemma to obtain 1 dc = (ct + µScS + σ 2 S 2 cSS + (c(S − − D. since they have discrete jumps. except that q shows up in a couple of places. t) − c(S − . we assume that c(S.38) I have not described how to apply Ito’s lemma in this situation before. To model this.37) where δ(t − τ ) is a dirac delta function at time τ .33) (6. This entire stream together it what is tradable. However. the bond is a tradable. The solution is d1 d2 1 ln(S/K) + (r0 − q + 2 σ 2 )(T − t) √ σ T −t √ = d1 − σ T − t = (6. and now we can write the tradables table. Now. and c. where it satisﬁes dv = µvdt + σvdz.35) c(S. at time τ the jump occurs. Once again.52 CHAPTER 6. Now.39) . Of course. and after the jump time τ . it is even simpler. t) − c(S . To model this drop. we can once again use our three step procedure. To compute the equation satisﬁed by an option on a stock that pays a cash dividend prior to expiration. t))δ(t − τ ) (6. Therefore.1. which causes c to drop by exactly D. we can purchase a share of the stock. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES European Call Option Once again. t) = Se−q(T −t) N (d1 ) − Ke−r0 (T −t) N (d2 ) Note that this is very similar to the formula for a European call option on a nondividend paying stock. we consider that the value corresponding to a single share is continuous and follows a geometric Brownian motion. In this case. We often call this either a cash or lump dividend. we assume that the stock pays a dividend at the time τ in the amount of D.
t) − c(S − . (6. λ1 = S λ0 + µS + ν(k − 1)S (k − 1)S ct + µScS + ν(c(kS − ) − c(S − )) c(kS − ) − c(S − ) c Solving for λ0 and λ1 gives: λ0 = r0 Plugging λ0 and λ1 into the last equation yields r0 c + λ1 = µ − r0 + ν. In this case. Furthermore.1. rearranging leads to the equation: (6.49) µ − r0 + ν (c(kS − ) − c(S − )) = ct + µScS + ν(c(kS − ) − c(S − )) k−1 (µ − r0 ) (c(kS − ) − c(S − )) = (k − 1) (ct + µScS − r0 c) .44) where ν is the intensity of the Poisson process π(t. I will compensate the factor in order to give it zero drift. This setup involves a bond earning a risk free rate and a nondividend paying stock that follows a geometric Poisson motion (GPM): dB dS = = r0 Bdt µS dt + (k − 1)S dπ(ν) − − (6.40) (6. By Ito’s lemma.43) (6. which is to solve the Price APT equation Pλ0 + Bλ1 = A for absence of arbitrage r0 B 0 B . t) − c(S − . t) and is continuously diﬀerentiable in both its arguments. 2 λ1 = 53 µ − r0 σ (6. Generically we call this derivative c and assume that its price process depends on the stock and time c(S.51) .47) (6. I would like to write my factor equations such that the factors are pure risk and don’t have any mean drift.46) (6. ν). my tradables are dB dS dc = = = r0 Bdt (µS + ν(k − 1)S)dt + (k − 1)S(dπ(ν) − νdt) (ct + µScS + ν(c(kS − ) − c(S − )))dt + (c(kS − ) − c(S − ))(dπ(ν) − νdt) (6.41) (6.1. EXAMPLES FROM EQUITY DERIVATIVES Solving for λ0 and λ1 gives: λ0 = r0 Plugging λ0 and λ1 into the last equation yields µ − r0 1 σScS = ct + µScS + σ 2 S 2 cSS + (c(S − − D. (6. Hence. rearranging leads to the BlackScholes equation: r0 c + 1 ct + r0 ScS + σ 2 S 2 cSS + (c(S − − D.6.45) This is a case where the random factor does not have zero mean. k−1 (6.4 Poisson Processes This was done by Cox and Ross [5]. dc = (ct + µScS )dt + (c(kS − ) − c(S − ))dπ(ν).50) Finally. t))δ(t − τ ) σ 2 Finally. we consider a derivative on the stock.42) 6. t))δ(t − τ ) = r0 c.48) We can now construct a tradables table B S c B r0 B 0 dt + (dπ(ν) − νdt) µS + ν(k − 1)S (k − 1)S d S = c ct + µScS + ν(c(kS − ) − c(S − )) c(kS − ) − c(S − ) This allows us to move on to Step 3.
one can interpret it as the BlackScholes formula.57) d f = 1 c c σf cf ct + µf cf + 2 σ 2 f 2 cf f Recall that the futures contract was the special case that motivated us to consider the price approach to absence of arbitrage.52) (6. This allows us to move on to Step 3. ln(k) Note that this solution has a structure that is very similar to the BlackScholes fomula. The key point was that the marktomarket mechanism always sets the price of a futures contract to zero while the change in value of the contract over period dt is given by the change in the futures price df . 1 (6. 6. (6. t) and is twice continuously diﬀerentiable in both its arguments.1. y) − Ke−r0 (T −t) Ψ(x. I will give the solution for a call option with strike K and expiration T : c(S.61) 2 . APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES European Call Option One can actually ﬁnd a closed form solution for European call and put options in this model.54) (6.55) Furthermore. σ 2 Finally. By Ito’s lemma. Generically we call this derivative c and assume that it’s price process depends on the futures price and time c(f. In fact.59) σ Plugging λ0 and λ1 into the last equation yields 1 µ (6. (6. 2 We can now complete Step 2 and construct a tradables table r0 B B B 0 0 dt + σf dz. r0 B 0 B 0 λ0 + σf λ1 = µf (6. y/k) where Ψ(α. rather than working with returns. µf (6.53) e−β β i . which is to solve the Price APT equation Pλ0 + Bλ1 = A for absense of arbitrage. (6.54 CHAPTER 6.58) 1 2 2 σf cf c ct + µf cf + 2 σ f cf f Solving for λ0 and λ1 gives: µ λ0 = r0 λ1 = .56) dc = (ct + µf cf + σ 2 f 2 cf f )dt + σf cf dz. This corresponds to the tradable table we have written above. t) = SΨ(x.5 Options on Futures This was essentially done by Black in [1]. Let’s derive the pde satisﬁed by an option on a futures contract.60) r0 c + σf cf = ct + µf cf + σ 2 f 2 cf f . we consider a derivative on the futures price. Assume there exists a bond and a futures contract with futures price f given as dB df = = r0 Bdt µf dt + σf dz. but with a Poisson random variable replacing the Gaussian random variable. β) = i=α ∞ (6. i! y= (r0 − µ)(T − t)k k−1 and x is the smallest nonnegative integer greater than ln(K/S)−µ(T −t) . rearranging leads to the partial diﬀerential equation: 1 ct + σ 2 f 2 cf f = r0 c.
However. in this case that is not possible unless we consider c(Y S − ) − c(S − )dπ a new factor. at this point we have no choice.61).63) (6. A derivative on the stock c(S.1. Merton originally solved this problem using a similar technique.62) (6. we would rather not do this. looks just like the BlackScholes equation on a stock paying a continuous dividend.71) Finally. We would like to be able to write down a linear factor model in the factors dz and Y dπ or a compensated version of of Y dπ. EXAMPLES FROM EQUITY DERIVATIVES European Call Option 55 Note that the BlackScholes equation in this case (6. We will also ﬁnd that this model creates a problem for our factor approach.66) where the jump portion of the stock has been compensated. t) is a function of S and t. we would solve the Price APT equations B 0 0 0 r0 B λ1 S λ0 + σS S 0 λ2 = (µ + αE[Y − 1])S c λ3 σScS 0 1 Lc (6. By Ito’s lemma we have dc = Lcdt + σScS dz + (c(Y S − ) − c(S − ))dπ − αE[(c(Y S − ) − c(S − ))]dt where (6.1.6. since this risk is driven by the same Poisson process and jump size Y . However. let’s proceed by considering ((c(Y S − ) − c(S − ))dπ − αE[(c(Y S − ) − c(S − ))]dt) and (Y − 1)dπ(α) − αE[Y − 1]dt as two diﬀerent factors.65) (6. For notational convenience. Let’s get started. The problem is how to identify factors.67) 1 Lc = ct + µScS + σ 2 S 2 cSS + αE[(c(Y S − ) − c(S − ))] (6. Here are the basic assets dB dS = = r0 Bdt (µ + αE[Y − 1])Sdt + σSdz + S − ((Y − 1)dπ(α) − αE[Y − 1]dt) (6.72) . we can just plug into that formula to obtain the price of a European call option. t) = f e−r0 (T −t) N (d1 ) − Ke−r0 (T −t) N (d2 ). 6. and we will bypass the problem in the same manner that Merton did. Yet.64) c(f. however the continuous dividend rate is q = r.6 Jump diﬀusion A jump diﬀusion model was ﬁrst solved by Merton [11]. let’s call them dψ1 dψ2 = = (Y − 1)dπ(α) − αE[Y − 1]dt (c(Y S − ) − c(S − ))dπ − αE[(c(Y S − ) − c(S − ))]dt 0 S 0 dz 0 0 dψ1 dψ2 1 (6. This model has a closed form solution for the derivative price which Merton computed in his original paper [11]. Therefore. The model includes a risk free bond and an underlying asset that has a diﬀusion portion and a lognormal jump portion. So.69) (6. This model is nice because it is related to many other models in equity and interest rate derivatives. The randomness associated with the jump term does not enter in the same way to S and c. This is where we run into a problem.70) Then our tradables table is 0 r0 B B B S d S = (µ + αE[Y − 1])S dt + σS σScS Lc c c (6. The solution is d1 d2 = = ln(f /K) + ( 1 σ 2 )(T − t) √ 2 σ T −t √ d1 − σ T − t (6.68) 2 Now we come to the point that we need to write a tradables table.
2 (6. conditional on the number of jumps. and γ = ln(1 + k). stock distribution at expiration is lognormal.. This is fairly messy and leaves a lot of degrees of freedom. Y = 0. This means that according to our model above. Lognormal Jumps! When Y the jump size is lognormal. T − t. it is given by ∞ c(S. then the value of call option will actually increase! You should think about this carefully to understand why that is true. Note that it is basically a combination of the BlackScholes formula and the solution under Poisson dynamics. We also have that k = E[Y − 1]. That is. the Price APT equations Pλ0 + Bλ = A become r0 B B 0 S λ0 + σS λ1 = (µ + αE[Y − 1])S . we can write out the full pde which will have a number of unknown market prices of risk. T. r0 − λk + n! T −t T −t ′ (6. if the rate of bankruptcy increases. and there is a closed form solution to the pde for European call and put options.73) Lc c σScS λ1 = µ + αE[Y − 1] . the market price of risk of any risk associated with the jump term is zero! This is the same as setting λ2 = λ3 = 0. With λ2 = λ3 = 0. In fact. σ (6. In the BlackScholes formula. this model is really the prototype for defaultable bonds. Then the above equation becomes 1 ct + (r0 + α)ScS + σ 2 S 2 cSS = (r0 + α)c. Instead he make the assumption that all jump risk is diversiﬁable.74) µ + αE[Y − 1] − r0 σ σScS = Lc (6. Of course.77) This looks like standard BlackScholes but the interest rate has been increased by the default probability! We will see that this same relationship will also appear in defaultable bonds. The key is that under this jump diﬀusion model. and with risk free rate r0 . r0 ) is the BlackScholes formula for a European call option with strike K and expiration T on a nondividend paying stock with current price S.78) where cBS (S. λ′ = λ(1 + k). the value of a European call option increases with the risk free rate. we can give a closed form solution in this case based on the BlackScholes formula. This is a strong assumption! But let’s follow Merton and see where this leads us. 2 Bankruptcy (6. Note the following counterintuitive observation. t) = n=0 e−λ (T −t) (λ′ (T − t))n nδ 2 nγ cBS S. K. K. volatility σ. and the ﬁnal equation becomes r0 c + which can be rewritten as Now.75) 1 ct + (r0 − αE[Y − 1])ScS + σ 2 S 2 cSS = r0 c − αE[(c(Y S − ) − c(S − ))]. For a European call option with strike K and expiration T . APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES Now we can solve for the market prices of risk as.76) Let’s consider a special case of complete bankruptcy. the price . Merton doesn’t do this.. σ. σ 2 + . then conditional on the number of jumps that have occurred before expiration. That is. λ0 = r0 . (6.56 CHAPTER 6. This formula can be found in Merton’s work [11].
Let’s use the ﬁrst three equations to solve for λ0 . indicating that the solution should look like BlackScholes.85) In this case we have four equations and only three unknowns.1. tradable assets will be called the bond B and two other assets S1 and S2 . Thus. µ 2 S2 Lc (6. we obtain Lc = r0 c + which upon rearrangement becomes 1 2 2 1 2 2 ct + r0 S1 cs1 + r0 S2 cs2 + σ1 S1 cs1 s1 + σ2 S2 cs2 s2 + ρσ1 σ2 S1 S2 cs1 s2 = r0 c. Therefore. then we need to weight them by the probability of that number of jumps. in this case. we have a BlackScholes formula.88) µ1 − r0 µ1 − r0 σ 1 S1 c S 1 + σ 1 S1 c S 1 σ1 σ1 (6. For instance. λ1 and λ2 as µ2 − r0 µ1 − r0 λ2 = .1. we solve the Price APT equations 0 B σ 1 S1 S1 S2 λ 0 + 0 σ 1 S1 c S 1 c λ1 λ2 (6. (6. The only question is how many jumps have occurred. 2 2 (6. which is Poisson.83) dz1 dz2 .6.87) .81) (6.82) 1 2 2 1 2 2 L1 c = (ct + µ1 S1 cS1 + µ2 S2 cS2 + σ1 S1 cS1 S1 + σ2 S2 cS2 S2 + ρσ1 σ2 S1 S2 cS1 S2 ) 2 2 and ρ is the correlation coeﬃcient between z1 and z2 . EXAMPLES FROM EQUITY DERIVATIVES 57 distribution at expiration is lognormal. Why? because many other derivatives can be thought of as exchanging two diﬀerent assets. for each possible number of jumps.80) (6. Their dynamics are given by: dB dS1 dS2 dc where = = = = r0 Bdt µ1 S1 dt + σ1 S1 dz1 µ2 S2 dt + σ2 S2 dz2 L1 cdt + +σS1 cS1 dz1 + σS2 cS2 dz2 (6.86) λ0 = r0 λ1 = σ1 σ2 If we plug these into the ﬁnal equation. the tradable table is B S1 S2 c r0 B B S1 µ 1 S1 d S2 = µ 2 S2 Lc c 0 σ 1 S1 dt + 0 σ 1 S1 c S 1 0 0 σ 2 S2 σ 2 S2 c S 2 0 0 σ 2 S2 σ 2 S2 c S 2 r0 B µ S = 1 1 . Simple! 6. can a vanilla call option be thought of as exchanging one asset for another? What are the two assets being exchanged? Anyway. (6. the answer is BlackScholes. Hence.84) Finally.79) (6. the solution is basically that conditioned on the number of jumps that have occurred.7 Exchange one asset for another Margrabe’s [10] exchange one asset for another is one of my favorite derivatives.
T ) = max(S2 − S1 .101) . 0). (6. Here is some intuition into why. S1 S1 dividing S2 by S1 is essentially changing units in order to value S2 in units of S1 . 6.58 CHAPTER 6. (What if I didn’t assume the derivative was a function of v? This would have been a bad assumption since I know in the BlackScholes case that c is a function of the volatility. 0. The closed form solution for this option is c(S1 . t) = S2 N (d1 ) − S1 N (d2 ) where d1 d2 σ = = = 1 ln(S2 /S1 ) + 2 σ 2 (T − t) √ σ T −t √ d1 − σ T − t 2 2 σ1 + σ2 − 2ρσ1 σ2 . Our derivative can depend on S. some problems (such as this one) are easier to solve in a convenient set of units.8 Stochastic volatility dB dS dv = r0 Bdt √ = µSdt + vSdz1 = adt + bdz2 (6. in units of S1 . Assume that you have the option to exchange asset S1 for asset S2 at time T . S1 Now. the portion max( S2 − 1.97) dc = Lcdt + vScS dz1 + bcv dz2 √ 1 1 Lc = (ct + µScS + acv + vS 2 cSS + b2 cvv + ρb vScSv ) 2 2 In this case the tradables are dB dS dc = = = r0 Bdt √ µSdt + vSdz1 √ Lcdt + vScS dz1 + bcv dz2 where (6. T ) = max(S2 − S1 .89) (6.95) (6. 0) looks like the payoﬀ of a call option with strike 1 on the asset S2 ! In fact. and we call the asset S1 the numeraire asset. S2 . 0) = S1 max( − 1. S2 . Hence. t) = 0.94) (6.99) (6. t. t). we call that a change of numeraire.100) (6. this is just a call option on S2 with strike 1! The multiplication on the left by S1 just converts the call option back to units of dollars.96) First.1. Let’s look at the payoﬀ S2 c(S1 . APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES Closed form solution There is a closed form solution for a European exchange one asset for another. we shouldn’t be surprised that the solution looks like BlackScholes.93) Note that this solution looks very much like BlackScholes. 0). and v. Hence. S2 . we write c(S. S2 . the boundary condition is c(S1 . we can list the relevant equations and we assume that z1 and z2 are correlated E[dz1 dz2 ] = ρdt. v.90) (6.98) (6. Therefore.91) (6. Would I end up getting the wrong answer?) By Ito’s lemma we have: √ (6. Needless to say. c(S1 .92) (6. Therefore. because this is really just a call option in a diﬀerent set of units. c(0. When we change units by denominating everything in terms of another asset (such as S1 in this case). t) = S2 .
this pde has a fairly convenient solution via transform methods.14) in this case. Plugging these into the ﬁnal equations leads to √ 1 1 ct + r0 ScS + (a − λ2 b)cv + vS 2 cSS + b2 cvv + ρb vScSv = r0 c 2 2 (6. and pays nothing if it ends up below the strike price.103) λ1 = µ − r0 √ . Draw a picture of the payoﬀ as a function of ST . derive the price of a digital option with strike K in terms of c(S. What is the relationship between a ramp.2.104) Note that we leave λ2 as an unknown. K.6. for a European call option.2. derive the ”price” of a security whose payoﬀ is a dirac delta function at K at expiration T . (a) Let T be the expiration date for the digital option. Using only this information. t) then you have the standard risk neutral pricing formula when the underlying asset follows a geometric Brownian motion.102) r0 B = µS . See Heston [9] if interested. (If you plug in the BlackScholes formula for c(S. K. step and delta function?) . I won’t cover this in detail here. In particular. We can think of the payoﬀ of a European call option as a ramp function. (6.2.14) is a special case of exchange one asset for another where one asset is the stock S(t) and the other is the bond B(t). K. show that the exchange one asset for another formula (6.105) Under speciﬁc choices of a and b. A digital option looks like a step function.) (e) As one ﬁnal task. c(S. and ﬁnally we price a dirac delta function. 6.) (c) Taking the above argument one step further. (Hint: think of how you can construct the payoﬀ of a digital option by buying and selling call options.1 Show that the BlackScholes formula (6. K. we solve the Price APT equations B √0 S λ0 + √ vS c vScS B S c B r0 B √0 d S = µS dt + √ vS vScS c Lc 0 0 bcv λ1 λ2 0 0 bcv dz1 dz2 . Problem 6. price a security whose payoﬀ is (ST − K)2 for ST ≥ K and 0 for ST < K. Note that if you have made it this far.2 (The power of linearity (Breeden and Litzenberger)) We will use linearity to derive the formula for a European digital option in terms of a European call option price. v (6.2 Problems Problem 6. (d) Use the result of (c) to derive a general pricing formula for an arbitrary payoﬀ at time T. (Here is a big hint if you are having trouble with this problem. t)). t). A digital option is an option that pays oﬀ 1 if the underlying stock ends up above the strike price K. Lc (6. you have ”derived” the risk neutral pricing formula in terms of c(S. PROBLEMS So the tradable table is 59 Solving for the market prices of risks λ0 and λ1 gives λ0 = r0 Finally. (b) Assume that you know the price of European call options for every strike K for the expiration date T (i.e.90) reduces to the BlackScholes formula (6. t).
Derive this. 0]. Derive a parity relating S1 . (c) Using the BlackScholes formula. Using these substitutions. T ) = max[S1 − S2 . t) = S2 f (v. Is there a contradiction here? Problem 6. S2 .106) (6. After the jump. if you currently hold S2 . This is a derivative security with payoﬀ c(S1 . That is. What equation must f (v.4 (Exchange one asset for another) In this problem you will derive the formula for an option to exhange one asset for another by reducing the pde to a standard BlackScholes pde for a call option on geometric Brownian motion. (c) Assume the payoﬀ of a derivative security on a nondividend paying underlying stock is a convex function that is nonpositive when the stock is zero. write the pde from (a) in terms of S2 .107) where E[dz1 dz2 ] = ρdt and assume that a risk free asset exists with constant interest rate r. Is it ever optimal to exercise this early? (f ) Merton showed that it can be optimal to exercise a put option early. f . However.2.) (b) Consider the change of variable v = S1 /S2 and assume that the solution of the above pde is of the form c(S1 .2. but at random times the market goes wild and the instantaneous variance jumps by an amount b. this time we model the instantaneous variance with a Poisson process. write down a formula for the value of an option to exchange S1 for S2 . First write down the relevant dynamics for this problem. (Hint: your dynamics should look like: √ dS = µSdt + vSdz 2 dv = a(σl − v)dt + bdπ where π is a Poisson process with intensity α. But. t). Prove that if this is an American option. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES Problem 6. Problem 6.) . t) satisfy? What is the appropriate boundary condition for a European option to exchange S2 for S1 in terms of the new variables. it is never optimal to exercise early. this pde can be in terms of a market price of risk. t. in which case.3 All about PutCall parity and Early Exercise. the American counterpart would never be exercised early according to Margrabe. (Hence. S2 and the options to exchange S1 for S2 and viceversa. and v. for another S1 (assume they are nondividend paying) at expiration T . the instantaneous variance exponentially decays back to its normal level σl . even on a nondividend paying stock. S2 . 0). the payoﬀ is max(S1 − S2 . (a) Let S1 and S2 be two assets whose dynamics are given by dS1 dS2 = = µ1 S1 dt + σ1 S1 dz1 µ2 S2 dt + σ2 S2 dz2 (6.5 Consider a European call option on a nondividend paying stock with stochastic volatility. (d) Consider the European option to exchange one asset. then derive a pde for the price of the option. (e) (Margrabe) Consider an American option to exchange one asset for another (nondividend paying again). Again. we can think of a European put option as exchanging a stock for a bond. Write down the pde for an option to exchange S2 for S1 at expiration T . this option would give you the right to exchange it for S1 at time T .2. (b) Early exercise is not optimal for a standard American call option on a nondividend paying stock.60 CHAPTER 6. S2 . The idea is that most of the time 2 the instantaneous variance is σl . (a) Derive putcall parity for European call and put options on a nondividend paying stock.
05.108) is known as the implied volatility. T. T. For a range of strike prices from K = [0. you will generate prices under Merton’s jumpdiﬀusion model. Use an initial stock price of S(0) = 1. Then the value of the volatility parameter σimpl that satisﬁes cm = cBS (S. ﬁrst compute the BlackScholes price of options using σ = 0. Use the same parameter values as in part (a) (i. and also jump intensity of λ = 2. K.3. PROBLEMS 61 Problem 6. and a risk free rate of r = 0.3.2. a. and compute implied volatility curves. expiration T . b. T = 0. expiration of T = 0.e. and σ = 0. r0 . 1. S(0) = 1.2].8. K. σ) (assuming the stock is nondividend paying) is the BlackScholes formula for a European call option. You should see a slight implied volatility ”smile”! Problem 6. r = 0.2. jump mean of νJ = 0 and jump standard deviation of σJ = 0.6. This time.3). Once again plot the resulting implied volatility curve.7 (American Options) Use the results of Chapter 4. then assume that they represent market prices and plot the resulting implied volatility curve as a function of K. In this exercise. and if cBS (S. σimpl ) (6.1.3.6 (Matlab Exercise) This exercise will introduce you to the implied volatility curve. Problem 3 to derive the pde conditions followed by an American call or put option where the underlying stock (nondividend paying) and bond follow dS dB = = µSdt + σSdz r0 Bdt. r0 .2.05. use Merton’s jumpdiﬀusion model to to generate the market prices. If cm is the market value of a call option with strike K. .
62 CHAPTER 6. APPLICATION OF THE FACTOR FORM: EQUITY DERIVATIVES .
the truth is that some of these models (such as HJM) are better suited for the risk neutral approach to derivative pricing (A subject that is touched upon in Chapter 9) simply because the pdes that we obtain are often too large and diﬃcult to solve in any reasonable manner. However. we will need to establish some notation. Okay. the calibration phase is in determining λ values that best ﬁt all the marketed tradables (In theory if there isn’t a perfect ﬁt. This makes the APT equations overdetermined! Thus. let’s get to pricing. at the end of your studies of derivative pricing you will be able to ”translate” between them. Hopefully. it is extremely instructive to see that they can be understood via the factor approach. For many equity derivatives. the calibration phase does not play prominently into the analysis. Again. the emphasis is showing that the pdes describing derivative pricing can all be easily derived from the simple factor approach equations. we apply the factor approach to interest rate and credit derivatives. then pricing a derivative just proceeds by solving the appropriate pde with boundary conditions using the calibrated λ. Since I have mentioned risk neutral pricing. in many equity derivative models the market is either complete in which case the calibration phase of determining λ is simple. However in practice we simply recognize that our model is too simple and try to ﬁnd a bestﬁt λ). For any derivative pricing problem.3 of Chapter 5. but not consider speciﬁc derivatives after that point. Interest rate and term structure models tend to be the opposite. 7. That is. the ﬁrst challenge before being able to price a derivative is the calibration phase as in Figure 5. we will often bring the models up to the point of calibration.. (Note that the notation is similar to the constant risk free rate r0 used in the previous chapter. I should also mention that ”calibration” in risk neutral pricing often looks diﬀerent from what I have called the calibration phase (which is just determining the market prices of risk from the marketed tradables). Once calibration is done. Before getting started. Let’s start with the short rate process.Chapter 7 Application of the Factor Form: Interest Rate and Credit Derivatives In this chapter. Beyond this.1 Notation and the Money Market Account In dealing with bonds and interest rate derivatives. we may use a single factor. λ0 . many marketed tradables. The short rate r0 (t) is the interest rate earned from time t to time t + dt quoted using continuous compounding. in what follows. let’s make a few comments about interest rate modeling and derivatives. they are really the same thing. then an arbitrage is available.) 63 . Because calibration is so important in interest rate and term structure models. or the market is incomplete in which case the APT equations are underdetermined and one is left with a degree of freedom in choosing λ. Nevertheless. just described using a diﬀerent ”language”. but have many. In fact. This is because the short rate plays the same crucial role as the constant risk free rate in that it deﬁnes the market price of time..
We assume that the zero coupon bonds are functions of the short rate. Note that this is the case even though r0 (t) itself can be random and follow a stochastic diﬀerential equation of its own.2.2.1: The Short Rate Process Furthermore.3) (7. we deﬁne the money market account B0 (t) as the value of an account that is continuously rolled over at the instantaneous short rate. Figure 7. B(r0 . this variable is driven by a single factor. That is. then you reinvest that amount over the next dt at the rate r0 (t + dt).2) where the time dependence is supressed in r0 = r0 (t). The tradables are a money market account. Furthermore. we can write dB0 dB(T ) = = r0 B0 dt 1 (Bt (T ) + aBr (T ) + b2 Brr (T ))dt + bBr (T )dz 2 (7. Pay careful attention to the role that the money market account plays in what follows. Then suppressing arguments. The short rate process is important because it can be used to drive the entire spot rate curve. t) can be functions of r0 and t. the money market account B0 (t) follows the dynamics dB0 (t) = r0 (t)B0 (t)dt (7. b = b(r0 . so don’t always assume that a variable is a constant if no arguments are explicitly listed. Hence. APPLICATION OF THE FACTOR FORM:INTEREST RATE AND CREDIT DERIVATIVES On a plot of the term structure. we will suppress these arguments. That is. Hence. This is pictured in Figure 7. t).2 7. 7.1) The money market account often plays a special role in interest rate derivatives because its dynamics are not explicitly driven by a random factor.1. Hence. tT ).64CHAPTER 7. and continue this.1 Interest Rate Derivatives Single Factor Short Rate Models The simpliest interest rate derivative model has a single underlying variable: the short rate. these models are typically called single factor short rate models. the short rate r0 (t) describes the instantaneous return on the money market account. you invest your money at the rate r0 (t) over the next dt. and a = a(r0 . which we denote by B0 and zero coupon bonds with face value of $1 of varying maturity that we denote by B(tT ) where t is the current time and T is the maturity date. When convenient. we will model this short rate variable as dr0 = adt + bdz (7. the short rate is where is spot rate curve intersects the yaxis as in Figure 7.4) . Vasicek [15] was the ﬁrst to recognize that pricing with absence of arbitrage was possible by modeling the instantaneous short rate.
bond options.2.2. diﬀerent bonds will often give diﬀerent values of λ1 . So.7) The boundary condition is of course that B(T T ) = 1. instead of using a single factor model. . and Br . Then the second equation gives 1 (Bt (T ) + (a − λ1 b)Br (T ) + b2 Brr (T )) = r0 B(T ) 2 (7. t) ∈ Rn . since in general we have many bonds. t) = X i . In fact. That is. any one of them should allow us to solve for λ1 . where X i is the i − th factor.2 MultiFactor Short Rate Models The single factor short rate models are usually not good enough to describe the term structure well. we solve the Price APT equations B0 B(T ) λ0 + 0 bBr (T ) λ1 = r0 B0 1 (Bt (T ) + aBr (T ) + 2 b2 Brr (T )) (7. indicating that the model is not exactly correct.2: Notation for Zero Coupon Bond Prices where Ito’s lemma was used to derive the equation for dB(T ). one could then use this model to price other interest rate derivatives such as caps. Let X be a vector in Rn of underlying variables aﬀecting the term structure. Note that one possibility is to have the short rate be one of the variables in X.6) The ﬁrst equation gives λ0 = r0 which is the random short rate. After this calibration phase. t)dz (7. t) ∈ Rn×n and f (X. etc. not that for convenience when using this partial derivative notation we supress the subscript r0 and write Br = Br0 . represent the ﬁrst and second partial derivatives with respect to the short rate r0 . and thus this is interpreted as a calibration phase. respectively. In practice. We assume that these variables follow a stochastic diﬀerential equation model dX = f (X. (Thus. This equation must hold for zero coupon bonds of any maturity.7.) We then assume that the short rate is a function of the underlying variables X. t). INTEREST RATE DERIVATIVES 65 Figure 7. When one tries to determine a single best ﬁt λ1 from the bond data. Thus. g(X. Note that it is in terms of a market price of risk. Brr . ﬂoors. r0 (X. t)dt + g(X. it is possible to choose r0 (X. In particular. we can use a multifactor model as follows. we are treating all the bonds as being marketed tradables. 7.5) Finally.8) where z ∈ Rn is a vector of uncorrelated Brownian motions. Now let’s write down the tradable table: B0 B(T ) d B0 B(T ) = r0 B0 1 (Bt (T ) + aBr (T ) + 2 b2 Brr (T )) dt + 0 bBr (T ) dz (7.
t)g T (X. t)]) = r0 (X. we have that the instantaneous short rate is given by r(tt) = r(t). 7. t (7. That is.2. to derive the pde that zero coupon bonds would follow. and to test the ﬁt for diﬀerent values of λ. Why is this helpful? Because this will allow us to pull the Price APT relationship for the marketed tradables B(tT ) back to the underlying variables r(ts) which opens up the possibility of doing calibration . let r(ts) denote the instantaneous forward rate seen from time t of the forward interest rate between time s and s + ds. we have B0 B(T ) λ0 + 0 BX (T )g(X. we have B(X. by the price APT. Thus. t)λ) + T r[BXX (T )g(X. Now. then we can quickly adjust λ to best ﬁt the market data of bonds. t)] where I have suppressed the X and t arguments in B. t)B0 0 dz dt + 1 Br (T )g(X. and given that it describes bonds as a function of λ.13) This model is a bit more complicated than the previous single and multifactor short rate models because we see through equation (7. This can be a pretty complicated pde. With this notational convention.66CHAPTER 7. APPLICATION OF THE FACTOR FORM:INTEREST RATE AND CREDIT DERIVATIVES Now. Hence. Then the second equation gives 1 (Bt (T ) + BX (T )(f (X. t) + 2 T r[BXX (T )g(X. ∂T (7.12) HJM takes the instantaneous forward rates as the underlying variables. t)B(T ) 2 (7. we just note that bonds are a function of X and t. we will be able to make that relationship concrete by plugging in explicit partial derivatives into Ito’s lemma. t)]) (7.11) ∂ ln(B(tT )).3 HeathJarrowMorton In the HeathJarrowMorton [6] framework. t)g T (X. if we were to write pricing formulas for derivatives. bond prices) to calibrate λ. we also have that r(tT ) = − r(ts)ds . t)+ 2 T r[BXX (T )g(X. tT ).11) gives us an explicit relationship between the tradables and the (iniﬁnite) underlying variables. instead of using a generic Ito’s lemma relationship between B(tT ) and the underlying factors r(ts). T ]! Because of this. we might have to solve the pde every time. instead of modeling the instantaneous short rate as driving the term structure. they would appear as inﬁnite dimensional pdes! Nevertheless. t)B0 1 (Bt (T ) + BX (T )f (X.10) The boundary condition is of course that B(T T ) = 1. t) Bt (T )+BX (T )f (X.11) that B(tT ) is a function of an inﬁnite number of underlying variables r(ts) for s ∈ [t. t) which is the random short rate. and models them as dr(ts) = µ(ts)dt + σ(ts)dz(t). The ﬁrst equation gives λ0 = r0 (X. and a zero coupon bond with maturity T seen from time t will be denoted by B(tT ) and related to the instantaneous short rates by T B(tT ) = exp − Hence. t)g T (X. t) − g(X. It is much easier if the solution to the pde is known in closed form as a function of λ. we could use the current term structure (or equivalently. This calibration can be quite diﬃcult because bond prices are described by a pde. they decided to model instantaneous forward rates. Via Ito’s lemma. our tradables become B0 B(T ) d B0 B(T ) = r0 (X. this model has an advantage over the previous single and multifactor short rate models in that equation (7.9) where λ ∈ Rn . t) λ= r0 (X. (7.
. If it is easier to work with market data about instantaneous forward rates. and compare that with using forward rate data to calibrate in the HJM model that we will derive.15) n n dB = Bt (ts) + µ(t. let us think B(tT ) = B({r(tsk ) : k = 1. we will take zero coupon bonds as our marketed tradables. INTEREST RATE DERIVATIVES 67 Figure 7. Okay. we would have n (7. let’s begin by assuming that s is indexed by k = 1. si )Br(tsi ) + i=1 j=1 i=1 1 σ(tsi )σ(tsj )Br(tsi )r(tsj ) dt 2 n + σ(tsi )Br(tsi ) dz i=1 Now. using the explicit relationship between the bonds B(tT ) and the underlying variabels r(ts) will allow us to pull the Price APT relationship back to the underlying variables r(ts) which would then allow us to calibrate using r(ts) directly.. That is. let’s dive into the details. by Ito’s lemma. That is. Br(tsi ) .. to complete our computation of Ito’s lemma.e. To understand the value of this.. T ] through the equation T B(tT ) = exp − r(ts)ds t (7. tT ) Then. no closed form solution to the pricing pde is known). then this can simplify the calibration process. T ) will only depend on n Ito processes. However. We will derive the Price APT equations marketed tradables which are the bonds B(tT ). which means that we need to calculate dB(tT ) for our tradables table. This is a bit of a tricky calculation because B(tT ) is really a function of an inﬁnite number of Ito processes r(ts) for s ∈ [t.n}. you should consider the calibration procedure that has to be done for a single or multifactor short rate model if no closed form bond pricing formula is known (i.3: Notation for Zero Coupon Bond Prices directly on the underlying factors r(ts) instead of the marketed tradables. we need to compute Bt .2.n so that B(t. with those preliminaries out of the way. and Br(tsi )r(tsj ) using . In this setup. B(tT ) are tradables.14) To simplify our thinking.7.
taking the partial derivative with respect to T gives T µ(tT ) − σ(tT ) σ(ts)ds = σ(tT )λ1 t (7. T t t T σ(ts)σ(tr)drds dt t Now.22) which makes the relationship a little more explicit.19) Plugging these into Ito’s lemma gives n n n dB(tT ) = exp − r(tsk )∆sk k=1 n n r(tt) − i=1 µ(tsi ) exp − n r(tsk )∆sk k=1 ∆si 1 + 2 − Taking continuous limits yields i=1 j=1 n i=1 σ(tsi )σ(tsj ) exp − n r(tsk )∆sk k=1 ∆si ∆sj dt σ(tsi ) exp − r(tsk )∆sk k=1 ∆si dz. Again. the point of this is that now we can calibrate from forward rate market date r(ts) rather than having to go through bond prices B(tT ).20) T t T t t µ(ts)ds − σ(ts)σ(tr)drds = t σ(ts)ds λ1 . . we have ”pulled back” the Price APT relationship onto the underlying factors. T dB(tT ) = B(tT )r(tt) − B(tT ) − B(tT ) t T 1 µ(ts)ds + B(tT ) 2 σ(ts)ds dz. In fact.17) Br(tsi ) ≈ − exp − Br(tsi )r(tsj ) ≈ exp − r(tsk )∆sk k=1 ∆si = − exp − ∆si ∆sj = exp − r(ts)ds ∆si t T (7. (7.16) T Bt ≈ exp − r(tsk )∆sk k=1 n r(tt) = exp − r(ts)ds r(tt) t T (7.68CHAPTER 7. to remind you.21) The upshot of this calculation is that the mean and the volatilities of the forward rates r(ts) must be related through this equation. t (7.14): n B(tT ) = exp − This gives n r(tsk )∆sk k=1 . APPLICATION OF THE FACTOR FORM:INTEREST RATE AND CREDIT DERIVATIVES the discretization of (7. (7. Thus.18) n r(tsk )∆sk k=1 r(ts)ds ∆si ∆sj . applying the Price APT equation leads to T r(tt) − or µ(ts)ds + t 1 2 1 2 T t t T T σ(ts)σ(tr)drds = r(tt) − T T σ(ts)ds λ1 t (7.
. T2 ) = a1 R(tT1 . .. Ti )) (7..25) B(tT1 ) (1 + τ R(tT1 . we will model the forward rates as stochastic diﬀerential equations. T2 )) (7. and thus Ito’s lemma (to create the tradables table) combined with the Price APT would lead to an inﬁnite dimensional pde.. Ti+1 ) and Bi = B(tTi ) so that dRi = ai Ri dt + bi Ri dzi Then prices of tradables satisfy Bi = Bi−1 1 + τ Ri−1 (7. Let R(tT1 . T2 )dt + b1 R(tT1 . This is one case in which risk neutral pricing (Chapter 9) can help quite a bit. . INTEREST RATE DERIVATIVES 69 This is just the calibration phase. it should be clear that Bi depends on Ri−1 and hence the factor zi−1 .. using the factor approach and pdes is not easy.7. this is also a model that is easier to price with using the risk neutral approach. let Ti+1 = Ti + τ where τ is a ﬁxed amount of time.27) (7. Thus. the idea is that this can make calibration easier. Again.4 The LIBOR Market Model This model is similar to the HJM model except that it uses discrete forward rates instead of instantaneous forward rates. For simplicity. T2 ) be the forward interest rate between time T1 and T2 as seen at time t. (7. Now since Bi−1 depends on Ri−2 and Bi−2 .. but also Bi−1 . we will be able to use an explicit relationship between the marketed tradables of bonds and the underlying variables of discrete forward rates to pull the Price APT relationship onto the underlying variables. For simplicity.2. we use an interest rate convention so that the price of a zero coupon bond is 1 (7. let us write generically that i−1 dBi = µi Bi dt + Bi j=1 σij dzj . let’s write Ri = R(tTi . then Bi ultimately depends on the factors zi−1 . To price some derivative we would not have an explicit relationship between the derivative and the underlying variables r(ts). n. Furthermore. Due to its similarity with HJM. For further reading on this approach see [3]. Once again. zi−2 .23) B(tT1 ) = (1 + τ R(tT1 )) and for bond prices and forward rates we have B(tT2 ) = and more generally B(tTi ) = B(tTi−1 ) (1 + τ R(tTi−1 ..28) (7.24) etc for i = 1. z1 .30) . 7. the factor approach can provide us with an important perspective on this model.26) Now. T2 )dz2 For notational simplicity. zi−3 .29) The relationship above leads to a recursive dependence Bi = (1 + τ Ri )Bi+1 . dR(tT1 .2. The point is that for actual pricing of a new derivative. etc. Now. we use a geometric Brownian motion model. (7. Nevertheless.
(1 + τ Ri ) (7.i as σi+1.29) is zero.38) we obtain σi+1.39) (7.29) and (7.40) 1 + τ Ri +Bi σi+1.41) .37) Once we have established these relationships. APPLICATION OF THE FACTOR FORM:INTEREST RATE AND CREDIT DERIVATIVES Using Ito’s lemma gives dBi = = τ dRi Bi+1 + (1 + τ Ri )dBi+1 + τ dRi dBi+1 +τ bi Ri Bi+1 i (7. First. Thus. equating the coeﬃcients of zj for j < i in (7.j ρij + (1 + τ Ri )(µi+1 Bi+1 ) dt i j=1 σi+1. This allows us to solve for σi+1.j .34) Futhermore.36) and (7. Returning to (7. we note that by comparing this to (7.31) i j=1 τ Bi+1 (ai Ri dt + bi Ri dzi ) + (1 + τ Ri ) µi+1 Bi+1 dt + Bi+1 j=1 i j=1 = σi+1.j = σi.j dzj (7.j dzj . Since the coeﬃcient of zi in (7.j = − τ bj R j . noting that Bi = (1 + τ Ri )Bi+1 gives σi+1. by combining (7.i = 0.34) gives (1 + τ Ri )Bi+1 σi+1. From this we can get two things.34).36) (7.70CHAPTER 7.38) (7.33) τ Bi+1 (ai Ri ) + τ bi Ri Bi+1 This is the expression for our tradables. we can move to constructing the tradables table and applying the Price APT.j = Bi σi.35) +τ Bi+1 bi Ri dzi + (1 + τ Ri ) Bi+1 σi+1.j dzj (7.32) (7.j dzj dt (7. (7.j ρij σi+1. i > j. the tradables look like i dBi = τ Bi+1 (ai Ri ) + τ bi Ri Bi+1 j=1 +τ Bi+1 bi Ri dzi + (1 + τ Ri ) Bi+1 = Bi µi+1 + i−1 j=1 σi+1.j . Now.29) we should have that τ Bi+1 bi Ri + (1 + τ Ri )Bi+1 σi+1.j ρij + (1 + τ Ri )(µi+1 Bi+1 ) dt i j=1 τ a i R i + τ bi R i i j=1 σi+1. (1 + τ Rj ) (7.i = − τ bi R i .j ρij dt σi+1.
and from (7. we would like to eliminate µi and σi. all the instantaneous forward rates were driven by the same factor dz.47) (7.j ).43) These equations mix parameters from the underlying variables (ai and bi ). the discrete analog to the continuous equation used in HJM T B(tT ) = exp − r(ts)ds t (7.j terms.44) λj σi. of course. Thus. µi = µi+1 + τ a i R i + τ bi R i i j=1 σi+1. To do this.42) into (7.46) which is the calibration relationship pulled onto the underlying variables of the discrete forward rates Ri . we can substitute in using (7.j ρij (7. INTEREST RATE DERIVATIVES where (7. The Price APT then tells us that µi = r0 + j=1 71 i−1 λj σi+1.7.j + λj σi+1. Thus. in fact. there are a couple of diﬀerence that are helpful to point out. and the marketed tradables (µi and σi.39) was used in the last equality.j (7. we only have n − 1 factors zi .48) This is.j ρij 1 + τ Ri (7. (Note that with n tradables as bond prices.41).j terms. there are only n − 1 market prices of risk. each discrete forward rate is driven by a distinct factor dzi . However. The ﬁrst is that we derived that HJM as a single factor model (a multifactor model is easy to do as well). In the Libor Market Model.42) where r0 is the short rate process.2.43). The next thing to note is that in the Libor Market Model. and that is why the above market price of risk is λi−1 and not λi . That is. giving i i−2 i−1 τ ai Ri + τ bi Ri j=1 σi+1. highly related to the HJM model.j = 1 + τ Ri j=1 j=1 This still contains σi. we used a recursive relationship to relate the underlying variables to the marketed tradables Bi = (1 + τ Ri ) which can be expanded to give Bi = (1 + τ Ri ) Bi−1 1 = (1 + τ Ri−1 ) j=0 (1 + τ Ri−1 ) i−1 Bi−1 (1 + τ Ri−1 ) (7.) Relations to HJM The Libor Market Model is.49) . let’s substitute (7.39) to get rid of those terms and have τ bj R i i−2 i−1 τ ai Ri − τ bi Ri j=1 (1+τ Rjj ) ρij τ bj R j τ bj R j λj = − + (7. We want conditions that don’t involve anything related to the marketed tradables. But.45) λj − (1 + τ Rj ) (1 + τ Rj ) 1 + τ Ri j=1 j=1 −λi−1 τ bi−1 Ri−1 (1 + τ Ri−1 ) = τ a i R i − τ bi R i i j=1 τ bj Rj (1+τ Rj ) or ρij 1 + τ Ri (7.
it is important to understand that these models all come from just a simple application of the factor approach. Since a bankruptcy is a sudden event. tT ). 2 (7.1 Defaultable Bonds In deriving equations for defaultable bonds. and the second is a default factor. The second equation gives r0 B(N ) + bBr (N )λ1 + (B(N + 1) − B(N ))λ2 = LB(N ) which can be rewritten as 1 Bt (N ) + aBr (N ) + b2 Brr + α(B(N ) − B(N + 1)) = r0 B(N ) + λ1 bBr (N ) + λ2 (B(N + 1) − B(N )). APPLICATION OF THE FACTOR FORM:INTEREST RATE AND CREDIT DERIVATIVES Thus. the two models (HJM and Libor Market Model) really are brothers.. This is typically not done because the risk neutral framework turns out to be better suited for these models. a defaultable bond of maturity T is a function or r0 .3 Credit Derivatives Credit derivatives usually refer to derivatives that pay oﬀ depending on whether a bankruptcy has occured. N .58) (7. However. Finally. despite their looking quite complicated.53) (7.3.54) (7. You should note the similarity between these models and Merton’s jump diﬀusion model in Chapter 6. so we have 1 Bt (0) + (a − λ1 b)Br (0) + b2 Brr (0) = (r0 + α − λ2 )B(0) + (λ2 − α)B(1). N.. Section 6. and t: B(r0 . we assume that we start with N = 0 and default is N = 1.57) 2 But. which includes the money market account: B0 B(N ) d B0 B(N ) = r0 B0 LB(N ) 0 bBr (N ) dt + 0 bBr (N ) 0 (B(N + 1) − B(N )) λ1 λ2 dz dπ − αdt (7. let’s move on to credit. In fact. if you understand Merton’s model then there isn’t too much new here. we solve the Price APT equations B0 B(N ) λ0 + 0 (B(N + 1) − B(N )) = r0 B0 LB(N ) (7.72CHAPTER 7. In what follows.50) (7. credit model rely heavily on Poisson processes.6. 7. the most important thing to note about the HJM and Libor Market Model is that they can both be viewed and understood from the factor point of view.. The ﬁrst is the short rate. after suppressing all arguments except N : dB(N ) = LB(N )dt + bBr (N )dz + (B(N + 1) − B(N ))(dπ(α) − αdt) where I have compensated the Poisson process and 1 LB(N ) = (Bt (N ) + aBr (N ) + b2 Brr (N ) + α(B(N ) − B(N + 1))) 2 Now let’s write down the tradables table. I hope you have found this exercise valuable. We model these as dr0 dN = = adt + bdz dπ(α) (7.1. we need two factors.51) In this case. and let N = 1 be the default state. 7. I will present the simplest models of a Defaultable Bond when the intensity of default is a constant. By Ito’s lemma we can write. and then we will generalize that to allow the intesity to be random.. (7. Since N (t) is a pure Poisson process.55) From the ﬁrst equation we have λ0 = r0 the short rate.56) .52) Finally. we can start it at N = 0 which is the no default state.
it is the same as B(0). If we assume no recovery we obtain 1 Bt (0) + (a − λ1 b)Br (0) + b2 Brr (0) = (r0 + α − λ2 )B(0) (7. it is the same as B.62) This time we will allow the default intensity to follow a stochastic diﬀerential equation. CREDIT DERIVATIVES Diﬀerent Types of Recovery 73 But. or fractional recovery B(1) = xB(0).68) But. the default rate and its market price of risk just adjust the short rate of interest. B(1) means that we are in default. N. or fractional recovery B(1) = xB(0). B(1) means that we are in default.61) (7. we start with N = 0 which is the no default state. tT ). since B(0) is the no default state. (7.60) (7.66) λ0 + LB(N ) bBr (N ) gBα (N ) (B(N + 1) − B(N )) B(N ) λ3 From the ﬁrst equation we have λ0 = r0 the short rate. Otherwise. The second equation gives r0 B(N ) + bBr (N )λ1 + gBα (N )λ2 + (B(N + 1) − B(N ))λ3 = LB(N ) which can be rewritten as 1 1 Bt (N ) + aBr (N ) + f Bα (N ) + 2 b2 Brr (N ) + 2 g 2 Bαα (N ) + ρgbBrα (N ) + α(B(N ) − B(N + 1)) = r0 B(N ) + λ1 bBr (N ) + λ2 gBα (N ) + λ3 (B(N + 1) − B(N )). so we have 1 1 Bt (0)+(a−λ1 b)Br (0)+(f −λ2 g)Bα (0)+ b2 Brr (0)+ g 2 Bαα (0)+ρgbBrα (0) = (r0 +α−λ3 )B(0)+(λ3 −α)B(1).67) But.64) 2 2 Now let’s write down the tradables table: dz1 0 0 0 r0 B0 B0 B0 dz2 dt + = d bBr (N ) gBα (N ) (B(N + 1) − B(N )) LB(N ) B(N ) B(N ) dπ − αdt (7.2 Defaultable Bonds with Random Intensity of Default dr0 dN dα = = = adt + bdz1 dπ(α) f dt + gdz2 (7. we assume that we start with N = 0 and default is N = 1. a defaultable bond of maturity T is a function of r0 . In this case. or even ﬁxed recovery B(1) = X. .59) 2 This looks exactly like the standard model for a bond. or even ﬁxed recovery B(1) = X. since B(0) is the no default state. This means that if there is a closed form solution for bond prices under a single factor short rate model. Hence. 2 2 (7. Again. α. That is. the price is the same.7. and let N = 1 be the default state.65) Finally.3. we can assume diﬀerent recoveries in default such as no recovery B(1) = 0.63) where again I have compensated the Poisson process and this time 1 1 LB(N ) = (Bt (N )+aBr (N )+f Bα (N )+ b2 Brr (N )+ g 2 Bαα (N )+ρbgBrα (N )+α(B(N )−B(N +1))) (7. we have where E[dz1 dz2 ] = ρdt. Furthermore. N .3. except that the short rate is increased by α − λ2 . Furthermore. By Ito’s lemma we can write: dB(N ) = LB(N )dt + bBr (N )dz1 + gBα (N )dz2 + (B(N + 1) − B(N ))(dπ(α) − αdt) (7. we can assume diﬀerent recoveries in default such as no recovery B(1) = 0. then there will also be a closed form solution for defaultable bonds! 7. we solve the Price APT equations λ1 r0 B0 0 0 0 B0 λ2 = (7. That is. α and t: B(r0 . Hence.
and the term structure is driven by a single factor short rate model dr0 = a(b − r0 )dt + f dz2 (7. If no arbitrage exists in the market.74CHAPTER 7.4 Problems Problem 7. where the short rate process is being driven by zB . let zero coupon bond prices with face value $1 and maturity T be denoted by B(tT ) and satisfy B(tT ) = exp(−r(T − t)) (7. (7.70) a(b − r0 )dt + c r0 dz2 where dz1 and dz2 are correlated E[dz1 dz2 ] = ρdt.4. it might be convenient to write bond dynamics generically as dB = µB Bdt + σB BdzB .72) where z2 (t) is a standard Brownian motion and E[dz1 dz2 ] = ρdt. respectively. Assume that a money market account exists that satisﬁes dB0 = r0 B0 dt. Additionally. That is. However. assume that interest rates are random.) Problem 7. Additionally. That is: dS dr0 = = µSdt + σSdz1 √ (7.5 in Chapter 6. Hint 2: Note that a forward price is not tradable.4 (Flat Term Structure) Consider a term structure model where the term structure of interest rates is ﬂat. and the short rate follows CoxIngersollRoss dynamics. r0 (t). Call this price function c(S(t).1. Hint 3: My recommendation is to let your underlying variables be the future and forward price. Hint 1: Recall that futures prices are marked to market (See Section 6.4.4. whereas forward prices are not marked to market. (Your answer may contain a market price of risk.1 Derive a pde for the price of a European call option on a nondividend paying stock when interest rates are random. Assume the following df dr0 = = µf dt + σf dz adt + bdzB where f is either the forward price or futures price and E[dzdzB ] = ρdt.74) .2 (Options on Forwards and Futures) (a) Assume interest rates are stochastic and driven by a single factor short rate model.3 (PDE for a derivative on a dividend paying stock under stochastic interest rates) Consider a market with a stock price process S(t) satisfying dS = µSdt + σSdz1 (7. Assume that this stock also pays a continuous dividend at a rate of q. the dividend amount is qSdt. but moves up and down randomly.73) Consider a third asset whose price at time t can be written as a twice diﬀerentiable function of S(t). Derive the pde for a derivative on a futures contract and on a forward contract. t). Thus. APPLICATION OF THE FACTOR FORM:INTEREST RATE AND CREDIT DERIVATIVES 7.69) (7.71) where z1 (t) is a standard Brownian motion. Problem 7. r0 (t) and t. r0 (t). and the short rate. t) must satisfy. derive the pde that c(S(t). over time dt. (b) What if interest rates are not stochastic? Do the pdes for options on forwards and futures look the same? Problem 7.4. you may assume that the asset that the forward contract is on is tradable on a spot market and there are standard relationships between the spot price of an asset and its forward price. Your answer may contain a market price of risk.
t) and b(r. What does this imply about the allowable dynamics for this term structure? .75) (a) Write down models for the instantaneous return of the money market account and for a generic zero coupon bond with maturity T (i. t)dt + b(r. for B(tT )). t)dz.4. and derive restrictions on a(r. PROBLEMS where r is modeled as a stochastic diﬀerential equation dr = a(r. 75 (7.7. t).e. Note that this model applies for all maturities T . (b) Derive the absence of arbitrage condition in this case.
APPLICATION OF THE FACTOR FORM:INTEREST RATE AND CREDIT DERIVATIVES .76CHAPTER 7.
we have entirely hedged out the risk. our portfolio is not sensitive to small variations in the underlying variables. we need to choose y such ˆ that Bu + y T B = 0 ˆ (8.3) To eliminate the risk.4) That is really all there is to hedging! 77 .1) and we would like to hedge this position. we consider a derivative to be a function of underlying variables. we will work with prices. Furthermore. Once again. This change in value is given by dV0 = dVu + y T dVh = (Au + y T Ah )dt + (Bu + y T Bh )dz ˆ ˆ ˆ (8. we can either work with returns or prices.2 Hedging from a Factor Perspective In abstract terms.Chapter 8 Hedging 8. 8. hedging is based upon eliminating factor risk. By hedging. we will assume that we hold one unit of this asset. we recognize that risk comes from the factors. In this case. hedging involves eliminating factor risk. In the ﬁrst. and assume that these assets have a value proﬁt/loss change vector of (8. We are then hedged against variations in these underlying variables if the derivative of the value of a portfolio with respect to the underlying variables is zero. Let us consider having various tradable assets to hedge with.1 Introduction We present two main approaches to hedging. In this chapter. In the second approach. Let dVu be the value change of the asset that we would like to hedge.2) dVh = Ah dt + Bh dz Now. we need to eliminate the coeﬃcients of the factors. Thus. Thus a factor model for our unhedged portfolio proﬁt/loss over the next time instance is dVu = Au dt + Bu dz (8. Thus. we actually just mean that we will form a portfolio (that may be dynamically traded over time) so that the factor model of the portfolio eliminates all factor risk. Thus. let y be the holdings of the assets used to hedge with and let dV0 be the value change of the hedged ˆ portfolio.
using the notation above. the asset to be hedged satisﬁes dVu = Au dt + Bu dz and the tradables used to hedge this asset are dVh = Ah dt + Bh dz We can write this in a tradables table form as 0 r0 B0 y0 B0 B0 y Ph ˆ d Vh = Ah dt + Bh dz.9) Additionally.7) y ˆ 1 (8.2 The Relationship Between Hedging and Arbitrage Hedging is actually quite closely related to arbitrage.10) (8. Hence. but add our holdings variable y Holdings y Prices P Changes d V = Factor Model A dt + B dz (8. HEDGING 8. and we hold a single ˆ unit of the asset to be hedged. to hedge. we select y so that y T B = 0 (recalling of course that y is given by (8. Let’s see how we can alter our hedged portfolio to satisfy this no cost condition. our holdings vector is actually y= and the value change in the hedged portfolio is dV0 = y T dV = y T Adt + y T Bdz Thus. Bu Au Vu 1 Pu (8.8) Note that y T B = 0 is one of the conditions of the arbitrage price implication. First.1 Description Using a Tradables Table I like to use a tradables table description. ˆ (8.2. ˆ Creating a No Cost Hedge Let’s break out the tradables table in a little more detail. y is the holdings of the assets with value changes of dVh .6) 8.6)). The other condition that needs to be satisﬁed to set up the arbitrage is that the total cost or price must be zero. Recall the arbitrage price implication y T P = 0 No cost y T B = 0 No risk ⇒ y T A = 0 No return (8. a hedged portfolio will automatically satisfy this condition. from the line following equation (8. y T P = 0.11) (8. let’s see how hedging and arbitrage are related.2. Thus. lets assume that there is a tradable with no direct factor risk dB0 = r0 B0 dt (8. Consider a tradables table description.12) where y0 is the holding of B0 .5) Let us assume that the last tradable is the derivative that we would like to hedge and we assume that we hold one unit of that derivative. In this section. In fact. so let’s rework the hedging analysis.78 CHAPTER 8. a standard derivation of the BlackScholes equation begins from a hedging argument.7). .
the tradable without direct factor risk is the moneymarket account and we are saying that the hedged portfolio must earn the short rate. we use y to hedge away all the factor risk in our portfolio.17) . we have a hedged portfolio if we hold −cS shares of the stock for every ˆ ˆ option.14) This proﬁt/loss condition is actually the BlackScholes equation! Thus. We hold an asset that we would like to hedge.8. this ﬁrst asset is the risk free asset. we assume the holding of the stock is y . any completely hedged portfolio can be altered to satisfy the price APT implication. this portfolio is hedged if y T B = 0 which in this case is y σS + σScS = 0 ˆ (8. ˆ r0 B0 0 B0 0 B0 dt + σS dz. we make the following important observation. We have assumed that we have one option c and that we are hedging with the stock S. The holding in the ﬁrst asset y0 has no eﬀect on the hedge! Since it has no factor risk. we can choose its holding y0 arbitrarily without aﬀecting the hedge. and then select y0 so that the total cost is zero y 0 B 0 + y T P h + Pu = 0 ˆ (8.16) Solving for y gives y = −cS . This means y0 rB0 + y T Ah + Au = 0 ˆ (8. That is really all that we are doing with the above arguments! 8.3 Hedging Examples Let’s see how hedging is done in some examples. This recognition allows us to select y0 so that the hedge has zero cost. First. Pricing The above portfolio is hedged. That is. a hedging strategy can be used to derive the BlackScholes equation via the above relationships. Thus.13) Thus. y S µS ˆ (8. Thus. The quantity cS is typically called the delta of the option. Section 6. Since it has no factor risk. the hedged portfolio has no direct ˆ factor risk.15) d S = 1 2 2 σScS c c 1 ct + µScS + 2 σ S cSS Now. including it in a portfolio has no eﬀect on the factors! Hence. HEDGING FROM A FACTOR PERSPECTIVE 79 Now.2. we must have zero proﬁt/loss in this hedged portfolio. for no arbitrage to exist it must be the same as the ﬁrst tradable that also has no factor risk. Thus. In an interest rate derivative setting. and all we are saying is the the hedged portfolio must earn the risk free rate.1 we have the following tradables table with the ﬁrst column being the holdings. Thus.1. Hedging in BlackScholes In the BlackScholes setup of Chapter 6.2. In the BlackScholes setting. but we can convert it to satisfy the arbitrage implication by choosing y0 so that the total cost is zero y0 B 0 − c S S + c = 0 (8. for no arbitrage to exist. let y be the holdings of ˆ a hedged portfolio. A Simple Explanation Here is the simple explanation of what we have done above.
we must have no proﬁt/loss which means that the drift term of our portfolio must be zero 1 y0 r0 B0 − cs µS + (ct + µScS + σ 2 S 2 cSS ) = 0 2 Substituting in (8. For no arbitrage to be present. this portfolio is hedged if y T B = 0 which in this case is 2 y bBr (T ) + bBr (T0 ) = 0 ˆ 1 B 2 (T ) r (8. the intuition is important as well! Hedging in Bond Pricing Consider the single factor short rate models of Chapter 7.25) which can be rewritten as 1 2 2 1 2 1 Bt (T0 ) + aBr (T0 ) + 2 b2 Brr (T0 ) − r0 B 2 (T0 ) B 1 (T ) + aBr (T ) + 1 b2 Brr (T ) − r0 B 1 (T ) 2 = t 2 (T ) 1 (T ) Br 0 Br (8.24) 2 Br (T0 ) 1 1 Br (T ) B (T ) − B 2 (T0 ) from (8. let’s write down the tradable table with holdings: r0 B0 0 0 B0 B0 1 1 1 1 y B 1 (T ) ˆ d B 1 (T ) = (Bt (T ) + aBr (T ) + 1 b2 Brr (T )) dt + bBr (T ) dz 2 1 2 2 2 2 2 2 2 bBr (T0 ) 1 B (T0 ) B (T0 ) (Bt (T0 ) + aBr (T0 ) + 2 b Brr (T0 )) (8.18) for y0 B0 = cS S − c from (8.17) gives 1 r0 (cS S − c) − cs µS + (ct + µScS + σ 2 S 2 cSS ) = 0 2 or 1 ct + r0 ScS + σ 2 S 2 cSS = r0 c 2 (8.24) gives 2 B 2 (T0 ) 1 1 1 Br (T0 ) 1 1 1 2 2 2 B (T ) − B 2 (T0 ) − r1 (B (T )+aBr (T )+ b2 Brr (T ))+(Bt (T0 )+aBr (T0 )+ b2 Brr (T0 )) = 0 1 Br (T ) Br (T ) t 2 2 (8. Again. I presented the more structured derivation because I believe that it is always important to have structure to fall back on when your intuition fails. we must have no proﬁt/loss y0 r0 B0 − Using y0 B0 = r0 2 1 1 Br (T0 ) 1 1 1 2 2 2 (Bt (T ) + aBr (T ) + b2 Brr (T )) + (Bt (T0 ) + aBr (T0 ) + b2 Brr (T0 )) = 0 1 (T ) Br 2 2 (8.26) .19) (8. However.23) and note that for no arbitrage to exist. ˆ ˆ Pricing Using the same procedure as in the BlackScholes case.2. Once again. we choose y0 so that the total cost is zero y0 B 0 − 2 Br (T0 ) 1 B (T ) + B 2 (T0 ) = 0 1 Br (T ) (8.18) which is the BlackScholes equation. Following this model.21) Now.20) (8.22) Solving for y gives y = − Br1 (T0) . HEDGING This is now a potential arbitrage portfolio. the simpler explanation of this derivation is that the hedged portfolio is risk free and hence must earn the risk free rate.80 CHAPTER 8. Note that hedged portfolios of bonds are sometimes called ”immunized”. Section 7.23) and substituting this into the ﬁrst term in (8.1. we can convert this hedged portfolio into a potential arbitrage portfolio and derive a pricing equation.
2.4 Hedging under Incompleteness In some cases it is impossible to eliminate all of the factor risk (this is true in incomplete markets). ˆ Since this hedged portfolio has no factor risk. Thus. Since the bond B is not driven by any of the risky factors. I do not need to consider it in the hedge. To see how this is done. But.6. we will have choices in terms of how we want to best try to reduce risk from the factors. we must have that 1 1 1 Bt (T ) + aBr (T ) + 1 b2 Brr (T ) − r0 B 1 (T ) 2 = λ′ 1 Br (T ) (8.35) (8. So.1.28) so that (8.33) I have added the vector of holdings on the left side of the tradables table. This is where the pricing pde comes from! 8.29) Upon rearranging we have 1 1 1 1 Bt (T ) + (a − λb)Br (T ) + b2 Brr (T ) = r0 B 1 (T ) 2 (8. In this case. the hedged portfolio takes the form dVh = = = ydS + dc y((µ + αE[Y − 1])Sdt + σSdz + Sdψ1 ) + Lcdt + ScS dz + dψ2 [y(µ + αE[Y − 1])S + Lc]dt + [yσS + ScS ]dz + ySdψ1 + dψ2 (8. To make this look like our previously derived bond pricing equation in Chapter 7 Section 7.32) − − (c(Y S ) − c(S ))dπ − αE[(c(Y S ) − c(S ))]dt (8. 0 0 0 0 r0 B B B dz y S S 0 dψ1 (8. we are in an incomplete market.31) d S = (µ + αE[Y − 1])S dt + σS 1 σScS 0 1 Lc c c dψ2 where L is the diﬀerential operator corresponding to the drift term in Ito’s lemma.2. let’s consider an example. we had the following tradables table. and that I will hedge with the stock S.30) which is the bond pricing equation where λ is the market price of risk.8. we may still attempt to reduce the eﬀect of the factors on our portfolio.27) for some constant λ′ .27) can be written as 1 1 1 1 Bt (T ) + aBr (T ) + 2 b2 Brr (T ) − r0 B 1 (T ) = λb 1 (T ) Br (8.36) One can clearly see that it is not possible to eliminate all the factor risk by choosing y. let’s let λ′ = λb (8. it must earn the same return as the tradable with no factor risk. which is the money market account. The intuitive explanation of this derivation is that ﬁrst we hedge out the factor risk by the choice of y .34) (8. .1. In the jump diﬀusion model of Chapter 6 Section 6. I have assumed that I hold 1 unit of the derivative c. HEDGING FROM A FACTOR PERSPECTIVE 81 Now. since the left hand side only depends on B 2 and the right hand side only depends on B 1 and B 1 was a bond of arbitrary maturity T . and dψ1 dψ2 = = (Y − 1)dπ(α) − αE[Y − 1]dt − − (8.2.
how about using the BlackScholes formula for c(S.41) (8.45) 0 r0 B B √0 dz1 0 d S = µS dt + √ vS .37) where cS is the partial derivative of Merton’s pricing formula. Thus. B) + y 2 V ar(B) λ2 dt2 + [σScS − yσS] dt (8. we are assuming that volatility is stochastic and thus there is no reason to believe that c would follow the BlackScholes equation.38) where A = B = Minimizing over y gives y= and sending dt → 0 leads to y= c(Y S(t− ). this is an incomplete market. Thus. let’s consider a diﬀerent alternative. what is cS ? Well. Consider the following example.46) dz2 Lc c vScS bcv √ 1 with Lc = ct + µScS + acv + 2 vS 2 cSS + 1 b2 cSS + ρ vbScSv . what Merton [11] did in his pricing formula is equivalent to eliminating the Brownian risk dz. But. t) − c(S(t− ). (8. . we choose y vS + vScS = 0 or y = −cS . we would choose y = −cS (8.40) λE[AB] + λ2 dtCov(A.47) 2 2 Clearly. This is where the inconsistency in hedging often arises. t− ) B(Y − 1)S. λE[B 2 ] + σ 2 S 2 (8. In our setting above.43) (8. HEDGING However. t) and computing cS from that? But. This can be seen by noting that y is often a function of cS for instance. That is V ar(dVh ) 2 = E[A2 ] − 2yE[AB] + y 2 E[B 2 ] λdt + V ar(A) − 2yCov(A. In fact. our hedge is to hold −cS shares of the stock. we can select y to eliminate some of the risk over the next dt. with y. Thus. let’s consider a hedged portfolio √ √ √ 1 1 dVh = y(µSdt + vSdz1 ) + (ct + µScS + acv + vS 2 cSS + b2 cvv + ρ vbScSv )dt + vScS dz1 + bcv dz2 (8. so we cannot perfectly hedge away the risk. In that case. Let’s consider a stochastic volatility model: dB dS dv with the tradables table B S c √ = µSdt + vSdz1 = adt + bdz2 = r0 Bdt (8. we cannot eliminate all the risk. Let’s consider a simple hedge where we only eliminate the √ √ dz1 risk. (8. this is not consistent because to derive the BlackScholes formula we assume that volatility is not stochastic.5 A Question of Consistency Note that the hedges we have derived above often involve knowledge of a pricing formula for the option c.82 CHAPTER 8. Nevertheless.2. B) + σ 2 S 2 cS λE[B 2 ] + λ2 dtV ar(B) + σ 2 S 2 λE[AB] + σ 2 S 2 cS . 2 Now. But.44) (8.39) (8. This presupposition of knowledge can bring up a question of consistency between the hedge and the assumption of the formula for c. Let’s choose y to eliminate the variance of the portfolio over the next dt.42) 8. our hedging analysis has presupposed that the derivative c follows a speciﬁed formula that we know.
Hence. pricing and hedging are intimately related. and hedge against those variables by eliminating the sensitivity to those underlying variables. If there is no hedging justiﬁcation for a pricing formula. Furthermore. That means that small changes in S will cause no change in the value of the hedged portfolio Ph .3. many times that is not done (for many reasons. if there is stochastic volatility. t) + y S ˆ (8. Here is the correct part. in the stochastic volatility case I might just want to hedge out the stock risk S via dz1 and leave my portfolio exposed to volatility risk. Another way of saying this is that we want the derivative of Ph with respect to S to be zero. including computational. we will be hedged. we consider the asset that we are trying to hedge to be a function of underlying variables. people like to rely on the BlackScholes formula even when its assumptions are not valid. in the BlackScholes setup. Thus. Now if we would like to hedge out the risk in our option. In this approach. in this case. Not every hedging strategy should be turned into a pricing formula. That is. Sometimes. ease of use. since all the risk comes from the ˆ stock variable. Here is the wrong part. Now. Why? Because hedging is the mechanism that enforces pricing. But. you ask. This pricing formula should be the best pricing formula that reﬂects the actual pricing and movement of the derivative c in the actual market. if a pricing formula is tightly related to a hedging strategy.3. HEDGING FROM AN UNDERLYING VARIABLE SENSITIVITY PERSPECTIVE What formula for c? 83 So. In the simplest terms. For example. then what formula should I use for c. then there is no good reason to expect that pricing formula to hold. we will be hedged if our portfolio has no sensitivity to changes in the stock. For example. Let’s see why. in places below we will fall into this trap as well. so keep a sharp eye out for where it may be occurring. then deviations from that pricing strategy can be exploited by turning the hedging strategy into an arbitrage (or almost arbitrage) opportunity as shown in the previous section. hedging strategies often rely on a pricing formula for the derivative c. The answer is that you should use the formula that best corresponds to the actual price and movements of c in the market. there is a simpler and faster way to derive hedges that doesn’t involve an explicit use of the factors. every pricing formula should be intimately connected to a hedging strategy. However. Thus.48) and want to choose y so that this portfolio is hedged.8. we note that all the risk in the price of an option comes from its dependence on the stock variable. As we saw above. we assume that an option is a function of the underlying stock S and time t. The pricing formula c that I should be using is the one that best reﬂects the actual movement of c in the market. you want to hedge for a reason unrelated to pricing. If I were interested in using some trading strategy that depended on volatility v but not the stock price S. t).3 Hedging from an Underlying Variable Sensitivity Perspective In some cases. the reverse is not true. Any pricing formula should be related to a reasonable hedging scheme via our analysis above. Due to standard ﬁnance conventions. This idea is both right and wrong. In practice. sensitivity is measured by the derivative of the hedged portfolio with respect to the underlying variable of concern. However. some might argue that you should use a pricing formula that comes from the hedge that you are doing. often hedging analysis is inconsistent. 8. then this would be perfectly reasonable. we form a hedged portfolio Ph = c(S. etc).1 BlackScholes Hedging For example. 8. the price of an option is a function of the variables S and t which we write as c(S. But it probably would not be reasonable to claim that I should be using a pricing formula related to this hedging strategy. Thus. . To summarize. then you should use a model that most accurately captures that stochastic volatility and how it is reﬂected in the movement of the derivative c.
the underlying variable is the stock price S and the tradables are the stock itself S and the option c. this approach only works if we know that the stock price changes will be small. Recall that the derivative is the change in a function for a small change in the variable.3 and an interest rate of r0 = 0. For simplicity. Hence.84 This condition can be written as CHAPTER 8.Plot of the Stock.3. tT0 ) + y B 1 (r0 . We wish to hedge this bond with another bond B 1 (r0 . call option.1 is a portfolio of the stock (left) and call option (middle) so that at the current value of the underlying variable (S = 10). when we say that a portfolio is hedged against stock price movements because its derivative with respect to the stock is zero. tT ) is hedged if its derivative with respect to r0 is zero. the risk in the price of a bond comes from the short rate r0 (t) which is our underlying variable. Stock Value 15 15 Option Value 15 Hedged Portfolio 10 10 Tradable: Hedged Portfolio 0 5 10 Underlying Variable: S 15 10 Tradable: S 0 Tradable: c 5 5 5 0 0 −5 −5 −5 −10 0 5 10 Underlying Variable: S 15 −10 −10 0 5 10 Underlying Variable: S 15 Figure 8. Thus. In this case. if the stock price is driven by a . This is the case for stock price movements driven by Brownian motion since Brownian motion is continuous. tT ) that is also a function of the short rate r0 (t) and time t. Thus.3.05. Middle . However. This hedge is known ˆ ˆ as a delta hedge and the quantity cS is commonly referred to as the delta of the option. To make this a potential arbitrage portfolio.Plot of the hedged portfolio assuming that the current price of the stock is $10. In that case. Figure 8. this means that the value of the portfolio change is approximately zero for small changes in the price of the stock. and delta hedged portfolio for an option with strike K = 10 and expiration T = 0. we could add a position in the bond in order to make the current price of the portfolio (at S = $10) be zero.3 on a stock with volatility σ = 0. But.Plot of the option. 8. a portfolio Ph = B 2 (r0 . The hedged portfolio on the right in Figure 8.49) ∂S Solving for y yields y = −cS which is the same answer that we arrived at using factors. ˆ ˆ r (8. the value of the portfolio change could be quite large if the stock change is large.3 Derivatives imply Small Changes Note that this underlying variable approach uses the derivative as a measure of sensitivity. we hold a bond B 2 (r0 . Right . In this setup. let’s assume the option is a European call option following BlackScholes.1 shows a plot of the stock.50) B 2 (T ) 8. the derivative of the portfolio value yS + c is zero (right plot). HEDGING ∂Ph = cS + y = 0 ˆ (8. tT0 ) that is a function of the short rate r0 (t) and time t. Thus ˆ ∂Ph 2 = Br (T0 ) + y Br (T ) = 0 ˆ 1 ∂r Solving for y yields y = − Br1 (T0) which is the same answer that we arrived at using factors. and the value of the tradable as a function of the underlying factor on the yaxis.1: Delta Hedge: Left . Consider a plot with the underlying variable on the xaxis. There is a simple graphical interpretation of this hedge.2 Hedging Bonds We can apply this same underlying variables approach to our hedging of bonds.
we are able to trade continuously. Thus. However. 8. 8. then we would expect it to have large jumps at times. By looking at this Taylor expansion. 2 (8.1. σ) = ct ∆t + cS ∆S + cr ∆r0 + cσ ∆σ + cSS (∆S)2 + . Of course.8. That is.51) I only included a single second order term (there are many second order terms) because is it the only named second order term. This leads to the socalled Greeks. From the BlackScholes formula.2 A DeltaGamma Hedge One can think of a delta hedge as eliminate the ﬁrst order term in the Taylor expansion. we use a ∆t instead of a dt and furthermore. recognizing that that is only an approximation. Thus. we see that it depends also on the risk free rate r0 and the volatility of the option σ. Each of these derivatives is given a Greek letter name as in Table 8. t.4 Higher Order Approximations Using the derivative to model the change in a portfolio due to the change in a variable is a linear approximation of portfolio value as a function of the underlying variable. theta delta rho vega gamma ct cS cr cσ cSS Table 8.. one can go further and ask if higher order terms can be eliminated as well. σ) as 1 ∆c(S. and ”vega” starts with ”v” and is the partial with respect to ”v”olatility. If you elminate the ﬁrst order term in . In our derivation of the BlackScholes formula. and note that the hedging strategy that we derived was not that same one that would result from this derivative approach. Similarly. This was the situation in the JumpDiﬀusion model above. HIGHER ORDER APPROXIMATIONS 85 Poisson process. An easy way to do this is to use the terms of a Taylor expansion. in general. with ”delta” and ”gamma” you are on your own. higher order terms will enter. presented next. Of course. This is actually more practical than the typical continuous time trading assumptions. t. However.4.4. r0 . ”theta” starts with ”t” and is the partial with respect to t. 8. To do this. a call option is not just a function of the price of the stock and time. Unfortunately. like I do. this Taylor expansion approach assumes that we are not trading continuously. one can also use a higher order approximation to the portfolio value. Note that in the typical dynamic hedging assumption.. Then we can ask how their changes might cause the price of an option to change assuming that the price follows the BlackScholes formula. Thus. we would just construct a multivariable Taylor series expansion of c(S. we can allow them to be underlying variables and change. we see that the various derivatives tell us the sensitivity of the price of an option to changes in those variables. ”rho” starts with ”r” and is with respect to r0 . r0 .1 The Greeks Now.4. only terms of order dt and lower matter. we assumed that r0 and σ were constant (not random factors). then you can remember some of the Greeks by noting that the ﬁrst letter of the Greek is the same as the partial derivative. in this case using the derivative is not a good approach since the stock price change would be large and the derivative would likely not be a good approximation to the change in the portfolio value.1: The Greeks If you have a bad memory.
2 (8.59) where I have only kept terms up to order ∆t in (8. For the analysis that we are looking at.52) and let’s Taylor expand this to obtain ∆P (S.61) and (8.62) is that (∆S)2 is σ 2 S 2 dt + higher order terms. Thus. 1 2 ct + r0 ScS + σi S 2 cSS = r0 c (8. this would create a portfolio that has the ﬁrst and second derivative equal to zero at the current value of the underlying variable.3 Determining what the error looks like We can use the Taylor expansion to see what the error looks like in a Delta hedge under BlackScholes assumptions. t) − cS ∆S 1 ct ∆t + cSS (∆S)2 dt + . For such a DeltaGamma hedge you need more than just the underlying stock. y2 = − (8.. In practice.4.61) (8.62) (8.60) (8. c satisﬁes the BlackScholes equation corresponding to a volatility value of σi ...54) (8.61) is also a very useful equation for intuituion. we have to eliminate the coeﬃcients of ∆S and (∆S)2 by choosing y1 and y2 .. Thus.53) (8.56) (8. t) = c(S. t) − cS S and let’s perform a Taylor expansion of this in S and t ∆P (S. Now let’s assume that in the market. and furthermore.57) cSS − cS . 8. Note that the only diﬀerence between (8. t) = = = ∆c(S. However. we can DeltaGamma hedge an option c. Let’s show how by using another call option c(2) and the underlying stock. Consider the hedged portfolio P (S.55) To DeltaGamma hedge.63) 2 I use the subscript i on the volatility σi to denote what is known as implied volatility. Implied volatility is the value of volatility that when plugged into the BlackScholes formula will make it equal the current . the hedged portfolio is P = c + y1 S + y2 c(2) (8. HEDGING ∆S and the second order term (∆S)2 than this is called a DeltaGamma hedge.86 CHAPTER 8. you need a tradable that depends on the second order term (∆S)2 .58) In the plots of Figure 8. 2 1 ct ∆t + cSS σ 2 S 2 dt + . In this case. t) + y1 ∆S + y2 ∆c(2) 1 1 (2) (2) (2) ct ∆t + cS ∆S + cSS (∆S)2 + y1 ∆S + y2 ct ∆t + cS ∆S + cSS (∆S)2 2 2 1 (2) 1 (2) (2) ct + y2 ct ∆t + cS + y1 + y2 cS ∆S + cSS + y2 cSS (∆S)2 2 2 (8. we must solve c S + y1 + y2 c S 1 (2) 1 cSS + y2 cSS 2 2 The solution is y1 = (2) c (2) S cSS (2) = = 0 0 cSS cSS (2) (8. So we will use it as well. the higher order terms don’t matter. t) = = = ∆c(S.1. (8. DeltaGamma hedges (and other hedges as well) are diﬃcult because transaction costs can make it expensive to trade too many assets.62).
.38). for this hedge where we are long the option and short delta of the stock. we recognize the fact that risk comes from the factors. Since we only set the derivative to zero. σ) denote the BlackScholes formula. time to expiration is T .5.61).67) Equation (8. T.64) Thus. tradables are viewed as functions of underlying variables.6 Problems Problem 8. if you underlying Taylor expansions. what volatility value are they pluggin in the BlackScholes formula? Thus.5 Summary Hedging can be approach from two diﬀerent points of view. (8. 8. In the second point of view. r0 . Usually we do this by setting the derivative of the hedged portfolio with respect to the underlying variable to zero at the current value of the underlying variable. t) = r0 P dt + cSS (∆S)2 − σi S 2 dt 2 or purely to order dt using (8. if we have more tradables to place in our portfolio. Equation (8. let cBS (S. Really.61) which gives 1 2 ∆P (S.63) to substitute for ct in (8. To be more concrete.66) (8. K. in hedging we try to eliminate the factor risk. T. Furthermore. we can use (8. and we lose money if it moves less.67) shows the same thing. SUMMARY 87 market price of an option. Furthermore. t) = r0 (c − cS S)dt + cSS (∆S)2 − σi S 2 dt 2 and ﬁnally noting that P = c − cS S gives 1 2 ∆P (S. then you are on your way to understanding the majority of hedging methods. t) = r0 P dt + cSS 2 2 σ 2 − σi S 2 dt. we can set higher order derivatives of our portfolio to zero as well and create better hedges. 8. this works as long as there are only small moves in the underlying variable between rehedging opportunities. (8. it answers the question: If the market is following BlackScholes.6. but in terms of the volatility of the stock σ rather than the stock move ∆S. K. In the ﬁrst point of view. 1 ∆P (S.8. strike price is K. Thus.1 Verify equation (8.63) assumes that the market is pricing the option using the BlackScholes formula with a volatility value of σi . In particular. and risk free rate is r0 . we make money if the stock moves more than implied volatility estimate.65) (8. σi ) (8. Thus. and we hedge by constructing a portfolio that eliminates the sensitivity to moves in the underlying variables. This point of view is appropriate regardless of what the risky factors are. r0 .66) shows that the gain or loss of our hedged portfolio relative to the risk free rate depends on the actual change in the stock price over the next ∆t in the term (∆S)2 relative to the implied volatility σi in BlackScholes that is being used to price the option in the market.62) instead of (8. and the derivative reﬂects a local approximation to the portfolio. Then the implied volatility is deﬁned as the value of σi that solves cm = cBS (S0 . let cm be the current market price of an option where the stock price is S0 .
HEDGING .88 CHAPTER 8.
2 Do the Factors Matter? Prices P Changes d V = Factor Model A dt + B dz In our tradables table. where the A and B representation of prices changes is the same.Chapter 9 The Road to Risk Neutrality 9. we have to start with a more basic question. the A and B representation of price changes does see the factors via Ito’s lemma. Let’s be a little more concrete about this. This introduction to risk neutral pricing does not follow the standard probabilityheavy route. Therefore. it is an extremely powerful approach. I will explain the risk neutral pricing principle. A. absence of arbitrage only depends on the values of P. Hypothesis: Perhaps by changing the factors from dz to some other random factors dψ. 89 . Thus. what other set of factors dψ will have the same A and B representation? At ﬁrst glance. I can compute the absence of arbitrage prices P easier. you might be tempted to say that I can choose dψ to be anything I want. and B. It doesn’t have to relate to the original dz at all because the absence of arbitrage condition ”does not see” dz. when understood correctly. Thus. regardless of what the driving factors are. but instead motivates risk neutral pricing in a simple manner from the factor approach.1) and the absence of arbitrage condition is A = Pλ0 + Bλ (9. Hence. 9. Nevertheless. and B. However. However. Why? Because when we deal with derivative securities. especially for those new to derivative pricing. To see whether something can be made of this hypothesis. and misused pricing approaches. misunderstood. it tends to be one of the most confusing. This leads us to ponder the following.2) Thus. their factor model is determined by the application of Ito’s lemma. the prices P are absence of arbitrage for any value change with A. and not on what the exact factors dz are. we have (9. you would soon realize that this is not the case. Ito’s lemma puts a constraint on which factors dψ are consistent with the original factors dz. Question: Given a set of factors dz. in this chapter.1 Introduction Risk neutral absence of arbitrage pricing is a widely used approach to derivative pricing.
3) (9. By Ito’s lemma we have dc = Thus.9) And thus we see that dz and dψ = d˜ + βdt are consistent in that they produce the same A and B z representation regardless of whether dz or dψ = d˜ + βdt is the factor! z The upshot is that we can replace any Brownian factors dz by Brownians plus a drift d˜ + βdt. 1 ct + µScS + σ 2 S 2 cSS dt + σScS dz.15) .11) (9. THE ROAD TO RISK NEUTRALITY dS = µSdt + σSdz and let c(S.2.12) µSdt + σS(dψ) µSdt + σS(d˜ + βdt) z (µ + σβS)dt + σSd˜ z (9. the new factor should be considered to be the entire term dψ = ηd˜ and not just the d˜. Again. dc = = = 1 z ct + (µ + σβ)ScS + σ 2 S 2 cSS dt + σScS d˜ 2 1 z ct + µScS + σ 2 S 2 cSS dt + σScS (d˜ + βdt) 2 1 ct + µScS + σ 2 S 2 cSS dt + σScS (dψ) 2 (9.1 Brownian Factors Changing the Mean Works Let’s try new factors given by the replacement dz → dψ = d˜ + βdt z (9. 2 (9. t).5) 9. I will replace the original Brownian factors by Brownians with a diﬀerent variance. and the z same absence of arbitrage prices hold! Changing the Variance Does Not Work Let’s try new factors given by the replacement dz → dψ = ηd˜ z (9.4) (9.14) (9.90 Let S(t) be a stock price with model CHAPTER 9.7) (9. We would have dS = = = and then by Ito’s lemma we have for c(S.6) That is. It is important to consider the new factor to be the entire term dψ = d˜ + βdt and not just d˜! z z We can ask whether this will lead to a representation consistent with the original dz.10) (9. We can ask whether this z z factor dψ = ηd˜ is consistent with the original dz. We would have z dS = = µSdt + σS (dψ) µdt + σS(ηd˜) z (9. the tradables table is Prices S c Changes d S c = Factor Model µS 1 ct + µScS + 2 σ 2 S 2 cSS dt + σS σScS dz.8) (9.13) That is. I will replace the original Brownian factors dz by a new factor that is a Brownian d˜ plus a drift z βdt. t) be a derivative security.
Therefore. α) → dψ = d˜ (t. Similarly.16) (9. α) (9. this substitution is not allowed. α) π (9.19) (9. we have π dS dc = = µSdt + (σ − 1)Sdψ = µSdt + (σ − 1)S(d˜ (t. α) (ct + µScS )dt + (c(σS) − c(S))dπ(t.25) (9. let’s consider changing the factor dπ(t. In this case. Now. t).2 Poisson Factors Changing the Intensity Works Assume that the original factor is a Poisson Process dπ(t. α). α). The above results lead us to the following principle. α) + ηdt) π (ct + (µS + (σ − 1)ηS)cS )dt + (c(σS) − c(S))d˜ (t.24) Now. Under dπ(t. π Adding a Drift Doesn’t Work Again assume that the original factor is a Poisson Process dπ(t. α + β) π and we see that the A and B representations remain the same under dψ = d˜ (t.9. in the case of multiple Brownian factors. α + β). then P is also absence of arbitrage if the factors dz are z z z z replaced by dz → dψ = d˜ + βdt with β arbitrary and where d˜ are Brownian factors with E[d˜d˜T ] = Σdt. α + β) π (9. α) we have dS dc = = µSdt + (σ − 1)Sdπ(t. then P is also absence of arbitrage under a diﬀerent set of factors dψ = d˜i (αi + βi ) π where αi + βi is the new intensity. we have dS dc = = µSdt + (σ − 1)Sd˜ (t. 9. we see that A is changed when we change the factor to another Brownian with a diﬀerent variance. α). changes in the correlations or the covariance structure is not allowed. (⋆) Arbitrage Invariance Principle for Brownian Motion: If a set of prices P is absence of arbitrage under Brownian factors dz ∈ Rn with E[dzdz T ] = Σdt. In this case.2.26) and we see that there is no way to recover the original A and B representations under dψ. α) + ηdt.17) (9. (⋆) Arbitrage Invariance Principle for Poisson: If a set of prices P is absence of arbitrage under Poisson factors dπi (αi ).2.23) (9.18) Thus.22) (ct + µScS )dt + (c(σS) − c(S))d˜ (t. . α) we have dS dc = = µSdt + (σ − 1)Sdπ(t. Under dπ(t. DO THE FACTORS MATTER? and then by Ito’s lemma we have for c(S. let’s consider changing the factor to a Poisson with an altered intensity dψ = d˜ (t.20) (ct + µScS )dt + (c(σS) − c(S))dπ(t.21) (9. α) (9. dc = = 1 z ct + µScS + σ 2 η 2 S 2 cSS dt + σScS (ηd˜) 2 1 ct + µScS + σ 2 η 2 S 2 cSS dt + σηScS (dψ) 2 91 (9. α + β) where π α + β > 0.
29) Now. z which leads to Prices Changes Factor Model (9.32) is called the risk neutral representation because all tradables have a drift equal to the market price of time.92 CHAPTER 9. First. and absence of arbitrage prices under this representation will be the same as under the original representation using the actual Brownian factors. In fact. Now. The basic idea is that for any pricing problem.1 Brownian Factors Prices P Changes d V = Factor Model A dt + B dz Consider the tradables table with Brownian factors dz (9. in this section we show that we can always replace the factors and put all tradables in what we will call a risk neutral representation. under arbitrage invariant substitutions of the factors dz → dψ = d˜ − λdt. Then. we can replace the factors to create the risk neutral representation which leads to simpliﬁed pricing formulas. the z following tradable table will produce the same absence of arbitrage prices Prices P Changes d V = Factor Model Pλ0 dt + B d˜ z (9.3. It states that we can ﬁnd a replacement set of factors so that all tradables have a drift equal to the market price of time λ0 . 9. (⋆) Risk Neutral Representation for Brownians: Let Prices P Changes d V = Factor Model A dt + B dz (9.30) = P Pλ0 dt + B d˜ d V z This is the risk neutral representation. Note that to obtain this risk neutral representation. . let’s see what the risk neutral representations are. Why is this helpful? Because in some situations it is easier to price a derivative if we replace the factors by something diﬀerent from the original factors. by the Brownian factor invariance under changes to the drift.27) and the absence of arbitrage condition is A = Pλ0 + Bλ. the factors were replaced by new factors that contained the market prices of risk! Let’s formalize this notion of a risk neutral representation. regardless of how risky they really are. we groups terms diﬀerently to obtain Prices P Changes d V = Pλ0 Factor Model dt + B (dz + λdt) (9.31) be a tradables table. we can replace dz → dψ = d˜ − λdt. we can substitute this into the tradables table to obtain Prices Changes Factor Model (9. The fact that we can do this will ultimately lead us to the risk neutral pricing principle. But we are getting ahead of ourselves.3 Risk Neutral Representations The Brownian and Poisson factor invariance principles tell us that it is okay to alter or replace the factors in certain ways.32) Equation (9. THE ROAD TO RISK NEUTRALITY 9.28) = P Pλ0 + Bλ dt + B dz d V Finally.
α + β). That is. Prices P Changes d V = Pλ0 Factor Model dt + B (dπ(t. let’s think about the market price of risk for a Poisson factor dπ.39) The ﬁnal step is to compensate the Poisson process.33) and the absence of arbitrage condition is A = Pλ0 + Bλ (9. Thus. we can substitute this into the tradables table to obtain Prices P Changes d V = Factor Model Pλ0 − Bλ′ dt + B dπ(t. let’s just note that λ < 0 for a Poisson factor. being short a Poisson factor is adding real (downside) risk. the new intensity of the factors in the risk neutral representation is equal to (minus) the market price of risk! Let’s make this formal. λ′ ) − λ′ dt) (9. RISK NEUTRAL REPRESENTATIONS 93 9. but for our purposes.3. Since a Poisson process either does nothing or jumps up by 1.3. we can see that the drift of the value changes V for all tradables is equal to the market price of time λ0 . α). by the Poisson factor invariance under changes to the intensity. This was the same as in Brownian case.37) where λ′ > 0. That is. Now. which leads to Prices P Changes d V = Factor Model Pλ0 − Bλ′ dt + B d˜ (t. for Poisson processes. if you are short a Poisson factor (B is negative). α) → dψ = d˜ (t. On the other hand. All the risk is on the upside. then you should be rewarded for taking on that risk. we would expect the market price of risk for a Poisson factor to be negative. This is the risk neutral representation under Poisson factors! Note that in this case. α) (9.9.35) (9.36) (9. I don’t like dealing with a negative quantity. λ′ ) (here is where it is important that λ′ > 0.34) Before proceeding. One can show this in a much more rigorous fashion. and a negative market price of risk would reﬂect that. we can replace the original Poisson factors by new Poisson factors with diﬀerent intensities so that the drift of all tradables is the market price of time λ0 (don’t forget to subtract oﬀ the mean of the Poisson factors so that they have zero mean!). it is always good to hold a positive amount of a Poisson factor. we subtract oﬀ the mean of the Poisson process so that the random factor has mean zero. . λ′ ) π (9. Thus. α) (9.40) Since the random factor term now has zero mean. Let’s choose β = λ′ − α so that dψ = d˜ (t.2 Poisson Factors Consider the tradables table with Poisson factors dπ(t.38) Now. so let’s deﬁne λ′ = −λ and rewrite the absence of arbitrage condition in terms of λ′ as A = = = Pλ0 + Bλ Pλ0 + B(−λ′ ) Pλ0 − Bλ′ (9. we can replace dπ(t. Prices P Changes d V = A Factor Model dt + B dπ(t. so π π that it can be an intensity!).
.48) ˜ where the expectation E(·) is taken under the risk neutral representation and λ0 is the market price of time. regardless of how risky they really are. λ′ ) − λ′ dt) . (⋆) The Risk Neutral Pricing Principle: Absence of arbitrage prices are given by the formula ˜ P(0) = E e− t 0 λ0 (s)ds P(t) (9. Thus. z (9. From the risk neutral representaion equation (9. Thus. α) → dψ = d˜ (t. so let’s see how it would work in practice. z Now. under arbitrage invariant substitutions of the factors dπ(t. π (9. λ′ ).94 CHAPTER 9.32) we have dV = λ0 Pdt + Bd˜.46) since z is a Brownian motion. this is a new pricing point of view that follows from the factor approach! The presentation has been a little abstract to this point. α) (9. Then. THE ROAD TO RISK NEUTRALITY (⋆) Risk Neutral Representation for Poissons: Let Prices P Changes d V = A Factor Model dt + B dπ(t. Let’s see how it works for the Brownian case. we can start with the risk neutral representation. integration of both ˜ sides says that t ˜ P(0) = E e− 0 λ0 (s)ds P(t) .43) (9. (We also switched the d and the expectation. This risk neutral pricing principle applies in the Poisson case as well. The fact that the drift of all tradables is equal to the market price of time leads us to a convenient new pricing formula.4 Pricing as an Expectation The risk neutral representations lead to a powerful risk neutral pricing formula. via Ito’s lemma one can verify that d e− Taking expectations of both sides gives ˜ d E e− t 0 t 0 (9. (9.41) be a tradables table. so that V = P.45) λ0 (s)ds P =0 (9. we have dP = λ0 Pdt + Bd˜ z which is telling us that the drift of P is λ0 when we use the factors based on d˜.47) We have arrived at risk neutral pricing.42) is called the risk neutral representation because all tradables have a drift equal to the market price of time. z Let’s assume that we are not dealing with a futures contract. 9.44) λ0 (s)ds P = e− t 0 λ0 (s)ds Bd˜.42) Equation (9. π the following tradable table will produce the same absence of arbitrage prices Prices P Changes d V = Pλ0 Factor Model dt + B (d˜ (t. This is because instead of starting with the real tradables table.) Finally.
51) Now.53). Apply the Risk Neutral Pricing Formula to the derivative security that is being priced. if T is expiration. so the risk neutral pricing formula is ˜ c(S(0). T ) .1 How to Apply Risk Neutral Pricing Risk neutral pricing is applied using the following steps.54) Now.2 BlackScholes Let’s see how risk neutral pricing applies to the BlackScholes setup. we were able to obtain a pricing formula without resorting to partial diﬀerential equations! . 2.5.). T ) = (S(T ) − K)+ . 0) = E e−r0 T c(S(T ). (9.55) But. 1. APPLICATIONS OF RISK NEUTRAL PRICING 95 9.52) (9. t) gives ˜ c(S(0). z (9.53) (9.49) (9. let’s outline how it is applied. S(t) follows (9.57) (9. convert the tradables table to its risk neutral representation. 9.5. applying the risk neutral pricing formula to the call option c(S.e.59) c(S. Via an arbitrage invariant substitution of the factors. we have dB dS dc = = = r0 Bdt r0 Sdt + σSd˜ z r0 cdt + σScS d˜. Hence. Performing this expectation leads to d1 d2 = = ln(S/K) + (r0 + 1 σ 2 )(T ) 2 √ σ T √ d1 − σ T (9. Recall that the bond and stock follow dB dS dc = = = r0 Bdt µSdt + σSdz 1 (ct + µScS + σ 2 S 2 cSS )dt + σScS dz.5 Applications of Risk Neutral Pricing Let’s see how the risk neutral pricing principle is used in a couple of familiar situations. then we know that c(S(T ). 9. etc.58) (9. 2 (9.5. according to the risk neutral pricing principle.56) where the e−r0 T was pulled out of the expectation because it is not random. 0) = SN (d1 ) − Ke−r0 T N (d2 ) which is the BlackScholes formula! Thus. That is it! Let’s clarify with some examples. 0) = e−r0 T E (S(T ) − K)+ (9. ˜ The expectation E(·) is taken under the risk neutral representation (i.50) (9.9. But ﬁrst. we have the same absence of arbitrage prices if we set all the drifts to the risk free rate (market price of time).
The tradables table is 0 r0 B B B dπ(ν) dt + S (k − 1)S µS d S = c(kS − ) − c(S − ) ct + µScS c c (9. ln(k) 9.96 CHAPTER 9. T ) . the underlying variables are given by the instantaneous forward rates dr(ts) = µ(ts)dt + σ(ts)dz(t). (9.4 HJM Recall the HJM model of Section 7. i! i=α ∞ (9. I won’t compensate the original Poisson factor dπ(t. y) − Ke−r0 (T ) Ψ(x. β) = e−β β i . k−1 r0 − µ k−1 Let’s try it for a Poisson model of Section 6. Thus. 0) = E e−r0 T c(S(T ).60) For the risk neutral representation we use minus the market price of risk λ′ = 1 (9. (9. 0) = e−r0 T E (S(T ) − K)+ .61) Finally. In the risk neutral representation. T ) = (S(T ) − K)+ .1.3.64) where the e−r0 T was pulled out of the expectation because it is not random. S (k − 1)S d S = r0 S dt + 1 1 − − c(kS ) − c(S ) r0 c c c ˜ c(S(0).4.67) and the tradables are bonds that follow T dB(tT ) = B(tT )r(tt) − B(tT ) − B(tT ) t T 1 µ(ts)ds + B(tT ) 2 σ(ts)ds dz T t t T σ(ts)σ(tr)drds dt t . t) = SΨ(x. we can apply the risk neutral pricing formula and the risk neutral representation is 0 r0 B B B (dπ(λ′ ) − λ′ dt) .2. To be consistent with the approach above in Section 9.5. In this model.3.3 Poisson Model and the market price of time is λ0 = r0 while the market price of risk is λ1 = µ − r0 . ν) as was done in Section 6.64) gives c(S.2. THE ROAD TO RISK NEUTRALITY 9.4. then we know that c(S(T ).1. computing the expectation in (9. (9.62) (9.62) determines the expectation E(·). ˜ (9.5. y/k) where Ψ(α. so the risk neutral pricing formula is ˜ c(S(0).66) and x is the smallest nonnegative integer greater than ln(K/S)−µ(T ) .63) But.65) y= (r0 − µ)kT k−1 (9. if T is expiration.
one can compute the payoﬀ value of a derivative. if we know the volatility of the instantaneous forward rates (σ(tT )). we have a zero market price of risk λ1 = 0 (because all tradables earn the risk free rate). the relationship between the bonds B(tT ) and the instantaneous forward rates is r(tT ) = − Then. one simulates the risk neutral representation of the instantaneous forward rates.2. So. then we can compute what the drift terms must be by equation (9. In the HJM model. Another way to get to this same result is to revisit the results of Section 7.68) where d˜ is the risk neutral factor and r(tt) = r0 (t) is the instantaneous short rate. let’s start over but with the risk neutral perspective in mind. APPLICATIONS OF RISK NEUTRAL PRICING 97 If you recall from Section 7. Well. we have dr(tT ) = = If one lets σ(tT ) = − then µ(tT ) = where dr(tT ) = µ(tT )dt + σ(tT )dz(t). This is the risk neutral priceing approach. Since this is a single factor model. the above equation becomes T µ(tT ) = σ(tT ) t σ(ts)ds (9. we can ask what z this implies about the instantaneous forward rates in the risk neutral world.71) (9. Thus.3 equation (7. Then. Thus.22) that said: T µ(tT ) − σ(tT ) σ(ts)ds = σ(tT )λ1 t (9. (9.2.3 this was extremely messy to deal with. the expectation is often computed by Monte Carlo.73) These equations tell us that under the risk neutral representation. Let’s start with the tradables that are the bonds. z 2 ∂T ∂ ν(tT ) ∂T (9.76) which is what we were looking for. ∂T (9. That is. and compute the expectation of it.9. .72) σ(ts)ds t (9. if we want to use the risk neutral representation.73).69) ∂ ∂T ∂ ∂T 1 2 ∂ ν (tT ) − r(tt) dt − ν(tT )d˜ z 2 ∂T 1 2 ∂ ν (tT ) dt − ν(tT )d˜. we would ﬁrst estimate the volatilities of the instantaneous forward rates σ(tT ) from market data. by Ito’s lemma. then use (9.73) to compute the risk neutral drifts. Once we have this. pricing proceeds via expectations as in the risk neutral pricing formula.70) (9. the risk neutral representation under Brownians tells us that we may write the tradables in the form dB(tT ) = r(tt)B(tT )dt + ν(tT )B(tT )d˜ z (9. Now.75) and note that in the risk neutral representation.5.74) ∂ ∂T 1 2 ν (tT ) 2 = ν(tT ) ∂ ν(tT ) = σ(tT ) ∂T T ∂ ln B(tT ).
we can switch to the risk neutral representation by setting all the market prices of risk to zero λi−1 = 0. 9. Recall from equation (7. Pricing is then done by expectation. 9. but deals with forward rates between times Ti and Ti+1 denoted by R(tTi . we write Ri = R(tTi . Note that this is similar to what we had in the HJM risk neutral model. T2 ) = a1 R(tT1 . This reduces the above equation to τ bj R i τ ai Ri − τ bi Ri j=1 (1+τ Rjj ) ρij (9.77) (9.7. and then use equation (9..5. my friends.4 is also made quite simple by the use of risk neutrality.82) to obtain their drifts in the risk neutral representation.81) a i = bi j=1 ρij (9. .82) which is quite a bit simplier than what we started with.7. But that. T2 )dz2 . The idea was that details of the factors do not seem to appear in the factor APT equations that we used throughout the book. By using a diﬀerent set of factors (that still preserved the same absense of arbitrage prices) in many case we can simplify our calculations of the absense of arbitrage prices.79) or by simplifying further Now. Ti+1 ) and that are assumed to follow dR(tT1 . Recall that the LMM is similar to the HJM model. We can use market data to estimate the bi terms (the volatility of the forward rates)..46) that the calibration relationship on the underlying Ri variables is τ bj R i τ ai Ri − τ bi Ri j=1 (1+τ Rjj ) ρij τ bi−1 Ri−1 . In fact. THE ROAD TO RISK NEUTRALITY 9.2. is the subject of another little book.6 Summary The point of this chapter was to show that risk neutral pricing is a logical consequence of the factor approach to derivative pricing. there is quite a bit more that one can do when the full power and generality of the risk neutral approach is explored.45).80) 0= 1 + τ Ri i τ a i R i − τ bi R i or ﬁnally j=1 i τ bj R j (1 + τ Rj ) τ bj R j (1 + τ Rj ) ρij = 0 (9.5 Libor Market Model The LMM model of Section 7.7 Problems Problem 9.78) (9. Ti+1 ) with dRi = ai Ri dt + bi Ri dzi .1 Verify equation (9. This is the basic notion of risk neutral pricing. That gave us the idea that perhaps we could change the factors (as long as we didn’t disturb the basic factor coeﬃcient structure) and still arive at the same absense of arbitrage price.2 Verify equatin (9. For notational simplicity.70). = −λi−1 (1 + τ Ri−1 ) 1 + τ Ri (9. Problem 9.98 CHAPTER 9. T2 )dt + b1 R(tT1 .
[14] G. 1993. J. Academic Press. Journal of Financial Economics. A theory of the term structure of inteest rates. Gatarek.C. 7:127–154. The Arbitrage Theory of Capital Asset Pricing. Gillespie. 1930. A. Black. [3] A. 1976. 3:167–179. and M. Cox and S. 5:177–188. Jarrow D. Ingersoll. 3:145–166. Merton. A ClosedForm Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options. 1973. 33(1):177– 186. 99 .C. Journal of Economic Theory. On the theory of brownian motion. Journal of Political Economy. Markov Processes: an introduction for physical scientists. L. [2] F. Heath and A. Journal of Finance. [11] Robert C. 1976. Margrabe. The value of an option to exchange one asset for another.G. 1977. The valuation of options for alternative stochastic processes. Uhlenbeck and L. Mathematical Finance. New York. and S. 60(1):77–105. SpringerVerlag. Williams.E. Phys. Diﬀusions.S. Journal of Financial Economics. 1997. 81:637–659. 1985. [5] J. Rev. 59:341–360.. Econometrica. [8] Daniel T. Ross. [15] O. Heston. [12] L. The pricing of commodity contracts. 1976. 36:823–841. 2000. [4] J. 1976. 1992.Bibliography [1] F. [10] W. Ross. Econometrica. An Equilibrium Characterization of the Term Structure. 6(2):327–343. [9] S. Black and M. The pricing of options and corporate liabilities. Journal of Financial Economics. Bond Pricing and the Term Structure of Interest Rates: A New Methodology. Ito Calculus. Morton. Ornstein. 53:385–467. [13] S. Cox. 5:125–144. Journal of Financial Economics. Brace. The Review of Financial Studies. [7] J.A. 1980.A. Methods of Mathematical Economics. Cambridge. 1997. Scholes. Rogers and D.C. D. [6] R. Markov Processes and Martingales: Volume 2. Franklin. Vasicek. 1978. Option Pricing when the Underlying Stock Returns are Discontinuous. Ross. The market model of interest rate dynamics. Musiela.
50 Factor Models Via Ito’s Lemma. 69 Market Incomplete. 7 intensity. 43 Jump Diﬀusion Derivative Pricing. 77 BlackScholes. 32 Arbitrage Price Implication. 43 Factors. 46. 42 Deﬁnition. 86 Incomplete. 85 delta. 1 Null Space. 35 Return Implication. 53 Poisson Random Variable. 25 Perpendicular Space. 35 Return Form. 85 Derivative. 33 Marketed Tradables. 43 OrnsteinUhlenbeck. 66 Hedging. 84 Delta Hedge. 25 Delta. 32 Relation to Range Space. 81 intensity. 41 Futures Derivative Pricing. 31 BlackScholes. 1 Increment. 7 CoxIngersollRoss. 55 Money Market Account. 3 Price APT 100 . 1. 80 Incomplete. 3 Calibration. 85 rho. 81 Hedging. 33 Market Price of Time. 85 gamma. 63. 46 Market Price of Risk. 49. 95 Hedging. 42 Dividends. 84 DeltaGamma Hedge. 55 Merton. 85 theta. 49 Greeks. 79. 54 Gaussian Random Variable. 3 Poisson Processes Derivative Pricing. 44 Merton. 80 Implied Volatility. 64 Dynamics. 80 Delta. 85 Immunization. 95 Formula. 32 Option European Call. 85 HeathJarrowMorton. 47 Compound Poisson Process. 1 geometric Brownian Motion BlackScholes.Index APT. 55 LiborMarketModel. 84 DeltaGamma. 85 Taylor Expansion. 81 Immunization. 32 Poisson Process. 80 Brownian Motion. 79. 3 Ito’s Lemma Obtaining Factor Models. 32 Price Form. 64 Normal Random Variable. 24. 85 vega. 3 Compound.
44 Range Space. 94 Risk Neutral Representation. 24. 7 Tradables. 92. 94 Risk Neutral Pricing Risk Neutral Representation. 89 Pricing. 86 101 . 32 Relation to Perp of Null Space. 64 Volatility Implied. 64 Vasicek. 94 Short Rate. 44 Underlying Variables. 41 Vasicek. 92. 44 Risk Neutral. 64 Single Factor Models. 42 Marketed. 64 Stochastic Process Compound Poisson Process. 32 Relative Pricing.INDEX Application to Pricing. 95 Principle. 44 Tradables Table.
This action might not be possible to undo. Are you sure you want to continue?