Professional Documents
Culture Documents
Liuren Wu
Investment: Buy if market price is lower than model value; sell otherwise.
How should you value the forward differently if you have inside information
that the company will be bought tomorrow and the stock price is going to
double?
I Shorting a forward at $100 is still safe for you if you can buy the stock
at $100 to hedge.
If you can really forecast the cashflow (with inside information), you
probably do not care much about hedging or no-arbitrage modeling.
I What is risk to the market is an opportunity for you.
I You just lift the market and try not getting caught for insider trading.
If you do not have insights on cash flows and still want to invest in
derivatives, the focus does not need to be on forecasting, but on
cross-sectional consistency.
I No-arbitrage pricing models can be useful.
ST = 22
St = 20
PP
P
P
Comments: ST = 18
Once we make the binomial dynamics assumption, the actual probability of
reaching either node (going up or down) no longer matters.
We have enough information (we have made enough assumption) to price
options that expire in 3 months.
Remember: For derivative pricing, what matters is the list of possible
scenarios, but not the actual probability of each scenario happening.
Liuren Wu Overview Advanced Derivatives Pricing 9 / 88
A 3-month call option
Consider a 3-month call option on the stock with a strike of $21.
Backward induction: Given the terminal stock price (ST ), we can compute
the option payoff at each node, (ST − K )+ .
The zero-coupon bond price with $1 par value is: 1 × e −.12×.25 = $0.9704.
B = 1
T
ST = 22
Bt = 0.9704 cT = 1
St = 20 Q
Q
ct = ? QQ BT = 1
ST = 18
cT = 0
Two angles:
Replicating: See if you can replicate the call’s payoff using stock and bond.
⇒ If the payoffs equal, so does the value.
Hedging: See if you can hedge away the risk of the call using stock. ⇒ If
the hedged payoff is riskfree, we can compute the present value using
riskfree rate.
Liuren Wu Overview Advanced Derivatives Pricing 10 / 88
Replicating
B = 1
T
S T = 22
Bt = 0.9704 cT = 1
St = 20 Q
Q
ct = ? QQ BT = 1
ST = 18
cT = 0
Assume that we can replicate the payoff of 1 call using ∆ share of stock and
D par value of bond, we have
Delta hedging completely erases risk under the binomial model assumption:
The underlying stock can only take on two possible values.
Using two (independent) securities can span the two possible realizations.
I We can use stock and bond to replicate the payoff of an option.
I We can also use stock and option to replicate the payoff of a bond.
I One of the 3 securities is redundant and can hence be priced by the
other two.
What will happen for the hedging/replicating exercise if the stock can take
on 3 possible values three months later, e.g., (22, 20, 18)?
I It is impossible to find a ∆ that perfectly hedge the option position or
replicate the option payoff.
I We need another instrument (say another option) to do perfect
hedging or replication.
One way to set up the tree is to use (u, d) to match the stock return
volatility (Cox, Ross, Rubinstein):
√
u = eσ ∆t
, d = 1/u.
The option prices obtained from the binominal tree often shows a zigzag
behavior across strikes.
One way to smooth it up is to price the options using two trees, one using N
steps and the other using (N + 1) steps. Then, simply averaging the value
from the two trees with generate much better/smoother results.
Yet another simple approach is to replace the last step value by the BMS
model analytical value.
I Even if the option is American, replacing the last step with the
European value does not hurt much.
I The smoothing effect can be even better than the averaging approach.
The binomial model: Discrete states and discrete time (The number of
possible stock prices and time steps are both finite).
The BMS model: Continuous states (stock price can be anything between 0
and ∞) and continuous time (time goes continuously).
Scholes and Merton won Nobel price. Black passed away.
BMS proposed the model for stock option pricing. Later, the model has
been extended/twisted to price currency options (Garman&Kohlhagen) and
options on futures (Black).
I treat all these variations as the same concept and call them
indiscriminately the BMS model.
1 2
Integral form: St = S0 e µt− 2 σ t+σWt
, ln St = ln S0 + µt − 12 σ 2 t + σWt
1.8 0.018
1.4 0.014
PDF of ln(St/S0)
1.2 0.012
PDF of St
1 0.01
0.8 0.008
0.6 0.006
0.4 0.004
0.2 0.002
0 0
−1 −0.5 0 0.5 1 50 100 150 200 250
ln(St/S0) St
0.04
180
0.03
160
0.02
Daily returns
Stock price
0.01 140
0 120
−0.01
100
−0.02
80
−0.03
−0.04 60
0 50 100 150 200 250 300 0 50 100 150 200 250 300
Days days
dxt = µx dt + σx dWt ,
where the random counting measure µ(dz, dt) realizes to a nonzero value for a
given z if and only if x jumps from xt− to xt = xt− + g (z) at time t. The process
ν(z, t)dzdt compensates the jump process so that the last term is the increment
of a pure jump martingale. The transformed variable y = f (x, t) follows,
y = ln St . We have
Z Z
1
d ln St = µ − σ 2 dt + σdWt + zµ(dz, dt) − (e z − 1) ν(z, t)dzdt
2
f (xt− + g (z), t) − f (xt− , t) = ln St − ln St− = ln(St− e z ) − ln St− = z.
The specification (e z − 1) on price guarantees that the maximum downside
jump is no larger than the pre-jump price level St− .
(z−µJ )2
−
1 2σ 2
In Merton, ν(z, t) = λ σ √
2π
e J .
J
Under the binominal model, if we cut time step ∆t small enough, the
binomial tree converges to the distribution behavior of the geometric
Brownian motion.
Reversely, the Brownian motion dynamics implies that if we slice the time
thin enough (dt), it behaves like a binominal tree.
I Under this thin slice of time interval, we can combine the option with
the stock to form a riskfree portfolio, like in the binomial case.
I Recall our hedging argument: Choose ∆ such that f − ∆S is riskfree.
I The portfolio is riskless (under this thin slice of time interval) and must
earn the riskfree rate.
I Since we can hedge perfectly, we do not need to worry about risk
premium and expected return. Thus, µ does not matter for this
portfolio and hence does not matter for the option valuation.
I Only σ matters, as it controls the width of the binomial branching.
∂f ∂f 1 ∂2f
+ (r − q)S + σ 2 S 2 2 = rf
∂t ∂S 2 ∂S
I The only free parameter is σ (as in the binominal model).
Solving this PDE, subject to the terminal payoff condition of the derivative
(e.g., fT = (ST − K )+ for a European call option), BMS derive analytical
formulas for call and put option value.
I Similar formula had been derived before based on distributional
(normal return) argument, but µ was still in.
I The realization that option valuation does not depend on µ is big.
Plus, it provides a way to hedge the option position.
The PDE is generic for any derivative securities, as long as S follows
geometric Brownian motion.
I Given boundary conditions, derivative values can be solved numerically
from the PDE.
Liuren Wu Overview Advanced Derivatives Pricing 34 / 88
Explicit finite difference method
One way to solve the PDE numerically is to discretize across time using N
time steps (0, ∆t, 2∆t, · · · , T ) and discretize across states using M grids
(0, ∆S, · · · , Smax ).
We can approximate the partial derivatives at time i and state j, (i, j), by
the differences:
f −f f +fi,j−1 −2fi,j f −f
fS ≈ i,j+12∆Si,j−1 , fSS ≈ i,j+1 ∆S 2 , and ft = i,j ∆ti−1,j
fi,j −fi−1,j fi,j+1 −fi,j−1 2
The PDE becomes: ∆t + (r − q)j∆S 2∆S + 12 σ 2 j 2 (∆S) = rfi,j
Collecting terms: fi−1,j = aj fi,j−1 + bj fi,j + cj fi,j+1
I Essentially a trinominal tree: The time-i value at j state of fi,j is a
weighted average of the time-i + 1 values at the three adjacent states
(j − 1, j, j + 1).
At (i, j), we can define the difference (ft , fS ) in three different ways:
I Backward: ft = (fi,j − fi−1,j )/∆t.
I Forward: ft = (fi+1,j − fi,j )/∆t.
I Centered: ft = (fi+1,j − fi−1,j )/(2∆t).
Different definitions results in different numerical schemes:
I Explicit: fi−1,j = aj fi,j−1 + bj fi,j + cj fi,j+1 .
I Implicit: aj fi,j−1 + +bj fi,j + cj fi,j+1 = fi+1,j .
I Crank-Nicolson:
−αj fi−1,j−1 + (1 − βj )fi−1,j − γj fi−1,j+1 = αj fi,j−1 + (1 + βj )fi,j + γj fi,j+1 .
ln(Ft /K ) ± 21 σ 2 (T − t)
ct = e −r (T −t) [Ft N(d1 ) − KN(d2 )] , d1,2 = √
σ T −t
Since Ft (or St ) is observable from the underlying stock or futures market,
(K , t, T ) are specified in the contract. The only unknown (and hence free)
parameter is σ.
We can estimate σ from time series return. (standard deviation calculation).
Alternatively, we can choose σ to match the observed option price —
implied volatility (IV).
There is a one-to-one correspondence between prices and implied volatilities.
Traders and brokers often quote implied volatilities rather than dollar prices.
The BMS model says that IV = σ. In reality, the implied volatility calculated
from different options (across strikes, maturities, dates) are usually different.
In the risk-neutral world, investing in all securities make the riskfree rate as
the total return.
If a stock pays a dividend yield of q, then the risk-neutral expected return
from stock price appreciation is (r − q), such as the total expected return is:
dividend yield+ price appreciation =r .
Investing in a currency earns the foreign interest rate rf similar to dividend
yield. Hence, the risk-neutral expected currency appreciation is (r − rf ) so
that the total expected return is still r .
Regard q as rf and value options as if they are the same.
Under the risk-neutral measure (Q), stocks earn the riskfree rate, regardless
of the predictability: dSt /St = (r − q) dt + σdWt
Analogously, the futures price is a martingale under Q, regardless of whether
you can predict the futures movements or not: dFt /Ft = σdWt
Measure change from P to Q does not change the diffusion component
σdW .
The BMS formula still applies.
depend on vt .
For f (S, v , t), we have (bivariate version of Ito):
dft = ft + fS µS + 12 fSS vt S 2 + fv κ (θ − vt ) + 21 fvv ω 2 vt + fSv ωvt ρ dt +
√ √
fS σS vt dWt + fv ω vt dW2t .
Since there are two sources of risk (Wt , W2t ), delta-hedging is not going to
lead to a riskfree portfolio.
The log currency return over period [t, T ], ln STfh /Stfh equals the difference
between the log pricing kernels of the two economies.
Let S denote the dollar price of pound (e.g. St = $2.06), then
pound dollar
ln ST /St = ln Mt,T − ln Mt,T .
If markets are completed by primary securities (e.g., bonds and stocks),
there is one unique pricing kernel per economy. The exchange rate
movement is uniquely determined by the ratio of the pricing kernels.
If the markets are not completed by primary securities, exchange rates (and
currency options) help complete the markets by requiring that the ratio of
any two viable pricing kernels go through the exchange rate.
Liuren Wu Overview Advanced Derivatives Pricing 46 / 88
From pricing kernel to measure change
Consider the following exponential martingale that defines the measure change
from P to Q:
dQ
Rt
dP t = E − 0 γs dWs ,
If the P dynamics is: dSti = µi Sti dt + σ i Sti dWti , with ρi dt = E[dWt dWti ],
then the dynamics of Sti under Q is: dSti =
µi Sti dt + σ i Sti dWti + E[−γt dWt , σ i Sti dWti ] = µit − γσ i ρi Sti dt + σ i Sti dWti
If Sti is the price of a traded security, we need r = µit − γσ i ρi . The risk
premium on the security is µit − r = γσ i ρi .
How do things
change if the
pricing kernel is
given by:
RT RT > RT
Mt,T = exp − t rs ds E − t γs dWs E − t γs> dZs ,
where Zt is another Brownian motion independent of Wt or Wti .
The time-t price of a $1 par riskfree zero-coupon bond that expires at T is:
pt = EPt h[Mt,T] R i
RT T
= EPt exp − t rs ds E − t γs> dXs
h R i
T
= EQ
t exp − t rs ds .
We need to know the short rate dynamics under Q for bond pricing.
Suppose thepricing kernel
is: R
RT RT T
Mt,T = exp − t rs ds E − t γs> dXs E − t ηs> dZs with X , Z
orthogonal, and r = r (X ).
Bond price will only depend on X , not Z .
Let X denote a pure jump process with its compensator being ν(x, t) under
P.
Consider a measure change defined by the exponential martingale:
dQ
dP t = E (−γXt ),
The compensator of the jump process under Q becomes:
ν(x, t)Q = e −γx ν(x, t).
Example: Merton (176)’s compound Poisson jump process,
(x−µJ )2
−
1 2σ 2
ν(x, t) = λ σ √
2π
e J . Under Q, it becomes
J
(x−µ )2
Q
(x−µJ )2 J
−γx− −
1 2σ 2 1 2σ 2
Q
ν(x, t) = λ σ √ e J =λ Q √ e J with µQ = µJ − γσJ2
J 2π σJ 2π
1 2
and λQ = λe 2 γ(γσJ −2µJ ) .
Reference: Küchler & Sørensen, Exponential Families of Stochastic Processes, Springer, 1997.
0.6 80
0.4 70
0.2 60
Delta
Delta
0 50
−0.2 40
−0.4 30
−0.6 20
−0.8 10
−1 0
60 80 100 120 140 160 180 60 80 100 120 140 160 180
Strike Strike
40
(St = 100, T − t = 1, σ = 20%) 40
35 35
30 30
25 25
Vega
Vega
20 20
15 15
10 10
5 5
0 0
60 80 100 120 140 160 180 −3 −2 −1 0 1 2 3
Strike d2
Consider an option portfolio that is delta-neutral but with a vega of −8, 000. We
plan to make the portfolio both delta and vega neutral using two instruments:
The underlying stock
A traded option with delta 0.6 and vega 2.0.
How many shares of the underlying stock and the traded option contracts do we
need?
As before, it is easier to take care of the vega first and then worry about the
delta using stocks.
To achieve vega neutral, we need short/write 60000/10 = 6000 contracts of
the traded option.
With the traded option position added to the portfolio, the delta of the
portfolio becomes 2000 − 0.5 × 6000 = −1000.
We hence also need to long 1000 shares of the underlying stock.
n1 = −6000, n2 = 1000.
When do you want to have non-zero target exposures?
Liuren Wu Overview Advanced Derivatives Pricing 58 / 88
BMS gamma
Gamma (Γ) is the rate of change of delta (∆) with respect to the price of
the underlying asset.
The BMS gamma is the same for calls and puts:
0.02
(St = 100, T − t = 1, σ = 20%) 0.02
0.018 0.018
0.016 0.016
0.014 0.014
0.012 0.012
Vega
Vega
0.01 0.01
0.008 0.008
0.006 0.006
0.004 0.004
0.002 0.002
0 0
60 80 100 120 140 160 180 −3 −2 −1 0 1 2 3
Strike d2
40
ratio is stable over time.
The underlying variable (stock
Call
30 I
20
Dynamic hedging can generate large hedging errors when the underlying
variable (stock price) can jump randomly.
I A large move size per se is not an issue, as long as we know how much
it moves — a binomial tree can be very large moves, but delta hedge
works perfectly.
I As long as we know the magnitude, hedging is relatively easy.
I The key problem comes from large moves of random size.
An alternative is to devise static hedging strategies: The position of the
hedging instruments does not vary over time.
I Conceptually not as easy. Different derivative products ask for different
static strategies.
I It involves more option positions. Cost per transaction is high.
I Monitoring cost is low. Fewer transactions.
gamma that the target call option will have at time u, should the underlying
price level be at K then.
Assumption: The stock price can jump randomly. The only requirement is
that the stock price is Markovian in itself: St captures all the information
about the stock up to time t.
Target Mat 2 4 12 12 12 2 4 12
JPYUSD:
Strategy Mean Std Max Min Skew Kurt
Value 55.40 3.71 67.39 37.77 0.55 2.77
No Hedge -49.46 50.01 0.00 -100.00 -0.02 1.00
Delta -44.24 20.88 76.67 -138.26 0.79 7.86
Vega -44.45 17.50 67.47 -130.67 1.40 10.14
Vanna -45.38 18.01 67.47 -137.32 1.32 10.08
Static -50.86 10.68 48.94 -68.02 6.66 62.00
GBPUSD:
Strategy Mean Std Max Min Skew Kurt
Value 49.35 1.66 54.98 42.86 0.21 4.09
No Hedge -32.21 46.73 0.00 -100.00 -0.76 1.58
Delta -36.39 19.81 70.02 -127.86 1.20 6.34
Vega -38.24 16.58 55.74 -124.94 1.85 8.82
Vanna -39.07 16.33 54.92 -119.33 1.85 8.56
Static -46.55 10.80 49.19 -52.62 5.07 39.81
The BMS model has only one free parameter, the asset return volatility σ.
Call and put option values increase monotonically with increasing σ under
BMS.
Given the contract specifications (K , T ) and the current market
observations (St , Ft , r ), the mapping between the option price and σ is a
unique one-to-one mapping.
The σ input into the BMS formula that generates the market observed
option price (or sometimes a model-generated price) is referred to as the
implied volatility (IV).
Practitioners often quote/monitor implied volatility for each option contract
instead of the option invoice price.
35
Call option value, ct
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
Volatility, σ Volatility, σ
Since options give the holder only rights but no obligation, larger moves
generate bigger opportunities but no extra risk — Higher volatility increases
the option’s time value.
At-the-money option price is approximately linear in implied volatility.
Hence, c = e r (T −t) (F − K ) + p.
I If F < K , put is ITM with intrinsic value e r (T −t) (K − F ), call is OTM.
Hence, p = c + e r (T −t) (K − F ).
I If F = K , both are ATM (forward), intrinsic value is zero for both
options. Hence, c = p.
0.7
0.65
Short−term smile
Implied Volatility
0.6
0.55
0.45
0.2
0.16
0.14
0.12
0.1
0.08
−3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2
Moneyness= ln(K/F
√
σ τ
)
JPYUSD GBPUSD
14 9.8
9.6
13.5
9.4
Average implied volatility
12.5 9
8.8
12
8.6
11.5
8.4
11 8.2
10 20 30 40 50 60 70 80 90 10 20 30 40 50 60 70 80 90
Put delta Put delta
A smile implies that actual OTM option prices are more expensive than
BMS model values.
I ⇒ The probability of reaching the tails of the distribution is higher
A negative skew implies that option values at low strikes are more expensive
than BMS model values.
I ⇒ The probability of downward movements is higher than that from a
normal distribution.
I ⇒ Negative skewness in the distribution.
Implied volatility smiles and skews indicate that the underlying security
return distribution is not normally distributed (under the risk-neutral
measure —We are talking about cross-sectional behaviors, not time series).
If we fit a quadratic function to the smile, the slope reflects the skewness of
the underlying return distribution.
The curvature measures the excess kurtosis of the distribution.
A normal distribution has zero skewness (it is symmetric) and zero excess
kurtosis.
This equation is just an approximation, based on expansions of the normal
density (Read “Accounting for Biases in Black-Scholes.”)
The currency option quotes: Risk reversals measure slope/skewness,
butterfly spreads measure curvature/kurtosis.
Check the VOLC function on Bloomberg.
For single name stocks (AMD), the short-term return distribution is highly
fat-tailed. The long-term distribution is highly negatively skewed.
For stock indexes (SPX), the distributions are negatively skewed at both
short and long horizons.
For currency options, the average distribution has positive butterfly spreads
(fat tails).
Normal distribution assumption does not work well.
Another assumption of BMS is that the return volatility (σ) is constant —
Check the evidence in the next few pages.
0.5
0.45
0.45
0.4
0.4
Implied Volatility
Implied Volatility
0.35
0.35
0.3 0.3
0.25
0.25
0.2
0.2
0.15
0.15
0.1
0.1 0.05
96 97 98 99 00 01 02 03 96 97 98 99 00 01 02 03
JPYUSD GBPUSD
28
12
26
24 11
22 10
Implied volatility
Implied volatility
20
9
18
16 8
14 7
12
6
10
8 5
1997 1998 1999 2000 2001 2002 2003 2004 1997 1998 1999 2000 2001 2002 2003 2004
0.3
0.3
0.25
0.25
0.2
0.2
0.15
0.15
0.1
0.1 0.05
0.05 0
96 97 98 99 00 01 02 03 96 97 98 99 00 01 02 03
JPYUSD GBPUSD
50
10
40
30 5
RR10 and BF10
10
−5
0
−10
−10
−20 −15
1997 1998 1999 2000 2001 2002 2003 2004 1997 1998 1999 2000 2001 2002 2003 2004
Three-month 10-delta risk reversal (blue lines) and butterfly spread (red lines).