A Review of Volatility and Option Pricing: by Sovan Mitra
A Review of Volatility and Option Pricing: by Sovan Mitra
A Review of Volatility and Option Pricing: by Sovan Mitra
Pricing
by
Sovan Mitra
arXiv:0904.1292v1 [q-fin.PR] 8 Apr 2009
Abstract
The literature on volatility modelling and option pricing is a large and diverse area
due to its importance and applications. This paper provides a review of the most signif-
icant volatility models and option pricing methods, beginning with constant volatility
models up to stochastic volatility. We also survey less commonly known models e.g.
hybrid models. We explain various volatility types (e.g. realised and implied volatility)
and discuss the empirical properties.
Key words: Option pricing, volatility models, risk neutral valuation, empirical volatil-
ity.
This paper provides a review of the most significant volatility models and their
related option pricing models, where we survey the development from constant up to
stochastic volatility. We define volatility, the volatility types and study the empirical
characteristics e.g. leverage effect. We discuss the key attributes of each volatility
modelling method, explaining how they capture theoretical and empirical character-
istics of implied and realised volatility e.g. time scale variance. We also discuss less
commonly known models.
The study of volatility has become a significant area of research within financial
mathematics. Firstly, volatility helps us understand price dynamics since it is one of
the key variables in a stochastic differential equation governing an asset price. Sec-
ondly, volatility is the only variable in the Black-Scholes option pricing equation that
is unobservable, hence the ability to model volatility is crucial to option pricing.
Thirdly, volatility is a crucial factor in a wide range of research areas. For example,
contagion effects involve the “transmission” of volatility from one country to another
where:
2
• µ(t, X(t)) denotes the drift of X(t);
• σ(t, X(t)) denotes the volatility (also known as the diffusion term).
The stochastic process is called a diffusion because the equation models diffusions in
physics. For continuous time modelling we let ∆t → 0 and so the stochastic difference
equation becomes:
It is worth noting in equation (4) that the first integral is a standard Riemman integral
whereas the second integral is a stochastic integral, that is, an integral with respect to
a Wiener process.
Definition 2. Let {Ft } denote the set of information that is available to the observer.
If
3
For a given stochastic process X(t), increasingly more information is revealed to
an observer as time progresses. Hence at time t=a (where a is a constant) some
information is revealed and this information is known with certainty at any future time
t > a. To keep track of the information flow that is revealed at time t we introduce
the filtration FtX .
Definition 3. The notation FtX denotes the information generated by process X(t)
on the interval [0,t]. If based upon the observations of the trajectory of X(s) over the
interval s ∈ [0, t] it is possible to determine if event A has occurred, then we say A is
FtX -measurable and write A ∈ FtX . If the stochastic process Z can be determined based
upon the trajectory of X(s) over the interval s ∈ [0, t] and Z(t) ∈ FtX , ∀t ≥ 0, then we
say Z is adapted to the filtration {FtX }t≥0 .
Definition 4. A stochastic process X(s) belongs to the class L2 [a, b] if the following
conditions are satisfied
Theorem 1. (Ito’s Lemma) Assume that X(t) is a stochastic process with the stochas-
tic differential given by
dX(t) = µ(t)dt + σ(t)dW, (6)
where µ and σ are adapted processes. Let Z(t) be a new process defined by Z(t) =
f (X, t) and f is a twice differentiable function, then Z has the stochastic differential:
∂f ∂f 1 2 ∂2f ∂f
df (X(t), t) = +µ + σ 2
dt + σ dW. (7)
∂t ∂X 2 ∂X ∂X
where a, µ and σ are constants. However, Bachelier’s equation was not satisfactory;
theoretically stock prices are always non-negative yet Bachelier’s equation allowed neg-
ative stock prices. We would expect percentage returns to be independent of stock price
4
X(t) yet Bachelier’s equation is not. Samuelson in 1965 [Sam65] introduced the Geo-
metric Brownian motion (GBM) of stock prices, which is the standard model for stock
prices to date:
In equation (10) X(t) is nonnegative with probability one and if X(t1 ) = 0 then
X(t) = 0, ∀t ≥ t1 . (12)
This property reflects financial “bankruptcy” since once X(t) equals 0 it will remain
permanently 0 thereafter.
If we have a portfolio of n assets X(t) = (X1 (t), X2 (t), ...., Xn (t)), governed by
n independent Wiener processes W = (W1 (t), W2 (t), ...., Wn (t)), we have the model
[Eth02]:
n
X
dXi /Xi = µi dt + σij dWj , i=1,..,n and j=1,..,n, (13)
j=1
n
! n
!
1X 2 X
Xi (t) = Xi (0)exp µi − σ t+ σij Wj , (14)
2 j=1 ij j=1
where
5
2.2. Black-Scholes Option Pricing
The Black-Scholes analysis of European options [BS73] yields a closed form solution
to option pricing, only requiring observable variables (except for volatility). The key
insight into their analysis was to construct a dynamically replicating portfolio of a
European option to value it. Options have become one of the most important and
frequently traded derivatives in finance. We give a definition of a derivative security
as follows [Bin04].
Definition 5. A derivative security (also known as a contingent claim) is a financial
contract whose value at expiration time T is precisely determined by the price of an
underlying asset at time T.
As the name implies, an option gives the right (but not the obligation) to buy or sell
an asset at a pre-determined price and time. A call option gives the right to buy the
asset whereas a put gives the right to sell the asset at a predetermined price. One can
purchase European options, which are options that can only be exercised at expiration
T, or American options, which can be exercised any time during the life of the option.
The Black-Scholes portfolio replication argument for European call options begins
by considering a market consisting of two assets -a riskless bond B(t) and a stock X(t)
with equations respectively:
where σ and µ are constants and r is the risk free rate of return. Black and Scholes man-
aged to determine the closed form value of a European call option C on the assumption
of no arbitrage (to be defined in section 2.4):
6
2.3. Risk Neutral Valuation
2.3.1. Martingales
In financial mathematics, martingale processes are important to understanding key
concepts, such as the idea of completeness. We therefore define martingales now.
• E[| Mt |] < ∞, ∀t ≥ 0;
• E[Ms | Ft ] = Mt , for s ≥ t.
where P is the probability measure and r̃ is the discount factor for risk. The difficulty
with this approach is that the discount factor r̃ and the probability measure P vary
7
according to the risk preference of an investor, so were unknown and typically arbi-
trarily chosen. For instance a risk neutral (or risk indifferent) investor does not require
an incentive or disincentive to take on a risky investment, therefore if the discounted
expected value remains constant he will pay the same price for an investment regardless
of the risk. Furthermore for a risk neutral investor we have r̃ equals the risk free rate of
return (which is observable), however P was unknown at the time Samuelson proposed
his option formula.
Cox et al. in [CRR79] realised that under a Black-Scholes model, option pricing is
independent of risk preferences. The investor’s risk aversion increases with the stock’s
drift µ; yet µ does not appear in the Black-Scholes option pricing equation (equation
(21)). By using equation (28), it was shown that all option prices would give the
same option price regardless of the risk preference chosen provided re and P are chosen
consistently. Therefore equation (28) and the Black-Scholes formula would always give
the same answer.
To value options as a risk neutral investor, we discount at the risk free rate and
take expectations under the risk neutral measure. To take expectations under the risk
neutral measure we need to change P in the stochastic differential equation, which
requires Girsanov’s Theorem. Girsanov’s Theorem tells us how a stochastic differential
equation (SDE) changes as probability measure P changes. Essentially, Girsanov’s
Theorem tells us a change in P corresponds to a change in drift µ and the rest of the
SDE remains unchanged. To explain how we change probability measures, let us define
probability spaces.
Definition 7. The triple {Ω, F , P} is called a probability space where Ω denotes the
sample space, the set of all possible events, F denotes a collection of subsets of Ω or
events and P is the probability measure on F or events. The quadruple {Ω, F , {Ft}t≥0 , P}
is called a filtered probability space.
Assume we have the probability space {Ω, F , P} then a change of measure from P
to Q means we have probability space {Ω, F , Q}. We now state Girsanov’s Theorem
for change of probability measures for SDEs.
Definition 8. (Girsanov’s Theorem) Suppose we have a family of information sets Ft
over a period [0,T] where T < ∞. Define over [0,T] the random process (known as the
Doleans exponential) ξt :
Z t Z
P 1 t 2
ξt = exp − λ(u)dW (u) − λ (u)du , (29)
0 2 0
where W P (t) is the Wiener process under probability measure P and λ(t) is an Ft -
measurable process that satisfies the Novikov condition
Z t
P 1 2
E exp λ (u)du < ∞, t ∈ [0, T ].
2 0
8
Then W Q is a Wiener process with respect to Ft under probability measure Q, where
W Q is defined by
Z t
Q P
W (t) = W (t) + λ(u)du, t ∈ [0, T ]. (30)
0
dW Q = dW P + λ(t)dt. (32)
If we choose
we obtain equation
which is a martingale (see section 2.3.1) since λ(t) cancels the drift. This choice of
λ(t) is also known as the market price of risk and the Sharpe ratio (discovered by
Sharpe [Sha66]). Note that only one choice of λ(t) in this SDE gives a martingale (or
alternatively eliminates the drift) hence Q is known as the unique equivalent martingale
measure. The equivalent martingale measure is also known as the risk neutral measure
and pricing under this measure is known as risk neutral valuation.
9
We define the random process ξt :
( n Z Z t )
X t
1
ξt = exp − λi (u)dWiP(u) − λ2i (u)du , (38)
i=1 0 2 0
where dWiP for i=1,...,n is an n-dimensional Wiener process under probability measure
P. Then under the measure Q, WiQ is a multidimensional Wiener process defined by:
Z t
Q P
Wi = Wi + λi (u)du, for i = 1, 2, ...n. (39)
0
When the volatility matrix is invertible then unique λi solutions exist, admitting a
unique risk neutral process. Therefore the discounted risk neutral process for the
multiple stock model is:
n
X
dXi /Xi = σij dWiQ . (43)
j=1
10
Definition 10. An arbitrage possibility in a financial market is a self-financed portfolio
V (t) such that:
• V (0) ≤ 0;
• E[V (T )] ≥ 0.
Then one can produce a riskless profit by shorting the lower drift asset X1 and using
the proceeds to purchase the higher drift asset X2 . For completeness we must have
̟1 ≥ ̟2 to enable us to trade or replicate every possible claim.
The no arbitrage definition enables us to state the put-call parity relationship for
European options, which does not require any assumptions other than the market is
arbitrage free.
11
Proposition 2. (Put-Call Parity)
Assume the market is arbitrage free. If a European call C(t,X) and a European put
P(t,X) with the same underlying asset X, strike K and expiration T exist then we have
the put-call parity relation:
P (t, X) = Ke−r(T −t) + C(t, X) − X(t) + D, (46)
where D is the cash dividend received from the underlying stock during the life of the
option.
dX1 (t) = µ1 (t, X1 (t), X2 (t))dt + σ11 (t, X1 (t), X2 (t))dW1 + σ12 (t, X1 (t), X2 (t))dW2 ,
(53)
dX2 (t) = µ2 (t, X1 (t), X2 (t))dt + σ21 (t, X1 (t), X2 (t))dW1 + σ22 (t, X1 (t), X2 (t))dW2 ,
(54)
12
with initial values at initial time s
X1 (s) = x1 , (55)
X2 (s) = x2 , (56)
with conditions
RT ∂f
0
E[(σ11 (t, X1 (t), X2 (t)) (t, X1 (t), X2 (t)))2
∂x1
∂f
+ (σ12 (t, X1 (t), X2 (t)) (t, X1 (t), X2 (t)))2 ]dt < ∞,
∂x1
RT ∂f
0
E[(σ21 (t, X1 (t), X2 (t)) (t, X1 (t), X2 (t)))2
∂x2
∂f
+ (σ22 (t, X1 (t), X2 (t)) (t, X1 (t), X2 (t)))2 ]dt < ∞,
∂x2
with boundary condition
and satisfy:
3. An Introduction to Volatility
3.1. Different Types of Volatility
Wilmott in [Wil01] distinguishes between four different types of volatility, although
in practise little distinction is made between them. The four types of volatility are:
13
• historic volatility (also known as realised volatility): this is a measure of volatility
using past empirical price data and will be explained further in section 3.2.
• implied volatility: this is the volatility associated with empirical option prices
and will be explained further in section 3.2.
• forward volatility: this is the volatility obtained from some forward instrument.
We note that all four types of volatility in theory should not differ since they all refer
to the same variable σ, however in practise they may be different. For example some
researchers believe actual volatility and implied volatility are 2 separate variables and
treat them differently (see Schonbucher’s model in section 4.4 for instance).
Note that sample variance contains n-1 in the denominator, whereas variance of
a theoretical distribution contains n;
•
X(ti )
χi = ln ; (61)
X(ti−1 )
14
In contrast to historic volatility we obtain implied volatility values from empirical
options data. Using Black-Scholes option pricing, call options C are a function of
C(X,t,T,r,σ,K), with all the independent variables observable except σ. Since the
quoted option price C obs is observable, using the Black-Scholes formula we can therefore
calculate or imply the volatility that is consistent with the quoted options prices and
observed variables. We can therefore define implied volatility I by:
where CBS is the option price calculated by the Black-Scholes equation (equation (21)).
Implied volatility surfaces are graphs plotting I for each call option’s strike K and
expiration T. Theoretically options whose underlying is governed by GBM should have
a flat implied volatility surface, since volatility is a constant; however in practise the
implied volatility surface is not flat and I varies with K and T.
Implied volatility plotted against strike prices from empirical data tends to vary in a
“u-shaped” relationship, known as the volatility smile, with the lowest value normally
at X=K (called “at the money” options). The opposite graph shape to a volatility
smile is known as a volatility frown due to its shape. The smile curve has become a
prominent feature since the 1987 October crash (see for instance [Bat00] and [CP98]).
Various explanations have been proposed to account for volatility smiles. Firstly, it
has been suggested that options are priced containing information about future short
term volatility that is not already contained within past price information (see for
instance [Jor95]). Secondly, implied volatility is influenced more by market sentiment
rather than by pure fundamentals, for instance the VIX (a weighted average of implied
volatilities from S&P 500 index options) is used as a gauge of market sentiment [SW01].
Thirdly, the transaction costs involved in trading options is significantly higher
and more complicated compared to their associated underlyings, creating volatility
smiles (see for instance [PRS99]). Finally, Jarrow and Turnbull [JT96] discuss non-
simultaneous price observation; since option and stock prices are from two different
financial markets, there will always be observation time differences. Such time differ-
ences can cause substantial differences in the estimated implied volatility.
15
the concept of leverage. Leverage (also known as the gearing or debt to equity ratio)
is defined by:
ι
Leverage= ,
MKT
where ι is the company’s total debt and MKT is the market capitalisation (number of
shares × share price). As the share price drops the company becomes riskier, since a
greater percentage of the company is debt financed, hence increasing volatility.
Black argued leverage could not entirely explain volatility since companies with
little or no debt still exhibit high volatility. Other explanations of volatility’s negative
corelation with stock price include portfolio rebalancing, where investors are forced to
liquidate their assets if they fall below a threshold price. Alternatively, threshold prices
may act as triggers for sales when interpretted in terms of prospect theory, as proposed
by Kahneman and Tversky [KT79]. Also there exists ownership concentration, where
an investor owns a substantial percentage of a company’s stock and sells all his holdings
at once (see for instance [CCW08]). In both cases a large volume of selling pushes prices
down further, increasing volatility.
Secondly, volatility (and return distributions) show dependency on the time scale
∆t chosen to measure it, as defined in equation (59). The return distribution becomes
increasingly more Gaussian as ∆t increases [Con01], known as “Gaussian aggregation”,
yet under GBM volatility is theoretically scale invariant. Such empirical observations
have motivated researchers to seek models that exhibit scale variation and is one of the
benefits of mean reverting stochastic volatility models.
Thirdly, Mandelbrot [Man63] and Fama [Fam65] were the first to observe volatility
clusters (positively autocorrelates) with time; large (small) price changes tend to fol-
low large (small) price changes. Such observations motivated GARCH and stochastic
volatility models (see section 4.3) and for this reason volatility clustering is sometimes
known as the “GARCH effect” [Con04]. The autocorrelation is significant over time
scales ∆t of days and weeks but insignificant over longer time scales. This is because
the autocorrelation strength decays following a power law with ∆t increasing.
The autocorrelation’s slow power law of decay has been cited as evidence of the
existence of long memory in volatility [Con01] and empirical evidence can be found
in [LGS99]. A stationary stochastic process X(t) has long memory if its covariance
υ(τ ) = cov(X(t), X(t + τ )) follows the power law of decay [Tan06]
16
and
∞
X
υ(τ ) = ∞. (65)
τ =0
As described in section 2.1.3, Bachelier [Bac00] proposed a model for stock prices,
with constant volatility (see equation (8)). Bachelier reasoned that investing was the-
oretically a “fair game” in the sense that statistically one could neither profit nor lose
from it. Hence Bachelier included the Wiener process to incorporate the random na-
ture of stock prices. Osborne [Osb59] conducted empirical work supporting Bachelier’s
model. Samuelson [Sam65] continued the constant volatility model under the Geomet-
ric Brownian Motion stock price model (see equation (10)) on the basis of economic
justifications.
Over time, empirical data and theoretical arguments found constant volatility to
be inconsistent with the observed market behaviour (such as the leverage effect, as
discussed in section 3.3). A plot of the empirical daily volatility of the S&P 500 index
clearly shows volatility is far from constant. This consequently led to the develop-
17
ment of dynamic volatility modelling. Volatility modelling may be classified into four
categories:
1. Constant volatility σ;
2. Time dependent volatility σ(t);
3. Local volatility: volatility dependent on the stock price σ(X(t));
4. Stochastic volatility: volatility driven by an additional random process σ(ω).
These models will now be discussed in some detail in the subsequent sections.
It is worth mentioning that some models do not use Wiener processes to capture
price movements. For example, Madan and Seneta [MS90] propose a Variance-Gamma
model where price movements are purely governed by a discontinuous jump process.
Carr et al. discuss various jump processes in [CGMY02] such as the pure jump process
of Cox and Ross [CR76]. The advantages of pure jump models are: firstly, they real-
istically reflect the fact that trading occurs discontinuously, secondly price jumps are
consistent with information releases according to Fama’s “Efficient Market Hypothesis”
[Fam65] and finally, stock price processes jump even on an intraday time scale [CT04].
Merton [Mer73] was the first to propose a formula for pricing options under time
dependent volatility. The option price associated with X is still calculated by the
standard Black-Scholes formula (equation (21)) except we set σ = σc where:
s
Z T
1
σc = σ 2 (τ )dτ , (67)
T −t t
18
The equation (67) converts σ(t) to its constant volatility equivalent σc over time period
t to T. The distribution of X(t) is given by:
1 2 2
log(XT /Xt ) ∼ N (µ − σc )(T − t), σc (T − t) . (70)
2
Note that the constant volatility σc changes in value as t and T change. This property
enables time dependent volatility to account for empirically observed implied volatilities
increasing with time (for a given strike).
The term “local” arises from knowing volatility with certainty “locally” or when X is
known -for a stochastic volatility model we never know the volatility with certainty.
The advantages of local volatility models are that firstly, no additional (or un-
tradable) source of randomness is introduced into the model. Hence the models are
complete, unlike stochastic volatility models. It is theoretically possible to perfectly
hedge contingent claims. Secondly, local volatility models can be also calibrated to per-
fectly fit empirically observed implied volatility surfaces, enabling consistent pricing of
the derivatives (an example is given in [Dup97]). Thirdly, the local volatility model is
able to account for a greater degree of empirical observations and theoretical arguments
on volatility than time dependent volatility (for instance the leverage effect). We will
now look at some common local volatility models.
19
extreme models. Additionally, n captures the level of leverage effect since volatility
increases as stock price decreases, as can be seen from equation (73).
The CEV model is analytically tractable, which is in contrast to the local volatility
model in section 4.2.4 where numerical solutions for derivatives become analytically
intractable [W+ 98]. Additionally, by appropriate choices of a and n we can fit CEV to
volatility smiles (see for instance Beckers [Bec80]).
The CEV model has been developed over the years giving various CEV modified
models. For instance the square root CEV model [CR76], which is similar to the
CIR interest rate model [CIJR85], Schroder [Sch89a] re-expresses the CEV model in
terms of a chi-squared distribution, enabling derivation of closed form solutions. Hsu
et al. [HLL08] determine the CEV model’s probability density function while Lo et al.
[LYH00] derive the option pricing formula for CEV with time dependent parameters.
The term w̃i (t) is a weighting for each component i at a point in time t, pei (t, X) denotes
the probability density of component i at a specific point in time for a specific stock
price X(t). We assume each pei (t, X) has the same mean µ but different variances σi2 (t).
Brigo and Mercurio then proved that the stock price follows the process:
The mixture distribution model has been developed further by Brigo et al. (see
for instance [BM02],[BMS03], [BMRS04]). Alexander has also developed the mix-
20
ture distribution model; Alexander has applied it to the areas of stress testing portfo-
lios [AS08], combining it with GARCH processes [AL06] and bivariate option pricing
[AS04]. GARCH processes will be covered in more detail in section 4.4.
The mixture distribution models have been used to model more complicated volatil-
ity models due to their ability to capture a variety of distributions. For example, Leisen
[Lei05] shows how a mixture distribution model approximates Merton’s jump diffusion
model (to be covered in section 4.4) amongst others, Lewis [Lew02] shows how mixture
models can approximate stochastic volatility models.
where p(X(T)) is the risk neutral probability density function for X(T). Breeden and
Litzenberger [BL78] then showed from equation (79) that the risk neutral cumulative
distribution function F (·) at K is:
∂C
= −e−rT F (X(T ) ≥ K). (80)
∂K
Furthermore, the risk neutral probability density function p(X(T)=K) is
∂ 2 C rT
e = p(X(T ) = K). (81)
∂K 2
Hence we can recover the risk neutral density p(X(T)) from option data. This proba-
bility can be interpreted as the current view of the future outcome of the stock price.
Dupire [Dup94] then showed by applying p(X(T)) from equation (81) (obtained
from Breeden and Litzenberger’s work) to the Fokker-Planck equation, one could obtain
Dupire’s equation:
∂C X 2 ∂2C ∂C
= σ 2 (X, T ). . 2
− (r − D)X. − DC, (82)
∂T 2 ∂X ∂X
where D is the dividend. Rearranging equation (82) gives:
v
u ∂C ∂C
u + (r − D)X + DC
u
σ(X, T ) = u ∂T ∂X . (83)
t X 2 ∂2C
2 ∂X 2
21
Therefore the local volatility σ(X, T ) can be fully extracted from option data.
It can be seen from equation (83) that calculating σ requires partial differentials
with respect to T and K. We therefore require a continuous set of options data for
all K and T. This is highly unrealistic and quoted option prices tend to suffer from
significant illiquidity effects, affecting option bid-ask spreads [Nor03]. Furthermore,
Pinder [Pin03] shows that option bid-ask spreads are related to volatility, expiry and
trading volume. Since option data is discrete we require some interpolation method
to convert it to continuous data, for example Monteiro et al. [MTV08] apply a cubic
spline method. However, Wilmott [Wil00] states that local volatility computation is
highly sensitive to interpolation methods.
Dupire’s model implicitly assumes the options data contains all the information on
the underlying’s volatility if we calibrate to options data alone. However there is evi-
dence to show historic and implied volatility differ significantly [CP98]. Furthermore,
calibrating the local volatility surface to the options data tends to be unstable with
time, since the surface significantly changes from one week to another [DFW98].
Numerical computation of local volatility has been implemented by Andersen and
Brotherton-Ratcliffe using a finite difference method [ABR98]. Derman and Kani
([DKC96], [DKZ96]) and Rubinstein [Rub94] determine local volatilities by fitting a
unique binomial tree to the observed option prices. Tree fitting also has the computa-
tional advantage of not being affected by different interpolation methods.
22
only be positive. The Wiener processes have instantaneous correlation ρ ∈ [−1, 1] de-
fined by:
corr(dW1(t), dW2 (t)) = ρdt.
Empirically ρ tends to be negative in equity markets due to the leverage effect (see
section 3.3) but close to 0 in the currency markets. Although ρ can be a function of
time we assume it is a constant throughout this thesis.
The key difference between local and stochastic volatility is that local volatility is
not driven by a random process of its own; there exists only one source of randomness
(dW1 ). In stochastic volatility models, volatility has its own source of randomness
(dW2 ) making volatility intrinsically stochastic. We can therefore never definitely de-
termine the volatility’s value, unlike in local volatility.
The key advantages of stochastic volatility models are that they capture a richer
set of empirical characteristics compared to other volatility models [MR05]. Firstly,
stochastic volatility models generate return distributions similar to what is empirically
observed. For example, the return distribution has a fatter left tail and peakedness
compared to normal distributions, with tail asymmetry controlled by ρ [Dur07]. Sec-
ondly, Renault and Touzi [RT96] proved volatility that is stochastic and ρ=0 always
produces implied volatilities that smile (note that volatility smiles do not necessarily
imply volatility is stochastic).
Thirdly, historic volatility shows significantly higher variability than would be ex-
pected from local or time dependent volatility, which could be better explained by
a stochastic process. A particular case in point is the dramatic change in volatility
during the 1987 October crash (Schwert [Sch90] gives an empirical study on this).
Finally, stochastic volatility accounts for the volatility’s empirical dependence on the
time scale measured (as discussed in section 3.3), which should not occur under local
or time dependent volatility.
The disadvantages of stochastic volatility are firstly that these models introduce
a non-tradable source of randomness, hence the market is no longer complete and
we can no longer uniquely price options or perfectly hedge. Therefore the practical
applications of stochastic volatility are limited. Secondly stochastic volatility models
tend to be analytically less tractable. In fact, it is common for stochastic volatility
models to have no closed form solutions for option prices. Consequently option prices
can only be calculated by simulation (for example Scott’s model in section 4.3.3).
23
to their tractability, theoretical or empirical appeal and we can categorise stochastic
volatility models according to them. Many stochastic volatility models favour a mean
reverting driving process. A mean reverting stochastic volatility process is of the form
[FPS00a]:
σ = f (Y ), (85)
dY = α(m − Y )dt + βdW2, (86)
where:
• β ≥ 0 and β is a constant;
Mean reversion is the tendency for a process to revert around its long run mean value.
We can economically account for the existence of mean reversion through the cob-
web theorem, which claims prices mean revert due to lags in supply and demand
[LH92]. The inclusion of mean reversion (α) within volatility is important, in particu-
lar, it controls the degree of volatility clustering (burstiness) if all other parameters are
unchanged. Volatility clustering is an important empirical characteristic of many eco-
nomic or financial time series [Eng82], which neither local nor time dependent volatility
1
models necessarily capture. Additionally, a high α
can be thought of as the time re-
quired to decorrelate or “forget” its previous value.
The equation (86) is an Ornstein-Uhlenbeck process in Y with known solution:
Z t
−αt
Y (t) = m + (Y (0) − m)e +β e−αt dW2 , (87)
0
Note that alternative processes to equation (86) could have been proposed to define
volatility as a mean reverting stochastic volatility model, for example the Feller or
Cox-Ingersoll-Ross (CIR) process [FPS00b]:
√
dY = α(m − Y )dt + β Y dW2 . (89)
24
4.3.3. Significant Stochastic Volatility Models
There is no generally accepted canonical stochastic volatility model and a large
number of them exist, therefore we review here the most significant ones.
where corr(dW1(t), dW2 (t)) = ρdt. A Monte Carlo method is proposed to deter-
mine the price of options under the stochastic volatility process by risk neutral
valuation. Johnson’s and Shanno’s computational results show that their option
prices are consistent with what is empirically observed (see section 3.3), that is
they exhibit a volatility smile and an increase in value with expiry.
2. Scott Model
Scott in 1987 [Sco87] considered the case where one assumes a geometric process
for stock prices and an Ornstein-Uhlenbeck process for the volatility:
25
where σe2 is a random variable. The option price is computed using the standard
p
Black-Scholes formula, under a risk neutral measure γ, with volatility σe2 . In
other words:
p
C(t, X, K, T, σ(ω)) = E [CBS (t, X, K, T, σe2 )].
γ
(97)
The option pricing equation (97) provides results consistent with empirically
observed currency options, where empirically ρ ≃ 0. Furthermore equation (97)
is still valid for any stochastic volatility process provided ρ = 0 [FPS00a]. For
correlated volatility, option prices are obtained using Monte Carlo simulation.
4. Stein and Stein Model
Stein and Stein in 1991 [SS91] proposed an Ornstein-Uhlenbeck process for volatil-
ity based on tractability and empirical considerations:
where ρ = 0. Note that although equation (99) implies volatility can be negative,
Stein and Stein state that only σ 2 is ever applied in calculation [SS91]. A closed
form solution to option prices is derived for particular choices of risk neutral
measures, which is in contrast to Johnson and Shanno [JS87] and Scott [Sco87],
who only provide numerical methods to option pricing.
5. Heston model
Heston’s model [Hes93] created in 1993 stands out from other stochastic volatility
models because there exists an analytical solution for European options that takes
in account correlation between dW1 and dW2 (although this requires assumptions
on the risk neutral measure):
The dσ 2 modelling process originated from CIR interest rate model [CIJR85].
To price an option under risk neutral valuation, we must specify a risk neutral
measure (due to market incompleteness) and this is chosen on economic justifica-
tions [Hes93]. Heston then finds an analytical solution for options using Fourier
transforms; the reader is referred to Heston [Hes93] and Musiela and Rutkowski
[MR05] for a derivation.
Due to the existence of an analytic solution to option pricing the Heston model
has been subsequently developed by various researchers. For example, Scott in-
26
cludes stochastic interest rates [Sco97], Pan includes stochastic dividends [Pan02]
and Bates adds jumps to the stochastic process [Bat96].
where
A Poisson process counts the number of events that occur in a given period. We define
a Poisson process P(ν) as [LM66]:
e−νt (νt)k
p(X = k) = , k = 0, 1, 2, .... (103)
k!
where
• ν is called the rate parameter, where ν is the expected number of events per unit
time;
• p(X=k) is the probability that the random number of events from 0 to time t
equals k.
In the case of the jump diffusion model the events are the jumps themselves. To obtain
a closed form solution for option prices under the jump diffusion model, Merton makes
a key argument as follows. We know that a replicating portfolio constructed in the
derivation of the original Black-Scholes equation ([BS73]) will eliminate the risk arising
from the GBM and so must earn the risk free rate of return. Next Merton assumes the
jump process represents nonsystematic risk and risk that is not priced into the market.
Therefore the same replicating portfolio applied in the Black-Scholes equation must
27
earn the risk free rate of return. Merton then proved that the option price is the sum
of an infinite series of options, valued by the standard Black-Scholes option pricing
equation without the jumps.
∞
X ′
e−θ τ (θ′ τ )n
C(X(t), K, t, T, r, σ) = CBS (X(t), K, t, T, rn , σn ),
n=0
n!
where
τ = T − t,
θ′ = θ(1 + k),
nln(1 + k)
rn = r − θk + ,
τ
nς 2
σn2 = σ 2 + .
τ
The nth option in the series is valued by the standard Black-Scholes equation, CBS ,
assuming n jumps have occurred before expiry. Since the series converges exponentially
it can be implemented computationally [CT04]. The term ς 2 arises from the fact that
a Poisson process can be approximated by a normal distribution. Therefore we can
assume the logarithm of the jumps are normally distributed with variance ς 2 . The
jumps fatten the return distribution’s tail, therefore the model is more consistent with
empirically observed distributions compared to GBM. Merton accounts for jumps as the
arrival of new information that have more than a marginal effect on price movements.
Another class of volatility models is to use a stochastic process to model the evo-
lution of the implied volatility directly. In all the models discussed so far, we have as-
sumed implied volatility and the underyling’s volatility are the same. Implied volatility
methods model an option’s (or any derivative’s) implied volatility σ ∗ separately from
the underlying’s volatility σ. One significant implied volatility model is Schonbucher’s
stochastic implied volatility model [Sch99]. Schonbucher models the implied volatility
σ ∗ of a vanilla option as a SDE and the underlying’s SDE separately:
Another class of volatility models is the lattice approach, with its origins from bi-
nomial trees by Cox et al. [CRR79]. Britten-Jones and Neuberger [BJN00] model
stochastic volatility by fitting a lattice model that is consistent with observed option
prices. The model is parameterised by up and down price movements and their asso-
ciated probabilities for each branch, using empirical option data as its input. This is
28
an improvement on Derman’s and Kani’s model, who fit a tree to local volatility only.
Britten-Jones and Neuberger also show how a variety of stochastic volatility models
(such as regime switching volatility) can be fitted to be consistent with observed option
price data.
Discrete time volatility models are another class of models that exist, such as
GARCH(p,q) [Bol86], the generalised autoregressive conditional heteroscedasticity model:
q p
X X
σt 2 = a0 + ai ε2t−i + 2
bi σt−i , for εt ∼ N (0, σt2), (106)
i=1 i=1
where ai and bi are weighting constants. We obtain the ARCH(q) [Eng82] model by
setting p=0. Due to the success of GARCH in econometrics it has been substan-
tially extended by various researchers, for example Nelson [Nel91], Sentana [Sen95]
and Zakoian [Zak94]. Whereas with continuous time modelling one is able to derive
closed form solutions, which can reduce computation and provide new insights, this is
generally not possible with discrete time models.
Finally, one class of volatility models are “hybrid” models; combining various
volatility models into one. For instance Alexander [Ale04] extends Brigo’s and Mer-
curio’s mixture model [BM00] by combining it with the binomial tree of Cox et al.
[CRR79] to incorporate stochastic volatility. The SABR model [HKLW02] is a stochas-
tic extension of the CEV model [CR76], where SABR is an abbreviation for stochastic
alpha, beta and rho in its equations. The SABR model captures the dynamics of a
forward price F of some asset (e.g. stock) under stochastic volatility; its risk neutral
dynamics under measure Q is:
A common shortfall in all the volatility models reviewed so far has been that all
these models are short term volatility models. The models implicitly ignore any long
term or broader economic factors influencing the volatility model, which is empirically
unrealistic and theoretically inconsistent. Furthermore, although some models may
specify a closed form solution for option pricing, they provide no method or recom-
mendation for calibration, which is important to modelling and option pricing.
5. Conclusions
This paper has surveyed the key volatility models and developments, highlighting
the innovation associated with each new class of volatility models. In conclusion it can
29
be seen from our review of volatility models that the development of has progressed in
a logical order to address key shortcomings of previous models.
Time dependent models addressed option prices varying with expiration dates, local
volatility also addressed volatility smiles and the leverage effect, whereas stochastic
volatility could incorporate all the effects captured by local volatility and a range of
other empirical effects e.g. greater variability in observed volatility. However the trade-
off associated with improved volatility modelling has been at the expense of analytical
tractability.
30
References
[Bat96] D.S. Bates. Jumps and stochastic volatility: exchange rate processes im-
plicit in deutsche mark options. Review of Financial Studies, 9(1):69–107,
1996.
[Bat00] D.S. Bates. Post-87 crash fears in the S&P 500 futures option market.
Journal of Econometrics, 94(1-2):181–238, 2000.
[Bau03] D. Baur. Testing for contagion mean and volatility contagion. Journal of
Multinational Financial Management, 13(4-5):405–422, 2003.
[Bec80] S. Beckers. The Constant Elasticity of Variance Model and Its Implications
For Option Pricing. The Journal of Finance, 35(3):661–673, 1980.
31
[Bjö04] T. Björk. Arbitrage Theory in Continuous Time. Oxford University Press,
2004.
[Bla90] David Blake. Financial Market Analysis. Oxford University Press, 1990.
[BS73] F. Black and M. Scholes. The Pricing of Options and Corporate Liabilities.
The Journal of Political Economy, 81(3):637–654, 1973.
[CCW08] C. Cao, E.C. Chang, and Y. Wang. An empirical analysis of the dynamic
relationship between mutual fund flow and market return volatility. Journal
of Banking and Finance, 32(10):2111–2123, 2008.
32
[CGMY02] P. Carr, H. Geman, D.B. Madan, and M. Yor. The Fine Structure of Asset
Returns: An Empirical Investigation. The Journal of Business, 75(2):305–
332, 2002.
[CIJR85] J.C. Cox, J.E. Ingersoll Jr, and S.A. Ross. A Theory of the Term Structure
of Interest Rates. Econometrica, 53(2):385–407, 1985.
[Con01] R. Cont. Empirical properties of asset returns: stylized facts and statistical
issues. Quantitative Finance, 1(2):223–236, 2001.
[CP98] B.J. Christensen and N.R. Prabhala. The Relation Between Implied and
Realized Volatility. Journal of Financial Economics, 50(2):125–150, 1998.
[CR76] J.C. Cox and S.A. Ross. The Valuation of Options for Alternative Stochas-
tic Processes. Journal of Financial Economics, 3(1), 1976.
[CRR79] J.C. Cox, S.A. Ross, and M. Rubinstein. Option Pricing: A Simplified
Approach. Journal of Financial Economics, 7(3):229–263, 1979.
[CT04] R. Cont and P. Tankov. Financial Modelling with Jump Processes. CRC
Press, 2004.
[DKC96] E. Derman, I. Kani, and N. Chriss. Implied trinomial trees of the volatility
smile. Journal of Derivatives, 3(4):7–22, 1996.
[DKZ96] E. Derman, I. Kani, and J.Z. Zou. The Local Volatility Surface: Unlock-
ing the Information in Index Option Prices. Financial Analysts Journal,
52(4):25–36, 1996.
[Duf01] D. Duffie. Dynamic asset pricing theory. Princeton University Press, 2001.
33
[Dup97] B. Dupire. Pricing and hedging with smiles. Mathematics of Derivative
Securities, 1(1):103–111, 1997.
[Dur07] G.B. Durham. SV mixture models with application to S&P 500 index
returns. Journal of Financial Economics, 85(3):822–856, 2007.
[EH81] B.S. Everitt and D.J. Hand. Finite mixture distributions. Monographs on
Applied Probability and Statistics, London: Chapman and Hall, 1981.
[Fam65] E.F. Fama. The behaviour of stock market prices. Journal of Business,
38(1):34–105, 1965.
[Fre80] K.R. French. Stock Returns and the Weekend Effect. Journal of Financial
Economics, 8(1):55–69, 1980.
[Hes93] S.L. Heston. A Closed-Form Solution for Options with Stochastic Volatility
with Applications to Bond and Currency Options. Review of Financial
Studies, 6(2):327–43, 1993.
[HKLW02] P.S. Hagan, D. Kumar, A.S. Lesniewski, and D.E. Woodward. Managing
smile risk. Wilmott Magazine, 1(1):84–108, 2002.
34
[HLL08] Y.L. Hsu, T.I. Lin, and C.F. Lee. Constant elasticity of variance (CEV)
option pricing model: Integration and detailed derivation. Mathematics
and Computers in Simulation, 79(1):60–71, 2008.
[HW87] J. Hull and A. White. The Pricing of Options on Assets with Stochastic
Volatilities. The Journal of Finance, 42(2):281–300, 1987.
[JS87] H. Johnson and D. Shanno. Option Pricing when the Variance is Changing.
The Journal of Financial and Quantitative Analysis, 22(2):143–151, 1987.
[JT96] R.A. Jarrow and S.M. Turnbull. Derivative securities. South-Western Col-
lege Pub., 1996.
[Lei05] D.P.J. Leisen. Mixed Lognormal Distributions for Derivatives Pricing. Mod-
elling, Simulation, and Optimization: Fifth IASTED International Confer-
ence Proceedings, 2005.
[Lew02] A.L. Lewis. The Mixing Approach To Stochastic Volatility and Jump Mod-
els. Working Paper, 2002.
[LH92] R.G. Lipsey and C.D. Harbury. First Principles of Economics. Oxford
University Press, 1992.
[LYH00] C.F. Lo, P.H. Yuen, and C.H. Hui. Constant Elasticity of Variance Option
Pricing Model With Time-Dependent Parameters. International Journal
of Theoretical and Applied Finance, 3(4):661–674, 2000.
35
[Mer73] R.C. Merton. Theory of rational option pricing. Bell Journal of Economics
and Management Science, 4(1):141–183, 1973.
[Mer76] R.C. Merton. Option pricing when underlying stock returns are discontin-
uous. Journal of Financial Economics, 3(1-2):125–144, 1976.
[MS90] D.B. Madan and E. Seneta. The Variance Gamma (VG) Model for Share
Market Returns. The Journal of Business, 63(4):511–524, 1990.
[MT97] W.R. Melick and C.P. Thomas. Recovering an Assets Implied PDF from
Option Prices: An Application to Crude Oil during the Gulf Crisis. Journal
of Financial and Quantitative Analysis, 32(1):91–115, 1997.
[MTV08] A.M. Monteiro, R.H. Tütüncü, and L.N. Vicente. Recovering risk-
neutral probability density functions from options prices using cubic splines
and ensuring nonnegativity. European Journal of Operational Research,
187(2):525–542, 2008.
[Nor03] L. Nordén. Asymmetric option price distribution and bid–ask quotes: con-
sequences for implied volatility smiles. Journal of Multinational Financial
Management, 13(4-5):423–441, 2003.
[Osb59] M.F.M. Osborne. Brownian Motion in the Stock Market. Operations Re-
search, 7(2):145–173, 1959.
[Pan02] J. Pan. The jump-risk premia implicit in options: evidence from an in-
tegrated time-series study. Journal of Financial Economics, 63(1):3–50,
2002.
36
[Pin03] S. Pinder. An empirical examination of the impact of market microstruc-
ture changes on the determinants of option bid-ask spreads. International
Review of Financial Analysis, 12(5):563–577, 2003.
[Reb04] R. Rebonato. Volatility and Correlation: The Perfect Hedger and the Fox.
Wiley & Sons, Chichester, UK, 2004.
[Ros96] P.E. Rossi. Modelling Stock Market Volatility: Bridging the Gap to Con-
tinuous Time. Academic Press, 1996.
[Sam65] P.A. Samuelson. Proof that properly anticipated prices fluctuate randomly.
Industrial Management Review, 6(2):41–49, 1965.
[Sch89b] G.W. Schwert. Why Does Stock Volatility Change Over Time? Journal of
Finance, 44(5):1115–1153, 1989.
[Sch90] G.W. Schwert. Stock volatility and the crash of’87. Review of Financial
Studies, 3(1):77–102, 1990.
[Sch99] P.J. Schonbucher. A Market Model for Stochastic Implied Volatility. Philo-
sophical Transactions: Mathematical, Physical and Engineering Sciences,
357(1758):2071–2092, 1999.
[Sco87] L.O. Scott. Option Pricing when the Variance Changes Randomly: Theory,
Estimation, and an Application. The Journal of Financial and Quantitative
Analysis, 22(4):419–438, 1987.
37
[Sco97] L.O. Scott. Pricing Stock Options in a Jump Diffusion Model with Stochas-
tic Volatility and Interest Rates: Applications of Fourier Inversion Meth-
ods. Mathematical Finance, 7(4):413–426, 1997.
[SS91] E. Stein and J. Stein. Stock price distributions with stochastic volatility:
an analytic approach. Review of Financial Studies, 4(4):727–752, 1991.
[Sto99] L.A. Stout. Why the Law Hates Speculators: Regulation and Private Or-
dering in the Market for OTC Derivatives. Duke Law Journal, 48(4):701–
786, 1999.
[SW01] D.P. Simon and R.A. Wiggins. S & P futures returns and contrary senti-
ment indicators. Journal of Futures Markets, 21(5):447–462, 2001.
[W+ 98] P. Wilmott et al. Derivatives: the theory and practice of financial engi-
neering. John Wiley & Sons, 1998.
38