0% found this document useful (0 votes)

76 views63 pages

Stochan

This document introduces stochastic analysis and provides examples of how it can be applied. It discusses how Brownian motion and stochastic integrals are used to model random processes and solve differential equations that involve randomness. It also outlines some key challenges, such as defining a "continuous i.i.d. sequence" of random variables that satisfies the desired properties and is measurable. The goal of the course is to address these challenges and develop the tools of stochastic analysis.

Uploaded by

John Kemp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views63 pages

Stochan

Uploaded by

John Kemp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 63

STOCHASTIC ANALYSIS

F. den Hollander, Matthias Löwe,

and H. Maassen

Mathematical Institute
University of Nijmegen
Toernooiveld 1, 6525 ED Nijmegen
The Netherlands

Mathematical Research Institute

Master Class 1996–1997
Stochastics and Operations Research
References:
• F. Black and M. Scholes, The pricing of options and corporate liabilities, J. Polit.
Econom. 81, 637-659, 1973.
• K.L. Chung and R. Williams, Introduction to Stochastic Integration, Birkhäuser, Boston,
1990 (2nd edition).
• P. Halmos, Measure Theory, Springer, New York, 1974 (2nd edition).
• K. Itô, Lectures on Stochastic Processes, Tata Institute of Fundamental Research,
Bombay, 1961.
• G. Kallianpur, Stochastic Filtering Theory, Springer, New York, 1980.
• R.E. Kalman and R.S. Bucy, New results in linear ﬁltering and prediction theory,
Trans. ASME Ser. DJ. Basic Eng. 83, 95-108, 1961.
• B. Øksendal, Stochastic Diﬀerential Equations, Springer, Berlin, Heidelberg 1992 (3rd
edition).
• L.S. Ornstein and G.E. Uhlenbeck, On the theory of Brownian motion, Phys. Rev. 36,
823-841, 1930.
• M.M. Rao Probability Theory with Applications, Academic Press, 1984
The notes in this syllabus are based primarily on the book by Øksendal.

1
Contents
1 Introduction 3
2 Brownian motion 7
2.1 Construction of Brownian Motion 7
2.2 Non-smoothness of paths 11
2.3 More Brownian motion 14
3 The Itô-integral 16
3.1 Step functions 17
3.2 Arbitrary functions 19
3.3 Martingales 22
3.4 Continuity of paths 23
4 Stochastic integrals and the Itô-formula 25
4.1 The one-dimensional Itô-formula 25
4.2 Some examples 27
4.3 The multi-dimensional Itô-formula 28
4.4 Local times of Brownian motion 30
5 The Martingale Representation Theorem 33
6 Stochastic differential equations 38
6.1 Strong solutions 38
6.2 Weak solutions 41
7 Itô-diffusions and one-parameter semigroups 43
7.1 Introduction and motivation 43
7.2 Basic properties 43
7.3 Generalities on generators 47
7.4 Applications 49
8 Transformations of diffusions 51
8.1 The Feynman-Kac formula 51
8.2 The Cameron-Martin formula 52
8.3 Killing and drift 54
9 The Black and Scholes option pricing formula. 56
9.1 Stocks, bonds and stock options 56
9.2 The martingale case 57
9.3 The effect of stock trading: the case μ = 0 58
9.4 Motivation 59
9.5 Results 60
9.6 Inclusion of the interest rate: r = 0 61

2
1 Introduction
In stochastic analysis one studies random functions of one variable and various kinds of
integrals and derivatives thereof. The argument of these functions is usually interpreted
as ‘time’, so the functions themselves can be thought of as the path of a random process.
Here, like in other areas of mathematics, going from the discrete to the continuous yields
a pay-off in simplicity and smoothness, at the price
t of a formally more complicated analy-
n
sis. Compare, to make an analogy, the integral 0 x 3 dx with he sum k=1 k3 . The integral
requires a more refined analysis for its definition and its properties, but once this has
been done the integral is easier to calculate. Similarly, in stochastic analysis you will be-
come acquainted with a convenient differential calculus as a reward for some hard work
in analysis.
Stochastic analysis can be applied in a wide variety of situations. We sketch a few exam-
ples below.
1. Some differential equations become more realistic when we allow some random-
ness in their coefficients. Consider for example the following growth equation, used
among other places in population biology:

d
St = (r + “Nt ”)St . (1)
dt
Here, St is the size of the population at time t, r is the average growth rate of the
population, and the “noise” Nt models random fluctuations in the growth rate.
2. At time t = 0 an investor buys stocks and bonds on the financial market, i.e., he
divides his initial capital C0 into A0 shares of stock and B0 shares of bonds. The
bonds will yield a guaranteed interest rate r . If we assume that the stock price St
satisfies the growth equation (1), then his capital Ct at time t is

Ct = At St + Bt er t , (2)

where At and Bt are the amounts of stocks and bonds held at time t. With a keen eye
on the market the investor sells stocks to buy bonds and vice versa. If his tradings

are ‘self-financing’, then dCt = At dSt + Bt d(er t ). An interesting question is:
- What would he be prepared to pay for a so-called European call option, i.e.,
the right (bought at time 0) to purchase at time T > 0 a share of stock at a
predetermined price K?
The rational answer, q say, was found by Black and Scholes (1973) through an anal-
ysis of the possible strategies leading from an initial investment q to a payoff CT .
Their formula is being used on the stock markets all over the world.
3. The Langevin equation describes the behaviour of a dust particle suspended in a
fluid:
d
m Vt = −ηVt + “Nt ”. (3)
dt
Here, Vt is the velocity at time t of the dust particle, the friction exerted on the
particle due to the viscosity η of the fluid is −ηVt , and the “noise” Nt stands for the
disturbance due to the thermal motion of the surrounding fluid molecules colliding
with the particle.

3
4. The path of the dust particle in example 3 is observed with some inaccuracy. One
measures the perturbed signal Z(t) given by

Zt = Vt + “Ñt ”. (4)

Here Ñt is again a “noise”. One is interested in the best guess for the actual value
of Vt , given the observation Zs for 0 ≤ s ≤ t. This is called a filtering problem: how
to filter away the noise Ñt . Kalman and Bucy (1961) found a linear algorithm, which
was almost immediately applied in aerospace engineering. Filtering theory is now a
flourishing and extremely useful discipline.
5. Stochastic analysis can help solve boundary value problems such as the Dirichlet
problem. If the value of a harmonic function f on the boundary of some bounded
regular region D ⊂ Rn is known, then one can express the value of f in the interior
of D as follows:

E f Bτx = f (x), (5)
t
where Btx := x + 0 Nt dt is an “integrated noise” or Brownian motion, starting at x,
and τ denotes the time when this Brownian motion first reaches the boundary. (A
harmonic function f is a function satisfying Δf = 0 with Δ the Laplacian.)
The goal of this course is to make sense of the above equations, and to work with them.
In all the above examples the unexplained symbol Nt occurs, which is to be thought of
as a “completely random” function of t, in other words, the continuous time analogue of
a sequence of independent identically distributed random variables. In a first attempt to
catch this concept, let us formulate the following requirements:
1. Nt is independent of Ns for t ≠ s;
2. The random variables Nt (t ≥ 0) all have the same probability distribution μ;
3. E (Nt ) = 0.
However, when taken literally these requirements do not produce what we want. This is
seen by the following argument. By requirement 1 we have for every point in time an
independent value of Nt . We shall show that such a “continuous i.i.d. sequence” Nt is not
measurable in t, unless it is identically 0.
Let μ denote the probability distribution of Nt , which by requirement 2 does not depend
on t, i.e., μ([a, b]) := P[a ≤ Nt ≤ b]. Divide R into two half lines, one extending from a
to −∞ and the other extending from a to ∞. If Nt is not a constant function of t, then
there must be a value of a such that each of the half lines has positive measure. So

p := P(Nt ≤ a) = μ ((−∞, a]) ∈ (0, 1). (6)

Now consider the set of time points where the noise Nt is low: E := { t ≥ 0 : Nt ≤ a }.
It can be shown that with probability 1 the set E is not Lebesgue measurable. Without
giving a full proof we can understand this as follows. Let λ denote the Lebesgue measure
on R. If E would be measurable, then by requirement 1 and Eq. (6) it would be reasonable
to expect its relative share in any interval (c, d) to be p, i.e.,

λ (E ∩ (c, d)) = p (d − c) . (7)

4
On the other hand, it is known from measure theory that every measurable set E is
arbitrarily thick somewhere with respect to the Lebesgue measure λ, i.e., for all α < 1 an
interval (c, d) can be found such that

λ (E ∩ (c, d)) > α(d − c)

(cf. Halmos (1974) Th. III.16.A). This clearly contradicts Eq. (7), so E is not measurable.
This is a bad property of Nt : for, in view of (1), (3), (4) and (5), we would like to integrate
Nt .
For this reason, let us approach the problem from another angle. Instead of Nt , let us
consider the integral of Nt , and give it a name:
t
Bt := Ns ds.
0

The three requirements on the evasive object Nt then translate into three quite sensible
requirements for Bt .
BM1. For 0 = t0 ≤ t1 ≤ · · · ≤ tn the random variables Btj+1 − Btj (j = 0, . . . , n − 1) are
independent;
BM2. Bt has stationary increments, i.e., the joint probability distribution of

Bt1 +s − Bu1 +s , Bt2 +s − Bu2 +s , . . . , Btn +s − Bun +s

does not depend on s ≥ 0, where ti > ui for i = 1, 2, · · · , n are arbitrary.

BM3. E (Bt ) = 0 for all t.
We add a normalisation:
BM4. E(B12 ) = 1.
Still, these four requirements do not determine Bt . For example, the compensated Poisson
jump process also satisfies them. Our fifth requirement fixes the process Bt uniquely:
BM5. t
→ Bt continuous a.s.
The object Bt so defined is called the Wiener process, or (by a slight abuse of physical
terminology) Brownian motion. In the next chapter we shall give a rigorous and explicit
construction of this process.
Before we go into details we remark the following

Excercise. 1.1 Show that BM5 implies the following: For any ε > 0

nP (Bt+ 1 − Bt > ε) → 0 (8)

as n → ∞.

Exercise 1.1 helps us to specify the increments of Brownian motion in the following way.

Excercise. 1.2 Suppose BM1, BM2, BM4 and (8) hold. Apply the Central Limit Theorem
(Lindeberg’s condition) to
Xn,k := B kt − B (k−1)t
n n

5
and conclude that Bt − Bs , s < t has a normal distribution with variance t − s, i.e.

1 x2
P (Bs+t − Bs ∈ A) = √ e− 2t dx.
2π t A

Introducing
BM 2’. If s, t ≥ 0 then
1 x2
P (Bs+t − Bs ∈ A) = √ e− 2t dx.
2π t A

we can now deﬁne Brownian motion as follows

Deﬁnition 1.1 A one-dimensional Brownian motion is a real-valued process Bt , t ≥ 0 with

the properties BM1,BM2’, and BM5.

6
2 Brownian motion

FIG: Norbert Wiener

2.1 Construction of Brownian Motion

Whenever a stochastic process with certain porpoerties is deﬁned, the most natural ques-
tion to ask is, does such a process exist? Of course, the answer is yes, otherwise these
lecture notes would not have been written.
In this section we shall construct Brownian motion on [0, T ]. For the sake of simplicity
we will take T = 1, the construction for general T can be carried out along the same lines,
or, by just concatenating independent Brownian motions.
The construction we shall use was given by P. Levy in 1948. Since we saw that the in-
crements of Brownian motion are independent Gaussian random variables, the idea is to
construct Brownian motion from these Gaussian increments.
More precisely, we start with the following observation. Suppose we already had con-
structed Brownian motion, say (Bt )0≤t≤T . Take two times 0 ≤ s < t ≤ T , put θ := s+t 2 ,
and let
1 2
p(τ, x, y) := √ e−(x−y) /2τ , τ > 0, x, y, ∈ R
2π τ
be the Gaussian kernel centered in y with variance τ. Then, conditioned on Bs = x and
x+z t−s
Bt = z, the random variable Bθ is normal with mean μ := 2 and variance σ 2 := 4 .
Indeed, since Bs ,Bθ − Bs , and Bt − Bθ are independent we obtain

t−s t−s
P [Bs ∈ dx, Bθ ∈ dy, Bt ∈ dz] = p(s, 0, x)p( , x, y)p( , y, z)dx dy dz
2 2
1 (y −μ)2
−
= p(s, 0, x)p(t − s, x, z) · √ e 2σ 2 dx dy dz
σ 2π

(which is just a bit of algebra). Dividing by

P [Bs ∈ dx, Bt ∈ dz] = p(s, 0, x)p(t − s, x, z)dx dz

we obtain
1 (y −μ)2
−
P [Bθ ∈ dy|Bs ∈ dx, Bt ∈ dz] = √ e 2σ 2 dy,
σ 2π

7
which is our claim.
This suggests that we might be able to construct Brownian motion on [0, 1] by interpola-
tion.
(n)
To carry out this program, we begin with a sequence {ξk , k ∈ I(n), n ∈ N0 } of indepen-
dent, standard normal random variables on some probability space (Ω, F , P ). Here

I(n) := {k ∈ N, k ≤ 2n , k = 2l + 1 for some l ∈ N}

denotes the set of odd, positive integers less than 2n . For each n ∈ N0 we deﬁne a process
(n)
B (n) := {Bt : 0 ≤ t ≤ 1} by recursion and linear interpolation of the preceeding process,
(n) (n−1)
as follows. For n ∈ N, Bk/2n−1 will agree with Bk/2n−1 , for all k = 0, 1, . . . , 2n−1 . Thus for
(n)
each n we only need to specify the values of Bk/2n for k ∈ I(n). We start with

(0) (1) (0)

B0 = 0 and B0 = ξ1 .
(n−1) (n−1)
If the values of Bk/2n−1 , k = 0, 1 . . . 2n−1 have been deﬁned (an thus Bt , k/2n−1 ≤ t ≤
(n−1) (n−1)
(k + 1)/2n−1 is the linear interpolation between Bk/2n−1 and B(k+1)/2n−1 ) and k ∈ I(n), we
(n−1) (n−1)
= 2−n+1 and
1 t−s
denote s = (k − 1)/2n , t = (k + 1)/2n , μ = 2 (Bs + Bt ) and σ 2 = 4
set in accordance with the above observations
(n) (n) (n)
Bk/2n := B(t+s)/2 := μ + σ ξk .

(n)
We shall show that, almost surely, Bt converges uniformly in t to a continuous function
Bt (as n → ∞) and that Bt is a Brownian motion.
We start with giving a more convenient representation of the processes B (n) , n = 0, 1, . . . .
We deﬁne the following Haar functions by H10 (t) ≡ 1, and for n ∈ N, k ∈ I(n)
⎧
⎪
⎪ (n−1)/2 , k−1 k
⎨ 2 2n ≤ t < 2n
(n) k k+1
Hk (t) := −2(n−1)/2 , 2n ≤ t < 2n
⎪
⎪
⎩ 0 otherwise.

The Schauder functions are deﬁned by

t
(n) (n)
Sk (t) := Hk (u)du, 0 ≤ t ≤ 1, n ∈ N0 , k ∈ I(n).
0

(0) (n)
Note that S1 (t) = t, and that for n ≥ 1 the graphs of Sk are little tents of height
2−(n+1)/2 centered at k/2n and non overlapping for diﬀerent values of k ∈ I(n). Clearly,
(0) (0) (0)
Bt = ξ1 S1 (t), and by induction on n, it is readily veriﬁed that

(n)

n (m) (m)
Bt (ω) = ξk (ω)Sk (t), 0 ≤ t ≤ 1, n ∈ N. (9)
m=0 k∈I(m)

(n)
Lemma 2.1 As n → ∞, the sequence of functions {Bt (ω), 0 ≤ t ≤ 1}, n ∈ N0 , given by
(9) converges uniformly in t to a continuous function {Bt (ω), 0 ≤ t ≤ 1} for almost every
ω ∈ Ω.

8
(n)
Proof. Let bn := maxk∈I(n) |ξk |. Oberserve that for x > 0 and each n, k
∞
2 2 /2
e−u
(n)
P (|ξk | > x) = du
π x
∞
2 u −u2 /2 2 1 −x 2 /2
≤ e du = e ,
π x x πx

which gives

(n) (n) 2 2n −n2 /2
P (bn > n) = P( {|ξk | > n}) ≤ 2n P (|ξ1 | > n) ≤ e ,
π n
k∈I(n)

for all n ∈ N. Since

2 2n −n2 /2
e <∞
n π n

, the Borel-Cantelli-Lemma implies that there is a set Ω̃ with P (Ω̃) = 1 such that for ω ∈ Ω̃
there is an n0 (ω) such that for all n ≥ n0 (ω) it holds true that bn (ω) ≤ n. But then

n2−(n+1)/2 < ∞;
(n) (n)
|ξk (ω)Sk (t)| ≤
n≥n0 (ω) k∈I(n) n≥n0 (ω)

(n)
so for ω ∈ Ω̃, Bt (ω) converges uniformly in t to a limit Bt . The uniformity of the
convergence implies the conitunuity of the limit Bt .
The following exercise facilitates the construction of Brownian motion substantially:

Excercise. 2.1 Check the following in a textbook of functional analysis:

The inner product
1
f , g := f (t)g(t)dt
0
(n)
turns L2 [0, 1] into a Hilbert space, and the Haar functions {Hk ; k ∈ I(n), n ∈ N0 } form
a complete, orthonormal system.
Thus the Parseval equality
∞
(n) (n)
f , g = f , Hk g, Hk (10)
n=0 k∈I(n)

holds true.

Applying (10) to f = 1[0,t] and g = 1[0,s] yields

∞
(n) (n)
Sk (t)Sk (s) = s ∧ t. (11)
n=0 k∈I(n)

Now we are able to prove

9
Theorem 2.2 With the above notations
(n)
Bt := lim Bt
n→∞

is a Brownian motion in [0, 1].

Proof. In view of our deﬁnition of Brownian motion it suﬃces to prove that for 0 = t0 <
t1 . . . < tn ≤ 1, the increments (Btj − Btj−1 )j=1,... ,n are independent, normally distributed
with mean zero and variance (tj − tj−1 ). For this we will show that the Fourier √ transforms
satisfy the appropriate condition, namely that for λj ∈ R (and as usual i := −1)

n
n
1
E exp i λj (Btj − Btj−1 ) = exp − λ2j (tj − tj−1 ) . (12)
2
j=1 j=1

To derive (12) it is most natural to exploit the construction of Bt form Gaussian random
(n)
variables. Set λn+1 = 0 and use the independence and normality of the ξk to compute
for M ∈ N

n
(M)
E exp −i (λj+1 − λj )Btj
j=1

M
n
(m) (m)
= E exp −i ξk (λj+1 − λj )Sk (tj )
m=0 k∈I(m) j=1

M
n
(m) (m)
= E exp −iξk (λj+1 − λj )Sk (tj )
m=0 k∈I(m) j=1

M 1
n
(m) 2
= exp − (λj+1 − λj )Sk (tj )
m=0 k∈I(m)
2
j=1

1
n n M
(m) (m)
= exp − (λj+1 − λj )(λl+1 − λl ) Sk (tj )Sk (tl )
2 m=0
j=1 l=1 k∈I(m)

Now we send M → ∞ and apply (11) to obtain

n

n

E exp i λj (Btj − Btj−1 ) = E exp −i (λj+1 − λj )Btj
j=1 j=1
n−1

1
n n
= exp − (λj+1 − λj )(λl+1 − λl )tj − (λj+1 − λj )2 tj
j=1 l=j+1
2 j=1
n−1

1
n
= exp − (λj+1 − λj )(−λj+1 )tj − (λj+1 − λj )2 tj
j=1
2 j=1

1 2
n−1
2 1 2
= exp (λ − λj )tj − λn tn
2 j=1 j+1 2

n
1
= exp − λ2j (tj − tj−1 )
j=1
2

2.2 Non-smoothness of paths

Although the path of Brownian motion is continuous, it is extremely rough. Indeed, in
this subsection we are going to prove the following

Theorem 2.3 Almost surely, the paths

t
→ Bt
have inﬁnite variation over every time interval and thus are nowhere diﬀerentiable.

The ﬁrst step in this direction is the following basic lemma.

(n) (n) (n)

Lemma 2.4 Let 0 ≤ t0 < t1 < · · · < tn = T be a family of partitions of the interval
[0, T ] that gets arbitrarily ﬁne for n → ∞, in the sense that
(n) (n)
lim max |tj+1 − tj | = 0.
n→∞ 0≤j<n

Then

n−1
L2 − lim (Bt(n) − Bt(n) )2 = T .
n→∞ j+1 j
j=0

(n) (n)
Proof. Abbreviate ΔBj = Bt(n) − Bt(n) and Δtj = tj+1 − tj . Let δn = maxj Δtj . Then
j+1 j
2 2

(ΔBj )2 − T = E (ΔBj )2 − T
j j

=E (ΔBi )2 (ΔBj )2 − 2T E (ΔBj )2 + T 2
i,j j

= E((ΔBj )4 ) + E((ΔBi )2 )E((ΔBj )2 ) − 2T Δtj + T 2
j i≠j j

2 2
= 3(Δtj ) + (Δti )(Δtj ) − T
j i≠j

=2 (Δtj )2
j

≤ 2δn Δtj
j

= 2δn T ,
where again we have used the
fact that the fourth moment of a centered Gaussian random
variable ξ is given by E ξ 4 = 3 Var(ξ)2 . Let n → ∞ to get the claim.
We may write the message of Lemma 2.4 symbolically as
(dBt )2 = dt,

11
saying that ‘Brownian motion has quadratic variation growing linearly with time’. This ex-
pression will acquire a precise meaning during the sequel of this course. For the moment,
let us just say that Bt has large ﬂuctuations at a small scale, namely

dBt is of order dt dt.

To prove the inﬁnite variation part of the above Theorem 2.3 we need one more prepara-
tory lemma, which applies to a general sequence of random variables.

Lemma 2.5 Let X1 , X2 , X3 , . . . be a sequence of real-valued random variables such that

lim E |Xn |p = 0.
n→∞
∞
Then there exists a subsequence Xnk tending to 0 almost surely.
k=1

∞ p

Proof. Choose a subsequence such that k=1 E(Xnk ) < ∞. By Chebyshev’s inequality
we have for all m ∈ N,
p
1 p
P Xnk ≥ ≤ m E Xnk ,
m

and therefore for all m ∈ N,

1
P Xnk ≥ < ∞.
m
k

The ﬁrst Borel-Cantelli lemma now implies that for any m,

1
P Xnk ≥ for ﬁnitely many k = 1.
m

By σ -additivity the intersection over m also has probability 1:

1
P ∀m∈N ∃K∈N ∀k≥K : Xnk < = 1,
m

i.e., Xnk → 0 almost surely as k → ∞.

We are ﬁnally ready to prove that Brownian motion almost surely hast inﬁnte variation:
Proof. By Lemmas 2.4 and 2.5 there exists an increasing sequence (nk ) such that for
almost all ω ∈ Ω,

n−1 2
lim B(nk ) (ω) − B(n ) (ω) = T .
k→∞ tj+1 tj k
j=0

Fix an ω ∈ Ω for which this holds. Let

εnk := max |ΔBj | := max B (nk ) (ω) − B (nk ) (ω) .
j 0≤j<nk tj+1 tj

12
Then limk→∞ εnk = 0 by the uniform continuity of t
→ Bt . It follows that

k −1
n k −1
n
1 1
|ΔBj | ≥ |ΔBj |2 ∼ T →∞ as k → ∞.
j=0 j=0
εnk εnk

To derive form here that also the paths of Brownian motion are almost surely nowhere
diﬀerentiable, we need the following

Excercise. 2.2 Let (Bt )0≤t≤T be a Brownian motion on [0.T ]. Then for each c > 0 the
following stochastic process
(c · Bt/c 2 )0≤t≤T
is a Brownian motion on [0, T /c 2 ].

Now we are ready to proof that the paths of Brownian motion are almost surely nowhere
diﬀerentiable.
Proof. Let Xn,k := maxj=k,k+1,k+2 |B j − B j−1 |. For ε > 0 we have
2n 2n

P (Xn,k ≤ ε) = P (|B 1 | ≤ ε)3 = P (|B1 | ≤ 2n/2 ε)3 ≤ (2n/2+1 ε)3

where the second step follows from the above exercise. Thus for Yn := mink≤T ·2n Xn,k we
obtain

P (Yn ≤ ε) ≤ T 2n (2n/2+1 ε)3 . (13)

Denote
A := {ω ∈ Ω : t
→ Bt (ω) is diﬀerentiable somewhere}.
Let ω ∈ A, t
→ Bt (ω) be diﬀerentiable in t0 := t0 (ω), and let D denote its derivative.
Then there exists δ := δ(ω, t0 ) such that

|t − t0 | ≤ δ ⇒ |Bt − Bt0 | ≤ (|D| + 1)|t − s|.

Now choose n0 := n0 (ω, t0 ) large enough such that

1 δ
n
< , n0 > (|D| + 1) and n0 > t0 .
2 0 2
k k+1
For n ≥ n0 choose k such that 2n ≤ t0 < 2n . Then

j
|t0 − |<δ for j = k, k + 1, k + 2.
2n
Thus
1 n
Xn,k (ω) ≤ (|D| + 1) n
≤ n,
2 2
n n
and, since n > t0 > k/2n , also Yn (ω) ≤ 2n . Therefore A ⊂ An := {Yn (ω) ≤ 2n } for n
large enough and hence also
A ⊆ lim inf An .

13
But (13) implies
P (An ) ≤ n2n (2n/2+1 n2−n )3 < ∞
n n

as n → ∞, such that P (lim inf An ) = 0. Thus almost surely t → Bt (ω) is nowhere diﬀer-
entiable.

2.3 More Brownian motion

In this little paragraph we collect some more facts and extensions of Brownian motion.
The ﬁrst is basically contained in the following exercise and states that Brownian mo-
tion is the standard exapmle of a continuous time (local) martingale. To this end recall
that a real valued process {Xt }t on a probability space (Ω, F , P ) adapted to a given ﬁl-
tration {Ft }t with EXt < ∞ for all t is said to called a submartingale (respectively, a
supermartingale), if for all 0 ≤ s < t < ∞, we have, P -a.s.

E(Xt |Ft ) ≥ Xs

(respectively,
E(Xt |Ft ) ≤ Xs ).
We say that it is a martingale if is both, a supermartingale and a submartingale.

Excercise. 2.3 Prove that Brownian motion (Bt )0≤t≤T together with the canconical ﬁltra-
tion
Ft ) := σ {Xs , 0 ≤ s ≤ t}
is a martingale.

Once we have construced Brownian motion in one dimension another natural question to
ask is, whether there is a multidimensional analogue to it. The following deﬁntion seems
most natural.

Deﬁnition 2.1 As above let

1 2
p(τ, x, y) := √ e−(x−y) /2τ , τ > 0, x, y, ∈ R.
2π τ
A stochastic process (Bt )0≤t≤T on a probability space (Ω, F , P ) and with values in Rd is
called a d-dimensional Brownian motion (or Brownian motion in Rd ), if it satisﬁes the
following conditons
1.
B0 ≡ 0 ∈ Rd .
2. The process has stationary and independent increments, i.e. for 0 ≤ t0 < t1 . . . <
tn ≤ T the random variables Btn − Btn−1 , . . . , Bt1 − Bt0 are independent and for all k,
the distribution of Btk − Btk−1 only depends on the time diﬀerence tk − tk−1 .
3. The process has componentwise standard Gaussian increments, i.e. for each mea-
surable set A ⊆ Rd and each 0 ≤ s < t ≤ T we have

P (Bt − Bs ∈ A) = p(t − s, x1 , 0) · · · p(t − s, xd , 0)dx1 · · · dxd .
A A

14
4. P -a.s. the paths of t
→ Bt are continuous.

Note that the above deﬁnition implies that the coordinate processes of a d-dimensional
Brownian motion are a one dimensional Brownian motion.

This concludes the construction of Brownian motion. In the next sections we shall see
that Brownian motion is the building block of stochastic analysis.
It turns out that all random variables in L2 (Ω, P) can be represented in a natural way as
integrals of products of increments of Brownian motion (‘Wiener chaos expansion’). This
shows that Brownian motion is really the basic random process in L2 (Ω, P).

15
3 The Itô-integral

FIG: Kiyosi Itô

In this section we shall introduce the concept of stochastic integration which basically
means that we will try to integrate (random) functions with respect to Brownian motion.
To those who are just familiar with ordinary Lesbesgue or Riemann integration, this may
sound like a strange idea. Indeed, after some considerations the idea is less strange than
one might at first glance expect. Recall that for every fixed ω the path (Bt (ω))0≤t≤T is
a fixed real-valued function. Thus we may try to apply the concept of Lebesgue-Stieltjes
integration to define an integral
T
f (t, ω)dBt (ω)
0

for some random function f which, of course, still would be a random object. The
Lebesgue-Stieltjes integral roughly follows the following idea. In the construction of the
Lebesgue (or Riemann) integral we give each interval I a weight which is equivalent to its
length |I|. Now a natural generalization of this concept is to assign a weight to I that de-
pends on its location. This can happen in the following way: Take a montonely increasing
function
G : [0, T ] → R
with G(0) = 0 and for a continuous function

f : [0, T ] → R

and a partition τ : 0 = t0 < t1 < . . . < tn = t deﬁne

τ
IG (f ) := f (ti )(G(ti+1 − G(ti )).
ti ∈τ

We now take a sequence of partitions (τν )ν∈N such that maxti ∈τν |ti+1 −ti | → 0 as ν → ∞.
The Lebesgue-Stieltjes integral of f with respect to G is then deﬁned as
T
τ
f (t)dG(t) := lim IGν (f ).
0 ν→∞

(The ordinary Riemann-Lebesgue integral is obtained form the Lebesgue-Stieltjes integral

by taking G(x) = x.)

16
It can be shown, that the Lebesgue-Stieltjes integral is well defined and may even be
extended to the case of non-increasing G, which reflects that an interval I might have
negative measure. Indeed, it turns out that the appropriate requirement is that G has
finite variation
lim |G(ti+1 ) − G(ti )| < ∞.
ν→∞
ti ∈τν

Now we have shown in Theorem 2.3 that the paths of Brownian motion have infinite vari-
ation on every time interval. Hence T the concept of Lebesgue-Stieltjes integration cannot
be simply carried over to define 0 f (t, ω)dBt (ω). We will soon see what goes wrong,
when we follow the standard ideas to define an integral, i.e. we first define the integral of
a step function and then continue by approximating “arbitrary functions” by step func-
tions. However, we shall see that f (t, ω) cannot be completely arbitrary, but has in some
way to be fitting to ω
→ Bt (ω). The construction was pioneered by K. Itô in the 1940’s.

3.1 Step functions

We deﬁne the stochastic integral of a step function of the form

n−1
φ(t, ω) = cj (ω)1[tj ,tj+1 ) (t)
j=0

T
n−1

φ(t, ω)dBt (ω) := cj (ω) Btj+1 (ω) − Btj (ω) . (14)
0
j=0

The next thing to do would be to approximate f by step functions and deﬁne f dBt to
be the limit of their stochastic integrals. But here we meet a diﬃculty!

Example Put f (t, ω) := Bt (ω). Two reasonable approximations of f are φn and ψn

given by

n−1
φn (t, ω) := Btj (ω)1[tj ,tj+1 ) ,
j=0

n−1
ψn (t, ω) := Btj+1 (ω)1[tj ,tj+1 ) ,
j=0

where t0 , t1 , . . . , tn are defined as in Lemma 2.4 in Section 2.4 (and are not ω-dependent).
However, from our definition (14) we find that
T T
n−1
ψn dBt − φn dBt = (ΔBj )2 ,
0 0
j=0

which, according to Lemma 2.4, does not tend to 0 as n → ∞but to the constant T . In
other words, the variation of the path t
→ Bt is too large for Bt dBt to be deﬁned in a
straightforward way.

17
We now introduce a requirement for the approximation of simple functions, and hence
also for the integrands.

Deﬁnition 3.1 Let Ft denote the σ -ﬁeld generated by { Bs | 0 ≤ s ≤ t }. Let F := FT . A

stochastic process on the probability space (Ω, F , P) of Brownian motion is a measurable
map [0, T ]×Ω → R. The process will be called adapted to the family of σ -ﬁelds (Ft )t∈[0,T ]
if ω
→ f (t, ω) is Ft -measurable for all t ∈ [0, T ].

The space L2 (Ω, Ft , P) of square-integrable Ft -measurable functions may be thought of

as those functions of ω ∈ Ω that are fully determined by the initial segment [0, t] →
R : s
→ Bs (ω) of the Brownian motion. In other words, g ∈ L2 (Ω, Ft , P) when g(ω) =
g(ω ) as soon as Bs (ω) = Bs (ω ) for all s ∈ [0, t].
Let L2 (B, [0, T ]) denote the space of all adapted stochastic processes f : [0, T ] × Ω → R
that are square integrable:
T
f 2L2 (Ω,P) = dt f (t, ω)2 P(dω) < ∞.
0 Ω

We shall often abbreviate this space by L2 (B). The natural inner product that makes
L2 (B) into a real Hilbert space is
T
f , g := dt f (t, ω)g(t, ω)P(dω)
0 Ω
T
=E f (t, ·)g(t, ·)dt .
0

We note that the step functions φn in the last example are adapted, since φn (t, ω)= Btj
for t ∈ [tj , tj+1 ), so that φn (t, ω) only depends on past values of B. On the other hand,
ψn is not adapted, since at time t ∈ [tj , tj+1 ) it already anticipates the Brownian motion
at time tj+1 : ψn (t, ω)= Btj+1 (ω).
The next theorem is a crucial property of stochastic integrals of step functions.

Proposition 3.1 (The Itô-isometry) Let φ be a step function in L2 (B, [0, T ]), and let
T
I0 (φ)(ω) := φ(t, ω)dBt (ω)
0

be its stochastic integral according to (14). Then I0 is an isometry:

I0 (φ)L2 (Ω,P) = φL2 (B) ,

i.e.,
2 T
T
P(dω) φ(t, ω)dBt (ω) = φ2 (t, ω)P(dω)dt.
Ω 0 Ω 0

18
Proof. By adaptedness, ci in (14) is independent of ΔBj := Btj+1 − Btj for i ≤ j. Therefore
n−1
2
I0 (φ)2 = E cj ΔBj
j=0

n−1
n−1
= E ci cj (ΔBi )(ΔBj )
i=0 j=0

n−1

= E cj2 (ΔBj )2 + 2 E ci cj (ΔBi ) E ΔBj
j=0 i<j

n−1

= E(cj2 )E (ΔBj )2
j=0

n−1
= E(cj2 )Δtj ,
j=0

where we use that E(ΔBj ) = 0, E((ΔBj )2 ) = Δtj (recall BM3-BM4 in Section 1). On the
other hand,
T n−1
2
φ2 = E cj 1[tj ,tj+1 ) (t) dt
0
j=0

T
n−1
n−1
= 1[ti ,ti+1 ) (t)1[tj ,tj+1 ) (t)dt E(ci cj )
0
i=0 j=0

n−1
= Δtj E(cj2 ).
j=0

The two expressions are the same.

3.2 Arbitrary functions

To go from step functions to arbitrary functions we need the following.

Lemma 3.2 Every function f ∈ L2 (B, [0, T ]) can be approximated arbitrarily well by step
functions in L2 (B, [0, T ]).

On the basis of Proposition 3.1 and Lemma 3.2 we can now deﬁne the Itô-integral of a
function g ∈ L2 (B, [0, T ]) as follows. Approximate g by step functions φn ∈ L2 (B, [0,
T ]), i.e., φn → g in L2 (B, [0, T ]). Apply I0 to each of the φn . Since I0 is an isometry, the
sequence I0 φn has a limit in L2 (Ω, P). This is what we deﬁne to be the Itô-integral Ig of
g:
T
g(t, ω)dBt (ω) := (Ig)(ω) = L2 − lim (I0 φn )(ω).
0 n→∞

Proof of Lemma 3.2. We divide the proof into three steps of successive approximation.
1. Every bounded (pathwise) continuous g ∈ L2 (B) can be approximated by a se-
quence of step functions.

19
Proof. Partition the interval [0, T ] into n pieces by times (tj ) in the customary
way. Deﬁne

n−1
φn (t, ω) := g(tj , ω)1[tj ,tj+1 ) (t).
j=0

Then, since t
→ g(t, ω) is continuous and maxj |Δtj | → 0 for all ω ∈ Ω, we have
T
2
lim g(t, ω) − φn (t, ω) dt = 0.
n→∞ 0

Hence by bounded convergence,

T
2
lim E g(t, ω) − φn (t, ω) dt = 0.
n→∞ 0

2. Every bounded h ∈ L2 (B) can be approximated by a sequence of bounded continu-
ous functions in L2 (B).
Proof. ******* Suppose |h| ≤ M. For each n, let the “molliﬁer” ψn be a non-negative
continuous function of the form given in Figure 3.2, with the properties ψn (x) = 0
∞
for x ∉ [0, 1/n] and −∞ ψn (x)dx = 1. Deﬁne
t
gn (t, ω) := ψn (t − s)h(s, ω)ds.
0

Then t
→ gn (t, ω) is continuous for all ω, and |gn | ≤ M. Moreover, for all ω,
T
2
lim gn (s, ω) − h(s, ω) ds = 0,
n→∞ 0

since ψn constitutes an approximate identity. Again by bounded convergence,

T
2
lim gn (s, ω) − h(s, ω) ds = 0.
n→∞ 0

3. Every f ∈ L2 (B) can be approximated by bounded functions in L2 (B). (This is a
general result on L2 -spaces.)
Proof. Let f ∈ L2 (B) and put hn (t, ω) := (−n) ∨ (n ∧ f (t, ω)). Then
T
f − hn 2L2 (B) ≤ dt P(dω) 1[n,∞)(|f (t, ω)|)f (t, ω)2 ,
0 Ω

which tends to 0 as n → ∞ by dominated convergence.

20
0 1/n
FIG: the function ψn .
Here is an example of a stochastic integral.

Example The following identity holds:

T
1 2 1
Bt dBt = B − T.
0 2 T 2

Proof. We choose an adapted approximation of Bt (ω), namely φn (t, ω) of the example

in Section 4.1. By deﬁnition,
T
n−1
φn (t)dBt = Bj ΔBj ,
0
j=0

where we use the shorthand notation Bj := Btj and ΔBj := Btj+1 − Btj . Note that Bi =

j<i ΔBj . We therefore have

2
BT2 = ΔBj
j

= (ΔBi )2 + 2 (ΔBi )(ΔBj )
i i<j

= (ΔBi )2 + 2 Bj (ΔBj )
i j
T
2
= (ΔBi ) + 2 φn (t)dBt .
0
i

From Lemma 2.4 in Section 2.4 it now follows that

T
1
lim φn (t)dBt = (BT2 − T ).
n→∞ 0 2

Note that the integral in the above example is actually
T diﬀerent from what it would be
for a smooth function f with f (0) = 0, namely: 0 f (t)df (t) = f (T )2 /2. What the exam-
ple shows is that ‘stochastic integration is ordinary integration except that the diagonal
terms must be left out’. This will be made precise in Section 5, where we shall encounter
a faster way to calculate stochastic integrals.

21
3.3 Martingales
In section 4.4 we shall prove that the Itô-integral w.r.t. Brownian motion of an adapted
square integrable stochastic process always has a continuous version. For this we shall
need an interlude on martingales. We start with a reminder of what we already deﬁned
in Section 2.3.

Deﬁnition 3.2 By the conditional expectation at time t ∈ [0, T ] of a random variable

X ∈ L2 (Ω, P) we shall mean its orthogonal projection onto L2 (Ω, Ft , P), the space of
random variables that are determined at time t. We denote this projection by E(X | Ft ),
or brieﬂy Et (X).

In words, Et (X)(ω) is the best estimate (in the sense of least mean square error) that can
be made of X(ω) on the basis of the knowledge of Bs (ω) for 0 ≤ s ≤ t.

Deﬁnition 3.3 An adapted process M ∈ L2 (B) is called a martingale (w.r.t. Brownian

motion) if

Es (Mt ) = Ms for 0 ≤ s ≤ t ≤ T .

In words, a martingale is a ‘fair game’: the expected value at any time in the future is
equal to the current value. Note that Brownian motion itself is a martingale, since for
0 ≤ s ≤ t ≤ T,

Es (Bt ) = Es (Bs + (Bt − Bs )) = Bs + Es (Bt − Bs ) = Bs ,

because Bt − Bs is independent of and hence orthogonal to any function in L2 (Ω, Fs , P).

Theorem 3.3 The stochastic integral of an adapted step function is a martingale with
continuous paths.

Proof. This directly follows from the fact that Brownian motion has continuous paths
and satisﬁes the martingale property (use the deﬁnition of the stochastic integral of a
step function given in (14) in Section 4.1).
The following powerful tool will help us prove that the Itô-integral of any process in
L2 (B) possesses a continuous version.

Theorem 3.4 (Doob’s martingale inequality) If Mt is a martingale with continuous paths,

then for all p ≥ 1 and λ > 0,

1
P[ sup |Mt | > λ] ≤ E(|MT |p ).
0≤t≤T λp

Proof. We may assume that E(|MT |p ) < ∞ for all s ∈ [0, T ]. Let Zt := |Mt |p . Then, since
x
→ |x|p is a convex function, Zt is sub-martingale, meaning that for all 0 ≤ s ≤ t ≤ T ,

Es (Zt ) = Es (|Mt |p ) ≥ |Es (Mt )|p = |Ms |p = Zs .

22
It follows in particular that E(|MT |p ) < ∞ for all s ∈ [0, T ]. Let us discretise time and
ﬁrst prove a discrete version of Doob’s inequality. To that end we ﬁx n ∈ N and put
tk = kT /n. Let K(ω) denote the smallest value of k for which Ztk ≥ λp , if this occurs at
all. Otherwise, put K(ω) = ∞. Then we may write, since [K = k] ∈ Ftk ,

3.4 Continuity of paths

We shall use the martingale inequality in Theorem 3.4 in Section 4.3 to prove the exis-
tence of a continuous version for our stochastic integrals. Two stochastic processes It
and Jt are called versions of each other if It (ω) = Jt (ω) for all t ∈ [0, T ] for almost all
ω∈Ω

Theorem 3.5 Let f ∈ L2 (B). Let

t
It (ω) = f (s, ω)dBs (ω).
0

Then there exists a version Jt of It with continuous paths, i.e., t → Jt (ω) is continuous
for almost all ω ∈ Ω.

Proof. The point of the proof is to turn continuity in L2 (Ω, P) into continuity of paths.
This requires some estimates.
Let φn ∈ L2 (B) be an approximation of f by step functions. Put
t
In (t, ω) = φn (s, ω)dBs (ω).
0

23
By Lemma 3.3 in Section 4.3, In is a pathwise continuous martingale for all n. The same
holds for the diﬀerences In − Im . Therefore, by the martingale inequality and the Itô-
isometry, we have
1
P[ sup |In (t) − Im (t)| > ] ≤ 2 E (In (t) − Im (t))2
0≤t≤T

1 T
= 2 E (φn (t) − φm (t))2 dt
0
1
= 2 φn − φm 2L2 (B) ,

which tends to 0 as n, m → ∞ because φn is a Cauchy sequence. We can therefore choose
an increasing sequence n1 , n2 , n3 , . . . of natural numbers such that
P[ sup |Ink+1 (t) − Ink (t)| > 2−k ] ≤ 2−k .
0≤t≤T

By the ﬁrst Borel-Cantelli lemma,

P[ sup |Ink+1 (t) − Ink (t)| > 2−k for inﬁnitely many k] = 0.
0≤t≤T

Hence for almost all ω there exists K(ω) such that for all k ≥ K(ω),
sup |Ink+1 (t, ω) − Ink (t, ω)| ≤ 2−k ,
0≤t≤T

so that for l > k > K(ω),

sup |Inl (t, ω) − Ink (t, ω)| ≤ 2−(k−1) .
0≤t≤T

This implies that t

→ Ink (t, ω) converges uniformly to some function t
→ J(t, ω), which
must therefore be continuous. It remains to show that J(t, ω) is a version of I(t, ω). This
can be done by Fatou’s lemma:

2
|J(t, ω) − I(t, ω)| P(dω) = lim inf |Ink (t, ω) − I(t, ω)|2 P(dω)
Ω Ω k→∞

≤ lim inf |Ink (t, ω) − I(t, ω)|2 P(dω) = 0.
k→∞ Ω

The last equality uses that the orthogonal projection in L2 (Ω, P) is continuous.
t
From now on we shall always take 0 f (s, ω)dBs to mean a t-continuous version of the
integral.
We ﬁnish this section by extending Theorem 3.3 to arbitrary functions f ∈ L2 (B, [0, T ])

Corollary 3.6 Let f (t, ω) ∈ L2 (B, [0, T ]). Then

t
Mt (ω) := f (s, ω)dBs
0
is a martingale with respect to Ft .

Proof. This follows from Theorem 3.3, the almost sure t-continuity of Mt , Doob’s mar-
tingale inequality combined with the Itô-isometry.
We have completed our construction of stochastic integrals. In the next sections we shall
investigate their main properties.

24
4 Stochastic integrals and the Itô-formula
In this chapter we shall treat the Itô-formula, a stochastic chain rule that is of great help
in the formal manipulation of stochastic integrals.
We say that a process Xt is a stochastic integral if there exist (square integrable adapted)
processes Ut , Vt ∈ L2 (B, [0, T ]) such that for all t ∈ [0, T ],
t t
Xt = X0 + Us ds + Vs dBs . (15)
0 0

The first integral on the r.h.s. is of finite variation, being pathwise differentiable almost
everywhere. The second integral is an Itô-integral and therefore a martingale. A decom-
position of a process into a martingale and a process of finite variation is called a Doob-
Meyer decomposition. Processes in L2 (B, [0, T ]) whave such a decomposition are called
‘semi-martingales’. Equation (15) is conveniently rewritten in differential form:

dXt = Ut dt + Vt dBt . (16)

Example In Section 4.2 it was shown that the process Bt2 satisﬁes the equation

d(Bt2 ) = dt + 2Bt dBt . (17)

4.1 The one-dimensional Itô-formula

Relation (17) is an instance of a general chain rule for functions of stochastic integrals
that can be stated as follows: ‘the diﬀerential must be expanded to second order and
every occurrence of (dBt )2 must be replaced by dt’. Here is the precise rule.

Theorem 4.1 Let Xt be a stochastic integral. Let g : [0, ∞) × R → R be twice continuously

diﬀerentiable. Then the process Yt := g(t, Xt ) satisﬁes

∂g ∂g 1 ∂ 2g 2
dYt = (t, Xt )dt + (t, Xt )dXt + 2 (t, Xt ) (dXt ) , (18)
∂t ∂x ∂x 2

where (dXt )2 is to be evaluated according to the multiplication table:

dt dBt
dt 0 0
dBt 0 dt

In terms of the explicit form (16) of Xt , we can write (18) as

dYt = Ut dt + Vt dBt

with
1 ∂2g
Ut
∂g ∂g
= ∂t (t, Xt ) + ∂x (t, Xt ) Ut + 2 ∂x 2 (t, Xt ) Vt2

Vt
∂g
= ∂x (t, Xt ) Vt ,

25
which in its turn stands for
T T
YT = Y0 + Us ds + Vs dBs .
0 0

(The occurrence of the third term in the r.h.s. of (18) is sometimes called ‘the Itô-correction’.)
We shall prove Theorem 4.1 via the following extension of Lemma 2.4 in Section 2.4.

Lemma 4.2 If At is a process in L2 (B, [0, T ]), then

n−1 2 T
Atj ΔBj → At dt in L2 (Ω, P) as n → ∞.
0
j=0

Proof. We leave this as an exercise.

We now give the proof of Theorem 4.1. It will be a bit ‘sketchy’, but the details are easily
ﬁlled in.
Proof. We shall use the by now standard notation Δtj = tj+1 − tj and ΔBj = Btj+1 − Btj ,
(n)
where (tj ) = (tj )n j=0 with 0 = t0 < t1 < · · · < tn = T for n = 1, 2, 3, · · · is a family of
partitions of [0, T ] whose maze δn = max Δtj tends to zero as n → ∞. We shall write Xj
for Xtj .
∂g ∂g ∂2g ∂2g ∂2g
First, we may assume that g, ∂t , ∂x , ∂t2 , ∂x 2 and ∂x∂t are all bounded. If they are not,
then we stop the process as soon as the absolute value of one of them reaches the value
N, and afterwards take the limit N → ∞. Next, using Taylor’s theorem we obtain

g(T , XT ) − g(0, X0 ) = g(tj+1 , Xj+1 ) − g(tj , Xj )
j
∂g ∂g
= (tj , Xj )Δtj + (tj , Xj )ΔXj
j
∂t j
∂x
∂ 2g 2 ∂ 2 g
1
+ 2 (t j , X j ) Δt j + (t j , X j ) Δt j ΔX j
∂t 2 ∂t∂x
j j
∂ 2g 2
1
+2 (t j , X j ) ΔX j + εj ,
∂x 2
j j

where εj = o |Δtj |2 + |ΔXj |2 for all j. Now, the ﬁrst two terms converge because δn →
0:
∂g T
∂g
(tj , Xj )Δtj → (t, Xt ) dt,
j
∂t 0 ∂t

∂g ∂g
(tj , Xj )ΔXj = tj , Xj Uj Δtj + Vj ΔBj + o(1)
∂x ∂x
j j
T T
∂g ∂g
→ (t, Xt ) Ut dt + (t, Xt ) Vt dBt .
0 ∂x 0 ∂x

26
The third and the fourth tend to zero. For instance, if in the fourth term we substitute
ΔXj = Uj Δtj + Vj ΔBj , then a term

∂ 2g
(tj , Xj )Vj Δtj ΔBj =: cj Δtj ΔBj
j
∂t∂x j

arises. But, because cj is Ftj -measurable and |cj | ≤ M for all j, it follows that (4.1) tends
to zero because
2
E cj Δtj ΔBj = E(cj2 )(Δtj )3 → 0.
j j

Finally, the ﬁfth term again converges:

1
∂ 2g 2
1
∂ 2g 2
2 ΔX j = 2 (t j , X j ) Uj Δt j + V j ΔBj + o(1)
j
∂x 2 j
∂x 2
∂ 2g
1 2 2 2 2
= 2 (t j , X j ) Uj (Δt j ) + 2Uj V j Δt j ΔBj + V j (ΔBj ) + o(1)
∂x 2
j
T 2
1 ∂ g
→ 2 (t , Xj )Vt2 dt
2 j
0 ∂x

by Lemma 4.2 (recall the multiplication table in Theorem 4.1).

4.2 Some examples

Example
T 2 With the help of the Itô-formula it is possible to quickly calculate an integral
like 0 Bt dBt , in much the same way as ordinary integrals are calculated: we make a guess
for the primitive, calculate its derivative, see if the guess is correct, if not then we adapt
our guess.
In the present case our guess is that we should have something like Bt3 , so we calculate
(use Theorem 4.1 with g(t, x) = x 3 , Ut ≡ 0, Vt ≡ 1):

d(Bt3 ) = 3Bt2 dBt + 3Bt (dBt )2 + (dBt )3 = 3Bt2 dBt + 3Bt dt

1
⇒ Bt2 dBt = 3 d Bt3 − Bt dt
T 1 T
⇒ 0 Bt2 dBt = 3 BT3 − 0 Bt dt.

T
Example Let f be diﬀerentiable. Then the noise Nf = 0 f (t)dBt satisﬁes (use Theorem
4.1 with g(t, x) = f (t)x, Ut ≡ 0, Vt ≡ 1):

d (f (t)Bt ) = d(f (t))Bt + f (t)dBt

T T
⇒ 0 f (t)dBt = f (T )BT − 0 f (t)Bt dt.

27
Example We want to solve the stochastic diﬀerential equation

dXt = βXt dBt ,

which is a special case of the growth equation in Example 1 in Section 1. We try Yt =

exp (βBt ), and obtain with the help of the Itô-formula
1
dYt = βYt dBt + 2 β2 Yt dt.

This is obviously growing too fast: the second term in the r.h.s., which is the Itô-correction,
must be compensated. We thus try Xt = exp (−αt)Yt , which yields

1
dXt = βXt dBt + 2 β2 − α Xt dt.

1
The second term in the r.h.s. is zero for α = 2 β2 , so we ﬁnd the solution

1 2
Xt = eβBt − 2 β t .

This is a so-called exponential martingale.

4.3 The multi-dimensional Itô-formula

Let B(t, ω) = (B1 (t, ω), B2 (t, ω), . . . , Bm (t, ω)) be Brownian motion in Rm , consisting of
m independent copies of Brownian motion. Let Xi be the stochastic integral given by

m
dXi (t) = Ui (t)dt + Vij (t)dBj (t) (i = 1, . . . , n), (19)
j=1

for some processes Ui (t) and Vij (t) in L2 (B, [0, T ]). We sometimes abbreviate (19) in the
vector notation

dX = U dt + V dB.

Let g : [0, T ] × Rn → Rp be a C 2 -function. Then the process Y (t, ω) := g(t, Xt ) satisﬁes

∂gi n ∂gi 1 n ∂ 2 gi
dYi (t) = ∂t (t, Xt ) dt + j=1 ∂xj (t, Xt ) dXj (t) + 2 j,k=1 ∂xj ∂xk (t, Xt ) dXj dXk

(i = 1, . . . , p),
(20)

where the product dXj dXk has to be evaluated according to the rules

dBj dBk = δjk dt

dBj dt = dt 2 = 0.

28
Equation (20 is the multi-dimensional version of Itô’s formula, which can be proved in the
same way as its one-dimensional counterpart. (Be careful to keep track of all the indices.
The easiest case is m = n, p = 1.)

Example (‘Bessel process’) Let Rt (ω) = Bt (ω), where Bt is m-dimensional Brownian
motion and · is the Euclidean norm. Apply the Itô-formula to the function r : Rm →
∂r xi ∂2r 1 xi2
R+ : x
→ x. We compute ∂xi = r and ∂xi 2
= r − r3
. So we ﬁnd
⎛ ⎞
2

m
Bj
m
1 B m
Bj m−1
dR =
1
dBj + 2 ⎝ − j ⎠ dt = dBj + dt.
R R R 3 R 2R
j=1 j=1 j=1

For notational convenience we have dropped here the arguments t and ω.

The next theorem gives a way to construct martingales out of Brownian motion. A func-
2
tion f is called harmonic if Δf = 0, with Δ = i ∂ 2 the Laplacian.
∂xi

Theorem 4.3 If f : Rm → R is harmonic, then f (Bt ) is a martingale.

Proof. Write out

m
∂f 1

m
m
∂ 2f
df (Bt ) = (B(t)) dBi (t) + 2 (B(t)) dBi (t)dBj (t).
∂xi ∂xi ∂xj
i=1 i=1 j=1

fk"

FIG: The graph of fk approximating a g having a jump.

The second part in the r.h.s. is zero because dBi (t)dBj (t) = δij dt and Δf = 0. Integration
now yields
m t
∂f
f (Bt ) − f (B0 ) = (B(t)) dBi (t).
0 ∂xi
i=1

29
This is an Itô-integral and hence a martingale.
An alternative way to understand Theorem 4.3 is that a harmonic function f has the
property that its value in a point x is the average over its values on any sphere around
x. This property, together with the fact that Brownian motion is ‘isotropic’, explains why
f (Bt ) is a ‘fair game’.
The following extension of Itô’s formula will be useful later on.

Lemma 4.4 Itô’s formula for Yt = g(Bt ) still holds if g : R → R is C1 everywhere and C2
outside a ﬁnite set { z1 , . . . , zN }, with g locally bounded outside this set.

Proof. Take fk ∈ C2 (R) such that fk → g and fk → g as k → ∞, both uniformly and
such that for x ∉ { z1 , . . . , zN }:
⎧
⎨f (x) → g (x)
k
⎩|f (x)| ≤ M in a neighbourhood of { z1 , . . . , zN } .
k

(Fig. 4.3 shows the graph of fk for a simple example of g that has a jump.) For fk we
have the Itô formula
t t
fk (Bs )dBs + 2 fk (Bs )ds.
1
fk (Bt ) = fk (B0 ) +
0 0

In the limit as k → ∞ this equality tends, term by term in L2 (Ω, P), to

t t

g (Bs )ds.
1
g(Bt ) = g(B0 ) + g (Bs )dBs + 2
0 0

4.4 Local times of Brownian motion

As an application of Lemma 4.4 we shall prove Tanaka’s formula for the local times of
Brownian motion.

Theorem 4.5 (Tanaka) Let t

→ Bt be a one-dimensional Brownian motion. Let λ denote
the Lebesgue measure on [0, T ]. Then the limit

1
Lt := lim λ ({ s ∈ [0, t] |Bs ∈ (−ε, ε) }) ,
ε↓0 2ε

exists in L2 (Ω, P) and is equal to

t
Lt = |Bt | − |B0 | − sgn(Bs )dBs .
0

(Think of Lt as the density per unit length of the total time spent close to the origin up
to time t.)

30
−ε ε

The function gε (t).

Proof. Consider for ε > 0 the function

⎧
⎨|x| if |x| ≥ ε
gε (x) = 1
⎩ ε + x 2 /ε if |x| < ε,
2

as shown in Fig. 4.4. Then gε is C2 , except in the points { −ε, ε }, and it is C1 everywhere
on R. Apply Lemma 4.4 to get
t t

gε (Bs )dBs ,
1
2 gε (Bs )ds = gε (Bt ) − gε (B0 ) − (21)
0 0

where gε (x) and gε (x) are given by

⎧
⎨sgn(x) if |x| ≥ ε
gε (x) =
⎩x/ε if |x| < ε

and

gε (x) = 1ε 1I(−ε,ε)(x) if x ∉ { −ε, ε } .

Now, the limit as ε ↓ 0 of the l.h.s. of (21) is precisely Lt . (The time spent by Bt in ±ε
is zero.) Moreover, we trivially have gε (Bt ) → |Bt | and gε (B0 ) → |B0 | as ε ↓ 0. Hence it
suﬃces to prove that the integral in the r.h.s. of (21) converges to the appropriate limit:
t

gε (Bs ) − sgn(Bs ) dBs → 0 in L2 (Ω, P).
0

To see why the latter is true, estimate

2 2
t t
0 gε (Bs ) − sgn(Bs ) dBs = 0 1I(−ε,ε)(Bs ) 1ε Bs − sgn(Bs ) dBs
2
t 1
= E 0 1I(−ε,ε) (Bs ) ε Bs − sgn(Bs ) ds

t
≤ 0 P (Bs ∈ (−ε, ε)) ds

→ 0 as ε ↓ 0,

31
where in the second equality we use the Itô-isometry and the last statement holds because
Bs (s > 0) has an absolutely continuous distribution. It follows that Lt exists and can be
expressed as in the statement of the theorem.
Note that for smooth functions f :
t
|f (t)| − |f (0)| − sgn (f (s)) f (s)ds = 0
0

because
d
f (t) = sgn (f (t)) f (t) (f (t) ≠ 0).
dt
Thus, the local time is an Itô-correction to this relation, caused by the fact that d|Bt | ≠
sgn (Bt ) dBt : if Bt passes the origin during the time interval Δtj , then |Btj+1 − Btj | need
not be equal to sgn(Btj )ΔBtj . The diﬀerence is a measure of the time spent close to the
origin.
The existence of the local times of Brownian motion was proved by Lévy in the 1930’s
using hard estimates. The above approach is shorter and more elegant. What is described
above is the local time at the origin: Lt = Lt (0). In a completely analogous way one can
prove the existence of the local time Lt (x) at any site x ∈ R. The process x → Lt (x)
plays a key role in many applications associated with Brownian motion.

32
5 The Martingale Representation Theorem
Let B(t) = (B1 (t), . . . , Bd (t)) be d-dimesnional Brownian motion. In Section 3, Theorem
3.3, and Corollary 3.6, we have proved that if f ∈ L2 then the Itô integral
t
Xt = X0 + f (s, ω)dB(s); t ≥ 0
0

is always a martingale with respect to the filtration Ft of Brownian motion. This might
not be too surprising because also Brownina motion itself is a martingale. In this section
we prove a result, which is really stunning, namely that the converse also is true: Any Ft -
martingale with respect to P can be represented as an Itô-integral. This result, called the
martingale representation theorem, is important in many applications, e.g. mathematical
finance. In this section we will only prove ist one dimension, but essentailly the same
proof works for arbitrary (finite) d.
We start by establishing some auxiliary results

Excercise. 5.1 Look up the proof the following result, sometimes called the Doob-Dynkin
lemma, e.g. in M. M. Rao, Prop. 3, p.7, or B. Øksendal, Lemma 2.1.2,p.9.

Lemma 5.1 Let (Ω, F , P ) be a probability space and X, Y : Ω → Rd be two random vari-
ables. Denote
σ (X) := {X −1 (B), B ∈ F }.
Then Y ist σ (X)-measurable if and only if there exists a Borel-measurable function g :
Rn → Rn such that
Y = g(X).

Excercise. 5.2 Look up the proof the following result, called the Martingale convergence
theorem, e.g. in B. Øksendal, Corollary C.9.

Lemma 5.2 Let (Ω, F , P ) be a probability space and X ∈ L1 (P ). Let Fk , k = 1, 2 . . . be an

increasing familiy of σ -algebras, Fk ⊆ F and deﬁne

F∞ := σ {Fk , k = 1, 2 . . . }.

Then
E[X|Fk ] →k→∞ E[F |F∞ ]
P -a.e. and in L1 (P ).

Lemma 5.3 The set of random variables

{φ(Bt1 , . . . , Btn ); ti ∈ [0, T ], φ ∈ C0∞ (Rn ), n = 1, 2, . . . }

is dense in L2 (FT , P ).

33
Proof. Let {ti }∞
i=1 be a dense subset of [0, T ] and for each n = 1, 2, . . . let Hn denote
the σ -algebra generated by Bt1 , . . . , Btn . Clearly

Hn ⊆ Hn+1

and
FT = σ {Hn , n = 1, 2, . . . }.
Choose g ∈ L2 (FT , P ). Then by the martingale convergence theorem 5.2 we have that

g = E[g|FT ] = lim E[g|Hn ]

n→∞

P -a.e. and in L2 (FT , P ). By the Doob-Dynkin lemma 5.1 we can write, for each n,

E[g|Hn ] = gn (Bt1 , . . . , Btn )

for some Borel measurable function g : Rn → R. By a standard result in measure theory

each such gn can be approximated in L2 (FT , P ) by functions φ ∈ C0∞ (Rn ), which yields
the result.

Lemma 5.4 The linear span of the random variables

T !
1 T 2
exp h(t)dBt (ω) − h (t)dt , h ∈ L2 ([0, T ]) (deter ministic) (22)
0 2 0

is dense in L2 (FT , P ).

Proof. Choose g ∈ L2 (FT , P ) and suppose it is orthogonal in to all functions in L2 (FT , P )

of the form (22). In particular

G(λ) := exp{λ1 Bt1 (ω) + . . . + λn Btn (ω)}g(ω)dP (ω) = 0
Ω

for all λ = (λ1 , . . . , λn ) ∈ Rn and all t1 , . . . tn ∈ [0.T ]. The function G(λ) is real analytic
and hence has an analytic extension to the complex space Cn given by

G(z) := exp{z1 Bt1 (ω) + . . . + zn Btn (ω)}g(ω)dP (ω)
Ω

for z = (z1 , . . . , zn ) ∈ Cn . Since G ≡ 0 in R and G is analytic, G ≡ 0 on C. In particular,

G(iy1 , . . . , iyn ) = 0, for all y = (y1 , . . . , yn ) ∈ Rn . But then for φ ∈ C0∞ (Rn )

φ(Bt1 , . . . , Btn )g(ω)dP (ω)

i n y B
= (2π )−n/2 φ̂(y)e j=1 j tj dy g(ω)dP (ω)
R
n
Ω
−n/2 i n
j=1 yj Btj
= (2π ) φ̂e g(ω)dP (ω) dy
Ω Rn

= (2π )−n/2 φ̂(y)G(iy)dy = 0, (23)
Rn

34
where
−n/2
φ̂(y) = (2π ) φ(x)e−ix·y dx
Rn

is the Fourier transform and we have used the inverse Fourier transform theorem

φ(x) = (2π )−n/2 φ̂(y)eix·y dy.
Rn

By (23) and Lemma 5.3 g is orthogonal to a dense subset of L2 (FT , P ) and thus we
conculde that g ≡ 0. Therefore the linear span of the functions in (22) must be dense in
L2 (FT , P ) as claimed.
Let B(t) = (B1 (t), . . . , Bd (t)) be d-dimensional Brownian motion. If f (ω, s) ∈ L2 (B, [0, T ])
then the random vaiable T
V (ω) := f (ω, s)dB(s)
0

is FT -measurable and by the Itô isometry

T
E[V 2 ] = E[f 2 (·, s)]ds < ∞,
0

so V ∈ L2 (FT , P ).
The next result states that any F ∈ L2 (FT , P ) can be represented this way:

Theorem 5.5 (The Itô representation theorem) Let F ∈ L2 (FT , P ). Then there exists a
unique stochastic process f (ω, s) ∈ L2 (B, [0, T ]) such that
T
F (ω) = E[F ] + f (ω, s)dB(s).
0

Proof. Again we will only treat the case of d = 1. First assume that F has the form (22),
i.e. T !
1 T 2
F (ω) = exp h(t)dBt (ω) − h (t)dt ,
0 2 0

for some h ∈ L2 ([0, T ]).

Deﬁne t t !
1 2
Yt (ω) = exp h(s)dBs (ω) − h (s)ds , 0 ≤ t ≤ T.
0 2 0

Then by Itô’s formula

1 2 1
dYt = Yt (h(t)dBt − h (t)dt) + Yt (h(t)dBt )2 = Yt h(t)dBt ,
2 2
so that t
Yt = 1 + Ys h(s)dBs ; t ∈ [0, T ].
0

Thus T
F = YT = 1 + Ys h(s)dBs
0

35
and hence E[F ] = 1. So in this case the claim of the lemma holds true.
By linearity it also holds true for linear combinations of functions of the form (22). So, if
F ∈ L2 (FT , P ) we approximate it by linear combinations Fn of functions of the form (22).
Then for each n we have
T
Fn (ω) = E[Fn ] + fn (ω, s)dB(s)
0

where fn (ω, s) ∈ L2 (B, [0, T ]).

By the Itô-isometry
" T #
2 2
E[(Fn − Fm ) ] = E (E[Fn − Fm ] + (fn − fm )dB)
0
T
= (E[Fn − Fm ])2 + E[(fn − fm )2 ]dt → 0
0

as n, m → ∞. So {fn } is a Cauchy sequence in L2 (B, [0, T ]) and hence converges to some

f ∈ L2 (B, [0, T ]). Again using the Itô isometry we see that

T T
F = lim Fn = lim E[Fn ] + fn (ω, s)dB(s) = E[F ] + f (ω, s)dB(s)
n→∞ n→∞ 0 0

the limit being taken in L2 (B, [0, T ]). Hence the representation part of the theorem fol-
lows.
To see the uniqueness we again employ the Itô isometry: Suppose
T T
F (ω) = E[F ] + f1 (ω, s)dB(s) = E[F ] + f2 (ω, s)dB(s)
0 0

for two functions f1 , f2 ∈ L2 (B, [0, T ]).

Then
T T
2
0 = E[( (f1 (ω, s) − f2 (ω, s))dBs (ω)) ] = E[T (f1 (ω, s) − f2 (ω, s))2 ]dt
0 0

and therefore f1 (ω, s) = f2 (ω, s) for almost all (t, ω).

Now we are ready to prove the main theorem of this section.

Theorem 5.6 ((The martingale representation theorem)) Let B(t) = (B1 (t), . . . , Bd (t)),
0 ≤ t ≤ T , be d-dimensional Brownian motion and let Mt be an Ft - martingale with
respect to P , such that Mt ∈ L2 (P ) for all 0 ≤ t ≤ T . Then there exists a unique stochastic
process g(ω, s) ∈ L2 (B, [0, T ]) and
t
Mt (ω) = E[M0 ] + g(ω, s)dB(s) a.e.
0

for all 0 ≤ t ≤ T .

Proof. Again we just treat the case d = 1. By the Itô representation theorem applied to
T = t and F = Mt , we have that there exists a unique f (t) (ω, s) ∈ L2 (B, [0, T ]) such that
t t
(t)
Mt (ω) = E[Mt ] + f (ω, s)dBs (ω) = E[M0 ] + f (t) (ω, s)dBs (ω).
0 0

36
Now assume that 0 ≤ t1 < t2 Then
" #
t2
Mt1 = E[Mt2 |Ft1 ] = E[M0 ] + E f (t2 ) (ω, s)dBs (ω)|Ft1
0
t1
= E[M0 ] + f (t2 ) (ω, s)dBs (ω). (24)
0

On the other hand

t1
Mt1 = E[M0 ] + f (t1 ) (ω, s)dBs (ω). (25)
0

Comparing (24) and (25) we get

⎡ 2 ⎤
t1 t1
0 = E⎣ (f (t1 ) − f (t2 ) )dB ⎦ = E[(f (t1 ) − f (t2 ) )2 ]ds
0 0

and therefore
f (t1 ) (ω, s) = f (t2 ) (ω, s) for a.a.(ω, s) ∈ [0, t1 ] × Ω.
Now putting
f (ω, s) = f (T ) (ω, s)
gives the result.

37
6 Stochastic diﬀerential equations
A stochastic diﬀerential equation for a process X(t) with values in Rn is an equation of
the form

m
dXi (t) = bi (t, Xt )dt + σij (t, Xt )dBj (t) (i = 1, · · · , n). (26)
j=1

Here, B(t) = (B1 (t), B2 (t), · · · , Bm (t)) is an m-dimensional Brownian motion, i.e., an m-
tuple of independent Brownian motions on R. The functions bi and σij from R × Rn to R
with i = 1, 2, · · · , n and j = 1, 2, · · · , m form a field b of n-vectors and a field σ of n×m-
matrices. A process X ∈ L2 (B, [0, T ]) for which (26) holds is called a (strong) solution of
the equation. In more pictorial language, such a solution is called an Itô-diffusion with
drift b and diffusion matrix σ σ ∗ .
In this section we shall formulate a result on the existence and the uniqueness of Itô-
diffusions. It will be convenient to employ the following notation for the norms on vectors
and matrices:

n
n
m
x2 := xi2 (x ∈ Rn ); σ 2 := 2
σij = tr (σ σ ∗ ) (σ ∈ Rn×m ).
i=1 i=1 j=1

Also, we would like to take into account an initial condition X(0) = Z, where Z is an Rn -
valued random variable independent of the Brownian motion. All in all, we enlarge our
probability space and our space of adapted processes as follows. We choose a probability
measure μ on Rn and put

Ω := Rn × Ωm ;
Ft := B(Rn ) ⊗ Ft⊗m (t ∈ [0, T ]);
⊗m
P := μ ⊗ P ;
n
Z : Ω → R : (z, ω)
→ z;
L (B, [0, T ]) := {X ∈ L2 (Rn , μ) ⊗ L2 (Ω, FT , P)⊗m ⊗ L2 [0, T ] ⊗ Rn | ω
→ Xt,i
2 x
(ω) is Ft -measurable}.

This change of notation being understood, we shall drop the primes again.

6.1 Strong solutions

Theorem 6.1 Fix T > 0. Let b : [0, T ] × Rn → Rn and σ : [0, T ] × Rn → Rn×m be measur-
able functions, satisfying the growth conditions

b (t, x) ∨ σ (t, x) ≤ C (1 + x) (t ∈ [0, T ], x ∈ Rn )

as well as the Lipschitz conditions

b(t, x) − b(t, y) ∨ σ (t, x) − σ (t, y) ≤ D x − y (t ∈ [0, T ], x, y ∈ Rn ),

for some positive constants C and D. Let (Bt )t∈[0,T

] be m-dimensional Brownian motion
and let μ be a probability measure on Rn with Rn x2 μ(dx) < ∞. Then the stochastic
diﬀerential equation (26) has a unique continuous adapted solution X(t) given that the
law of X(0) is equal to μ.

38
Proof. The proof comes in three parts.
1. Uniqueness. Suppose X, Y ∈ L2 (B, [0, T ]) are solutions of (26) with continuous paths.
Put

Δbi (t) := bi (t, Xt ) − bi (t, Yt )

Δσij (t) := σij (t, Xt ) − σij (t, Yt ).

Applying the inequality (a+b)2 ≤ 2(a2 +b2 ) for real numbers t a and b, the independence
t
of the components of B(t), the Cauchy-Schwarz inequality ( 0 g(s)ds)2 ≤ t 0 g(s)2 ds for
an L2 -function g, the multi-dimensional Itô-isometry and finally the Lipschitz condition,
we find

n

E Xt − Yt 2 := E (Xi (t) − Yi (t))2
i=1
⎛⎛ ⎞2 ⎞

n t m t

⎜ ⎟
= E ⎝⎝ Δbi (s)ds + Δσij (s)dBj (s)⎠ ⎠
0 0
i=1 j=1
⎛ ⎞2 ⎞
2 ⎛ m

n
⎜
t t
⎟
≤2 E⎝ Δbi (s)ds +⎝ Δσij (s)dBj (s)⎠ ⎠
0 0
i=1 j=1
⎛ 2 ⎞

n t
=2 E⎝ Δbi (s)ds ⎠
0
i=1
⎛ ⎞

n m t

+2 E⎝ (Δσij (s))2 ds ⎠
0
i=1 j=1
⎛ ⎞ ⎛ ⎞
t
n t
n
m
≤ 2t E⎝ Δbi (s)2 ⎠ ds + 2 E⎝ Δσij (s)2 ⎠ ds
0 0
i=1 i=1 j=1
t t
= 2t E(Δb(s)2 )ds + 2 E(Δσ (s)2 )ds
0 0
t
≤ 2D 2 (T + 1) EXs − Ys 2 ds.
0

So the function f : t
→ E Xt − Yt 2 satisfies the integral inequality
t
0 ≤ f (t) ≤ A f (s)ds
0

for the constant A = 2D 2(T + 1). This inequality implies that f = 0 (“Gronwall’s inequal-
t
ity”). Indeed, put F (t) = 0 f (s)ds. Then F (t) is C1 and F (t) = f (t) ≤ AF (t). Therefore

d −tA
e F (t) = e−tA (f (t) − AF (t)) ≤ 0.
dt

Since F (0) = 0, it follows that e−tA F (t) ≤ 0 implying F (t) ≤ 0. So we have 0 ≤ f (t) ≤
AF (t) ≤ 0 and hence f (t) = 0.

39

Thus we have E Xt − Yt 2 = 0 for all t ∈ [0, T ]. In particular, for all t ∈ [0, T ] ∩ Q and
almost all ω:

Xt (ω) = Yt (ω).

Now let

N := { ω ∈ Ω | ∃t∈[0,T ]∩Q : Xt (ω) ≠ Yt (ω) }.

Then N is a countable union of null sets, so P(N) = 0. On Ω \ N we have

∀t∈[0,T ]∩Q : Xt (ω) = Yt (ω),

and since Xt and Yt have continuous paths we conclude that for almost all ω:

Xt (ω) = Yt (ω) for all t ∈ [0, T ].

This completes the proof of uniqueness.

2. Existence. We shall ﬁnd a solution of (26) by iterating the map L2 (B) → L2 (B) : X → X̃
deﬁned by
t t
X̃t = Z + b(s, Xs )ds + σ (s, Xs )dBs .
0 0

Let us start with the constant process Xt0 := Z, and deﬁne recursively
(k+1) (k)
Xt := X̃t (k ≥ 0).

The calculation in the uniqueness part of this proof can be used to conclude that
2 t

E X̃t − Ỹt ≤ A E Xs − Ys 2 ds for any Xs , Ys ∈ L2 (B).
0

Iteration of this result yields, for k ≥ 1,

(k+1) (k) 2 (k) (k−1) 2
E Xt − Xt = E X̃t − X̃t
t
(k) (k−1) 2
≤A E Xs − Xs ds
0
t sk−1 s1 2
(1)
≤ Ak dsk−1 sk−2 · · · ds0 E Xs0 − Z
0 0 0
k
A t k
≤ K,
k!
where K is given
⎛ 2 ⎞
t t

K := sup E ⎝ b(s, Z)ds + σ (s, Z)dBs ⎠
t∈[0,T ]
0 0
⎧ ⎛ 2 ⎞ ⎛ 2 ⎞⎫
⎨ t t ⎬

≤ 2 sup E ⎝ b(s, Z)ds ⎠ + E ⎝ σ (s, Z)dBs ⎠
t∈[0,T ]
⎩ 0 0 ⎭

≤ 2T 2 C 2 E (1 + Z)2 + 2T C 2 E (1 + Z)2
≤ 2C 2 (T 2 + T )E (1 + Z)2 ,

40
which is ﬁnite by the growth condition and the requirement that Z has ﬁnite variance.
2 (k)
-
Now let X = L (B)- limk→∞ X . The existence of this limit follows because k Ak t k /k! <
∞. Then

2 (k)
X̃ = L (B)- lim X ˜= L2 (B)- lim X (k+1) = X.
k→∞ k→∞

So X is a solution of (26).
3. Continuity and adaptedness. By Theorem 3.5 the paths t
→ Xt (ω) can be assumed
continuous for almost all ω ∈ Ω. The fact that the solution is adapted is immediate from
the construction.

6.2 Weak solutions

What we have shown now is that there exists a strong solution of (26). In the literature
on stochastic differential equations often a different point of view is taken, namely one
where the Brownian motion itself is also considered unknown. This leads to the so-called
weak solutions of stochastic differential equations.

Deﬁnition 6.1 A weak solution of (26) is a pair (Bt , Xt ), measurable w.r.t. some ﬁltration
(Gt )t∈[0,T ] on some probability space (Ω, G, P), such that Bt is m-dimensional Brownian
motion and such that (26) holds.

The key point here is that the ﬁltration need not be (Ft )t∈[0,T ] = σ ((Bs )s∈[0,t] ). (If it is,
then we have a strong solution.) Strong uniqueness is uniqueness of a strong solution.
Weak uniqueness holds if, given σ and b, all weak solutions have the same law. Strong
uniqueness implies weak uniqueness.
By the following example we illustrate the diﬀerence between these two concepts.

Example (Tanaka) This is related to our example of Brownian local time in Section 5.4.
Let Bt be a Brownian motion and deﬁne
t
B̃t := sgn(Bs )dBs .
0

Then B̃t is a martingale with quadratic variation t (recall Section 2.4):

t
B̃t2 −2 B̃s dB̃s = t.
0

Indeed, since dB̃t = sgn(Bt )dBt , we have by Itô’s formula,

d(B̃t2 ) = 2B̃t dB̃t + (dB̃t )2

with (dB̃t )2 = sgn(Bt )2 (dBt )2 = (dBt )2 = dt. Now, Lévy’s characterization of Brownian
motion (which we do not prove here) says that any martingale with quadratic variation t
must be a Brownian motion. Hence B̃t is a Brownian motion.
Turning matters around, Bt is itself a solution of the stochastic diﬀerential equation

dBt = sgn(Bt )dB̃t , (27)

41
t t t
since Bt = 0 dBs = 0 sgn(Bs )2 dBs = 0 sgn(Bs )dB̃s . Because every solution of (27) is a
Brownian motion, we have weak uniqueness. However, because −Bt obviously also is a
solution of (27), there are two solutions of (27) for a given process B̃t . In other words, we
do not have strong uniqueness.
Taking this argument one step further we ﬁnd (cf. Theorem 4.5):

B̃(t) = |B(t)| − |B(0)| − Lt ,

where Lt is the local time at the origin. By its definition, Lt is adapted to the filtra-
tion (Gt )t∈[0,T ] generated by (|Bt |)t∈[0,T ] . Hence so is B̃t . It follows that F̃t ⊂ Gt , where
(F̃t )t∈[0,T ] is the filtration generated by (B̃t )t∈[0,T ] . However, since Bt is not Gt -measurable
(its sign is not determined by Gt ), it is not F̃t -measurable.
Note that (27) is an Itô-diffusion with b(t, x) = 0 and σ (t, x) = sgn(x). The latter is not
Lipschitz, which is why Theorem 6.1 does not apply.

42
7 Itô-diffusions and one-parameter semigroups
7.1 Introduction and motivation
An Itô-diffusion Xtx (with initial condition X0x = x ∈ Rn ) is a Markov process in contin-
uous time. If we suppose that the field b of drift vectors and the field σ of diffusion
matrices are both time-independent, i.e., Xtx is the solution starting at x of the stochastic
differential equation

dXt = b(Xt )dt + σ (Xt )dBt (28)

(where b and σ satisfy the Lipschitz conditions mentioned in Theorem 6.1), then this
Markov process is stationary (= time-homogeneous) and can be characterised by its tran-
sition probabilities
. /
Pt (x, B) := P Xtx ∈ B (t ≥ 0),

where B runs through the Borel subsets of Rn . These transition probabilities satisfy the
one-parameter Chapman-Kolmogorov equation

Pt+s (x, B) = Pt (x, dy)Ps (y, B). (29)
y∈Rn

An alternative way to characterise an Itô-diﬀusion is by its action on functions f : Rn →

Rn . If we put

(St f )(x) := E f (Xtx ) = Pt (x, dy)f (y), (30)
y∈Rn

then (29) leads to

St+s = St ◦ Ss , (31)

i.e., the transformations (St )t≥0 form a one-parameter semigroup. Such a semigroup is
determined by its generator A, deﬁned by

1
Af := lim (St f − f ) . (32)
t↓0 t

In this chapter we study the interplay between the diﬀusion Xtx and its generator A.

7.2 Basic properties

Let us now make the above remarks precise. We start by the following observation.
t,x
Let t ≥ 0 and let Xs for s ≥ t denote the solution of the stochastic diﬀerential equation
t,x
(28) with initial condition Xt = x.

t,x x
Lemma 7.1 (Stationarity) The processes s
→ Xs and s
→ Xs−t are equal in law for s ≥ t.

Proof. These processes are solutions of

t,x t,x t,x
dXs = b(Xs )ds + σ (Xs )d(Bs − Bt )

43
and
x x x
dXs−t = b(Xs−t )ds + σ (Xs−t )d(Bs−t )

respectively, for s ≥ t. Since Bs −Bt and Bs−t are Brownian motions on [t, T ], both starting
at 0, the two processes have the same law by weak uniqueness.

Proposition 7.2 (Markov property) Let Xtx (t ≥ 0) denote the solution of (28) with initial
condition X0x = x. Fix t ≥ 0. Then for all s ≥ 0 the conditional expectation E(f (Xt+s
x
)|Ft )
x
depends on ω ∈ Ω only via Xt . In fact,
x
E(f (Xt+s )|Ft ) = (Ss f )(Xtx ).

Proof. Fix t ≥ 0. For y ∈ Rn and s ≥ t, let

t,y
Ys (y) := Xs+t .
y
Then, according to Lemma 7.1, Ys (y) and Xs have the same law. Hence, for all f ∈
C0 (Rn ),

y
E f (Ys (y)) = E f (Xs ) = (Ss f )(y).

On the other hand, the random variable Ys (y) depends on ω ∈ Ω only via the Brownian
motion u
→ Bu (ω) − Bt (ω) for u ≥ t, so Ys (y) is stochastically independent of Ft . By
strong uniqueness we must have, for s ≥ 0,

Ys Xtx = Xt+sx
a.s.,

since both sides are solutions of the same stochastic diﬀerential equation (28) for s ≥ 0
with initial condition Y0 (Xtx ) = Xtx . As Xtx is Ft -measurable, we now have

x
E f (Xt+s |Ft ) (ω) = f Ys (Xtx (ω))(ω ) P(dω ) = (Ss f ) Xtx (ω) .
ω ∈Ω

Deﬁnition 7.1 Let B be a Banach space. A jointly continuous one-parameter semigroup

of contractions on B is a family (St )t≥0 of linear operators St : B → B satisfying the
following requirements:
1. St f ≤ f for all t ≥ 0, f ∈ B;
2. S0 f = f for all f ∈ B;
3. St+s = St ◦ Ss for all t, s ≥ 0;
4. the map (t, f )
→ St f is jointly continuous [0, ∞) × B → B.

In our study of diﬀusions and their one-parameter semigroups we choose for B the space
C0 (Rn ) of all continuous functions Rn → R that tend to 0 at inﬁnity. The natural norm
on this space is the supremum norm

f := sup |f (x)|.
x∈Rn

44
For f ∈ C0 (Rn ) we deﬁne

(St f )(x) := E f (Xtx ) ,

where Xtx (t ≥ 0) denotes the solution of (28) with initial condition X0x = x.

Theorem 7.3 The operators St with t ≥ 0 form a jointly continuous one-parameter semi-
group of contractions C0 (Rn ) → C0 (Rn ).

Proof. We prove properties 1.-4. in Deﬁnition 7.1.

1. Fix t ≥ 0 and f ∈ C0 (Rn ). We show that the function St f again lies in C0 (Rn ), by
showing that it is continuous and tends to 0 at inﬁnity.
y
Continuity: Choose x, y ∈ Rn and consider the processes Xtx and Xt . The basic estimate
in Section 6.1 yields
t

x y 2 2 x y 2
E Xt − Xt ≤ 2 x − y + A EXs − Xs ds
0
t y
for some constant A. Putting F (t) := 0 EXsx − Xs 2 ds, we may write this as
d
F (t) ≤ 2x − y2 + 2AF (t).
dt
So
d −2At
e F (t) ≤ 2x − y2 e−2At .
dt
Hence, as F (0) = 0,
1
e−2At F (t) ≤ x − y2 1 − e−2At ,
A
or

AF (t) ≤ x − y2 e2At − 1 .

We thus obtain
y
EXtx − Xt 2 ≤ 2x − y2 + 2AF (t) ≤ 2x − y2 e2At . (33)

Now choose ε > 0. Since f is uniformly continuous on Rn , there exists a δ > 0 such that
ε
x − y ≤ δ ⇒ |f (x ) − f (y )| ≤ .
2
√ -
It follows that for x, y ∈ Rn not more than δ := δ εe−At /2 2f apart we may estimate

y
|St f (x) − St f (y)| = E f (Xtx ) − E f (Xt )

y
≤ E f (Xtx ) − f (Xt )
ε
y
≤ + 2f P Xtx − Xt > δ
2
ε 1
≤ + 4f e2At x − y2
2 (δ )2
< ε,

45
where the third inequality uses the Markov inequality and (33). So St f is uniformly con-
tinuous as well.
approach to 0 at inﬁnity. Since t
→ St f is continuous in the supremum norm, it suﬃces
to prove that there exists a τ > 0 such that for all t ∈ [0, τ] we have

lim St f (x) = 0.
x→∞

(Physically speaking, we must prove that the diffusion cannot escape to infinity in a finite
time.) We start with the estimate
Ak t k
x,(k+1) x,(k) 2
E Xt − Xt ≤ Dt(1 + x)2
k!
for the iterates of the map X
→ X̃ in Section 6.1, starting from the constant process
x,0 x,(k+1)
Xt = x. Since Xtx = x + k (Xt − X x,(k)), it follows that
2
E Xtx − x ≤ Dt (1 + x)2 . (34)
-
with Dt = Dt( k Ak t k /k!)2 . Now choose ε > 0. Let M > 0 be such that |f (x )| ≤ 2 for
ε

all x ∈ Rn with x ≥ M, and let τ > 0 be such that 16Dτ f < 2 var epsilon. Then,
1

by the triangle inequality in Rn and by Chebyshev’s inequality, we have for all x ∈ Rn

with x > 2M ∨ 1 and all t ∈ [0, τ]:
ε . /
St f (x) ≤ + f P Xtx < M
2
ε
1
≤ + f P Xtx − x > 2 x
2
ε 4
≤ + f EXtx − x2
2 x2
ε 4
≤ + f Dτ (1 + x)2
2 x2
2
ε 1
≤ + f 4Dτ +1
2 x
< ε.

Since ε is arbitrary, this proves the claim.

2. From the initial condition X0x = x it is obvious that (S0 f )(x) = E f (X0x ) = f (x).
3. The semigroup property of St is a consequence of the Markov property of Xt . Namely,
for all s, t ≥ 0 and all x ∈ Rn :
x
(St+s f ) (x) = E f Xt+s
x
= E E f Xt+s |Ft

= E (Ss f ) Xtx
= (St (Ss f )) (x) .

4. To prove the joint continuity of the map (t, f )

→ St f , it in fact suﬃces to show that
St f → f as t ↓ 0. Indeed,

St f − Ss g = St (f − Ss−t g) ≤ f − Ss−t g ≤ f − g + g − Ss−t g.

46
2
Note that from (34) it follows that limt↓0 E Xtx − x = 0. Now again take ε > 0. Let
δ > 0 be such that x − y < δ ⇒ |f (x) − f (y)| < ε/2. Let t0 > 0 be such that
EXtx − x2 < εδ2 /4f for 0 ≤ t ≤ t0 . Then for t ∈ [0, t0 ] we have, again by Chebyshev’s
inequality,

x
(St f ) (x) − f (x) = f Xt (ω) − f (x) P (dω)

Ω
ε . /
≤ + 2 f P Xtx (ω) − x ≥ δ
2
ε 2f
≤ + EXtx − x2
2 δ2
<ε.

7.3 Generalities on generators

We next describe some basic theory on one-parameter semigroups on Banach spaces.

Deﬁnition 7.2 The domain of a generator A of a one-parameter semigroup (St )t≥0 on a

Banach space B is deﬁned by
0 1

1
Dom(A) := f ∈ B lim t (St f − f ) exists ,
t↓0

and the limit is called Af .

This deﬁnition leads to the following.

Proposition 7.4 Let A be the generator of a jointly continuous one-parameter semigroup

(St )t≥0 on a Banach space B. Then
(i) Dom(A) is dense in B;
(ii) St leaves Dom(A) invariant: St (Dom(A)) ⊂ Dom(A);
d
(iii) ∀f ∈Dom(A) : St Af = ASt f = dt St f .

Proof. (i). For f ∈ B and ε > 0 we deﬁne

1 ε
fε := St f dt.
ε 0
2 3
Then limε↓0 fε = f , so fε |ε > 0, f ∈ B is dense in B. But we also have

1 1 1 ε
(Sh − 1I) fε = (Sh − 1I) St f dt
h h ε 0
ε
h+ε
1
= St f dt − St f dt
hε h 0
h
h
1
= St (Sε f )dt − St f dt
hε 0 0
1
→ (Sε f − f ) as h ↓ 0.
ε

47
So fε ∈ Dom(A), and hence Dom(A) is dense in B.

(ii),(iii). If f ∈ Dom(A), then

1 1
(Sh − 1I) St f = St (Sh − 1I) (f )
h h
→ St Af , as h ↓ 0.

So St f ∈ Dom(A) and ASt f = St Af .

The next theorem gives us an explicit formula for the generator of our Itô-diﬀusion on
a large subset of its domain (namely all functions that are twice diﬀerentiable and have
compact support).

Theorem 7.5 Let (St )t≥0 be the one-parameter semigroup on B = C0 (Rn ) associated
to an Itô-diﬀusion with coeﬃcients b and σ . Let A be the generator of (St )t≥0 . Then
Cc2 (Rn ) ⊂ Dom(A), and for all f ∈ Cc2 (Rn ):
∂f ∂ 2f
(Af )(x) = bi (x) (x) + 1
2 σ (x)σ (x)∗ ij (x).
i
∂xi i,j
∂xi ∂xj

Proof. Call the r.h.s. A f . By Itô’s formula we have

∂f 1
∂ 2f
df (Xt ) = (Xt ) dXi (t) + 2 (Xt ) dXi (t)dXj (t).
i
∂xi i,j
∂xi ∂xj

Since dXi (t) = bi (Xt ) dt + σij (Xt )dBj (t), we have
j
⎛ ⎞⎛ ⎞

dXi dXj = ⎝ σik dBk ⎠ ⎝ σjl dBl ⎠
k l
⎛ ⎞

= ⎝ σik σjk ⎠ dt
k

= σ σ ∗ ij dt.

Using the condition X0x = x, we conclude that for any f ∈ Cc2 (Rn ),
t t
∂f x x
f Xtx − f (x) = Xs σij Xsx dBj (s) + A f Xs ds. (35)
0
i,j
∂xi 0

The ﬁrst part on the r.h.s. is a martingale and therefore, by the continuity of t
→ Xtx and
the deﬁnition of A,
1
(Af ) (x) = lim t E f Xtx − f (x)
t↓0

t
A f Xsx ds
1
= lim E t
t↓0 0

= A f (x) .

48
Note that the martingale part drops out after taking the expectation.
Having thus identified the generator associated with Itô-diffusions, we next formulate an
important formula for stopped Itô-diffusions.

Theorem 7.6 (Dynkin’s formula) Let X be a diﬀusion in Rn with generator A. Let τ be a

stopping time with E(τ) < ∞. Then for all f ∈ Cc2 (Rn ) and all x ∈ Rn ,
τ
x x
E (f (Xτ )) = f (x) + E (Af ) (Xs ) ds .
0

Proof. The proof is a bit ‘sketchy’, but the details are easily ﬁlled in. By applying (35) to
x
the stopped process t
→ Xt∧τ and letting t → ∞ afterwards, we ﬁnd:
τ τ
∂f
f Xτx = f (x) + (X x )σij (Xsx )dBj (s) + (Af ) Xsx ds.
0 ∂xi s 0
i,j

After taking expectations we get the claim (stopped martingales starting at 0 have expec-
tation 0).

7.4 Applications
The simplest example of a diﬀusion is Brownian motion itself: Xt = Bt (b ≡ 0, σ ≡ id).
Its generator is 1/2 times the Laplacian Δ.
Problem 1: Consider Brownian motion Bta := a + Bt starting at a ∈ Rn . Let R > a. What
is the average time Bta spends inside the ball DR = { x ∈ Rn : x ≤ R }?
Solution: Choose f ∈ Cc2 (Rn ) such that f (x) = x2 for x ≤ R. Let τRa denote the ﬁrst
time the Brownian motion hits the sphere. Then τRa is a stopping time. Put τ := τRa ∧ T
and apply Dynkin’s formula, to obtain
τ
a
1 a
E f (Bτ ) = f (a) + E 2 (Δf ) Bs ds
0
2
= a + nE(τ),

1
where we use that 2 Δf ≡ n. Obviously, E f (Bτa ) ≤ R 2 . Therefore

1
E(τ) ≤ n R 2 − a2 .

1
As this holds for all T , it follows that E(τRa ) ≤ n R 2 − a2 < ∞. Hence τ → τRa as
T → ∞ by monotone convergence. But then we must have f (Bτa ) → Bτaa 2 = R 2 as
R
T → ∞, and so in fact

1
E τRa = n R 2 − a2

by dominated convergence.
Problem 2: Let b ∈ Rn be a point outside the ball DR . What is the probability that the
Brownian motion starting in b ever hits DR ?

49
Solution: We cannot use Dynkin’s formula directly, because we do not know if the Brow-
nian particle will ever hit the sphere. In order to obtain a well-defined stopping time, we
b
need a bigger ball DM := { x ∈ Rn |x < M } enclosing the point b. Let σM,R be the first
b b
exit time from the ‘annulus’ AM := DM \DR starting from b. Then clearly σM,R = τM ∧ τRb .
1
Now take A = 2 Δ, the generator of Brownian motion, and suppose that fM : AM → Rn
satisfies the following requirements:
(i) ΔfM = 0, i.e., fM is harmonic,
(ii) fM (x) = 1 for x = R,
(iii) fM (x) = 0 for x = M.
We then find

b
P B
b
= R = E fM B b
b = fM (b),
σM,R σM,R

where the ﬁrst equality uses (ii) and (iii) and the second equality uses Dynkin’s formula
in combination with (i.) (Incidentally, this says that fM is uniquely determined if it exists!)
Next we let M → ∞ to obtain

P τRb < ∞ = P ∃M≥b : τRb < τM b
,

because any path of Brownian motion that hits the boundary of DR must be bounded.
From the latter we in turn obtain

P τRb < ∞ = P ∪M≥b [τRb < τM b
] = lim P τRb < τM
b
M→∞

b
= lim P BσM = R = lim fM (b).
M→∞ M→∞

Thus, the only thing that remains to be done is to calculate this limit, i.e., we must solve
(i)-(iii.)
For n = 2 we ﬁnd, after some calculation:

log b − log M

fM (b) = →1 as M → ∞.
log R − log M

For n ≠ 2, on the other hand, we ﬁnd:

⎧
b2−n − M 2−n ⎨ 1 if n = 1
fM (b) = →
R 2−n −M 2−n ⎩ (b/R)2−n if n ≥ 3.

It follows that Brownian motion with n = 1 or n = 2 is recurrent, since it will hit any
sphere with probability 1. But for n ≥ 3 Brownian motion is transient.

50
8 Transformations of diﬀusions
In this section we treat two useful formulas in the theory of diﬀusions, which are proved
using Itô-calculus.

8.1 The Feynman-Kac formula

Theorem 8.1 (Feynman-Kac formula) For x ∈ Rn , let Xtx be an Itô-diﬀusion with gener-
ator A and initial condition X0x = x. Let v : Rn → [0, ∞) be continuous, and let Stv f for
f ∈ C0 (Rn ) and t ≥ 0 be given by

v x t
x
St f (x) = E f Xt exp − v(Xu )du .
0

Then (Stv )t≥0 is a jointly continuous semigroup on C0 (Rn ) with generator A − v.

Proof. It is not difficult to show, by the techniques used in the proof of Theorem 7.3,
that Stv f lies again in C0 (Rn ), and that the properties 1, 2 and 4 in Definition 7.1 hold. It
is illuminating, however, to explicitly prove Property 3.
For 0 ≤ s ≤ t, let
t
x s,x
Zs,t := exp − v Xu du .
s
x
Property 3 in Definition 7.1 is preserved due to the particular form of the process Z0,t . In
fact, let f ∈ C0 (R ). Then
n

v x
x
St+s f (x) = E Z0,t+s f Xt+s

x Xtx x
= E Z0,t Zt,t+s f Xt+s
x
x Xt t,Xtx
= E Z0,t E Zt,t+s f Xt+s |Ft

x
= E Z0,t g Xtx

= Stv g (x) ,

where
y t,y y y
g(y) = E Zt,t+s f Xt+s |Ft = E Z0,s f (Xs ) = Ssv f y
by stationarity. So indeed
v
St+s f (x) = Stv ◦ Ssv f (x) .

Let us ﬁnally show that A − v is the generator. To that end we calculate (with the help of
Itô’s formula and dXt = b(Xt )dt + σ (Xt )dBt ):

d Z0,t f (Xt ) =(dZ0,t )f (Xt ) + Z0,t d(f (Xt )) + (dZ0,t )(df (Xt ))
. /
= −v (Xt ) f (Xt ) + (Af ) (Xt ) Z0,t dt + Z0,t σ (Xt )T ∇f (Xt ) , dBt .

Taking expectations on both sides of this equation, we get

dE Z0,t f (Xt ) = E [−vf + Af ] (Xt ) Z0,t dt .

51
In short, dStv f = Stv ((A − v)f ) dt which means that A − v is indeed the generator of the
semigroup (Stv )t≥0 .
We can give the semigroup (Stv )t≥0 a clear probabilistic interpretation. The positive num-
ber v(y) is viewed as a ‘hazard rate’ at y ∈ Rn , the probability per unit time for the
diﬀusion process to be ‘killed’. Let us extend the state space Rn by a single point ∂, the
‘coﬃn’ state, where the system ends up after being killed. Then it can be shown that there
exists a stopping time τ, the ‘killing time’, such that the process Ytx given by
⎧
⎨X x if t ≤ τ
x t
Yt :=
⎩∂ if t > τ

satisﬁes

x x t x
E f Yt = E f Xt exp − v Xu du ,
0

provided we deﬁne f (∂) := 0. The proof of this requires an explicit construction of the
killing time τ, which we shall not give here.
The Feynman-Kac formula was originally formulated as a non-rigorous ‘path-integral for-
mula’ in quantum mechanics by R. Feynman, and was later reformulated in terms of
diﬀusions by M. Kac. The connection with quantum mechanics can be stated as follows.
1
If Xt is Brownian motion, then the generator of Stv is 2 Δ − v. This is (−1)× the Hamilton
n
operator of a particle in a potential v in R . According to Schrödinger, the evolution in
time of the wave function ψ ∈ L2 (Rn ) describing such a particle is given by ψ
→ Utv ψ,
where Utv is a group of unitary operators given by

1
Utv = exp it 2 Δ − v (t ∈ R).

This group can be obtained by analytic continuation in t of the semigroup

1
Stv = exp t 2 Δ − v (t ∈ [0, ∞)).

Finally, if v fails to be nonnegative, then the Feynman-Kac formula may still hold. For
instance, it suﬃces that v be bounded from below.

8.2 The Cameron-Martin formula

Brownian motion starting at x ∈ Rn can be represented naturally on the probability
space (Ω, F , Px ), where Ω = C([0, T ] → Rn ) is the space of its paths, F is the σ -algebra
generated by the sets {ω ∈ Ω | a ≤ ω(t) ≤ b} (a, b ∈ R, t ∈ [0, T ]), and Px is the
probability distribution on C[0, t] of (Btx := x + Bt )t∈[0,T ] constructed in Section 2. We
want to compare the probability distribution Px with that of another process derived
from it. Let Xtx denote the solution of

dXtx = b Xtx dt + dBt , X0x = x,

where b is bounded and Lipschitz continuous. This induces a probability measure P̃x
b on
Ω.

52
Theorem 8.2 (Cameron-Martin formula) For all x ∈ Rn the measures Px and P̃x
b are
mutually absolutely continuous with Radon-Nikodym derivative given by
T
dP̃x T x x 2
b
= exp x 1
b Bs , dBs − 2 b Bs ds .
dPx 0 0

Proof. Fix x ∈ Pn . The idea of the proof is to show that Xtx has the same distribution
under Px (or P0 ) as Btx has under ρT ·Px , where ρT denotes the Radon-Nikodym derivative
of the theorem. In other words, we shall show that for all 0 ≤ t1 ≤ t2 ≤ · · · ≤ tn ≤ T and
all f1 , f2 , · · · fn ∈ C0 (Rn ):

Ex f1 Xt1 × · · · × fn Xtn = Ex ρT f1 Btx1 × · · · × fn Btxn , (36)

Indeed, by the deﬁnition of P̃x

b the l.h.s. is equal to

f1 (ω(t1 )) × · · · × fn (ω(tn )) P̃x
b (dω)
Ω

and the r.h.s. is equal to

f1 (ω(t1 )) × · · · × fn (ω(tn )) ρT (ω)Px (dω) ,
Ω

while the functions ω

→ f1 (ω(t1 )) × · · · × fn (ω(tn )) generate the σ -algebra F .
We shall prove (36) by showing that both sides are equal to

St1 f1 St2 −t1 · · · Stn −tn−1 (fn ) · · · (x) , (37)

Let us start with the l.h.s. First we note that for 0 ≤ s ≤ t ≤ T , for all F in the algebra
As of all bounded Fs -measurable functions on Ω and for all f ∈ C0 (Rn ) we have by the
property of conditional expectations and the Markov property,

Ex (F f (Xt )) = Ex E(F f (Xt ))|Fs

= Ex F E(f (Xt ))|Fs

= Ex F (St−s f )(Xt ) .

If we now apply this result repeatedly on the product f1 (Xt1 ) × · · · × fn (Xtn ), ﬁrst pro-
jecting onto Atn−1 , then on Atn−2 , and continue down to At1 , we obtain (37).
Now consider the r.h.s. of (36), which is more diﬃcult to handle. We shall show that it is
also given by (37) in three steps. Namely,
1. For all t ∈ [0, T ], F ∈ At ,

Ex (F ρT ) = Ex (F ρt ) . (38)

Hence the r.h.s. of (36) does not depend on T (as long as T ≥ tn ).

2. For all s ≤ t in [0, T ], all F ∈ As , f ∈ C0 (Rn ):

Ex (F f (Bt ) ρT ) = Ex (F (St−s f )(Bs )ρT ) . (39)

53
3. We apply (39) repeatedly, ﬁrst with t = tn , s = tn−1 , F ∈ Atn−1 , then with t = tn−1 ,
s = tn−2 , F ∈ Atn−2 , continuing until t = t1 , and we obtain (37).
Thus, to complete the proof it remains to prove (38) and (39).
1
Proof of 1: Put ρt := exp (Zt ) with dZt = b (Bt ) , dBt − 2 b (Bt )2 dt. Then ρT is as
1
deﬁned above and dρt = exp (Zt ) dZt + 2 exp (Zt ) (dZt )2 = ρt b (Bt ) , dBt . It follows
that t
→ ρt is a martingale. Therefore for F ∈ At :

Ex (F ρT ) = Ex (E (F ρT |Ft ))
= Ex (F E (ρT |Ft ))
= Ex (F ρt ) .

Proof of 2: Note that

d (ρt f (Bt )) = (dρt ) f (Bt ) + ρt (df (Bt )) + (dρt )(df (Bt ))

1
=ρt f (Bt )b (Bt ) , dBt + ∇f (Bt ) , dBt + ρt 2 Δf (Bt ) dt + b (Bt ) , ∇f (Bt )dt

Hence for t ≥ s (with F ∈ As ) we have by (38)

d x d x
E (F ρT f (Bt )) = E (F ρt f (Bt ))
dt dt
= Ex (F ρt (Af ) (Bt ))
= Ex (F (Af ) (Bt ) ρT ) ,

1
where A := 2 Δ + b, ∇ is the generator of (Xt ). Therefore the left and the right hand
side of (39) have the same derivative with respect to t for t ≥ s. Since they are equal for
t = s, they must be equal for all t ≥ s.

8.3 Killing and drift

Let Xt be Brownian motion with a drift, given by the gradient of some potential h:

dXt = −∇h (Xt ) dt + dBt ,

and let (St ) be the associated contraction semigroup, which can be expressed using the
Cameron-Martin formula as
t
t
x x x 1 x 2
(St f ) := E(f (Xt )) = E f (Bt ) exp − ∇h(Bs ), dBs − 2 h(Bs ) ds .
0 0

Let on the other hand the semigroup Tt be given by the expression

t
(Tt f )(x) := E f (Btx ) exp − v(Btx )ds .
0

where

1
v (x) := 2 ∇h (x)2 − Δh (x) . (40)

54
Then from (40) and the equality
t t
h(Btx ) = h(x) + ∇h(Bsx ), dBs + 12 Δh(Bsx )ds
0 0

it follows that the two semigroups St and Tt satisfy

e−h(x) (St f ) (x) = Tt e−h f (x). (41)

Now, by putting f = 1 we see that the function Ω : Rn → R deﬁned by Ω (x) :=

exp(−h(x)) is invariant under Tt for all t ≥ 0. Hence Ω must be annihilated by the
1
generator of Tt , which by the Feynman-Kac formula is equal to 2 Δ − v, It follows from
the theory of Sturm-Liouville operators that, as Ω > 0 everywhere, Ω must be the eigen-
1
function to the lowest eigenvalue (“ground state”) of the operator v − 2 Δ. The equation
(41) can be read as saying that the map U : f
→ Ωf = e−h
f is a unitary equivalence be-
2 n 2
tween the semigroup (St ) of operators on L R , |Ω| dx describing a Brownian motion
with drift (and equilibrium distribution |cΩ|2 for some constant c > 0), and the physically
1
interesting contraction semigroup (Tt = e 2 Δ−v ) in the Hilbert space L2 (Rn ), obtained by
killing a Brownian motion.
1
Example
Choose
h (x) = 2 x2 , which gives −∇h (x) = −x, Δh = n and v (x) =
1 2 1 1 2
2 x − n . We see that the Hamiltonian − 2 Δ + v = 2 Δ + x − n/2 gives a semi-
group by killing that can also be obtained by the drift −x. Here n/2 is the energy of
ground state of the harmonic oscillato whose wave function is given by

1 − 1 x2
Ω(x) = e 2 .
c

55
9 The Black and Scholes option pricing formula.

FIG: Myron Scholes and Fisher Black

In 1973 Black and Scholes published a paper (after two rejections) containing a formula
for the fair price of a European call option on stocks. This formula now forms the basis of
pricing practice on the option market. It is a ﬁne example of applied stochastic analysis,
and marks the beginning of an era were banks employ probabilists that are well versed
in Itô calculus.
A (European) call option on stocks is the right, but not the obligation, to buy at some
future time T a share of stock at price K. Both T and K are ﬁxed ahead. Since at time T
the value of the stock may be higher than K, such a right-to-buy in itself has a value. The
problem of this section is: what is a reasonable price for this right?

9.1 Stocks, bonds and stock options

Let (Bt )t∈[0,T ] be standard Brownian motion on the probability space (Ω, F , P) with fil-
tration (Ft )t∈[0,T ] . We model the price of one share of a certain stock on the financial
market by an Itô-diffusion (St )t∈[0,T ] on the interval (0, ∞), described by the stochastic
differential equation

dSt = St (μdt + σ dBt ). (42)

The constant μ ∈ R (usually positive) describes the relative rate of return of the shares.
The constant σ > 0 is called the volatility and measures the size of the random flucta-
tions in the stock value. Shares of stock represent some real asset, for example partial
ownership of a company. They can be bought or sold at any time t at the current price
St .
Let us suppose that on the market certain other securities, called bonds, are available that
yield a riskless return rate r ≥ 0. This is comparable to the interest on bank accounts.
The value βt of a bond satisfies the differential equation

dβt = βt (r dt). (43)

56
The question we are addressing is: How much would we be willing to pay at time 0 for the
right to buy at time T one share of stock at the price K > 0 fixed ahead? Such a right is
called a European stock option. The time T is called the expiry time, the price K is called
the exercise price. This option pricing problem turns out to be a problem of stochastic
control.
Another type of option is the American stock option, where the time at which the shares
can be bought is not fixed beforehand but only has an upper limit. The pricing problem
for American stock options contains, apart from a stochastic control element, also a
halting problem. It is therefore more difficult, and we leave it aside.
The solution to the European option pricing problem is given by the Black and Scholes
option pricing formula (50) appearing at the end of this Section. As this formula does
not look wholely transparant at first sight, we shall introduce the result in three steps,
raising the level of complexity slowly by introducing the ingredients one by one.

9.2 The martingale case

Let us ﬁrst suppose that both the stock price and the bond price are martingales: μ =
r = 0 in Equations (42) and (43). For bonds this simply means that their price is constant:
they are the stock-market equivalent of bank notes. For stocks it means that their price
behaves like the exponential martingale found in Chapter 5:

1
St = S0 exp σ Bt − 2 σ 2 t .

In this stationary world our option pricing problem is easy: a fair price q at time 0 of the
right to buy at time T a share of stock at the price K is

q := E (ST − K)+ , (44)

where x + stands for max(0, x). Indeed, if the stock value ST turns out to be larger than
the exercise price K, then the holder of the option makes a proﬁt of ST − K by buying
the share of stock at the price K that he is entitled to, and then immediately selling it
again at its current value ST . On the other hand, if ST ≤ K, then his option expires as a
worthless contract.
Since we know the price process St to be an exponential martingale, we can explicitly
evaluate the option price q:

q = E (ST − K)+
+
1 2
=E S 0 e σ BT − 2 σ T − K
∞ +
1 2
= S0 ew− 2 σ T − K ϕσ 2 T (w)dw (45)
−∞
∞
1
w− 2 σ 2 T
= K 1
S 0 e − K ϕσ 2 T (w)dw
S0 + 2 σ
log 2T

S0 1 S0 1
= S0 Φσ 2 T log + σ 2T − K Φσ 2 T log − σ 2T ,
K 2 K 2

where Φλ : u
→ √1 Φ √u is the normal distribution fuction with mean 0 and variance λ,
λ λ
and ϕλ = √ 1 exp(−w 2 /2λ) is the associated density function. In (45) we have made
2π λ

57
use of the equalities
1
ew− 2 λ ϕλ (w) = ϕλ (w − λ),

and
∞
ϕλ (w)dw = Φλ (−x).
x

The above option pricing formula covers the case μ = r = 0. Surprisingly, it also covers
the case μ = 0, r = 0. I.e., μ plays no role in the final result. In fact, the full Black and
Scholes option pricing formula is obtained by substituting Ke−r T for K to take devalua-
tion into account. It was this surprising disappearance of μ from the formula that caused
the difficulties that Black and Scholes experienced in getting their result accepted.
And indeed, for a justification of these statements we need a considerable extension of
our theoretical background.

9.3 The eﬀect of stock trading: the case μ = 0

If we now drop the assumption that μ = r = 0, but keep r = 0, the intuitive argument
leading to (44) breaks down. Indeed, if μ > 0, then there is an upward drift in the stock
value that makes ‘dollars at time 0’ inequivalent to ‘dollars at time T ’. Simply building
in a discount factor e−μt is ill-founded, and will in fact turn out to be incorrect. We shall
have to consider seriously the definition of a ‘fair price’. It is reasonable to base such a
definition on the question what else could be done with a capital q during the interval of
time [0, T ]. This brings us to the subject of trading.
Suppose that a dealer in securities enters the market at time 0 with a capital q. He buys an
amount a0 of stock at the current price S0 , keeping the rest b0 := q − a0 S0 in his pocket
in the form of banknotes. Then at the times t1 < t2 < t3 < · · · < tn (= T ) he repeats the
following procedure. Let i ∈ {0, 1, 2, · · · , n − 1}. At time ti+1 our dealer, judging the past
stock values St , (0 ≤ t ≤ ti+1 ), decides to change his amount of stock from ati to ati+1
by buying or selling. If he chooses to buy more than his capital can pay for, he borrows
money, (thus making bt+1 negative), and keeps in mind that his loan must be paid back
in due course, say after time T . Nevertheless, his tradings must be self-financing, i.e. our
dealer spends no money, and obtains no money from outside, other than the loans and
stock tradings mentioned. So his total capital (‘portfolio value’) just before ti+1 must be
equal to the portfolio value just after this moment of time:

ati Sti+1 + bti = ati+1 Sti+1 + bti+1 , (i = 0, 1, · · · , n − 1). (46)

Employing the notation of Section 2,

(Δt)i := ti+1 − ti ; (ΔX)i := Xti+1 − Xti ,

we may write (46) as

(Δa)i (Sti + (ΔS)i ) + (Δb)i = 0. (47)

We note that this equation expresses a kind of ‘time delay’: the amount ati of stock
bought at time ti only has its eﬀect at time ti+1 . This delay will remain even after the
continuous time limit is taken, when it will give rise to an Itô-term.

58
Now consider the portfolio value Ct at time t:

Ct := at St + bt .

We note that in the case of self-ﬁnancing trade:

(ΔC)i := Cti+1 − Cti

= ati+1 Sti+1 + bti+1 − ati Cti − bti
= (ati+1 − ati )(Sti+1 − Sti ) + (bti+1 − bti ) + (ati+1 − ati )Sti + ati (Sti+1 − Sti )
= (Δa)i (ΔS)i + (Δb)i + (Δa)i Si + ai (ΔS)i

Therefore (47) can be written concisely as

(ΔC)i = ai (ΔS)i .

Now, since there is no essential limit to the frequency of trade, the partition of [0, T ]
generated by the sequence of times 0 = t0 < t1 < t2 < · · · < tn−1 < tn = T can be made
arbitrarily ﬁne. It is therefore reasonable to make the following idealisation.

Deﬁnition 9.1 A self-ﬁnancing trading strategy is a pair (a, b) of square-integrable adapted

processes on [0, T ] with continuous paths, such that the sum Ct := at St + bt satisﬁes

dCt = at dSt . (48)

We are now in a position to deﬁne what we mean by the ‘fair price’ of a claim or option.
Let g : [0, ∞) → [0, ∞) be measurable. By a claim to g(ST ) we mean the right to cash in
at time T the amount g(ST ), which depends on the current stock value ST at time T .

Deﬁnition 9.2 A claim to g(ST ) is called redundant if there exists a self-ﬁnancing strat-
egy (a, b) such that with probability 1,

CT := aT ST + bT = g(ST ).

By the fair price F(g(ST )) of a redundant claim to g(ST ) we mean

F(g(ST )) := C0 = a0 S0 + b0 .

So F(g(ST )) = q if q could be used as the starting capital q = a0 S0 + b0 for a trading

(a, b) that ends up with g(ST ) with certainty.

9.4 Motivation
In the economic literature the above definition is usually motivated by the following
argument (a so-called arbitrage argument).
Suppose that claims to g(ST ) were traded at time 0 at a price p higher than q. Then it
would be possible to make an unbounded and riskless profit (an ‘arbitrage’) by selling
n such claims for the market price p, then to reserve an amount nq as initial capital
for the self-financing strategy (na, nb) — yielding with probability 1 the amount ng(ST )

59
needed to satisfy the claims — and then to pocket the difference n(p − q). Conversely, if
the market price p of the claim would be lower than q, then one could buy n claims and
apply the strategy (−na, −nb) yielding an immediate gain of n(q − p) at time 0. At time
T one could cancel one’s debts by executing the n claims to g(ST ). It should be admitted
that this second strategy, involving negative share holdings (or short sales of stock), is
somewhat more artificial than the first. But clearly, the possibility of arbitrage is not fair.
This concludes the motivation of Definition 9.2. In economic theory one often goes one
step further and assumes that arbitrage in fact does not occur. It is claimed that the
possibility of arbitrage would immediately be used by one of the parties on the market,
and this would set the market price equal to the fair price.

9.5 Results

Theorem 9.1 Let g ∈ C0 (0, ∞). On a stock market without bonds or interest (i.e., with
r = 0), the fair price at time 0 of a claim to g(ST ) at time T > 0 is
F(g(ST )) = E(g(XT )),
where (Xt )t∈[0,T ] is the exponential martingale with parameter σ starting at S0 , i.e., the
solution of the stochastic diﬀerential equation
dXt = Xt (σ dBt ) with X0 = S 0 . (49)

Corollary 9.2 In the absence of bonds or interest the fair price at time 0 of an option to
a share of stock at time T is

F (ST − K)+ = E (XT − K)+ ,
where Xt is the exponential martingale of Theorem 9.1. The right hand side is given
explicitly by (45).

Proof. Approximate the function x

→ (x − K)+ by a sequence gn ∈ C0 (0, ∞) ad apply
Theorem 9.1.

Proof. of Theorem 9.1

∂2
Let A := σ 2 x 2 ∂x denote the generator of the diﬀusion Xt in (49), and deﬁne

ft := e(T −t)A g, (t ∈ [0, T ]).

Define a trading strategy (a, b) by
at := ft (St ) and bt := ft (St ) − St ft (St ).
Then aT ST + bT = fT (ST ) = g(ST ). Moreover, (a, b) is self-financing since the total
portfolio value Ct satisfies
dCt := dft (St )

dft
(St )dt + ft (St )dSt + 2 ft (St )(dSt )2
1
=
dt
= −(Aft )(St )dt + at dSt + 2 σ 2 St2 ft (St )dt
1

= at dSt .

60
It follows that the fair price at time 0 is given by

S
F(g(ST )) := a0 S0 + b0 = f0 (S0 ) = eT A g (S0 ) = E g(XT0 ) .

Note that there is no μ in this proof! Apparently the fair price at time 0 is not inﬂuenced
by μ! To begin to understand this fact, we take the case g(x) = x. Clearly the fair price
at time 0 of a share at time T is just S0 , not E(ST ): St is automatically a ‘martingale under
F’.
Explicit calculation of the strategy for the case g(x) = (x − K)+ of a stock option yields

St 1
at = Φσ 2 (T −t) log + 2 σ 2 (T − t) ;
K

and

St 1 2
bt = −K Φσ 2 (T −t) log − 2 σ (T − t) .
K

These expressions describe a smooth steering mechanism, moving from the initial value
(a0 , b0 ) to the ﬁnal value (aT , bT ), given by
⎧
⎨(0, 0), if ST ≤ K;
(aT , bT ) =
⎩(1, −K), if ST > K.

Thus the pair (at , bt ) always moves inside [0, 1] × [−K, 0], the strategy always involves
borrowing of money in order to buy up to one single share of stock. In cases where the
option is cheap, relatively much has to be borrowed in order to imitate the workings of
the option.

9.6 Inclusion of the interest rate: r = 0

We complete our treatment of the Black and Scholes model by including bonds, securities
that yield a riskless return. The presence of these bonds on the market leads to a constant
devaluation. Therefore, measured in ‘dollars at time 0’, the exercise price of the option
will only measure Ke−r T . Interestingly, this is the only change (in the Black and Scholes)
result due to a nonzero interest rate: if again Xt denotes the exponential martingale with
parameter σ and S0 the stock value at time 0, then

F (ST − K)+ = E (XT − Ke−r T )+ .

This result will come out as a combined effect of an upward drift and a discount.
In the presence of bonds, a self-financing strategy is to be defined as a pair (a, b) of
adapted processes with continuous paths such that the total portfolio value Ct := at St +
bt βt satisfies

dCt = at dSt + bt dβt .

61
Theorem 9.3 On a stock market with stocks and bonds the fair price at time 0 of a claim
to g(ST ) at time T > 0 is

F(g(ST )) = e−r T E(g(YT )),

where (Yt )t∈[0,T ] is the solution of the stochastic diﬀerential equation

dYt = Yt (r dt + σ dBt ) with Y0 = S0 .

∂ ∂2
Proof. Let B := r x ∂x + σ 2 x 2 ∂x 2 be the generator of the diﬀusion Yt and deﬁne

ft := e(T −t)(B−r )g, (t ∈ [0, T ]).

Let the strategy (a, b) be given by

1
at := ft (St ) and bt :=
ft (St ) − St ft (St ) .
βt

(Recall that βt = er t by (43).) Then aT ST + bT βT = fT (ST ) = g(ST ). Moreover, (a, b) is

self-ﬁnancing since

dCt := dft (St )

= −(B − r )ft (St )dt + ft (St )dSt + 2 ft (St )(dSt )2
1

= −(B − r )ft (St ) + 2 σ 2 St2 ft (St ) dt + ft (St )dSt
1

= r ft (St ) − St ft (St ) dt + ft (St )dSt

= bt dβt + at dSt .

It follows that

F(g(ST )) := f0 (S0 ) = eT (B−r ) g (S0 ) = E e−r T g(YT ) .

Corollary 9.4 The fair price of a stock option is

F (ST − K)+ = e−r T E (YT − K)+ = E (XT − e−r T K)+
(50)
S0 S0
+ (r + 2 σ 2 )T − Ke−r T Φ log
1 1
= S0 Φ log + (r − 2 σ 2 )T .
K K

Proof. The ﬁrst equality is Theorem 9.3, the second equality follows from the fact that
Yt = er t Xt , while the third equality is (45).

Glass in Building - Basic Soda-Lime Silicate Glass Products: BSI Standards Publication
100% (1)
Glass in Building - Basic Soda-Lime Silicate Glass Products: BSI Standards Publication
18 pages
Solutions To Oksendal
0% (2)
Solutions To Oksendal
35 pages
Stochastic Calculus With Applications by Ovidiu Calin
100% (1)
Stochastic Calculus With Applications by Ovidiu Calin
372 pages
Applied Stochastic Processes: M. Ottobre
No ratings yet
Applied Stochastic Processes: M. Ottobre
164 pages
An Introductory Course in Stochastic Processes
100% (2)
An Introductory Course in Stochastic Processes
105 pages
Stochastic Processes, Ito Calculus and Black-Scholes Formula
No ratings yet
Stochastic Processes, Ito Calculus and Black-Scholes Formula
36 pages
D18NN
No ratings yet
D18NN
372 pages
An Introduction To Stochastic Calculus With Applications To Finance
No ratings yet
An Introduction To Stochastic Calculus With Applications To Finance
196 pages
Analisi Stocastica
No ratings yet
Analisi Stocastica
142 pages
Stoc
No ratings yet
Stoc
46 pages
Stochastic Calculus by Alan Bain
100% (1)
Stochastic Calculus by Alan Bain
87 pages
SDE Kultam
No ratings yet
SDE Kultam
33 pages
Alan Bain - Stochastic Calculus
No ratings yet
Alan Bain - Stochastic Calculus
89 pages
Bain StochasticCalc
100% (1)
Bain StochasticCalc
99 pages
1 HJB: The Stochastic Case: 1.1 Brownian Motion
No ratings yet
1 HJB: The Stochastic Case: 1.1 Brownian Motion
10 pages
BR Mo ST Ca 13
No ratings yet
BR Mo ST Ca 13
143 pages
CompleteNote PDF
No ratings yet
CompleteNote PDF
131 pages
Week1 PDF
No ratings yet
Week1 PDF
23 pages
Notes Mainimp
No ratings yet
Notes Mainimp
164 pages
Stochastic Calculus
No ratings yet
Stochastic Calculus
114 pages
Introduction To Stochastic Calculus
No ratings yet
Introduction To Stochastic Calculus
126 pages
Continuous Martingales and Stochastic Calculus: Alison Etheridge March 11, 2018
No ratings yet
Continuous Martingales and Stochastic Calculus: Alison Etheridge March 11, 2018
70 pages
Pavliotis Book
No ratings yet
Pavliotis Book
155 pages
Stochastic Calculus - An Introduction With Applications PDF
No ratings yet
Stochastic Calculus - An Introduction With Applications PDF
246 pages
Finbook 2
No ratings yet
Finbook 2
246 pages
Introduction To Stochastic Analysis: Lecture Notes
No ratings yet
Introduction To Stochastic Analysis: Lecture Notes
138 pages
Lec 0813
No ratings yet
Lec 0813
10 pages
Intro To Sdes
No ratings yet
Intro To Sdes
28 pages
SDErules
No ratings yet
SDErules
5 pages
Intro
No ratings yet
Intro
15 pages
SDE Book
No ratings yet
SDE Book
119 pages
An Introduction To Stochastic Calculus
100% (1)
An Introduction To Stochastic Calculus
102 pages
Stochastic Calculus
No ratings yet
Stochastic Calculus
23 pages
Stochastic Calculus
No ratings yet
Stochastic Calculus
3 pages
Stochastic Calculus, Filtering, and Stochastic Control
100% (2)
Stochastic Calculus, Filtering, and Stochastic Control
265 pages
A Brief Introduction To Stochastic Calculus: 1 Martingales, Brownian Motion and Quadratic Variation
No ratings yet
A Brief Introduction To Stochastic Calculus: 1 Martingales, Brownian Motion and Quadratic Variation
7 pages
Tutorial On Stochastic Differential Equations
100% (1)
Tutorial On Stochastic Differential Equations
38 pages
Statistical Inference For Ergodic Diffusion Process: Yu.A. Kutoyants
No ratings yet
Statistical Inference For Ergodic Diffusion Process: Yu.A. Kutoyants
24 pages
Space-Time Stochastic Calculus and White Noise
No ratings yet
Space-Time Stochastic Calculus and White Noise
22 pages
Mathematical Association of America
No ratings yet
Mathematical Association of America
4 pages
CQF January 2023 M1L6 Blank
No ratings yet
CQF January 2023 M1L6 Blank
32 pages
Driver PDF
No ratings yet
Driver PDF
211 pages
Driver
No ratings yet
Driver
211 pages
4SP LectureNotes v3
No ratings yet
4SP LectureNotes v3
45 pages
Stochastic Diff Eq PDF
No ratings yet
Stochastic Diff Eq PDF
44 pages
Continuous Time Finance: Tomas BJ Ork Stockholm School of Economics
No ratings yet
Continuous Time Finance: Tomas BJ Ork Stockholm School of Economics
387 pages
18.676 Files
No ratings yet
18.676 Files
98 pages
Non-Life Insurance Mathematics (Sumary)
No ratings yet
Non-Life Insurance Mathematics (Sumary)
99 pages
Stoch Calc
No ratings yet
Stoch Calc
138 pages
Stochastic Calculus
No ratings yet
Stochastic Calculus
18 pages
N. Privault, Stochastic Analysis in Discrete and Continuous Settings, Lecture Notes in Mathematics 1982, DOI 10.1007/978-3-642-02380-4 0, C 1
No ratings yet
N. Privault, Stochastic Analysis in Discrete and Continuous Settings, Lecture Notes in Mathematics 1982, DOI 10.1007/978-3-642-02380-4 0, C 1
6 pages
Brownian Motion Notes
No ratings yet
Brownian Motion Notes
14 pages
An Introduction To Stochastic Processes in Continuous Time
No ratings yet
An Introduction To Stochastic Processes in Continuous Time
145 pages
Linear Stochastic Differential Equations With Anticipating Initial Conditions
No ratings yet
Linear Stochastic Differential Equations With Anticipating Initial Conditions
9 pages
Complex Integration and Cauchy's Theorem
From Everand
Complex Integration and Cauchy's Theorem
G. N. Watson
No ratings yet
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet
Diophantine Approximations
From Everand
Diophantine Approximations
Ivan Niven
3/5 (1)
The Logical Solution Syracuse Conjecture
From Everand
The Logical Solution Syracuse Conjecture
Rolando Zucchini
No ratings yet
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Heat: The Nature of Temperature and Most Other Physics
From Everand
Heat: The Nature of Temperature and Most Other Physics
Marc E. King
No ratings yet
Lectures on Ergodic Theory
From Everand
Lectures on Ergodic Theory
Paul R. Halmos
No ratings yet
Itp 1
No ratings yet
Itp 1
5 pages
COMEDK Chem
No ratings yet
COMEDK Chem
8 pages
Class Note 05 Mole Concept Chemisry Arjuna NEET 2-0-2026 Nikhil
No ratings yet
Class Note 05 Mole Concept Chemisry Arjuna NEET 2-0-2026 Nikhil
2 pages
Physical Science Gr11 PHYSICS PDF
No ratings yet
Physical Science Gr11 PHYSICS PDF
58 pages
Armstrong Oscillator Lab
No ratings yet
Armstrong Oscillator Lab
8 pages
Di Laora 2020
No ratings yet
Di Laora 2020
8 pages
Junction Boxes: With Blank Sides
No ratings yet
Junction Boxes: With Blank Sides
1 page
A Study On The Effects of Blade Thickness
No ratings yet
A Study On The Effects of Blade Thickness
7 pages
IB Chemistry HL Topic4 Questions 1.: O C H H O
No ratings yet
IB Chemistry HL Topic4 Questions 1.: O C H H O
21 pages
Verif EPI-ASAP-procedure-EN
No ratings yet
Verif EPI-ASAP-procedure-EN
4 pages
Worked Example
No ratings yet
Worked Example
1 page
Effects of Double Exchange in Magnetic Crystals
No ratings yet
Effects of Double Exchange in Magnetic Crystals
14 pages
Hello Iitk Lecture 10 Sectional Views II Sept 6 2023
No ratings yet
Hello Iitk Lecture 10 Sectional Views II Sept 6 2023
26 pages
Typical Aluminum GFM Mass Flow Meter
No ratings yet
Typical Aluminum GFM Mass Flow Meter
5 pages
Cyclodextrins For Reducing Aggregation of ADC's
No ratings yet
Cyclodextrins For Reducing Aggregation of ADC's
11 pages
Origin of The Solar System
No ratings yet
Origin of The Solar System
3 pages
S10FE IIc D 48 Electromagnetic Spectrum Practical Applications
No ratings yet
S10FE IIc D 48 Electromagnetic Spectrum Practical Applications
6 pages
ST Handout 3
No ratings yet
ST Handout 3
23 pages
Parts List 2020.rev.00 PDF
No ratings yet
Parts List 2020.rev.00 PDF
4 pages
Computational Inelasticity - Course Syllabus - PC 25owkbu
0% (1)
Computational Inelasticity - Course Syllabus - PC 25owkbu
5 pages
Chapter 11 Intermolecular Forces New
No ratings yet
Chapter 11 Intermolecular Forces New
81 pages
PDF Chemical Reactor Design Mathematical Modeling and Applications Conesa Download
100% (2)
PDF Chemical Reactor Design Mathematical Modeling and Applications Conesa Download
53 pages
Analisis Del Avión Airbus A380
100% (1)
Analisis Del Avión Airbus A380
15 pages
Class 9 - Science Force and Laws of Motion
No ratings yet
Class 9 - Science Force and Laws of Motion
10 pages
Second Term Examination Physics Class 6.
No ratings yet
Second Term Examination Physics Class 6.
5 pages
Reporte Pvsyst Sobre Merida para 1.25 MWP
No ratings yet
Reporte Pvsyst Sobre Merida para 1.25 MWP
8 pages
Persky 1999 PDF
No ratings yet
Persky 1999 PDF
25 pages
Choke Valve Sizing
No ratings yet
Choke Valve Sizing
8 pages
Brochure Ad Exlg crt16g GB PDF
No ratings yet
Brochure Ad Exlg crt16g GB PDF
80 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Stochan

Uploaded by

Stochan

Uploaded by

STOCHASTIC ANALYSIS

F. den Hollander, Matthias Löwe,

Mathematical Research Institute

p := P(Nt ≤ a) = μ ((−∞, a]) ∈ (0, 1). (6)

λ (E ∩ (c, d)) = p (d − c) . (7)

λ (E ∩ (c, d)) > α(d − c)

does not depend on s ≥ 0, where ti > ui for i = 1, 2, · · · , n are arbitrary.

nP (Bt+ 1 − Bt > ε) → 0 (8)

we can now deﬁne Brownian motion as follows

Deﬁnition 1.1 A one-dimensional Brownian motion is a real-valued process Bt , t ≥ 0 with

FIG: Norbert Wiener

2.1 Construction of Brownian Motion

(which is just a bit of algebra). Dividing by

P [Bs ∈ dx, Bt ∈ dz] = p(s, 0, x)p(t − s, x, z)dx dz

I(n) := {k ∈ N, k ≤ 2n , k = 2l + 1 for some l ∈ N}

(0) (1) (0)

The Schauder functions are deﬁned by

for all n ∈ N. Since

Excercise. 2.1 Check the following in a textbook of functional analysis:

Applying (10) to f = 1[0,t] and g = 1[0,s] yields

Now we are able to prove

is a Brownian motion in [0, 1].

Now we send M → ∞ and apply (11) to obtain

2.2 Non-smoothness of paths

Theorem 2.3 Almost surely, the paths

The ﬁrst step in this direction is the following basic lemma.

(n) (n) (n)

Lemma 2.5 Let X1 , X2 , X3 , . . . be a sequence of real-valued random variables such that

and therefore for all m ∈ N,

The ﬁrst Borel-Cantelli lemma now implies that for any m,

By σ -additivity the intersection over m also has probability 1:

i.e., Xnk → 0 almost surely as k → ∞.

Fix an ω ∈ Ω for which this holds. Let

P (Xn,k ≤ ε) = P (|B 1 | ≤ ε)3 = P (|B1 | ≤ 2n/2 ε)3 ≤ (2n/2+1 ε)3

P (Yn ≤ ε) ≤ T 2n (2n/2+1 ε)3 . (13)

|t − t0 | ≤ δ ⇒ |Bt − Bt0 | ≤ (|D| + 1)|t − s|.

Now choose n0 := n0 (ω, t0 ) large enough such that

2.3 More Brownian motion

Deﬁnition 2.1 As above let

FIG: Kiyosi Itô

and a partition τ : 0 = t0 < t1 < . . . < tn = t deﬁne

(The ordinary Riemann-Lebesgue integral is obtained form the Lebesgue-Stieltjes integral

3.1 Step functions

Example Put f (t, ω) := Bt (ω). Two reasonable approximations of f are φn and ψn

Deﬁnition 3.1 Let Ft denote the σ -ﬁeld generated by { Bs | 0 ≤ s ≤ t }. Let F := FT . A

The space L2 (Ω, Ft , P) of square-integrable Ft -measurable functions may be thought of

be its stochastic integral according to (14). Then I0 is an isometry:

I0 (φ)L2 (Ω,P) = φL2 (B) ,

The two expressions are the same.

3.2 Arbitrary functions

Hence by bounded convergence,

since ψn constitutes an approximate identity. Again by bounded convergence,

which tends to 0 as n → ∞ by dominated convergence.

Example The following identity holds:

Proof. We choose an adapted approximation of Bt (ω), namely φn (t, ω) of the example

From Lemma 2.4 in Section 2.4 it now follows that

Deﬁnition 3.2 By the conditional expectation at time t ∈ [0, T ] of a random variable

Deﬁnition 3.3 An adapted process M ∈ L2 (B) is called a martingale (w.r.t. Brownian

Es (Bt ) = Es (Bs + (Bt − Bs )) = Bs + Es (Bt − Bs ) = Bs ,

because Bt − Bs is independent of and hence orthogonal to any function in L2 (Ω, Fs , P).

Theorem 3.4 (Doob’s martingale inequality) If Mt is a martingale with continuous paths,

Es (Zt ) = Es (|Mt |p ) ≥ |Es (Mt )|p = |Ms |p = Zs .

3.4 Continuity of paths

Theorem 3.5 Let f ∈ L2 (B). Let

By the ﬁrst Borel-Cantelli lemma,

so that for l > k > K(ω),

This implies that t

Corollary 3.6 Let f (t, ω) ∈ L2 (B, [0, T ]). Then

dXt = Ut dt + Vt dBt . (16)

d(Bt2 ) = dt + 2Bt dBt . (17)

4.1 The one-dimensional Itô-formula

Theorem 4.1 Let Xt be a stochastic integral. Let g : [0, ∞) × R → R be twice continuously

where (dXt )2 is to be evaluated according to the multiplication table:

In terms of the explicit form (16) of Xt , we can write (18) as

dYt = Ut dt + Vt dBt

Lemma 4.2 If At is a process in L2 (B, [0, T ]), then

I0 (φ)L2 (Ω,P) = φL2 (B) ,

Let g : [0, T ] × Rn → Rp be a C 2 -function. Then the process Y (t, ω) := g(t, Xt ) satisﬁes

Theorem 4.3 If f : Rm → R is harmonic, then f (Bt ) is a martingale.

In the limit as k → ∞ this equality tends, term by term in L2 (Ω, P), to

b (t, x) ∨ σ (t, x) ≤ C (1 + x) (t ∈ [0, T ], x ∈ Rn )