Classical Wavelet Theory: Jonathan R. Partington, University of Leeds, School of Mathematics April 29, 2010
Classical Wavelet Theory: Jonathan R. Partington, University of Leeds, School of Mathematics April 29, 2010
http://www.maths.leeds.ac.uk/∼ pmt6jrp/gradcourse0910.html
LECTURE 1
Books:
Ingrid Daubechies, Ten lectures on wavelets, SIAM.
C.K. Chui, An introduction to wavelets, Academic Press.
Gerald Kaiser, A friendly guide to wavelets, Birkhäuser.
1 Introduction
Fourier series and transforms
and Z 1
ck = f (t)e−2πikt dt = fˆ(k), say.
0
Likewise for suitable functions f on the real line we have
Z ∞
f (t) = fˆ(w)e2πiwt dw, where
−∞
Z ∞
ˆ
f(w) = f (t)e−2πiwt dt.
−∞
These are useful formulae, but not localised – if you want to examine f near
t = t0 , say, you still need all the information in fˆ.
1
Windowed Fourier transform (Gabor, 1946)
Let g be a function which is ‘well-localised’ – i.e., big near 0, say, and with g(t)
small for large t.
Examples:
2 /2
• g(t) = e−t ;
Let Z ∞
win ′
T f (w, t ) = f (t)g(t − t′ )e−2πiwt dt.
−∞
Wavelet transforms
In particular, we want Z ∞
ψ(t) dt = ψ̂(0) = 0.
−∞
where
a,b −1/2 t−b
ψ (t) = |a| ψ .
a
2
The number a 6= 0 controls the resolution (scaling) and b the position of the
function.
Then in fact there is an inversion formula
Z ∞Z ∞
−1 da db
f = Cψ ψ a,b hf, ψ a,b i 2 ,
−∞ −∞ a
where Z ∞
|ψ̂(w)|2
Cψ = 2π dw.
−∞ |w|
Again, to reconstruct f we only need a discrete version. Let
−n/2
ψn,m (t) = a0 ψ(a−n
0 t − mb0 )
Then for suitable a0 , b0 (the ones above are OK for the Mexican hat), we can find
‘dual’ functions ψ̃n,m such that
∞
X ∞
X
f= hf, ψn,m iψ̃n,m .
n=−∞ m=−∞
a condition which makes (ψn,m ) a frame. Sometimes we can do better (e.g. Haar
wavelets, Littlewood–Paley wavelets) and (ψn,m ) form an orthonormal basis, i.e.,
ψ̃n,m = ψn,m and A = B = 1.
Multi-resolution approximations
. . . ⊂ V−2 ⊂ V−1 ⊂ V0 ⊂ V1 ⊂ V2 ⊂ . . . .
T S
Also Vn = {0} and Vn is dense in L2 (R).
We also note that f (t) ∈ Vn ⇐⇒ f (2t) ∈ Vn+1 (for each n), and f (t) ∈ V0 ⇐⇒
f (t − m) ∈ V0 (for each m).
3
In this case we also have that the function g(t) = χ(0,1) (t) has the property that
(g(t − m))m∈Z is an orthonormal basis for V0 .
From this we will later construct “wavelets” and find a function ψ that gives an
orthonormal basis of L2 (R) of the form
Applications
LECTURE 2
2.1 Definition
Let 1 ≤ p < ∞. We define the space Lp (R) as the space of (measurable) functions
f : R → C satisfying
Z ∞ 1/p
p
kf kp := |f (t)| dt < ∞.
−∞
4
Again we regard two functions f, g as equal if f (t) = g(t) a.e. Then L∞ (R), and
more generally L∞ (a, b), is a Banach space.
Fourier series
We work on the interval [0, 1]; other intervals can be treated by a change of
variables.
Let
en (t) = exp(2πint),
for each n ∈ Z.
2.2 Theorem
(en )n∈Z is an orthonormal basis of L2 (0, 1), that is, it satisfies:
(
1 if n = m,
1. hen , em i = δnm =
0 for n 6= m.
P
2. Every function f ∈ L2 (0, 1) has an expansion f = ∞ n=−∞ cn en , converging
in L2 norm, i.e.,
N
X
kf − cn en k2 → 0 as N → ∞.
n=−N
P∞
Moreover kf k2 = n=−∞ |cn |2 (Parseval’s formula), and
Z 1
cn = hf, en i = f (t)e−2πint dt
0
N
X
2 2
kf k ≥ kfN k = |cn |2 .
n=−N
P∞
Hence N =−∞ |cN |2 ≤ kf k2 , and
X
kfN − fM k2 = |cn |2 , if N < M,
N <|n|≤M
5
from which we deduce that (fN ) is a Cauchy sequence, and so converges to g,
say. The hard part is to show that f = g, i.e., (en ) is a complete orthonormal
basis, and then
X∞
2 2
kf k = lim kfN k = |cn |2 .
n=−∞
2.4 Remarks
In fact fN → f a.e. when f ∈ L2 , but this is much harder (Carleson, 1962). If
f only lies in L1 , then it is possible for its Fourier series to diverge everywhere
(Kolmogorov, 1926). Even for continuous periodic functions the Fourier series
may diverge at some points, and certainly need not converge uniformly.
L2 (0, 1) → ℓ2 (Z)
f 7→ (fˆ(n))n := (hf, en i)n .
Fourier transforms.
2.5 Proposition
Suppose that f ∈ L1 (R). Then the integral
Z ∞
f (t) exp(−2πiωt) dt
−∞
converges absolutely for all ω ∈ R, and defines a function which we write as fˆ(ω),
the Fourier transform of f . Moreover kfˆk∞ ≤ kf k1 .
LECTURE 3
2.6 Theorem
6
Let f ∈ L1 (R) ∩ L2 (R). Then the function fˆ exists and satisfies kfk
ˆ 2 = kf k2 ;
thus the Fourier transform can be extended continuously to the whole of L2 (R),
and satisfies the following:
2.7 Corollary
There is a linear operator F : L2 (R) → L2 (R) such that
Z ∞
(F f )(w) = f (t) exp(−2πiwt) dt a.e.
−∞
hf, gi = hF f, F gi
for f, g ∈ L2 (R).
Let V0 denote the subspace of L2 (R) consisting of all functions f that are constant
on every interval of the form (n, n + 1), where n is an integer.
7
3.1 Lemma
Every function f ∈ V0 has the form
∞
X ∞
X
f (t) = ak φ(t − k), with kf k22 = |ak |2 ,
k=−∞ k=−∞
where (
1 if 0 < t < 1,
φ(t) = χ(0,1) (t) =
0 otherwise.
Indeed, ak is the value of f on (k, k + 1). Thus the functions (φk )k∈Z , where
φk (t) = φ(t − k), form an orthonormal basis of V0 .
3.2 Theorem
For j ∈ Z letVj denote the space of functions f constant on all intervals of the
form 2kj , k+1
2j
, with k ∈ Z. Then
. . . V−2 ⊂ V−1 ⊂ V0 ⊂ V1 ⊂ V2 ⊂ . . . .
T S
Also j∈Z Vj = {0} and j∈Z Vj = L2 (R).
A function f lies in Vj if and only if the function t 7→ f (2−j t) lies in V0 , and Vj
has an orthonormal basis consisting of the functions
for each j, and if kf k < ∞, this means that c = 0 and f = 0 a.e. on (0, ∞).
Similarly for (−∞, 0). So f = 0 a.e.
S
Now to show that j∈Z Vj is dense, take f ∈ L2 (R) and ǫ > 0. By standard
results on L2 functions we have that ∃N ∈ N and a continuous function g such
that g(t) = 0 for |t| ≥ N and kf − gk2 < ǫ/2.
8
Now g is uniformly
√ continuous on [−N, N], so ∃j > 0 such that g doesn’t vary
by more than ǫ/ 8N on any interval (k/2j , (k + 1)/2j ). Hence there is a step
function h ∈ Vj such that
Z N 1/2
ǫ2
kg − hk2 ≤ dt = ǫ/2.
−N 8N
S
Thus kf − hk2 < ǫ by the triangle inequality, and so j∈Z Vj is dense.
If f ∈ Vj then f (t) is constant for t ∈ 2kj , k+1
2j
, for all k ∈ Z.
−j
This happens if and only if f (2 t) is constant for t ∈ (k, k + 1), for all k, i.e.,
f (2−j t) ∈ V0 .
Finally, any function in Vj has the form
∞
X
−j
f (2 t) = ak φ(t − k),
k=−∞
Now
hφ(2j t − k), φ(2j t − ℓ)i = 2−j δkℓ ,
and so clearly (2j/2 φ(2j t − k))k∈Z is an orthonormal basis of Vj .
LECTURE 4
Now (Vj ) provides a multi-resolution approximation of L2 (R), the ‘resolution’
being 2−j at the jth level. The best approximation fj ∈ Vj to f ∈ L2 (R) is given
by
∞
X
fj = PVj f = hf, φjk iφjk ,
k=−∞
We now want to use the orthonormal bases for the Vj to get a grand orthonormal
basis for L2 (R).
To do this we introduce the Haar wavelet ψ given by
Clearly ψ ∈ V1 \ V0 .
9
3.3 Lemma
The functions t 7→ ψ(t − k), k ∈ Z, form an orthonormal basis of a space W0 such
that V1 has the orthogonal decomposition V1 = V0 ⊕ W0 .
3.4 Lemma
For each j ∈ Z, Vj+1 = Vj ⊕ Wj , where Wj has the orthonormal basis
(2j/2 ψ(2j t − k))k∈Z .
3.5 Theorem
L2 (R) = . . . ⊕ W−2 ⊕ W−1 ⊕ W0 ⊕ W1 ⊕ W2 ⊕ . . . as an orthogonal direct sum;
that is, the functions
LECTURE 5
10
Any closed subspace V of L2 (R) has an orthogonal complement V ⊥ , i.e.,
L2 (R) = V ⊕ V ⊥ , where
⊥
= Vn ⊕ Wn ⊕ Vn+1 ,
so
3.6 Corollary
Every f ∈ L2 (R) has a multi-resolution expansion
∞
X ∞
X
f= hf, ψjk iψjk ,
j=−∞ k=−∞
11
converging in L2 (R) norm.
R∞
WARNING: Note that −∞ ψjk = 0 for all j and k. Hence if f ∈ L1 (R) ∩ L2 (R)
R∞ 2
and −∞ f 6= 0 (e.g. e−t /2 ), then the finite sums in Cor. 3.6 converge in L2 norm
but not in L1 norm, since
Z ∞ ∞
X ∞
X Z ∞
(f − cjk ψjk ) = f (constant) 6= 0,
−∞ j=−∞ k=−∞ −∞
4.1 Definition
Let b > 0. The Paley–Wiener space, P W (b), is the space of all functions
f ∈ L2 (R) such that fˆ(w) = 0 for all |w| > b. Also written P W (−b, b).
4.2 Theorem
If f ∈ P W (b), then f is the restriction to R of a function analytic on the whole
of C (which we also call f ), satisfying
for z = x + iy ∈ C.
12
Proof: Define
Z b
f (z) = ˆ
f(w)e2πiwz
dw,
−b
and
Z b
g(z) = ˆ
(2πiw)f(w)e2πiwz
dw.
−b
Clearly,
Z b
|f (z)| ≤ |fˆ(w)||e2πiwz | dw
−b
Thus the integral for f converges absolutely, and similarly for g. Also,
Z v Z vZ b
g(z) dz = ˆ
(2πiw)f(w)e 2πiwz
dw dz,
0 0 −b
and, changing the order of integration and using Fubini’s theorem, we get that
this is Z b
fˆ(w) e2πiwv − 1 dw = f (v) − f (0),
−b
′
and hence f (v) = g(v), i.e., f is analytic on C.
Functions satisfying such an inequality are said to be of exponential type. There
is a converse, which we don’t prove:
if f is analytic on C such that f|R lies in L2 (R) and, for some A > 0, we have
|f (z)| ≤ Ae2πb|y| for all z = x + iy ∈ C, then f ∈ P W (b).
4.3 Corollary
A function cannot be both time-limited and band-limited, unless it is identically
zero.
Proof: If f is analytic on C and f (t) = 0 for all real t with |t| > T , then f ≡ 0
by the principle of isolated zeroes.
LECTURE 6
We’re now working towards the sampling theorem for band-limited functions.
13
4.4 Lemma
Let f ∈ P W (b). Then
1 X m mπiw/b 1 X n −nπiw/b
∞ ∞
ˆ
f (w) = f − e = f e , (1)
2b n=−∞ 2b 2b n=−∞ 2b
and Z b
1 1 m
hfˆ, em i = √ fˆ(w)e−mφiw/b dw = √ f − .
2b −b 2b 2b
The other identity is obtained on writing n = −m.
converging in L2 (R) (i.e., P W (b)) norm, as well as uniformly. (As usual, we give
sin x
x
the value 1 at x = 0.)
Proof: Since the Fourier transform preserves L2 norms, we can invert (1) to
get
Z
1 X n b −nπiw/b 2πitw
∞
f (t) = f e e dw
2b n=−∞ 2b −b
∞ Z b
1 X
= f (tn ) e−2πitn w e2πitw dw,
2b n=−∞ −b
∞ b
1 X e2πi(t−tn )w
= f (tn ) ,
2b n=−∞ 2πi(t − tn ) −b
14
which gives the result since
4.6 Theorem
n √1 kt ,
Let tn = 2b for n ∈ Z. Then the functions 2b n
for n ∈ Z, form an orthonormal
basis for P W (b). Moreover,
Definition. A reproducing kernel Hilbert space is a Hilbert space H consisting
of functions on a set S, such that for each s ∈ S there is a ks ∈ H such that
hf, ks i = f (s) for all f ∈ H.
Examples.
sin 2πb(t−s)
1. P W (b) with ks (t) = π(t−s)
.
P∞
2. The Hardy space H 2 of all power series f (z) = n=0 an z
n
in the disc
D = {z ∈ C : |z| < 1}, with
∞
X
2
kf k = |an |2 < ∞,
n=0
15
and inner product
∞
X ∞
X ∞
X
h an z n , bn z n i = an bn .
n=0 n=0 n=0
1
For w ∈ D we take kw (z) = 1−wz
, the Cauchy–Szegö kernel.
a(j) = aj = ha, ej i
for j ∈ S, with ej the jth standard basis vector (all coordinates 0, except
that the jth is 1).
4.7 Proposition
Let H be a reproducing kernel Hilbert space with r.k. ks ∈ H for s ∈ S. Then
for s, t ∈ S we have ks (t) = kt (s). Also ks (s) > 0 unless f (s) = 0 for all s ∈ H.
Proof: We have
ks (t) = hks , kt i = hkt , ks i = kt (s).
Also ks (s) = hks , ks i = kks k2 , and ks 6= 0 unless f (s) = hf, ks i = 0 for all f ∈ H.
Note in P W (b) we have ks (t) = kt (s) = k0 (t − s) (real and symmetric).
LECTURE 7
Remark. The sampling theorem gives a way of reconstructing f from values
n
f (tn ), with tn = 2b , but is not robust – small errors in measurements can eas-
ily add up. Oversampling helps avoid this problem: since P W (b) ⊂ P W (c) for
n
c > b, we can reconstruct f from values f ( 2c ) if we wish.
1
We take b = 2
from now on, and write V0 = P W (1/2). Then V0 has an orthonor-
mal basis
sin π(t − k)
t 7→ φ(t − k) = , (k ∈ Z),
π(t − k)
sin πt
where φ(t) = k0 (t) = πt
.
4.8 Theorem
16
For j ∈ Z, let Vj = P W (2j−1). We have
. . . V−2 ⊂ V−1 ⊂ V0 ⊂ V1 ⊂ V2 ⊂ . . . .
S T
Also j∈Z Vj = L2 (R) and j∈Z Vj = {0}. Moreover, for each j, we have
f (t) ∈ V0 ⇐⇒ f (2j t) ∈ Vj . Finally, Vj has an orthonormal basis consisting
of the functions 2j/2 φ(2j t − k), with k ∈ Z.
17
In the transform space, we want ψ̂ supported on [−1, 1] and orthogonal to all
functions fˆ in L2 (− 12 , 12 ). So we define
(
1 if 12 ≤ |w| ≤ 1,
ψ̂(w) =
0 otherwise.
Proof:
Z ∞
ψ̂k (w) = ψ(t − k)e−2πiwt dt
Z−∞
∞
= ψ(x)e−2πiw(x+k) dx, with x = t − k,
−∞
−2πiwk
= e ψ̂(w).
Now
Z Z !
−1/2 1
hψ̂k , ψ̂ℓ i = + e−2πiwk e2πiwℓ dw
−1 1/2
(
1 if k = ℓ,
=
0 if k 6= ℓ.
LECTURE 8
Finally, since
1 1 1 1
L2 (−1, 1) = L2 (− , ) ⊕ L2 ((−1, − ) ∪ ( , 1)),
2 2 2 2
it is sufficient to verify that (ψ̂k ) is an o.n.b. for L2 ((−1, − 12 ) ∪ ( 12 , 1)) (we have
already seen that they are orthonormal).
Note that (ek (w))k∈Z := (e2πiwk )k∈Z is an o.n.b. for L2 (0, 1).
18
Given γ ∈ L2 ((−1, − 12 ) ∪ ( 21 , 1)), define e
γ ∈ L2 (0, 1) by
(
γ(w − 1) if 0 < w < 21 ,
γe(w) =
γ(w) if 21 < w < 1.
Given ǫ > 0 there is an N and a−N , . . . , aN ∈ C such that
Z 1 N
X
2
−2πiwk
e
γ (w) − ak e dw < ǫ2 .
0
k=−N
This equals
Z Z ! 2
−1/2 1 N
X
−2πiwk
+ γ(w) − ak e dw,
−1 1/2
k=−N
PN
and so kγ − −N ak ψ̂k k2 < ǫ, as required.
4.10 Theorem
The functions ψjk (t) = 2j/2 ψ(2j t − k), (j, k ∈ Z), form an orthonormal basis
for L2 (R), the Littlewood–Paley wavelets.
Examples.
1. Any orthonormal basis (φk ), and then we can take A = B = 1.
2. In Cn , any basis {φ1 , . . . , φn }.
19
5.2 Theorem
The sequence (φk ) is a Riesz basis for H if and only if there is a bounded invert-
ible linear operator U : H → H such that (Uφk ) is an orthonormal basis for H.
Also,
X
X
ak (Uφk )
≤ kUk
ak φk
, and so
X
X 1/2
ak φk
≤ kUk−1 |ak |2 .
Conversely, if (φk ) is a Riesz basis, then let (ψk ) be any o.n.b. for H. Define
U : H → H by X X
U ak φk = ak ψk
for finite sums. Then
X
X
−1
ak ψk
≤ A
ak φk
, and
X
X
ak φk
≤ B
ak ψk
,
so that U and U −1 are bounded on finite sums. Since the closed linear span of
the (φk ) and (ψk ) both equal H, we can extend U to an isomorphism from H to
H.
5.3 Definition
A sequence (φk ) in a Hilbert space H is called a frame, if there are constants
A, B > 0 such that
X
Akφk2 ≤ |hφ, φk i|2 ≤ Bkφk2 ∀φ ∈ H. (2)
Examples.
1. Orthonormal bases, with A = B = 1.
2. Riesz bases, since
X X X
|hφ, φk i|2 = |hφ, U −1 Uφk i|2 = |h(U −1 )∗ φ, ψk i|2 = k(U −1 )∗ φk2 ,
which can be bounded above by k(U −1 )∗ k2 kφk2 , and below by kU ∗ k−2 kφk2 .
20
However, frames needn’t be linearly independent. If (φk ) is a frame, then adding
in the zero vector, or repeating each vector a variable number ≤ N times, does
not stop it being a frame. Indeed, any finite set spanning Cn is a frame in Cn
(exercise).
We call the numbers hφ, φk i the frame coefficients of φ and would like to use
them to reconstruct φ.
form a frame for L2 (R) with B/A ≈ 1.116. Used in vision analysis, and process-
ing seismic data.
A frame is tight, if A = B, e.g., any orthonormal basis. There are other examples;
e.g., in C2 , the set
( √ √ )
1 3 1 3
{φ1 , φ2 , φ3 } = (1, 0), (− , ), (− , − )
2 2 2 2
satisfies
3
X 3
|hφ, φk i|2 = kφk2
k=1
2
for all φ ∈ C2 .
LECTURE 9
From now on we are looking at countably infinite frames in a separable Hilbert
space.
5.4 Theorem
A sequence (φk ) in a Hilbert space H is a frame for H if and only if there is a
surjective bounded linear operator T : ℓ2 → H such that
X
T ((ak )) = ak φk for all (ak ) ∈ ℓ2 .
21
Proof: Suppose that T exists, and consider T ∗ : H → ℓ2 . Since
X X
hT ∗ ψ, (ak )i = hψ, T ((ak ))i = hψ, ak φk i = ak hψ, φk i,
where φek = S −1 φk .
5.5 Theorem
22
The sequence (φek ) = (S −1 φk ) forms a frame in H, called the dual frame to (φk ).
Moreover, the dual frame to (φek ) is (φk ) again.
Since S is self-adjoint and invertible, we can say that ∃A′ , B ′ > 0 such that
Se = TeTe∗ = S −1 T T ∗ S −1 = S −1 SS −1 = S −1 .
e
Thus φek = Se−1 φek = S φek = φk .
5.6 Theorem
We have
2
B−A
I − S
≤ < 1, (3)
B+A
B+A
and so ∞ n
−1 2 X 2
S = I− S . (4)
B + A n=0 B+A
Proof: Let C denote (A + B)/2. Then
23
Hence, if
I − B+A
2
S
is small, we have a rapidly converging series for S −1 . Then
we can use this, since
X X
ψ= hψ, φk i(S −1 φk ) = S −1 hψ, φk iφk ,
LECTURE 10
5.7 Corollary
PN
and φek =
2 (N ) 2
Let R = I − B+A
S, B+A n=0 Rn φk for k = 1, 2, . . .. Then
X
B − A N +1
hψ, φk iφek
≤
(N )
ψ − kψk.
B+A
Proof: ∞
2 X n
φek = R φk = φek + RN +1 φek .
(N )
B + A n=0
Hence
X
X
(N )
ψ − hψ, φk iφek
=
hψ, φk iRN +1 φek
= kRN +1 ψk.
But kRk ≤ (B − A)/(B + A), by Theorem 5.6, and hence the result.
24
eˆ
fe(w, t) = e−2πiwt f(−t, w), where the windowed transform of fˆ is formed using
the window ĝ.
Proof: We have
e t) = hf, gw,ti = hf,
f(w, ˆ ĝw,ti,
So Z ∞
fe(w, t) = e−2πiwt fˆ(ν)e2πiνt ĝ(ν − w) dν,
−∞
as required.
−πt2
Hence if g and ĝ are both well-localised (e.g. g(t) = 21/4 e and ĝ(w) =
1/4 −πw 2
2 e ), then the windowed Fourier transform gives localised behaviour of
both f and fˆ simultaneously. But there is a limitation as to how well we can do.
25
More generally, for all real t0 and w0 we have, for g ∈ L2 (R), that
Z ∞ 1/2 Z ∞ 1/2
2 2 2 2 1
(t − t0 ) |g(t)| dt (w − w0 ) |ĝ(w)| dw ≥ kgk22.
−∞ −∞ 4π
ˆ
f(w) = g(w + w0 )e2πiwt0 e2πiw0 t0 .
LECTURE 11
Proof of Theorem 6.2: Without loss of generality we may assume that f is
ˆ
C 1 and that |f (t)| = O(1/t2) at ±∞ and |f(w)| = O(1/w 2) at ±∞. This is a
dense set of functions, and the general case follows by approximation.
Now
Z ∞
2
kf k2 = 1.f f
−∞
h i∞ Z ∞
= tf (t)f (t) − t[f ′ (t)f (t) + f (t)f ′ (t)] dt,
−∞ −∞
ˆ
again integrating by parts, so that kf ′k2 = kF f ′k2 = 2πkw f(w)k 2.
Hence
ˆ
kf k22 ≤ 4πktf (t)k2kw f(w)k 2.
The inversion formula.
We write C for kgk2.
6.3 Theorem
26
Z ∞Z ∞
1
f (u) = 2 fe(w, t)g(u − t)e2πiwu dw dt,
C −∞ −∞
assuming that the integrals converge absolutely.
Hence
Z ∞ Z ∞ Z ∞
2
f (u)|g(u − t)| dt = e t)g(u − t)e2πiwu dw dt.
f(w, (5)
−∞ −∞ −∞
where gmn (u) = e2πimw0 u g(u − nt0), so by the results of Section 5, we get a stable
inversion formula if and only if the (gmn ) form a frame.
It turns out to be necessary, but not sufficient in general, that w0 t0 < 1 (so sam-
pling rapidly enough); for the Gaussian window this is also sufficient (Lyubarskii,
1989).
27
7.1 Definitions
Let ψ ∈ L2 (R) satisfy the admissibility condition
Z ∞
|ψ̂(w)|2
Cψ := dw < ∞. (6)
−∞ |w|
We write
x,y −1/2 t−y
ψ (t) = |x| ψ ,
x
for x, y ∈ R with x 6= 0.
For example, take ψ(t) to be the Haar wavelet, so that
(1 − e−iπwt )2
ψ̂(w) = = O(w) as w → 0.
2πiw
√
Then ψ x,y (for x > 0 at least) takes values ±1/ x on the interval between y and
x + y.
Now define
◦
f (x, y) = hf, ψ x,y i
Z ∞
−1/2 t−y
= f (t)|x| ψ ,
−∞ x
the Wavelet Transform of f .
Note that if ψ̂ is continuous, then by (6) it must satisfy ψ̂(0) = 0, i.e.,
Z ∞
ψ(t) dt = 0.
−∞
So ψ cannot be a positive function like the Gaussian, but it could be the Mexican
hat function.
LECTURE 12
7.2 Theorem
For all f, g ∈ L2 (R), we have
Z ∞Z ∞ ◦
◦ dx dy
f (x, y)g (x, y) 2 = Cψ hf, gi.
−∞ −∞ x
Proof: (Fuller calculations given in lectures.) The LHS above equals
Z ∞
dx dy
A := hf, ψ x,y i hψ x,y , gi 2 .
−∞ x
28
Now
d
ψ b
x,y (w) = |x|1/2 e−2πiwy ψ(xw),
so Z ∞ Z ∞
b ψbx,y i hψbx,y , b dx dy dx
A= hf, gi = hF̌x , Ǧx i ,
−∞ |x| −∞ |x|
where F̌x (y) denotes the inverse Fourier transform of Fx (w) = fb(w)ψ(xw).
b Thus
Z ∞ Z ∞
dx dy b
b ψ(xw)b b dx dw
A = hFx , Gx i = f(w) g (w)ψ(xw)
−∞ |x| −∞ |w|
= Cψ hfb, b
g i = Cψ hf, gi.
Remark. We can interpret this as saying that
Z ∞Z ∞ ◦
−1 ψ x,y
f = Cψ f (x, y) 2 dx dy,
−∞ −∞ x
where the integral is taken in a weak sense (i.e., take the inner product of both
sides with any g ∈ L2 (R).
In fact, it converges in norm as a vector-valued integral, too, in the sense that
Z Z
◦ ψ x,y
f − C −1
f (x, y) dx dy
→0
ψ
x2
|y|≤c a≤|x|≤b
7.3 Corollary
For f ∈ L2 (R), Z Z
∞ ∞ ◦ dx dy
kf k22 = Cψ−1 | f (x, y)|2 ,
−∞ −∞ x2
2
i.e., the wavelet transform maps L (R) isometrically into a closed subspace H of
L2 (R2 , dµ), where the measure on R2 is given by dµ = Cdxψdy
x2
.
Further, H is a reproducing kernel Hilbert space with kernel
k(s,t) (x, y) = hψ s,t , ψ x,y i.
Proof: The first part follows directly from Theorem 7.2, on putting f = g.
For the reproducing kernel properties, note that
◦ ◦ ◦
f (s, t) = hf, ψ s,t i = hf, ψ s,t iL2 (R2 ,dµ)
Z ∞Z ∞ ◦
−1
◦ dx dy
= Cψ f (x, y)ψ s,t(x, y) 2
−∞ −∞ x
◦
= hf , k(s,t) iL2 (R2 ,dµ) ,
29
where ◦
s,t
k(s,t) (x, y) =ψ (x, y) = hψ s,t , ψ x,y i.
Remark. This transform has found applications in the analysis of radar and
sonar data; there are also connections with quantum mechanics and group rep-
resentations.
It is considered superior to the windowed Fourier transform (WFT) because the
WFT cannot deal efficiently with several scalings/resolutions at once (the “win-
dow” is a fixed size). So the wavelet transform tends to be more efficient.
Fix a > 1 and b > 0. For j, k ∈ Z set x = a−j and y = ka−j b. Write
LECTURE 13
. . . V−2 ⊂ V−1 ⊂ V0 ⊂ V1 ⊂ V2 ⊂ . . . .
S T
such that j∈Z Vj = L2 (R) and j∈Z Vj = {0}.
We also ask that for j, k ∈ Z
30
• there is a function φ ∈ V0 such that (φk )k∈Z form an o.n.b. of V0 , where
φk (t) = φ(t − k).
Then for j ∈ Z the space Vj has an orthonormal basis consisting of the functions
Wj = {f ∈ Vj+1 : hf, gi = 0 ∀g ∈ Vj }
= {f ∈ Vj+1 : hf, φjk i = 0 ∀k ∈ Z},
By the same arguments as before, once we have a MRA we can deduce that
and
π
θ(u) + θ(1 − u) =
2
for all u ∈ [0, 1]. The way we shall do this is to let Pk be an odd polynomial of
(m)
degree 2d + 1 satisfying Pk (1) = 1 and Pk = 0 for m = 1, 2, . . . , d (no condition
here if d = 0).
Thus we can take P0 (x) = x, and P1 (x) = 23 x − 21 x3 .
31
Define
π 1 2
θ(u) = (1 + Pk (6u − 3)) for θ ∈ , .
4 3 3
Finally we define the “father wavelet” by means of its Fourier transform, taking
b
φ(w) = cos(θ(|w|)) on [−1, 1]
and 0 outside that interval. Clearly φb ∈ C d and has compact support. This
implies that φ is smooth and rapidly-decreasing, in that td φ(t) lies in L2 (R).
8.2 Proposition
For any φ ∈ L2 (R) the orthonormality property
hφk , φℓ i = δk,ℓ
is equivalent to the condition that
∞
X
K(w) := b + k)|2 = 1
|φ(w a.e.
k=−∞
(exchanging the integral sum is O.K. by the monotone convergence theorem, for
example). So K ∈ L1 (0, 1) and
Z ∞
hφk , φℓ i = hφbk , φbℓ i = b
φ(w)e−2πikw b
φ(w)e2πiℓw dw
−∞
Z ∞ Z 1
= b
|φ(w)| 2 −2πi(k−ℓ)w
e dw = K(w)e−2πi(k−ℓ)w dw,
−∞ 0
8.3 Proposition
2
Let (Vn ) and φ give a MRA
P of2 L (R). Then
2
(i) ∃(hk )k ∈ ℓ (Z) with |hk | = 1 such that
X
φ= hk φ1,k .
P∞ 2πikt
(ii) There is a 1-periodic function h(t) = k=−∞ hk e (convergence in L2 ),
such that khkL2 (0,1) = 1 and
∞
X
hk hk+2ℓ = δℓ,0 , (ℓ ∈ Z).
k=−∞
P
(iii) A function gk φ1,k lies in W0 if and only if (gk ) ∈ ℓ2 and
∞
X
gk hk+2ℓ = 0, (ℓ ∈ Z).
k=−∞
and X X
φ(t + ℓ) = hj 21/2 φ(2t + 2ℓ − j) = hn+2ℓ 21/2 φ(2t − n),
n
33
with n = j − 2ℓ. Now the result follows since
X
hφ−ℓ , φi = hn+2ℓ hn = δℓ,0 .
n
P
(iii) A function χ = gk φ1,k lies in W0 if and only if hφ−ℓ , χi = 0 for all ℓ ∈ Z;
the calculation is now similar to (ii) above.
The Haar wavelet revisited
We have
1 1
φ = √ φ1,0 + √ φ1,1 ,
2 2
and so h(t) = √12 (1 + e2πit ).
In this case we defined
1 1
ψ = √ φ1,0 − √ φ1,1 ,
2 2
so our two sequences are
1 1
(hj ) = (. . . , 0, 0, √ , √ , 0, 0, . . .),
2 2
1 1
(gj ) = (. . . , 0, 0, √ , − √ , 0, 0, . . .), (8)
2 2
so that (gj ) is orthogonal to (hj ) and all its translates by an even number of steps.
LECTURE 15
Back to the general case
8.4 Proposition
Let h be as above, and consider it as a function in L2 (0, 1). then
(i)
Z 1
1
|h(t)|2 e4πiℓt dt = δ0,ℓ , (ℓ ∈ Z); and |h(t)|2 + |h(t + )|2 = 2 a.e.
0 2
P
(ii) a function g ∈ L2 (0, 1) corresponds to gk φ1,k in W0 if and only if
Z 1
h(t)g(t)e4πiℓt dt = 0 (ℓ ∈ Z).
0
Equivalently,
1 1
h(t)g(t) + h(t + )g(t + ) = 0 a.e.
2 2
34
We may equivalently write g(t) = λ(t)h(t + 12 ), with λ a 1-periodic function such
that λ(t) + λ(t + 12 ) = 0 a.e., or, equivalently
1
g(t) = e2πit µ(2t)h(t + ),
2
where µ is 1-periodic.
1 1
(|h(t)|2 − 1)e2πit = (|h(t + )|2 − 1)e2πi(t+ 2 ) ,
2
or |h(t)|2 − 1 = −(|h(t + 21 )|2 − 1), as required.
(ii) Similarly, h(t)g(t)e2πit is 12 -periodic, i.e.,
1 1 1
h(t)g(t)e2πit = h(t + )g(t + )e2πi(t+ 2 ) .
2 2
Now if h(t + 21 ) = 0 then h(t) 6= 0, so g(t) = 0, and we can write g(t) =
λ(t)h(t + 12 ) a.e., with λ 1-periodic.
Now
1 1 1 1 1
h(t)g(t) + h(t + )g(t + ) = h(t)λ(t)h(t + ) + h(t + )λ(t + )h(t) = 0,
2 2 2 2 2
and so we have λ(t) + λ(t + 21 ) = 0 (if h(t + 21 ) = 0, then we can define λ(t) to
ensure this).
This means we can write λ(t) = e2πit µ(2t) with µ 1-periodic.
Example. Take µ ≡ 1. So g(t) = h(t + 12 ).
Thus X X 1
gn e2πint = hk e−2πik(t+ 2 ) e2πit ,
n k
n
or gn = (−1) h1−n . This fits in with the example (8) above.
35
8.5 Lemma
With φ and h as above we have
b 1 b
φ(w) = √ h(w/2)φ(w/2).
2
P
Letting ψ = gn φ1,n where gn = (−1)n h1−n , so that g(w) = e2πiw h(w + 1/2),
we have
b 1 b 1 b
ψ(w) = √ g(w/2)φ(w/2) = √ eπiw h((w + 1)/2)φ(w/2).
2 2
P
Proof: With φ = hk φ1,k we have
Z ∞√
b
φ1,k (w) = 2φ(2t − k)e−2πiwt dt
−∞
Z ∞
1
= √ φ(x)e−2πiw(x+k)/2 dx (x = 2t − k)
2 −∞
1 b −πiwk
= √ φ(w/2)e .
2
b b P b
So φ(w) = √12 φ(w/2) k hk e
−πiwk
with L2 convergence, which equals √12 h(w/2)φ(w/2).
b
Similarly ψ(w) b
= √12 g(w/2)φ(w/2).
Another example. √
For the Littlewood–Paley wavelet we had φb = χ(−1/2,1/2) , and h(w) = 2φ(2w)/
b b
φ(w),
which is 1-periodic.
b
We get that ψ(w) = eπiw χ(−1,−1/2)∪(1/2,1) (w). By choosing µ differently we can
obtain the ψ we obtained before.
8.6 Theorem
In the MRA situation of this chapter, let ν(w) be 1-periodic and satisfy |ν(w)| = 1
a.e. Define ψ by
b 1 b
ψ(w) = √ ν(w)h((w + 1)/2)φ(w/2).
2
Then (ψk )k∈Z , given by ψk (t) = ψ(t − k), generate an o.n.b. for W0 ; hence
(ψj,k )j,k∈Z, given by ψj,k (t) = 2j/2 ψ(2j − k), form an orthonormal basis for L2 (R).
LECTURE 16
36
9 Spline wavelets and compactly-supported wavelets
9.1 Definition
Suppose that a0 < a1 < . . . < aN are real and that n ≥ 1. A spline func-
tion of degree n, with knots at a0 , . . . , aN is a function f : [a0 , aN ] → R such that
f ∈ C n−1 [a0 , aN ] and f|[aj ,aj+1 ] is a polynomial of degree n for each 0 ≤ j ≤ N −1.
9.2 Proposition
With φ as above we have:
(i)
2
b sin πw
φ(w) = ;
πw
(ii)
1 1
φ(x) = φ(2x + 1) + φ(2x) + φ(2x − 1),
2 2
i.e., P
φ = hk φ1,k , where h−1 = h1 = 2√1 2 , and h0 = √12 , and the rest 0; as usual
φj,k (t) = 2j/2 φ(2j t − k).
(iii)
X∞
K(w) := b + k)|2 = 2 + 1 cos 2πw.
|φ(w
3 3
k=−∞
37
Proof: We leave parts (i) and (ii) as exercises. For (iii), note that K is a
period-1 function whose mth Fourier coefficient is hφ, φmi, as in the proof of
(8.2).
Now Z 1
2
hφ, φ0i = 2 (1 − x)2 dx = ,
0 3
and Z 1
1
hφ, φ1 i = hφ, φ−1i = (1 − x)x dx = ,
0 6
with all other inner products zero, so
2 1 2πiw 1 −2πiw 2 1
K(w) = + e + e = + cos 2πw.
3 6 6 3 3
In general, it is possible to construct similar smoother φ that is a finite linear
combination on translates, and with
n+1
b −iπrw sin πw
φ(w) = e ,
πw
satisfies 0 < A ≤ K(w) ≤ B for almost all w ∈ R. Then the family (φk )k∈Z
forms a Riesz basis for its closed linear span V0 , and an orthonormal basis can be
obtained by taking the functions (φ♯k )k∈Z defined by
38
Proof: We calculate
2
2
2
X
X
X
b
−2πilw b
cl φ l
=
cl φ l
=
cl e φ(w)
l 2 l 2 l
Z ∞ X
2
−2πilw b
= cl e |φ(w)|2 dw
−∞
l
∞ Z k+1 X
2
X 2
−2πilw b
= cl e φ(w) dw
k=−∞ k l
X∞ Z 1 X 2 2
−2πilw b
= cl e φ(w + k) dw
k=−∞ 0 l
Z 1 X
2
−2πilw
= cl e K(w) dw.
0 l
Thus
2
X
X
X
2
A |cl | ≤
cl φ l
≤ B |cl |2 ,
l l 2 l
We should remark that (φk ) and (φ♯k ) have the same closed linear span, since
V0 := {f ∈ L2 (R) : fb = ν φ,
b ν ∈ L2 (0, 1), extended 1-periodically}
= {f ∈ L2 (R) : fb = ν1 φb♯ , ν1 ∈ L2 (0, 1), extended 1-periodically},
Now φ♯ is still a spline, but no longer compactly supported. It does have exponen-
tial decay, however. V0 consists of all continuous L2 functions that are piecewise
39
linear on every [n, n + 1]. We could now work out ψ as in Section 8, but it gets
very complicated.
LECTURE 17
Compactly-supported wavelets
For good localization we would like wavelets which decrease rapidly as |t| → ∞.
The following result links the behaviour of f and fb.
9.4 Proposition
(i) Suppose that f, f ′ , . . . , f (n) are all L1 (R) functions (so that in fact f, f ′ , . . . , f (n−1)
are all absolutely continuous as well). Then fb(w) = O(|w|−n) as |w| → ∞.
b
(ii) Suppose that f(w) = O(|w|−n) as |w| → ∞. Then f is C n−2 .
(iii) Parts (i) and (ii) hold if we swap the roles of f and fb.
Proof: (i)
Z ∞
b
f(w) = f (t)e−2πiwt dt
−∞
Z ∞
1
= f ′ (t)e−2πiwt dt
2πiw −∞
= ... Z ∞
1
= f (n) (t)e−2πiwt dt,
(2πiw)n −∞
9.5 Corollary
40
If f is C ∞ with all derivatives in L1 (R), then fb decreases more rapidly than any
power of w. Likewise, if fb decreases rapidly, then f ∈ C ∞ .
9.6 Proposition
Suppose that φ is a compactly-supported function, φ(0)b = 0, that φ generates a
MRA (Vk ), and
√ (φk ) is an o.n.b. for V0 . Then h(t) is a trigonometric polynomial
in t, h(0) = 2, and ψ is also compactly-supported.
Proof: Since φ and φ1,k are compactly-supported, it’s clear that only finitely-
many φ1,k have supports meeting supp φ, so hk = 0 for all but finitely-many
k. √
b
Also φ(w) b
= √12 h(w/2)φ(w/2); put w = 0 and we see that h(0) = 2.
P
Finally, gk = h1−k (−1)k 6= 0 for only finitely-many k and so ψ = gk φ1,k is
compactly-supported as each φ1,k is.
41
LECTURE 18
9.7 Proposition
The scaling function h determines φ (up to a constant) by the identity
Y∞
b b 1 m
φ(w) = φ(0) √ h(w/2 ) .
m=1
2
√
Proof: Since h(0) = 2 and h is a trigonometric polynomial, we have
w
h(w/2m ) = h(0) + m h′ (ξ),
2
for some 0 < ξ < w/2m . Thus there is a constant C > 0 such that
1
√ h(w/2 ) − 1 ≤ Cw/2m
m
2
9.8 Construction
√
We take h to be given by h(w) = 2P (e2πiw ), with P a polynomial of degree
2N − 1; here N ≥ 1 is a fixed integer.
If we have |P (z)|2 + √
|P (−z)|2 = 1 whenever |z| √
= 1, then, with z = exp(2πiw),
we will have h(w) = 2P (z) and h(w + 1/2) = 2P (−z), so we will obtain
|h(w)|2 + |h(w + 1/2)|2 = 2,
which is one of the necessary conditions.
We shall choose N
1+z
P (z) = W (z),
2
with deg W = N − 1. Thus P (−1) = 0 and we will take P (1) = 1 by making
W (1) = 1.
This property leads to smoothness of φ (details omitted).
42
9.9 Proposition
For N = 1 the construction produces P (z) = (1+z)/2, and |P (z)|2 +|P (−z)|2 = 1
for |z| = 1.
Then h(w) = √12 (1 + e2πiw ), the scaling function for the Haar wavelet.
and if |P (z)|2 equals the sum of the first N terms, then P satisfies the conditions
of Construction 9.8.
43
and indeed
2N
X −1
2 2 2N − 1
|P (z)| + |P (−z)| = C 4N −2−2k S 2k = 1.
k=0
k
Note that z = −1 is when w = 1/2 and C = 0, and |P (z)|2 has a zero of degree
4N − 2 − 2(N − 1) = 2N there, so P will have a zero of degree N.
Example. N = 2.
N
X −1
2 2N − 1
|P (z)| = C 4N −2−2k S 2k
k
k=0
X1
3
= C 6−2k S 2k
k
k=0
= C + 3C 4 S 2
6
1 + z 4 2
= (C + 3S 2 ).
2
We now need to do a spectral factorization, choose W a polynomial of degree 1
with W (1) = 1 such that
or
z+z
|W (z)|2 = 2 −
2
when |z| = 1.
We try W (z) = a + bz with a, b real, so that |W (z)|2 = (a + bz)(a + bz), and
hence 2 = a2 + b2 and − 21 = ab. Also a + b = W (1) = 1.
The solution to these equations is
2
1 1+z h √ √ i
P (z) = (1 ± 3) + (1 ∓ 3)z .
2 2
9.11 Final details
P determines h, and h determines φb and hence φ, by Prop. 9.7. Using the coef-
ficients of h we can now obtain ψ too.
44
1. φ is compactly-supported – the idea is to show that the infinite product
defining φb produces an entire function lying in a suitable Paley–Wiener
space, and hence φ itself has compact support contained in [0, 2N − 1].
THE END
45