Burdman - Lecture 03
Burdman - Lecture 03
Canonical Quantization
Since, as we saw before, Quantum Field Theory (QFT) emerges as we attempt to combine
quantum mechanics with special relativity it is natural to start with quantum mechanics
of a single particle. We will see that when trying to make this conform with relativistic
dynamics, we will naturally develop a way of thinking of the solution to this problem
that goes by the name of canonical quantization. Besides being conceptually natural, this
formalism will be useful when trying to understand the statistics of di↵erent states.
@ 1
i (x, t) = r2 (x, t) , (3.1)
@t 2m
where we set h̄ = 1. In terms of states and operators, we can define the wave function
as (x, t) = hx| , ti, i.e. in term of the state | , ti projected onto the position state |xi.
More generally, eq. (3.1) can be written as
@
i | , ti = H | , ti , (3.2)
@t
p2
H= , (3.3)
2m
resulting in (3.1). We would like to generalize this for the relativistic case, i.e. choosing
1
2 LECTURE 3. CANONICAL QUANTIZATION
p
H=+ p2 + m 2 , (3.4)
where again we use c = 1. If one uses this Hamiltonian in the Schrödinger equation one
gets
@ p
i (x, t) = r2 + m2 (x, t) . (3.5)
@t
But this is problematic for a relativistic equation since time and space derivatives are of
di↵erent order. If the equation has to have any chance of being Lorentz invariant, it needs
to have the same number of time and space derivatives. One simple way to do this is to
apply the time derivative operator twice on both sides. This results in
@2
(x, t) = r2 + m 2 (x, t) . (3.6)
@t2
This is the Klein-Gordon equation for the wave function (x, t), and is clearly consistent
with the relativistic dispersion relation (3.4), once we make the identifications
@
i $H ir $ p , (3.7)
@t
where H and p are the hamiltonian and momentum operators. In covariant notation, and
using
@ @ @ @
⌘ @ µ = ( , r) , ⌘ @ µ = ( , r) , (3.8)
@xµ @t @xµ @t
This is manifestly Lorentz invariant. However it has several problems. The fact that
this equation has two time derivatives implies for instance that | (x, t)|2 is not generally
time independent, so we cannot interpret it as a conserved probability, as it is in the case
of the Scrödinger equation. This issue is tackled by Dirac, which derives a relativistic
equation for the wave-function that is first order in both time and space derivatives. But
this equation will be valid for spinors, not scalar wave-functions. We will study it in the
next lecture in detail. But it does not resolve the central issue, as we see below.
Both the Klein-Gordon and the Dirac equations admit solutions with negative energies.
This would imply that the system does not have a ground state, since it would be always
3.2. CANONICAL DESCRIPTION OF QUANTUM FIELDS 3
energetically favorable to go to the negative energy states. Since the Dirac equation
describes fermions, one can use Pauli’s exclusion principle and argue, as Dirac did, that all
the negative energy states are already occupied. This is the so-called Dirac sea. According
to this picture, an electron would not be able to drop to negative energy states since these
are already filled. Interestingly, this predicts that in principle it should be possible to
kick one of the negative energy states to a positive energy state. Then, one would see
an electron appear. But this would leave a hole in the sea, which would appear as a
positively charged state. This is Dirac’s prediction of the existence of the positron. Is
really nice, but now we need an infinite number of particles in the sea, whereas we were
supposed to be describing the wave-function of one particle. Besides, this only works for
wave-functions describing fermions. What about bosons ?
What we are seeing is the inadequacy of the relativistic description of the one-particle
wave-function. At best, as in the case of fermions, we were driven from a one-particle
description to one with an infinite number of particles. At the heart of the problem is the
fact that, although now we have the same number of time and space derivatives, position
and time are not treated on the same footing in quantum mechanics. There is in fact a
position operator, whereas time is just a parameter labeling the states.
On the other hand, we can consider operators labeled by the spacetime position xµ = (t, x),
such as in
(t, x) = (x) . (3.10)
These objects are called quantum fields. They are clearly in the Heisenberg picture,
whereas if we choose the time-independent Schrödinger picture quantum fields they are
only labeled by the spatial component of the position as in (x). These quantum fields
will be our dynamical degrees of freedom. All spacetime positions have a value of (x)
assigned. As we will see in more detail below, the quantization of these fields will result
in infinitely many states. So we will abandon the idea of trying to describe the quantum
dynamics of one particle. This formulation will allow us to include antiparticles and (in
the presence of interactions) also other particles associated with other quantum fields.
It solves one of the problems mentioned earlier, the fact that relativity and quantum
mechanics should allow the presence of these extra particles as long as there is enough
energy, and/or the intermediate process that violates energy conservation by E lasts a
time t such that E t ⇠ h̄.
The behavior of quantum fields under Lorentz transformations will define their prop-
erties. We can have scalar fields (x), i.e. no Lorentz indices; fields that transform as
four-vectors: µ (x); as spinors: a (x), with a a spinorial index; as tensors, as in the rank
2 tensor µ⌫ (x); etc. We will start with the simplest kind, the scalar field.
dynamical variable obeys a relativistic equation relating space and time derivatives:
(@ 2 + m2 ) (x) = 0 , (3.11)
†
(x) = (x) , (3.12)
where we already anticipate to elevate the field to an operator, hence the †. It is interesting
to solve the Klein-Gordon equation for the classical field in momentum space. The most
general solution has the following form
Z
d3 p
(x, t) = N p ap e i(!p t p·x)
+ b†p ei(!p t p·x)
, (3.13)
(2⇡)3
p
where, as defined earlier, !p = + p2 + m2 . Here, Np is a momentum-dependent nor-
malization to be determined later, and the momentum-dependent coefficients ap and b†P
will eventually be elevated to operators. In general ap and b†p are independent. However,
when we impose (3.12), this results in
ap = b p . (3.14)
This is not the case, for instance, if (x) is a complex scalar field.
At this point and before we quantize the system, we remind ourselves of the fact that the
Klein-Gordon equation (3.11) is obtained from the lagrangian density
1 1 2
L = @µ @ µ m 2
. (3.15)
2 2
To convince yourself of this just use the Eüler-Lagrange equations from the previous
lecture to derive (3.11) from (3.15). Then, since (x) is our dynamical variable, the
canonically conjugated momentum is
@L
⇡(x) = = ˙, (3.16)
@ ˙ (x)
Z
d3 x
⇡(x, t) = Np i!p ap e i(!p t p·x)
+ i!p a†p ei(!p t p·x)
. (3.17)
(2⇡)3
Having the field and its conjugate momentum defined we can then impose quantization
conditions. It is useful first to refresh our memory on how this is done in quantum
mechanics.
1 2
L= q̇ V (q) , (3.18)
2
where V (q) is some still unspecified potential, which we assume it does not depend on
the velocities. The associated hamiltonian is
1 2
H= p + V (q) , (3.19)
2
where the conjugate momentum is p = @L/@ q̇ = q̇. To quantize the system we elevate p
and q to operators and impose the commutation relations
Notice that if we are in the Heisenberg description, the commutators should be evaluated
at equal time, i.e. [q(t), p(t)] = i, etc. We change to a description in terms of the operators
1
a ⌘ (! q + ip)
2!
1
a† ⌘ (! q ip) , (3.21)
2!
a|0i = 0 , (3.23)
where the 0 in the state refers to the absence of quanta. Then, assuming the ground state
(or vacuum) is a normalized state, we have
a† |0i = |1i ,
a|1i = |0i , (3.26)
which means that a and a† are ladder operators. We interpret the state |1i as a state with
one particle. In this way a and a† can also we called annihilation and creation operators.
The simplest example is, of course, the simple harmonic oscillator, with V (q) = ! 2 q 2 /2.
Here we see that the commutator is defined at equal times, as it should for Heisenberg
operators. All other possible commutators vanish, i.e.
Now, when we turned (x) into an operator, so did ap and a†p . In order to see what the
imposition of (3.27) and (3.28) implies for the commutators of the operators ap and a†p ,
3.2. CANONICAL DESCRIPTION OF QUANTUM FIELDS 7
we write out (3.27) using the explicit expressions (3.13) and (3.17) for the field and its
momentum in terms of them. We obtain
Z Z n
d3 p d 3 p0 i(!p !p0 )t ip·x ip0 ·x0
0
[ (x, t), ⇡(x , t)] = Np N p 0 i!p0 e e [ap , a†p0 ]
(2⇡)3 (2⇡)3
o
ip·x+ip0 ·x0
i! ep0
i(!p !p0 )t
e [a†p , ap0 ] , (3.29)
The question is what are the commutation rules for [ap , a†p0 ]. Now we will show that in
order for (3.27) to be satisfied, we need to impose
Z n o
0 d3 p 2 x0 ) ip·(x x0 )
[ (x, t), ⇡(x , t)] = i N !p eip·(x +e . (3.32)
(2⇡)3 p
Z Z
(3) 0 d3 p ip·(x x0 ) d3 p ip·(x x0 )
(x x)= e = e , (3.33)
(2⇡)3 (2⇡)3
then if
1
Np2 !p = , (3.34)
2
as long as
8 LECTURE 3. CANONICAL QUANTIZATION
1
Np = p . (3.36)
2!p
We can now go back to the expression (3.13) for the real scalar field, and rewrite it in
covariant form as
Z
d3 p ipµ xµ µ
(x) = p ap e + a†p eipµ x , (3.37)
(2⇡)3 2!p
pµ xµ = p0 x0 p · x = !p t p·x (3.38)
ap |0i = 0 , (3.39)
we conclude that ap and a†p are ladder operators, just as in the quantum mechanical case
seen above. In other words we have
where |1p i corresponds to the state containing one particle of momentum p. Conversely,
and analogously to the quantum mechanical case, we have
†
This allows us to interpret the operators (x) and (x) in the following form:
The operator (x):
Of course in our case, a real scalar field, particles and anti-particles are the same due
to (3.14). On the other hand, if was for instance complex, particles and anti-particles
would be created and annihilated by di↵erent operators, and they would carry di↵erent
“charges” under the global U (1) symmetry of the lagrangian. We will discuss this later
in the course.
3.3 Statistics
One of the advantages of this formulation is that we will be able to determine the statistics
of our Klein-Gordon quanta. In general a state of N particles can be defined by
But since the a†p ’s commute, then the state on the left is even under the exchange of any
two particles. Also, it is not a problem to have any two of the particles to have the same
momentum. For instance, if we have two particles
the fact that [a†p , a†q ] = 0 implies that it is allowed to have p = q. All this implies that the
quanta of the real scalar field (x) obeying the Klein-Gordon equation obey Bose-Einstein
statistics.
We will see that for other fields, we will need to impose anti-commuting rules . These
anti-commuting operators satisfy
meaning that the state is odd under the particle exchange. These particles (fermions)
obey Fermi statistics. This clearly means that these two particles cannot have the same
10 LECTURE 3. CANONICAL QUANTIZATION
momentum. This is called the Pauli exclusion principle. In the next lecture we will see
that fermions must have ladder operators obeying anti-commutations rules such as (3.44),
and therefore are subject to the Pauli exclusion principle.
Since now we know we can use the field operator (x) to create and annihilate, it looks
reasonable to compute the following amplitude
From the momentum expansion of (x) in terms of creation and annihilation operators
ap and a†p it is easy to realize that only terms like ap a†p0 will contribute to the vacuum
expectation value (VEV) in (3.47). This is because
whereas
Then (3.47) is
3.4. THE FEYNMAN PROPAGATOR 11
Z Z
d3 p d3 k
h0| (x) (y)|0i = p p e ip·x +ik·y
e h0|ap a†k |0i
(2⇡)3 2!p (2⇡)3 2!k
Z
d3 p ip·(x y)
= e ⌘ D(x y) , (3.50)
(2⇡)3 2!p
where in the last line we used (3.49) and we define the two-point function D(x y),
the propagator. But, as we discussed at length in the first lecture, this just gives us the
amplitude for one particular temporal order: x0 > y0 . We actually need an amplitude
that is causal, i.e. that is valid for whatever temporal order.
We will see that the following object is the propagator we are looking for. We define the
Feynman propagator as the two-point function given by
Z
d4 p i ip·(x y)
DF (x y) ⌘ e , (3.51)
(2⇡)4 p2 m2 + i✏
where m is the mass of the particle and ✏ is an infinitesimal, positive, real constant. Its
meaning will soon be clear. Let us just say for now that the choice of sign with wich ✏
enters in (3.51) is what matters, not its precise value.
To expose the issue of temporal order we rewrite the Feynman propagator as
Z
d3 p i ip0 (x0 y0 ) ip·(x y)
DF (x y) = dp0 2 e e , (3.52)
(2⇡)4 p0 !p2 + i✏
We will perform the integral in p0 . For this purpose we analytically continue it to the
complex plane. This is the reason for the introduction of the small imaginary part i✏.
First, we identify the poles of the integrand. They are at
⇢
!p i✏
p0 = , (3.54)
!p + i✏
in other words infinitesimally below and above the real p0 axis. To perform the integral
in the complex p0 plane we need to choose how to close the contour to apply Cauchy’s
residue theorem. Of course, we want the contour to only contribute for real values of p0
so we get the integral we actually want to perform. But there are two possible choices to
12 LECTURE 3. CANONICAL QUANTIZATION
close the contour: at very large and positive Im[p0 ] or very large and negative Im[p0 ]. As
we can see from (3.52), the choice that leads to a vanishing contribution from the infinte
radius semicircle depends on the sign of x0 y0 , that is on the temporal order. We will
consider each case separately.
x 0 > y0 :
For this temporal order, the factor
ip0 (x0 y0 )
e (3.55)
Figure 3.2: Closing the contour below for x0 > y0 to perform the p0 integral.
Z Z
d3 p i ip·(x y)
DF (x y) = dp0 e
(2⇡)4 (p0 !p )(p0 + !p )
Z
d3 p i ip·(x y)
= 4
( 2⇡i) e . (3.57)
(2⇡) 2!p
We then conclude that for the temporal order x0 > y0 the Feynman propagator is
3.4. THE FEYNMAN PROPAGATOR 13
Z
d3 p ip·(x y)
DF (x y) = e = D(x y) , (3.58)
(2⇡)3 2!p
which is the value of the amplitude computed in (3.50). We will now evaluate the Feynman
propagator for the other possible temporal order.
y 0 > x0 :
For this case, of course, we need to close the contour above in order to have an exponen-
tially decaying factor in (3.55). That is, we need
The contour is shown is Figure 3.3. We see that now the pole that contributes is p0 =
!p + i✏.
Figure 3.3: Closing the contour below for y0 > x0 to perform the p0 integral.
Z
d3 p i ip·(x y)
DF (x y) = (+2⇡i) e . (3.60)
(2⇡)4 2!p
This, together with the fact that we can change p ! p by switching the integration
limits in the d3 p integrals, allows us to make the replacement
14 LECTURE 3. CANONICAL QUANTIZATION
ip·(x y) ip·(y x)
e !e , (3.62)
for this temporal order. Then, for y0 > x0 the Feynman propagator is
Z
d3 p 1 ip·(y x)
DF (x y) = e = D(y x) . (3.63)
(2⇡)3 2!p
We can summarize the results of (3.58) and (3.63) for both temporal orders as
Defining the time order operator T , we can write (3.65) in a compact form as
This means that the Feynman or causal propagator is given by the VEV of the time-
ordered product of the fields. Finally, we can see that we obtained a causal propagator
with the choice of positive ✏. Had we chosen ✏ to be negative we would have obtained a
non-causal object. Thus, the choice of ✏ is crucial in order to get the correct propagator.
• Quantum Field Theory in a Nutshell, by A. Zee. See the discussion in Chapter 1.8
about canonical quantization.