Foundations I: States and Ensembles: 2.1 Axioms of Quantum Mechanics
Foundations I: States and Ensembles: 2.1 Axioms of Quantum Mechanics
[), (2.3)
for all vectors [), [) (where here I have denoted A[) as [A)). A is
self-adjoint if A = A
.
If A and B are self adjoint, then so is A + B (because (A + B)
=
A
+ B
) but (AB)
= B
n
a
n
P
n
. (2.4)
2.1. AXIOMS OF QUANTUM MECHANICS 3
Here each a
n
is an eigenvalue of A, and P
n
is the corresponding or-
thogonal projection onto the space of eigenvectors with eigenvalue a
n
.
(If a
n
is nondegenerate, then P
n
= [n)n[; it is the projection onto the
corresponding eigenvector.) The P
n
s satisfy
P
n
P
m
=
n,m
P
n
P
n
= P
n
. (2.5)
(For unbounded operators in an innite-dimensional space, the deni-
tion of self-adjoint and the statement of the spectral theorem are more
subtle, but this need not concern us.)
3. Measurement. In quantum mechanics, the numerical outcome of a
measurement of the observable A is an eigenvalue of A; right after the
measurement, the quantum state is an eigenstate of A with the mea-
sured eigenvalue. If the quantum state just prior to the measurement
is [), then the outcome a
n
is obtained with probability
Prob (a
n
) =| P
n
[) |
2
= [P
n
[); (2.6)
If the outcome is a
n
is attained, then the (normalized) quantum state
becomes
P
n
[)
([P
n
[))
1/2
. (2.7)
(Note that if the measurement is immediately repeated, then according
to this rule the same outcome is attained again, with probability one.)
4. Dynamics. Time evolution of a quantum state is unitary; it is gener-
ated by a self-adjoint operator, called the Hamiltonian of the system. In
the Schrodinger picture of dynamics, the vector describing the system
moves in time as governed by the Schrodinger equation
d
dt
[(t)) = iH[(t)), (2.8)
where H is the Hamiltonian. We may reexpress this equation, to rst
order in the innitesimal quantity dt, as
[(t + dt)) = (1 iHdt)[(t)). (2.9)
4 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
The operator U(dt) 1 iHdt is unitary; because H is self-adjoint
it satises U
), (2.12)
that preserves the absolute values of inner products
[[)[ = [
)[, (2.13)
for all [) and [). According to a famous theorem due to Wigner, a mapping
with this property can always be chosen (by adopting suitable phase conven-
tions) to be either unitary or antiunitary. The antiunitary alternative, while
important for discrete symmetries, can be excluded for continuous symme-
tries. Then the symmetry acts as
[) [
) = U[), (2.14)
where U is unitary (and in particular, linear).
Symmetries form a group: a symmetry transformation can be inverted,
and the product of two symmetries is a symmetry. For each symmetry op-
eration R acting on our physical system, there is a corresponding unitary
transformation U(R). Multiplication of these unitary operators must respect
the group multiplication law of the symmetries applying R
1
R
2
should be
equivalent to rst applying R
2
and subsequently R
1
. Thus we demand
U(R
1
)U(R
2
) = Phase (R
1
, R
2
)U(R
1
R
2
) (2.15)
The phase is permitted in eq. (2.15) because quantum states are rays; we
need only demand that U(R
1
R
2
) act the same way as U(R
1
)U(R
2
) on
rays, not on vectors. U(R) provides a unitary representation (up to a phase)
of the symmetry group.
So far, our concept of symmetry has no connection with dynamics. Usu-
ally, we demand of a symmetry that it respect the dynamical evolution of
the system. This means that it should not matter whether we rst transform
the system and then evolve it, or rst evolve it and then transform it. In
other words, the diagram
2.2. THE QUBIT 7
Initial Final
New Initial New Final
-
-
? ?
dynamics
dynamics
rotation rotation
is commutative. This means that the time evolution operator e
itH
should
commute with the symmetry transformation U(R) :
U(R)e
itH
= e
itH
U(R), (2.16)
and expanding to linear order in t we obtain
U(R)H = HU(R) (2.17)
For a continuous symmetry, we can choose R innitesimally close to the
identity, R = I + T, and then U is close to 1,
U = 1 iQ+ O(
2
). (2.18)
From the unitarity of U (to order ) it follows that Q is an observable,
Q = Q
] = i
km
J
m
, (2.23)
where
km
is the totally antisymmetric tensor with
123
= 1, and repeated
indices are summed. To implement rotations on a quantum system, we nd
self-adjoint operators J
1
, J
2
, J
3
in Hilbert space that satisfy these relations.
The dening representation of the rotation group is three dimensional,
but the simplest nontrivial irreducible representation is two dimensional,
given by
J
k
=
1
2
k
, (2.24)
where
1
=
_
0 1
1 0
_
,
2
=
_
0 i
i 0
_
,
3
=
_
1 0
0 1
_
, (2.25)
are the Pauli matrices. This is the unique two-dimensional irreducible rep-
resentation, up to a unitary change of basis. Since the eigenvalues of J
k
are
1
2
, we call this the spin-
1
2
representation. (By identifying J as the angular-
momentum, we have implicitly chosen units with = 1).
The Pauli matrices also have the properties of being mutually anticom-
muting and squaring to the identity,
k
= 2
k
1, (2.26)
2.2. THE QUBIT 9
So we see that ( n )
2
= n
k
n
= n
k
n
k
1 = 1. By expanding the
exponential series, we see that nite rotations are represented as
U( n, ) = e
i
2
n
= 1cos
2
i n sin
2
. (2.27)
The most general 2 2 unitary matrix with determinant 1 can be expressed
in this form. Thus, we are entitled to think of a qubit as the state of a spin-
1
2
object, and an arbitrary unitary transformation acting on the state (aside
from a possible rotation of the overall phase) is a rotation of the spin.
A peculiar property of the representation U( n, ) is that it is double-
valued. In particular a rotation by 2 about any axis is represented nontriv-
ially:
U( n, = 2) = 1. (2.28)
Our representation of the rotation group is really a representation up to a
sign
U(R
1
)U(R
2
) = U(R
1
R
2
). (2.29)
But as already noted, this is acceptable, because the group multiplication is
respected on rays, though not on vectors. These double-valued representa-
tions of the rotation group are called spinor representations. (The existence
of spinors follows from a topological property of the group it is not simply
connected.)
While it is true that a rotation by 2 has no detectable eect on a spin-
1
2
object, it would be wrong to conclude that the spinor property has no
observable consequences. Suppose I have a machine that acts on a pair of
spins. If the rst spin is up, it does nothing, but if the rst spin is down, it
rotates the second spin by 2. Now let the machine act when the rst spin
is in a superposition of up and down. Then
1
2
([ )
1
+[ )
1
) [ )
2
1
2
([ )
1
[ )
1
) [ )
2
. (2.30)
While there is no detectable eect on the second spin, the state of the rst
has ipped to an orthogonal state, which is very much observable.
In a rotated frame of reference, a rotation R( n, ) becomes a rotation
through the same angle but about a rotated axis. It follows that the three
components of angular momentum transform under rotations as a vector:
U(R)J
k
U(R)
= R
k
J
. (2.31)
10 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
Thus, if a state [m) is an eigenstate of J
3
J
3
[m) = m[m), (2.32)
then U(R)[m) is an eigenstate of RJ
3
with the same eigenvalue:
RJ
3
(U(R)[m)) = U(R)J
3
U(R)
U(R)[m)
= U(R)J
3
[m) = m(U(R)[m)) . (2.33)
Therefore, we can construct eigenstates of angular momentum along the axis
n = (sin cos , sin sin, cos ) by applying a rotation through , about the
axis n
2
n
_
= exp
_
2
_
0 e
i
e
i
0
__
=
_
cos
2
e
i
sin
2
e
i
sin
2
cos
2
_
, (2.34)
and applying it to
_
1
0
_
, the J
3
eigenstate with eigenvalue 1, we obtain
[(, )) =
_
e
i/2
cos
2
e
i/2
sin
2
_
, (2.35)
(up to an overall phase). We can check directly that this is an eigenstate of
n =
_
cos e
i
sin
e
i
sin cos
_
, (2.36)
with eigenvalue one. So we have seen that eq. (2.11) with a = e
i/2
cos
2
, b =
e
i/2
sin
2
, can be interpreted as a spin pointing in the (, ) direction.
We noted that we cannot determine a and b with a single measurement.
Furthermore, even with many identical copies of the state, we cannot com-
pletely determine the state by measuring each copy only along the z-axis.
This would enable us to estimate [a[ and [b[, but we would learn nothing
about the relative phase of a and b. Equivalently, we would nd the compo-
nent of the spin along the z-axis
(, )[
3
[(, )) = cos
2
2
sin
2
2
= cos , (2.37)
2.2. THE QUBIT 11
but we would not learn about the component in the xy plane. The problem
of determining [) by measuring the spin is equivalent to determining the
unit vector n by measuring its components along various axes. Altogether,
measurements along three dierent axes are required. E.g., from
3
) and
1
) we can determine n
3
and n
1
, but the sign of n
2
remains undetermined.
Measuring
2
) would remove this remaining ambiguity.
Of course, if we are permitted to rotate the spin, then only measurements
along the z-axis will suce. That is, measuring a spin along the n axis is
equivalent to rst applying a rotation that rotates the n axis to the axis z,
and then measuring along z.
In the special case =
2
and = 0 (the x-axis) our spin state is
[
x
) =
1
2
([
z
) +[
z
)) , (2.38)
(spin-up along the x-axis). The orthogonal state (spin down along the
x-axis) is
[
x
) =
1
2
([
z
) [
z
)) . (2.39)
For either of these states, if we measure the spin along the z-axis, we will
obtain [
z
) with probability
1
2
and [
z
) with probability
1
2
.
Now consider the combination
1
2
([
x
) +[
x
)) . (2.40)
This state has the property that, if we measure the spin along the x-axis, we
obtain [
x
) or [
x
), each with probability
1
2
. Now we may ask, what if we
measure the state in eq. (2.40) along the z-axis?
If these were probabilistic classical bits, the answer would be obvious.
The state in eq. (2.40) is in one of two states, and for each of the two, the
probability is
1
2
for pointing up or down along the z-axis. So of course we
should nd up with probability
1
2
when we measure along the z-axis.
But not so for qubits! By adding eq. (2.38) and eq. (2.39), we see that
the state in eq. (2.40) is really [
z
) in disguise. When we measure along the
z-axis, we always nd [
z
), never [
z
).
We see that for qubits, as opposed to probabilistic classical bits, proba-
bilities can add in unexpected ways. This is, in its simplest guise, the phe-
nomenon called quantum interference, an important feature of quantum
information.
12 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
It should be emphasized that, while this formal equivalence with a spin-
1
2
object applies to any two-level quantum system, of course not every two-level
system transforms as a spinor under rotations!
2.2.2 Photon polarizations
Another important two-state system is provided by a photon, which can
have two independent polarizations. These photon polarization states also
transform under rotations, but photons dier from our spin-
1
2
objects in two
important ways: (1) Photons are massless. (2) Photons have spin-1 (they
are not spinors).
Now is not a good time for a detailed discussion of the unitary represen-
tations of the Poincare group. Suce it to say that the spin of a particle
classies how it transforms under the little group, the subgroup of the Lorentz
group that preserves the particles momentum. For a massive particle, we
may always boost to the particles rest frame, and then the little group is
the rotation group.
For massless particles, there is no rest frame. The nite-dimensional
unitary representations of the little group turn out to be representations of
the rotation group in two dimensions, the rotations about the axis determined
by the momentum. Of course, for a photon, this corresponds to the familiar
property of classical light the waves are polarized transverse to the direction
of propagation.
Under a rotation about the axis of propagation, the two linear polarization
states ([x) and [y) for horizontal and vertical polarization) transform as
[x) cos [x) + sin[y)
[y) sin[x) + cos [y). (2.41)
This two-dimensional representation is actually reducible. The matrix
_
cos sin
sin cos
_
(2.42)
has the eigenstates
[R) =
1
2
_
1
i
_
[L) =
1
2
_
i
1
_
, (2.43)
2.3. THE DENSITY MATRIX 13
with eigenvalues e
i
and e
i
, the states of right and left circular polarization.
That is, these are the eigenstates of the rotation generator
J =
_
0 i
i 0
_
=
y
, (2.44)
with eigenvalues 1. Because the eigenvalues are 1 (not
1
2
) we say that
the photon has spin-1.
In this context, the quantum interference phenomenon can be described
this way: Suppose that we have a polarization analyzer that allows only
one of the two linear photon polarizations to pass through. Then an x or y
polarized photon has prob
1
2
of getting through a 45
o
rotated polarizer, and
a 45
o
polarized photon has prob
1
2
of getting through an x and y analyzer.
But an x photon never passes through a y analyzer. If we put a 45
o
rotated
analyzer in between an x and y analyzer, then
1
2
the photons make it through
each analyzer. But if we remove the analyzer in the middle no photons make
it through the y analyzer.
A device can be constructed easily that rotates the linear polarization of
a photon, and so applies the transformation Eq. (2.41) to our qubit. As
noted, this is not the most general possible unitary transformation. But if
we also have a device that alters the relative phase of the two orthogonal
linear polarization states
[x) e
i/2
[x)
[y) e
i/2
[y) , (2.45)
the two devices can be employed together to apply an arbitrary 22 unitary
transformation (of determinant 1) to the photon polarization state.
2.3 The density matrix
2.3.1 The bipartite quantum system
The last lecture was about one qubit. This lecture is about two qubits.
(Guess what the next lecture will be about!) Stepping up from one qubit to
two is a bigger leap than you might expect. Much that is weird and wonderful
about quantum mechanics can be appreciated by considering the properties
of the quantum states of two qubits.
14 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
The axioms of 2.1 provide a perfectly acceptable general formulation
of the quantum theory. Yet under many circumstances, we nd that the
axioms appear to be violated. The trouble is that our axioms are intended
to characterize the quantum behavior of the entire universe. Most of the
time, we are not so ambitious as to attempt to understand the physics of the
whole universe; we are content to observe just our little corner. In practice,
then, the observations we make are always limited to a small part of a much
larger quantum system.
In the next several lectures, we will see that, when we limit our attention
to just part of a larger system, then (contrary to the axioms):
1. States are not rays.
2. Measurements are not orthogonal projections.
3. Evolution is not unitary.
We can best understand these points by considering the simplest possible
example: a two-qubit world in which we observe only one of the qubits.
So consider a system of two qubits. Qubit A is here in the room with us,
and we are free to observe or manipulate it any way we please. But qubit
B is locked in a vault where we cant get access to it. Given some quantum
state of the two qubits, we would like to nd a compact way to characterize
the observations that can be made on qubit A alone.
Well use [0)
A
, [1)
A
and [0)
B
, [1)
B
to denote orthonormal bases for
qubits A and B respectively. Consider a quantum state of the two-qubit
world of the form
[)
AB
= a[0)
A
[0)
B
+ b[1)
A
[1)
B
. (2.46)
In this state, qubits A and B are correlated. Suppose we measure qubit A by
projecting onto the [0)
A
, [1)
A
basis. Then with probability [a[
2
we obtain
the result [0)
A
, and the measurement prepares the state
[0)
A
[0)
B
. (2.47)
with probability [b[
2
, we obtain the result [1)
A
and prepare the state
[1)
A
[1)
B
. (2.48)
2.3. THE DENSITY MATRIX 15
In either case, a denite state of qubit B is picked out by the measurement. If
we subsequently measure qubit B, then we are guaranteed (with probability
one) to nd [0)
B
if we had found [0)
A
, and we are guaranteed to nd [1)
B
if
we found [1)
A
. In this sense, the outcomes of the [0)
A
, [1)
A
and [0)
B
, [1)
B
A
0[
B
0[ + b
B
1[
B
1[) (M
A
1
B
)
(a[0)
A
[0)
B
+ b[1)
A
[1)
B
)
= [a[
2
A
0[M
A
[0)
A
+[b[
2
A
1[M
A
[1)
A
, (2.50)
(where we have used the orthogonality of [0)
B
and [1)
B
). This expression
can be rewritten in the form
M
A
) = tr (M
A
A
) , (2.51)
A
= [a[
2
[0)
A A
0[ +[b[
2
[1)
A A
1[, (2.52)
and tr() denotes the trace. The operator
A
is called the density operator
(or density matrix) for qubit A. It is self-adjoint, positive (its eigenvalues are
nonnegative) and it has unit trace (because [) is a normalized state.)
Because M
A
) has the form eq. (2.51) for any observable M
A
acting
on qubit A, it is consistent to interpret
A
as representing an ensemble of
possible quantum states, each occurring with a specied probability. That
is, we would obtain precisely the same result for M
A
) if we stipulated that
qubit A is in one of two quantum states. With probability p
0
= [a[
2
it is
in the quantum state [0)
A
, and with probability p
1
= [b[
2
it is in the state
16 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
[1)
A
. If we are interested in the result of any possible measurement, we can
consider M
A
to be the projection E
A
(a) onto the relevant eigenspace of a
particular observable. Then
Prob (a) = p
0 A
0[E
A
(a)[0)
A
+ p
1 A
1[E
A
(a)[1)
A
, (2.53)
which is the probability of outcome a summed over the ensemble, and weighted
by the probability of each state in the ensemble.
We have emphasized previously that there is an essential dierence be-
tween a coherent superposition of the states [0)
A
and [1)
A
, and a probabilistic
ensemble, in which [0)
A
and [1)
A
can each occur with specied probabilities.
For example, for a spin-
1
2
object we have seen that if we measure
1
in the
state
1
2
([
z
) +[
z
)), we will obtain the result [
x
) with probability one.
But the ensemble in which [
z
) and [
z
) each occur with probability
1
2
is
represented by the density operator
=
1
2
([
z
)
z
[ +[
z
)
z
[)
=
1
2
1, (2.54)
and the projection onto [
x
) then has the expectation value
tr ([
x
)
x
[) =
1
2
. (2.55)
In fact, we have seen that any state of one qubit represented by a ray can
be interpreted as a spin pointing in some denite direction. But because
the identity is left unchanged by any unitary change of basis, and the state
[(, )) can be obtained by applying a suitable unitary transformation to
[
z
), we see that for given by eq. (2.54), we have
tr ([(, ))(, )[) =
1
2
. (2.56)
Therefore, if the state [)
AB
in eq. (2.57) is prepared, with [a[
2
= [b[
2
=
1
2
,
and we measure the spin A along any axis, we obtain a completely random
result; spin up or spin down can occur, each with probability
1
2
.
This discussion of the correlated two-qubit state [)
AB
is easily general-
ized to an arbitrary state of any bipartite quantum system (a system divided
into two parts). The Hilbert space of a bipartite system is H
A
H
B
where
2.3. THE DENSITY MATRIX 17
H
A,B
are the Hilbert spaces of the two parts. This means that if [i)
A
is an
orthonormal basis for H
A
and [)
B
is an orthonormal basis for H
B
, then
[i)
A
[)
B
is an orthonormal basis for H
A
H
B
. Thus an arbitrary pure
state of H
A
H
B
can be expanded as
[)
AB
=
i,
a
i
[i)
A
[)
B
, (2.57)
where
i,
[a
i
[
2
= 1. The expectation value of an observable M
A
1
B
, that
acts only on subsystem A is
M
A
) =
AB
[M
A
1
B
[)
AB
=
j,
a
j
(
A
j[
B
[) (M
A
1
B
)
i,
a
i
([i)
A
[)
B
)
=
i,j,
a
j
a
i A
j[M
A
[i)
A
= tr (M
A
A
) , (2.58)
where
A
= tr
B
([)
AB AB
[)
i,j,
a
i
a
j
[i)
A A
j[ . (2.59)
We say that the density operator
A
for subsystem A is obtained by per-
forming a partial trace over subsystem B of the density matrix (in this case
a pure state) for the combined system AB.
From the denition eq. (2.59), we can immediately infer that
A
has the
following properties:
1.
A
is self-adjoint:
A
=
A
.
2.
A
is positive: For any [)
A A
[
A
[)
A
=
i
a
i A
[i)
A
[
2
0.
3. tr(
A
) = 1: We have tr
A
=
i,
[a
i
[
2
= 1, since [)
AB
is normalized.
It follows that
A
can be diagonalized, that the eigenvalues are all real and
nonnegative, and that the eigenvalues sum to one.
If we are looking at a subsystem of a larger quantum system, then, even
if the state of the larger system is a ray, the state of the subsystem need
18 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
not be; in general, the state is represented by a density operator. In the
case where the state of the subsystem is a ray, and we say that the state is
pure. Otherwise the state is mixed. If the state is a pure state [)
A
, then
the density matrix
A
= [)
A A
[ is the projection onto the one-dimensional
space spanned by [)
A
. Hence a pure density matrix has the property
2
= .
A general density matrix, expressed in the basis in which it is diagonal, has
the form
A
=
a
p
a
[
a
)
a
[, (2.60)
where 0 < p
a
1 and
a
p
a
= 1. If the state is not pure, there are two
or more terms in this sum, and
2
,= ; in fact, tr
2
=
p
2
a
<
p
a
= 1.
We say that is an incoherent superposition of the states [
a
); incoherent
meaning that the relative phases of the [
a
) are experimentally inaccessible.
Since the expectation value of any observable M acting on the subsystem
can be expressed as
M) = trM =
a
p
a
a
[M[
a
), (2.61)
we see as before that we may interpret as describing an ensemble of pure
quantum states, in which the state [
a
) occurs with probability p
a
. We have,
therefore, come a long part of the way to understanding how probabilities
arise in quantum mechanics when a quantum system A interacts with another
system B. A and B become entangled, that is, correlated. The entanglement
destroys the coherence of a superposition of states of A, so that some of the
phases in the superposition become inaccessible if we look at A alone. We
may describe this situation by saying that the state of system A collapses
it is in one of a set of alternative states, each of which can be assigned a
probability.
2.3.2 Bloch sphere
Lets return to the case in which system A is a single qubit, and consider the
form of the general density matrix. The most general self-adjoint 22 matrix
has four real parameters, and can be expanded in the basis 1,
1
,
2
,
3
.
Since each
i
is traceless, the coecient of 1 in the expansion of a density
2.3. THE DENSITY MATRIX 19
matrix must be
1
2
(so that tr() = 1), and may be expressed as
(
P) =
1
2
_
1 +
P
_
1
2
(1 + P
1
1
+ P
2
2
+ P
3
3
)
=
1
2
_
1 + P
3
P
1
iP
2
P
1
+ iP
2
1 P
3
_
. (2.62)
We can compute det =
1
4
_
1
P
2
_
. Therefore, a necessary condition for
to have nonnegative eigenvalues is det 0 or
P
2
1. This condition is
also sucient; since tr = 1, it is not possible for to have two negative
eigenvalues. Thus, there is a 1 1 correspondence between the possible
density matrices of a single qubit and the points on the unit 3-ball 0 [
P[
1. This ball is usually called the Bloch sphere (although of course it is really
a ball, not a sphere).
The boundary
_
[
P[ = 1
_
of the ball (which really is a sphere) contains
the density matrices with vanishing determinant. Since tr = 1, these den-
sity matrices must have the eigenvalues 0 and 1. They are one-dimensional
projectors, and hence pure states. We have already seen that every pure
state of a single qubit is of the form [(, )) and can be envisioned as a spin
pointing in the (, ) direction. Indeed using the property
( n )
2
= 1, (2.63)
where n is a unit vector, we can easily verify that the pure-state density
matrix
( n) =
1
2
(1 + n ) (2.64)
satises the property
( n ) ( n) = ( n) ( n ) = ( n), (2.65)
and, therefore is the projector
( n) = [( n))( n)[; (2.66)
that is, n is the direction along which the spin is pointing up. Alternatively,
from the expression
[(, )) =
_
e
i/2
cos
2
e
i/2
sin
2
_
, (2.67)
20 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
we may compute directly that
(, ) = [(, ))(, )[
=
_
cos
2
2
cos
2
sin
2
e
i
cos
2
sin
2
e
i
sin
2
2
_
=
1
2
1 +
1
2
_
cos sine
i
sine
i
cos
_
=
1
2
(1 + n ) (2.68)
where n = (sin cos , sin sin, cos ). One nice property of the Bloch
parametrization of the pure states is that while [(, )) has an arbitrary
overall phase that has no physical signicance, there is no phase ambiguity
in the density matrix (, ) = [(, ))(, )[; all the parameters in
have a physical meaning.
From the property
1
2
tr
i
j
=
ij
(2.69)
we see that
n )
P
= tr
_
n (
P)
_
= n
P . (2.70)
Thus the vector
P in Eq. (2.62) parametrizes the polarization of the spin. If
there are many identically prepared systems at our disposal, we can determine
i,
a
i
[i(t))
A
[(t))
B
, (2.77)
where
[i(t))
A
= U
A
(t)[i(0))
A
,
[(t))
B
= U
B
(t)[(0))
B
, (2.78)
dene new orthonormal basis for H
A
and H
B
(since U
A
(t) and U
B
(t) are
unitary). Taking the partial trace as before, we nd
A
(t) =
i,j,
a
i
a
j
[i(t))
A A
j(t)[
= U
A
(t)
A
(0)U
A
(t)
. (2.79)
2.4. SCHMIDT DECOMPOSITION 23
Thus U
A
(t), acting by conjugation, determines the time evolution of the
density matrix.
In particular, in the basis in which
A
(0) is diagonal, we have
A
(t) =
a
p
a
U
A
(t)[
a
(0))
A A
a
(0)[U
A
(t). (2.80)
Eq. (2.80) tells us that the evolution of
A
is perfectly consistent with the
ensemble interpretation. Each state in the ensemble evolves forward in time
governed by U
A
(t). If the state [
a
(0)) occurs with probability p
a
at time 0,
then [
a
(t)) occurs with probability p
a
at the subsequent time t.
On the other hand, it should be clear that eq. (2.80) applies only under
the assumption that systems A and B are not coupled by the Hamiltonian.
Later, we will investigate how the density matrix evolves under more general
conditions.
2.4 Schmidt decomposition
A bipartite pure state can be expressed in a standard form (the Schmidt
decomposition) that is often very useful.
To arrive at this form, note that an arbitrary vector in H
A
H
B
can be
expanded as
[)
AB
=
i,
a
i
[i)
A
[)
B
i
[i)
A
[
i)
B
. (2.81)
Here [i)
A
and [)
B
are orthonormal basis for H
A
and H
B
respectively,
but to obtain the second equality in eq. (2.81) we have dened
[
i)
B
a
i
[)
B
. (2.82)
Note that the [
i)
B
s need not be mutually orthogonal or normalized.
Now lets suppose that the [i)
A
basis is chosen to be the basis in which
A
is diagonal,
A
=
i
p
i
[i)
A A
i[. (2.83)
We can also compute
A
by performing a partial trace,
A
= tr
B
([)
AB AB
[)
24 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
= tr
B
(
ij
[i)
A A
j[ [
i)
B B
j[) =
ij
B
j[
i)
B
([i)
A A
j[) .
(2.84)
We obtained the last equality in eq. (2.84) by noting that
tr
B
_
[
i)
B B
j[
_
=
k
B
k[
i)
B B
j[k)
B
=
k
B
j[k)
B B
k[
i)
B
=
B
j[
i)
B
, (2.85)
where [k)
B
is an orthonormal basis for H
B
. By comparing eq. (2.83) and
eq. (2.84), we see that
B
j[
i)
B
= p
i
ij
. (2.86)
Hence, it turns out that the [
i)
B
are orthogonal after all. We obtain
orthonormal vectors by rescaling,
[i
)
B
= p
1/2
i
[i)
B
(2.87)
(we may assume p
i
,= 0, because we will need eq. (2.87) only for i appearing
in the sum eq. (2.83)), and therefore obtain the expansion
[)
AB
=
p
i
[i)
A
[i
)
B
, (2.88)
in terms of a particular orthonormal basis of H
A
and H
B
.
Eq. (2.88) is the Schmidt decomposition of the bipartite pure state [)
AB
.
Any bipartite pure state can be expressed in this form, but of course the
bases used depend on the pure state that is being expanded. In general, we
cant simultaneously expand both [)
AB
and [)
AB
H
A
H
B
in the form
eq. (2.88) using the same orthonormal bases for H
A
and H
B
.
Using eq. (2.88), we can also evaluate the partial trace over H
A
to obtain
B
= tr
A
([)
AB AB
[) =
i
p
i
[i
)
B B
i
[. (2.89)
We see that
A
and
B
have the same nonzero eigenvalues. Of course, there
is no need for H
A
and H
B
to have the same dimension, so the number of zero
eigenvalues of
A
and
B
can dier.
If
A
(and hence
B
) have no degenerate eigenvalues other than zero,
then the Schmidt decomposition of [)
AB
is essentially uniquely determined
2.4. SCHMIDT DECOMPOSITION 25
by
A
and
B
. We can diagonalize
A
and
B
to nd the [i)
A
s and [i
)
B
s,
and then we pair up the eigenstates of
A
and
B
with the same eigenvalue
to obtain eq. (2.88). We have chosen the phases of our basis states so that no
phases appear in the coecients in the sum; the only remaining freedom is
to redene [i)
A
and [i
)
B
by multiplying by opposite phases (which of course
leaves the expression eq. (2.88) unchanged).
But if
A
has degenerate nonzero eigenvalues, then we need more infor-
mation than that provided by
A
and
B
to determine the Schmidt decompo-
sition; we need to know which [i
)
B
gets paired with each [i)
A
. For example,
if both H
A
and H
B
are N-dimensional and U
ij
is any N N unitary matrix,
then
[)
AB
=
1
N
N
i,j=1
[i)
A
U
ij
[j
)
B
, (2.90)
will yield
A
=
B
=
1
N
1 when we take partial traces. Furthermore, we are
free to apply simultaneous unitary transformations in H
A
and H
B
,
[)
AB
=
1
i
[i)
A
[i
)
B
=
1
ijk
U
ij
[j)
A
U
ik
[k
)
B
; (2.91)
this preserves the state [)
AB
, but illustrates that there is an ambiguity in
the basis used when we express [)
AB
in the Schmidt form.
2.4.1 Entanglement
With any bipartite pure state [)
AB
we may associate a positive integer, the
Schmidt number, which is the number of nonzero eigenvalues in
A
(or
B
)
and hence the number of terms in the Schmidt decomposition of [)
AB
. In
terms of this quantity, we can dene what it means for a bipartite pure state
to be entangled: [)
AB
is entangled (or nonseparable) if its Schmidt number
is greater than one; otherwise, it is separable (or unentangled). Thus, a
separable bipartite pure state is a direct product of pure states in H
A
and
H
B
,
[)
AB
= [)
A
[)
B
; (2.92)
then the reduced density matrices
A
= [)
A A
[ and
B
= [)
B B
[ are
pure. Any state that cannot be expressed as such a direct product is entan-
gled; then
A
and
B
are mixed states.
26 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
One of our main goals this term will be to understand better the signif-
icance of entanglement. It is not strictly correct to say that subsystems A
and B are uncorrelated if [)
AB
is separable; after all, the two spins in the
separable state
[ )
A
[ )
B
, (2.93)
are surely correlated they are both pointing in the same direction. But
the correlations between A and B in an entangled state have a dierent
character than those in a separable state. Perhaps the critical dierence is
that entanglement cannot be created locally. The only way to entangle A and
B is for the two subsystems to directly interact with one another.
We can prepare the state eq. (2.93) without allowing spins A and B to
ever come into contact with one another. We need only send a (classical!)
message to two preparers (Alice and Bob) telling both of them to prepare a
spin pointing along the z-axis. But the only way to turn the state eq. (2.93)
into an entangled state like
1
2
([ )
A
[ )
B
+[ )
A
[ )
B
) , (2.94)
is to apply a collective unitary transformation to the state. Local unitary
transformations of the form U
A
U
B
, and local measurements performed by
Alice or Bob, cannot increase the Schmidt number of the two-qubit state,
no matter how much Alice and Bob discuss what they do. To entangle two
qubits, we must bring them together and allow them to interact.
As we will discuss later, it is also possible to make the distinction between
entangled and separable bipartite mixed states. We will also discuss various
ways in which local operations can modify the form of entanglement, and
some ways that entanglement can be put to use.
2.5 Ambiguity of the ensemble interpretation
2.5.1 Convexity
Recall that an operator acting on a Hilbert space H may be interpreted as
a density operator if it has the three properties:
(1) is self-adjoint.
2.5. AMBIGUITY OF THE ENSEMBLE INTERPRETATION 27
(2) is nonnegative.
(3) tr() = 1.
It follows immediately that, given two density matrices
1
, and
2
, we can
always construct another density matrix as a convex linear combination of
the two:
() =
1
+ (1 )
2
(2.95)
is a density matrix for any real satisfying 0 1. We easily see that
() satises (1) and (3) if
1
and
2
do. To check (2), we evaluate
[()[) = [
1
[) + (1 )[
2
[) 0; (2.96)
()) is guaranteed to be nonnegative because
1
) and
2
) are. We have,
therefore, shown that in a Hilbert space H of dimension N, the density
operators are a convex subset of the real vector space of N N hermitian
matrices. (A subset of a vector space is said to be convex if the set contains
the straight line segment connecting any two points in the set.)
Most density operators can be expressed as a sum of other density oper-
ators in many dierent ways. But the pure states are special in this regard
it is not possible to express a pure state as a convex sum of two other states.
Consider a pure state = [)[, and let [
[[
) = 0 =
[
1
[
)
+ (1 )
[
2
[
). (2.97)
Since the right hand side is a sum of two nonnegative terms, and the sum
vanishes, both terms must vanish. If is not 0 or 1, we conclude that
1
and
2
are orthogonal to [
). But since [
P) =
1
2
(1 +
P ), (2.102)
with 0 < [
P) = ( n
1
) + (1 )( n
2
), (2.103)
if
P = n
1
+ (1 ) n
2
(or in other words, if
P lies somewhere on the line
segment connecting the points n
1
and n
2
on the sphere). Evidently, for any
P, there is a solution associated with any chord of the sphere that passes
through the point
P; all such chords comprise a two-parameter family.
This highly ambiguous nature of the preparation of a mixed quantum
state is one of the characteristic features of quantum information that con-
trasts sharply with classical probability distributions. Consider, for exam-
ple, the case of a probability distribution for a single classical bit. The two
extremal distributions are those in which either 0 or 1 occurs with 100%
probability. Any probability distribution for the bit is a convex sum of these
two extremal points. Similarly, if there are N possible states, there are N
extremal distributions, and any probability distribution has a unique decom-
position into extremal ones (the convex set of probability distributions is a
simplex). If 0 occurs with 21% probability, 1 with 33% probability, and 2
with 46% probability, there is a unique preparation procedure that yields
this probability distribution!
2.5.3 Faster than light?
Lets now return to our earlier viewpoint that a mixed state of system
A arises because A is entangled with system B to further consider the
implications of the ambiguous preparation of mixed states. If qubit A has
density matrix
A
=
1
2
[
z
)
A A
z
[ +
1
2
[
z
)
A A
z
[, (2.104)
this density matrix could arise from an entangled bipartite pure state [)
AB
with the Schmidt decomposition
[)
AB
=
1
2
([
z
)
A
[
z
)
B
+[
z
)
A
[
z
)
B
) . (2.105)
2.5. AMBIGUITY OF THE ENSEMBLE INTERPRETATION 31
Therefore, the ensemble interpretation of
A
in which either [
z
)
A
or [
z
)
A
is prepared (each with probability p =
1
2
) can be realized by performing a
measurement of qubit B. We measure qubit B in the [
z
)
B
, [
z
)
B
basis;
if the result [
z
)
B
is obtained, we have prepared [
z
)
A
, and if the result
[
7
)
B
is obtained, we have prepared [
z
)
A
.
But as we have already noted, in this case, because
A
has degenerate
eigenvalues, the Schmidt basis is not unique. We can apply simultaneous
unitary transformations to qubits A and B (actually, if we apply U to A
we must apply U
2
([
n
)
A
[
n
)
B
+[
n
)
A
[
n
)
B
) . (2.106)
We see that by measuring qubit B in a suitable basis, we can realize any
interpretation of
A
as an ensemble of two pure states.
Bright students, upon learning of this property, are sometimes inspired
to suggest a mechanism for faster-than-light communication. Many copies of
[)
AB
are prepared. Alice takes all of the A qubits to the Andromeda galaxy
and Bob keeps all of the B qubits on earth. When Bob wants to send a one-
bit message to Alice, he chooses to measure either
1
or
3
for all his spins,
thus preparing Alices spins in either the [
z
)
A
, [
z
)
A
or [
x
)
A
, [
x
)
A
ensembles.
1
To read the message, Alice immediately measures her spins to
see which ensemble has been prepared.
But exceptionally bright students (or students who heard the previous
lecture) can see the aw in this scheme. Though the two preparation meth-
ods are surely dierent, both ensembles are described by precisely the same
density matrix
A
. Thus, there is no conceivable measurement Alice can
make that will distinguish the two ensembles, and no way for Alice to tell
what action Bob performed. The message is unreadable.
Why, then, do we condently state that the two preparation methods
are surely dierent? To qualm any doubts about that, imagine that Bob
either (1) measures all of his spins along the z-axis, or (2) measures all of his
spins along the x-axis, and then calls Alice on the intergalactic telephone. He
does not tell Alice whether he did (1) or (2), but he does tell her the results of
all his measurements: the rst spin was up, the second was down, etc. Now
1
U is real in this case, so U = U
and n = n
.
32 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
Alice performs either (1) or (2) on her spins. If both Alice and Bob measured
along the same axis, Alice will nd that every single one of her measurement
outcomes agrees with what Bob found. But if Alice and Bob measured along
dierent (orthogonal) axes, then Alice will nd no correlation between her
results and Bobs. About half of her measurements agree with Bobs and
about half disagree. If Bob promises to do either (1) or (2), and assuming no
preparation or measurement errors, then Alice will know that Bobs action
was dierent than hers (even though Bob never told her this information)
as soon as one of her measurements disagrees with what Bob found. If all
their measurements agree, then if many spins are measured, Alice will have
very high statistical condence that she and Bob measured along the same
axis. (Even with occasional measurement errors, the statistical test will still
be highly reliable if the error rate is low enough.) So Alice does have a
way to distinguish Bobs two preparation methods, but in this case there is
certainly no faster-than-light communication, because Alice had to receive
Bobs phone call before she could perform her test.
2.5.4 Quantum erasure
We had said that the density matrix
A
=
1
2
1 describes a spin in an inco-
herent superposition of the pure states [
z
)
A
and [
z
)
A
. This was to be
distinguished from coherent superpositions of these states, such as
[
x
,
x
) =
1
2
([
z
) [
z
)) ; (2.107)
in the case of a coherent superposition, the relative phase of the two states
has observable consequences (distinguishes [
x
) from [
x
)). In the case of an
incoherent superposition, the relative phase is completely unobservable. The
superposition becomes incoherent if spin A becomes entangled with another
spin B, and spin B is inaccessible.
Heuristically, the states [
z
)
A
and [
z
)
A
can interfere (the relative phase
of these states can be observed) only if we have no information about whether
the spin state is [
z
)
A
or [
z
)
A
. More than that, interference can occur
only if there is in principle no possible way to nd out whether the spin
is up or down along the z-axis. Entangling spin A with spin B destroys
interference, (causes spin A to decohere) because it is possible in principle
for us to determine if spin A is up or down along z by performing a suitable
measurement of spin B.
2.5. AMBIGUITY OF THE ENSEMBLE INTERPRETATION 33
But we have now seen that the statement that entanglement causes de-
coherence requires a qualication. Suppose that Bob measures spin B along
the x-axis, obtaining either the result [
x
)
B
or [
x
)
B
, and that he sends his
measurement result to Alice. Now Alices spin is a pure state (either [
x
)
A
or [
x
)
A
) and in fact a coherent superposition of [
z
)
A
and [
z
)
A
. We have
managed to recover the purity of Alices spin before the jaws of decoherence
could close!
Suppose that Bob allows his spin to pass through a SternGerlach ap-
paratus oriented along the z-axis. Well, of course, Alices spin cant behave
like a coherent superposition of [
z
)
A
and [
z
)
A
; all Bob has to do is look
to see which way his spin moved, and he will know whether Alices spin is
up or down along z. But suppose that Bob does not look. Instead, he care-
fully refocuses the two beams without maintaining any record of whether his
spin moved up or down, and then allows the spin to pass through a second
SternGerlach apparatus oriented along the x-axis. This time he looks, and
communicates the result of his
1
measurement to Alice. Now the coherence
of Alices spin has been restored!
This situation has been called a quantum eraser. Entangling the two
spins creates a measurement situation in which the coherence of [
z
)
A
and
[
z
)
A
is lost because we can nd out if spin A is up or down along z by
observing spin B. But when we measure spin B along x, this information
is erased. Whether the result is [
x
)
B
or [
x
)
B
does not tell us anything
about whether spin A is up or down along z, because Bob has been careful
not to retain the which way information that he might have acquired by
looking at the rst SternGerlach apparatus.
2
Therefore, it is possible again
for spin A to behave like a coherent superposition of [
z
)
A
and [
z
)
A
(and
it does, after Alice hears about Bobs result).
We can best understand the quantum eraser from the ensemble viewpoint.
Alice has many spins selected from an ensemble described by
A
=
1
2
1, and
there is no way for her to observe interference between [
z
)
A
and [
z
)
A
.
When Bob makes his measurement along x, a particular preparation of the
ensemble is realized. However, this has no eect that Alice can perceive
her spin is still described by
A
=
1
2
1 as before. But, when Alice receives
Bobs phone call, she can select a subensemble of her spins that are all in
the pure state [
x
)
A
. The information that Bob sends allows Alice to distill
2
One often says that the welcher weg information has been erased, because it sounds
more sophisticated in German.
34 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
purity from a maximally mixed state.
Another wrinkle on the quantum eraser is sometimes called delayed choice.
This just means that the situation we have described is really completely sym-
metric between Alice and Bob, so it cant make any dierence who measures
rst. (Indeed, if Alices and Bobs measurements are spacelike separated
events, there is no invariant meaning to which came rst; it depends on the
frame of reference of the observer.) Alice could measure all of her spins to-
day (say along x) before Bob has made his mind up how he will measure his
spins. Next week, Bob can decide to prepare Alices spins in the states
[
n
)
A
and [
n
)
A
(that is the delayed choice). He then tells Alice which
were the [
n
)
A
spins, and she can check her measurement record to verify
that
1
)
n
= n x . (2.108)
The results are the same, irrespective of whether Bob prepares the spins
before or after Alice measures them.
We have claimed that the density matrix
A
provides a complete physical
description of the state of subsystem A, because it characterizes all possible
measurements that can be performed on A. One sometimes hears the objec-
tion
3
that the quantum eraser phenomenon demonstrates otherwise. Since
the information received from Bob enables Alice to recover a pure state from
the mixture, how can we hold that everything Alice can know about A is
encoded in
A
?
I dont think this is the right conclusion. Rather, I would say that quan-
tum erasure provides yet another opportunity to recite our mantra: Infor-
mation is physical. The state
A
of system A is not the same thing as
A
accompanied by the information that Alice has received from Bob. This in-
formation (which attaches labels to the subensembles) changes the physical
description. One way to say this mathematically is that we should include
Alices state of knowledge in our description. An ensemble of spins for
which Alice has no information about whether each spin is up or down is a
dierent physical state than an ensemble in which Alice knows which spins
are up and which are down.
4
3
For example, from Roger Penrose in Shadows of the Mind.
4
This state of knowledge need not really be the state of a human mind; any (inani-
mate) record that labels the subensemble will suce.
2.5. AMBIGUITY OF THE ENSEMBLE INTERPRETATION 35
2.5.5 The GHJW theorem
So far, we have considered the quantum eraser only in the context of a single
qubit, described by an ensemble of equally probable mutually orthogonal
states, (i.e.,
A
=
1
2
1). The discussion can be considerably generalized.
We have already seen that a mixed state of any quantum system can be
realized as an ensemble of pure states in an innite number of dierent ways.
For a density matrix
A
, consider one such realization:
A
=
i
p
i
[
i
)
A A
i
[,
p
i
= 1. (2.109)
Here the states [
i
)
A
are all normalized vectors, but we do not assume
that they are mutually orthogonal. Nevertheless,
A
can be realized as an
ensemble, in which each pure state [
i
)
A A
i
[ occurs with probability p
i
.
Of course, for any such
A
, we can construct a purication of
A
, a
bipartite pure state [
1
)
AB
that yields
A
when we perform a partial trace
over H
B
. One such purication is of the form
[
1
)
AB
=
p
i
[
i
)
A
[
i
)
B
, (2.110)
where the vectors [
i
)
B
H
B
are mutually orthogonal and normalized,
B
i
[
j
)
B
=
ij
. (2.111)
Clearly, then,
tr
B
([
1
)
AB AB
1
[) =
A
. (2.112)
Furthermore, we can imagine performing an orthogonal measurement in sys-
tem B that projects onto the [
i
)
B
basis.
5
The outcome [
i
)
B
will occur with
probability p
i
, and will prepare the pure state [
i
)
A A
i
[ of system A. Thus,
given the purication [)
AB
of
A
, there is a measurement we can perform
in system B that realizes the [
i
)
A
ensemble interpretation of
A
. When the
measurement outcome in B is known, we have successfully extracted one of
the pure states [
i
)
A
from the mixture
A
.
What we have just described is a generalization of preparing [
z
)
A
by
measuring spin B along z (in our discussion of two entangled qubits). But
5
The [
i
)
B
s might not span H
B
, but in the state [)
AB
, measurement outcomes
orthogonal to all the [
i
)
B
s never occur.
36 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
to generalize the notion of a quantum eraser, we wish to see that in the state
[
1
)
AB
, we can realize a dierent ensemble interpretation of
A
by performing
a dierent measurement of B. So let
A
=
)
A A
[, (2.113)
be another realization of the same density matrix
A
as an ensemble of pure
states. For this ensemble as well, there is a corresponding purication
[
2
)
AB
=
)
A
[
)
B
, (2.114)
where again the [
)
B
s are orthonormal vectors in H
B
. So in the state
[
2
)
AB
, we can realize the ensemble by performing a measurement in H
B
that projects onto the [
)
B
basis.
Now, how are [
1
)
AB
and [
2
)
AB
related? In fact, we can easily show
that
[
1
)
AB
= (1
A
U
B
) [
2
)
AB
; (2.115)
the two states dier by a unitary change of basis acting in H
B
alone, or
[
1
)
AB
=
)
A
[
)
B
, (2.116)
where
[
)
B
= U
B
[
)
B
, (2.117)
is yet another orthonormal basis for H
B
. We see, then, that there is a single
purication [
1
)
AB
of
A
, such that we can realize either the [
i
)
A
ensemble
or [
)
A
ensemble by choosing to measure the appropriate observable in
system B!
Similarly, we may consider many ensembles that all realize
A
, where
the maximum number of pure states appearing in any of the ensembles is
n. Then we may choose a Hilbert space H
B
of dimension n, and a pure
state [)
AB
H
A
H
B
, such that any one of the ensembles can be realized
by measuring a suitable observable of B. This is the GHJW
6
theorem. It
expresses the quantum eraser phenomenon in its most general form.
6
For Gisin and Hughston, Jozsa, and Wootters.
2.6. SUMMARY 37
In fact, the GHJW theorem is an almost trivial corollary to the Schmidt
decomposition. Both [
1
)
AB
and [
2
)
AB
have a Schmidt decomposition, and
because both yield the same
A
when we take the partial trace over B, these
decompositions must have the form
[
1
)
AB
=
k
_
k
[k)
A
[k
1
)
B
,
[
2
)
AB
=
k
_
k
[k)
A
[k
2
)
B
, (2.118)
where the
k
s are the eigenvalues of
A
and the [k)
A
s are the corresponding
eigenvectors. But since [k
1
)
B
and [k
2
)
B
are both orthonormal bases for
H
B
, there is a unitary U
B
such that
[k
1
)
B
= U
B
[k
2
)
B
, (2.119)
from which eq. (2.115) immediately follows.
In the ensemble of pure states described by Eq. (2.109), we would say that
the pure states [
i
)
A
are superposed incoherently an observer in system
A cannot detect the relative phases of these states. Heuristically, the reason
that these states cannot interfere is that it is possible in principle to nd
out which representative of the ensemble is actually realized by performing a
measurement in system B, a projection onto the orthonormal basis [
i
)
B
.
However, by projecting onto the [
)
B
basis instead, and relaying the in-
formation about the measurement outcome to system A, we can extract one
of the pure states [
)
A
from the ensemble, even though this state may be a
coherent superposition of the [
i
)
A
s. In eect, measuring B in the [
)
B
P) =
1
2
(1 +
P ) (2.120)
where
P is a three-component vector of length [
P[ = 1.
Schmidt decomposition. For any quantum system divided into two
parts A and B (a bipartite system), the Hilbert space is a tensor product H
A
H
B
. For any pure state [)
AB
of a bipartite system, there are orthonormal
bases [i)
A
for H
A
and [i
)
B
for H
B
such that
[)
AB
=
p
i
[i)
A
[i
)
B
; (2.121)
Eq. (2.121) is called the Schmidt decomposition of [)
AB
. In a bipartite pure
state, subsystems A and B separately are described by density operators
A
and
B
; it follows from eq. (2.121) that
A
and
B
have the same nonvanish-
ing eigenvalues (the p
i
s). The number of nonvanishing eigenvalues is called
the Schmidt number of [)
AB
. A bipartite pure state is said to be entangled
if its Schmidt number is greater than one.
Ensembles. The density operators on a Hilbert space form a convex set,
and the pure states are the extremal points of the set. A mixed state of a
system A can be prepared as an ensemble of pure states in many dierent
ways, all of which are experimentally indistinguishable if we observe system
A alone. Given any mixed state
A
of system A, any preparation of
A
as an ensemble of pure states can be realized in principle by performing a
2.7. EXERCISES 39
measurement in another system B with which A is entangled. In fact given
many such preparations of
A
, there is a single entangled state of A and
B such that any one of these preparations can be realized by measuring a
suitable observable in B (the GHJW theorem). By measuring in system B
and reporting the measurement outcome to system A, we can extract from
the mixture a pure state chosen from one of the ensembles.
2.7 Exercises
2.1 Fidelity of a random guess
A single qubit (spin-
1
2
object) is in an unknown pure state [), se-
lected at random from an ensemble uniformly distributed over the Bloch
sphere. We guess at random that the state is [). On the average, what
is the delity F of our guess, dened by
F [[)[
2
. (2.122)
2.2 Fidelity after measurement
After randomly selecting a one-qubit pure state as in the previous prob-
lem, we perform a measurement of the spin along the z-axis. This
measurement prepares a state described by the density matrix
= P
[P
[) +P
[P
[) (2.123)
(where P
,
denote the projections onto the spin-up and spin-down
states along the z-axis). On the average, with what delity
F [[) (2.124)
does this density matrix represent the initial state [)? (The improve-
ment in F compared to the answer to the previous problem is a crude
measure of how much we learned by making the measurement.)
2.3 Schmidt decomposition
For the two-qubit state
=
1
2
[ )
A
_
1
2
[ )
B
+
3
2
[ )
B
_
+
1
2
[ )
A
_
3
2
[ )
B
+
1
2
[ )
B
_
,
(2.125)
40 CHAPTER 2. FOUNDATIONS I: STATES AND ENSEMBLES
a. Compute
A
= tr
B
([)[) and
B
= tr
A
([)[).
b. Find the Schmidt decomposition of [).
2.4 Tripartite pure state
Is there a Schmidt decomposition for an arbitrary tripartite pure state?
That is if [)
ABC
is an arbitrary vector in H
A
H
B
H
C
, can we nd
orthonormal bases [i)
A
, [i)
B
, [i)
C
such that
[)
ABC
=
p
i
[i)
A
[i)
B
[i)
C
? (2.126)
Explain your answer.
2.5 Quantum correlations in a mixed state
Consider a density matrix for two qubits
=
1
8
1 +
1
2
[
[ , (2.127)
where 1 denotes the 4 4 unit matrix, and
[
) =
1
2
([ )[ ) [ )[ )) . (2.128)
Suppose we measure the rst spin along the n axis and the second spin
along the m axis, where n m = cos . What is the probability that
both spins are spin-up along their respective axes?