Hughes - The Basics of Gravitational Waves
Hughes - The Basics of Gravitational Waves
The MIT Faculty has made this article openly available. Please share
how this access benefits you. Your story matters.
Citation: Flanagan, Eanna E, and Scott A Hughes. “The Basics of Gravitational Wave
Theory.” New J. Phys. 7 (September 29, 2005): 204–204. © IOP Publishing Ltd and Deutsche
Physikalische Gesellschaft
As Published: http://dx.doi.org/10.1088/1367-2630/7/1/204
Version: Final published version: final published article, as it appeared in a journal, conference
proceedings, or other formally published context
This content has been downloaded from IOPscience. Please scroll down to see the full text.
(http://iopscience.iop.org/1367-2630/7/1/204)
View the table of contents for this issue, or go to the journal homepage for more
Download details:
IP Address: 50.187.205.234
This content was downloaded on 15/02/2016 at 23:42
Einstein’s special relativity [1] taught us that space and time are not simply abstract, external
concepts, but must in fact be considered measured observables, like any other quantity in physics.
This reformulation enforced the philosophy that Newton sought to introduce in laying out his
laws of mechanics [2]:
Absolute space, in its own nature, without relation to anything external, remains always
similar and immovable
Absolute, true, and mathematical time, of itself, and from its own nature, flows equably
without relation to anything external.
Special relativity put an end to these abstractions: time is nothing more than that which
is measured by clocks, and space is that which is measured by rulers. The properties of space
and time thus depend on the properties of clocks and rulers. The constancy of the speed of
light as measured by observers in different reference frames, as observed in the Michelson–
Morley experiment, forces us inevitably to the fact that space and time are mixed into spacetime.
Ten years after his paper on special relativity, Einstein endowed spacetime with curvature and
made it dynamical [5]. This provided a covariant theory of gravity [6], in which all predictions
for physical measurements are invariant under changes in coordinates. In this theory, general
relativity, the notion of ‘gravitational force’is reinterpreted in terms of the behaviour of geodesics
in the curved manifold of spacetime.
To be compatible with special relativity, gravity must be causal: any change to a gravitating
source must be communicated to distant observers no faster than the speed of light, c. This
leads immediately to the idea that there must exist some notion of ‘gravitational radiation’. As
demonstrated by Schutz [7], one can actually calculate with surprising accuracy many of the
properties of gravitational radiation simply by combining a time-dependent Newtonian potential
with special relativity.
The first calculation of gravitational radiation in general relativity is due to Einstein.
His initial calculation [8] was ‘marred by an error in calculation’ (Einstein’s words), and
was corrected in 1918 [9] (albeit with an overall factor of two error). Modulo a somewhat
convoluted history (discussed in great detail by Kennefick [10]) owing (largely) to the
difficulties of analysing radiation in a nonlinear theory, Einstein’s final result stands today as the
leading-order ‘quadrupole formula’ for gravitational wave emission. This formula plays a role
in gravity theory analogous to the dipole formula for electromagnetic radiation, showing that
gravitational waves (hereafter abbreviated GWs) arise from accelerated masses exactly as
electromagnetic waves arise from accelerated charges.
The quadrupole formula tells us that GWs are difficult to produce—very large masses
moving at relativistic speeds are needed. This follows from the weakness of the gravitational
interaction. A consequence of this is that it is extremely unlikely there will ever be an interesting
laboratory source of GWs. The only objects massive and relativistic enough to generate detectable
GWs are astrophysical. Indeed, experimental confirmation of the existence of GWs has come from
the study of binary neutron star systems—the variation of the mass quadrupole in such systems
is large enough that GW emission changes the system’s characteristics on a timescale short
enough to be observed. The most celebrated example is the ‘Hulse–Taylor’ pulsar, B1913+16,
reported by Hulse and Taylor in 1975 [11]. Thirty years of observation have shown that the orbit
is decaying; the results match with extraordinary precision general relativity’s prediction for such
a decay due to the loss of orbital energy and angular momentum by GWs. For a summary of the
most recent data, see figure 1 of [12]. Hulse and Taylor were awarded the Nobel Prize for this
When we discuss linearized theory, we will sometimes be sloppy and sum over adjacent spatial
indices in the same position. Hence,
ui vi ≡ ui vi ≡ ui vi ≡ ui vi
i
The most natural starting point for any discussion of GWs is ‘linearized gravity’. Linearized
gravity is an adequate approximation to general relativity when the spacetime metric, gab , may
be treated as deviating only slightly from a flat metric, ηab :
Here ∂a means the partial derivative ∂/∂xa . Since we use ηab to raise and lower indices, spatial
indices can be written either in the ‘up’position or the ‘down’position without changing the value
of a quantity: f x = fx . Raising or lowering a time index, by contrast, switches sign: f t = −ft .
The Riemann tensor we construct in linearized theory is then given by
where h = ha a is the trace of the metric perturbation and = ∂c ∂c = ∇ 2 − ∂2t is the wave
operator. Contracting once more, we find the curvature scalar:
Gab = Rab − 21 ηab R = 21 (∂c ∂b hc a + ∂c ∂a hbc − hab − ∂a ∂b h − ηab ∂c ∂d hc d + ηab h). (2.6)
Here, ω = |k|. The complex coefficients Aab (k) depend on the wavevector k but are independent
of x and t. They are subject to the constraint ka Aab = 0 (which follows from the Lorentz gauge
condition), with ka = (ω, k), but are otherwise arbitrary. These solutions are gravitational waves.
3
Fairly recently, it has become widely recognized that this gauge was in fact invented by Ludwig Lorenz, rather
than by Hendrik Lorentz. The inclusion of the ‘t’ seems most likely due to confusion between the similar names; see
[38] for a detailed discussion. Following the practice of Griffiths ([39], p 421), we bow to the weight of historical
usage in order to avoid any possible confusion.
h = hi i = 0. (2.19)
The Lorentz gauge condition (2.10) then implies that the spatial metric perturbation is transverse:
∂i hij = 0. (2.20)
This is called the transverse-traceless gauge, or TT gauge. A metric perturbation that has been
put into TT gauge will be written as hTT TT
ab . Since it is traceless, there is no distinction between hab
and h̄TT
ab .
The conditions (2.18) and (2.19) comprise five constraints on the metric, while the residual
gauge freedom in Lorentz gauge is parametrized by four functions that satisfy the wave equation.
It is nevertheless possible to satisfy these conditions, essentially because the metric perturbation
satisfies the linearized vacuum Einstein equation. When the TT gauge conditions are satisfied
the gauge is completely fixed.
One might wonder why we would choose TT gauge. It is certainly not necessary; however,
it is extremely convenient, since the TT gauge conditions completely fix all the local gauge
freedom. The metric perturbation hTT ab therefore contains only physical, non-gauge information
about the radiation. In TT gauge, there is a close relation between the metric perturbation and
the linearized Riemann tensor Rabcd (which is invariant under the local gauge transformations
(2.8) by equation (2.3)), namely
Ritjt = − 21 ḧTT
ij . (2.21)
In a globally vacuum spacetime, all non-zero components of the Riemann tensor can be obtained
from Ritjt via Riemann’s symmetries and the Bianchi identity. In a more general spacetime, there
will be components that are not related to radiation; this point is discussed further in section 2.2.
Transverse traceless gauge also exhibits the fact that gravitational waves have two
polarization components. For example, consider a GW which propagates in the z direction:
ij = hij (t − z) is a valid solution to the wave equation hij = 0. The Lorentz condition
hTT TT TT
zj = 0 implies that hzj (t − z) = constant. This constant must be zero to satisfy the condition
∂z hTT TT
xx = −hyy ≡ h+ (t − z);
hTT TT
(2.22)
The quantities h+ and h× are the two independent waveforms of the GW (see figure 1).
For globally vacuum spacetimes, one can always satisfy the TT gauge conditions. To
see this, note that the most general gauge transformation ξ a that preserves the Lorentz gauge
condition (2.10) satisfies ξ a = 0, from equation (2.12).A general solution to this equation can be
written as
ξ = Re d3 kCa (k)ei(k·x−ωt)
a
(2.24)
for some coefficients Ca (k). Under this transformation the tensor Aab (k) in equation (2.17)
transforms as
Achieving the TT gauge conditions (2.20) and (2.19) therefore requires finding, for each k, a
Ca (k) that satisfies the two equations
δt b is the Kronecker delta—zero for b = t, unity otherwise. An explicit solution to these equations
is given by
Weyl has classified plane gravitational waves into three types, viz. (1) longitudinal-
longitudinal; (2) longitudinal-transverse; (3) transverse-transverse. The present
investigation leads to the conclusion that transverse-transverse waves are propagated
with the speed of light in all systems of co-ordinates. Waves of the first and second
types have no fixed velocity—a result which rouses suspicion as to their objective
existence. Einstein had also become suspicious of these waves (in so far as they occur
in his special co-ordinate system) for another reason, because he found that they convey
no energy. They are not objective, and (like absolute velocity) are not detectable by
any conceivable experiment. They are merely sinuosities in the co-ordinate system,
and the only speed of propagation relevant to them is “the speed of thought." . . . It is
evidently a great convenience in analysis to have all waves, both physical and spurious,
travelling with one velocity; but it is liable to obscure physical ideas by mixing them up
so completely. The chief new point in the present discussion is that when unrestricted
co-ordinates are allowed the genuine waves continue to travel with the velocity of light
and the spurious waves cease to have any fixed velocity.
Unfortunately, Eddington’s wry dismissal of unphysical modes as propagating with ‘the speed of
thought’ is often taken by skeptics (and crackpots) as applying to all gravitational perturbations.
Eddington in fact showed quite the opposite. We do so now using a somewhat more modern
notation; our presentation is essentially the flat-spacetime limit of Bardeen’s [41] gauge-invariant
cosmological perturbation formalism. A similar treatment can be found in the lecture notes by
Bertschinger [42].
We begin by defining the decomposition of the metric perturbation hab , in any gauge, into
a number of irreducible pieces. Assuming that hab → 0 as r → ∞, we define the quantities φ,
βi , γ, H, εi , λ and hTT
ij via the equations
ij = 0 (three constraints),
∂i hTT (2.34)
ij = 0 (one constraint).
δij hTT (2.35)
φ → φ − Ȧ, (2.38)
βi → βi − Ḃi , (2.39)
γ → γ − A − Ċ, (2.40)
H → H − 2∇ 2 C, (2.41)
λ → λ − 2C, (2.42)
εi → εi − 2Bi , (2.43)
ij → hij .
hTT TT
(2.44)
≡ −φ + γ̇ − 21 λ̈, (2.45)
≡ 13 (H − ∇ 2 λ), (2.46)
i ≡ βi − 21 ε̇i ; (2.47)
hTT
ij is gauge-invariant without any further manipulation. In the Newtonian limit, reduces to the
Newtonian potential N , while = −2 N . The total number of free, gauge-invariant functions
is six: one function ; one function ; three functions i , minus one due to the constraint
∂i i = 0; and six functions hTT
ij , minus three due to the constraints ∂i hij = 0, minus one due to
TT
ij = 0. This is in keeping with the fact that in general the 10 metric functions
the constraint δij hTT
contain six physical and four gauge degrees of freedom.
We would now like to enforce Einstein’s equation. Before doing so, it is useful to first
decompose the stress–energy tensor in a manner similar to that of our decomposition of the
metric. We define the quantities ρ, Si , S, P, σij , σi and σ via the equations
Ttt = ρ, (2.48)
Tti = Si + ∂i S, (2.49)
∂i Si = 0, (2.51)
∂i σi = 0, (2.52)
∂i σij = 0, (2.53)
S → 0, σi → 0, σ → 0, ∇ 2σ → 0 (2.55)
as r → ∞. These quantities are not all independent. The variables ρ, P, Si and σij can be specified
arbitrarily; stress–energy conservation (∂a Tab = 0) then determines the remaining variables S,
σ and σi via
∇ 2 S = ρ̇, (2.56)
∇ 2 σ = − 23 P + 23 Ṡ, (2.57)
∇ 2 σi = 2Ṡ i . (2.58)
Gtt = −∇ 2 , (2.59)
Gti = − 21 ∇ 2 i − ∂i ,
˙ (2.60)
Gij = − 21 hTT
ij − ∂(i j) − 2 ∂i ∂j (2
˙ 1
+ ) + δij [ 21 ∇ 2 (2 + ) − ].
¨ (2.61)
We finally enforce Einstein’s equation Gab = 8πTab and simplify using the conservation relations
(2.56)–(2.58); this leads to the following field equations:
∇ 2 = −8πρ, (2.62)
∇ 2 i = −16πSi , (2.64)
hTT
ij = −16πσij . (2.65)
Notice that only the metric components hTT ij obey a wave-like equation. The other
variables , and i are determined by Poisson-type equations. Indeed, in a purely vacuum
spacetime, the field equations reduce to five Laplace equations and a wave equation:
∇ 2 vac = 0, (2.66)
∇2 vac
= 0, (2.67)
∇ 2 vac
i = 0, (2.68)
hTT,vac
ij = 0. (2.69)
This manifestly demonstrates that only the hTT ij metric components—the transverse, traceless
degrees of freedom of the metric perturbation—characterize the radiative degrees of freedom in
the spacetime. Although it is possible to pick a gauge in which other metric components appear
to be radiative, they will not be: their radiative character is an illusion arising due to the choice
of gauge or coordinates.
The field equations (2.62)–(2.65) also demonstrate that, far from a dynamic, radiating source,
the time-varying portion of the physical degrees of freedom in the metric is dominated by hTT ij .
If we expand the gauge-invariant fields , , i and hTT ij in powers of 1/r, then, at sufficiently
large distances, the leading-order O(1/r) terms will dominate. For the fields , and i ,
the coefficients of the 1/r pieces are simply the conserved mass d3 x ρ or the conserved linear
momentum − d3 x Si , from the conservation relations (2.56)–(2.58). Thus, the only time-varying
New Journal of Physics 7 (2005) 204 (http://www.njp.org/)
14 Institute of Physics ⌽ DEUTSCHE PHYSIKALISCHE GESELLSCHAFT
piece of the physical degrees of freedom in the metric perturbation at order O(1/r) is the TT
piece hTT
ij . An alternative proof of this result is given in exercise 19.1 of Misner et al [4].
Although the variables , , i and hTT ij have the advantage of being gauge-invariant, they
have the disadvantage of being non-local. Computation of these variables at a point requires
knowledge of the metric perturbation hab everywhere. This non-locality obscures the fact that
the physical, non-radiative degrees of freedom are causal, a fact which is explicit in Lorentz
gauge 4 . On the other hand, many observations that seek to detect GWs are sensitive only to the
value of the Riemann tensor at a given point in space (see section 3). For example, the Riemann
tensor components Ritjt , which are directly observable by detectors such as LIGO, are given in
terms of the gauge-invariant variables as
Ritjt = − 21 ḧTT
ij + ,ij
˙ (i,j) − 1 δ
+ 2
¨ ij . (2.70)
Thus, at least certain combinations of the gauge-invariant variables are locally observable.
The usual notion of ‘gravitational force’ disappears in general relativity, replaced instead by the
idea that freely falling bodies follow geodesics in spacetime. Given a spacetime metric gab and
a set of spacetime coordinates xa , geodesic trajectories are given by the equation
d 2 xa b
a dx dx
c
+ bc = 0, (3.1)
dτ 2 dτ dτ
where τ is a proper time as measured by an observer travelling along the geodesic. By writing the
derivatives in the geodesic equation (3.1) in terms of coordinate time t rather than proper time
τ, and by combining the a = t equation with the spatial, a = j equations, we obtain an equation
for the coordinate acceleration:
d 2 xi
= −(itt + 2itj vj + ijk vj vk ) + vi (ttt + 2ttj vj + tjk vj vk ), (3.2)
dt 2
where vi = dxi /dt is the coordinate velocity.
Let us now specialize to linearized theory, with the non-flat part of our metric dominated
by a GW in TT gauge. Further, let us specialize to non-relativistic motion for our test body. This
implies that vi 1, and to a good approximation we can neglect the velocity-dependent terms
in equation (3.2):
d 2 xi
+ i tt = 0. (3.3)
dt 2
In linearized theory and TT gauge,
i tt = itt = 21 2∂t hTT
jt − ∂j htt
TT
=0 (3.4)
Does this result mean that the GW has no effect? Certainly not! It just tells us that, in TT
gauge the coordinate location of a slowly moving, freely falling body is unaffected by the GW.
In essence, the coordinates move with the waves.
This result illustrates why, in general relativity, it is important to focus upon coordinate-
invariant observables—a naive interpretation of the above result would be that freely falling
bodies are not influenced by GWs. In fact, the GWs cause the proper separation between two
freely falling particles to oscillate, even if the coordinate separation is constant. Consider two
spatial freely falling particles, located at z = 0, and separated on the x-axis by a coordinate
distance Lc . Consider a GW in TT gauge that propagates down the z-axis, hTT ab (t, z). The proper
distance L between the two particles in the presence of the GW is given by
Lc Lc
√
L= dx gxx = dx 1 + hTTxx (t, z = 0)
0 0
Lc
xx (t, z = 0)] = Lc [1 + 2 hxx (t, z = 0)].
dx[1 + 21 hTT 1 TT
(3.5)
0
see, e.g., [36], chapter 21. This equation is valid to linear order in La ; fractional corrections
to this equation will scale as L/L, where L is the lengthscale over which the curvature
varies.
For application to GW detectors, the shortest such lengthscale L is the wavelength λ of
the GWs. Thus, the geodesic deviation equation will have fractional corrections of order L/λ.
For ground-based detectors L is a few km, while λ 3000 km (see section 6.1); thus the
approximation will be valid. For detectors with L λ (e.g. the space-based detector LISA), the
analysis here is not valid and other techniques must be used to analyse the detector.
A convenient coordinate system for analysing the geodesic deviation equation (3.7) is the
local proper reference frame of the observer who travels along the first geodesic. This coordinate
system is defined by the requirements
d2 Li (t)
= −Ritjt (t, 0)Lj (t). (3.10)
dt 2
Note that the key quantity entering into the equation, Ritjt , is gauge-invariant in linearized theory,
so we can use any convenient coordinate system to evaluate it. Using the expression (2.21) for
the Riemann tensor in terms of the TT gauge metric perturbation hTT ij , we find that
d2 Li 1 d2 hTT
ij
2
= 2
Lj . (3.11)
dt 2 dt
Integrating this equation using Li (t) = Li0 + δLi (t) with |δL| |L0 | gives
j
δLi (t) = 21 hTT
ij (t)L0 . (3.12)
components. We take the ends of one of the interferometer’s two arms as defining the two nearby
geodesics; the first geodesic is defined by the beam splitter at x = 0, the second by the end-mirror.
From equation (3.12) we then find that the distances L = |L| of the arms’ ends from the beam
splitter vary with time as
δLx δLy
= 21 h+ , = − 21 h+ . (3.13)
L L
(Here the subscripts x and y denote the two different arms, not the components of a vector.)
These distance changes are then measured via laser interferometry. Notice that the GW (which
is typically a sinusoidally varying function) acts tidally, squeezing along one axis and stretching
along the other. In this configuration, the detector is sensitive only to the + polarization of the
GW. The × polarization acts similarly, except that it squeezes and stretches along a set of axes
New Journal of Physics 7 (2005) 204 (http://www.njp.org/)
18 Institute of Physics ⌽ DEUTSCHE PHYSIKALISCHE GESELLSCHAFT
z z
x x
+ Polarization X Polarization
y y
Figure 1. Lines of force for a purely + GW (left), and for a purely × GW (right).
Figure kindly provided by Kip Thorne; originally published in [44].
that are rotated with respect to the x and y axes by 45◦ . The force lines corresponding to the two
different polarizations are illustrated in figure 1.
Of course, we do not expect nature to provide GWs that so perfectly align with our detectors.
In general, we will need to account for the detector’s antenna pattern, meaning that we will be
sensitive to some weighted combination of the two polarizations, with the weights depending
upon the location of a source on the sky, and the relative orientation of the source and the detector.
See [45], equations (104a,b) and associated text for further discussion.
Finally, in our analysis so far of detection, we have assumed that the only contribution
to the metric perturbation is the GW contribution. However, in reality time-varying near-zone
gravitational fields produced by sources in the vicinity of the detector will also be present. From
equation (3.10) we see that the quantity that is actually measured by interferometric detectors is
the spacetime–spacetime or electric-type piece Ritjt of the Riemann tensor (or more precisely the
time-varying piece of this within the frequency band of the detector). From the general expression
(2.70) for this quantity, we see that Ritjt contains contributions from both hTT ij describing GWs,
and also additional terms describing the time-varying near-zone gravitational fields. There is
no way for the detector to separate these two contributions, and the time-varying near-zone
gravitational fields produced by motions of bedrock, air, human bodies, and tumbleweeds can
all contribute to the output of the detector and act as sources of noise [46]–[48].
cf equation (2.15) (presented here in Lorentz gauge). In this section we will compute the leading-
order contribution to the spatial components of the metric perturbation for a source whose internal
motions are slow compared to the speed of light (‘slow-motion sources’). We will then compute
the TT piece of the metric perturbation to obtain the standard quadrupole formula for the emitted
radiation.
where f(t, x) is the radiative field, depending on time t and position x, and s(t, x) is a source
function. The Green’s function G(t, x; t , x ) is the field which arises due to a delta function
source; it tells how much field is generated at the ‘field point’ (t, x) per unit source at the ‘source
point’ (t , x ):
The field which arises from our actual source is then given by integrating the Green’s function
against s(t, x):
f(t, x) = dt d3 x G(t, x; t , x )s(t , x ). (4.4)
The Green’s function associated with the wave operator is very well known (see, e.g. [49]):
δ(t − [t − |x − x |/c])
G(t, x; t , x ) = − . (4.5)
4π|x − x |
The quantity t − |x − x |/c is the retarded time; it takes into account the lag associated with the
propagation of information from events at x to position x . The speed of light c has been restored
here to emphasize the causal nature of this Green’s function; we set it back to unity in what
follows.
Applying this result to equation (4.1), we find
Tab (t − |x − x |, x )
h̄ab (t, x) = 4 d3 x . (4.6)
|x − x |
As already mentioned, the radiative degrees of freedom are contained entirely in the spatial
part of the metric, projected transverse and traceless. Firstly, consider the spatial part of the
metric:
T ij (t − |x − x |, x )
h̄ij (t, x) = 4 d3 x . (4.7)
|x − x |
We have raised indices on the right-hand side, using the rule that the position of spatial indices
in linearized theory is irrelevant.
We now evaluate this quantity at large distances from the source. This allows us to replace
the factor |x − x | in the denominator with r = |x|. The corresponding fractional errors scale as
∼L/r, where L is the size of the source; these errors can be neglected. We also make the same
replacement in the time argument of Tij :
∂t T tt + ∂i T ti = 0, (4.10)
∂t T ti + ∂j T ij = 0. (4.11)
∂2t T tt = ∂k ∂l T kl . (4.12)
Multiply both sides of this equation by xi xj . We first manipulate the left-hand side:
This identity is easily verified6 by expanding the derivatives and applying the identity ∂i xj = δi j .
We thus have
This yields
4 3 4
d x Tij = d3 x 21 ∂2t (T tt xi xj ) + ∂k (T ik xj + T kj xi ) − 21 ∂k ∂l (T kl xi xj )
r r
2
= d3 x ∂2t (T tt xi xj )
r
2 ∂2
= d3 x T tt xi xj
r ∂t 2
2 ∂2
= d3 x ρ xi xj . (4.16)
r ∂t 2
6
Although one of us (SAH) was unable to do this simple calculation while delivering lectures at a summer school
in Brownsville, TX. Never attempt to derive the quadrupole formula while medicated.
2 d2 Iij (t − r)
h̄ij (t, x) = . (4.18)
r dt 2
When we subtract the trace from Iij , we obtain the quadrupole moment tensor:
This tensor eliminates vector components parallel to n, leaving only transverse components.
Thus,
Substituting equation (4.18) into (4.22), we obtain our final quadrupole formula:
2 d2 Ikl (t − r)
ij (t, x) =
hTT Pik (n)Pjl (n). (4.23)
r dt 2
where ηab ≡ diag(−1, 1, 1, 1). When gravity is weak this definition coincides with our previous
definition of h̄ab as a trace-reversed metric perturbation. We impose the harmonic gauge condition
√
∂a ( −ggab ) = ∂a h̄ab = 0. (4.25)
where flat ≡ ηab ∂a ∂b is the flat-spacetime wave operator and t ab is a pseudo-tensor that is
constructed from h̄ab . Taking a coordinate divergence of this equation and using the gauge
condition (4.25), shows that stress–energy conservation can be written as
∂a (T ab + t ab ) = 0. (4.27)
Equations (4.25)–(4.27) are precisely the same equations as are used in the linearized-
gravity derivation of the quadrupole formula, except for the fact that the stress–energy tensor
T ab is replaced by T ab + t ab . Therefore, the derivation of the last subsection carries over, with the
modification that the formula (4.17) for Iij is replaced by
Iij (t) = d3 x [T tt (t, x ) + t tt (t, x )]xi xj . (4.28)
In this equation the term t tt describes gravitational-binding energy, roughly speaking. For systems
with weak gravity, this term is negligible in comparison with the term T tt describing the rest-
masses of the bodies. Therefore, the quadrupole formula (4.23) and the original definition (4.17)
of Iij continue to apply to the more general situation considered here.
(More accurately, this is the total mass-energy of the source.) Next, we define the dipole moment:
M1 ≡ ρ xi d3 x = MLi . (4.30)
M 5/3 2/3
h= . (4.44)
r
Finally, we insert numbers corresponding to plausible sources:
5/3
−21 M 1 h 2/3 1 kiloparsec
h 10
2M P r
5/3
−22 M 0.01 second 2/3 100megaparsecs
10 . (4.45)
2.8M P r
The first line corresponds roughly to the mass, distance and orbital period (P = 2π/) expected
for the many close binary white dwarf systems in our galaxy. Such binaries are so common that
they are likely to be a confusion-limited source of GWs for space-based detectors, acting in
some cases as an effective source of noise. The second line contains typical parameter values
for binary neutron stars that are on the verge of spiralling together and merging. Such waves are
targets for the ground-based detectors that have recently begun operations. The tiny magnitude
of these waves illustrates why detecting GWs is so difficult.
At the most fundamental level, GWs can only be defined within the context of an approximation
in which the wavelength of the waves is much smaller than lengthscales characterizing the
background spacetime in which the waves propagate. In this section, we discuss perturbation
theory of curved spacetimes, describe the approximation in which GWs can be defined, and
derive the effective stress tensor which describes the energy content of GWs. The material in
this section draws on the treatments given in chapter 35 of Misner et al [4], section 7.5 of Wald
[51], and the review papers [31, 32].
gab = gab
B
+ εhab + ε2 jab + O(ε3 ). (5.1)
8
In units with G = 1 and for circular orbits of radius R, R3 2 = M.
= B bc
a
+ εδabc + O(ε2 ). (5.3)
Here B bc
a B
are the connection coefficients of the background metric gab , and the first-order
corrections to the connection coefficients are given by
where ∇aB is the covariant-derivative operator associated with the background metric.
Equation (5.4) can be derived more easily, at any given point in spacetime, by evaluating the
expression (5.2) in a coordinate system in which the background connection coefficients vanish
at that point, so that ∂a = ∇aB . The result (5.4) for general coordinate systems then follows from
general covariance.
Next, insert the expansion (5.3) of the connection coefficients into the formula
for the Riemann tensor. Evaluating the result in a coordinate system in which B bc
a
= 0 at the
point of evaluation gives
Ra bcd = ∂c B bd
a
− ∂d B bc
a
+ ε(∂c δabd − ∂d δabc ) + O(ε2 )
Here RB abcd is the Riemann tensor of the background metric and δRabcd = ∂c δabd − ∂d δabc is the
linear perturbation to the Riemann tensor. It follows from general covariance that the expression
for δRabcd in a general coordinate system is
δRa bcd = 21 (∇cB ∇bB ha d + ∇cB ∇dB ha b − ∇cB ∇ B a hbd − ∇dB ∇bB hac − ∇dB ∇cB ha b + ∇dB ∇ B a hbc ). (5.8)
Contracting on the indices a and c yields the linearized Ricci tensor δRbd :
where B ≡ ∇aB ∇ B a , indices are raised and lowered with the background metric and h = haa .
Reversing the trace to obtain the linearized Einstein tensor δGbd , and writing the result in terms
of the trace-reversed metric perturbation
∇ B a h̄ab = 0 (5.14)
by requiring that ξb satisfies the wave equation B ξb = ∇ B a h̄ab . We can further specialize the
gauge to satisfy h = 0. Dropping the primes, the metric perturbation is thus traceless and
transverse:
∇ B a hab = h = 0. (5.15)
(Note, however, that one cannot in this context impose the additional gauge conditions h0a = 0
used in the definition of TT gauge for perturbations of flat spacetime.)
To see that the traceless condition h = 0 can be achieved, note that the trace transforms as
h → h = h − 2∇ B a ξa . (5.17)
∇ B a ξa − h/2 = 0. (5.18)
We can choose initial data for ξa on any Cauchy hypersurface for which the quantity (5.18)
and also its normal derivative vanish. Since the quantity (5.18) satisfies the homogeneous wave
equation by equations (5.11) and (5.14), it will vanish everywhere.
The wave equation (5.16) differs from its flat spacetime counterpart (2.16) in two respects:
firstly, there is an explicit coupling to the background Riemann tensor; and secondly, there is
a coupling to the background curvature through the connection coefficients that appear in the
covariant wave operator B . In the limit (discussed below) where the wavelength of the waves is
much smaller than the lengthscales characterizing the background metric, these couplings have
the effect of causing gradual evolution in the properties of the wave. These gradual changes can
be described using the formalism of geometric optics, which shows that GWs travel along null
geodesics with slowly evolving amplitudes and polarizations. See [31] for a detailed description
of this formalism. Outside the geometric optics limit, the curvature couplings in equation (5.16)
can cause the dynamics of the metric perturbation to be strongly coupled to the dynamics of the
background spacetime. An example of such coupling is the parametric amplification of metric
perturbations during inflation in the early Universe [52].
Here the angular brackets · · · denote an average over lengthscales large compared to λ but small
compared to L; a suitable covariant definition of such averaging has been given by Brill and Hartle
[53]. A useful analogy to consider is the surface of an orange, which contains curvatures on two
different lengthscales: An overall, roughly spherical background curvature (analogous to the
background metric), and a dimpled texture on small scales (analogous to the GW). The regime
λ L is called the geometric optics regime.
We will argue below that the short-wavelength perturbation εhab gives rise to an effective
stress tensor of order ε2 h2 /λ2 , where h is a typical size of hab . This effective stress tensor
λ
εh . (5.21)
L
Since we are assuming that λ L, it follows that the short-wavelength piece εhab of the metric
is small compared to the background metric, and so we can use the perturbation formalism of
section 5.1. Consider now the splitting of the Riemann tensor into a background piece plus a
perturbation given by equation (5.6):
the same unique and local procedure as for the metric perturbation (5.20). This Riemann tensor
perturbation is often called the GW Riemann tensor; it is a tensor characterizing the GWs that
B
propagate in the background metric gab .
The operational meaning of the GW fields εhab and εδRabcd follows directly from the
equivalence principle and from their meaning in the context of flat spacetime (section 2).
Specifically, suppose that P is a point in spacetime and pick a coordinate system in which
B
gab = ηab and B bc
a
= 0 at P . Then we have
2
x
gab = ηab + O + εhab + O(ε2 ), (5.25)
L2
where x is the distance from P . Therefore, within a spacetime region around P in which x L,
the flat-spacetime perturbation theory and measurement analysis of section 2 can be applied.
Thus, the gravitational waveforms seen by observers performing local experiments will just
be given by components of the GW Riemann tensor in the observer’s local proper reference
frames.
We remark that the splitting of the metric into a background plus a linear perturbation
can sometimes be uniquely defined even in the regime λ ∼ L. Some examples are when
the background spacetime is static (e.g. perturbations of a static star), or homogeneous
(e.g. Friedman–Robertson–Walker cosmological models). In these cases the dynamic metric
perturbation are not actually GWs, although their evolution can be computed using the linearized
Einstein equation. For example, consider the evolution of a metric perturbation mode which is
parametrically amplified during inflation in the early Universe. At early times during inflation,
the mode’s wavelength λ is smaller than the Hubble scale (L); the mode is said to be ‘inside the
horizon’. Any excitation of the mode is locally measurable (although such modes are usually
assumed to start in their vacuum state). As inflation proceeds, the mode’s wavelength redshifts
New Journal of Physics 7 (2005) 204 (http://www.njp.org/)
30 Institute of Physics ⌽ DEUTSCHE PHYSIKALISCHE GESELLSCHAFT
and becomes larger due to the rapid expansion of the Universe, and eventually becomes larger
than the Hubble scale L; the mode is then ‘outside the horizon’. At this point, excitations in the
mode are not locally measurable and are thus not GWs. Finally, after inflation ends, the mode
‘re-enters the horizon’ and excitations of the mode are locally measurable. The mode is now a
true GW once again.
Finally, we note that for perturbations of flat spacetime, the definition of GWs given here
does not always coincide with the definition in terms of the TT component of the metric given
in section 2.2. However, far from sources of GWs (the regime relevant to observations), the
two definitions do coincide. This is because the TT piece of the metric will vary on scales of
a wavelength λ which is short compared to the lengthscale ∼r over which other pieces of the
metric vary (except for other dynamic pieces of the metric such as the time-varying quadrupole
term in the gauge-invariant field ; those pieces vary on short lengthscales but are unimportant
since they are smaller than the TT piece by a factor ∼λ2 /r 2 or smaller).
be computed by extending the computation of section 5.1 to one higher order, and is a sum of
terms of the form hab ∇cB ∇dB hef and (∇aB hbc )(∇dB hef ) with various index contractions; see equation
(35.58b) of MTW [4]. It is worth recalling that jab is a second-order metric perturbation. We
must take the calculation to second order to compute the effective stress–energy tensor of the
waves, since an averaging is involved—the first-order contribution vanishes by the oscillatory
nature of the waves.
Equating to zero the coefficients of the different powers of ε, we obtain the vacuum Einstein
equation for the background spacetime
B
Gab [gcd ] = 0, (5.27)
together with the equation for the second-order metric perturbation jab
(1)
Gab [jcd ; gef
B
] = −G(2)
ab [hcd ; gef ].
B
(5.29)
gab = (gab
B
+ ε2 jab ) + (εhab + ε2 jab ) + O(ε3 ), (5.31)
where the first term varies slowly on lengthscales ∼L and the second term varies rapidly on
lengthscales ∼λ. Consider next the average of the second-order Einstein equation (5.29). Using
the fact that the averaging operation · · · commutes with derivatives, we get
(1)
Gab [jcd ; gef
B
] = −G(2)
ab [hcd ; gef ].
B
(5.32)
Subtracting equation (5.32) from equation (5.27) gives an equation for jab :
(1)
Gab [jcd ] = −G(2)
ab [hcd ; gef ] + Gab [hcd ; gef ].
B (2) B
(5.33)
1 B
GW,eff
Tab = ∇a h̄cd ∇bB h̄cd − 21 ∇aB h̄∇bB h̄ − ∇aB h̄bc ∇dB h̄cd − ∇bB h̄ac ∇dB h̄cd . (5.37)
32π
1
GW,eff
Tab = ∇ B hcd ∇bB hcd . (5.38)
32π a
For example, for the plane wave propagating in the z direction in flat spacetime, given by
h20 ω2 h2 ω2
T tt = T tz = cos2 (ωt − ωz) = 0 . (5.40)
16π 32π
If we restore factors of G and c, and insert numbers typical of bursts of waves that we hope to
detect, we get the energy flux
2 2
−2 h0 f
T = 1.5 mW m
tz
, (5.41)
10−22 1 kHz
where f = ω/(2π). Note that this is a large energy flux by astronomical standards, despite the
tiny value of h0 ; it is comparable to the flux of reflected sunlight from a full moon [32].
Having now reviewed the basic theory and properties of GWs, we conclude this paper by very
briefly surveying the properties of important potential sources of GWs. Our goal is to give
some indication of the value that GWs may provide for astronomical observations; much of this
material is updated from a previous survey paper [29]. We note that since the focus of this paper
is intended to be the theory of GW sources (and that this paper is significantly longer than was
intended or requested), we are quite a bit more schematic in our treatment here than we have been
Electromagnetic waves interact strongly with matter; GWs do not. The weak interaction of GWs
is both blessing and curse: it means that they propagate from emission to Earth-bound observers
with essentially zero absorption, making it possible to probe astrophysics that is hidden or dark
to electromagnetic observations—e.g. the coalescence and merger of black holes, the collapse
of a stellar core and the dynamics of the early Universe. It also means that detecting GWs is
very difficult. Also, because many of the best sources are hidden or dark, they are very poorly
understood today—we know very little about what are likely to be some of the most important
sources of GWs.
Electromagnetic radiation typically has a wavelength smaller than the size of the emitting system,
and so can be used to form an image of the source. This is because electromagnetic radiation is
usually generated by moving charges in the environment of some larger source—e.g. an atomic
transition in interstellar gas, or emission from hot plasma in a stellar environment. By contrast,
the wavelength of gravitational radiation is typically comparable to or larger than the size of
the radiating source. GWs are generated by the bulk dynamics of the source itself—e.g. the
motion of neutron stars in a binary. As a consequence, GWs cannot be used to form an image:
the radiation simply does not resolve the generating system. Instead, GWs are best thought of
as analogous to sound—the two polarizations carry a stereophonic description of the source’s
dynamics.
INITIAL LIGO
SU
10-19
SP
EN
SI
SEISM
NO
TH
ER
GR
IC
RA
M
AVI
AL
DI
AT
TY
IO
10-21
N
GR
PR
OT
AD
h(f) [Hz-1/2]
ES
SH
SU
IEN
RE
TN
OIS
TES
E
T MA
10-23 SS IN
TER
NAL
RESIDUAL GASS, 10-6 TORR H2
ST
RA
Y
LI FACILITY
GH
T RESIDUAL GAS, 10-9 Torr H2
10-25
1 10 100 1000 10000
Frequency (Hz)
Figure 2. Sensitivity goals of the initial LIGO interferometers, and facility limits
on the LIGO sensitivity (taken from [25]).
We now survey the more well-understood possible sources of measurable GWs in the high-
frequency band. We emphasize at this point that such a listing of sources can in no way be
considered comprehensive: we are hopeful that some GW sources may surprise us, as has been
the case whenever we have studied the Universe with a new type of radiation.
6.1.1. Coalescing compact binaries. Compact binaries—binary star systems in which each
member is a neutron star or black hole—are currently the best-understood sources of GWs.
Double neutron stars have been studied observationally since the mid-1970s; five such systems
[12]–[16], tight enough to merge within a few 108 or 109 years have been identified in our
Galaxy. Extrapolation from these observed binaries in the Milky Way to the Universe at large
[61]–[64] indicates that GW detectors should measure at least several and at most several hundred
binary neutron star mergers each year (following detector upgrades; the expected rate for initial
detectors is of the order of one event per several years, so that measurement of an event is plausible
but of fairly low probability). Population synthesis (modelling evolution of stellar populations)
indicates that the measured rate of binaries containing black holes should likewise be interestingly
large (perhaps even for initial detectors) [65]–[68]. The uncertainties of population synthesis
calculations are rather large, however, due to poorly understood aspects of stellar evolution
6.1.2. Stellar core collapse. Core collapse in massive stars (the engine of type II supernova
explosions) has long been regarded as likely to be an important source of GWs; see, for example,
[69] for an early review. Stellar collapse certainly exhibits all of the necessary conditions for
strong GW generation—large amounts of mass (1–100M ) flow in a compact region (hundreds
to thousands of kilometres) at relativistic speeds (v/c 1/5). However, these conditions are not
sufficient to guarantee strong emission. In particular, the degree of asymmetry in collapse is not
particularly well understood.
If the core of a star is very rapidly rotating during collapse, then instabilities may develop
which lead to strong GW emission [70]. If such instabilities develop, core collapse GWs could
be detected from events as far away as 10 Megaparsecs [71], a distance encompassing enough
galaxies that several events per year would be likely. Most models of massive stars, however,
indicate that such rapid rotation is not likely (e.g. [72]). Even without the growth of instabilities,
the asymmetric dynamics of core collapse is likely to lead to wave emission which would be
detectable within the Local Group of galaxies, with perhaps an event every few years detectable
by advanced interferometers [73]. The wave strength is likely to correlate strongly with the degree
of asymmetry in the supernova. If the GW event has an electromagnetic or neutrino counterpart,
we may gain a wealth of knowledge regarding the state of the precollapse core [74].
6.1.3. Periodic emitters. Periodic sources of GWs radiate at constant or nearly constant
frequency, like radio pulsars. In fact, the prototypical source of continuous GW is a rotating
neutron star, or GW pulsar. A non-axisymmetry in a neutron star crust (caused, for example, by
an oblateness that is misaligned with the star’s spin axis) will radiate GWs with characteristic
amplitude
G I f 2
h∼ . (6.2)
c4 r
Here I is the star’s moment of inertia, f is the wave frequency, r is the distance to the source
and is the dimensionless fractional distortion = (Ixx − Iyy )/I, where Iij is the moment of
inertia tensor. The crucial parameter characterizes the degree to which the star is distorted; it
is rather poorly understood. Upgraded interferometers in LIGO could set an upper limit on of
order 10−6 for sources at ∼10 kpc [27]. Various mechanisms have been proposed to explain how
a neutron star can be distorted to give a value of that is interesting as a GW source; see [75, 76]
for further discussion. Examples of some interesting mechanisms include misalignment of a
star’s internal magnetic field with the rotation axis [77] and distortion by accreting material from
a companion star [78, 79] (discussed in more detail below).
Whatever the mechanism generating the distortion, it is clear that will be small, so that
h ∼ 10−24 or smaller—quite weak. Measuring these waves will require coherently tracking their
signal for a large number
√ of wave cycles. Coherently tracking N cycles boosts the signal-to-noise
ratio by a factor ∼ N. This is actually fairly difficult, since the signal is strongly modulated by the
Earth’s rotation and orbital motion, ‘smearing’the waves’power across multiple frequency bands.
Searching for periodic GWs means demodulating the motion of the detector, a computationally
intensive problem since the modulation is different for every sky position. Unless one knows
in advance the position of the source, one needs to search over a huge number of sky position
New Journal of Physics 7 (2005) 204 (http://www.njp.org/)
38 Institute of Physics ⌽ DEUTSCHE PHYSIKALISCHE GESELLSCHAFT
‘error boxes’, perhaps as many as 1013 . One rapidly becomes computationally limited10 . (Radio
pulsar searches face this same problem, with the additional complication that radio pulses are
dispersed by the interstellar medium. However, radio observers usually use shorter integration
times, and often target their searches on small regions of the sky, so their computational cost is
usually not as great.) For further discussion, see [80]; for ideas about doing hierarchical searches
that require less computer power, see [81].
As mentioned above, one particularly interesting mechanism for distorting a neutron star
comes from accretion of material from a companion star. Accretion provides a spin-up torque to
a neutron star,
(dJ/dt)spin-up ∼ R2 ∗ Ṁ (6.3)
(where J is the spin angular momentum, ∗ is the orbital frequency of the accreting matter as it
plunges onto the star, R is the star’s radius and Ṁ is the mass accretion rate). Without any kind
of braking mechanism, the neutron star would presumably spin-up until it reaches the ‘breakup
limit’, i.e., the spin frequency at which centrifugal forces would begin to break it apart; the
breakup frequency is typically around 2000–3000 Hz.
Observations have shown [82] that accreting neutron stars do, in fact, appear to have a
‘speed limit’—no accreting neutron star has been observed to spin faster than 619 Hz [83]. This
is consistent with the fact that the fastest radio pulsar11 has a spin period of 641 Hz [84]. This
suggests that some mechanism is removing angular momentum from the neutron star. A plausible
and very attractive possibility of how this angular momentum is removed is via GW emission.
Because the spin-down torque due to GW emission grows sharply with spin frequency,
the limiting spin obtained by balancing the torques (6.2) and (6.4) is relatively insensitive to
the mass accretion rate Ṁ. Such a mechanism was originally suggested by Wagoner [85], and
was revived by Bildsten [78] to explain the narrow clustering in the spin frequency of accreting
low-mass x-ray binaries (LMXBs). Various mechanisms could provide the spin-down torque—
Bildsten originally suggested that a quadrupole moment in the spinning star could be induced
by a thermally varying electron capture mechanism, but also noted that the r-mode instability
(see, e.g. [86] for a review) could be excited, leading to a similar spin-down law. Whatever
the mechanism, accreting neutron stars are obvious and very attractive targets for observing
campaigns with GW detectors, particularly given that their sky positions are well known.
10
This rather large number of patches on the sky is driven by the possible need to search for high-frequency pulsars
over several months of observation. The difference f between the Doppler frequency shifts for two adjacent
sky patches separated by an angle δθ is of order f ∼ v⊕ fδθ/c, where v⊕ ∼ 3 × 104 ms−1 is the Earth’s orbital
velocity and f is the gravitational wave frequency. The phase error over an observation time Tobs is of order f Tobs .
Demanding that this be less than unity yields δθ c/(v⊕ f T). The number of independent sky patches is then
Np ∼ 4πδθ −2 ∼ 4πv2⊕ f 2 Tobs
2
/c2 ∼ 1013 for f = 1000 Hz and Tobs = 1/3 year. Fewer positions would be needed
if either the maximum frequency or the integration time is reduced; the figures given here set the maximum values
that are plausible. See [80] for more details.
11
The so-called ‘recycled’ radio pulsars spin at frequencies ∼ several hundred Hertz; they are believed to be the
fossils of accreting neutron stars.
60
o
Sun
Mercury
Venus
local ground motions, gravitational coupling to fluctuations in the local mass distribution
ultimately limits the sensitivity to frequencies f 1 Hz. Nonetheless, many extremely
interesting sources radiate in this band. The only way to measure these waves is to build a
detector in the quiet environment of space, far removed from low-frequency noise sources.
Such an instrument is currently being designed jointly by NASA in the United States and
ESA, the European Space Agency: LISA, the Laser Interferometer Space Antenna. If all goes
well, LISA will be launched into orbit in 2013 or so. LISA will be a laser interferometer, similar
in concept to the ground-based detectors: changes in the distance between widely separated test
masses will be monitored for GWs. However, LISA’s scale is vastly larger than that of its ground-
based cousins, and so details of its operations are quite different. In particular, LISA will have
armlengths L 5 × 106 km. The three spacecrafts which delineate the ends of LISA’s arms are
placed into orbits such that LISA forms a triangular constellation orbiting the sun, inclined 60◦
with respect to the plane of the ecliptic and following the Earth with a 20◦ lag. This configuration
is sketched in figure 3. Since it essentially shares Earth’s orbit, the constellation orbits the sun
once per year, ‘rolling’ as it does so. This motion plays an important role in pinpointing the
position of sources by modulating the measured waveform—the modulation encodes the source
location and makes position determination possible.
Each of the three spacecrafts contains two optical assemblies, each of which houses a 1 W
laser and a 30 cm telescope. Because of the extreme lengths of the interferometer’s arms, Fabry–
Perot interferometry as in the ground-based detectors is not possible: diffraction spreads the
laser beam over a diameter of about 20 km as it propagates from one spacecraft to the other. A
portion of that 20 km wavefront is sampled with the telescope. That light is then interfered with
a sample of light from the on-board laser. Each spacecraft thus generates two interference data
streams; six signals are generated by the full LISA constellation. From these six signals, we can
construct the time variations of LISA’s armlengths and then build both GW polarizations. More
information and details can be found in [93].
Note that the LISA armlengths are not constant—as the constellation orbits, the distances
between the various spacecrafts vary by about 1% (including effects such as planetary
perturbations). These variations are far larger than the displacements produced by GWs, which
are of the order of picometers. However, these variations occur over timescales of the order of
months, and are extremely smooth and well modelled. It will not be difficult to remove them from
the data leaving clean data in the interesting frequency band. Picometer scale variations are not too
6.2.1. Periodic emitters. In the high-frequency band, the source of most periodic GWs is
expected to be isolated neutron stars. LISA’s periodic GWs will come primarily from binary star
systems in the Milky Way, primarily close white dwarf binaries. Most of these systems do not
generate waves strong enough to backreact significantly, so that their frequencies do not change
measurably over the course of LISA’s observations. Certain systems are well known in advance
to be sources of periodic waves for the LISA band. These sources are understood well enough
from optical observations that they may be regarded as ‘calibrators’—LISA should detect them
or else something is wrong!
Aside from these sources that are known in advance, it is expected that LISA will discover
a good number of binary systems that are too faint to detect with telescopes. Joint observations
by LISA and other astronomical instruments are likely to be more fruitful than observations with
a single instrument alone. For example, it is typically difficult for telescopes to determine the
inclination of a binary to the line of sight (a quantity needed to help pin down the masses of the
binary’s members). GWs measure the inclination angle almost automatically, since this angle
determines the relative magnitude of the polarizations h+ and h× .
The total number of periodic binaries radiating in LISA’s band is expected to be so large that
they will constitute a confused, stochastic background at low frequencies—there are likely to be
several thousand galactic binaries radiating in each resolvable frequency bin. This background
will constitute a source of ‘noise’ (from the standpoint of measuring other astrophysical sources)
that is larger than that intrinsic to the instrument noise at f 10−3 Hz.
6.2.2. Coalescing binary systems containing black holes. Coalescing binary black hole systems
will be measurable by LISA to extremely large distances—essentially to the edge of the
observable Universe. Even if such events are very rare, the observed volume is enormous so
an interesting event detection rate is very likely. One class of such binaries consists of systems
in which the member holes are of roughly equal mass (∼105 –108 M ). These binaries can form
following the merger of galaxies (or pregalactic structures) containing black holes in their cores.
Depending on the mass of the binary, the waves from these coalescences will be detectable
to fairly large redshifts (z ∼ 5–10), possibly probing an early epoch in the formation of the
Universe’s structure [94].
The other major class of binary systems consists of relatively small bodies (black holes
with mass ∼10–100M , neutron stars or white dwarfs) that are captured by larger black holes
(M∼105 –107 M ). These ‘extreme mass ratio’ binaries are created when the smaller body is
captured onto an extremely strong field, highly relativistic orbit, generating strong GWs. Such
New Journal of Physics 7 (2005) 204 (http://www.njp.org/)
42 Institute of Physics ⌽ DEUTSCHE PHYSIKALISCHE GESELLSCHAFT
systems are measurable to a distance of a few Gigaparsecs if the in-spiralling body is a 10M
black hole, and to a distance of a few hundred Megaparsecs if the body is a neutron star or
white dwarf. LISA will measure the waves that come from the last year or so of the smaller
body’s in-spiral, and thence probe the nature of the larger black hole’s gravitational field deep
within the hole’s potential. The rates for such events are not too well understood and depend
on the details of stellar dynamics in the cores of galaxies. Extremely conservative estimates
typically find that the rate of measurable events for LISA should be at least several per year
[95, 96]. Recent thinking suggests that these rates are likely to be rather underestimated—black
holes (which are measurable to much greater distances) are likely to dominate the measured rate,
perhaps increasing the rate to several dozen or several hundred per year [97].
Finally, it is worth noting that many events involving intermediate mass black holes—those
with masses in the band running from a few 102 to a few 105 M —would generate GWs in LISA’s
sensitive band. There is a large body of tentative evidence for the existence of black holes in this
mass band (see, e.g. [98] for a review), though as yet we have no ‘smoking gun’ unambiguous
signature for such a hole. If such black holes do exist and undergo mergers in sufficient numbers,
measurement of their waves will make possible a wealth of interesting tests of relativity [99],
and could untangle some of the mysteries surrounding supermassive black hole formation and
growth.
gw h2100 < 9.3 × 10−8 , 4 × 10−9 < f < 4 × 10−8 Hz (6.8)
(where the limit is a 95% confidence limit and h100 is the Hubble constant in units of
100 km second−1 Mpc−1 ).
The upper limit (6.8) already places constraints on some cosmological models (in particular
those involving cosmic strings). With further observations and the inclusion of additional pulsars
in the datasets, it is likely to improve quite soon. It is possible that the background in this band
will be dominated by many unresolved coalescing massive binary black holes [105]—binaries
that are either too massive to radiate in the LISA band, or else are in-spiralling towards the
LISA band en route to a final merger several centuries or millenia hence. Constraints from pulsar
observations in this band will remain an extremely important source of data on stochastic waves
in the future—the limits they can set on gw are likely to be better than can be set by any of the
laser interferometric detectors.
where mP is the Planck mass. Measuring these GWs would be a direct probe of inflationary
physics and would determine the inflation energy scale, which is currently unknown to within
many orders of magnitude. These waves have been described as the ‘smoking gun’ signature of
inflation [106].
During inflation, quantum fluctuations impact both the scalar field which drives inflation
(the inflaton φ) and the metric of spacetime. There exist independent scalar fluctuations (coupled
fluctuations in the inflaton and scalar-type fluctuations in the metric) and tensor fluctuations
(tensor-type fluctuations in the metric). The Fourier modes of these scalar and tensor perturbations
are describable as harmonic oscillators in the expanding Universe [107]. Each mode undergoes
zero-point oscillations in the harmonic potential. However, the potential itself is evolving due
to the expansion of the Universe. The evolution of this potential parametrically amplifies these
zero-point oscillations, creating quanta of the field [87]. During inflation, the Universe’s scale
factor a(t) grows faster than the Hubble length H −1 , and so each mode’s wavelength likewise
grows faster than the Hubble length. The mode’s wavelength eventually becomes larger than the
Hubble length, or the mode ‘leaves the horizon’. After inflation ends the mode subsequently re-
enters the horizon. For gravitational perturbations, the number of quanta generated in the mode
is proportional to the factor by which the Universe expands between the two different horizon
crossings. Fluctuations in the inflaton seed density fluctuations, δρ(r) = δφ(r)(∂V/∂φ) (where
on the sphere of the ‘ordinary’ spherical harmonics Ylm (n̂); see [109] for details. Because scalar
perturbations have no handedness, they only induce gradient-type polarization. GWs induce both
gradient- and curl-type polarization. Thus, an unambiguous detection of the curl-type polarization
would confirm production of GWs by inflation. (A caveat is that gravitational lensing can convert
E-modes to B-modes; this so-called ‘cosmic shear’ ultimately limits the sensitivity to GWs of
CMB polarization studies [110].)
7. Conclusion
This paper has summarized many of the most important topics in the theory of GWs. Due to
space and time limitations, we sadly were not able to cover all topics with which students of
this field should be familiar. In particular, we had hoped to include a discussion of strong field
relativity and GW emission. We confine ourselves, in this conclusion, to a (very) brief discussion
of important aspects of this subject for GW science, as well as pointers to the relevant literature.
Linearized theory as described in sections 2 and 5 is entirely adequate to describe the
propagation of GWs through our Universe and to model the interaction of GWs with our detectors.
In some cases, it is even adequate to describe the emission of waves from a source, as described
in section 4 (although for sources with non-negligible self-gravity such as binary star systems,
one has to augment linearized theory as described in section 4.2). However, many sources have
very strong self-gravity where the linearized treatment is completely inadequate. A variety of
formalisms have been developed to handle these cases.
• Post-Newtonian (PN) theory. PN theory is one of the most important of these formalisms,
particularly for modelling binary systems. Roughly speaking, PN theory analyses sources
using an iterated expansion in two variables: the ‘gravitational potential’, φ ∼ M/r, where
M is a mass scale and r characterizes the distance from the source; and velocities of internal
motion, v. (In linearized theory, we assume φ is small but place no constraints on v.)
Newtonian gravity emerges as the first term in the expansion, and higher-order corrections
are found as the expansion is iterated to ever higher order. Our derivation of the quadrupole
We thank R Price and J Pullin for the invitation to write this paper, and are profoundly grateful to
T Smith at the Institute of Physics for patiently and repeatedly extending our deadline as we wrote
this paper. We are grateful to two anonymous referees for detailed and helpful comments, as well
as to helpful comments from Hongbao Zhang. SAH thanks the Caltech TAPIR group and the
Kavli Institute for Theoretical Cosmology at the University of Chicago for their hospitality while
this paper was completed. EEF is supported by NSF grant PHY-0140209; SAH is supported by
NSF grant PHY-0244424 and NASA grant NAGW-12906.
In this appendix, we show that one can always find TT gauges in local vacuum regions in
linearized gravity. More precisely, suppose that V is a connected open spatial region and (t0 , t1 )
is an open interval of time. Then one can find a gauge on the product R ≡ (t0 , t1 ) × V that
satisfies htt = hti = δij hij = ∂i hij = 0, as long as Tab = 0 throughout R.
The proof involves a generalization of the gauge-invariant formalism of section 2.2 to finite
spacetime regions. We define a decomposition of the metric perturbation hab in terms of quantities
φ, βi , γ, hTT
ij , H, εi and λ using the same equations (2.29)–(2.35) as before. However we replace
the boundary conditions (2.36) with
t
γ|∂V = dt φ|∂V , (A.1)
t0
λ|∂V = 0, (A.2)
where n is the unit outward-pointing unit normal to ∂V . The reason for this particular choice
of boundary conditions will be explained below. These boundary conditions define a unique
decomposition of the metric within R.
Next, we compute how the variables φ, βi , γ, hTT
ij , H, εi and λ transform under general gauge
transformations. We use the same parametrization (2.37) of the gauge transformation as before,
except that we impose now the boundary condition C|∂V = 0. We find that the transformation
laws (2.38)–(2.44) are replaced by the following equations which contain some extra terms:
φ → φ − Ȧ, (A.5)
βi → βi − Ḃi − ∂i ψ, (A.6)
γ → γ − A − Ċ + ψ, (A.7)
H → H − 2∇ 2 C, (A.8)
λ → λ − 2C, (A.9)
Here ψ is the time-independent, harmonic function defined by ∇ 2 ψ = 0 and ψ|∂V = A|∂V ,t=t0 .
Similarly ηi is the time-independent, harmonic transverse vector defined by ∇ 2 ηi = 0 and
(n × n)|∂V = n × B|∂V ,t=t0 .
We define the variables , and i by the same equations (2.45)–(2.47) as before. From
the transformation laws (A.5)–(A.11) these variables are still gauge-invariant, while hTT
ij is no
longer gauge-invariant in the present context. Next, imposing the linearized vacuum Einstein
equations using the expressions (2.59)–(2.61) yields
∇ 2 = 0, ∇ 2 i = −2∂i ,
˙ ∇2 = 23
¨ (A.12)
in V . The boundary conditions (A.1)–(A.4) together with the definitions (2.45)–(2.47) imply that
the boundary conditions on the gauge-invariant variables are
(This is why we choose those particular boundary conditions.) Therefore, all the gauge-invariant
variables vanish, = = i = 0 in R.
It is now straightforward to show that one can choose a gauge in which φ = βi = γ = H =
εi = λ = 0. From the transformation laws (A.5)–(A.11), we can choose C to make λ = 0, choose
Ȧ to make φ = 0 and choose Ḃi to make βi = 0. The residual gauge freedom is then parametrized
by functions A and Bi that are time-independent. Next, from equation (A.13) together with the
definitions (2.45)–(2.47) it follows that
0 = = 13 H, (A.14)
0= = −2γ̇, (A.15)
0 = i = − 21 ε̇i . (A.16)
Thus the only remaining non-zero pieces of the metric other than the TT piece are γ and εi ,
and these are both time-independent. Finally, we can use the residual gauge freedom given by
time-independent functions A and Bi to set γ and εi to zero, by equations (A.7) and (A.10). (For
this purpose A and Bi will vanish on ∂V , by equations (A.1) and (A.4), so ψ and ηi vanish.)
References
[1] Einstein A 1905 On the electrodynamics of moving bodies Ann. Phys. 17 891
[2] Newton I 1687 Philosophiae Naturalis Principia Mathematica 2nd edn (London: Streater); from the General
Scholum added at the end of the 3rd book in the 2nd edn of 1713
[3] Newton I 1687 Philosophiae Naturalis Principia Mathematica 1st edn (London: Streater). All quotes from
the Principia are taken from [4], Box 1.10
[4] Misner C W, Thorne K S and Wheeler J A 1973 Gravitation (San Francisco: Freeman)
The crucial second term in the square brackets was left out of the published version.
We thank Ryan Lang at MIT for bringing this error to our attention.