To Differential Geometry: Robert Bartnik
To Differential Geometry: Robert Bartnik
to
Differential Geometry
Robert Bartnik
January 1995
References
125
1 Surfaces
Outline:
One advantage of an index notation is that the generalisation of many of our calcula-
tions to the case of n-dimensional surfaces in Rn+l, n ~ 2, is then very simple; another
advantage is that we may use the Einstein summation convention:
The summation convention a!lows us to express concisely many otherwise lengthy and
repetitive formulae.
The coordinate tangent vectors to S are the vectors X 11 X 2 in R 3 defined by
ax ax
x1 = Xu = au = au1'
X_ aX ax
x2 = v - av = au2.
(1)
126
where ya, a = 1, 2, are the coefficients of V in the basis X a.
Using the inner product (·, ·) for vectors in R 3 we define the metric or first funda-
mental form of S, by
g(V, W) = (V, W) (2)
for any tangent vectors V, Won S. By linearity we may express the metric in terms of
the components
(4)
defines a curve inS; more precisely, "Y: [0; 1] ---+ R 2 describes a curve in a coordinate
chart of S, and the curve in S c R 3 is given by
d)' - d"'(a X - . aX
dt - dt a-"'( a
127
A unit normal vector N to Sis determined up to ±N, and may be described using,
the vector cross product in R 3 by the formula
N= Xu X Xv.
IXu X Xvl
This formula does not generalise so easily 1 to the case of hypersurfaces in Rn+l, n :2:: 3;
but this is not a significant problem, since the existence of a normal vector is not in
question.
Iff: Rn --T R, and if 'Y: t 1----t 'Y(t) ERn is a curve in Rn, then f o "f: R --T R
is a function of one variable and the chain rule gives
d n aj d"(i
d/ 0 'Y(t) = ~ axi dt
•=1
·i aj ( )
="~-a··
x• 6
which depends on the tangent vector .:Y E Rn to the curve 'Y· We call this the directional
derivative off in the direction /y,
·iaf
D -r f ="f-a.. (7)
x•
If V is a vector in Rn, based at a point x E Rn, then we may always find a curve
through x in the direction V (for example, 'Y(t) = x +tV), and then we define the
directional derivative by
Dv f = .:Yi(o)aaf_
x•
(x), (8)
where x = "1(0) and V = .:Y(O). Notice that (7) shows that this expression is independent
of the choice of curve representing V, so the definition (8) is unambiguous. We may
rewrite (8) in the elegant form
iaf
Dvf=V-a.,
x•
which makes the relation with the chain rule very explicit.
The definition of directional derivative of a function may be easily extended to
vector fields in Rn. Thus, for example, if Y, Z are two vector fields in R 3 , so Y =
(Y 1 ( x, y, z), Y 2 ( x, y, z), Y3 ( x, y, z)), then the directional derivative of Z in the direction
Y is the vector field
DvZ = y1~z + y2~z + ya~z
ax ay az
.a
= yt_a.z.
x•
1 The generalisation defines the normal vector using n (coordinate) tangent vectors X~, ... , x .. and
128
If Y, Z are vector fields tangent to the surface S, then we may decompose Dy Z into
components tangential and normal to S,
is called the covariant derivative on S of the vector field Z in the direction Y, and the
normal component
II(Y, Z) = (DyZ, N) (11)
is the second fundamental form of S. Since
it follows that
II(Y, Z) = - (Z, Dy N) (13)
and thus we may interpret II geometrically as describing the "bending" of the normal
vector as we move around the surface. It is clear from (13) that II(Y, Z) depends
linearly on the tangent vectors Y, Z;
(14)
(15)
(16)
Since Xab = EP Xjauaaub = Xba, this implies IIab = IIba or in terms of general tangent
vectors Y, Z,
II(Y, Z) = II(Z, Y). (17)
129
Thus the second fundamental form is a symmetric bilinear form on tangent vectors to
s.
The principal curvatures of S are the eigenvalues of II with respect to the metric g;
equivalently, they are the roots of the polynomial equation
The principal vectors e 1 , e2 of II are tangent vectors which are orthonormal with respect
to g,
where .\ 1 , .\ 2 are the principal curvatures. These relations may be written more suc-
cinctly as
g(ea,eb) (19)
II(ea, eb) (20)
(22)
where (gab) = (9ab)- 1 is the inverse metric. Another interpretation of the principal
curvatures is as the negative eigenvalues of the Weingarten map (21), or using (22), as
the eigenvalues of the matrix
II a=
b
gben ac· (23)
The principal vectors are then the eigenvectors of II~, normalised to unit length -
because IIab is symmetric, the Principal Axis Theorem of elementary linear algebra
implies that the eigenvectors of II~ are orthogonal with respect to gab·
130
The mean curvature H and Gauf3 curvature K of S are defined using symmetric
functions of the principal curvatures .A 1 , .A2 by
Unlike the second fundamental form II(Y, Z), the covariant derivative \i'yZ cannot
depend only on the value of the vectors Y, Z at a point (see (14)), but must involve the
derivative of the coefficients of Z, since the total directional derivative DyZ involves
the derivative of Z. Explicitly, by expanding Y, Z in the basis Xa we obtain
( ya~zb)
aua X b + (yazbrc) X ab c
(29)
Defining r abc = 9cdf~b we obtain using (15)
r abc 9cdr~b = ('V Xaxb, Xc)
(DxaXb,Xc)
(Xab, X c) (30)
which gives a simple formula for computing the Christoffel symbol r~b = gcdr abd and
hence the covariant derivative (28). Remarkably, the Christoffel symbol may be ex-
pressed by formulae which use only the metric 9ab,
fabc =
1(a
2 a
auagbc + oub9ac- ouc9ab '
a )
c 1 cd( a a a ) (31)
rab = 2g aua9bd + oub9ad- oud9ab .
131
These formulae follow from the useful identity
DxJg(Xb,Xc))
g('\7 Xaxb, Xc) + g(Xb, '\7 xaXc)· (33)
where [Dv, Dw] denotes the commutator of the differential operators Dv, Dw. By
.a
expanding V = V'-;:;-:-, W
.a
= W 1 -;:;-:- we find that
uX' uXJ
.a . . a ·) a
D[V,wJf = (VJ axj W' - WJ axj V' axi j,
using the fact that partial derivatives commute, and thus we see that [V, W] is a vector
field with coefficients
(35)
If Y, Z are tangent vectors to S, and ya, zb are their coefficients with respect to the
basis X a a = 1, 2 of tangent vectors to S (rather than the basis a; = a~i, i = 1, 2, 3 of
tangent vectors to R 3 ), then their Lie bracket is given by
(36)
(37)
132
This can be seen most easily by noting that
(41)
where
Rabcd R()(a,)(b,)(c,)(d)
g( (\1 X a \1 xb - \1 Xb \1 xJ)(c, )(d)
a a
auarbcd- aubracd +rae bde- be ade
er rer (42)
since [Xa, Xb] = 0. Notice in particular from (31) and (42) that the curvature Rabcd may
be expressed by a formula which involves only the metric gab and its first and second
derivatives. Further properties of the Riemann curvature will be developed later.
Alternatively we may use (42) to express Rabcd in terms of)( and its derivatives
)(a = axjaua, )(ab = a 2)(jauaaub and )(abc = (fJ )(jauaaubauc. By systematically
expanding into tangential and normal components we may obtain some fundamental
identities of surface theory. From (9), (15), (16) and (29) we have
(43)
(44)
133
and taking the normal component gives the Codazzi-Mainardi identity
(45)
(46)
which should be compared with the expression (34) for the covariant derivative of a
vector Z.
The tangential component of (44) gives the Gauft identity
(47)
and thus, for sur:faces .S in R 3 , essentially the only non-vanishing curvature component
is R 1221 • From (47) we derive a relation between the GauB curvature K = A1 A2 , defined
using the bending of the embedding S C R 3 and the Riemann curvature, defined using
the intrinsic length measure gab, namely
AA _ K _ R1221 = R1221
(49)
1 2 - - det(gab) gug22 - (gl2) 2 .
134
2 Manifolds
Outline:
The aim of this chapter is to introduce the fundamental concept of a manifold. The
systematic formulation of this definition was a major achievement of early 20th century
geometry, and laid the foundations for a vast amount of work in topology, analysis and
geometry.
Roughly speaking, a manifold is an n-dimensional surface, but without the repre-
sentation into Euclidean space that proved so useful to us when we described surfaces
8 in R 3 . Some examples of useful and common manifolds are
• the set {(u, v) E 8 2 x 8 2 ; u..Lv }, which turns out to be the same as 80(3), the
real orthogonal group.
2 More precisely and more generally, the space of affine lines in Rn is an open dense subset of
the Grassmanian of two planes (through 0) in Rn+l. This may be seen by identifying a line
G 2 ,(n+l)•
l c R n with the line (£, 1) c R n+l, which spans a unique 2-plane through 0 in R n+l. The missing
lines correspond to the G2,n at infinity in G2,(n+l}.
135
For some of these examples, it's not immediately obvious how they might be realised
as subsets of some Euclidean space, and even less obvious that any such representation
is an essential feature of the set. Instead, we aim to consider these sets independent of
any particular representation as a subset of some Rm.
A key observation on the path to constructing such an intrinsic definition of "sur-
face" is that for a parameterised surface X: (u 1 ,u2 ) f---7 X(ul,u 2 ) ESC R 3 , many
(but not all!) of the computations may be written solely in terms of the coordinates
(ua) and quantities which may be defined as functions of (ua), such as 9ab, Rabcd, etc.
The definition of manifold below will start with coordinate charts, but (somehow) not
make any reference to any ambient Euclidean space.
For another clue about the general definition of manifold, consider the definition of
a c= Ck, k 2: 1) function on S c R 3 . If f : S --+ R, then one possible definition
of coo would that f be the restriction of some coo function j: R 3 - 1 R. This
obvious definition has the aesthetic drawback that it involves values of j at points
which do not lie on S. Note that this idea of extension away from Sis used
in many of the surface calculations of the for in
of tv,ro
deriva.tive
A.n alternative definition of c= uses the coordinates we can require that the
136
or heuristically, v = v(u) = (Y- 1 o X)(u). Since
the two coordinate representations off : S ---+ R are related by the transition function
u 1---7 v(u) = (Y- 1 o X)(u), and we may relate differentiability off in u to differen-
tiability off in v by using the chain rule of multivariable calculus and the (assumed)
smoothness of the transition function u 1---7 v(u). This imposes a compatibility con-
dition on the coordinate choices, namely that the resulting transition functions have
sufficient regularity (eg coo, Ck etc).
We can now construct the definition of a manifold. Ann-dimensional (topological)
manifold M is a set with a topology which is
(a) Hausdorff (i.e. if x, y EM, x =f. y, then there are open sets U C M, V C M such
that x E U, y E V and U n V = 0);
(b) Separable (i.e. the topology on M has a basis of open sets which is countable);
and
(c) Locally Euclidean (i.e. for any x E M, there is an open set x E U C Manda
homeomorphism ¢>: U---+ cf>(U) ERn).
is Ck as a map between Euclidean spaces. Notice that the coordinate charts we are
using(¢>: U c M---+ Rn) go the opposite direction to the parameterisations X: U c
R 2 ---+ S c R 3 we used when considering surfaces in R 3 ; this turns out to be more
convenient.
137
A Ck-atlas of the manifold M is a family of coordinate charts
(where A is the indexing set of the family), such that the sets U00 a E A, cover M
are all Ck. A maximal Ck atlas is an atlas which contains all Ck compatible coordinate
charts; because
cPa"! = cPa(3 ° cP(3"f'
at least where both functions are defined, and because the composition of ck functions
is again Ck, it follows that in order to construct a maximal Ck atlas, it suffices to find
just one family of Ck-compatible coordinate charts which covers M. Finally, a Ck
manifold is a topological manifold with a Ck maximal atlas.
Henceforth, for simplicity we will consider all manifolds to be c=, unless explicitly
indicated otherwise; the changes needed to consider Ck manifolds are usually very
minor. We also use smooth as a synonym for coo.
It's easy to see that Rn is a smooth manifold, since the identity map I d: Rn --+ Rn
defines a chart which covers all of Rn and hence gives a c= atlas.
The definition of a smooth function f : M --+ R is now clear: we must have the
composition
a smooth function, for every coordinate chart (¢a, Ua) on M. Again, because
where both functions are defined, and because cPa(3 E c= always, it follows that this
definition is consistent, and smoothness need only be verified on single atlas of charts.
Similarly, a map J : Mm --+ Nn between manifolds is c= if the maps
"(:R--+M.
138
The graph of a smooth function f: Rn ---7 RP,
(Subtle point: if f E Ck, k < oo is not smooth, then this construction still shows
graph(!) is a c= manifold, but the inclusion graph(!) <-+ Rn+p will no longer be a
coo map between manifolds).
The space sn c Rn+l is not a graph, but it can be locally represented as a graph
(over some coordinate n-plane, for example). This provides a covering of sn by co-
ordinate charts, and it is easy to check that the resulting transition functions are coo
and hence sn with the topology induced from Rn+l is a smooth manifold. More gener-
ally, any connected subset of Rk which can locally be represented as a graph of a c=
function is a smooth manifold, by the same argument.
This leads to the general question: when can the level set F- 1 (0) = {x ERn, F(x) =
0}, where F : Rn ---7 Rm, rn < n, be given a manifold structure? By the above dis-
cussion, it suffices to show that F- 1 (0) is locally described as a graph, over some
coordinate plane for example. Conditions which ensure this is possible are provided by
Proof: Consider F: Rn ---7 Rn, F(u, v) = (u, f(u, v)). Clearly F is coo and
139
is invertible where Dvf is invertible. In particular, DF( u 0 , v 0 ) is invertible, and the
Inverse Function Theorem gives a c= function G : Rn --+ Rn which is an inverse of
F in some neighbour hood of (u 0 , v0 ),
for (x, y) near (u 0 , f(u 0 , v0 )) = (u 0 , 0). Write G(x, y) = (g1(x, y), g2(x, y)), so
F(G(x, y)) F(g1, g2) = (g1(x, y), f(gl(x, y), g2(x, y)))
(x, y).
rank Df(x) =m
at all points x for which f(x) =Yo· Then f- 1 (yo) = {x ERn; f(x) =Yo} is a smooth
manifold under the topology induced from Rn.
Proof : The rank condition implies that Dvf is invertible, for v in some coordinate
plane, and hence by the Implicit Function Theorem, f- 1 (y 0 ) is always locally repre-
sentable as a graph over some coordinate plane. Ill
Another way of stating this is that f- 1 (y 0 )
is an embedded submanifold. More gen-
erally, f : Mm --+ Nn is an embedding if the induced topology on M from its inclusion
in N coincides with the topology on M, and iff : M --+ f(M) is a diffeomorphism,
i.e. a smooth map with a smooth inverse. If f : M --+ N is locally an embedding
(i.e. for every x E M, there is a neighbourhood x E U C M such that flu is an em-
bedding), then f is an immersion. Note that an immersed submanifold can still have
self-intersections.
Example: It was claimed earlier that the set M = { ( u, v) E S 2 x S 2 , u l_ v} is a
manifold. To show this, note that M = f- 1 (0) where
140
and thus
141
3 Vectors and Tensors
Outline:
tangent vectors and covectors; coordinate vectors; differentials; tangent
and cotangent bundles; vector fields; tensor and exterior algebras; push
forward and pull back maps; Riemannian manifold.
Here¢ o ry: R--+ Rn is a curve in Rn, and (¢ o ry)'(O) is its tangent vector in Rn at
t = 0. Note that the chain rule ensures that the condition (50) is in fact independent
of the choice of chart (¢, U) andhence the equivalence relation "' is well defined.
The directional derivative is defined using representing curves. If v is a tangent
vector at x EM, and f: M--+ R, then
Dv(f)(x) = :/ o ry(t)lt=O
where ry : R --+ M is any curve representing v. Again the chain rule and (50)
combine to show this definition is independent of the choice of representing curve ry for
v. Explicitly, if¢ : U --+ Rn is a chart about p0 , then we set
f 0 ry = f 0 q;- 1 0 ¢ 0 ry = j 0 i
where j = f o q;- 1 : ¢(U) --+Rand i = ¢ o ry : R--+ Rn are both maps between
Euclidean spaces. Then
d - di
-(! o ry)(O) Df · -(0)
dt dt
- dJ
Df(¢- 1 (x)) · dtCJ(O) by (50)
d
dt (! 0 5)(0),
142
where 6 : R----+ M is another representative of v.
If ¢ : U ----+ R n, U C M is a coordinate chart, then we define the coordinate
tangent vectors (of the chart ¢) as the tangent vectors of the coordinate lines li
R----+ M, i = 1, · · ·, n
d -
d/(¢(Po) + te;) It=O
aj
axi (¢(Po)); (51)
that is, the directional derivative in the coordinate tangent vector direction is just the
usual partial derivative, when all calculations are translated to Rn using the chart ¢.
This will be an extremely useful observation. For example, we adopt the notation a;
for the coordinate tangent vector, and then (51) may be rewritten as
If we now agree to forego the j notation, then this can be written more simply as
a
De.f
'
= -a
X'
f = aJ (52)
We now show that the coordinate tangent vectors form a basis for the vector space of
tangent vectors (at the point p 0 ). If vis a tangent vector at p 0 with representing curve
1 : R ----+ M and f : M ----+ R is any function, then
and in the xi coordinates, ry(t) = (¢ o l)(t) = (i' 1 (t), · · ·, ;:yn(t)). By the chain rule and
(52),
a] di'i (o)
ax• dt
.i
i' (0) DeJ. (53)
143
. .i
If we let v' = i' (0), then the curve
n
6(t) qy- 1(¢(Po) + t l::Viei)
i=1
(54)
That is, {8 1 , • • ·, 8n} is a basis for the tangent vectors. It is not hard to verify that
this vector space structure is independent of the choice of coordinate chart.
If we let r; : Rn ----7 R, i = 1, · · ·, n denote the standard Cartesian coordinate
functions and if(¢, U) is a chart on M, then then functions
(57)
Appropriately, the proof of (56) follows from the chain rule. Let x 0 = ¢(p0 ), then
f)
DaxJ ax;(f o ¢-1)(xo)
f)
~((! o 'lj;- 1 ) o ('1/J o ¢- 1 )) (xo)
ux'
f) f)yi
~u o 'I/J- 1 )(y(xo)) · ~ (xo)
uyJ ux'
f)yi
~Da .f.
ux' yJ
144
The vector space of tangent vectors at p E M is denoted TpM, and the space of
all tangent vectors on M is T M = UpEM TpM; T M has a natural manifold structure
based on charts derived from coordinate charts on M. There is a natural projection
T M ----+ M defined by (p, v) r--t p where v E TpM, so that the fibre over p E M is the
tangent space at p. A section of T M is a lifting p r--t vP, or in other words, a vector
field on M. The space of all smooth vector fields is denoted X ( M).
Constructions with a vector space V (such as forming the dual space V*, the tensor
products V Q9 ° Q9 V Q9 V* Q9
• • Q9 V*, and the exterior products 1\ k V) can be easily
° • •
carried over to the tangent bundle T M, by simply applying them to each vector space
TpM,p EM. We now briefly review these vector space constructions.
If V is a vector space then the dual space V* is the vector space of linear functionals
TJ : V --+ R. The dual basis of V*, dual to a basis v1 , · 0
• , Vn of V, is the set of linear
functionals {TJj, j = 1, ... , n} satisfying
,.,)
'I
(v·)
1,
= oz'
1 i = 1, ... ,n.
The dual vector space of TpM is called the space of cotangent vectors or simply
covectors and is denoted by r;
Mo The cotangent bundle T* M is the space of all
r;
covectors, T* M = upEM M, and also is naturally a manifold, of dimension 2n. Just
as tangent vectors are naturally associated with curves in the manifold, there is a
geometric interpretation of cotangent vectors, based on functions:
Given any function f : M ----+ R, there is a linear functional df on tangent vectors
defined for v E TpM by
df(v) = Dvf. (58)
The linearity of df follows from the coordinate representation (53). Denoting the
restriction of df to vectors at the point p by dfP (similarly we may indicate that a
vector vis based at p by writing vp), we have
dfP E r;Mo
A natural basis for r; M can be derived from a coordinate chart. The differentials dxi
of the coordinate functions xi : U ----+ R satisfy
Daj (xi)
a -i
8x)x
145
In other words, { dx 1 , · · · , dx"} is the basis of r; M dual to the coordinate vector basis
{ o1, ···,on} of TpM.
The expansion of df in the basis dxi, i = 1, ... , n is obtained by noting that
of
df(oi) = DaJ = oxi,
and hence
8f i
df = ~dx, (59)
ux'
which should be compared with the chain rule.
The tensor product of two vector spaces V, W, with bases {v1 , · · · , Vn}, {w1 , · · · , Wm}
®:VxW------+V®W
(60)
r
yizj Vi Q9 Vj
146
Tensors in 0s V* are identified with multilinear functionals on V. For example if
a, (3 E V* then a® (3 is the bilinear functional defined by
(61)
which satisfies % = gji (symmetric) and% > 0 (positive definite). Here % are the
coefficients of the metric tensor with respect to the local basis dxi ® dxi, 1 :=; i, j :=; n,
of T~ 0 • 2 ) M, and thus by duality of 8i and dxJ we have%= g(8i, 8j).
A Riemannian manifold is an n-dimensional manifold with a metric g - a symmet-
ric positive definite bilinear form on vectors. Note that this definition of metric does
not make any reference to any relation with the metric induced from an immersion in
any Euclidean space; in fact, a priori it is far from clear whether or not a given metric
can be realised by an immersion into some Rk. That this is indeed true is the content
of a deep theorem of John Nash.
The second example (0, 2)-tensor is the second fundamental form II. Again this is
also symmetric, but not in general positive definite. In local coordinates (ua) on the
surface S we have
has the structure of an associative algebra over R with identity 1 E 0° V, and product
operation ®. Taking the quotient of 0 V by the two-sided ideal (v ® v) generated by
elements of the form v ® v, v E V, gives the exterior algebra of V,
f\Y=@Vj(v®v). (62)
147
in AV by the definition (62), it follows from linearity that
and hence
dim A V
k
= (n)k = k!(nn!_ k)!, k S: n
and Ak V = 0 for k > n. It follows by repeated application of (64) that
(65)
(66)
aEperm(l, ... ,k)
The other identification would insert a factor of 1/2 in the first relation, and 1/k! in
the general relation (66). The convention adopted here has the advantage that the
bases v1 , I= (i 1, ... ,ik), 1 S: i 1 < ... < ik S: nand r"/ are dual, r/(vJ) = 8}. The
interior or cut product ~v: Ak+ 1 V* --t AkV* is defined using the identification with
alternating forms by
(67)
148
As before, we may apply the exterior product construction to the tangent or cotan-
gent bundles, yielding the spaces 1\ T M, and 1\ T* M. The space Ak(M) of sections of
the bundle 1\ kT* M is particularly important- sections of Ak(M) are called k-forms.
Note that 1-form is thus a synonym for cotangent vector field, since 1\ 1 V = V.
A map ¢ : Mm ----+ Nn of manifolds may be used to transport objects from one
manifold to the other. For example, if"( : R -t M is a curve through p E M then
¢ o 'Y : R -t N is a curve through ¢(p) E N. The push forward ¢.vP of the tangent
vector Vp = 'Y'(O) to 'Y at pis then the tangent vector at ¢(p) to¢ o 'Y· Other notations
for push forward are
¢.v = (¢ o 'Y)'(O) = d¢(v) = T¢(v).
Note that although individual vectors may be pushed forward, the push forward of a
vector field is not a vector field - a point q E N may have more than one preimage
under ¢, or none at all. One example of push forward of a vector has already been
given- the tangent vectors X a to a surface may be viewed as the push forward of the
coordinate tangent vectors in U c R 2 ,
and is called the pull back of a. Unlike the push forward of a vector field, the pull back
of a covector field is always a covector field. Note that iff : N -t R then we may also
define the pull back of f by ¢* f = f o ¢ : M -t R, and then we have an interesting
relationship with the differential,
We also note that ¢* respects the algebra structure of 1\M ie. ¢*( a/\/3) = ¢*(a )/\¢*(/3),
which follows from the linearity of ¢*, ¢. and the definition of ¢* applied to 1\kT* N.
In particular, ¢* : Ak(N) ----+ Ak(M).
149
4 Curvature
Outline:
for any vector fields X, Y E X(M) and function f E C 00 (M), and the metric compati-
bility and torsion-free conditions
defines the covariant derivative on any Riemannian manifold, and is easily seen to
have the required properties (72), (73), (74), (75). It also follows that \7 is uniquely
determined by these properties.
There are two interesting special cases of (76). If we restrict attention to coordinate
tangent vector fields 8i, then [8i, 8i] = 0 and (76) gives the Christoffel symbol formula
If on the other hand we restrict attention to vector fields e1 , ... , en forming an or-
thonormal frame
150
(or more generally, g(ei, e1) =canst), then the first three terms in (76) drop out and
we obtain
The covariant derivative may be extended to act on tensors more general than just
vector fields, by requiring the obvious linearity and derivation rules (cf. (72)), together
with a Leibnitz or product rule property. Thus, for example, V acting on (2, D)-tensors
satisfies
v(Y ® Z) = (\7Y) ® Z + Y ® ('i7Z),
and \7 acting on a cotangent field a satisfies
The formula (46) for the covariant derivative of the (0, 2)-tensor ll is a direct general-
isation of (82). More directly exploiting the Leibnitz rule, we have a coordinate-free
description of the covariant derivative of ll:
(\7 xll)(Y, Z) = Dx(IT(Y, Z))- Il(\7 xY, Z)- IT(Y, 'i7xZ). (83)
151
Comparing (74) with (83), we see that the metric compatibility condition is equivalent
to \1 g = 0.
The Riemann curvature tensor is defined as before on vector fields X, Y, Z, W E
X(M) by
R(X, Y, Z, W) = g((\1 x\ly- \ly\1 x- V[x,Yj)Z, W). (84)
That this is in fact a tensor, ie. a multilinear function in each of the four slots, follows
from the definition, the derivation property (72) and the definition of the Lie bracket.
First, it is immediately clear that R(X, Y, Z, JW) = f R(X, Y, Z, W) for any function
j E C 00 (M). Since
[f X, Y] = f[X, Y] - Dy j X (85)
we have
R(X,Y,jZ,W) = g(\lx(f\lyZ+DyjZ)-\ly(J\lxZ+DxfZ), W)
- g(J\l[x,Y]Z, W) - g(D[x,YJ] Z, W)
g(Dxf \lyZ + DxDy f Z + Dy f \1 xZ, W)
- g(Dyf\lxZ +DyDxf Z +Dxf\lyZ, W)
- g(D[x,YJf Z, W) + j R(X, Y, Z, W)
j R(X, Y, Z, W).
Thus the Riemann curvature depends only on the values of the vectors and not on the
first or second derivatives; in other words, Riem = R(·, ·, ·,·)is a (0, 4)-tensor. By the
linearity property and expanding all vectors in a basis frame of coordinate vectors, we
may write R(X, Y, Z, W) in index form
. . k R
R(X,Y,Z, W) X'Y 1 Z W R;jkt,
152
Note that it is not essential that indices refer to a basis frame of coordinate vectors
8 1 , ... , On - already with the connection 1-form Wij we have seen an example where
the indices refer to an orthonormal frame of vectors e1 , ... , en. In general, the frame
used to define the indices in any formula will be either understood, or explicitly men-
tioned. Particularly in Riemannian geometry, it is often much more useful to perform
computations in an orthonormal frame rather than a coordinate frame.
The curvature tensor has a number of symmetries, the first being the obvious
R(X, Y, Z, W) = -R(Y,X, Z, W). Expanding out the identity
shows that
R(X, Y, Z, W) = -R(X, Y, W, Z), (87)
whilst the first Bianchi identity
since the Lie bracket satisfies the easily verified Jacobi identity
[X, [Y, Z]] + [Y, [Z, X]] + [Z, [X, Y]] = 0. (89)
Finally, combining (87), (88) gives a symmetry between the (12) and (34) positions,
as follows:
153
+ R(Y, W, X, Z) + R(W, X, Y, Z)
R(X, Y, Z, W) + R(X, Y, Z, W)
+ R(Y, Z, W, X)+ R(Z, X, W, Y)
- R(Z, W, Y, X)- R(W, Z, X, Y)
2R(Z, W, X, Y).
where Rijk e = Rjkp9pf and (g;j) = (gi/) is the inverse metric. This process of raising
and lowering of indices using the metric and its inverse, corresponds to the canonical
isomorphisms between TpM and r; M defined by the inner product g. Using the ·;;
notation for covariant derivative yields the so-called Ricci identity
(92)
(93)
Taking the covariant derivative of gradf gives the Hessian, or second covariant deriva-
tive matrix of f,
v7jf = !;ij = aiajJ- r:jakf, (95)
and the trace of \7 2 f is the Laplace-Beltrami operator of the metric g
(96)
154
where V§ = Vde[g.
Another method for describing the curvature and covariant derivatives was de-
veloped by E. Cartan, and uses orthonormal frames and forms, rather than vectors.
Because forms behave better than vectors under maps of manifolds, the Cart an calculus
is frequently better suited to actual computations.
For this we need the exterior derivative operator
(97)
(c) d 2 = 0.
These properties uniquely determine d, since they lead to an explicit expression for d
in any local coordinate system. Firstly, (a) determines d acting on A(M) = C 00 (M).
By (b) and (c), for any two functions j, g E coo (M) we have
which determines don A 1 (M) since alll-forms may be written as a linear combination
of terms of the form f dg. Now
(98)
which shows that the conditions (a), (b), (c) determine don Ak(M). To show that
this definition is consistent, we note that any k-form may be written as a = a dxi 1 1\
... 1\ dxik = a 1 dx 1 where the coefficients ai 1 ···ik = a 1 are uniquely determined and
I= (i 1 < · · · < ik) is a multi-index. Then the local definition
Oaf i I
da = ~dx 1\ dx , (99)
ux'
155
satisfies the required properties, and (98) shows that any definition of dis unique (since
the differentials dg1 are defined already), hence there exists a unique operator with the
properties (a), (b), (c). By iterating (99) we see that the property d2 = 0 reflects the
commutativity of second partial derivatives.
By combining (71) and (98) we see that d behaves naturally under pullback by a
map 1> : M -+ N,
cf>*(da) = d(cf>*a), Va E A(N). (100)
By computing in local coordinates, we can easily show that for any a E A 1 (M) and
vector fields X, Y E X(M),
- e;([ej, ek])
- g(e;, 'Veiek- 'Vekei)
W;j(ek)- Wik(eJ)
- (wie 1\ ee) (ej, ek),
(102)
where w;1 = g(e;, \lei) is the connection 1-form. A similar computation yields the
second structure equation,
(103)
where rl;1 is the curvature 2-form and is related to the Riemann curvature tensor by
(104)
where the indices in R;1ke refer to components in the basis e1 , ... , en-
The first Bianchi identity (88) is equivalent to
(105)
156
and is derived (much more easily this time!) by taking the exterior derivative of (102).
Taking the exterior derivative of D;j using (103) gives the Cartan form of the second
Bianchi identity
(106)
which is equivalent to
a form which can be verified directly, but not so simply, from the definition of Riem
and \7 Riem. The index notation versions of (105) and (106) are
157
5 Integration and Stokes' Theorem
Outline:
(109)
where d£~ is then-dimensional Lebesgue measure and w(o1 1\ ... 1\ On) = w(o 1 , ••• , On)
is the multilinear pairing between w and the infinitesimal coordinate parallelepiped
o1 1\ ... 1\ On, normalised by (dx 1 1\ ... 1\ dxn) (o1 1\ ... 1\ On) = 1. To see that this
definition is independent of the choice of coordinate chart, suppose y = (yi) = y(x) is
some other coordinate chart covering the support of w. The alternating property and
(56) imply that
(110)
(111)
oy
det ox > 0, (112)
158
restricting the admissible coordinate charts.
We say that M is orientable ifthere is f.L E An(M) such that f.L # 0 everywhere in M;
an oriented coordinate chart then is required to satisfy f.L(81 /\ .. . l\8n) > 0. Equivalently,
an orientable manifold is one with a covering by coordinate charts satisfying (112).
The problem of integrating a general w E An(M) is reduced to the case of support
within a coordinate neighbourhood by the use of a partition of unity: a family of
functions ,\"' E coo ( M), a E A, satisfying 0 :::; ,\"' (p) :::; 1 and La Aa (p) = 1 for all
p E M, where only a finite number of terms in the sum are non-zero. If M is separable,
then a partition of unity exists subordinate to any locally finite covering of M.
Writing w =La AaW and using the linearity of JM then determines fM w. Note that
this definition of JM w depends strongly on the choice of orientation f.L E An(M) of
M: denoting by -M the manifold with the reverse orientation -f.L, we have f(-M)w =
-fMw.
If ¢ : Mm --+ Nn and if a E Am(N) then the integral of a over M is defined
naturally by pull-back:
r
}q,(M)
a= r ¢*(a).
}M
(113)
(114)
and thus the volume form f.L defined by (114) is independent of the choice of oriented
coordinate system.
If M is Riemannian but not oriented then it is still possible to define an integral for
functions (but not n-forms), by using the measure Jdet(g) d£~ in coordinate patches.
159
Although viable and sometimes useful, this non-oriented integral does not satisfy the
very elegant Stokes' Theorem, a.k.a. the generalised fundamental theorem of calculus.
In order to state Stokes' theorem, we need a definition of manifold with boundary.
This is a pair (M, oM) where M is a smooth manifold under the induced topology from
M c MUoM, and where points in oM admit boundary coordinate charts. A boundary
coordinate chart is (c/J, U), U c M u oM, where ¢J: U-+ R+ = {(x\ ... , xn), xn 2 0}
and as usual we require the boundary charts to be C 00 -compatible in the interior set M,
and such that the charts (c/JiaM, UnoM) define a coo structure on oM. The definitions
of tangent vector, smooth function etc, extend naturally to the case of a manifold with
boundary. Note, however, that C00 (M u oM) # C 00 (M).
If Misoriented by 11 E An(M), then there is a natural induced orientation on oM
defined by ( -1) no1 1\ ... On_ 1 , where (x 1 , ... , xn) is any oriented boundary coordinate
chart with the boundary defined by xn 2 0 (ie. On is inward pointing). The induced
orientation form on oM is then [1. = -Lan/1·
Theorem 4 (Stokes) Suppose (M, oM) is an oriented manifold with boundary, and
wE An- 1 (M). Then
r dw = JaM
}M
1 w. (115)
Proof : By invoking a partition of unity we may distinguish two cases, with w sup-
ported in either an interior coordinate chart or a boundary chart. In the interior case,
by linearity we may assume w = f(x)dx 2 1\ ... 1\ dxn with f E C~(Rn), and then
dw = 8~ 1 f dx 1 1\ ... 1\ dxn. By Fubini's theorem we have
which vanishes by the fundamental theorem of calculus, since f has compact support.
If w is supported in a boundary chart (c/J, U) with boundary defined by xn 2 0, then
we distinguish two subcases,
160
In care (a} we have
.
dw = {- "1..\_"--1
:Jc.J
8.-J 1
.J;; d. ,.,_
-;;--ux A ... A X ,
uxn-
and thus
Since-
In coordinates,
= X~. (117)
so. that
{ divX J.L = ,£- g{X, n)it. (Uft)
fM JaM
161
This reduces to the well-known Gaul3 and Green's formulae in dimensions' 3 and .2
respectively.
A more general form of (119) is sometimes useful, for example when integrating
along a null hypersurface in a Lorentzian manifold, where the metric volume ·form
vanishes and it is not possible to define a unit length normal vectpr. If f.L and jl
are orientation forms on M and 8 M ,respectively, then there is a 1-form v such that
f.L = v 1\ P,; vis called a conormal to a}vf. Then (118) gives
(123)
where Dnf is the outer normal derivative of f. Combining (121) and (123) gives the
Green's identities for two functions ¢, 'ljJ E c=(M),
r
}M
'l/JD.g¢dv - ( g(gradcp,grad'lj;)dv+
}M
1 '1/JDncpdS,
~M
(124)
162
Proof: Applying (124) with¢= '1/J gives
JM g(grad¢, grad¢) dv = 0.
Since g is positive definite, it follows that grad¢ = 0 identically, hence ¢ is constant.
II
In a similar fashion we may prove uniqueness for solutions u E C 2 (M) of the
Dirichlet problem
in M } (126)
on 8M
where f E C 0 (M) and ¢ E C 0 (8M) are given. For, suppose u 1, u2 are two solutions
of (126), then the difference w = u 1 - u 2 satisfies the Dirichlet problem with f = 0
and zero boundary conditions. The previous argument carries over and shows that w
is constant, hence w = 0 identically by the zero boundary conditions.
163