Vdocuments - MX - Classical Mechanics With Mathematica PDF
Vdocuments - MX - Classical Mechanics With Mathematica PDF
Series Editor
Nicola Bellomo
Politecnico di Torino
Torino, Italy
A. Bertozzi L. Preziosi
Department of Mathematics Dipartimento di Matematica
University of California Los Angleles Politecnico di Torino
Los Angeles, CA, USA Torino, Italy
A. Deutsch Y. Sone
Center for Information Services Department of Aeronautical Engineering
and High Performance Computing Kyoto University
Technische Universität Dresden Kyoto, Japan
Dresden, Germany
Classical Mechanics
with Mathematicar
Antonio Romano
Università degli Studi di Napoli
Dipartimento di Matematica e Applicazioni
Napoli, Italia
Please note that additional material for this book can be downloaded from
http://extras.springer.com
Mathematics Subject Classification (2010): 70–01, 70E05, 70E17, 70H03, 70H05, 70H06, 70H09,
70H15, 70H20, 70H25, 70H30, 76–00
Books on classical mechanics can roughly be divided into two classes. Books of
the first class present the subject using differential calculus and analysis, whereas
books of the second class discuss the subject by resorting to the advanced language
of differential geometry. The former books, though more accessible to readers
with a standard background in calculus, do not allow for a deep understanding of
the geometric structures underlying modern mathematical modeling of mechanical
phenomena. On the other hand, books of the second class, requiring a wide
knowledge of differential geometry, are accessible to readers who are already
acquainted with the many aspects of mechanics but wish to understand the most
modern developments in the subject. The present book aims to bridge the gap
between these two classes.
Before discussing the contents of the book in detail, I wish to clarify why I
decided to follow a historical approach to presenting mechanics. Over the long
period (35 years) during which I taught mechanics to students in physics at the
University of Naples Federico II, I saw that students could grasp the fundamentals
of the subject if they could understand the historical process that led to the
comprehension of mechanical phenomena. This process is like a staircase: as you
climb the stairs, each successive step gives you an increasingly broader perspective.
To take in a wider horizon, you must go up to the next step. Then, I believe, students
needed to remain on a given step until they reached the limit of their perspective
and felt the urge to climb to the next step. Clearly, students would achieve the
broadest possible view if they started on the last step. However, this approach would
allow for, at most, a technical comprehension of the subject without a profound
understanding of the roots out of which grew the great tree of classical mechanics.
In the history of science, mechanics was the first to resort to mathematical models
in an attempt to describe the reality around us. Already Leonardo da Vinci (1452–
1519), conscious of this feature of mechanics, stated that mechanics was heaven
for mathematics. Later, Galileo Galilei (1564–1642), having discovered many of
the fundamental principles of mechanical theory, stated that Nature was written
in the language of geometry. The first almost complete picture of the mechanical
world is due to Isaac Newton (1642–1727), heir to Galileo and Johannes Kepler
v
vi Preface
(1571–1630). He expressed his gratitude to these two great minds stating: “If I
have seen farther it is by standing on the shoulders of giants.” In proposing his
fundamental laws of dynamics, Newton understood that the mathematics of his
time was not up to the task of solving the problems posed by his description of
the mechanical world. To make explicit some of the consequences of his laws, he
laid down, together with Gottfried Wilhelm Leibniz (1646–1716), the foundations
of infinitesimal analysis. However, believing that this new approach was too difficult
for his contemporaries to understand, he adopted the language of geometry in
writing his magnum opus, Philosophiae Naturalis Principia Mathematica.
Although Newton’s laws, in principle, describe the physical behavior of any
mechanical system, the description appears in such an implicit form that drawing it
out is a very difficult task. We could say that the history of mechanics from Newton
to our day is the history of the process of making explicit a part of the hidden content
of Newton’s laws. In this process of making things explicit, powerful mathematical
descriptions of mechanical systems have been discovered that can be applied to
many other branches of physics.
The first step in this process was made by Leonhard Euler (1707–1783), who
introduced many notations and definitions still in use today. After formulating the
fundamental balance equations of mechanics, he proposed a model of rigid bodies.
This model introduced an extension to Newton’s model, which was essentially
conceived with respect to material points, in particular for the solar system.
Further, Euler formulated the balance equations of perfect fluids, i.e., of particular
deformable and extended bodies, laying down the foundations of fluidodynamics.
The systematic treatment of a system S of rigid bodies subject to smooth
constraints is due to Joseph-Louis Lagrange (1736–1813), who in his famous
treatise Mécanique analytique reduced the analysis of the dynamical behavior of
such a system to determining a curve of <n whose parametric equations q h .t/
are solutions of the famous Lagrange equations. These second-order differential
equations possess the following advantages:
• They contain the lowest number of parameters needed to determine the position
of S , the so-called Lagrangian coordinates of S .
• They allow one to determine the motion of S without knowing the reactive forces
due to the presence of constraints.
With Lagrange, mechanics resolutely opted for the language of analysis.
William Rowan Hamilton (1805–1865) and Carl Gustav Jacob Jacobi (1804–
1851) proposed a new description of the dynamics of a system S of rigid bodies
with smooth constraints and acted upon by conservative forces. This description
is obtained by the Legendre map, which transforms the solutions of the Lagrange
equations into curves belonging to the phase space and satisfying Hamilton’s equa-
tions. Adopting this formalism, it is possible to determine some general properties
of any mechanical system. Finally, the Hamiltonian formalism is fundamental in
geometric optics, statistical mechanics, and quantum mechanics.
In the early twentieth century, the Hamiltonian formulation of mechanics
underwent a profound change brought about by a new geometric interpretation of its
Preface vii
structures. This new point of view gave rise to symplectic mechanics, which, using
the powerful instruments of differential geometry, shed new light on many aspects
of mechanics and allowed for the solution of some open problems. Today mechanics
is again written in the language of (differential) geometry.
This book surveys all the aforementioned models showing their field of appli-
cation and their limitations. It is divided into two parts. The first part contains an
introduction to linear algebra and differential geometry. In presenting this subject,
I preferred to put forward as evidence the concrete meaning of geometric concepts,
often sacrificing the formal aspects. Chapter 1 contains a brief introduction to vector
spaces and linear mappings. Chapters 2 and 3 deal with tensor and exterior algebras.
Euclidean and symplectic spaces are discussed in Chaps. 4 and 5. Chapters 6–9 con-
tain some fundamental concepts of differential geometry: manifolds, tensor fields,
one-parameter groups of diffeomorphisms, exterior differentiation and integration,
and affine connections. Finally, Chap. 10 presents some elements of dynamical
systems. Chapters 1, 2, 4, and 10 are sufficient for understanding Chaps. 11–16. The
remaining chapters of the first part are necessary for understanding the Lagrangian
and Hamiltonian formalisms (Chaps. 17–22) of the second part.
The second part, starting with Chap. 11, is devoted to mechanics. More precisely,
the kinematics of a material point and a rigid body are presented, respectively,
in Chaps. 11 and 12. Newton’s laws and some general theorems of dynamics are
discussed in Chap. 13, whereas the applications of these laws to a single material
point or to systems of material points are considered in Chap. 14. Further, Chaps. 15
and 16 are devoted to the dynamics of a rigid body and its applications. Lagrange’s
coordinate formalism on a configuration manifold is introduced in Chap. 17, which
also includes a geometric formulation of Maupertuis’ (Pierre-Louis Moreau de
Maupertuis, 1698–1759) principle, an analysis of the principal oscillations of a
mechanical system about a stable equilibrium position, and a transcription of the
Lagrange equations in the velocity space.
Some fundamental aspects of Hamiltonian mechanics are treated in Chap. 18.
First, it is shown that Legendre’s (Adrien-Marie Legendre, 1752–1833) transforma-
tion maps a phase portrait of the solutions of the Lagrange equations into a phase
portrait of the solutions of Hamilton’s equations. When the dynamics is independent
of time, the existence on the phase state of a symplectic structure characterizing the
canonical or symplectic coordinates is demonstrated. Then, the Hamilton–Jacobi
theory about these coordinates is presented in Chap. 19. A description of the
behavior of completely integrable Hamiltonian systems can be found in Chap. 20.
This chapter also contains an introduction to Hamiltonian perturbation theory whose
purpose is to determine the approximate behavior of mechanical systems that are
almost completely integrable systems. In particular, this theory shows that small
perturbations may generate large effects and that chaos lurks in the corner of many
mechanical systems.
An elementary introduction to Maxwell’s (James Clerk Maxwell, 1831–1879)
kinetic theory of perfect gases and to Gibbs’s (Josiah Willard Gibbs, 1839–1903)
formulation of the statistical mechanics of equilibrium can be found in Chap. 21.
In Chap. 22, the balance equations and the Lagrange equations are extended to
viii Preface
ix
x Contents
Part II Mechanics
References .. .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 499
Index . . . . . . . . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 501
Part I
Introduction to Linear Algebra and
Differential Geometry
Chapter 1
Vector Space and Linear Maps
Vectors are usually introduced with a comment that many physical quantities
(e.g., displacements, velocities, accelerations, forces, and torques) are conveniently
described by oriented segments characterized by intensity, direction, and versus.
Then, a vector space E is defined as the set of the oriented segments starting from
a given point O. Algebraic operations are introduced in the set E as the addition
x C y of two vectors x; y 2 E, the multiplication ax of a real number a by a vector
x, the scalar product x y, and the cross or vector product x y.
In this chapter, in agreement with the spirit of modern mathematics, vectors will
be defined as algebraic objects, i.e., as quantities that are not characterized by their
nature but by the algebraic operations that are defined on them. This approach,
which is followed in many fields of mathematics (for instance, in the theory of
numbers), leads us to attribute a more general meaning to a vector.1
Let < be the field of real numbers whose elements we denote by lowercase Latin
letters a, b; : : :, and let E be an arbitrary set whose elements we denote by bold
Latin letters x, y; : : :. In the set E we assign an internal composition operation,
called addition, denoted by C,
.x; y/ 2 E E ! x C y 2 E; (1.1)
and satisfying the following properties:
x C .y C z/ D .x C y/ C z; (1.2)
x C y D y C x; (1.3)
1
The topics contained in the Chapters 1–9 can also be found in [2, 5, 7–9, 11, 13, 14, 16–18, 21, 52].
90 2 E W x C 0 D x; (1.4)
8x 2 E; 9.x/ 2 E W x C .x/ D 0; (1.5)
8x; y; z 2 E.
Besides the addition, we suppose that another composition law is defined by
.a; x/ 2 E ! ax 2 E; (1.6)
which associates to any pair .a; x/, with a 2 < and x 2 E, a new element ax 2 E
in such a way that the following properties are satisfied:
x C 0 D x; 8x 2 E;
so that, in particular, it is 00 C 0 D 00 . On the other hand, from the relation
x C 00 D x; 8x 2 E;
x0 C x D x00 C x D 0:
On the other hand, in view of (1.2), we have that
a1 x1 C C an xn D 0: (1.11)
Proof. If .ei / is a basis, then the system .x; .ei // is linearly dependent so that n C 1
scalars .a; ai / exist such that
ax C ai ei D 0: (1.13)
x D x i ei :
0 D x i ei :
Since the quantities xi are uniquely determined, this relation implies that x 1 D D
x n D 0 and the vector system .ei / is free. To prove that .ei / is a basis, we must verify
that any vector system .u1 ; : : : ; un ; x/ of order n C 1 is linearly dependent. That is
true if .ui / is linearly dependent. By contrast, if .ui / is free, then from (1.12) we
derive the relation
u1 D ai ei ;
in which at least one of the scalars ai does not vanish since u1 ¤ 0. Supposing that
a1 ¤ 0, the preceding relation leads us to
n
1 1 X i
e1 D u1 a ei :
a1 a1 i D2
In view of this result and (1.12), we can state that any vector x 2 En can be
represented as a linear combination of the vectors .u1 ; e2 ; : : : ; en /. Consequently,
we can say that
u2 D b 1 u1 C b 2 e2 C C b n en ;
where at least one of the scalars b 2 ; : : : ; b n does not vanish. If fact, if all these
coefficients were equal to zero, then we would have u2 D b 1 u1 , and the vector
system .u1 ; : : : ; un / would be linearly dependent. If b 2 ¤ 0, we can write that
e2 D c 1 u1 C c 2 u2 C c 3 e3 C C en :
In view of (1.12), this relation implies that any vector x 2 En is a linear combination
of .u1 ; u2 ; e3 ; : : : ; en /. Iterating the procedure, we conclude that (1.12) and the
hypothesis that .ui / is free imply the relation x D x i ui , i.e., x x i ui D 0. Then, the
vector system .u1 ; : : : ; un ; x/ is linearly dependent and the theorem is proved.
1.4 Vector Subspaces 7
Definition 1.5. The unique coefficients expressing any vector x as a linear combi-
nation of the basis .ei / are called the contravariant components of x in the basis
.ei / or relative to the basis .ei /.
Let .ei / and .e0i / be two bases of the n-dimensional vector space En . Then, x
admits the two representations
x D x i ei D x 0i e0i : (1.14)
Further, since .ei / and .e0i / are bases of En , any vector of the basis .e0i / can be
expressed as a linear combination of the vectors belonging to the basis .ei /, i.e.,
j
e0j D Aij ei ; ei D .A1 /i e0j ; (1.15)
j
where ..A1 /i / is the inverse matrix of .Aij /. Introducing (1.15) into (1.14), we
obtain the condition
j
x D x 0j e0j D .A1 /i x i e0j ;
which, given the uniqueness of the representation of a vector with respect to a basis,
implies the equations
j
x 0j D .A1 /i x i (1.16)
relating the contravariant components of the same vector evaluated in two different
bases. It is evident that the inverse formulae of (1.16) are
x i D Aij x 0j : (1.17)
Notice that the contravariant components of the vector x are transformed with
the inverse matrix of the basis change (1.14).
x D x 1 e1 C C x m em ; (1.19)
8 1 Vector Space and Linear Maps
W D U p ˚ Vq (1.20)
the direct sum of the subspaces Up and Vq , that is, the vector set formed by all the
vectors u C v, where u 2 Up and v 2 Vq .
Theorem 1.2. The direct sum W D Up ˚ Vq of the vector subspaces Up and Vq is
a vector subspace of dimension m D p C q.
Proof. Let .u1 ; : : : ; up / and .v1 ; : : : ; vq / be bases of Up and Vq , respectively. By
Definition 1.7, we can state that none of these vectors belongs to both bases.
Moreover, any w 2 W can be written in the form w D u C v, where u and v
are unique vectors belonging to Up and Vq , respectively. Therefore, we have that
p q
X X
i
wD u ui C vi v i ;
i D1 i D1
1.5 Algebras
Definition 1.8. Let E be a vector space. Let us assign an internal mapping .x; y/ 2
E E ! xy 2 E, called a product, that for any choice of the scalars a and b and
any choice of the vectors x, y, and z verifies the following properties:
xe D ex D x: (1.23)
Definition 1.9. We say that the preceding algebra is a Lie algebra if the product
• Is skew-symmetric, i.e.,
xy D yxI (1.24)
• Verifies Jacobi’s identity
x.yz/ C z.xy/ C y.zx/ D 0: (1.25)
1.6 Examples
In this section we present some interesting examples of vector spaces and algebras.
• The vector space of oriented segments. Let E3 be a three-dimensional Euclidean
!
space. We denote by AB an oriented segment starting from the point A 2 E3 and
ending at the point B 2 E3 . In the set of all oriented segments of E3 we introduce
the following equivalence relation R: two oriented segments are equivalent if
they are equipollent. Then we consider the set E D E3 =R and denote by x D
! !
ŒAB the equivalence class of an arbitrary oriented segment AB. Since all these
equivalence classes are in one-to-one correspondence with the oriented segments
!
starting from a fixed point O 2 E, we can introduce the notation x D ŒOA. If
!
y D ŒOB, then we introduce the following operations in E:
!
ax D ŒaOA; (1.26)
! !
x C y D ŒOA C OB; (1.27)
! !
where in (1.27) the oriented segment OAC OB is the diagonal of a parallelogram
! !
with sides OA and OB. It is plain to verify that the foregoing operations equip
E with the structure of a three-dimensional vector space since three unit vectors,
which are orthogonal to each other, form a basis of E.
It is also easy to verify that if we introduce into E the cross product x y of
two vectors, the vector space E becomes a Lie algebra.
• The vector space of polynomials of degree n. Denote by E the set of
polynomials P .x/ of degree r, where 0 r n. Any polynomial can be written
as P .x/ D p0 C p1 x C C pn x n , where some of its coefficients pi may vanish.
If, for any a 2 <, we define
P0 D 1; : : : ; Pn D x n (1.28)
0 P0 .x/ C C n Pn .x/ D 0
and the fundamental theorem of algebra, it follows that all the real numbers
0 ; : : : ; n vanish. On the other hand, any polynomial of degree n can be
expressed as a linear combination of the n C 1 polynomials (1.28). We can
conclude that E is an .n C 1/-dimensional vector space.
• The vector space of continuous functions on the interval Œa; b. Denote by
C 0 Œa; b the set of continuous functions on the closed and bounded interval
Œa; b. With the ordinary operations of multiplication of a function by a number
and the addition of two functions, C 0 Œa; b becomes a vector space. Since the
polynomials of any degree are continuous functions, the vector space of the
polynomial of degree n, where n is an arbitrary integer, is a vector subspace
of C 0 Œa; b. Consequently, E has infinite dimension since it contains free vector
systems of any order.
• The vector space of n n matrices. Let E be the set of the n n matrices
A D .aij /. The operations
equip E with the structure of a vector space. It is a very simple exercise to verify
that the n2 matrices
0 1 0 1
1 0 0 0 0 0
@ A ;::: ; @ A
0 0 0 0 0 0 0 1
AB ¤ BA: (1.31)
Definition 1.10. Let En and Gm be two vector spaces on < with dimensions n and
m, respectively. We say that the map
F W En ! Gm (1.33)
Since the vectors F .x/ belong to Gm , they can be represented as a linear
combination of the vectors of a basis .gi / of Gm , i.e.,
where, for any fixed i , the scalars Fih denote the components of the vector F .ei /
relative to the basis .gh /. Considering the upper index of Fih as a row index and the
lower index as a column index, the numbers Fih become the elements of an m n
matrix F with m rows and n columns. When the bases .ei / in En and .gh / in Gm are
chosen, such a matrix uniquely determines the linear map F since (1.35) and (1.36)
imply that
y D F .x/ D Fih x i gh : (1.37)
For this reason, F D .Fih / is called the matrix of the linear map F relative to the
bases .ei / and .gh /. Adopting the matrix notation, (1.37) becomes
Y D FX; (1.38)
is the kernel of F .
A linear map (1.33) is an epimorphism if Im.F / G; further, F is called a
monomorphism if no two different elements of E are sent by F into the same
element of G. In particular, an epimorphism F W En ! En is called endomorphism
and the set of all endomorphisms is denoted by Lin.En /. The linear map (1.33) is an
isomorphism if it is a monomorphism and F .En / D Gm . In this case, there exists
the inverse linear map F 1 W Gm ! En . Finally, an isomorphism F W En ! En is
said to be an automorphism.
Theorem 1.4. Im.F / and ker.F / are vector subspaces of Gm and En , respectively.
Moreover, F is a monomorphism if and only if ker.F / D f0g.
Proof. If y1 , y2 2 Im.F /, then there are x1 , x2 2 En such that y1 D F .x1 / and
y2 D F .x2 /. Then, 8a; b 2 < the linearity of F implies that
Proof. (a)) (b). The condition i F .ei / D F .i ei / D 0 and (1.41) imply that
i ei D 0. Since .ei / is a basis of En , we have that i D 0, i D 1; : : : ; n, and
the vectors .F .ei // are independent. Moreover, if F is an isomorphism, 8y 2
Gm there exists one and only one x 2 Gm such that y D F .x/ D x i F .ei /.
Consequently, the vectors F .ei / form a basis of Gm and m D n.
(b)) (c). In view of (1.37), the relation i F .ei / D 0 becomes i Fih gh D 0. But
(b) implies that the linear system i Fih D 0 has only the solution i D 0,
i D 1; : : : ; n. Consequently, m D n and the matrix F is not singular.
(c)) (a). Equation (1.38) is the matrix form of the mapping F with respect to the
bases .ei / and .gh /. For any choice of the vector y 2 Gm , (1.38) is a linear system
on n equations in the n unknown x i . Owing to (c), this system admits one and
only one solution so that F is an isomorphism.
We conclude this section with the following remark. Consider a basis change
j
g0h D Ghk gk ; e0i D Ei ej (1.42)
in the two vector spaces En and Gm . Then, the matrix F relative to the bases .eh / and
.gh / and the matrix F0 relative to the bases .e0h / and .g0h / are related by the following
equation:
F0 D G1 FE (1.43)
(Exercise 4).
1.8 Exercises
1. In the vector space En2 of the n n matrices, prove that Jacobi’s identity is
verified.
We start by noting that
Jacobi’s identity is proved applying the foregoing identity to the three terms of
the expression ŒA; ŒB; C C ŒC; ŒA; B C ŒB; ŒC; A .
2. Let
x
e2
e’2
e1
be a basis change in the vector space E2 (Fig. 1.1). Evaluate the components
.x 0i / in the basis .e0i / of the vector that has the components .1; 2/ in the
basis .ei /.
Hint: It is sufficient to verify that the inverse matrix of the basis change is
cos ' sin '
.A1 /ij D
sin ' cos '
y D y h gh D Fih x i gh ;
y D y 0h g0h D Fi0h x i g0h :
Taking into account the basis changes (1.42), this condition becomes
6. Determine the vectors .a; b; c/ of the vector space V generated by the vectors
u D .2; 1; 0/, v D .1; 1; 2/, and w D .0; 3; 4/.
Hint: We must find the real numbers x, y, and z such that
2x C y D a;
3y 6z D b;
0 D 2a 4b 3c:
This chapter contains an introduction to tensor algebra. After defining covectors and
dual bases, the space of covariant two-tensor is introduced. Then, the results derived
for this space are extended to the general space of the .r; s/-tensors.
Theorem 2.1. Let En be a vector space with finite dimension n. Then, E has the
same dimension n. Moreover, if .ei / is a basis of En , then the n covectors such that
i .ej / D ıji 2 (2.4)
1
For the contents of Chaps. 2–9, see [7, 8, 10, 11, 13, 14].
2
Here ıji is the Kronecker symbol
0; i ¤ j;
ıji D
1; i D j:
define a basis of E .
Proof. First, we remark that, owing to (2.4), the linear forms i are defined over the
whole space En since, 8x D x i ei 2 En , we have that
i .x/ D i .x j ej / D x j i .ej / D x i : (2.5)
To show that En is n-dimensional, it is sufficient to verify that any element of En
can be written as a unique linear combination of the covectors . i /. Owing to the
linearity of any ! 2 En , we have that
!.x/ D x i !.ei / D x i !i ; (2.6)
where we have introduced the notation
!i D !.ei /: (2.7)
On the other hand, (2.5) allows us to write (2.6) in the following form:
! D !i i : (2.9)
To prove the theorem, it remains to verify that the quantities !i in (2.9) are uniquely
determined. If another representation ! D !i0 i existed, then we would have 0 D
.!i !i0 / i , i.e.,
.!i !i0 / i .x/ D 0; 8x 2 En :
Remark 2.2. Bearing in mind the foregoing results, we can state that the com-
ponents of a covector relative to a dual basis are transformed according to the
covariance law (i.e., as the bases of the vector space En ). By contrast, the dual
bases are transformed according to the contravariance law (i.e., as the components
of a vector x 2 En ).
Remark 2.3. Since the vector spaces En and En have the same dimension, it is
possible to build an isomorphism between them. In fact, choosing a basis .ei / of
En and a basis . i / of En , anPisomorphism between En and En is obtained by
associating the covector ! D i x i i with the vector x D x i ei 2 En . However,
owing to the different transformation character of the components of a vector and
a covector, the preceding isomorphism depends on the choice of the bases .ei /
and . i /. Later we will show that, when En is a Euclidean vector space, it is
possible to define an isomorphism between En and En that does not depend on
the aforementioned choice, i.e., it is intrinsic.
2.2 Biduality
We have already proved that En is itself a vector space. Consequently, it is possible
to consider its dual vector space En containing all the linear maps G W En ! <.
Moreover, to any basis . i / 2 En we can associate the dual basis .fi / 2 En
defined by the conditions [see (2.4)]
j
fi . j / D ıi ; (2.13)
F D F i fi : (2.14)
At this point we can consider the idea of generating “ad libitum” vector spaces by
the duality definition. But this cannot happen since En and En are isomorphic, i.e.,
they can be identified. In fact, let us consider the linear map
x 2 En ! Fx 2 En
such that
Fx .!/ D !.x/; 8! 2 En : (2.15)
20 2 Tensor Algebra
F i D xi ; i D 1; : : : ; n: (2.17)
The correspondence (2.17) refers to the bases .ei /, . i /, and .fi /. To prove that
the isomorphism (2.17) does not depend on the basis, it is sufficient to note that the
basis change (1.15) in En determines the basis change (2.11) in En . But .fi / is the
dual basis of . i /, so that it is transformed according to the formulae
j
f0i D Ai fj :
T W En En ! <
Theorem 2.2. Let .ei / be a basis of En and let . i / be the dual basis in En . Then
. i ˝ j / is a basis of T2 .En /, which is a n2 -dimensional vector space.
Proof. Since T 2 T2 .En / is bilinear, we have that
On the other hand, in view of (2.19) and (2.5), it also holds that
i ˝ j .x; y/ D x i y j ; (2.22)
Since this identity holds for any x; y 2 En , we conclude that the set . i ˝ j /
generates the whole vector space T2 .En /:
T D Tij i ˝ j : (2.23)
Then, the covariant 2-tensors . i ˝ j / form a basis of T2 .En / if they are linearly
independent. To prove this statement, it is sufficient to note that from the arbitrary
linear combination
aij i ˝ j D 0
we obtain
j
aij i ˝ j .eh ; ek / D aij ıhi ık D 0;
and taking into account (2.24) we obtain the following transformation formulae of
the components of a covariant 2-tensor T 2 T2 .En / under a basis change (2.24):
Remark 2.4. If T D .Tij / is a matrix whose elements are the components of T, and
A D .Aij / is a matrix of the basis change (1.15), then the matrix form of (2.26) is
T0 D AT TA: (2.27)
It is evident that the set T 2 .En / of all contravariant 2-tensors becomes a vector
space by the introduction of the standard operations of addition of two contravariant
2-tensors and the product of a 2-tensor by a real number.
Definition 2.5. The tensor product of two vectors x; y 2 En is the contravariant
2-tensor
x ˝ y.!; / D !.x/ .y/; 8!; 2 En : (2.29)
Theorem 2.3. Let .ei / be a basis of the vector space En . Then .ei ˝ ej / is a basis
of T 2 .En /, which is an n2 -dimensional vector space.
Proof. 8!; 2 En ,
aij ei ˝ ej D 0
so that, taking into account (2.34), we derive the following transformation formulae
for the components of T:
j
T 0ij D .A1 /ih .A1 /k T hk : (2.36)
Exercise 2.1. Verify that the matrix form of (2.36) is [see (2.27)]
Once again, the set T11 .En / of all mixed 2-tensors becomes a vector space by the
introduction of the standard operations of addition of two mixed 2-tensors and the
product of a mixed 2-tensor by a real number.
Definition 2.7. The tensor product of a vector x and a covector 2 En is the
mixed 2-tensor
Theorem 2.4. Let .ei / be a basis of the vector space En and let . i / be the dual
basis in En . Then .ei ˝ j / is a basis of T11 .En /, which is an n2 -dimensional vector
space.
In view of this theorem we can write
T D Tji ei ˝ j ; (2.40)
Exercise 2.2. Verify that the matrix form of (2.43) is [see (2.27)]
It is quite clear how to transform the set Tsr .En / of all .r; s/-tensors (2.45) in a real
vector space.
p
Definition 2.9. Let T 2 Tsr .En / be an .r; s/-tensor and let L 2 Tq .En / be a .p; q/-
tensor. Then the tensor product of these two tensors is the .r C p; s C q/-tensor
rCp
T ˝ L 2 TsCq .En / given by
By imposing that the associative property holds, the tensor product can be
extended to any number of factors. Then we introduce the following definition of
the tensor product of vectors and covectors:
x1 ˝ ˝ xr ˝ !1 ˝ ˝ !s . 1 ; : : : ; r ; u1 ; : : : ; us / D
x1 . 1 / xr . r /!1 .u1 / !s .us /: (2.47)
With the procedure revealed in the previous section it is possible to prove the
following theorem.
Theorem 2.5. The dimension of the vector space Tsr .En / of the .r C s/-tensors on
En is nrCs , and
e i1 ˝ ˝ e ir ˝ j 1 ˝ ˝ j s (2.48)
is a basis of it.
In (2.48), .ei / is a basis of En and . j / the dual basis in En . Instead of
Eqs. (2.40)–(2.43), we now obtain
T D Tji11j
ir
e ˝ ˝ e ir ˝ j 1 ˝ ˝ j s ;
s i1
(2.49)
j ;:::;j
Ti 11;:::;i s D T. j1 ; : : : ; js ; ei1 ; : : : ; eir /; (2.50)
n
2.6 Contraction and Contracted Multiplication 25
Tj0i11j
ir
s
D .A1 /ih11 .A1 /ihrr Akj11 Akjss Tkh11k
hr
s
: (2.52)
In the previous sections we defined the addition of tensors belonging to the same
tensor space. On the other hand, the tensor product of two tensors that might
belong to different tensor spaces defines a new tensor that belongs to another tensor
space. In conclusion, the tensor product is not an internal operation. However,
it is possible to introduce a suitable set that, equipped with the aforementioned
operations, becomes an algebra.
Let us consider the infinite direct sum (Sect. 1.4)
In the tensor algebra TEn we can introduce two other internal operations: the
contraction and the contracted product.
Theorem 2.6. Denote by .ei / a basis of the vector space En and by . i / the dual
basis in En . Then, for any pair of integers 1 h; k n, the linear map
Ch;k W T D Tji11kj
hir
ei ˝ ˝ eir ˝ j1 ˝ ˝ js 2 Tsr .En /
s 1
hj
that, under a basis change, the quantities Th are transformed as vector components.
Since in a basis change we have that
0ij p
Th D .A1 /il .A1 /jm Ah Tplm ;
ij
For instance, if T D Th ei ˝ ej ˝ h and L D Llm el ˝ m , then Ci;m .T ˝ L/ D
T hj Llh ej ˝ el .
Theorem 2.7. The nrCs quantities
Tji11j
ir
s
(2.56)
where y i are the components of the vector y D T.x/ in the basis .ei /. Owing to the
previous theorem, the quantities .Tji / are the components of a .1; 1/-tensor relative
to the basis .ei ˝ j /. It is easy to verify that any .1; 1/-tensor T determines a linear
map T W En ! En .
2.7 Exercises 27
2.7 Exercises
1. Determine all the linear maps that can be associated with an .r; s/-tensor, where
r C s 3.
2. Let .ei / be a basis of a vector space En , and let . i / be the dual basis. Verify
that a tensor that has the components .ıji / in the basis .ei ˝ j / has the same
components in any other basis.
3. Let .ei / be a basis of a two-dimensional vector space E2 , and denote by . i / the
dual basis. Determine the value of the covariant 2-tensor 1 ˝ 2 2 ˝ 1 when
it is applied to the pairs of vectors .e1 ; e2 /, .e2 ; e1 /, and .x; y/ and recognize
geometric meaning of each result.
4. The components of a .1; 1/-tensor T of the vector space E3 relative to a basis
.ei ˝ j / are given by the matrix
0 1
121
@2 1 2A:
121
e01 D e1 C e3 ;
e02 D 2e1 e2 ;
e03 D e1 C e2 C e3 :
7. For which .1; 1/-tensor .Tji / of T11 .E3 / does the linear map F defined by the
matrix .Tji / satisfy the condition F .x/ D ax, 8x 2 E3 and 8a 2 <?
8. Given the .1; 1/-tensor Tji ei ˝ j of T2 .E2 /, verify that T11 C T22 and det.Tji /
are invariant with respect to a change of basis.
28 2 Tensor Algebra
9. Prove that if the components of a .0; 2/-tensor T satisfy either of the conditions
in a given basis, then they satisfy the same conditions in any other basis.
10. Given .0; 2/-tensors that in the basis .ei /, i D 1; 2; 3, have the components
0 1
1 0 1
T1 D @ 0 1 2 A ;
1 2 1
0 1
0 1 1
T2 D @ 1 0 2 A ;
1 2 0
determine the covector !u depending on u such that
!u .v/ D T2 .u; v/
This equality is identically satisfied 8x; y 2 En if and only if the components Tij of
T verify the conditions
Exercise 3.1. Prove that if conditions (3.2) hold in a basis, then they hold in any
basis.
If we denote by T the matrix of the components of T, then conditions (3.2)
assume the following matrix form:
T D TT : (3.3)
Definition 3.2. The exterior product of the covectors !; 2 En is the .0; 2/-
tensor such that
! ^ D ! ˝ ˝ !: (3.4)
Theorem 3.1. The exterior product (3.4) has the following properties:
• ! ^ is a skew-symmetric tensor;
• ! ^ D ^ !;
• If D a!, a 2 <, then ! ^ D 0.
Proof. From (3.4), 8x; y 2 En , we have that
and the skew symmetry of ! ^ is proved. Starting from (3.5) it is simple to verify
the other two properties.
The set ƒ2 .En / of the skew-symmetric .0; 2/-tensors is a subspace of the vector
space T2 .En /. The dimension and the bases of ƒ2 .En / are determined by the
following theorem.
Theorem 3.2. Let .ei / be a basis of En and denote by . i / its dual basis in En .
n
The dimension of ƒ2 .En / is , and the vectors
2
. i ^ j /; i < j;
where
T.ij / D Tij ; i < j: (3.7)
Relation (3.6) shows that the set of the .0; 2/-tensors . i ^ j / 2 ƒ2 .En / generates
the whole subspace ƒ2 .En /. Consequently, to verify that they form a basis of
3.1 Skew-Symmetric (0, 2)-Tensors 31
ƒ2 .En /, it is sufficient to verify their linear independence. Now, from any linear
combination
a.ij / i ^ j D 0; i < j;
which, in view of (3.5) and (2.4), implies a.ij / D 0, and the proof is complete.
Definition 3.3. The quantities T.ij / , i < j , are called strict components of T
relative to the basis . i ^ j /.
To determine the transformation formulae of the strict components, we recall that
they are components of a .0; 2/-tensor. Consequently, in view of (2.26), we can write
0 h k
T.ij / D A.i Aj / Thk
X X
D Ah.i Akj / Thk C Ah.i Akj / Thk
h<k h>k
X
D .Ah.i Akj / Ak.i Ahj / /T.hk/ ;
h<k
Example 3.1. Let E2 be a two-dimensional vector space and denote by .e1 ; e2 / and
. 1 ; 2 / a basis of E2 and the dual basis of E2 , respectively. The dimension of
the vector space T2 .E2 / is 4, whereas the subspace ƒ2 .E2 / of the skew-symmetric
.0; 2/-tensors is one-dimensional. Consequently, any .0; 2/-tensor of ƒ2 .E2 / can be
written as follows:
T D T12 1 ^ 2 :
and we can state that 1 ^ 2 .x; y/ measures the area of the parallelogram
determined by the vectors x and y. In particular, from (3.13) we obtain
1 ^ 2 .e1 ; e2 / D 1;
and the parallelogram determined by the vectors e1 and e2 has a unit area. We note
that the area of the parallelogram formed by the vectors .e01 ; e02 / of another basis of
E2 , where
where . i / is the dual basis in E3 of .ei /. Prove that the skew-symmetric
.0; 2/-tensor
T D 1 ^ 2 C 1 ^ 3 C 2 ^ 3
3.2 Skew-Symmetric .0; r/-Tensors 33
associates to any pair of vectors x, y of E3 the sum of the areas of the projections
of the parallelogram formed by the vectors x, y onto the subspaces generated by
.e1 ; e2 /, .e1 ; e3 /, .e2 ; e3 /, respectively.
In the next section, the content of this section will be extended to .0; r/-tensors.
Definition 3.4. Let Sr D f1; 2; : : : ; rg be the set of the first r integer numbers. A
permutation of Sr is any one-to-one map
W Sr ! Sr :
We denote by f.1/; .2/; : : : ; .r/g the set formed by the same numbers of Sr
placed in a different order. It is well known that the set …r of all the permutations
of Sr contains rŠ one-to-one maps. This set can be equipped with the structure of a
group by the usual composition of maps
; 2 …r ! ı 2 …r :
The identity of this group is a map that does not modify the position of the numbers
of Sr . Finally, the opposite of is the inverse map 1 . Let i; j , i < j , be two
numbers of Sr . We say that the permutation contains an inversion with respect to
Sr if
.i / > .j /:
A permutation is said to be even or odd according to whether the total number
of inversions contained in S is even or odd. In the sequel, we denote by m./ the
total inversions of .
Definition 3.5. A tensor T 2 Tr .En /, r > 2, is skew-symmetric or alternating if
8x1 ; : : : ; xr 2 En and 8 2 …r .
The preceding definition implies that the value of T vanishes every time T is
evaluated on a set of vectors containing two equal vectors. In fact, it is sufficient
to consider a permutation that exchanges the position of these two vectors without
modifying the position of the others and then to apply (3.14). From this remark it
easily follows that the value of T vanishes if the vectors fx1 ; : : : ; xr g are linearly
dependent.
If the r vectors fx1 ; : : : ; xr g belong to a basis .ei /, then, in view of (2.50), we
express the skew symmetry of T in terms of its components:
Ti1 ir D .1/m./ T.i1 /.ir / : (3.15)
34 3 Skew-Symmetric Tensors and Exterior Algebra
In particular, this condition implies that all the components of T in which two
indices have the same value vanish. It is quite obvious that the set of all the skew-
symmetric tensors form a subspace ƒr .En / of Tr .En /.
Remark 3.1. Since r vectors, when r > n, cannot be linearly independent, we can
state that
ƒr .En / D f0g; n < r:
i1 ^ ^ ir ; (3.18)
where i1 < < ir is an arbitrary r-tuple of integer numbers chosen in the set of
indices f1; : : : ; ng, form a basis of ƒr .En /.
Proof. First, when r n, any T 2 ƒr .En / can be written as follows:
T D Tj1 jr j1 ˝ ˝ jr :
3.2 Skew-Symmetric .0; r/-Tensors 35
The summation on the right-hand side contains only terms in which all the indices
are different from each other since T is skew-symmetric. This circumstance makes
it possible to group all the terms as follows: first, we consider the terms obtained by
extracting arbitrarilyr different
indices i1 < < ir from the set f1; : : : ; ng. We
n
recall that there are different possible choices of these indices. Then, for each
r
choice, which is characterized by the indices i1 < < ir , we consider all the terms
obtained by permutating in all the possible ways the indices i1 < < ir . In view
of (3.15) and (3.16), we can write
X X
TD T.i1 /.ir / .i1 / ˝ ˝ .ir /
i1 <<ir 2…r
X X
D Ti1 ir .1/m./ .i1 / ˝ ˝ .ir /
i1 <<ir 2…r
X
D Ti1 ir i1 ^ ^ ir ;
i1 <<ir
that is,
T D T.i1 ir / i1 ^ ^ ir : (3.19)
where the indices j1 < < jr are chosen in the set 1; : : : ; n in all possible ways.
Finally, we have that
ˇ ˇ
ˇ Aj1 Aj1 ˇ
ˇ i1 ir ˇ
ˇ ˇ
T.i0 1 ir / D ˇ ˇ T.j1 jr / : (3.20)
ˇ jr jr ˇ
ˇ Ai Ai ˇ
1 r
We can easily verify that the transformation formula of the bases of ƒr .En / [see
(3.11)] is
ˇ ˇ
ˇ .A1 /i1 .A1 /i1 ˇ
ˇ j1 jr ˇ
ˇ ˇ
0i1 ^ ^ 0ir D ˇ ˇ j1 ^ ^ jr : (3.21)
ˇ 1 ir ir ˇ
ˇ .A /j1 Ajr ˇ
36 3 Skew-Symmetric Tensors and Exterior Algebra
where A D det.Aij /.
and
L D L.j1 js / j1 ^ ^ jr 2 ƒs .En /;
(3.24)
where is any permutation of the indices h1 < < hrCs , in which .h1 / <
< .hr /, .hrC1 / < < .hrCs /, and m./ is the number of inversions of
.h1 /; : : : ; .hrCs /.
Example 3.2. The external product of the two skew-symmetric tensors
Exercise 3.3. Prove that the exterior product of T 2 ƒ2 .E5 / and L 2 ƒ2 .E5 / has
the component
T12 L45 T14 L25 C T15 L24 C T24 L15 T25 L14 C T45 L12
It is not difficult to verify that T^L is skew-symmetric and independent of the basis.
It can also be proved that
T ^ L D .1/rs L ^ T: (3.25)
We can define the exterior algebra as we did for the tensor algebra. First, we
introduce the notations
ƒ0 .En / D <; ƒ1 .En / D En :
whose elements are formed by finite sequences .a; !; T; : : :/, where a 2 <, ! 2
En , T 2 ƒ2 .En /, etc., and we recall that ƒr .En / D f0g for r > n. This set,
equipped with multiplication by a scalar, addition, and an exterior product, is the
exterior algebra over En .
It is evident that what we have proved for the skew-symmetric tensors T0s can be
repeated for the skew-symmetric tensors Tr0 . In this way, we can define the exterior
algebra ƒr .En /.
where A D det.Aij / ¤ 0.
Theorem 3.4. R is an equivalence relation partitioning B into two equivalence
classes.
j j
Proof. From the evident conditions ei D ıi ej , det.ıi / D 1 > 0, it follows that any
basis is in the relation R with itself and R is reflexive. If (3.28) holds, then we have
j
ei D .A1 /i e0j ;
j
where det..A1 /i / D 1=A > 0. Therefore, .ei / is in the relation R with .e0i / and
R is symmetric. It remains to prove that R is transitive. To this end, we consider a
third basis .e00i / such that
j
e00h D Bh e0j ; (3.29)
38 3 Skew-Symmetric Tensors and Exterior Algebra
j
where det.Bh / > 0. Then we also have
j j
e00h D Bh e0j D Bh Akj ek Chk ek : (3.30)
j
Since det.Chk / D det.Bh / det.Akj / > 0, R is an equivalence relation. To verify
that R partitions B into two equivalence classes, we first note that these equivalence
classes are at least two since the two bases .ei / and .e0i /, such that
0 1
1 0 0
B 0 1 0 C
e0i D Cih eh ; .Cih / D B
@
C ; det.C h / D 1;
A i
0 0 1
j j
e00i D Ai e0j D Ai Cjh eh ;
1 ^ ^ n .e1 ; : : : ; en / D 1: (3.32)
Further, if .e0i / is another basis of En , related to .ei / by (3.27), then in view of (3.31),
we have that
1 ^ ^ n .e01 ; : : : ; e0n / D det A: (3.33)
3.5 Exercises
˛ D 1 2;
ˇ D 1 2 C 3;
D 3;
the 2-form
D 1 ^ 3 C 2 ^ 3;
˛ ^ ˇ; ˛ ^ ˇ ^ ; ˛ ^ ; ˛ ^ :
2. Evaluate the components of the forms of the preceding exercise under the basis
change
01 D 1 2 2 ;
02 D 1 C 3 ;
03 D 3 :
3. Let .ei / be a basis of the vector space E3 and denote by . i / the dual basis. Given
the skew-symmetric tensors
01 D 1 C 2 2 ;
02 D 2 C 3 ;
03 D 1 2 3 :
Chapter 4
Euclidean and Symplectic Vector Spaces
Tij D Tj i ; (4.2)
where a, b, and c are given real numbers. Now the trinomial in (4.4) vanishes
identically for any if and only if a D b D c D 0. Since b D T.x ; y / ¤ 0, we
have provedp the existence of at least one vector x1 for which T.x1 ; x1 / ¤ 0. Putting
u1 D x1 = j˛j, where ˛ D T.x1 ; x1 /, it follows that T.u1 ; u1 / D ˙1, where the
sign is chosen according to the sign of ˛. We denote by U 1 the one-dimensional
subspace of En generated by u1 and by V 1 the following subspace of En :
V 1 D fv 2 En W T.u1 ; v/ D 0g:
V 1 D fv 2 En ; T.v; u/ D 0; 8u 2 U 1 g
T.v0 v; ui / D T.u0 u; ui / D 0:
Since u0 u 2 U 1 , there are two real numbers a and b such that u0 u D au1 Cbu2 .
Consequently, from the foregoing result we obtain the condition
aT.u1 ; ui / C bT.u2 ; ui / D 0;
ei D u2i 1 ; i D 1; : : : ; r;
erCi D u2i ; i D 1; : : : ; r;
ek D uk ; k D 2r C 1; : : : ; n;
E0 D fx 2 En ; T.x; x/ D 0; 8y 2 En g:
4.2 Degenerate and Nondegenerate .0; 2/-Tensors 45
Tij x i D 0; j D 1; : : : ; n; (4.9)
En D E C ˚ E ˚ E0 : (4.10)
we have that
r
X rCs
X
T.x; x/ D .x i /2 > 0; T.y; y/ D .y i /2 < 0: (4.11)
i D1 i DrC1
r 0 0 Cs 0
rX
X
T.x; x/ D .x 0i /2 > 0; T.y; y/ D .y 0i /2 < 0:
i D1 i Dr 0 C1
We can easily prove that the intersection E C \.E 0 ˚E0 / D f0g. In fact, if x 2 E C
and x ¤ 0, then (4.11)1 is satisfied; further, any y 2 E C , y ¤ 0, verifies (4.11)2.
Consequently, the subspace E C ˚ E 0 ˚ E0 has a dimension r C n r 0 n, so
that r r 0 . Applying this reasoning to E 0C ˚ E ˚ E0 , we obtain r D r 0 .
Definition 4.3. The integer number r is called an index of T, whereas the difference
r s is the signature of T.
Definition 4.4. Let T belong to T2 .En /. The map q W En ! < such that
q.x/ D T.x; x/
is said to be the quadratic form associated with T. Since q.x/ vanishes identically
when T is skew-symmetric, throughout the following sections we refer only to
symmetric tensors.
Definition 4.5. A symmetric tensor T 2 T2 .En / is said to be positive semidefinite
if 8x 2 En the following results are obtained:
It is well known that quadratic forms are positive semidefinite (positive definite) if
and only if all the principal minors T i , i D 1; : : : ; n, of the representative matrix
T D .Tij / of T with respect to the basis .ei / satisfy the following conditions:
Theorem 4.4. Let T 2 T2 .En / be a symmetric and positive semidefinite .0; 2/-
tensor. Then, 8x; y 2 En Schwarz’s inequality
p p
jT.x; y/j T.x; x/ T.y; y (4.15)
4.3 Pseudo-Euclidean Vector Spaces 47
.x; y/ 2 En En ! x y 2 <;
where
x y D g.x; y/: (4.19)
x y D y x;
x .y C z/ D x y C x z;
48 4 Euclidean and Symplectic Vector Spaces
a.x y/ D .ax/ y;
x y D 0; 8y 2 En ) x D 0; (4.20)
8x; y; z 2 En . In fact, the first property follows from the symmetry of g; since g
is bilinear, the second and third properties hold; finally, the fourth property follows
from the positive-definite character of g.
A vector x is said to be a unit vector if
x x D ˙1: (4.21)
Two vectors are orthogonal if
x y D 0: (4.22)
In a basis .ei / of En the scalar product determines a symmetric nonsingular
matrix G D .gij / whose coefficients are
gij D ei ej : (4.23)
x y D gij x i y j (4.24)
in any basis .ei / of En . Theorem 4.1 guarantees the existence of canonical bases .ei /
in which the representative matrix G of the scalar product has the diagonal form
Ir O
GD ; (4.25)
O Is
r
X rCs
X
xyD xi yi xi y i : (4.26)
i D1 i DrC1
Definition 4.7. A Euclidean vector space is a pair .En ; g/, where En is a vector
space and g is a symmetric positive-definite .0; 2/-tensor.
In a Euclidean vector space the scalar product verifies the further condition
x x 0; 8x 2 En ; (4.27)
4.4 Euclidean Vector Spaces 49
where the equality to zero holds if and only if x D 0. Property (4.27) makes it
possible to define the length or modulus of a vector as follows:
p
jxj D x x: (4.28)
Also, in a Euclidean space the Schwarz and Minkowski inequalities (4.15) and
(4.16) hold. With the new notations, they assume the following form:
jx yj jxjjyj; (4.29)
jx C yj jxj C jyj: (4.30)
we can define the angle ' between two vectors x and y by the position
xy
cos ' D : (4.32)
jxjjyj
gij x i y j
cos ' D p p : (4.34)
gij x i x j gij y i y j
Definition 4.8. Let .ei / be a basis of the Euclidean vector space En . We call
covariant components of the vector x relative to the basis .ei / the quantities
xi D x ei D gij x j : (4.35)
When a basis .ei / is given, there is a one-to-one map between vectors and their
contravariant components. The same property holds for the covariant components.
In fact, it is sufficient to note that det.gij / ¤ 0 and refer to linear relations (4.35).
All the preceding formulae assume their simplest form relative to an orthonormal
basis .ei /. In fact, since in such a basis
ei ej D ıij ; (4.36)
50 4 Euclidean and Symplectic Vector Spaces
we obtain also
n
X
xy D xi y j ; (4.37)
i D1
v
u n
uX
jxj D t .x i /2 ; (4.38)
i D1
xi D xi : (4.39)
u1 D e1 : (4.40)
u2 D a21 u1 C e2 ; (4.41)
u1 u2 D 0: (4.42)
a21 u1 u1 C u1 e2 D 0:
Since u1 ¤ 0, the preceding condition allows us to determine a21 and the vector u2
is not zero owing to the linear independence of e1 and e2 . Then, we search for a
vector u3 such that
u3 D a31 u1 C a22 u2 C u3
u1 u3 D 0;
u2 u3 D 0:
a31 u1 u1 C u1 e3 D 0;
a32 u2 u2 C u2 33 D 0;
which determines the unknowns a31 and a32 since the vectors u1 and u2 do not vanish.
Finally, the system .u1 ; u2 ; u3 / is orthogonal. After n steps, an orthogonal system
.u1 ; : : : ; un / is determined. Dividing each vector of this system by its length, we
obtain an orthonormal system.
4.5 Eigenvectors of Euclidean 2-Tensors 51
V? D fx 2 En ; x y D 0; 8y 2 V g ; (4.43)
containing all the vectors that are orthogonal to any vector of V , is said to be the
orthogonal complement of V .
Theorem 4.5. If En is a Euclidean vector space and V any vector subspace of En ,
then V? is a vector subspace of En ; in addition, the following results are obtained:
E n D V ˚ V? : (4.44)
.a1 x1 C a2 x2 / y D a1 x1 y C a2 x2 y D 0;
x0 D .x e1 /e1 C .x em /em ;
x00 D x x0 :
Since the vector x00 2 V? , the decomposition x D x0 C x00 is such that x0 2 V and
x00 2 V? . To prove that this decomposition is unique, we suppose that there is an-
other decomposition y0 Cy00 . Then, it must be that .y0 x0 /C.y00 x00 / D 0, where the
vector inside the first parentheses belongs to V , whereas the vector inside the other
parentheses belongs to V? . Finally, in a Euclidean space the sum of two orthogonal
vectors vanishes if and only if each of them vanishes and the theorem is proved.
The property that T is linear implies that the set V of all the eigenvectors
belonging to the same eigenvalue form a vector subspace of En . In fact, if a; b 2 <
and x, y 2 V , then we have that
so that ax C by 2 V .
Definition 4.11. The dimension of the vector subspace associated with the eigen-
value is called a geometric multiplicity of the eigenvalue ; in particular, an
eigenvalue with multiplicity 1 is also said to be simple. The set of all the eigenvalues
of T is called the spectrum of T. Finally, the eigenvalue problem relative to T
consists in determining the whole spectrum of T.
To find the eigenvalues of T, we start out by noting that in a basis .ei / of En , (4.45)
is written as
Tji ıji x j D 0; i D 1; : : : ; n: (4.46)
and then
Pn0 ./ D Pn ./: (4.48)
Since the polynomial Pn ./ does not depend on the basis .ei /, it is called a
characteristic polynomial of T. Denoting by Ii the coefficient of the power ni
and noting that I0 D .1/n , we can write Pn ./ as follows:
Ii D .1/i Ji ; i D 1; : : : ; n; (4.50)
4.6 Orthogonal Transformations 53
where Ji is the sum of the all determinants of the principal minors of order i of
matrix T. In particular, I1 D T11 C C Tnn and In D det T.
In conclusion, we have proved what follows.
Theorem 4.6. The eigenvalues of a .1; 1/-tensor T are the real roots of the
characteristic polynomial
If the basis .ei / is orthonormal, then the n vectors Q.ei / are independent and
consequently form a basis of En . Therefore, Q is an isomorphism and the matrix
54 4 Euclidean and Symplectic Vector Spaces
Q, representative of Q in any basis .ei /, is not singular. The condition (4.53) can be
written in one of the following forms:
QT GQ D G: (4.55)
QT Q D I , QT D Q1 : (4.56)
.Qji ıji /x j D 0;
3 C I1 2 C I2 C 1 D 0:
Aij
.Q1 /ij D ;
det Q
This result shows that I2 D I1 and the characteristic equation becomes
3 C I1 2 I1 C 1 D 0;
In view of (4.60), the preceding equations imply Q11 D Q22 D Q33 D 1. Taking into
account this result and the orthogonality conditions
These relations imply the existence of an angle ' 2 .0:2/ such that
0 1
cos ' sin ' 0
Q D @ sin ' cos ' 0 A :
0 0 1
Definition 4.14. A symplectic vector space is a pair .E2n ; /, where E2n is a vector
space with even dimension and a skew-symmetric nondegenerate .0; 2/-tensor.
Remark 4.2. It is fundamental to require that the dimension of the vector space be
odd. In fact, owing to Theorem 4.2, in a vector space with even dimension, any
skew-symmetric .0; 2/-tensor is always degenerate.
Definition 4.15. In the symplectic vector space .E2n ; /, the antiscalar product is
the map .x; y/ 2 E2n E2n ! Œx; y 2 < such that
In fact, the first property follows from the skew symmetry of . The second and
third properties follow from the bilinearity of . Finally, the fourth property is due
to the fact that is nondegenerate.
We say that the vectors x and y are antiorthogonal if
Œx; y D 0: (4.67)
In view of (4.66), we can state that the only vector that is antiorthogonal to any other
vector is 0. Further, any vector is antiorthogonal to itself. In any basis .ei / of E2n ,
the representative matrix of is skew-symmetric and the antiscalar product can be
written as
Œx; y D ij x i y j ; (4.68)
where O is a zero nn matrix and I is a unit nn matrix. Moreover, in a symplectic
basis, in view of (4.68) and (4.69), we obtain
ui ; uj D unCi ; unCj ; (4.70)
ui ; unCj D ıij ; (4.71)
n
X
Œx; y D .x i y nCi x nCi y i /: (4.72)
i D1
ST S D : (4.75)
AT B CT A D O; (4.77)
T T
A D C C D I; (4.78)
T T
B B D A D O; (4.79)
T T
B D D C D I: (4.80)
.det S/2 D 1;
4.8 Exercises
1. Let V be a subspace of the Euclidean vector space <4 generated by the vectors
.1; 0; 1; 3/ and .0; 1; 1; 2/. Determine the subspace V? of the vectors that are
orthogonal to V .
4.8 Exercises 59
A B D tr.BT A/
is an orthonormal basis of E. p p
3. Find an orthogonal 2 2 matrix whose first row is either .1= 5; 2= 5/ or .1; 2/.
4. Let En be a vector space, and denote by u a given vector of En . Prove that
•
† D fv 2 En ju v D 0g
is an .n 1/-dimensional subspace of En ;
•
En D † ˚ ‚;
where ‚ D fv D au; a 2 <g;
• The orthogonal projection P† .v/ of v 2 En onto † is given by
P† .v/ D v .v u/u:
5.1 Duality
!x .y/ D g.x; y/ D x y D 0; 8y 2 En :
i.e.,
ei D gij j : (5.2)
In the bases .ei / and . i /, isomorphism (5.1) has the following representation:
!i D gij x j ; (5.3)
In other words, isomorphism (5.1) associates with any vector x a covector whose
components in the dual basis are equal to the covariant components of x in the basis
.ei /. We can also state that (5.1) allows us to identify the vectors of En and the
covectors of En .
Definition 5.1. Any pair .x; !x / in which !x is given by (5.1) is said to be
a Euclidean vector. Further, the real numbers x i and !i satisfying (5.3) are
respectively called contravariant components and covariant components of the
Euclidean vector x.
Henceforth a Euclidean vector will be denoted by the first element of a pair
.x; !x /.
such that
T.x; y/ D T0 .x; !y / D T00 .!x ; !y /; 8x; y 2 En : (5.5)
These linear maps are isomorphisms since, adopting a basis in En and its dual basis
in En , they have the following coordinate representations:
Definition 5.2. The triad .T; T0 ; T00 / is called a Euclidean double tensor. Further,
the components Tij of T in the basis . i ˝ j /, the components Tji of T0 in the
basis .ei ˝ j /, and the components T ij of T00 in the basis .ei ˝ ej / are respectively
called covariant, mixed, and contravariant components of the Euclidean tensor
.T; T0 ; T00 /.
Henceforth we will denote a Euclidean tensor by the first element of a triad. It is
evident how to extend the preceding considerations to any tensor space Tsr .En /.
5.3 The Levi–Civita Tensor 63
In view of the foregoing results, we conclude that the tensor spaces T r .En / and
Tr .En / are isomorphic. We now verify that their subspaces ƒr En and ƒr En are
also isomorphic, and we determine the form of this isomorphism. For the sake of
simplicity, we refer to the case r D 2. If T 2 ƒr En , we have that
XX XX
T.ij / D gih gj k T hk C gih gj k T hk
i <j h<k i <j h>k
XX XX
D gih gj k T hk C gih gj k T kh
i <j h<k i <j h<k
D .gih gj k gi k gj h /T .hk/ :
In conclusion, we have
gih gi k
T.ij / D det T .hk/ : (5.7)
gj h gj k
More generally, we could prove that
0 1
gi1 h1 gi1 hr
T.i1 :::ir / D det @ A T .h1 :::hr / : (5.8)
gir h1 gir hr
In particular, when r D n, (5.8) gives
T1n D det.g/T 1:::n : (5.9)
we obtain
g 0 D A2 g; jg 0 j D A2 jgj;
so that
p p
jg 0 j D ˙A jgj; (5.14)
p the C sign when A > 0 and the sign when A < 0. This formula
where we take
shows that jgj is a pseudoscalar density of weight 1.
Introduce the skew-symmetric Levi–Civita symbol
8
< 0;
i1 :::in D 1; (5.15)
:
1;
Let En be a Euclidean vector space. We prove that the multilinear map , which
in a basis .ei / of En has the following components:
p
i1 :::in D gi1 :::in ; (5.17)
is a .0; n/-pseudotensor. In fact, in view of (5.14) and (5.16), under the basis change
(5.10), we have that
p p
0j1 :::jn D jg 0 jj1 :::jn D ˙A jgjj1 :::jn
p
D ˙Aij11 Aijnn gi1 :::in
where . i / is a basis of the dual vector space En . Taking into account what we have
just proved about the n-form and recalling the results of Sect. 3.4, we can state
that is a volume form that is invariant with respect to congruent basis changes.
We conclude this section with the following definition.
Definition 5.4. The Hodge star operator is a linear mapping
such that
? T.i1 i.nk/ / D i1 i.nk/ j1 jk T .j1 jk / : (5.21)
5.4 Exercises
1. Verify that in an arbitrary base .ei / of the Euclidean vector space En , the
eigenvalue equation (4.47) relative to the Euclidean tensor T assumes the form
where gij D ei ej .
2. Show that the components of the cross-product u v of the vectors u and v of
the three-dimensional Euclidean space E3 are
u .v w/; .u v/ .w x/:
Chapter 6
Differentiable Manifolds
y i D pr i ı f .x 1 ; : : : ; x n / y i .x 1 ; : : : ; x n /
is a C k function.
A homeomorphism f W U ! V , where V is an open set of <n , is a continuous
map with its inverse. Finally, the map f is a diffeomorphism of class C k if both
f and f 1 are C k maps. A diffeomorphism is represented by an invertible system
of functions y i .x 1 ; : : : ; x n /, .x 1 ; : : : ; x n / 2 U , i D 1; : : : ; n, of class C k together
with the inverse functions. It is well known that the condition
i
@y
det ¤0
@x j 0
at the point .x01 ; : : : ; x0n / 2 U is a sufficient condition for the invertibility of these
functions in a neighborhood of the point .x01 ; : : : ; x0n /.
A differentiable manifold can roughly be defined as an n-dimensional surface
embedded in <m , n < m. This approach to the analysis of differentiable manifolds
is more intuitive but not convenient for the following reasons. First, determining the
lowest-dimension n of the space <n in which we can embed the manifold is not an
easy task. For instance, the plane curves can be embedded in <2 , whereas the skew
curves can be embedded in <3 . Further, in this approach the geometric objects on the
X U
O ϕ xn
ϕ(U)
x1
manifold are defined starting from the space <n in which they are embedded. It is
much more interesting to build the geometry of a manifold in an intrinsic way. This
approach makes it possible to answer questions like the following ones: Is it possible
to recognize if a manifold is a sphere staying on it? Is it possible to recognize the
geometric structure of the three-dimensional space in which we live by measures
that are necessarily internal to our space?
Definition 6.1. Let n be a positive integer number and denote by X a Hausdorff1
paracompact topological space.2 X is said to be an n-dimensional manifold if, 8x 2
X , there exist an open neighborhood U of x and a homeomorphism ' W U !
'.U / <n . The pair .U; '/ is called a chart of domain U and coordinate map '.
Finally, the n numbers
.x 1 ; : : : ; x n / D '.x/ 2 '.U /
are the coordinates in the chart .U; '/ (Fig. 6.1).
Definition 6.2. An atlas of class C k on an n-dimensional manifold X is a
collection ˛ of charts on X satisfying the following conditions:
• The collection of the domains of the charts of ˛ is an open covering of X ;
• 8.U; '/; .V; / 2 ˛ the map
ı 1 W '.U \ V / ! .U \ V / (6.1)
is a C k diffeomorphism, called a coordinate transformation.
(See Fig. 6.2.)
1
A topological space X is a Hausdorff
T space if, 8x; y 2 X, x ¤ y, there are neighborhoods U ,
V of x, y, respectively, such that U V D ;.
2
A Hausdorff space is paracompact if every open covering contains a subcovering that is locally
finite.
6.1 Differentiable Manifolds 69
X
U
O ψ ϕ xn
ψ(U) ϕ(U )
x1
ψ ° ϕ−1
A chart .V; / is compatible with the atlas ˛ if, 8.U; '/ 2 ˛, map (6.1) is
of class C k . We call the collection of all the charts that are compatible with ˛ a
maximal atlas ˛ of X .
Definition 6.3. The pair Vn D .X; ˛/ is called an n-dimensional differentiable
manifold of class C k .
A difficult theorem of Whitney proves that any C k manifold Vn , k 1, becomes
an analytic manifold (i.e., the coordinate transformations between charts of an
atlas are analytic diffeomorphisms) by discarding a suitable collection of C k charts
belonging to the original maximal atlas. It is even more difficult to show that a C 0
manifold may fail to become a C 1 manifold.
Now we show how to obtain differentiable manifolds.
• Let U <n be an open set, and denote by .u1 ; : : : ; un / a point of U . Let S be
the locus of the points .x 1 ; : : : ; x l / 2 <l , n < l, given by the set of C 1 functions
x 1 D x 1 .u1 ; : : : ; un /;
::::::::::::::::::::::::::::: (6.2)
x l D x l .u1 ; : : : ; un /;
f1 .x 1 ; : : : ; x l / D 0;
::::::::::::::::::::::::::;
fm .x 1 ; : : : ; x l / D 0; (6.3)
are the parametric equations of in the chart .U; '/. The curve is closed if
.a/ D .b/.
Let .x i / be the coordinates defined by the chart .U; '/. If x0 D .x0i / 2 U is a
point of U , the i th coordinate curve at x0 is a curve with the following parametric
equations:
Vn U xn
ϕ(U )
x1
O t O
a b
F
Vn U V
Wm
ym
O xn O
ϕ(U) ψ(V)
x1 y1
x 1 D x01 ;
:::::::::::::;
x i D t;
::::::::::::::;
x n D x0n : (6.8)
F W Vn ! W m
is of class C k if
y ˛ D y ˛ .x 1 ; : : : ; x n /; ˛ D 1; : : : ; m: (6.10)
where .x 1 .t/; : : : ; x n .t// are the parametric equations of in the chart .U; '/. Then,
(6.12) gives
@ 1 dx i
Xx f D .f ı ' / : (6.13)
@x i '.x/ dt
.Xx C Yx /f D Xx f C Yx f;
.aXx /f D aXx f;
becomes an <-vector space that is called the tangent vector space to the manifold
Vn at point x.
Theorem 6.1. Let .x i / be a coordinate system relative to the chart .U; '/ of the
manifold Vn . Then, the relations
@ @ 1
f D .f ı ' / (6.15)
@x i x0 @x i '.x0 /
Proof. Denoting by i .t/ the i th coordinate curve crossing x0 , that is, the curve
with the following parametric equations,
x 1 D x01 ; : : : ; x i D t; : : : ; x n D x0n ;
and relations (6.15) define n vectors tangent to the coordinate curves. Their linear
independence is proved applying the linear combination
j @
D0
@x j x0
where the real numbers X i are the components of Xx relative to the natural basis.
It is fundamental to determine the transformation formulae of the natural bases
and the components of a tangent vector for a change .x i / ! .x 0j / of the local
coordinates. From (6.15) we obtain that
@ @x j @ j @
D Ai ; (6.17)
@x 0i x @x 0i @x j x @x j x
and, consequently,
@x 0i j
X 0i D X .A1 /ij X j : (6.18)
@x j
6.4 Cotangent Vector Space 75
Definition 6.8. The dual vector space of Tx Vn (Sect. 2.1) is called the cotangent
vector space Tx Vn .
If Tx Vn is referred to as the natural basis .@=@x i /x relative to the coordinates .x i /,
the dual basis . ix / is characterized by the conditions
such that
.df /x Xx D Xx f; Xx 2 T x V n : (6.21)
In a natural basis relative to the coordinates .x i / of the chart .U; '/ of Vn , (6.21)
gives
.df /x Xx D ai Xxi ; (6.22)
where
@
ai D f: (6.23)
@x i x
where
@
!i D !x : (6.26)
@x i x
76 6 Differentiable Manifolds
Owing to the results of Sect. 2.2, we can state that under the coordinate change
.x i / ! .x 0i /, the following transformation formulae of the dual bases and the
components of a covector hold [see (6.17)]:
@x 0i
.dx 0i /x D .dx j /x D .A1 /ij .dx j /x ; (6.27)
@x j
@x j j
!i0 D !j D Ai !j : (6.28)
@x 0i
Starting from Tx Vn and Tx Vn it is possible to build the whole tensor algebra
.Tsr /x Vn as well as the exterior algebra .^s /x Vn at any point x 2 Vn (Chap. 2). In
particular, the transformation formulae under a change of local coordinates .x i / !
.x 0i / of the components Tji11i
ir
s
of any .r; s/-tensor belonging to .Tsr /x Vn ,
@ @
T D Tji11i
ir
˝ ˝ i ˝ dx j1 ˝ ˝ dx js ; (6.29)
s
@x i1 @x 1
are
@x 0i1 @x 0ir @x k1 @x ks h1 hr
Tj0i11iis r D T : (6.30)
@x h1 @x hr @x 0j1 @x 0is k1 ks
@
X D X i .x 1 : : : x n / ; (6.32)
@x i
which differs from (6.16) since the components X i are C k functions of the
coordinates.
Similarly, a C k tensor field is a map
@ @
T D Tji11j
ir
.x 1 ; : : : ; x n / ˝ i ˝ dx j1 ˝ ˝ dx js ; (6.34)
s
@x i1 @x r
W x 2 Vn ! x 2 .^x /s Vn ; (6.35)
6.4 Cotangent Vector Space 77
We conclude this section by introducing the Lie algebra of the vector field on a
manifold Vn .
Let Vn be a C 1 n-dimensional manifold and denote by F 1 Vn the <-vector
space of the C 1 functions on Vn and by 1 Vn the <-vector space of the C 1
vector fields on Vn . Then, to any C 1 vector field
X W x 2 V n ! Xx 2 T x V n
X W f 2 F 1 Vn ! Xf 2 F 1 Vn (6.37)
such that
.Xf /.x/ D Xx f: (6.38)
It can be easily proved that map (6.37) verifies the following derivation property:
From (6.38) and (6.32) we obtain the following coordinate form of (6.40):
@Y i @X i @
ŒX; Y D X j j Y j j : (6.41)
@x @x @x i
F W Vn ! W m
is the linear map
Fx W Xx 2 Tx Vn ! YF .x/ 2 TF .x/Wm (6.43)
such that the image of the tangent vector Xx at x to the curve .t/ is the tangent
vector to the curve F ..t// at the point F .x/. Formally,
d
YF .x/ g D g ı F ı .t/; 8g 2 F .F .x//: (6.44)
dt
To find the coordinate representation of (6.44), we introduce a chart .U; '/ with
coordinates .x i /, i D 1; : : : ; n, in a neighborhood of x 2 Vn and a chart .V; /,
with coordinates .y ˛ /, ˛ D 1; : : : ; m, in neighborhood V of F .x/. We denote by
y ˛ D y ˛ .x 1 ; : : : ; x n / the coordinate form of the map F and by x i .t/ the parametric
equations of the curve .t/ in the coordinates .x i /. Then, (6.44) can be written as
follows:
@ dx i @y ˛ @ i @y
˛
@
Y ˛ .y ˇ .x i // g D g X g; 8f 2 F F .x/:
@x ˛ dt @x i @y ˛ @x i @y ˛
In conclusion, the coordinate form of (6.43) is
i @ @y ˛ i @
Fx W Xx D X 2 T x Vn ! X 2 TF .x/ Wm : (6.45)
@x i x @x i @y ˛ F .x/
Starting from linear map (6.43), we can give the following definition.
Xx V
U F
x
O xn O
y
x
between the dual spaces TF.x/Wm and Tx Vn , defined by the condition
@y ˛ i
!i X i D ˛ X ;
@x i
and, taking into account the arbitrariness of Xx , we obtain
@y ˛
!i D ˛ : (6.48)
@x i
In conclusion, the coordinate form of (6.46) is
@y ˛ i
FF.x/ W ˛ dy ˛ 2 TF.x/ Wm ! ˛ dx 2 Tx Vn : (6.49)
@x i
Now we consider the extension of the differential F of F that is the new linear
map, denoted by the same symbol,
such that
Fx .X1 ˝ : : : ˝ Xr / D Fx X1 ˝ : : : ˝ Fx Xr ; (6.51)
8X1 ; : : : ; Xr 2 Tx Vn . It can be easily verified that map (6.50) transforms the .r; 0/-
tensor
@ @
T D T i1 ir i ˝ : : : ˝ i
@x 1 @x r
into the .r; 0/-tensor
˛1 ˛r
O D @y @y T i1 ir @ ˝ : : : ˝ @ :
T (6.52)
@x i1 @x ir @y ˛1 @y ˛r
such that
FF.x/ . 1 ˝ ˝ s / D FF.x/ 1 ˝ ˝ FF.x/ s (6.54)
@y ˛1 @y ˛s
TO D T˛ ˛ dx i1 ˝ ˝ dx is 2 .Ts0 /x Vn : (6.55)
@x i1 @x is 1 s
Finally, we can define the linear map
O 2 .ƒs /x Vn
Fx W F .x/ 2 .ƒs /x Wm ! (6.56)
such that
FF.x/. 1 ^ ^ s / D FF.x/ 1 ^ ^ FF.x/ s (6.57)
81 ; : : : ; s 2 TF.x/ .
Again it is simple to verify that (6.56) maps
into
˛1 ˛s
O D @.y ; : : : ; y / .˛1 ˛s / dx i1 ^ ^ dx is 2 .ƒs /x Vn :
(6.58)
@.x i1 ; : : : ; x is /
Remark 6.1. It is important to note that, in general, none of the preceding linear
maps can be extended over the entire manifolds Vn or Wm because F W Vn ! Wm
can be neither one-to-one nor onto. For instance, the vector field X W x 2 Vn !
Fx Xx is defined on F .Vn /, and it could assume more values at the same point if F
were not one-to-one.
If F W Vn ! Wm is a diffeomorphism, then n D m, and we can define an
isomorphism
Fx W .Tsr /x Vn ! .Tsr /F .x/ Wn (6.59)
such that
1
Fx .X1 ˝ ˝ Xr ˝ !1 ˝ ˝ !s / D Fx X1 ˝ ˝ Fx !s : (6.60)
Once again, it is simple to verify that the linearity of (6.59) implies that (6.59) maps
the tensor
@ @
T D Tji11j
ir
i
˝ ˝ i ˝ dx j1 ˝ ˝ dx js 2 .Tsr /x Vn
s
@x 1 @x r
6.6 Tangent and Cotangent Fiber Bundles 81
@y ˛1 @y ˛r @x j1 @x js i1 ir @
Fx T D T ˝
@x ii @x ir @y ˇ1 @y ˇs j1 js @y ˛1
@
˝ ˝ dy ˇ1 ˝ ˝ dy ˇs 2 .Tsr /F .x/ Wn : (6.61)
@y ˛r
T Vn D f.x; Xx /; x 2 Vn ; Xx 2 Tx Vn g : (6.62)
The map
W .x; Xx / 2 T Vn ! x 2 Vn (6.63)
x 1 D sin
cos ';
x 2 D sin
sin ';
x 3 D cos
:
x
O
x
Fig. 6.6 Sphere in E3
6.7 Riemannian Manifolds 83
We have already said that any regular surface in Euclidean space is a differen-
tiable manifold. Now we wish to equip a sphere with the metrics induced by the
metrics of E3 . This means that the square distance ds 2 between two points .x i / and
.x i C dx i / of S 2 is assumed to be equal to the Pythagorean distance in E3
3
X
ds 2 D .dx i /2 ; (6.65)
i D1
ds 2 D sin2
d' 2 C d
2 ; (6.66)
x 1 D sin
cos ';
x 2 D sin
sin ';
x 3 D a cos
;
with axes 2, 2, and 2a. Again adopting the foregoing viewpoint, the square distance
between two points .x i / and .x i C dx i / of †2 becomes
The sphere S 2 and the ellipsoid †2 are diffeomorphic so that they are essentially
the same manifold. In particular, the variables ' and
are local coordinates for both
surfaces. These two manifolds become different from each other when we introduce
a metric structure by the square distance ds 2 (Fig. 6.7).
Definition 6.14. A C k -differentiable n-dimensional manifold Vn is a Riemannian
manifold when on Vn a metric tensor, that is, a C k .0; 2/-symmetric, nondegenerate
tensor field g, is given.
Owing to the properties of g, we can introduce into any tangent space Tx Vn a
scalar product of two arbitrary tangent vectors Xx and Yx
Xx Yx D gx .Xx ; Yx /; (6.68)
x
O
In local coordinates .x i / relative to a chart .U; '/ of Vn , the tensor g assumes the
following representation:
Xx Yx D gij X i Y j : (6.70)
@ @
gij D D ˙ıij : (6.71)
@x i @x j
The set f1; : : : ; 1; 1; : : : ; 1g is called the signature of the metric tensor and is
independent of the coordinates.
Finally, we define as the square distance ds 2 between two points .x i / and .x i C
i
dx / the quantity
ds 2 D gij dx i dx j : (6.72)
Definition 6.15. Let Vn be a Riemannian manifold and suppose that the tensor gx
is positive definite at any point x 2 Vn . The length of a C 1 curve .t/ W Œa; b
< ! Vn is the real number
6.8 Geodesics of a Riemannian Manifold 85
Z
l. / D ds: (6.73)
It is evident that l. / depends neither on the choice of the coordinates nor on the
parameterization of the curve.
Let .xai / D .x i .a// and .xbi / D .x i .b// be the initial and final points of .t/, and
consider the one-parameter family of curves
• In addition, they are such that any curve starts from .xai / and ends at .xbi /, i.e.,
Definition 6.16. The curve .t/ between the points .xai / and .xbi / is a geodesic of
the metric g if the function I.s/ is stationary for s D 0 for any family of curves
satisfying (6.76) and (6.77).
To determine the parametric equations of the geodesic between the points .xai /
and .xbi /, we start by analyzing the stationarity condition
dI
.0/ D 0: (6.79)
ds
Introducing the position
q
L.f h ; fPh / D gij fPi fPj ; (6.80)
Since
@L @fPh @ @L @f h @ @L @f h
D ;
@fPh @s @t @fPh @s @t @fPh @s
we have that
Z b b
dI @L @ @L @f h @L @f h
.s/ D .0; t/dt C : (6.81)
ds a @f h @t @fPh @s @fPh @s a
From this formula, when we recall (6.76) and (6.77), we deduce that the condition
dI =ds.0/ D 0 is equivalent to requiring that the equation
Z b
@L j j d @L j j @f h
.x ; x
P / .x ; x
P / .0; t/dt D 0 (6.82)
a @x h dt @xP h @s
@L j j d @L j j
h
.x ; xP / .x ; xP / D 0; h D 1; : : : ; n; (6.83)
@x dt @xP h
and the boundary conditions
Remark 6.2. In the Cauchy problem relative to (6.83), we must assign the initial
conditions x h .0/ D xah and xP h .0/ D X h , h D 1; : : : ; n. It is well known that, under
general hypotheses on the function L.x i ; xP i /, there is only one solution satisfying
the Euler–Lagrange equations and the initial data, that is, there is only one geodesic
starting from the point .xai / and having at this point the tangent vector .X i /. In
contrast, there is no general theorem about the boundary problem (6.83), (6.84), so
perhaps there is one and only one geodesic between the points .xah / and .xbh /, no
geodesic or infinite geodesics. In this last case we say that .xah / and .xbh / are focal
points.
Remark 6.3. A curve .t/ has been defined as a map W t 2 Œa; b ! Vn .
Consequently, a change in the parameter into the parametric equations x h .t/ leads
to a different curve, although the locus of points is the same. On the other hand,
the presence of the square root under integral (6.74) implies that the value of the
length l. / is not modified by a change in the parameter t; in other words, l. / has
the same value for all curves determining the same set of points of Vn . This remark
implies that (6.83) cannot be independent. To show that only n1 of these equations
are independent, it is sufficient to note that from the identity
6.8 Geodesics of a Riemannian Manifold 87
n
X @L
LD xP i (6.85)
i D1
@xP i
it follows that
X n
@L @2 L
h
D xP i i h ; (6.86)
@x i D1
@xP @x
X n
@L @L @2 L
h
D h
C xP i i h ; (6.87)
@xP @xP i D1
@xP @xP
and then
n
X @2 L
xP i D 0: (6.88)
i D1
@xP i @xP h
This relation shows that if the parametric equations x h .t/ satisfy the first n 1
Euler–Lagrange equations, then they satisfy the last one.
We use the arbitrariness of the parameter t to obtain a new form of the equations
of a geodesic. Setting
p
L D '; (6.90)
If we choose a parameter s for which the tangent vector .xP h / to the geodesic has
unit length, i.e., a parameter s such that
dx h
' D gij xP i xP j D 1; xP h D ; (6.92)
ds
88 6 Differentiable Manifolds
1 @ d
h
.gij xP i xP j / .ghj xP j / D 0; h D 1; : : : ; n: (6.93)
2 @x ds
6.9 Exercises
x 1 D sin
cos ';
x 2 D sin
sin ';
x 3 D cos
;
x 1 D cos ';
x 2 D sin ';
x 3 D z;
where 0 ' 2 is the longitude and z 2 < is the ordinate along the axis of
the cylinder.
3. Find an atlas for the torus T 2 in three-dimensional Euclidean space starting
from its parametric equations
x 1 D sin
cos ';
x 2 D sin
sin ';
x 3 D cos
;
B A A’
B’
O x
7. The diffeomorphism
overlaps the open subsets .0;
/ .0; 1/ and .2
; 2/ .0; 1/ of the same
squares (Fig. 6.11).
Show that U , V and , are an atlas for the cylinder S 1 .0; 1/. Essentially,
a cylinder is obtained from a rectangle by identifying the points of two opposite
sides.
6.9 Exercises 91
S r
x’
8. The diffeomorphism
overlaps the open subsets .0;
/ .0; 1/ and .2
; 2/ .0; 1/ of the same
squares (Fig. 6.11). Show that U , V and , are an atlas for the Moebius strip.
Essentially, this surface is obtained by identifying the points of two opposite
sides after a torsion of 180ı.
9. Determine an atlas with four coordinate domains and the relative coordinate
transformations to obtain a torus. Show that the torus is obtained from a
rectangle by identifying the points of the two pairs of opposite sides.
10. This exercise shows how we can obtain a geographic chart using a map between
two manifolds. Let S 2 N be the unit sphere minus the North Pole. The
diffeomorphism F , which is called a stereographic projection, between S 2 N
and the tangent plane at the South Pole S , is shown in Fig. 6.12. F maps the
point x 2 S 2 N into the point x 0 2 corresponding to the intersection of
the straight line N x with . If we adopt spherical coordinates .';
/ on S 2 and
polar coordinates .'; r/ with its center at the South Pole on , the map F has
the following coordinate form (Fig. 6.13):
/2
O
S x
x’
r
S
x’
x
O
z
Determine how F transforms the coordinate curves, the tangent vectors, and
the covectors on S 2 . Analyze how it transforms the distances and the angles
between tangent vectors.
11. As another example of a geographic chart, we consider the diffeomorphism
between F W S 2 fS; N g ! C , where C is the cylinder in Fig. 6.14 and F
maps x 2 S 2 into the point of C belonging to the straight line Ox. Adopting
spherical coordinates .';
/ on S 2 and cylindrical coordinates .'; z/ on C , the
map F assumes the following coordinate form:
Determine how F transforms the coordinate curves, the tangent vectors, the
covectors, the distances, and the angles between tangent vectors.
6.9 Exercises 93
such that
1. 8t 2 < the map t W x 2 Vn ! t .x/ 2 Vn is a C k diffeomorphism of Vn ;
2. 8t; s 2 < 8x 2 Vn , t Cs .x/ D t ı s .x/.
In particular, from the second property we have t .x/ D 0Ct .x/ D 0 ı t .x/,
so that
0 .x/ D x; 8x 2 Vn : (7.2)
Definition 7.2. 8x0 2 Vn , the C k curve t .x0 / W < ! Vn on Vn is called the orbit
of G determined by x0 . In view of (7.2), the orbit contains x0 .
Theorem 7.1. Any point x0 2 Vn belongs to one and only one orbit, i.e., the orbits
determine a partition of Vn .
Proof. First, we show that if x2 2 t .x1 /, then the orbit determined by x2 coincides
with t .x1 / up to a change of the parameter. In other words, an orbit is determined
by any of its points. In fact, if x2 2 t .x1 /, then there exists a value t2 of t such
that x2 D t2 .x1 /. In view of property (7.3), x1 D t2 .x2 /. Consequently, the orbit
t .x1 / can also be written in the form t t2 .x2 /, which, up to the change t ! t t2
of the parameter, gives the orbit determined by x2 . Now it remains to prove that if
x2 does not belong to t .x1 /, then the orbits t .x1 / and t .x2 / do not intersect each
other. In fact, if there is a point x0 belonging to both orbits, then there exist two
values t1 ; t2 2 < such that
such that
x 0 t .x/ D x C tu;
r 0 D r;
˛ 0 D ˛ C t;
z0 D z
@
XD :
@˛
Definition 7.4. Let U be an open region of Vn . A one-parameter local group of
diffeomorphisms is a differential map
W .; / U ! Vn ; (7.4)
where > 0, such that
1. 8t 2 .; /, t W x 2 U ! t .x/ 2 t .U / is a diffeomorphism;
2. If t, s, t C s 2 .; /, then t Cs .x/ D t .s .x//.
Theorem 7.3. Let X be a differentiable vector field on Vn . For any x 2 Vn , there
exist an open region U Vn , a real number > 0, and a one-parameter local
group of diffeomorphisms W .; / U ! Vn whose infinitesimal generator is X.
Proof. Consider the system of ordinary differential equations (ODEs)
dx i
D X i .x 1 ; : : : ; x n /; i D 1; : : : ; n:
dt
If the vector field is differentiable, from known theorems of analysis, 8x 2 Vn there
exist a neighborhood U of x and an interval .; / such that one and only one
solution x i D .t; x0 /, t 2 .; /, of the preceding system exists satisfying the
initial data i .0; x0 / D x0 , 8x0 2 U . For the uniqueness theorem, x i D .t; x0 / is
a diffeomorphism 8t 2 .; /. We omit the proof of the property .t C s; x0 / D
.t; .s; x0 //.
Definition 7.5. The vector field X is said to be complete if it is an infinitesimal
generator of a global one-parameter group of diffeomorphisms.
Theorem 7.4. On a compact manifold Vn , every differential vector field is complete.
Proof. The preceding theorem allows one to state that, 8x 2 Vn , a map W
..x/; .x// Ux ! Vn exists verifying conditions 1 and 2 of Definition 7.4.
Since Vn is compact, the covering .Ux /x2Vn admits a finite subcovering .Uxi /i D1:::;s .
Setting by D min f.x1 /; : : : ; .xs /g, the map is defined on .; / Vn .
LX f D Xf: (7.6)
such that, 8x 2 Vn ,
1
.LX Y/x D lim Œ.t / .t .x//Yt .x/ Yx : (7.7)
t !0 t
Definition 7.8. The Lie derivative on with respect to the vector field X is the
map (Fig. 7.2)
LX W ! 2 ! LX ! 2
7.2 Lie’s Derivative 99
such that, 8x 2 Vn ,
1
.LX !/x D lim Œ.t / .x/!t .x/ !x : (7.8)
t !0 t
It is evident how Lie’s derivative can be extended to the tensor fields of rs .
To find the coordinate expression of Lie’s derivative, we introduce a chart .U; x i /
on Vn and denote by y i D ti .x 1 ; : : : ; x n / the coordinate expression of the one-
parameter group of the diffeomorphisms t .x/ and by X i the components of the
infinitesimal generator of the group X. Then, the maps t .x/ and t .x/ in a
neighborhood of x are given by the expressions
LX Y D ŒX; Y : (7.15)
Starting from (7.9), we derive the coordinate form of the codifferential .t .x//
j
j @X @!j
..t / !t .x/ /i D ıi C t !j .x/ C X h
.x/t CO.t/; (7.16)
@x i x @x h x
It is not a difficult task to prove that for an arbitrary tensor T 2 rs , Lie’s
derivative is expressed by the following formula:
" r
@ i1 ir X i1 ik1 hikC1 ir @X ik
LX T D X h T Tj1 js
@x h j1 js @x h
kD1
s
#
X i1 ir @X h @ @
C Tj1 jk1 hjkC1 js j i
˝˝ i
@x h @x 1 @x r
kD1
˝dx j1 ˝ ˝ dx js : (7.18)
X @ @X h
LX D Xh .j 1 j s / C h.j 2 j s /
i1 <<is
@x h @x j1
@X h
C .j1 js1 /h dx j1 ^ ^ dx js : (7.19)
@x js
From the definition of Lie’s derivative there follows that LX is linear with respect
to X, and the derivation property with respect to the tensorial product and the
exterior product
LX .T ˝ S/ D LX T ˝ S C T ˝ LX S; (7.20)
LX .T ^ S/ D LX T ^ S C T ^ LX S: (7.21)
where .t / denotes the extension of the differential .t / of the diffeomorphisms
of the group to the tensor algebra (Chap. 6).
In particular, a vector field Y and a 1-form ! are invariant under the group
t .x/ if
LX T D 0; (7.25)
! D !.i1 is / dx i1 ^ dx is
iX .! ^ / D iX ! ^ C .1/s ! ^ iX I (7.27)
3. If ! 2 Vn , then
iX ! D !.X/:
7.3 Exercises
Hint: The parametric equations of the group’s orbits are solutions of the system
of ODEs
dx dy
D x; D y:
dt dt
The solutions are
x D x0 et ; y D y0 et ;
@ @
XD C ey :
@x @y
Hint: The parametric equations of the group’s orbits are solutions of the system
of ODEs
dx dy
D 1; D ey :
dt dt
Consequently, the parametric equations of the orbits are
@ @
X D y Cx :
@x @y
Hint: The parametric equations of the group’s orbits are solutions of the system
of ODEs
dx dy
D y; D x;
dt dt
i.e., they are given by the functions
@ @
X D .x y/ C .x C y/
@x @y
is
x.t/ D et .x0 cos t y0 sin t/; y.t/ D et .y0 cos t C x0 sin t/:
x 0 D x C ˛t; y 0 D y C ˇt;
where x and y are Cartesian coordinates, determine the vector fields that are
invariant under the action of the group.
Hint: The infinitesimal generator of the group t .x/ is the vector field X D
.˛; ˇ/. The differential .t / of the diffeomorphisms t .x/ is represented by the
unit matrix. Further, in view of Theorem 7.5, we can determine the invariant
fields Y either by (7.23) or by (7.25). The last condition gives
@Y 1 @Y 1 dY 1
˛ Cˇ D D 0;
@x @y dt
@Y 2 @Y 2 dY 2
˛ Cˇ D D 0;
@x @y dt
Both the results show that the components of Y must be constant along the
group’s orbits.
6. On the unit sphere S referred to the spherical coordinates .'; /, consider the
one-parameter group G of diffeomorphisms
where ˛ and ˇ are constant. Determine the orbits of G and the vector fields that
are invariant under the action of G.
Chapter 8
Exterior Derivative and Integration
If the exterior derivative exists, then, applying properties 2 and 3, we have that
Finally, taking into account property 1, we obtain the coordinate form of the exterior
derivative
that is,
1 @w3 @w2 @w1 @w3 @w2 @w1
rD p 3; 3 1; 1 2 : (8.9)
jgj @x 2 @x @x @x @x @x
is
@23 @31 @12
d D C C dx 1 ^ dx 2 ^ dx 3 : (8.10)
@x 1 @x 2 @x 3
1 1 1
u1 D p 23 ; u2 D p 31 ; u3 D p 12 ; (8.12)
jgj jgj jgj
where
p
jgjdx 1 ^ dx 2 ^ dx 3 (8.15)
df D ! , rf D v; (8.16)
d! D , r v D u; (8.17)
d D 0 , r u D 0: (8.18)
d! D : (8.19)
108 8 Exterior Derivative and Integration
d D 0: (8.20)
It is evident that the exact differential r-forms belong to the vector subspace
[see (1.39)]
Er D d.^r1 Vn / D Im.d/; (8.21)
Cr D ker.d/; (8.22)
Er Cr ^r Vn : (8.23)
8; 0 2 Cr ; 0 if 0 2 Er : (8.24)
.1; x/ D x; .0; x/ D x0 ; 8x 2 U:
br D dim.A/ D 0;
In this section, we prove some relations existing among the exterior derivative, the
differential of a map, the Lie derivative, and the inner product of a differential r-
form by a vector field.
In this regard the following theorem holds.
Theorem 8.2. For ˛ 2 ƒk Vn , f 2 F Vn , and X 2 Vn ,
if X ˛ D f iX ˛; (8.26)
iX df D LX f; (8.27)
LX ˛ D iX d˛ C d.iX ˛/; (8.28)
Lf X ˛ D f LX ˛ C df ^ iX ˛; (8.29)
LX .d˛/ D d.LX ˛/: (8.30)
j @˛i @X j
LX ˛ D X C ˛j dx i ;
@x j @x i
@˛i @˛j
iX d˛ D X j
i dx i ;
@x j @x
@X j i @˛j
d.iX ˛/ D d.X j ˛j / D ˛j dx C X j i dx i ;
@x i @x
and (8.28) is proved for 1-forms. Since any r-form ˛ is a linear combination of
exterior products of 1-forms, and LX , iX , and d are linear maps, to prove the
theorem, it is sufficient to verify that (8.28) is satisfied for the exterior product
˛1 ^ ˛2 of two 1-forms. Now, owing to (7.2) and recalling that (8.28) has been
proved for 1-forms, we have that
LX .˛1 ^ ˛2 / D LX ˛1 ^ ˛2 C ˛1 ^ LX ˛2
D .iX d˛1 C d.iX ˛1 // ^ ˛2 C ˛1 ^ .iX d˛2 C d.iX ˛2 //
D .iX d˛1 / ^ ˛2 C d.iX ˛1 / ^ ˛2
C ˛1 ^ .iX d˛2 / C ˛1 ^ d.iX ˛2 /:
and (8.28) is proved. In the same way, we can prove (8.29) and (8.30). We omit the
proof of (8.31) and (8.32).
In Sect. 3.4, we showed that one of the two possible orientations of <r is determined
by choosing a basis .ei /. Denote by dx 1 ^ ^ dx r the volume r-form of <r .
Definition 8.7. The integral of the r-form
! D !.i1 ir / dx i1 ^ ^ dx ir
where
@x 0i
J D det : (8.35)
@x j
Henceforth, we identify the compact set U with the r-cube defined as follows:
˚
U D .x 1 ; : : : ; x r / 2 <r ; ai x i ai C c i : (8.36)
Starting from the .r 1/-cubes we can define the .r 2/-cubes, which are the faces
of the .r 1/-cubes, and so on. For instance, if r D 2, then the 2-cubes are rectangles
and the 1-cubes are the sides of the rectangles. Further, if r D 3, then a 3-cube is a
parallelepiped, the 2-cubes are its faces, and the 1-cubes are its edges.
Now we consider the vectors ni , i D 1; : : : ; r, D 0; 1, which in the basis .ei /,
chosen to determine an orientation of <r , have the components
j j
ni;0 D ıi ej ; ni;1 D ıi ej : (8.38)
In particular, if r D 2, then
It is evident that the vectors ni are normal to the faces of the r-cube U and outward
oriented (Fig. 8.1 is relevant to the case r D 2).
.1/
On any face Ui of the r-cube U , we consider a vector set †i D .ni ; vi ; : : : ;
.r1/ .1/ .r1/
vi /, where the vectors vi ; : : : ; vi are independent vectors belonging to Ui ,
chosen in such a way that the bases †i are congruent with the basis .ei /. When U
is equipped with these bases on its faces, we say that U is an oriented cube of <r
and we write .U; /, where is the orientation determined by .ei /.
Definition 8.8. We define an oriented r-cube on a differentiable manifold Vn , a
tern C D .U; ; F /, where .U; / is an oriented r-cube of <r and F W U ! Vn is a
differentiable map.
Example 8.1. Given the 2-cube of <2 defined by the inequalities
0 r 1; 0 2
;
112 8 Exterior Derivative and Integration
x 1 D r cos ; x 2 D r sin :
0 ' 2
; 0
=2;
is the ordinary curvilinear integral of ! along the curve with parametric equations
x i .t/.
If s is the curvilinear abscissa along and we denote by t D .dx i =ds/ the unit
tangent to and by v a vector whose covariant components vi are equal to !i , then
the preceding integral assumes the familiar form
Z Z
!D v tds: (8.40)
C
We conclude this section stating without the difficult proof the following
generalized Stokes theorem:
Theorem 8.3. If C is an oriented r-chain of the differentiable manifold Vn and @C
its cochain, then, 8! 2 ƒr1 Vn , we obtain
Z Z
d! D !: (8.43)
C @C
8.5 Exercises
X X @!
.i2 /
.irC1 / i1
d! D .1/m.
/ dx ^ dx irC1 ; (8.44)
i1 <<irC1
2…rC1
@x
.i1 /
where ….r C 1/ is the set of all permutations of the indices i1 < i2 < <
irC1 , for which i2 < < irC1 , and m.
/ is the number of inversions in the
permutation f
.i1 /;
.i2 /; : : : ;
.irC1 /g.
8.5 Exercises 115
on a manifold V3 .
Hint: Since n D 3, the only choice i1 < i2 < i3 is 1; 2; 3. On the other hand, the
possible permutations
.1/;
.2/;
.3/ of these indices for which
.2/ <
.3/
are f1; 2; 3g, f2; 1; 3g, f3; 1; 2g. Therefore, (8.44) gives
@23 @13 @12
d D 1
2
C dx 1 ^ dx 2 ^ dx 3 ;
@x @x @x 3
[see (8.16)] evaluate the area of the triangle and the parallelogram of Figs. 9.1
and 9.2, on the unit sphere referred to the spherical coordinates ', .
5. Integrate on the unit sphere the 1-form
xdx C dy C dz
dx i
D X i .x 1 ; : : : ; x n /; i D 1; : : : ; n: (9.1)
dt
It is evident that X.x i .t// is the tangent vector to the curve x i .t/ at any point. When
a point x0 .x0i / 2 U is given, there is one and only one (local) integral curve of X.x/
containing point x0 . We denote by x i .t; x0 / the general integral of (9.1).
dY dY i dei
D ei C Y i
dt dt dt
i k
@Y h @eh dx
D C Y : (9.2)
@x k @x k dt
To attribute a meaning to the derivatives @ei =@x k that express the variation of the
basis vectors on varying the coordinates, we start by noting that if the functions
y i D y i .x 1 ; : : : ; x n /
@ei @2 y m
D um
@x j @x i @x j
@2 y m @x k
D ek ;
@x i @x j @y m
and introducing the notations
@2 y m @x k
ijk D jki ; (9.3)
@x i @x j @y m
we obtain
@ei
D ijk ek ; (9.4)
@x i
and we can give (9.2) the form
dY @Y i
D C kh Y X k ei :
i h
(9.5)
dt @x k
i
This formula is not satisfactory since the quantities kh are given in terms of the
i
rectilinear coordinates .y /, as is shown by (9.3), not in terms of the coordinates
.x i /. But this further problem can be solved easily because we are operating in a
Euclidean space. In fact, a scalar product
X Y D gij X i Y j
9.1 Preliminary Considerations 119
between two arbitrary vectors X and Y of the same tangent space to En is defined.
In particular, ei ej D gij . Consequently, we have that
@gj k
jni gnk C ink gnj D ; (9.7)
@x i
n @gki
kj gni C jni gnk D : (9.8)
@x j
Adding (9.6) and (9.7) and subtracting (9.8), we express the quantities ijn in terms
of the metric coefficients gij and their first derivatives in the coordinates .x i /:
1 nh @ghj @gj i @gih
ijn D g i
C h
: (9.9)
2 @x @x @x j
for any choice of C 1 real functions f and g and the C 1 vector fields X and Y on En .
We conclude this section by noting that the expression (9.5) of the derivative of
Y.t/ along the curve x i .t/, where the quantities are given by (9.9), was obtained
under the following two conditions:
• There exists a rectilinear system of coordinates.
• In any tangent space to the manifold Vn , there is a scalar product.
In the following sections, we show how to extend the results of this section to
arbitrary manifolds.
120 9 Absolute Differential Calculus
such that
X D X i ei ; Y D Y i ei ; (9.15)
where .ei / are the vector fields defining a natural basis of the tangent space at any
point of the open set U . From the properties (9.13) and (9.14) there follows
i
i @Y
rX Y D rX k ek .Y ei / D ei C Y rek .ei / X k :
i
(9.16)
@x k
h
Introducing the connection coefficients ki by the relations
h
rek .ei / D ki eh ; (9.17)
(9.16) becomes
@Y i i h
rX Y D C kh Y X k ei rk Y i X k ei : (9.18)
@x k
Although (9.18) and (9.15) are formally identical, in (9.18) the n3 C 1 functions kh i
i
are arbitrary. In particular, they could not verify the symmetry conditions kh D
i i
hk . In any case, if we give an affine connection on Vn , then its coefficients kh
are determined in every coordinate domain. Conversely, a connection is defined in
i
every coordinate domain U by T giving the coefficients kh in U . If we introduce a
0i
new chart .V; x / such that U V ¤ ;, then the connection coefficients in the
new chart cannot be arbitrarily assigned. In fact, in the coordinate transformation
.x i / ! .x 0i /, the natural bases are transformed according to the rule
@x i
e0j D ei Aij ei : (9.19)
@x 0j
9.3 Parallel Transport and Autoparallel Curves 121
rX Y D .Y;ik Ckh
i
Y h /X k ei D .Y;0lm Cmn 0l
Y 0n /X 0m e0l
D Akm ..A1 /li Y i /;k Cmn
0l
.A1 /nh Y h .A1 /m p r
p X Al er
h
D Akm .A1 /m r 1 l i k 1 m r
p Al .A /i Y ;k CAm .A /p Al .A /i;k Y
1 l i
i
C .A1 /nh Arl .A1 /m 0l
p mn Y
h
X p er :
k
rX Y D Y;ik C.Ail .A1 /lh C .A1 /m i 1 n 0l
k Al .A /h mn /Y
h
X ei :
Finally, we obtain the following transformation formulae for the connection coeffi-
cients:
i
kh D Ail .A1 /m 1 n 0l i 1 l
k .A /h mn C Al .A /h ;k ; (9.20)
whose inverse formulae are
0i
kh D .A1 /il Am n l 1 i l
k Ah mn C .A /l Ah ;k : (9.21)
These formulae show that the connection coefficients are not transformed like the
components of a .1; 2/-tensor unless the coordinate transformation is linear since,
in this case, Alh ;k D 0.
We do not prove that an affine connection can be assigned on any paracompact
manifold.
Let .U; x i / be a chart on a manifold Vn , equipped with an affine connection, and let
.x i .t// be the parametric equations of a curve contained in region U . Finally, we
denote by
dx i
XD ei (9.22)
dt
the tangent vector to evaluated in the natural bases .ei / along .
Definition 9.2. The vector Y.t/ is said to be parallel along if
dY
rX Y D 0: (9.23)
dt
122 9 Absolute Differential Calculus
dY i i dx k
C kh Yh D 0; i D 1; : : : ; n: (9.24)
dt dt
rX X D 0; (9.27)
2
C kh D 0; i D 1; : : : ; n: (9.28)
ds ds ds
This is a second-order differential system in the unknowns x i .s/. Consequently, if
a point x0 2 Vn is fixed together with a vector .X i .0// 2 Tx0 Vn , then one and only
one autoparallel curve exists that contains x0 and is tangent to .X i .0// at x0 .
Remark 9.1. We recall that a curve of Vn is defined as a C 1 map W s 2 Œa; b !
.s/ 2 Vn . This means that by changing the parameter, we obtain another curve,
9.4 Covariant Differential of Tensor Fields 123
even if the locus of its points does not change. As a consequence, we have that, if
.s/ is an autoparallel curve, .s.t// .t/ could not be an autoparallel curve. To
identify the parameter changes that do not modify the autoparallelism of a curve,
we start by noting that
dx k dx k dt
D ;
ds dt ds
2
d2 x k d dx k d2 x k dt dx k d2 t
D D C :
ds 2 ds ds dt 2 ds dt ds 2
This result allows us to state that the autoparallel character of the curve .s/ is not
modified by the parameter change s D s.t/ if and only if
t D as C b; (9.30)
where a and b are arbitrary constants. Parameters that do not modify the autoparal-
lelism of a curve are called canonical parameters.
We denote by rs .Vn / the set of .r; s/-tensor fields of class C 1 on the manifold Vn
equipped with an affine connection. In particular, we denote by .Vn / D 10 .Vn /
the set of C 1 vector fields on Vn . We remark that rs .Vn / is also a F .Vn / module.
Finally, we denote by FO the set of all the maps rs .Vn / ! rs .Vn /.
Theorem 9.1. On the manifold Vn there is only one <-linear map
rO W X 2 .Vn / ! rO X 2 FO (9.31)
Proof. We prove the local existence of the preceding map in the domain of a chart
.U; x i /. We denote by .ei / the fields of the natural bases in U and by .dx i / the
differential forms of the dual bases. If ! 2 01 .Vn / and Y 2 .Vn /, then C.! ˝ Y/
is a function. From property 1 we obtain
Expanding the left-hand side of (9.33), simplifying the obtained relation and
recalling that it must hold for any vector field Y, we obtain the coordinate expression
of the covariant derivative of a 1-form:
T D Tji11i
ir
e ˝ ˝ eir ˝ dx j1 ˝ ˝ dx js
s i1
and applying properties 1–4, it is easy to verify that the coordinate expression of the
covariant differential of T is
h i
.rO X T/ij11i
is D X
r k
Tji11i
ir
s
;k C i1 hir
kh T j i
1 s
C ir
kh T j
i1 h
i
1 s
h
kj 1
T i1 ir
hi s
h
kj s
T j
i1 ir
1 h :
(9.35)
Henceforth, we use the symbol rX instead of rO X .
f;ij D f;j i :
9.5 Torsion and Curvature Tensors 125
However, the n2 quantities f;ij do not represent the components of a .0; 2/-tensor. In
contrast, the quantities ri f;j can be regarded as the components of a .0; 2/-tensor,
but we cannot change the derivation order since, in view of (9.34), we have that
The left-hand side gives the components of a .0; 2/-tensor and f;j are the compo-
nents of a covector; consequently, owing to the criteria given in Sect. 2.6, we can
state that the quantities
Sjhi D jhi ijh (9.37)
are the components of a .1; 2/-tensor, which is called the torsion tensor of the affine
connection of Vn . This tensor field vanishes if and only if the connection coefficients
are symmetric with respect to the lower indices.
Consider a vector field X, which in the chart .U; x i / on the manifold Vn has
the coordinate representation X D X i ei , where .ei / is the natural basis of the
coordinates x i . Then, with simple calculations, it is possible to verify that
where
Rljk i D ikl;j jkl;i C jkh ihl ih
k h
j l (9.39)
is the Riemann curvature tensor.
Now, we prove the following fundamental theorem.
Theorem 9.2. Let Vn be a manifold equipped with an affine connection. Then the
following statements are equivalent:
(a) There exists an atlas ˇ on Vn in each chart of which the connection coefficients
vanish;
(b) The torsion vanishes and, 8x; y 2 Vn , the parallel transport along any curve
connecting x and y does not depend on the curve;
(c) The torsion vanishes and there are n vector fields u1 ; : : : ; un that are indepen-
dent and uniform;
(d) The curvature and the torsion vanish.
T
Proof. (a)) (b). If .U; x i /; .U ; x i / 2 ˇ, then in the intersection U U of the
i i
domains, we obtain the results kh D kh D 0. Consequently, in view of (9.20),
i
the coefficients Aj of transformation matrix (9.19) verify the conditions
that is,
.A1 /m i 1 l 1 l
i Al .A /h ;k D .A /h ;k D 0;
126 9 Absolute Differential Calculus
and we conclude that the elements Alh of the transformation matrix are constant
and the coordinate transformation x i ! x i is linear. Further, the parallel
transport (9.24) along any curve contained in a domain of a chart of the atlas
ˇ reduces to dX i =dt D 0. Let x; y be two arbitrary points of Vn . If both points
belong to the same domain U of a chart of ˇ, then the parallel transport of a
vector Yx D Yxi ei .x/ along a curve .t/ U , connecting x and y, is obtained
integrating the equation dY i =dt D 0 along .t/, and it is independent of .t/.
Suppose that x belongs
T to the chart .U; x i / of ˇ and y belongs to another chart
i
.U ; x /, with U U ¤ ;. Let 1 .t/ and 2 .t/ be two curves going from T x to y.
The parallel transport of Yx along 1 .t/ up to a point p1 D .t1 / 2 U U gives
the vector Yxi ei .p1 /, whereas the
T parallel transport of the same vector along 2 .t/
up to a point p2 D .t2 / 2 U U gives the vector Yxi ei .p2 / since this transport
for both the curves is expressed by the equation dY i =dt D 0. Now we regard p1
and p2 as points belonging to U . In the coordinates .x i /, the vectors Yxi ei .p1 / and
j j
Yxi ei .p2 / have, respectively, the representations Aij Yx ei .p1 / and Aij Yx ei .p2 /
T
since the quantities Aij are constant all over U U . To prove condition (b) we
j
must verify that the parallel transport of Aij Yx ei .p1 / along 1 .t/ from p1 to y
j
and of the vector Aij Yx ei .p2 / along 2 .t/ from p2 to y leads us to the same
i
vector Yy . But this is evident since we must integrate the equations dY =dt D 0
of the parallel transport along 1 .t/ and 2 .t/ with the same initial conditions.
(b)) (c). Let Y1 ; : : : ; Yn n be independent vectors at the arbitrary point x 2 Vn .
Since condition (b) holds, by a parallel transport of these vectors to any other
point of Vn , we obtain n vector fields u1 ; : : : ; un whose values at a point do not
depend on the curve along which the vectors are transported; in other words, they
verify the condition
rA ui D 0
for any vector field A tangent to the curve along which ui is transported. From
the arbitrariness of the curve, i.e., of A, it follows that the fields ui are uniform.
It remains to prove that they are independent at any point. In this regard, it is
sufficient to note that the parallel transport, when it is independent on the curves,
defines an isomorphism W Tx .Vn / ! Ty .Vn / between the tangent spaces
corresponding to points x and y owing to the linearity of the parallel transport
equations and the existence and uniqueness theorems.
(c))(d). Since the torsion tensor vanishes, condition (9.38) becomes
rj ri Y k ri rj Y k D Rljk i Y l :
Rljk i al D 0;
(d))(a). We must prove that, for any x 2 Vn , there exists a chart .U; x 0i /
such that in these coordinates the connection coefficients vanish. Let .V; x i / be
another chart containing x. Then, in view of (9.21), we must find a coordinate
transformation .x i / ! .x 0i / such that
0p q
j h D .A1 /pq Am n q 1 p
j Ah mn C .A /q Ah;j D 0;
q
where .Aij / D .@x i =@x 0j / and the connection coefficients mn are symmetric
with respect to m and n since the torsion tensor vanishes. Multiplying the
preceding equation by Alp , we obtain the system
Alh;j D mn
l
Am n
j Ah ; (9.40)
@x l
D Alj : (9.41)
@x 0j
To find the n2 Cn unknowns Alj .x/ and x 0i .x/ of the preceding n2 Cn3 equations,
the integrability conditions
@Alh @Alh
0j 0i
D ; (9.42)
@x @x @x 0i @x 0j
@2 x l @2 x l
D (9.43)
@x 0j x 0i @x 0i x 0j
must be satisfied. We write these conditions in the more compact form
where we have used the notation fŒij D fij fj i . The integrability conditions
(9.44)2 are satisfied since, from (9.40) and the hypothesis that the torsion tensor
vanishes, we obtain
AlŒi;j D mn
l
Am n l m n
j Ai C mn Ai Aj
l
D mn Am n l n m n m l
j Ai C nm Ai Aj D Ai Aj Œn;m D 0:
Again for (9.40), the integrability conditions of (9.44)1 can be written as follows:
@ l m n @ l m n
Alh;Œj i D 0i
mn Aj Ah C 0i mn Ai Ah
@x @x
l
@mn
D Am An mn
l
Am n l m n
j;i Ah mn Aj Ah;i
@x 0i j h
l
@mn
C Am An C mn
l
Am n l m n
i;j Ah C mn Ai Ah;j :
@x 0j i h
128 9 Absolute Differential Calculus
When we note that @=@x 0j D Akj @=@x k , take into account (9.40), and recall the
already proven result AlŒi;j D 0, the preceding equations become
p
Alh;Œj i D Am
i An k
A
h j l
mn;k l
k n;m C l
kp mn
p
l
mp k n
Alh;Œj i D Am n k l
i Ah Aj Rnmk D 0; (9.45)
Theorem 9.3. A Riemannian manifold Vn admits one and only one affine connec-
tion r such that
• The torsion tensor vanishes and
• The scalar product is invariant under a parallel transport.
Proof. Let .U; x i / be a chart on Vn , and denote by kh
i
the connection coefficients
whose existence we must prove. If .t/ is any curve connecting two arbitrary points
x; y 2 U and X, Y two vector fields that are parallel along .t/, then we have
dX i i dx h
C kh Xk D 0; (9.46)
dt dt
dY i i dx h
C kh Yk D 0; (9.47)
dt dt
where x h .t/ are the parametric equations of .t/ in the chart .U; x i /. The torsion
tensor vanishes if and only if the connection coefficients are symmetric with respect
to the lower indices. Further, the scalar product is invariant under the parallel
transport if and only if the condition
d
.gij X i Y j / D 0 (9.48)
dt
is satisfied for any pair of vector fields X, Y that verify solutions of Eqs. (9.46) and
(9.47). Expanding (9.48) and taking into account (9.46) and (9.47), we obtain
h
dgij l dx dx h
glj ih gi l jl h X i Y j D 0:
dt dt dt
9.6 Riemannian Connection 129
Owing to the arbitrariness of X and Y and the curve x h .t/, the following conditions
must be verified:
l
gij;h D glj ih C gi l jl h : (9.49)
Cyclically permutating the indices, we obtain the other two equations
Adding (9.49) and (9.50) and subtracting (9.51), we are led to the equations
1
ghl ijl D .ghi;j C gj h;i gij;h / Œj h; i : (9.52)
2
Finally, Eqs. (9.52) supply the following expressions of the connection coefficients:
1 hm
ijm D g .ghi;j C gj h;i gij;h / g mh Œj h; i : (9.53)
2
It is a simple exercise to verify that the connection defined by the preceding
coefficients is without torsion and leaves invariant the scalar product under a parallel
transport.
Definition 9.5. The Riemannian connection is the connection whose coefficients
are given by (9.53).
Theorem 9.4 (Bianchi’s theorem). The metric tensor g is uniform, that is, its
covariant derivative vanishes:
rh gij D 0: (9.54)
Proof. It is sufficient to refer to (9.49) and recall the expression of the covariant
differential of a .0; 2/-tensor.
Theorem 9.5. In a neighborhood of any point x0 of a Riemannian manifold, it is
possible to define a local chart .U ; x i / whose domain contains x0 such that
g ij .x0 / D i ıij ; g ij;h D 0; (9.55)
i i i
kh .x0 / D kh .x0 / kh .x0 / D 0:
Since the connection is Riemannian, (9.50) holds, and then we have gij;h .x0 / D
0. Finally, by a linear transformation .x i / ! .x 0i /, which does not modify the
preceding results, we can reduce the metric coefficients at x0 to the form (9.55).
The Riemannian manifold Vn is said to be locally Euclidean if in the neighbor-
hood of any point there is a system of coordinates .U; x i / in which gij D i ıij ,
8x 2 U .
Consequently, in the natural basis .ei /, relative to the chart .U; x i /, we can write
rf D g ij f;j ei : (9.59)
r rf D 0: (9.61)
r X D X i ;i Cih
i
X h: (9.63)
i 1 1 p
ih D g;h D p . jgj/;h (9.65)
2g jgj
1 p
r X D p . jgjX i /;i : (9.66)
jgj
4f D r rf: (9.67)
In local coordinates .x i /, in view of (9.67) and (9.59), we can write the Laplace
operator in the form
1 p
4f D ri .g ij f;j / D g ij .fij ijk f; k/ D p . jgjg ij f;j /;i : (9.68)
jgj
4f D 0 (9.69)
S C
α
A
B
D
C
A
B
9.8 Exercises
3. On the unit sphere S consider the triangle ABC (Fig. 9.1) in which AC and BC
are arcs of meridians, AB an arc of the equator, and B a pole. Let ˛ be the angle
at the vertex C . Show that the three sides of the triangle are arcs of geodesics and
evaluate the parallel transport of the vector .1; 0/ along the sides of ABC from
A to A.
4. Using the notations of the preceding exercise, evaluate the parallel transport of
the vector .1; 0/ from A to A along the curve ABCA in Fig. 9.2,
where AD and BC are arcs of meridians and AB and CD are arcs of parallels.
Chapter 10
An Overview of Dynamical Systems
1
The topics contained in this chapter can also be found in [5, 6, 9, 25, 28, 29, 35, 40, 55].
why modeling leads us to this object, and what kinds of problems one finds at the
end of this process, we start with a very simple example containing the fundamental
ingredients of the problems we face.
Suppose that one wishes to describe the evolution of a population consisting of
people, animals, bacteria, radioactive atoms, etc. The importance of this problem is
clear: its resolution could allow decisive forecasts on the destiny of the examined
population. For example, the growth velocity of a population of infectious bacteria
in an organism could suggest the appropriate dosage of antibiotic. Similarly, if the
growth law of a population is known, the right amount of food for its survival can
be evaluated.
To formulate a growth model for a population, the factors that positively or
negatively influence the aforementioned process must first be identified and a
mathematical relation supplying the number N.t/ of living individuals at instant
t as a function of those factors must be formulated. Proceeding from the simplest
situation, all external influences on growth are neglected and the food source is
assumed unlimited. In this situation, it is quite reasonable to suppose that the
variation N.t/ of the number N.t/ in the time interval .t; t C t/ is proportional
both to N.t/ and t
N.t/ D ˛N.t/t;
where ˛ is a coefficient depending on the kind of population. Of course, this
coefficient is positive if the population increases and negative in the opposite case.
In the limit t ! 0, the number N.t/ satisfies the equation
dN.t/
D ˛N.t/;
dt
in which N.t/ is the unknown appearing in the equation with its first derivative.
Such a mathematical object is just an ODE of the first order in the unknown N.t/.
We underline that the hypotheses leading us to the previous equation are very
spontaneous, but this is not the case when forecasting the form of its solution2
N.t/ D C e˛t ;
where C is an arbitrary positive constant. Now the main reason why differential
equations frequently appear in modeling natural phenomena can be recognized: It is
much easier to formulate a reasonable relation between the unknown of a given
problem and its variations than to imagine the form of the function itself. The same
kind of population admits infinite evolutions depending on the initial datum or
condition N.0/ that assigns a value to C .
The previous model, where ˛ > 0, implies that the number of individuals goes
to infinity with time, and this is absurd. To improve the model, the habitat in which
2
This solution is obtainable at once by the method of variable separation. However, the reader can
easily verify that it is really a solution for any C .
10.1 Modeling and Dynamical Systems 137
the population lives is not supposed to support a population level greater than M ;
this means that when N.t/ reaches this value, the growth rate becomes zero. The
limit M is called the carrying capacity. To take into account this constraint, the
coefficient ˛ appearing in the preceding equation is assumed depending on the
quantity .M N.t//=M . The simplest dependency on this variable is the direct
proportionality
N.t/
˛ Dˇ 1 ;
M
where ˇ > 0 is constant, and we arrive at the relation
N.t/
NP .t/ D ˇ 1 N.t/;
M
which is called the logistic equation. This nonlinear equation gives a more accurate
description of the evolution of a population since it introduces an upper bound
to its growth. Further, it represents the starting point of many models describing
the competition among species living in the same habitat or the modified prey–
predator model of Lotka–Volterra (see, for instance, [28, 29]). When the variables
are separated, the family of solutions
M
N.t/ D
1 C C eˇt
3
For the application of differential equations to economy, see, for instance, [55].
138 10 An Overview of Dynamical Systems
wide variety of possible motions of material bodies confirm the three fundamental
principles of dynamics (Chap. 13). It is an everyday experience that a body may
fall in many different ways under the influence of its weight. Similarly, the planets
describe different orbits around the Sun and traverse them at different velocities.
Galileo and Newton discovered that a very simple relation of proportionality relates
the acceleration a of a material point P to the acting force F
ma D F;
where m is the mass of the body. If .x.t/; y.t/; z.t// are the coordinates in a
frame of reference Oxyz of the moving point P as a function of time, then the
components of the acceleration vector are .x.t/;
R y.t/;
R zR.t//. On the other hand, the
force depends on the position of P (that is, on .x.t/; y.t/; z.t//) as well as on its
velocity .x.t/;
P y.t/;
P zP.t//. Consequently, the fundamental equation of dynamics is
essentially a system of three ordinary second-order differential equations in the
unknowns .x.t/; y.t/; z.t//. By the auxiliary variables .u; v; w/ D .x; P y;
P zP/, this
reduces to a system of six first-order differential equations.
Let <n be the vector space of the ordered n-tuples of real numbers x D .x1 ; : : : ; xn /
and kxk the Euclidean norm in <n . Moreover, let f W ! <n be a map defined
on the open subset of <n . We call a autonomous first-order (vector) differential
equation in the normal form the following equation
xP D f.x/: (10.1)
xP i D fi .x1 ; : : : ; xn /; i D 1; 2; : : : ; n:
The vector space <n , to which x belongs, is said to be the state space.
A smooth function x W I ! <n , where I is a nonempty interval of <, is a
solution of (10.1) if
The set x.t/; 8t 2 I is called an orbit, and it represents a curve in the state space.
The initial-value problem or Cauchy problem for (10.1) consists in searching
for the solution x.t; t0 ; x0 / of (10.1) that satisfies the following initial condition or
initial datum:
x0 D x.t0 /; x0 2 : (10.2)
10.2 General Definitions and Cauchy’s Problem 139
1
xP D
x
4
For a proof, see, for instance, [40].
140 10 An Overview of Dynamical Systems
does not satisfy a Lipschitz condition in the interval .0; 1. However, the solution
p
xD x0 2 C 2t;
It is evident that ı./ < ; otherwise, condition (10.5) should not be verified at
the initial instant. It is important to note the following. First, the stability property
is equivalent to the continuity of solutions of (10.1) with respect to the initial
datum x0 uniformly in the unbounded time interval Œ0; 1. If (10.1) is interpreted
as a mathematical model of a real system S , its solutions represent the possible
evolutions of the variable x describing the state of S . To have S at the equilibrium
state x.t/ D 0,S must initially be put at 0. But this operation is physically
impossible because it is necessarily realized by procedures that introduce measure
errors, however accurate they may be. This means that the system is put at an initial
state x0 next to 0, and the corresponding solution could lead the state of the system
definitively far from the equilibrium. If this happened for some initial data, the
equilibrium state itself would not be physically observable. In contrast, if 0 is stable,
this situation is not confirmed. In fact, if the notation Uı D fx 2 <n Wk x k< ıg is
used, the stability notion can be formulated in the following terms. If any region
U D is fixed around the origin, it is possible to find a whole neighborhood Uı
U of initial conditions whose corresponding solutions are fully contained in U .
The equilibrium solution x D 0 is as follows:
1. Attractive if
The set 0 of initial data x0 for which (10.6) is satisfied is called the domain
of attraction. In particular, if 0 D D, x D 0 is globally asymptotically
stable. Figures 10.1–10.3 supply a rough representation of an attractive origin,
10.4 Definitions of Stability 143
U
x0 x
O
U
x0 x
O
asymptotically stable origin, and unstable origin, respectively. In Fig. 10.1, the
origin is stable since there are two neighborhoods Uı and U of the origin, where
Uı U , of the origin such that the orbits starting from any point in Uı are
always contained in U . In Fig. 10.2, the origin is attractive since the solutions
corresponding to the initial data belonging to 0 are contained in a neighborhood
of the origin after a suitable value of time. In Fig. 10.3, the origin is asymptotically
stable because the solutions corresponding to the initial data belonging to Uı are
contained in Uı and, moreover, tend to the origin when the variable t goes to
infinity. Finally, in Fig. 10.4, the origin is unstable because there are initial data
whose corresponding solutions go definitively away from the origin.
The previous stability definitions could generate the wrong idea that all solutions
of (10.1), corresponding to initial data in a neighborhood of 0, must be known
in order to recognize the stability of the equilibrium position. It is evident that
the solutions of (10.1) can be exhibited in only a few elementary cases (linear or
special systems) so that an alternative method is needed to check the stability of
the origin. In this section, it is shown that it is possible to recognize the stability
of the origin by analyzing the right-hand side of (10.1). This is realized in the
direct method of Lyapunov, which reduces the stability problem to that of the
existence of suitable functions (Lyapunov functions) having, with their derivatives,
determined the properties on the solutions of (10.5). This method is called direct
because, under the hypothesis that these functions are of class C 1 , the corresponding
derivatives on the solutions of (10.1) can be directly expressed by the right-hand
side of (10.1) without knowing the solutions themselves. The determination of a
10.5 Analysis of Stability: The Direct Method 145
Lyapunov function, in turn, is not easy. Therefore, the original idea was developed
in different directions to make easier the application of the method itself.
In this section, some stability or instability criteria related to the direct method
are discussed.
We extensively use the notation
together with the remark that along the solutions of (10.1) VP .x/ coincides with the
total derivative of V .x.t; x0 // because
This remark implies that the sign of VP .x.t; x0 // along the solutions can be
established without their preliminary knowledge if the sign of VP .x/ around the
origin is known.
Definition 10.1. Let V W U ! < be a continuous function such that V .0/ D 0.
V is said to be positive .negative/ definite on U if V .x/ > 0 .V .x/ < 0/ for x ¤ 0
or positive .negative/ semidefinite if V .x/ 0 .V .x/ 0/.
Theorem 10.2. Let V W U ! < be a class C 1 function that is positive definite
in a neighborhood U of the origin. If VP .x/ is negative semidefinite in U , then
the origin is stable. Moreover, if VP .x/ is negative definite in U , then the origin is
asymptotically stable.
Proof. 8 2 .0; we denote a sphere with its center at the origin and radius
. Since U U and in U f0g the function V .x/ is positive definite and
continuous, there exists a positive minimum of the restriction of V .x/ on @U :
Again for the continuity of V .x/ at the origin, there is a positive real quantity ı <
such that
jxj < ı ) 0 < V .x/ < V :
Now we prove that any solution x.t; x0 / of (10.1), for which jx0 j < ı, remains for
any t > 0 in the sphere S . In fact, if this statement were false, there should be an
instant t in which the orbit x.t; x0 / should cross @S . But initially it is V .x0 / < V ,
so that we should have V .x0 / < V V .x.t ; x0 // against the condition VP 0
along the orbit x.t; x0 /.
The proof of the following theorem is omitted.
Theorem 10.3. Let V W U ! < be a class C 1 function in a neighborhood U
of the origin such that (1) V .0/ D 0 and (2) for any neighborhood U U a point
146 10 An Overview of Dynamical Systems
x 2 U exists at which V .x/ > 0. If VP .x/ is positive definite in U , then the origin
is unstable.
It is possible to recognize the role of the hypotheses contained in Theorems (10.2)
and (10.3) by a simple geometric description.
In fact, let c be a trajectory starting from a point x0 of a level curve of the
Lyapunov function. At this point, c has a tangent vector xP D f.x0 / that points
toward the internal region of , and the angle between rV and xP is greater than
. Consequently, c meets the more internal level curves under the same conditions.
In this way, c approaches the origin (Fig. 10.5).
Similar considerations, applied to the case VP 0, allow us to conclude that the
trajectory does not leave the internal region to the level curve of V containing x0 :
However, the condition VP 0 does not exclude the possibility that the trajectory
will remain on a level curve so that it cannot approach the origin.
In particular, when VP D 0 along the solutions, one has the situation illustrated in
Fig. 10.6.
Before stating the next theorem of Barbashin–Krasovskii, a simple physical
example will be discussed. Let P be a harmonic oscillator of unit mass moving
on the Ox-axis subject to an elastic force kx, k > 0, and to a linear friction hx, P
h > 0. Its motion x.t/ is given by the linear system
xP D v;
vP D kx hv:
It is quite evident from a physical point of view that the origin is an asymp-
totically stable equilibrium position. To verify this property of the origin, the total
10.5 Analysis of Stability: The Direct Method 147
1 2
V .x; v/ D .v C kx 2 /
2
we have
VP .x; v/ D hv2 :
However, VP .x; v/ vanishes not only at the origin but also along the x-axis. From
the previous theorems we can only conclude that the origin is stable. The following
theorems allow us to establish that the origin is asymptotically stable by using the
same Lyapunov function.
Theorem 10.4. (Barbashin–Krasovskii) Let V W P ! < be a class C 1 function
that is positive definite in a neighborhood P of the origin. If VP .x/ is negative
semidefinite in P and if x D 0 is the only solution of (10.1) for which VP .x/ D 0,
then the origin is asymptotically stable.
Theorem 10.5. (Krasovskii) Let V W P ! < be a class C 1 function such that
1. 8 2 .0; /; 9x 2 P W V .x/ > 0I
2. VP .x/ 0; 8x 2 P I and
3. x D 0 is the only solution of (5.1) for which VP .x/ D 0:
Then the origin is unstable.
Returning to the preceding example, we remark that the equation VP D hv2 D 0
implies v D 0 along the solutions. On the other hand, from the differential system it
148 10 An Overview of Dynamical Systems
follows that x D v D 0; then the hypotheses of Theorem 10.4 are satisfied and the
origin is asymptotically stable.
The preceding theorems make possible the analysis of the stability properties
of the equilibrium solution x D 0 given a Lyapunov function possessing suitable
property. We have already remarked that a first integral of (10.1) can be used as a
Lyapunov function. In some cases, polynomial Lyapunov functions can be easily
found (see exercises at end of chapter). However, in general, it is not an easy task to
determine a Lyapunov function. In this regard, the following theorem, which allows
us to state the stability of the equilibrium solution x D 0 by an analysis of the linear
part of (10.1) around 0, could be useful.5
More precisely, since f.0/ D 0, (10.1) can be written as follows:
xP D Ax C ˆ.x/; (10.11)
5
Readers will find in [35] many programs, written using Mathematica, that allow for the analysis
of many stability problems.
10.6 Poincare’s Perturbation Method 149
the solution we are searching for is periodic. However, in this case, by applying
another procedure (Lindstedt, Poincaré), a uniform expansion with respect to t can
be derived (see, for instance, [28, 29, 35]).
Poincaré’s method is applicable to a differential equation, which can be written
in the form
xP D f.x/ C F.x/; (10.12)
where is a suitable “small” dimensionless parameter related to the physical system
modeled by the preceding differential equation.
The method can be developed, provided that the solution of the equation
xP D f.x/ C g.t/;
where g.t/ is a known function belonging to a suitable class, can be computed for
a given initial condition and if the perturbation term F is analytic in its argument.
When both conditions are fulfilled, it is possible to look for the solution in powers
of . More precisely, it can be proved that the solution of (10.12), under suitable
hypotheses on the functions f and F, can be written as a power series of the
parameter ,
x.t/ D x0 .t/ C x1 .t/ C 2 xn .t/ C ;
that converges to the solution of (10.12) uniformly with respect to but usually not
with respect to t. The general term xn .t/ of the preceding expansion is the solution
of the equation
xP n D f.xn/ C gn .t/;
where the function gn .t/ is completely determined by the previous terms x1 .t/,
: : : ; xn1 .t/.
The meaning of the attribute small applied to requires an explanation. The
parameter is small if 1 and the terms xP , f.x/, and F.x/ are all comparable
with unity. However, it is not possible to verify this property because usually there
is no knowledge of the solution. To overcome this difficulty, we can resort to a
dimensionless analysis of the problem modeled by (10.12), provided that we have
a sufficient knowledge of the phenomenon we are analyzing. We make clear this
procedure with some examples.
We analyze in more detail the case where the function f in (10.12) depends
linearly on x, i.e., we consider the following Cauchy problem:
xP D Ax C F.x; /;
(10.13)
x.t0 / D x;
where A is an n n matrix with constant coefficients.
Poincaré proved that if F.x; / is an analytic function of its variables, then the
solution x.t; / of (10.13) is analytic with respect to ; consequently, it can be
expressed by an expansion
To find the terms of the expansion (10.14), we expand F.x; / with respect to
and then introduce (10.14) into (10.13)1. In this way, the following sequence of
Cauchy problems is derived, which determine x0 .t/, x1 .t/, x2 .t/, and so on:
xP 0 D Ax0 ;
(10.16)
x0 .t0 / D xN ;
xP 1 D Ax1 C F.x0 .t/; 0/;
(10.17)
x1 .t0 / D 0;
xP 2 D Ax2 C .F /.x0 .t/; 0/ C x1 .t/ .rx F/.x0 .t/; 0/;
(10.18)
x2 .t0 / D 0;
where .F /.x0 .t/; t; 0/ is the derivative of F with respect to evaluated at .x0 .t/; 0/
and .rx F/.x0 .t/; 0/ denotes the gradient of F with respect to the variable x at
.x0 .t/; 0/.
It is very important to note that the problems (10.16), (10.17), : : : refer to the
same linear differential equation, which is homogeneous at the first step and nonho-
mogeneous at the next steps. However, the terms appearing at the i th step are known
when the previous Cauchy problems have been solved. Although the simplification
reached in solving the original Cauchy problem is clear, the calculations to write
and solve the different systems are very heavy and cumbersome. For these reasons,
the reader can use the notebook Poincare.
It is also very important to recall that expansion (10.14) is usually not uniform
with respect to time. As we shall see, in the series (10.14) some secular terms
n t n sin nt, n t n cos nt can appear. When this happens, one is compelled to accept
into (10.14) time values verifying the inequality t a=b, where a and b denote
the maximum values of the coefficients of, respectively, x0 .t/ and the secular terms
in x1 .t/, to be sure that the second term of the expansion (10.14) is small with
respect to the first one for time values t a=b.
10.7 Introducing the Small Parameter 151
where x0 denotes the abscissa of the initial position of P and T is equal to 2 times
the period of the oscillations one has when only the linear part hx of the elastic
force is acting. Defining the dimensionless quantities
x t
x D ; t D (10.21)
L T
and remarking that
L dx L L d2 x L
xP D D xP ; xR D D 2 xR ;
T dt T T 2 dt 2 T
152 10 An Overview of Dynamical Systems
xP D y ;
kx02 3
yP D x x ;
h
the dimensionless parameter
kx02
D
h
can be identified with the small parameter of Poincaré’s method, provided
that <<1.
Example 10.4. Now we take another mechanical example into account: a moving
point P under the action of its weight and a friction. The motion equation
is written as
vP D g hvv; (10.22)
where g is the gravity acceleration, v D jvj, and h is a positive constant depending
on the medium in which P is moving as well as on its form. Refer the previous
vector equation to a frame Oxy whose origin O is at the initial position of P and
whose Ox-axis and Oy-axis are taken to be horizontal and vertical, respectively,
and such that the (vertical) plane Oxy contains the initial velocity of P . Since the
motion is planar, the whole trajectory is contained in the plane Oxy. To identify the
small parameter, we note that when friction is absent, the motion equation
vP D g
where v0 D jv.0/j and ˛ is the angle between the initial velocity v.0/ and the
horizontal Ox-axis. Since the quantities vx .t/ and vy .t/, in the presence of friction,
are of the same order of magnitude as the previous ones, the height L at which a
heavy body arrives, before inverting its motion, and the time T to reach the soil
again can be taken as reference quantities of length and time, respectively. It is easy
to deduce from (10.23) and (10.24) the formulae
v20 2v0
LD ; T D
2g g
10.8 Weierstrass’ Qualitative Analysis 153
so that
L g
D :
T2 8
With the introduction of these reference quantities, (10.22) assumes the following
dimensionless form:
vP D 8k hLvv; (10.25)
where k is the unit vector along the Oy-axis. If the factor
D hL<<8;
xR D f .x/ (10.26)
xP D v; (10.27)
vP D f .x/; (10.28)
in the unknowns x.t/, v.t/. This pair of functions defines a curve in the state space
whose structure depends on the meaning of the variable x. For instance, if x is the
angle ' varying on a circumference, then
is a cylinder. The equilibrium positions
of (10.27), (10.28) are the solutions of the system
f .x/ D 0; v D 0: (10.29)
1 2
V .x; v/ D v C U.x/; (10.30)
2
154 10 An Overview of Dynamical Systems
where Z
U.x/ D f .x/ dx (10.31)
Equation (10.32) is the explicit form of level curves V D cost. It shows that these
curves are symmetric with respect to the Ox-axis. Moreover, in view of (10.27),
(10.28), when v D 0, we also have xP D 0. Consequently, the tangent straight line
to the level curves at the points in which they intersect Ox is orthogonal to Ox.
Finally, for any fixed value of E, the admissible values of x satisfy the condition
U.x/ E. Equation (10.33) gives the time needed to go from x0 to x.
If we assign an initial datum .x0 ; v0 /, where v0 > 0, then we must choose the
C sign in (10.32), (10.33) up to the instant at which U.x/ D E. Consequently, if
10.8 Weierstrass’ Qualitative Analysis 155
U.x/ < E, 8x > x0 , the function x.t; x0 ;p v0 / goes to infinity in a finite or infinite
time according to whether the function 1=. 2.E U.x/// is integrable or not.
In contrast, if on the right-hand side of x0 there is a point x where U.x/ D E,
then we must consider the following two cases:
Therefore, the function under the integral in (10.33) is integrable in Œx0 ; x/, and
x.t; x0 ; v0 / reaches the point x in the finite time
Z x
ds
tD p : (10.35)
x0 2.E U.x//
Further, U.x/ < E for x < x and U.x/ D E for x D x. Therefore, the function
U.x/ increases at the point x so that f .x/ D U 0 .x/ < 0. Finally, if we take
into account (10.28), then we also have vP .t ; x0 ; v0 / D x.t; R x0 ; v0 / < 0, whereas
(10.27) and (10.32) imply that v.t; x0 ; v0 / D x.t P ; x0 ; v0 / D 0. In conclusion,
x.t
P ; x0 ; v0 / < 0 for t < t and x.t ; x0 ; v0 / comes back toward x0 . In particular, if to
the left of x0 is another point x with the same characteristics of x, then the function
x.t; x0 ; v0 / is periodic with a period
Z x
ds
T D2 p : (10.36)
x 2.E U.x//
In case .b/, Taylor’s expansion of U.x/ at x is
1
U.x/ D E C U 00 .x/.x x/2 C O.jx xj2 /; / (10.37)
2
p
and the approach time to x is infinity since the function 1=. 2.E U.x/// is no
more integrable and x.t ; x0 ; v0 / tends asymptotically to x. We notice that, in this
case, x is an unstable equilibrium position since the conditions U.x/ < E for x < x,
U.x/ D E for x D x, and U 0 .x/ D 0, ensure that x is not a minimum.
The analysis we have just described can be carried out by the notebook
Weierstrass.
Example 10.5. Figure 10.7 shows the qualitative behavior of the potential energy
U.x/ and the corresponding phase portrait of (10.27), (10.28). For E < E1
there is no solution. For E D E1 the level curve reduces to the equilibrium
position .x1 ; 0/. For E1 < E < E , the level curves are closed and the orbits
correspond to periodic motions. For E < E < E2 , the level curves have two
connected components: the first of them corresponds to a periodic orbit, whereas
156 10 An Overview of Dynamical Systems
U (x)
E4
E3
E2
E*
E1
x
x1 x2 x3
v
the second one is an aperiodic orbit going to infinity. For E D E2 we have a level
curve with three connected components corresponding to a bounded open orbit, an
unstable equilibrium position, and an open orbit going to infinity. It is a simple
task to complete the analysis of the phase portrait. We conclude by noting that the
dashed lines represent orbits that tend to an unstable equilibrium position without
reaching it.
10.9 Exercises
1
2
1. Verify that V D 2 x C y 2 is a Lyapunov function for the system
xP D y C ax.x 2 C y 2 /;
yP D x C ay.x 2 C y 2 /;
and determine the stability property of the origin upon varying the constant a.
10.9 Exercises 157
1
2
2. Verify that V D 2 x C y 2 is a Lyapunov function for the system
xP D y C axy 2 ;
yP D x yx 2 ;
and determine the stability property of the origin upon varying the constant a.
3. Determine the periodic orbits of the following system in polar coordinates
.r; '/:
Are there equilibrium positions? Control the results using the notebook
PolarPhase.
4. Determine the equilibrium positions of the system
xP D 2xy;
yP D 1 3x 2 y 2 ;
and analyze their linear stability properties. Control the obtained results using
the notebook Linstab.
5. Determine the equilibrium positions of the system
xP D x C y;
yP D x C 2xy;
and analyze their linear stability properties. Control the obtained results using
the notebook Linstab.
6. Determine the equilibrium positions of the system
and analyze their linear stability properties. Control the obtained results using
the notebook Linstab.
7. Determine an approximate solution of the system
xP D y;
yP D x C .1 x 2 /y
using Poincaré’s method, and control the obtained results using the notebook
Poincare.
158 10 An Overview of Dynamical Systems
xP D y;
yP D x x 3
using Poincaré’s method, and control the obtained results using the notebook
Poincare.
9. Determine an approximate solution of the system
xP D y C .x 2 y 2 /;
yP D x C y 3
using Poincaré’s method, and control the obtained results using the notebook
Poincare.
10. Determine the equilibrium configurations of the potential energy
1 2 1 4
U D x C x ;
2 4
their stability properties, and the phase portrait using a Weierstrass analysis.
Control the obtained results using the notebook Weierstrass.
11. Determine the equilibrium configurations of the potential energy
U.x/ D cos.x/;
their stability properties, and the phase portrait using a Weierstrass analysis.
Control the obtained results using the notebook Weierstrass.
12. Determine the equilibrium configurations of the potential energy
2
U.x/ D ex cos.x/;
their stability properties, and the phase portrait using a Weierstrass analysis.
Control the obtained results using the notebook Weierstrass.
Part II
Mechanics
Chapter 11
Kinematics of a Point Particle
1
The contents of Chapters 11–17 can also be found in [4, 12, 20, 23, 24, 26, 32, 33, 45, 46, 49, 54].
chronology at any point A of S using the local time, at present we have no criterion
to compare the chronology of the events at A with the chronology of the events at
another point B of S . Only after the introduction of such a criterion can we state
when an event at A is simultaneous with an event at B or when an event at A happens
before or after an event at B. Already the measure of the distance d.A; B/ between
two points of S at a given instant requires the definition of simultaneity of events
happening at two different points.
We suppose that there is no problem in synchronizing two clocks at the same
point A of S . To introduce a definition of synchronism between two distant clocks,
we start by placing at the point A many identical and synchronous clocks. Then we
postulate that these clocks remain synchronous when they are placed at different
points of S . In other words, we assume the following axiom.
Axiom 11.1. The behavior of a clock does not depend on its motion. In particular,
it is not influenced by the transport.
This postulate allows us to define a universal time, i.e., a time that does not
depend on a point of S . In fact, it is sufficient to place synchronous clocks at A
and then to transport them to different points of S . Adopting this universal time,
measured by synchronous clocks distributed at different points of S , we can define
a rigid body as a body S for which the distance d.A; B/ between two arbitrary
points A and B of S is constant in time.
Another fundamental axiom of classical kinematics follows.
Axiom 11.2. There exist rigid bodies inside which the geometry of a three-
dimensional Euclidean space E3 holds.
Let R D fS; tg be a frame of reference formed by the rigid body S and universal
time t. Henceforth, we will often identify a frame of reference with an orthonormal
frame .O; .ei //, i D 1; 2; 3, of the three-dimensional Euclidean space describing
the geometry of S (Axiom 11.2). In this section, we analyze the kinematic behavior
of a point particle P moving with respect to .O; .ei //.
The equation of motion of P is a function
giving the position vector r.t/ of P relative to the orthonormal frame .O; .ei // at
any instant t (Fig. 11.1). If .xi / are the Cartesian coordinates of P in .O; .ei //, then
(11.1) can also be written in the following form:2
2
Henceforth we intend the summation from 1 to 3 on repeated indices. We use only covariant
indices since, in Cartesian orthogonal coordinates, there is no difference between covariant and
contravariant components.
11.3 Velocity and Acceleration 163
e r t
x
O
e
e
Let P be a point particle moving relative to the reference .O; .ei //, and let r.t/ be
the equation of motion of P . The vector velocity or simply velocity of P in .O; .ei //
is the vector function dr
rP D (11.5)
dt
and the scalar velocity is the derivative sP .t/.
Differentiating (11.4) with respect to time and recalling the first Frenet formula
dOr
D t;
ds
164 11 Kinematics of a Point Particle
where t is the unit vector tangent to the trajectory, we obtain the following relation
between the vector velocity and the scalar velocity:
rP D sP t: (11.6)
In other words, the vector velocity is directed along the tangent to the trajectory
and its length is equal to the absolute value of the scalar velocity. Finally, it has the
versus of t or the opposite versus depending on the sign of sP .
In the Cartesian frame of reference .O; .ei //, we can write r.t/ D xi .t/ei so that
the velocity has the following Cartesian expression:
rP D xP i ei : (11.7)
dPr d2 r
rR D D 2; (11.8)
dt dt
which, in view of (11.7), in Cartesian coordinates has the form
rR D xR i ei : (11.9)
Recalling the second Frenet formula
dt n
D ;
dt R
where n is the unit principal normal to the trajectory and R the radius of the
osculating circle, we obtain another important expression of acceleration differenti-
ating (11.6):
v2
rR D sR t C
n; .v D jPrj D jPs j/: (11.10)
R
The preceding formula shows that the acceleration
• Lies in the osculating plane of trajectory,
• Is directed along the tangent if the trajectory is a straight line, and
• Is normal to the trajectory when sP D 0 (uniform motion).
Henceforth, we will use the following definitions:
sR D scalar acceleration
sR t D tangential acceleration
v2
n D centripetal acceleration (11.11)
R
11.4 Velocity and Acceleration in Plane Motions 165
dv2 dPs 2
D D 2Ps sR ;
dt dt
the motion is accelerated if sP sR > 0 and decelerated if sP sR < 0.
Let P be a point moving on the plane ˛. Introducing in ˛ polar coordinates .r; '/,
the motion of P is represented by the equations
ds 2 D dr 2 C r 2 d 2 ;
and the vectors
@ @
eO r D ; eO ' D
@r @'
of the holonomic base relative to the polar coordinates verify the conditions
eO r eO r D 1; eO ' eO ' D r 2 ; eO r eO ' D 0:
We denote by .P; er ; e' / the orthonormal frame relative to the polar coordinates at
any point P 2 ˛ (Fig. 11.2), where
1
er D eO r ; e' D eO ' :
r
v
e
er
P
r
j
O i
Then we define the radial velocity vr and transverse velocity v' are the scalar
quantities
vr D rP er ; v' D rP e' ; (11.13)
that is, the components of vP relative to the base .er ; e' / at the point P 2 . To
determine the form of (11.13) in terms of (11.12), we note that, in polar coordinates,
the position vector r is written as
r D rer .r; '/:
Differentiating this expression with respect to time, we obtain
@er @er
rP D rP er C r rP C 'P :
@r @'
On the other hand, in a Cartesian frame of reference .O; i; j/ (Fig. 11.2), this
results in
er D cos 'i C sin 'j; e' D sin 'i C cos 'j;
so that we have
@er @er
D 0; D e' ;
@r @' (11.14)
@e' @e'
D 0; D er ;
@r @'
In view of (1.13) and (1.15), the radial and transverse velocities are given by the
following formulae:
vr D rP ; v' D r ':
P (11.16)
Differentiating (11.15) with respect to time, we obtain the acceleration of P in polar
coordinates:
1 d 2
rR D .Rr r 'P 2 /er C P ';
.r '/e (11.17)
r dt
so that its components along er and e' are
ar D a er D rR r 'P 2 ; (11.18)
1 d 2
a' D a e' D P
.r '/: (11.19)
r dt
Now we introduce a new definition that plays an important role in describing the
motions of planets. Consider the area A spanned by the radius OP .t/, where P
belongs to the trajectory , in the time interval .t; t C t/ (Fig. 11.3). Then the limit
11.4 Velocity and Acceleration in Plane Motions 167
P t+ t
A
r (t+ t )
Pt
r(t)
O t x
A
AP D lim
t !0 t
1 2 1 2
r ' A rmax ';
2 min 2
in the limit t ! 0, we obtain
1
AP D r 2 ':
P (11.20)
2
In some cases, the Cartesian form of the areal velocity is more convenient. It can be
obtained by considering the coordinate transformation from Cartesian coordinates
.x; y/ to polar ones
p
r D x2 C y 2 ;
y
' D arctan :
x
It is easy to verify that, in view of these formulae, (11.20) becomes
1
AP D .x yP xy/:
P (11.21)
2
168 11 Kinematics of a Point Particle
s D r'.t/;
we have
sP D r '.t/;
P sR D r '.t/:
R
Introducing these results into (11.6) and (11.10), we obtain the expressions of
velocity and acceleration in a circular motion
rP D r 't;
P (11.22)
rR D r 't
R C r 'P 2 n; (11.23)
where t is the unit vector tangent to and n is the unit vector orthogonal to and
directed toward O. In particular, if P moves uniformly on , then (11.23) gives
rR D ! 2 r; (11.24)
! D j'j:
P (11.25)
rt s
x
O
X
Let P be a material point that uniformly moves counterclockwise ('P > 0) along
a circumference with its center at O and radius r. We define harmonic motion
with its center at O along the straight line Ox the motion that is obtained projecting
the point P along any diameter Ox of . In the Cartesian coordinates of Fig. 11.4,
we have that the motion of X is represented by the following equation:
In (11.26), r is called the amplitude of the harmonic motion and ! its frequency.
Differentiating (11.26) twice, we obtain
P
x.t/ D r! sin.!t C '0 /; R
x.t/ D r! 2 cos.!t C '0 /;
so that in a harmonic motion the following relation between position and accelera-
tion holds:
R
x.t/ D ! 2 x.t/: (11.27)
It is simple to recognize that the homogeneous second-order linear differential
equation with constant coefficients (11.27) characterizes the harmonic motions, i.e.,
its solutions are only harmonic motions. In fact, D i ! are the roots of the
characteristic equation of (11.27)
2 C ! 2 D 0;
x.t/ D C1 ei !t C C2 ei !t ;
where C1 and C2 are arbitrary constants. Taking into account Euler’s formulae, the
preceding solution can also be written as
where A and B are two arbitrary constants. Finally, we obtain (11.26) when we
write A and B in the form
Let P be a moving point with respect to a frame of reference .O; i; j; k/, where
i, j, and k are unit vectors along the .Ox; Oy; Oz/ axes of a Cartesian system of
coordinates. If the position vector r can be written in the form
X
n
r.t/ D ri .t/; (11.28)
i D1
y
O
Theorem 11.1. Let ABCD be the rectangle Œr1 ; r1 Œr2 ; r2 with vertices
A.r1 ; r2 /, B.r1 ; r2 /, C.r1 ; r2 /, and D.r1 ; r2 /. The compound motion of the
harmonic motions (11.29) has the following properties:
• If '1 D '2 , or '1 D '2 and !1 D !2 , then it is harmonic along one of the
diagonals of the rectangle ABCD;
• For all other values of '1 and '2 and !1 D !2 , the trajectory is an ellipsis
tangent to the sides of the rectangle ABCD every time touches one of them;
further, P moves along with constant areal velocity, and the motion is periodic;
• If !1 ¤ !2 and the ratio !1 =!2 is a rational number, the motion is periodic.
Further, the trajectory may be either an open curve or a closed curve;
• If !1 ¤ !2 and the ratio !1 =!2 is an irrational number, then the motion is
aperiodic and the trajectory is everywhere dense on the rectangle ABCD (i.e.,
the trajectory intersects any neighborhood of any point in the rectangle).
Proof. For the sake of simplicity, we limit ourselves to proving only the first three
items. Introducing the notation ı D '2 '1 and supposing !1 D !2 D !, from
(11.29) we obtain
x
D cos.!t C '2 ı/ D cos.!t C '2 / cos ı C sin.!t C '2 / sin ı
r1
y
D cos ı C sin.!t C '2 / sin ı;
r2
172 11 Kinematics of a Point Particle
so that
x y
cos ı D sin.!t C '2 / sin ı: (11.30)
r1 r2
On the other hand, in view of (11.29)2, we also have that
y
sin ı D cos.!t C '2 / sin ı: (11.31)
r2
x2 y2 2
2
C 2
x y cos ı D sin2 ı: (11.32)
r1 r2 r1 r2
If '1 D '2 , then ı D 0, whereas if '1 D '2 , then ı D . In all these cases,
(11.32) gives
x y 2
D 0; (11.33)
r1 r2
where the sign corresponds to ı D 0, the C sign corresponds to ı D , and
the trajectory coincides with one of the diagonals of the rectangle ABCD. To prove
that the motion along the diagonal y D r2 x=r1 , for instance, is harmonic, we note
that the distance s of P from O along this diagonal is given by
p q
sD x 2 C y 2 D r12 C r22 cos.!t C '1 /:
When ı is different from 0 and , the trajectory (11.32) is an ellipsis. In this case,
the areal velocity A [see (11.21)]
1 1
AD P x y/
.xy P D !r1 r2 sin ı
2 2
is constant and the motion is periodic. Moreover, the ellipsis touches the sides
x D r1 of the rectangle ABCD when cos.!t C '1 / D 1, that is, when
!t C '1 D n, where n is an arbitrary relative integer. It is plain to verify that
the corresponding value of xP vanishes and the ellipsis is tangent to the rectangle any
time it touches the sides x D r1 .
We now determine the conditions under which the motion is periodic when
!1 ¤ !2 . The compound motion is periodic if and only if for any instant t there
exists a value T of time such that
y
1.0
0.5
x
2 1 1 2
0.5
1.0
y
1.0
0.5
x
2 1 1 2
0.5
1.0
!1 T D 2n;
!2 T D 2m;
where n and m are arbitrary relative integers. In conclusion, the motion is periodic
if and only if the ratio
!1 n
D (11.34)
!2 m
is a rational number.
Figures 11.6–11.10 show different cases of compounding two harmonic motions
along orthogonal axes. They were obtained by using the notebook Composition.nb,
which allows one to draw the trajectory obtained by compounding two or three
motions.
174 11 Kinematics of a Point Particle
y
1.0
0.5
x
2 1 1 2
0.5
1.0
y
1.0
0.5
x
2 1 1 2
0.5
1.0
y
1.0
0.5
x
2 1 1 2
0.5
1.0
p
Fig. 11.10 r1 D 2, r2 D 1, !2 D 2, !2 D 1, '1 D 0, '2 D =3
11.7 Exercise
1. Let Oxy be a plane in which we adopt polar coordinates .r; '/, and let r D A'
be the trajectory of a point particle P , where A is a constant. Assuming that
'P D 'P0 is constant, determine the radial and transverse velocity and the radial
and transverse acceleration of P . Verify that the areal velocity is constant.
11.7 Exercise 175
2. In the same plane of the preceding exercise, a point particle P describes the
trajectory r D r0 e' , where r0 and 'P are constant. Determine the radial and
transverse velocity and acceleration.
3. In the plane Oxy, the equations of motion of a point particle P are
x D xP 0 t; y D y0 ˇ.1 e˛t /;
Hint: Evaluate rP taking into account that r 2 'P D c, where c is a constant. Then
recall that the radial velocity is rP and the transverse velocity is r '.
P
10. A particle P moves along a curve y D a sin bx, where a and b are constant.
Determine the acceleration of P assuming that the component of its velocity
along the Ox-axis is constant.
Chapter 12
Kinematics of Rigid Bodies
P D f .P 0 ; t 0 /; (12.1)
t D g.P 0 ; t 0 /; (12.2)
relating the space-time measures that R and R0 associate with the same event.
When we recall the postulate stating that the behavior of a clock is not influenced
by the transport, we can equip R and R0 with synchronized clocks and be sure that
they will remain synchronous. Therefore, (12.2) becomes t D t 0 . More generally,
even if R and R0 use clocks with the same behavior, they could choose a different
time unit and a different origin of time. In this case, instead of t D t 0 , we have
t D at 0 C b; (12.3)
where a and b are constants. We remark that (12.3) still implies that two simultane-
ous events for R are also simultaneous for R0 .
Let EA D .A0 ; t 0 / and EB D .B 0 ; t 0 / be two events happening in two different
points at the same instant for the observer R0 . The same events are also simultaneous
for R but happening at points A and B. To determine the function (12.1), classical
kinematics resorts to another axiom.
Axiom 12.1. At any instant, transformation (12.1) preserves the spatial distance d
between simultaneous events, that is,
In other words, Axiom 12.1 states that, at any instant, (12.1) is an isometry
between the three-dimensional Euclidean spaces E3 and E30 of observers R and R0 ,
respectively. Let us introduce Cartesian axes .O 0 ; .xi0 // in the frame R0 and
Cartesian axes .O; .xi // in R. If .xi0 / and .xi / are the coordinates that R0 and R,
respectively, associate with the points P 0 and P of the same event E D .P 0 ; t 0 / D
.P; t/, then from linear algebra we know that transformation (12.1) is expressed by
the formulae
xi D xi C Qij .t/xj0 ; (12.5)
where is the place in which occurs the arbitrary event E D .; t/, evaluated by
observer R, and .Qij .t// is an arbitrary time-dependent orthogonal matrix, that is,
a matrix satisfying the conditions
xi D Qij .t/xj0 ;
which gives the transformation formulae of the components of the position vector
r0 in passing from the frame .O; .e0i // to the frame .O; .ei //. Consequently, the two
bases .ei / and .e0i / are related by the equations
so that the coefficients .QT /j i are the components of the vectors e0i with respect to
the base ei .
Remark 12.2. In many textbooks, the transformation formulae (12.5) are proved by
noting that from Fig. 12.1 we have that
rP D r C r0 : (12.8)
Therefore, referring this relation to the bases .ei / and .e0i /, we obtain
x3
x’3 S
x’2
e’3 P
r’
e’2
Ω
rP
e3 e’1
rΩ
x’1 x2
O
e2
e1
x1
Now, representing the base .e0i / with respect to the base .ei / by the relations e0j D
Qj i ei , where .Qij / is an orthogonal matrix, we again obtain (12.5). However, this
approach hides a logical mistake. In fact, in both of the preceding relations, the
vector r0 is intended to be a vector belonging to both vector spaces relative to the two
observers R and R0 , that is, it is intended as the same vector. But this assumption is
just equivalent to the condition that the correspondence between the spaces relative
to the two observers is an isometry. In other words, we can accept (12.8) only after
proving (12.5).
In this section we evaluate the velocity field and the acceleration field of any rigid
body S moving relative to a rigid frame of reference R.
Let .O; .ei // be a Cartesian frame in R, and denote by .xi / the coordinates of any
point of S relative to .O; .ei //. Introduce a Cartesian frame .; .e0i // in the body
S (Fig. 12.1) with the origin at an arbitrary point 2 S . Owing to the rigidity of
S , any point P of S has coordinates .xi0 / relative to .; .e0i // that do not depend
on time. For the sake of brevity, we call R the space frame or the lab frame and
.; .e0i // the body frame.
180 12 Kinematics of Rigid Bodies
The relation between the coordinates .xi0 / and .xi / is expressed by (12.5). Since
we are now interested in the velocity field of S with respect to R and S is a rigid
body, we differentiate (12.5) with respect to time:
xP i D xP i C QP ij xj0 : (12.9)
On the other hand, in view of (12.5) and the orthogonality condition (12.6), we have
that
xj0 D .QT /j h .xh xh /: (12.10)
Inserting the preceding expression into (12.9) yields the relation
where
Wih D QP ij .QT /j h : (12.12)
In (12.11), xP i , xP i , and xh xh are components of vectors; therefore, from the
tensorial criteria of Chap. 2, we can state that Wih are the components of a Euclidean
2-tensor W. This tensor is skew-symmetric since (12.6) implies the equality
and consequently
With the notations of Fig. 12.1, we can put (12.16) in the following vector form:
! !
rP P D rP C ! P ; P D rP r : (12.17)
Differentiating the preceding formula with respect to time and noting that
d ! d !
P D .rP r / D rP P rP D ! P ;
dt dt
12.2 Velocity Field of a Rigid Motion 181
1
!i D ij h Wj h : (12.19)
2
Remark 12.3. Equation (12.17) shows that the velocity field of a rigid motion is
completely determined by giving two vector functions of time, that is, the velocity
rP .t/ of an arbitrary point of the rigid body and the vector function !.t/.
We now prove the important Mozzi’s theorem, which supplies both an interesting
interpretation of (12.17) and the physical meaning of the vector !.t/.
Theorem 12.1. At any instant t at which !.t/ ¤ 0, the locus of points A such that
rP .t/ D .t/!.t/, where .t/ is a real function, is a straight line A.t/, parallel to
!.t/, which is called Mozzi’s axis. Moreover, all points belonging to A.t/ move
with the same velocity .t/.
Proof. The points A of S having the property stated by the theorem are character-
ized by the condition
rP A ! D 0;
which, in view of (12.17), becomes
!
.Pr C ! P / ! D 0:
When we take into account the vector identity .a b/ c D .a c/b .b c/a, the
preceding equation can also be written as
! !
rP ! D .A !/! ! 2 A: (12.20)
!
In conclusion, the locus A.t/ coincides with the set of solutions A of (12.20).
Introducing the unit vector k D !=j!j, we have that
! ! !
A D .A k/k C A? ;
!
where A? k D 0, and (12.20) becomes
! ! !
rP ! D .A k/! 2 k ! 2 Œ.A k/k C A? :
182 12 Kinematics of Rigid Bodies
rP P D rP ;
rP P D .t/: (12.23)
0 1
cos ' sin ' 0
.Qij / D @ sin ' cos ' 0 A :
0 0 1
Taking into account (12.7), (12.12), and (12.19), by simple calculations we obtain
the following results:
0 1
0 1 0
Wij D 'P @ 1 0 0 A ;
0 0 0
1
!i D ij h Wj h D .0; 0; '/:
P
2
If the arbitrary point in (12.17) is chosen on the fixed axis a and the preceding
expression of ! is taken into account, then we obtain the velocity field of a rotational
rigid motion with a fixed axis a:
!
rP P D 'e
P 3 P : (12.24)
184 12 Kinematics of Rigid Bodies
Theorem 12.3. Let S be a rigid body rotating about the fixed axis a, and denote
by C the orthogonal projection of any point P 2 S on axis a and by P the plane
orthogonal to a and containing C . Then the following properties hold.
Proof. Mozzi’s axis A.t/ is a set of points whose velocities vanish or are parallel
to !. Since a is fixed, its points must belong to A.t/. Consequently, A.t/ coincides
with the fixed axis a. The second statement follows at once from both (12.25) and
the properties of the cross product.
The vector !.t/ vanishes in a translational rigid motion, whereas in a rotational
motion about a fixed axis a, it is directed along a and its component along a gives
the angular velocity of all points of the rigid body.
Definition 12.3. A rigid motion is helicoidal if it is obtained by compounding a
rotational motion about axis a and a translational motion along a.
In view of (12.23) and (12.25), the velocity field in such a motion yields
!
rP P D C 'k
P OP ; (12.26)
where is the sliding speed of a, k a unit vector along a, and 'P the angular velocity
about a.
Comparing the velocity field of a helicoidal motion (12.26) with (12.22), we
obtain the following equivalent form of Mozzi’s theorem.
Theorem 12.4. At any instant t, the velocity field of a rigid motion M coincides
with the velocity field of a helicoidal motion, called a helicoidal motion tangent to
M at instant t.
At this point, it is quite natural to call ! the instantaneous angular velocity of the
rigid motion.
12.4 Spherical Rigid Motions 185
so that a spherical motion is determined by giving the matrix function .Qij .t//.
Since this matrix is orthogonal, that is, it confirms the six conditions (12.6), we can
say that a spherical motion is determined assigning three independent coefficients
of the orthogonal matrix .Qij / as functions of time.
Further, the velocity field of a spherical motion is given by the equation
!
rP P D !.t/ OP ; (12.28)
in which both the value and direction of the angular velocity ! depend on time.
Instead of giving three independent coefficients of the orthogonal matrix .Qij /,
we can determine the orientation of the Cartesian axes .O; .xi0 // with respect to the
Cartesian axes .O; .xi // using three other independent parameters, called Euler’s
angles (Fig. 12.3). These angles can be defined when the planes Ox1 x2 and Ox10 x20
do not coincide. In fact, in this case, they intersect each other along a straight line,
called a line of nodes or nodal line, and we can define the angles , ', and shown
in Fig. 12.3. The angle between the axis Ox1 and the line of nodes is called the
angle of precession, the angle ' between the line of nodes and the axis Ox10 is the
x3
x’3 x’2
e3
e’2
e’3
N
O x2
e’1
e1 n
x’1
line of nodes
x1
angle of proper rotation, and the angle between the axes Ox3 and Ox30 is called the
angle of nutation. To recognize the independence of these angles, it is sufficient to
verify that they define three independent rotations transforming the axes .Oxi / into
the axes .Ox 0i /. It is easy to understand that the following three rotations have this
property: a rotation through the angle about the axis Ox3 , a rotation through the
angle ' about the axis Ox30 , and a rotation through the angle about the line of
nodes.
We now determine the fundamental Euler kinematical relations relating the an-
gular velocity with the Euler angles. First, we start with the following representation
of ! in the base .e0i /:
! D pe01 C qe02 C re03 : (12.29)
Owing to the independence of the Euler angles, we can write
! D P e3 C 'e P
P 03 C n: (12.30)
Denoting by N the unit vector in the plane .O; e01 ; e02 / and orthogonal to n and
resorting to Fig. 12.3, we can easily prove the following formulae:
P D 1
.p sin ' C q cos '/; (12.32)
sin
'P D .p sin ' C q cos '/ cot C r; (12.33)
P D p cos ' q sin ': (12.34)
12.4 Spherical Rigid Motions 187
0 1
cos sin 0
A D @ sin cos 0 A ;
0 0 1
0 1
1 0 0
B D @ 0 cos sin 0 A ;
0 sin cos
0 1
cos ' sin ' 0
C D @ sin ' cos ' 0 A ;
0 0 1
so that
On the other hand, the components of a vector are transformed according to the
inverse matrix of .QijT /, that is, according to .Qij /, since .QijT / is orthogonal.
Consequently, the matrix giving the transformation .p; q; r/ ! .!i / is
Definition 12.5. A rigid motion is a planar motion if the velocity field is parallel
to a fixed plane .
It is trivial that the rigidity of the planar motion implies that all points that belong
to the same straight line orthogonal to have the same velocity. Consequently,
to characterize the velocity field of a planar motion, it is sufficient to analyze the
velocity field on an arbitrary plane parallel to . We must determine the restrictions
on the vectors and ! of (12.22) due to the hypothesis of planar motion. The answer
to this question is given by the following theorem.
Theorem 12.5. In a planar motion, we have that
1. If .t/ ¤ 0, then necessarily !.t/ D 0 and .t/ is parallel to .
2. If !.t/ ¤ 0, then necessarily .t/ D 0 and !.t/ is orthogonal to .
Proof. At instant t, the vector in (12.26) gives the common velocity of all points
belonging to Mozzi’s axis A.t/. Consequently, A.t/ must be parallel to . But also
!
!.t/ AP must be parallel to for any choice of P , and that is possible if and
!
only if !.t/ D 0. Inversely, if !.t/ ¤ 0, then the term !.t/ AP is parallel to
for any P if and only if !.t/ is orthogonal to . Further, .t/ must also be parallel
to both and ! so that it vanishes.
The preceding theorem states that, at any instant t, the velocity field of a planar
motion coincides either with that of a translational motion parallel to or with the
velocity field of a rotational motion about an axis orthogonal to . In this last case,
since the points of this axis are at rest, it coincides with Mozzi’s axis A.t/. The
intersection point A of A.t/ with the plane is called the instantaneous rotation
center. It is evident that, for any point P , A belongs to a straight line orthogonal to
the velocity rP P at point P .
In this section, we apply the results of the previous sections to two examples. Let
S be a disk and let a be an axis orthogonal to S containing the center C of S . We
suppose that S rotates with angular velocity ! D P k about a, where k is the unit
vector along a. Further, we suppose that C moves with constant angular velocity
'P on a circumference having its center at O and radius R (Fig. 12.4). We take
the position vectors starting from O and denote by rA the unknown position vector
of the instantaneous rotation center A. Since we are considering a nontranslational
planar motion, Mozzi’s axis A.t/ is orthogonal to the plane containing S and,
12.6 Two Examples 189
.
rC
C
R
A
rC
x
O
rP C P k
rA D rC :
P2
j'j
P
rA D rC ˙ R n;
P
j j
where we must choose the C sign when 'P and P have the same sign and the sign
in the opposite case. In conclusion, during motion, the instantaneous rotation center
A describes a circumference with center at O and radius r less or greater than R,
depending on whether or not 'P and P have the same sign.
As a second example, we consider a disk S rotating with constant angular
velocity P about an axis a contained in the plane of S (Fig. 12.5). We also suppose
that the center C of S moves along a straight line Ox orthogonal to a with a velocity
rP C . To determine the evolution of Mozzi’s axis A.t/, we denote by k a unit vector
along a and by A the intersection point of A.t/ with the plane Oxy. Then from
(12.22) we obtain
rP C P k jPrC j
rA D rC D rC j;
P2 j Pj
where j D k i and i is the unite vector directed along Ox.
190 12 Kinematics of Rigid Bodies
k
z
y a
k
j
.
O rC x
i C
Let R and R0 be two observers moving relative to each other. Let the relative
motion be rigid. If point P is moving relative to both R and R0 , what is the relation
between the velocities and the accelerations of P measured by both observers? In
this section, we answer this question.
We denote by .O; ei / and .; e0i / two systems of Cartesian coordinates fixed in R
and R0 , respectively. Further, we call the motion of P relative to R0 relative motion
and the motion of P relative to R absolute motion.
Let xi0 .t/ be the parametric equation of the moving point P in the frame .; xi0 /.
To know the coordinates of P relative to .O; xi /, we introduce the functions xi0 .t/
into (12.5):
xi .t/ D xi .t/ C Qij .t/xj0 .t/: (12.36)
Now, we must recognize the physical meaning of the different terms appearing in
(12.37). First, on the left-hand side are the components relative to the base .ei / of
the velocity rP P of P evaluated by observer R, that is, of the absolute velocity. The
first two terms on the right-hand side, which coincide with (12.9), are obtained by
differentiating (12.36) with respect to time and holding constant the components
xj0 .t/. In other words, these terms give the velocity v of the point at rest relative
to .; e0i / that, at instant t, is occupied by P . Since the motion of R0 relative to R
12.7 Relative Motions 191
is rigid, v is given by (12.17), i.e., by the first two terms on the right-hand side
of (12.37). Finally, the last term of (12.37) is the velocity rP 0P of P evaluated by
observer R0 . In conclusion, we can give (12.37) the following form:
rP P D v C rP 0P ; (12.38)
where
!
v D rP C ! P : (12.39)
Equation (12.38) is the mathematical formulation of the classical principle of
velocity composition.
Let u be a vector varying in time with respect to both observers R and R0 . Due
to the relative motion of R0 relative to R, the dependence of u on time is expressed
by a function f.t/ in R and by a function g.t/ in R0 . We call
da u.t/
D Pf.t/
dt
dr u.t/
D gP .t/
dt
the relative derivative of u.t/. Here, we show that (12.38) allows us to determine
the relation between these two derivatives. In fact, the following two representations
!
of the vector u.t/ AB hold in R and R0 , respectively (Fig. 12.6):
where rB .t/ and rA .t/ are the position vectors of points B and A relative to the
frame .O; ei /, and r0B .t/ and r0A .t/ are the position vectors of the same points in the
frame .; e0i /. It is evident that
da u.t/
D rP B .t/ rP A .t/;
dt
dr u.t/
D rP 0B .t/ rP 0A .t/:
dt
Using (12.38) and (12.39) in the preceding expressions, we obtain the fundamental
formula
da u.t/ dr u.t/
D C ! u.t/: (12.41)
dt dt
We are interested in determining the relation between the absolute acceleration
and the relative acceleration. We reach this result differentiating with respect to time
either (12.37) or (12.38). Here, we adopt the second approach, which is based on
(12.41).
192 12 Kinematics of Rigid Bodies
x3
x’2
x’3 B u
r’B
A
e’3 r’A
rB
e3 e’2
rA Ω x2
O
e2 e’1
e1
x’1
x1
da rP P da
rR P D D .Pr0P C v /
dt dt
0
dr rP P da v
D C ! rP 0P C
dt dt
da v
D rR 0P C ! rP 0P C : (12.42)
dt
On the other hand, from (12.39) we have that
da v !
D rR C !
P P C ! .PrP rP /
dt
!
P P C ! .Pr0P C v P rP /
D rR C !
! !
P P C ! rP 0P C ! .! P /:
D rR C ! (12.43)
rR P D rR 0P C a C ac ; (12.44)
12.7 Relative Motions 193
x2
x’2
.
rP
v x’1
e2
e’2 .
rP’
e’1 P
O x1
e1
where
! !
a D rR C !
P P C ! .! P /;
ac D 2! rP 0P :
In view of (12.18), the term a gives the acceleration of that fixed point of .; e0i /,
which is occupied by P at instant t. The term ac is called Coriolis’ acceleration.
Example 12.1. Let P be a point moving on a straight line Ox1 that rotates
with constant angular velocity 'P0 about a fixed axis a orthogonal to Ox1 x2 and
containing O (Fig. 12.7). Verify that
a C 2! rP 0 D 0
must hold for any vector rP 0 , so that a D ! D 0, and the theorem is proved.
194 12 Kinematics of Rigid Bodies
12.8 Exercises
1. Let !i be the components of the angular velocity in the lab frame .O; xi /.
Determine the evolution equations of the matrix Q D .Qij / relative to .O; xi /.
From (12.12) and (12.15) we obtain
P D WQ;
Q (12.45)
This is a system of nine ODEs in the nine unknowns .Qij .t//. However, we
can only accept solutions for which the matrix Q is orthogonal at any instant.
Now we prove that this property is confirmed by any solution of (12.46)
corresponding to initial data Q.0/ such that
P T D W D WT D QQ
QQ P T;
that is,
P T D 0 , d .QQT / D 0:
P T C QQ
QQ
dt
In other words, for any solution of (12.46), the matrix QQT is constant.
Therefore, if we assign the initial data satisfying (12.47), the corresponding
solution of (12.46) will be an orthogonal matrix.
2. Determine the time evolution of the matrix Q D .Qij / in the lab frame when
the angular velocity ! is given by the functions . P .t/; '.t/;
P P .t//.
When the initial values . 0 ; '0 ; 0 / of Euler’s angles are given, by integrat-
ing . P .t/; '.t/;
P P .t//, we obtain Euler’s angles . .t/; '.t/; .t//. Then, the
time evolution of the matrix Q D .Qij / follows from (12.32)–(12.35).
3. Let S be a rigid homogeneous bar whose end points A and B are constrained
to move, respectively, along the axes Ox and Oy of a Cartesian system of
coordinates. Prove that, during the motion of S , the instantaneous rotation
center C describes a circumference with its center at O and radius equal to the
length 2l of the bar (Fig. 12.8). Justify the result recalling that the instantaneous
rotation center belongs to the intersection of the straight lines orthogonal to the
velocities.
Hint: Point A has coordinates .2l sin '; 0/, where ' is the angle between S
and a downward oriented vertical straight line. Then, the velocity of A has
components .2l 'P cos '; 0/. In view of (12.21) and Theorem 12.4, we have that
12.8 Exercises 195
B
C
O l x
A
B
A
O G
B D x
rP A !
rC D rA :
!2
Since ! D 'k, P where k is a unit vector orthogonal to the plane Oxy, we have
that rC D .2l sin '; 2l cos '/.
4. The center G of a rectangle ABCD (Fig. 12.9) moves along the Ox-axis of a
Cartesian system of coordinates. Further, ABCD rotates about an axis orthog-
onal to Oxy and containing G. Denoting by 2l the diagonal of the rectangle,
determine the velocity of point A and the trajectory of the instantaneous rotation
center.
!
Hint: rP A D xP G i C 'k
P GA. Apply (12.21) to determine C .
5. A bar OC and a disk are constrained to move in the plane Oxy (Fig. 12.10).
The bar has point A moving on the Ox-axis and rotates about O. The disk
rotates about C . Determine the velocity of point P of the disk.
6. A uniform circular hoop of radius r rolls on the outer rim of a fixed wheel of
radius b; the hoop and the wheel are coplanar. If the angular velocity ! of the
hoop is constant, find
196 12 Kinematics of Rigid Bodies
y
P
O
x
l l
O P
x
13.1 Introduction
Dynamics has the aim of determining the motion starting from its causes. If we take
into account all the characteristics of real bodies, such as, for example, extension and
deformability, with the aim of reaching a more accurate description of the real world,
then we reach such a complex mathematical model that it is usually impossible
to extract concrete results to compare with experimental data. Consequently, it is
convenient to start with simplified models that necessarily neglect some aspects
of real bodies. These models will describe real behavior with sufficient accuracy,
provided that the neglected aspects have negligible effects on the motion.
In this chapter, we start with the Newtonian model in which the bodies are
modeled as material points or point masses. This model does not take into account
the extension and form of bodies, which are characterized only by their mass. It
is evident that such a model supplies a reasonable description of the real behavior
of bodies only when their extension can be neglected with respect to the extension
of the region in which the motion takes place. A famous example satisfying this
condition is supplied by the motion of the planets around the Sun. In fact, the
Newtonian model was created simply to describe the behavior of the planetary
system and, more generally, of celestial bodies such as comets, asteroids, and others.
The evolution of systems formed by material points is governed by the Newtonian
laws of dynamics, whose predictions have been confirmed by a large body of
experimental data.
In Chap. 15 we analyze the Eulerian model in which the extension of a body B
becomes a relevant characteristic of the model; the mass distribution of B plays a
crucial role but its deformations are neglected. In other words, the Eulerian model
analyzes the motion of extended rigid bodies. A model to describe a system of
material points or rigid bodies, free or subjected to constraints, is due to Lagrange
and will be analyzed in Chap. 17.
As was remarked in Chap. 11, the description of motion requires the introduction
of a frame of reference equipped with synchronized clocks. All frames of reference
having this characteristic are quite equivalent from a kinematical point of view. In
dynamics we are faced with a completely different situation since the choice of a
frame of reference deeply affects the connection between the motion and its causes.
Then it is fundamental to check the existence of a subset I of the whole class of
frames of reference adopted in kinematics, in which the description between motion
and causes admits the simplest description.
The criterion that helps us in determining set I is based on a fundamental
assumption: any action on a material point P is due to the presence of other bodies
B in the space around it; moreover, this action reduces to zero when the distances
between P and the bodies belonging to B go to infinity. This vague assumption on
the interaction among bodies makes reasonable the following definition: a material
point P is said to be isolated if it is so far from any other body that we can neglect
their action on P . It is natural to presume that in the class I the motion of an isolated
material point P is as simple as possible. In this respect, we have the following
principle.
Axiom 13.1. Principle of inertia (Newton’s First Law) – There is at least a frame
of reference I , called an inertial frame, in which any isolated material point is at
rest or is uniformly moving on a straight line.
An inertial frame I whose existence is stated by the preceding principle can be
localized using the following procedure. First, I is chosen in such a way that any
isolated material point moves on a straight line relative to I. On the straight line r
described by a particular isolated material point P we fix a point O and denote by x
the abscissa of P with respect to O. Then I is equipped with synchronized clocks
(Sect. 11.1) that associate with any position x of P the instant
x
tD ; (13.1)
v
where v is an arbitrary constant. It is evident that, with the time t, point P moves
uniformly on the straight line r with constant speed v. The variable t is defined up
to a linear transformation since we can change the origin of time by varying point O
and the unit of time by changing the value of v. The principle of inertia states that,
with this choice of time, the motion of any isolated material point is still rectilinear
and uniform. Such a time will be called a dynamical time.
In practice, we choose an arbitrary periodic device and then control whether
or not the time that it defines is dynamical. Usually, we adopt the Earth as a
fundamental clock, dividing its period of rotation into hours, primes, and seconds.
A more accurate measure of dynamical time is obtained by fractionizing the period
of revolution of the Earth about the Sun.
13.3 Dynamical Interactions and Force Laws 199
The existence of at least one inertial frame implies the existence of infinite
inertial frames, as is proved by the following theorem, which follows from
Theorem 12.5.
Theorem 13.1. Let I be an inertial frame equipped with the dynamical time t.
Then, a frame of reference I 0 is an inertial frame if and only if its motion relative
to I is translational and uniform and it is equipped with clocks measuring the time
t 0 D at C b, where a and b are arbitrary real constants.
The vector F.Pi ; Pj / is the force that Pj exerts on Pi , the function fij is called
a force law, condition (13.3) is the mathematical version of the action-reaction
principle (Newton’s third law), and, finally, (13.4) postulates the parallelogram
rule for forces.
The preceding axiom lays down the laws that hold for interactions in classical
mechanics. Before introducing the axiom that makes clear the connection between
these actions and the motion they produce, we must deal with another important
aspect of classical interactions. More precisely, we must define the behavior of
forces and force laws under a change of the frame of reference. It is important to
solve this problem for the following reasons. First, we can compare the dynamical
descriptions obtained by two inertial observers analyzing the same phenomenon.
Second, there are some interesting phenomena (for instance, dynamics relative to
the Earth) in which a noninertial frame is spontaneously adopted, so that we are
compelled to decide how the force law is modified by adopting this new frame of
reference.
To solve the preceding problem, we start by introducing some definitions.
Definition 13.1. A tensor is objective if it is invariant under a rigid frame change
R D .O; xi / ! R0 D .0 ; xi0 / given by
where r
r D ri rj ; rP D rP i rP j ; kD ;
jrj
and 'ij is an objective scalar function.
Remark 13.2. The objectivity principle implies that the force fij .ri ; rj ; rP i ; rP i / acts
along the straight line joining points Pi and Pj , in accordance with the action and
reaction principle.
Proof. To simplify the notations, we omit the indices i and j and set ri D u,
rj D v. Further, we notice that the length of any vector r D u v is invariant under
transformation (13.5). If (13.11) holds, then we have that
rP 0 r 0 rh0
fh0 .u0 ; v0 ; uP 0 ; vP 0 / D ' jr0 j; l 0 l
jr j jr0 j
!
.Qlk rPk C QP lk rk /Qlm rm Qhl rl
D ' jrj; :
jrj jrj
and the force law is objective. Conversely, if the force law is objective, i.e., if
must hold for any possible choice of the quantities of Qkl , QP kl , xi , and xP i . Now
we choose
Qkl D ıkl ; QP kl D 0; xk D vk ; xP i D vP k :
This choice corresponds to taking a frame of reference R0 that, at the arbitrary
instant t, moves relative to I with a translational velocity equal to the velocity at
instant t of point Pj , which is at rest at the origin of R0 . For this choice, (13.12)
becomes
fh .rk ; 0; rPk ; 0/ D fh .uk ; vk ; uP k ; vP k /
so that f must depend on r D u v and rP D uP vP . Then, for any orthogonal matrix
.Qkl /, we have that
rP D rP ? C .Pr k/k;
When we note that jrj and rP k are invariant under transformation (13.5), we conclude
that (13.13) assumes the final form
so that f.r; .Pr k/k/ depends isotropically on the variables r and .Pr k/k. Finally,
(13.11) follows from known theorems of algebra about isotropic functions.
The axioms of the preceding sections describe some fundamental aspects of the
classical interactions between material points. The following axiom describes the
connection existing between a force acting upon a material point and a correspond-
ing motion.
Axiom 13.4. Let S D P1 ; : : : ; Pn be an isolated system of n material points, and
.e/
let I be an inertial system of reference. Denote by Pi the set of all points of S
different from Pi . Then, to any material point Pi it is possible to associate a positive
number mi , the mass of Pi , which only depends on the material constitution of Pi .
Further, the acceleration rR i of Pi relative to I satisfies the equation [see (13.4)]
mi rR i C mj rR j D 0: (13.15)
jRri j mj
D : (13.16)
jRrj j mi
This is precisely the result proved in the preceding theorem, which is postulated
in the formulation due to Mach and Kirchhoff. In this approach the force Fi .Pi ; Pj /,
which Pj exerts upon Pi , is defined as the product mi rR i . Postulating that the forces
satisfy the parallelogram rule, the action and reaction principle is derived. We
conclude by noting that (13.16) leads to an operative measure of mass by measures
of accelerations.
204 13 Principles of Dynamics
In this section, we show that the Newtonian laws of dynamics allow one, at least in
principle, to determine the motion relative to an inertial frame I of material points
belonging to an isolated system S D fP1 ; : : : ; Pn g. In fact, applying (13.14) to any
material point Pi 2 S and taking into account (13.3), we obtain the system
mi rR i D fi .r1 ; : : : ; rn ; rP 1 ; : : : rP n /; i D 1; : : : ; n; (13.17)
where
n
X
fi .r1 ; : : : ; rn ; rP 1 ; : : : rP n / D fij .ri ; rj ; rP i ; rP j /
j ¤i;j D1
In other words, when the state of the system S is assigned by giving the position
and velocity of any point of S at the instant t0 , then the evolution of S is uniquely
determined for t t0 . This circumstance, characteristic of classical physics, is
called mechanical determinism.
In our analysis it is proved that the explicit solution of the initial value prob-
lem (13.17) and (13.18) can only be found for some particular force laws. However,
we show throughout the book that we have at our disposal many mathematical
instruments to obtain meaningful information about the solution and its properties.
We are now in a position to state a fundamental principle of dynamics. Let I and
I 0 be two inertial frames, and let S D fP1 ; : : : ; Pn g be an isolated system moving
relative to them. Since the motion of I 0 relative to I is translational and uniform,
the acceleration of any point Pi 2 S is the same in both inertial frames [see (12.44)]
rR 0 D rR :
Further, the general axioms of dynamics state that the mass of Pi is invariant, and
the force Fi acting on it is objective, together with the force law. Consequently, in
the inertial frame I 0 , system (13.17) becomes
we obtain
n
X n
X .e/
mi rR i D Fi : (13.22)
i D1 i D1
1
For a deeper analysis of the relativity principle, see Chap. 24.
206 13 Principles of Dynamics
Theorem 13.5. The time derivative of the linear momentum of the system S of
material points under the action of the external system S e is equal to the total
external force:
P D R.e/ :
Q (13.25)
Equation (13.25) is called a balance equation of linear momentum or first
cardinal equation of dynamics.
Define the center of mass G as the point whose position vector rG with respect
to the axes of the inertial frame I is given by the relation
n
X
mrG D m i ri ; (13.26)
i D1
Pn
where m D i D1 mi is the total mass of S . Then we have that
Q D mPrG ; (13.27)
Remark 13.4. It is important to note that (13.28) does not allow us to find the
motion of G since, in general, the total force depends on the position and velocity
of all material points of S . It is possible to determine the motion of G by (13.28)
only if the total force depends on the position and velocity of G.
Let ri be the position vector of Pi 2 S relative to the origin O of an
inertial frame I . Taking into account the action and reaction principle, after simple
calculations, from (13.20) we derive the condition
13.6 Balance Equations 207
n
X n
X .e/
ri mi rR i D ri F i : (13.29)
i D1 i D1
P O D M.e/ :
K (13.32)
O
Differentiating with respect to time and recalling (13.25), (13.27), and (13.30),
instead of (13.32), we obtain
P A rP A mPrG D M.e/ :
K (13.33)
A
This generalization of the balance equation of angular momentum shows that (13.32)
holds either if A is fixed or if it coincides with the center of mass G.
Remark 13.6. The balance equations of dynamics do not determine the motion of
a system S since they do not contain the internal forces, which are fundamental
in determining the evolution of S . However, they will play a fundamental role in
describing the motion of a rigid system in which the internal interactions do not
affect its evolution.
208 13 Principles of Dynamics
1
T D T 0 C mrPG2 : (13.35)
2
rP i D rP 0i C rP G :
X n
1
D T 0 C mrPG2 C rP G mi rP 0i :
2 i D1
Theorem 13.9. The angular momentum K0 , relative to the motion about the center
of mass, is independent of the pole. Further, if KG is the angular momentum relative
to pole G in any frame of reference, it results that
KG D K0 : (13.37)
Proof. Let O and O 0 be two arbitrary points moving or at rest in RG . Then, taking
into account (13.36), we have
n
X n
X n
X
! 0! !
K0O 0 D O 0 Pi mi rP 0i D O O mi rP 0i C OPi mi rP 0i D K0O :
i D1 i D1 i D1
Integrating (13.38) in the time interval Œt0 ; t1 , we obtain that the variation of kinetic
energy is equal to the work of internal and external forces acting upon the system S :
.i / .e/
T .t1 / T .t0 / D WŒt0 ;t1 C WŒt0 ;t1 : (13.40)
The forces acting on S are said to be positional if they depend only on the
position vectors of the material point belonging to S and S .e/ (Sect. 13.6). In
this case, the work of these forces in the time interval Œt0 ; t1 only depends on the
trajectories of the points of S and S .e/ and is independent of their velocities. More
particularly, a system of positional forces † D f.r1 ; F1 /; : : : ; .r1 ; F1 /g acting upon a
system of material points fP1 ; : : : ; Pn g is conservative if there exists a C 1 function
U.r1 ; : : : ; rn /, called potential energy, such that
Fi D rri U: (13.41)
The work of conservative forces in the time interval Œt0 ; t1 assumes the form
WŒt0 ;t1 D .U.r1 .t1 /; : : : ; rn .t1 // C U.r1 .t0 /; : : : ; rn .t0 //; (13.42)
that is, the work depends not on the trajectories and velocities of the points of S but
only on their initial and final positions.
We conclude this section with the following theorem, which follows immediately
from (13.40) and (13.42).
Theorem 13.10 (Conservation of mechanical energy). If the forces acting on a
system S of material points are conservative and U is the corresponding potential
energy, during the motion the total energy is constant:
T C U D const: (13.43)
In the preceding sections, the fundamental laws of dynamics were stated with
respect to inertial frames. However, in some cases it is necessary to analyze the
motion of a system S D fP1 ; : : : ; Pn g in an arbitrary frame of reference. For
instance, when we study the motion of a material point with respect to the Earth,
we adopt a terrestrial frame of reference that is not inertial.
13.10 Constrained Dynamics 211
Let I and R0 be, respectively, an inertial frame and an arbitrary rigid frame of
reference. To find the form that the Eqs. (13.17) assume in R0 , we first recall that
the masses and the force laws are invariant under an arbitrary change in the rigid
frame of reference. Then, taking into account (12.44), we can give (13.17) the form
where a i is the acceleration of point Pi due to the relative motion of R0 with respect
to I (Sect. 12.7). Relation (13.14) shows that, to describe the evolution of S in the
frame R0 , besides the forces fi , i D 1; : : : ; n, we must consider the forces mi a i
and 2mi ! rP 0i , which are called inertial or fictitious forces. More precisely, the
first of these is the drag force, whereas the second one is the Coriolis force. The
name we have given these forces is due to the fact that the observer R finds no
physical justification of their presence, that is, he cannot find bodies that, interacting
with S , can explain the existence of those forces. For these reasons, the forces due
to the presence of bodies are also called real forces.
We conclude by remarking that all the results we proved for inertial frames can be
extended to noninertial frames, provided we consider both real and fictitious forces.
together with the equation of the surface, at least in principle, makes it possible for
us to determine the unknowns r.t/ and .r.t//.
Let P be a material point moving along the curve obtained by intersecting the
surfaces
f1 .r/ D 0; f2 .r/ D 0 (13.46)
such that the rank of the Jacobian matrix of the functions f1 and f2 is equal to
2. If the surfaces are smooth, then the reactive force of the constraint is a linear
combination of two reactive forces orthogonal to each surface. Consequently, the
equation of motion can be written as
Equations (13.46) and (13.47) allow, at least in principle, to determine the unknowns
r.t/, 1 .r.t//, and 2 .r.t//.
If the curve is given in parametric form
r D r.s/ (13.48)
sP 2
rR D sR t C n; (13.49)
r
where t is the unit vector tangent to , n the principal unit vector, and r the curvature
radius of . The equation of motion
sP 2
m sR t C n D F C ˆ; (13.50)
r
13.11 Exercises 213
when it is projected along t and the condition ˆ t D 0 is taken into account, gives
the scalar differential equation
13.11 Exercises
Definition 14.1. A positional force is said to be central with center O if its force
law is
r
F D f .r/ ; r D jrj; (14.1)
r
where r is the position vector relative to O.
The elementary work dW of F in the elementary displacement dr is
r
dW D F dr D f .r/ dr D f .r/dr:
r
Then, taking into account (13.41), we can state that a central force is conservative
and its potential energy is given by
Z
U D f .r/dr: (14.2)
A first example of central force is the elastic force for which (14.1) and (14.2)
become
r 1
F D kr ; U D kr 2 ; (14.3)
r 2
respectively. Another important example of central force is the Newtonian force for
which
k r k
FD 2 ; U D : (14.4)
r r r
A Newtonian force is said to be attractive if k > 0, repulsive if k < 0.
The torque with respect to the pole O of any central force vanishes. Conse-
quently, the angular momentum KO is constant:
KO D KO .0/; (14.5)
where KO .0/ is the initial value of KO . On the basis of this result we can prove the
following theorem.
Theorem 14.1. The trajectory of a material point P subject to a central force lies
in a plane containing the initial position and velocity of P and orthogonal to the
initial angular momentum KO .0/ (Laplace’s plane). Further, in this plane the areal
velocity of P is constant (Kepler’s second law).
Proof. Since KO is constant, we have that
x yP xy
P D Kz .0/=m; (14.7)
If we introduce polar coordinates .r; '/ with center O into the Laplace plane and
we recall formulae (11.19), which give the components of the acceleration in polar
coordinates, from (14.8) we obtain the following system of ordinary differential
equations:
in the unknowns r.t/ and '.t/. Equation (14.10) again states that the areal velocity
is constant:
1 2 1 1
r 'P D Kz .0/ c: (14.11)
2 2m 2
Introducing (14.11) into (14.9), we obtain the following equation in the unknown
function r.t/:
1 mc 2
rR D f .r/ C 3 : (14.12)
m r
Finally, as the central forces are conservative, the total energy [see (13.43)]
1 2
mPr C U.r/ D E (14.13)
2
14.1 Central Forces 217
2
rP 2 D .E Ueff .r//; (14.15)
m
where we have introduced the effective potential
mc 2
Ueff .r/ D U.r/ C : (14.16)
2r 2
Equation (14.11) shows that, if the initial data are chosen in such a way that
c D 0, then '.t/ D '.0/ for any t > 0 and the trajectory lies on a straight line
containing O. When c ¤ 0, the function '.t/P has a constant sign, so that '.t/ is
always increasing or decreasing, according to the initial data. Consequently, there
exists the inverse function
t D t.'/; (14.17)
and the trajectory, instead of being expressed by the parametric equations r.t/, '.t/,
can be put in the polar form r.'/. To find a differential equation giving the function
r.'/, it is sufficient to note that the formulae
dr c dr d 1
rP D 'P D 2 D c ; (14.18)
d' r d' d' r
drP c drP c 2 d2 1
rR D 'P D 2 D 2 2 ; (14.19)
d' r d' r d' r
where
1
uD :
r.'/
We highlight that (14.12) and (14.20) contain the constant quantity c. In other
words, these equations require a knowledge of the areal velocity or, equivalently,
218 14 Dynamics of a Material Point
of the angular momentum about an axis orthogonal to the plane in which the orbit
lies [see (14.11)] and containing the center of the force. This plane is obtained by
P
giving the initial data r.0/ and '.0/. We can always assume that the initial datum
'.0/ is equal to zero by a convenient choice of the polar axis. On the other hand, to
integrate (14.12), we must also assign the initial datum rP .0/. The data r.0/, '.0/, P
and rP .0/ are also necessary for integrating (14.20) since .dr=d'/.0/ D rP .0/='.0/ P
[see (14.18)].
Weierstrass’ analysis (Chap. 10) can be applied to (14.12) and (14.14) to
determine the qualitative behavior of the function r.t/ as well as the phase portrait
P In contrast, if we are interested in determining the qualitative
in the space .r; r/.
behavior of the function r.'/ and the corresponding phase portrait in the space
.r; dr=d'/, then we apply Weierstrass’s analysis to equations (14.20) and (14.21)
since (14.21) is a first integral of (14.20). We explicitly note that the preceding
phase portrait depends on the choice of the constant c. The different curves of the
phase portrait in the space .r; rP / are obtained by assigning the initial data r.0/, r.0/,
P
whereas the curve of the phase portrait in the space .r; dr=d'/ is determined by
the initial data .r.0/; .dr=d'/.0//. The notebook Weierstrass provides examples of
these phase portraits.
Regarding the central forces we recall the following important theorem.
Theorem 14.2 (Bernard). The elastic forces and the Newtonian forces are the only
central forces for which all bounded orbits are closed.
OP
eD ;
d
is constant. The point O and the straight line a are respectively called the focus
and the directrix of the conic . Applying the preceding definition of a conic to
both cases of Fig. 14.1 and using polar coordinates in the plane , we obtain the
relations
r
eD ;
D r cos.' '0 /
r
eD ;
r cos.' '0 / D
14.2 Newtonian Forces 219
y y
a a d P
d
P
r r
O x O x
eD
rD ; (14.22)
1 C e cos.' '0 /
eD
rD : (14.23)
1 C e cos.' '0 /
d2 u 1
CuD ; (14.24)
d' 2 p
where
mc 2
pD : (14.25)
k
The general integral of the linear equation (14.24), with constant coefficients, is
1
uD C R cos.' '0 /;
p
where R > 0 and '0 are arbitrary constants depending on the initial data. Going
back to the variable r and introducing the quantity
e D jpjR; (14.26)
p s
x
O
Taking into account (14.25)–(14.27), after some calculations, we can prove the
result
k 2 k2
ED .e 1/ D .e 2 1/;
2p 2mc 2
which implies that r
2mc 2
eD 1C E: (14.32)
k2
This formula yields the following table:
Applying Taylor’s expansion to the right-hand side of (14.36) and collecting the
terms that multiply the same power of the eccentricity, we obtain the following
sequence of equations:
'P0 D a; (14.37)
'P1 D 2a cos.at/; (14.38)
'P2 D a cos .at/ 2a sin.at/'1 .t/;
2
(14.39)
Finally, solving the preceding equations with the initial data 'i .0/ D 0, i D
0; 1; 2; : : :, we obtain
3 5
'.t/ D at C 2e sin.at/ C e at C sin.2at/ C :
2
(14.40)
2 4
1 2 k
ED mv 0:
2 0 r
Consequently, the escape velocity is given by
s
2k
v0 D : (14.41)
mr0
Let S be an isolated system of two material points P1 and P2 subject to the action
of the mutual gravitational force. If m1 and m2 denote their masses, the motion of
S is described by the equations
14.3 Two-Body Problem 223
P1
G
r
P2
r1 rG
r2
O
m1 m2
m1 rR 1 D h .r1 r2 /; (14.42)
jr1 r2 j3
m1 m2
m2 rR 2 D h .r2 r1 /; (14.43)
jr2 r1 j3
r D r2 r1 : (14.45)
These equations show that the two-body problem can be solved by determining the
vector functions rG .t/ and r.t/. Since the system S is isolated, we can state that
the center of mass moves uniformly on a straight line; therefore, the function rG .t/
is known when the initial data r1 .0/, rP 1 .0/, r2 .0/, rP 2 .0/ are given [see (14.44)].
Regarding the function r.t/, we note that, dividing (14.42) by m1 and (14.42) by m2
and subtracting the second equation from the first one, we obtain the equation
m2 .m1 C m2 /
m2 rR D h r; (14.48)
jrj3
224 14 Dynamics of a Material Point
Both Eqs. (14.48) and (14.49) show that r.t/ can be determined by solving a
problem of a material point subject to a Newtonian force. In other words, if we
introduce the variables rG and r, the two-body problem is reduced to the problem
of a material point moving under the action of a Newtonian force.
Exercise 14.1. Let P1 x10 x20 x30 be a noninertial frame of reference with the origin at
the point P1 and axes parallel to the axes of an inertial frame Ox1 x2 x3 . Prove that
Newton’s equation relative to the material point P2 in the frame P1 x10 x20 x30 coincides
with (14.48), provided that we take into account the fictitious forces.
In this section, we study the motion of the planets about the Sun. First, we note
that the dimension of the planets is much smaller compared with their distances
from the Sun. Consequently, it is reasonable to model the planetary system with a
system of material points moving under the influence of the mutual gravitational
interaction. Such a problem is too complex to be solved. However, the solar mass
is much greater than the mass of any planet. This circumstance suggests adopting a
simplified model to analyze the motion of the planets. This model is justified by the
following considerations.
The motion of a planet Pi is determined by the gravitational action of the Sun
and the other planets. Consequently, in an inertial frame I , the motion of the planets
Pi of mass mi is a solution of the equation
M mi X mi mj
mi rR i D h r i h rij ; (14.51)
jri j3 jrj3ij
j ¤i
where h is the gravitational constant, M is the solar mass, ri is the position vector
of the planet Pi with respect to the Sun, and rij D ri rj is the position vector of
Pi relative to the planet Pj . Since the solar mass M mi , for any planet, while the
distances jri j and jrij j are of the same order of magnitude, in a first approximation,
we can neglect the action of the other planets on Pi with respect to the prevailing
action of the Sun. In this approximation we are faced with a two-body problem: the
Sun and the single planet. Consequently, to describe the Sun–planet system, we can
14.4 Kepler’s Laws 225
mi M
mi rRi D h ri ; (14.52)
jri j3
according to which, in a first approximation, the planet moves about the Sun as
a material point subject to a Newtonian force. We are in a position to prove the
following Kepler’s laws:
1. The orbit of a planet is an ellipse having a focus occupied by the center of the
Sun.
2. The orbit is described by a constant areal velocity.
3. The ratio between the square of the revolution period Ti and the cube of the
greatest semiaxis ai of the ellipse is given by
Ti2 4 2
3
D ; (14.53)
ai hM
1
ai bi D ci Ti :
2
ci2 Ti2
pi D :
4 2 ai3
Ti2 4 2 mi
3
D
ai ki
which, recalling that for Newtonian forces ki D hmi M , implies the Kepler’s third
law.
The preceding considerations show that Kepler’s laws are not exact laws since
they are obtained by neglecting the mutual actions of the other planets on the motion
of a single planet. Taking into account this influence is a very difficult task since
it requires a more sophisticated formulation of the problem. This more accurate
226 14 Dynamics of a Material Point
analysis shows that the orbits of the planets are more complex since they are open
curves that, roughly, can be regarded as ellipses slowly rotating about the focus
occupied by the Sun.
y
v (i )
s
r
O x
v (e)
and
˛ 1
D p
tan : (14.54)
2 e 1
2
On the other hand, the conservation of total energy and angular momentum is
expressed by the following formulae:
1 2 k 1 2
ED v D m v.i1/ ; (14.55)
2 r 2
p
jKO j D lim jKO j D m s v.i1/ D s 2mE; (14.56)
r>1
k ˛
sD cot : (14.58)
2E 2
The preceding formula is important for the following reasons. Rutherford
proposed the nuclear atomic model according to which an atom is formed by a
small nucleus, containing the entire positive charge of the atom, and electrons
rotating along elliptic orbits with a focus occupied by the nucleus. This model was in
competition with the model proposed by Thomson in which the positive charge was
continuously distributed in a cloud having the dimension of the atom, whereas the
electrons were contained in fixed positions inside this cloud. Rutherford understood
that, to resolve the dispute regarding the correct model, it was sufficient to throw
charged particles (˛ particles) against the atoms contained in a thin gold leaf and to
evaluate the scattering angles of those particles. In fact, Thomson’s model, owing
to the assumed charge distribution inside the atom, implied small scattering angles.
In contrast, the concentrated positive charge of Rutherford’s model implied high
scattering angles, especially for small shock parameters. The experimental results
of Rutherford’s experience were in full agreement with Eq. (14.58), which, in
particular, implies a back scattering for very small shock parameters.
Let P be a mass point acted upon by its weight mg and the air resistance
v
F D R.v/ ; (14.59)
v
228 14 Dynamics of a Material Point
Taking into account the preceding considerations, we can state that the equation
governing the motion of P is
v
mPv D mg R.v/ : (14.62)
v
Since in this section we limit our attention to falls along the vertical a, we begin
by proving that if the initial velocity v0 is directed along a, then the whole trajectory
lies on a vertical straight line. Let Oxyz be a frame of reference with the origin O
at the initial position of P and the axis Oz vertical and downward directed. In this
frame, the components along Ox and Oy of (14.62) are written as
R.v/
xR D P
x; (14.63)
mv
R.v/
yR D P
y: (14.64)
mv
d 2 R.v/ 2
xP D 2 xP 0; (14.65)
dt mv
d 2 R.v/ 2
yP D 2 yP 0; (14.66)
dt mv
so that xP 2 .t/ and yP 2 .t/ are not increasing functions of time. Since initially x.0/
P D
P
y.0/ D 0, we can state that x.t/ P and y.t/
P vanish identically. But in addition, x.0/ D
y.0/ D 0, and then x.t/ D y.t/ D 0, for any value of time, and P moves vertically.
14.6 Vertical Motion of a Heavy Particle in Air 229
It remains to analyze the motion along the vertical axis Oz. To this end, we denote
by k the vertical downward-oriented unit vector and note that
v zP
D k D k; (14.67)
v jPzj
where we must take the sign if zP > 0 (downward motion) and the C sign if zP < 0
(upward motion). Finally, to evaluate the vertical motion of P , we must integrate
the equation
R.jPzj/
zR D mg 1 (14.68)
mg
with the initial conditions
In case (a), we must take the C sign in (14.68) at least up to the instant t for
which zP.t / D 0. In the time interval Œ0; t , we have that zR > 0, zP < 0, so that
dPz2
D 2PzzR < 0;
dt
and the motion is decelerated. Integrating (14.68) in the interval Œ0; t we obtain
the formula Z 0
dPz
D gt ; (14.71)
Pz0 R.jPzj/
1C
mg
and t is finite since the function under the integral is bounded in the interval ŒPz0 ; 0.
In case (b), the motion is initially progressive, so that we take the sign
in (14.68) at least up to an eventual instant t in which zP.t / D 0:
R.Pz/
zR D mg 1 : (14.72)
mg
On the other hand, it is evident that, under hypotheses (14.61) on f .v/, there is one
and only one value V such that
mg D R.V /: (14.73)
Reasoning in the same way, we conclude that in case .i i i / the motion is progressive
and accelerated, and (14.75) still holds.
As an application of the previous analysis, let us consider the case of a parachutist
of mass m for which we require a prefixed limit velocity V , i.e., an impact velocity
with the ground that does not cause damage to the parachutist. From (14.60) we
obtain the cross section A of the parachute in order to attain the requested limit
velocity:
mg
AD :
˛f .V /
In this section, a qualitative analysis of Eq. (14.62) is presented in the general case
in which the direction of the initial velocity v0 is arbitrary. Let Oxyz be a frame
of reference with the origin O at the initial position of P and an upward directed
vertical axis Oz. Further, we suppose that the coordinate plane Oxz contains v0 , and
Ox is directed in such a way as to have xP 0 > 0. Since in this frame yP0 D 0, and the
component of Eq. (14.62) along Oy is still (14.64), we have that y.t/ P D 0, at any
instant, and we can state that the trajectory lies in the vertical plane containing the
initial position and velocity.
P
From (14.63) and the initial condition x.0/ D xP 0 > 0 it follows that the function
jx.t/j
P is not increasing [see (14.65)]. Moreover, there is no instant t such that
P / D 0. In fact, if such an instant existed, we could consider the initial value
x.t
problem given by (14.63) and the initial datum x.t P / D 0. Since the only solution
P
of this problem is given by x.t/ D 0, the motion should take place along the vertical
axis Oz, and this conclusion should be in contradiction with the condition x.0/ P D
xP 0 > 0. In conclusion, x.t/
P is a positive nonincreasing function for any t 0
so that
P
lim x.t/ D xP 1 < xP 0 ; (14.76)
t !1
R
lim x.t/ D 0: (14.77)
t !1
14.7 Curvilinear Motion of a Heavy Particle in Air 231
/2
/2
We have already proved that the trajectory of the material point P is contained
in the vertical plane Oxz. To discover important properties of , we introduce the
unit vector t D v=v tangent to and denote by ˛ the angle that t forms with the
Ox-axis. Then, projecting (14.62) onto the Ox-axis and along t, we have the system
d
m .v cos ˛/ D R.v/ cos ˛; (14.79)
dt
mPv D mg sin ˛ R.v/ (14.80)
in the unknowns ˛.t/ and v.t/. Introducing into (14.79) the value of vP deduced
from (14.80), we obtain the equation
d g
.cos ˛/ D sin 2˛; (14.81)
dt 2v
which can also be written as
g cos ˛
P
˛.t/ D : (14.82)
v
P
Since we refer to the case x.0/ > 0, at the initial instant t D 0 the angle ˛
satisfies the condition
< ˛0 < :
2 2
Then from (14.81) we have that
P
˛.0/ < 0: (14.83)
Consequently, we can state that ˛.t/, starting from any initial value belonging to the
interval .=2; =2/, decreases at least up to an eventual instant t in which the
right-hand side of (14.82) vanishes, that is, when (Fig. 14.5)
232 14 Dynamics of a Material Point
O x
˛.t / D :
2
The value t is infinite since, in the opposite case, if we adopt the initial datum
P / D v.t / cos ˛.t / D 0, the corresponding motion should always take place
x.t
along a vertical line, against the hypothesis that for t D 0 the velocity has a
component along the Ox-axis.
In conclusion, we can state that the trajectory is downward concave [˛.t/
decreases] and has a vertical asymptote (Fig. 14.6).
Finally, we want to prove that, if v.t/ is a regular function of t at infinity, then
where V is the unique solution of (14.73). In fact, the regularity of the positive
function v.t/ implies one of the following possibilities:
In the first case, since v.t/ is positive, either of the following conditions holds:
On the other hand, when limt !1 v.t/ D C1, from (14.80) we obtain limt !1 vP
.t/ D 1, and then (14.85)1 is false.
14.7 Curvilinear Motion of a Heavy Particle in Air 233
P D 0;
lim v.t/
t !1
dv vR.v/
D v tan ˛ C (14.88)
d˛ mg cos ˛
v D F .˛; ˛0 ; v0 /; (14.89)
for given initial data ˛0 and v0 , defines a curve in the plane .˛; v/. When this function
is known, it is possible to integrate the equation of motion. In fact, in view of (14.86),
we have that
dx g cos ˛ dx
v cos ˛ D xP D ˛P D ;
d˛ v d˛
dz g cos ˛ dz
v sin ˛ D zP D ˛P D :
d˛ v d˛
In turn, this system implies
dx v2 F 2 .˛; ˛0 ; v0 /
D D ;
d˛ g g
dz v2 F 2 .˛; ˛0 ; v0 /
D tan ˛ D tan ˛;
d˛ g g
so that
Z ˛
1
xD F 2 .; ˛0 ; v0 /d G.˛; ˛0 ; v0 /; (14.90)
g ˛0
Z ˛
1
zD F 2 .; ˛0 ; v0 / tan ˛ d H.˛; ˛0 ; v0 /: (14.91)
g ˛0
234 14 Dynamics of a Material Point
These relations, which give the parametric equations of the trajectory for any choice
of the initial data, represent the solution of the fundamental ballistic problem since
they make it possible to determine the initial angle ˛0 and the initial velocity v0 to
hit a given target.
R’e
’3 x3
O
x2
O’
’2
Re
’ x1
where
MT m 0
mA D h r (14.94)
jrj03
is the gravitational force acting on P due to the Earth and F is the nongravitational
force.
We define the weight p of P as the opposite of the force Fe we need to apply to
P , so that it remains at rest in the terrestrial frame Re . From (14.93) we have that
!
Fe C mA C m!T2 QP D 0;
where g is the gravitational acceleration. The preceding equation shows that the
weight of P is obtained by adding the gravitational force mA, which is exerted
!
on P by the Earth, and the centrifugal force m!T2 QP . Supposing that the Earth
is spherical and recalling that the first force is always directed toward the center
O 0 of the Earth, whereas the second force is orthogonal to the rotation axis of
the Earth, we can state that the weight reaches its minimum at the equator and its
maximum at the poles. Further, at the poles and at the equator, the direction of p
coincides with the vertical to the surface of the Earth. Finally, we remark that the
direction of g does not depend on the mass of P . This fundamental property follows
from the identification of the masses m appearing in (14.94) and in the centrifugal
!
force m!T2 QP . If we define as gravitational mass mg the mass appearing in the
gravitational force (14.94) and as inertial mass the mass mi appearing in Newton’s
equation of motion, instead of (14.95), we obtain
mi 2 !
gDAC ! QP : (14.96)
mg T
north
y
z
k
west O x east
south
mi and mg , we can assume that mi =mg D 1. This result plays a fundamental role
in general relativity.
With the introduction of the weight (14.95) and in the absence of other forces,
(14.93) becomes
rR 0 D g 2m!T rP 0 : (14.97)
To consider a first application of (14.97), we choose a frame of reference Re as
in Fig. 14.8 and write (14.97) in a convenient nondimensional form.
Let us introduce a reference length L and a reference time interval T . Since the
rotation of the Earth produces much smaller effects on the falls of heavy bodies
than does the weight, we can take L and T in such a way that L=T 2 D g. On the
other hand, we cannot use the same time interval T as a measure of ! since a
turn of the Earth about its axis requires 24 h, whereas a heavy body falls in a few
minutes or seconds depending on the throw height. Consequently, we introduce a
new reference time interval, TT . If we use the same letters to denote dimensional
and nondimensional quantities, (14.97) can also be written as
rR 0 D k C 2 !T rP 0 ; (14.98)
where k is the unit vector along Oz (Fig. 14.8) and is the nondimensional
parameter
T
D :
TT
For a falling time T D 60 s and T D 86; 400 s (24 h) we have ' 0:0007. In other
words, we are faced with a problem to which we can apply Poincaré’s method. Since
in the frame Re k D .0; 0; 1/ and !T D .0; !T cos ; !T sin /, (14.98) gives the
system
14.9 Simple Pendulum 237
mRr D mg C ˆ; (14.99)
where g is the gravitational acceleration and ˆ the reactive force exerted by the
constraint. Projecting (14.99) onto the unit vector t tangent to and recalling that the
curvilinear abscissa s D l', where l is the radius of , and ˆ t D 0 [see (13.51)],
we obtain a second-order differential equation
g
'R D sin ' (14.100)
l
238 14 Dynamics of a Material Point
mg
E
E
E
E
in the unknown '.t/. This equation admits one and only one solution when the
P are given. Since there is no solution in a finite form of the
initial data '.0/ and '.0/
preceding equation, we resort to a qualitative analysis of (14.100). Due to the form
of (14.100), we can resort to the method discussed in Sect. 10.8 to obtain a phase
portrait (Fig. 14.10).
The conservation of the total mechanical energy is written as
1 2 2
ml 'P mgl cos ' D E;
2
14.10 Rotating Simple Pendulum 239
so that we have
2
'P 2 D .E C mgl cos '/: (14.101)
ml 2
Since the potential energy U.'/ D mgl cos ' is periodic, it is sufficient to analyze
the motion for ' 2 Œ; .
The motion is possible if E C mgl cos ' 0. When E D mgl, point P
occupies the position ' D 0 with velocity equal to zero [see (14.101)]. This position
is a stable equilibrium position since the potential energy has a minimum at ' D 0.
For mgl < E < mgl the motion is periodic between the two simple zeros of the
equation
E C mgl cos ' D 0;
z a
Q t y
x
P
mg
m! 2 l 2
U.'/ D mgl cos ' C cos2 ' (14.107)
2
E
E
E
E
and
Therefore, if
g
> 1;
!2l
then U 0 .'/ D 0 if and only if ' D 0; ˙. In this hypothesis, we can state that
the sign of U 0 .'/ [see (14.109)] depends on the sign of sin '. Consequently, U.'/
decreases for ' < 0, increases for ' > 0, and has a minimum when ' D 0 and a
maximum for ' D ˙ (Fig. 14.12).
In contrast, if
g
< 1;
!2l
then U 0 .'/ D 0 when ' D 0; ˙; ˙ arccos !g2 l . Moreover, ' D 0; ˙ are
maxima of U.'/, whereas the angles ˙ arccos.g=! 2 l/ correspond to minima. The
relative phase portrait is shown in Fig. 14.13. This analysis shows that the phase
portrait undergoes a profound change when the ratio g=! 2 l changes in a small
neighborhood of 1. For this reason it is called a bifurcation value.
242 14 Dynamics of a Material Point
Let P be a heavy point moving on the surface S of a smooth sphere with radius l
whose center C is fixed with respect to a terrestrial frame of reference R. We wish
to study the small oscillations of this spherical pendulum relative to a terrestrial
observer R D Oxyz. The axis Oz is chosen to coincide with the vertical of the place
containing the center C of S , and the axes Ox and Oy are horizontal (Figs. 14.14
and 14.15). The equation governing the motion of P relative to a terrestrial observer
R D Oxyz is
rR D g 2!T rP C ˆ; (14.110)
where !T is the angular velocity of the Earth and ˆ is the reactive force exerted on
P by the spherical constraint S with the equation
xR D 2!T yP cos ˛ C 2x
; (14.112)
yR D 2!T zP sin ˛ 2!T xP cos ˛ C 2y
; (14.113)
zR D g 2!T yP sin ˛ 2.l z/
: (14.114)
14.11 Foucault’s Pendulum 243
y
z
C l
O x
We are interested in the small oscillations of the spherical pendulum about the
position ' D 0. This circumstance simplifies the study of Eqs. (14.112)–(14.114),
as we can see by a nondimensional analysis. Introduce as the length of reference
the radius l of the sphere S , and denote by ' the angle corresponding to the largest
oscillation; then we have that (Fig. 14.16)
z
C
l
ϕ
S 2 y
}lϕ
lϕ mg
r D r0 ;
rP D rP 0 ! r0 ;
rR D rR 0 C a 2! rP 0 D rR 0 ! 2 r0 2! rP 0 :
Thisqequation shows that in the frame R0 , the pendulum oscillates with a period
2= gl C ! 2 , whereas its plane of oscillation rotates with angular velocity !
about the vertical axis Oz.
14.12 Exercises
1. Neglecting the resistance of the air, prove that the velocity of a point particle
projected upward in a direction inclined at 60 to the horizontal is half of its
initial velocity when it arrives at its greatest height.
2. Neglecting the resistance of the air, find the greatest distance that a stone can
be thrown with initial velocity v0 .
3. A simple pendulum executes small oscillations in a resisting medium with
damping h'. P Determine the qualitative behavior of the motion and the loss
of energy after n oscillations.
4. Supposing the Earth to be spherical, with what velocity must a projectile be
fired from the Earth’s surface for its trajectory to be an ellipse with major axis
a?
5. A bomb is dropped from an airplane flying horizontally at height h and with
speed v. Assuming a linear law of resistance mgkv, show that, if k is small,
the fall time is approximately
s s !
2h 1 2h
1 C gk :
g 6 g
246 14 Dynamics of a Material Point
b c
f .r/ D 2
C 4;
r r
where a and b are positive constants and r is the distance from the center.
Analyze qualitatively the motion, prove the existence of a circular orbit, and
determine its radius.
7. Repeat the analysis required in the preceding exercise for a particle moving
under the action of the central force
f .r/ D kr 2 er :
2
k
f .r/ D ;
r3
gives the mass of the arbitrary measurable region c C . The function .r0 /, which
is called the mass density of the body B, could exhibit finite discontinuity across
some surfaces contained in B.
We call the point G 2 C whose position vector rG relative to R0 is defined by
the formula Z
m.C /rG D .r0 /r0 dc (15.2)
C
the center of mass.
Exercise 15.1. Prove that the center of mass is a characteristic point of the rigid
body B, i.e., it is independent of the origin of the body frame.
The velocity field, which in a rigid motion has the form (12.17)
can be considered as depending on the time t and the position vector r in the lab
frame or as a function of t and the position vector r0 in the body frame. In all of the
following formulae, it is supposed to be a function of t, r0 .
In this section, we extend to a rigid body the results of Sects. 13.6 and 13.7
relative to systems of material points. To this end, besides the lab and body frames
R and R0 , we need to introduce a third frame of reference RG D .Gx 1 x 2 x 3 / having
a translational motion relative to R with velocity rP G . As in Sect. 13.7, we call the
motion of B relative to RG the motion about the center of mass. Any quantity a
related to RG will be denoted by a.
We call linear momentum the vector
Z
QD Pr dc (15.3)
C
Since the region C does not depend on time owing to the rigidity of B,
differentiating (15.2) with respect to time, we obtain
Q D mPrG : (15.5)
Q D 0: (15.6)
KG D KG : (15.7)
15.3 Tensor of Inertia 249
The angular momentum relative to RG does not depend on the pole; that is,
KO D KG (15.8)
In view of (15.6), the integral on the right-hand side of the preceding equation
vanishes and the theorem is proved. t
u
We conclude this section defining the kinetic energy of the body B as follows:
Z
1
T D Pr2 dc: (15.10)
2 C
1
T D T C mPr2G : (15.11)
2
The quantities we introduced in the preceding section refer to any possible rigid
motion, but we have not yet used the explicit expression of the rigid velocity
field. In this section, we determine useful expressions of angular momentum and
kinetic energy that explicitly take into account the relation existing in a rigid motion
between the angular velocity and the velocity of any point.
250 15 General Principles of Rigid Body Dynamics
(1) Let B be a rigid body with a fixed point O in the frame of reference R. It is
always possible to take O coinciding both with the origin of the lab frame R and the
!
origin of the body frame R0 . With this choice of the frames of reference, OP D r
and the velocity field is written as rP D ! r. Consequently, (15.4) gives
Z
KO D r .! r/ dc
C
Z
D .jrj2 ! .r !/r/ dc:
C
KO D IO !; (15.12)
so that IO is symmetric.
When a rigid motion with a fixed point O and its angular velocity ! are given, it
is possible to evaluate not only the angular momentum KO by (15.12) but also the
kinetic energy. In fact, introducing the velocity rP D ! r into (15.10), we obtain
Z Z
1 1
T D ! r rP dc D r rP ! dc:
2 C 2 C
If we note that the angular velocity does not depend on the point of C and we take
into account (15.4) and (15.12), then the preceding relation can also be written as
1 1
T D KO ! D ! IO !: (15.16)
2 2
The kinetic energy is a positive quantity (T 0) that vanishes if and only if the
velocity rP D ! r vanishes identically. In turn, this condition is equivalent to
requiring that ! D 0. In conclusion, in view of (15.16), we can state that the tensor
of inertia is symmetric and positive definite.
15.3 Tensor of Inertia 251
The quantities KO and ! in relations (15.12) and (15.16) are relative to the
lab frame R. However, since they are vector quantities, we can evaluate their
components in any frame of reference. Consistent advantages are obtained if (15.12)
and (15.16) are projected in the body frame R0 . In fact, with respect to the basis .e0i /
of the unit vectors directed along the axes of R0 , we have
0 1 0 0 0
KOi D Iij0 !ij0 ; T D I !! ; (15.17)
2 ij i j
where the components of the tensor of inertia are
Z
Iij0 D .r 02 ıij xi0 xj0 / dc: (15.18)
C
In the body frame R0 the components Iij0 are independent of time since they only
depend on the geometry and mass distribution of the body B. On the other hand,
we have already noted that the 2-tensor IO is symmetric and positive definite.
Consequently, the eigenvalues A, B, and C of IO are positive, and there exists at
least an orthonormal basis .e0i / formed by eigenvectors of IO . These vectors are at
rest in R0 , i.e., relative to the body B. Any frame of reference that is formed by an
arbitrary point O and axes parallel to eigenvectors of IO is called a principal frame
of inertia relative to O, whereas the eigenvalues A, B, and C of IO are the principal
moments of inertia relative to O. In particular, if O is the center of mass of S , then
we substitute the attribute principal with central.
In a principal frame of inertia, the tensor of inertia is represented by the following
diagonal matrix:
0 1
A 0 0
@ 0 B 0 A; (15.19)
0 0 C
so that (15.12) and (15.16) become
1 2
T D Ap C Bq 2 C C r 2 ; (15.21)
2
where the eigenvalues A, B, and C are constant.
We now determine the expressions of KO nd T when the solid B has a fixed axis
a or is free from any constraint.
(a) In the first case, provided that we choose the lab frame R and the body frame
R0 as in Fig. 12.2, we have that ! D 'eP 03 . Consequently, for any O 2 a, (15.18)
gives
0 0 0 0 0 0 1 0 2
KO D '.IP 13 e1 C I23 e2 C I33 e3 /; T D I33 'P : (15.22)
2
252 15 General Principles of Rigid Body Dynamics
IO e03 D C e03
or, equivalently,
0 0 0 0
10 1 0 1
I11 I12 I13 0 0
@I0 I0 I0 A@0A D C @0A:
12 22 23
0 0
I13 I23 C 1 1
This equation implies that
0 0
I13 D I23 D 0;
and finally we can write (15.22) in the following simple form:
P 03 ;
KO D C 'e (15.23)
At first sight, it seems that we have to evaluate the tensor of inertia IO every time
we change the fixed point O. In this section we show that there exists a simple
relation between the tensor IG and the tensor of inertia IO relative to any point O.
Further, the evaluation of IG becomes simpler when the rigid body B possesses some
symmetries.
Denoting the position vector r of any point P of B in the lab frame R by rG , the
position vector of the center of mass of B in R, and by r0 the position vector of P
0
in the body frame RO , we have that r D rG C r0 , and relation (15.13) can also be
written as follows:
15.4 Some Properties of the Tensor of Inertia 253
Z
IO D Œ.jrG j2 C jrj02 C 2rG r0 /1 .rG C r0 / ˝ .rG C r0 / dc:
C
Since Z Z
IG D .jrG j2 1 r0 ˝ r0 / dc; r0 dc;
C C
we obtain the following formula:
IO D IG C m.jrG j2 1 rG ˝ rG /; (15.27)
where ıi is the distance between Oxi and Gx i ; moreover, the products of inertia
are modified according to the rule
Finally, if .G; x i / is a principal frame of inertia, then any frame .O; xi /, where
O 2 x i and the axes are parallel to the corresponding axes of .G; x i /, is a principal
frame of inertia.
Proof. Relations (15.28) and (15.29) follow at once from (15.27). Equation (15.30)
follows from (15.29) when we note that two coordinates of G vanish when one of
the axes Oxi contains G. Finally, (15.30) follows from (15.29) since, when .G; x i /
is a central frame of inertia, I Gij D 0 for i ¤ j . t
u
To determine IG , we arbitrarily fix a frame of reference .G; x i / and we evaluate
the components of IG in this frame by (15.14) and (15.15). Then we solve the
eigenvalue equation
I Gij uj D ui D ıij uj
to find the eigenvalues and eigenvectors of IG , i.e., the central frames of inertia.
The following theorem shows that the last problem can be simplified if the body
B exhibits some symmetries with respect to the center of mass.
254 15 General Principles of Rigid Body Dynamics
We define the moment of inertia of a rigid body B relative to the straight line a by
the quantity Z
Ia D ı 2 .r/ dc; (15.31)
C
where ı denotes the distance of an arbitrary point r of B from the straight line a
(Fig. 15.1). Denoting by u a unit vector directed along a, we have that
Ia D u IO u: (15.32)
15.5 Ellipsoid of Inertia 255
u
O
r
x x
a
IO D IOij i j D 1: (15.34)
The positive-definite character of the tensor of inertia IO implies that (15.33) defines
an ellipsoid E, which is called the ellipsoid of inertia relative to O. In particular,
if O G, then E is called the central ellipsoid of inertia. In view of (15.32), the
moment of inertia of B relative to any axis a containing the center O of the ellipsoid
is given by the formula
1
Ia D : (15.35)
jj2
In other words, the ellipsoid of inertia relative to O allows one to determine
geometrically the moment of inertia of B with respect to any axis containing O.
Recalling the theorems we have proved regarding the dependence on O of the tensor
IO , we understand that knowledge of the central ellipsoid of inertia is sufficient to
determine the ellipsoid of inertia relative to any other point O. If the unit vectors
.ei / along the axes of the frame of reference .O; xi / are eigenvectors of IO , then
(15.35) becomes
Ax12 C Bx22 C C x32 D 1; (15.36)
where A, B, and C are the corresponding eigenvalues. When A D B, the ellipsoid
E is an ellipsoid of revolution about Ox3 ; finally, it reduces to a sphere when A D
B D C.
256 15 General Principles of Rigid Body Dynamics
where a is the axis containing O and parallel to !. In view of (15.33), the vector
D puI is the position vector of the intersection point P of the ellipsoid E with
a
axis a; therefore, we have that
p
KO D Ia j!jIO :
The forces acting on a rigid body B can be divided into two classes: active forces
and reactive forces. A force belongs to the first class if we can assign a priori the
force law, i.e., the way in which the force depends on the position of B, its velocity
field, and time. These forces can be concentrated forces if they are applied at a
point of the body B and mass forces if they act on the whole volume occupied by B.
Consequently, the total force and the total torque relative to a pole O of the active
force can be written as
Z X
p
.a/
R .a/
D b dc C Fi ; (15.38)
C i D1
Z X ! p
.a/ ! .a/
MO D OP b dc C OP i Fi ; (15.39)
C i D1
.a/
where is the mass density of B, b is the force per unit mass, and Fi , i D 1; : : : ; p,
are the concentrated forces acting at the points P1 ; : : : ; Pn of B.
The reactive forces are due to the contact between B and the external obstacles
(constraints). In turn, these forces are determined by the very small deformations
15.6 Active and Reactive Forces 257
that both B and the constraints undergo in very small regions around the contact
surfaces. The hypothesis that both B and the constraints are rigid eliminates the
preceding small deformations so that it is impossible to determine the force law of
the reactive forces. We suppose that the reactive forces can be distributed on the
surfaces 1 ; : : : ; m of B as well as on very small areas around the points Q1 ; : : : Qs
of contact between B and the constraints. Then we assume that the reactive forces
acting on these small areas can be described by a total force ˆ i and a couple with a
moment i , which is called rolling friction. As a consequence of these assumptions,
we have that the total reactive force and torque can be written as follows:
m Z
X X
s
R .r/
D t d C ˆi ; (15.40)
i D1 i i D1
m Z
X X ! s
.r/ !
MO D OP t dc C .OQi ˆ i C i /; (15.41)
1 i i D1
jˆ k j s jˆ ? j; (15.42)
j k j hs jˆ ? j; (15.43)
j ? j ks jˆ? j; (15.44)
where the constants 0 < s ; hs ; ks < 1, depending on the contact between the body
and the constraints, are called static friction coefficients.
Definition 15.3. Rough constraints in dynamic conditions are such that
rP
jˆ k j D d jˆ ? j ; (15.45)
jPrj
!k
j k j D hd jˆ ? j ; (15.46)
j!k j
!?
j ? j D kd jˆ ? j ; (15.47)
j!? j
where rP is the relative velocity of the contact point of B with respect to the con-
straint, and !k and !? denote respectively the tangential and normal components
of the relative angular velocity ! of B with respect to the constraint. The coefficients
0 < d < s < 1, hd , and kd have the same meaning as in the static case but are
now called the dynamic friction coefficients.
In the preceding section, the characteristics of the forces acting on a rigid body
B were given. In this section, we introduce the balance equations governing its
dynamics.
It is reasonable to assume that, owing to the hypothesis of rigidity, the internal
forces, acting between parts of B, do not influence the motion. Consequently, they
must not appear in the dynamic equations describing the effect of the acting forces
on the motion of B. Now, the equations possessing this characteristic are the balance
equations of linear and angular momentum, (13.28) and (13.32). Although these
equations were proved for a system of mass points, it is quite natural to require
their validity for a rigid body that is a collection of the elementary masses dc.
Therefore, we postulate that the dynamical behavior of a rigid body B is governed
by the following balance equations:
where m is the total mass of B and G its center of mass, O coincides with G or with
an arbitrary fixed pole, and the angular momentum is expressed by (15.25). In the
next chapter, we prove that the preceding balance equations allow us to determine
the motion of a rigid body B when it has a fixed axis or a fixed point or is free of
constraints.
We conclude this section by proving an important theorem that holds during the
motion of any rigid body regardless of the constraints to which it is subjected.
Let us suppose that B moves relative to the frame of reference .O; xi / under the
action of the forces F1 ; : : : ; Fs applied at the points P1 ; : : : ; Ps of B. Then, taking
into account that the motion of B is rigid, the power W of these forces is
X
s X
s
!
W D Fi rP i D Fi .Pr C ! P i /
i D1 i D1
! !
X
s X
s
!
D Fi rP C Pi Fi !;
i D1 i D1
where is an arbitrary point of B and ! is the angular velocity of the rigid motion.
Introducing the total force R and torque MO acting on B, we can put the preceding
relation in the form
W D R rP C MO !;P (15.50)
which shows that the power of the forces acting on B depends only on their total
force and torque.
Denoting by T the kinetic energy of B, we prove the following formula, which
expresses the theorem of kinetic energy:
TP D W: (15.51)
P G D Ape
K P 01 C B qe
P 02 C C rP e03 C ! KG :
P G ! D Ap pP C Bq qP C C r rP D dT ;
K
dt
and (15.51) is proved.
260 15 General Principles of Rigid Body Dynamics
15.8 Exercises
1. A cannon resting on a rough horizontal plane is fired, and the muzzle velocity of
the projectile with respect to the cannon is v. If m1 is the mass of the cannon and
m2 the mass of the projectile, the mass of the powder being negligible, and is
the coefficient of friction, show that the distance of recoil of the cannon is
2
m2 v 1
;
m1 C m2 2 g
3. If a shell at rest explodes and breaks into two fragments, show that the two
fragments move in opposite directions along the same straight line with speeds
that are inversely proportional to their masses.
4. Prove that, if is the coefficient of friction, the couple necessary to set into
rotation a right circular cylinder of radius r and weight mg that is standing on its
base on a rough horizontal plane is 23 mgr .
5. A uniform disk of radius a that is spinning with angular speed ! about a vertical
axis is placed upon a horizontal table. If the coefficient of the friction between
the surface of the disk and the plane is , determine when the disk stops.
6. A heavy bar’s end points A and B are moving, respectively, along the axes Ox
and Oy of a vertical plane . Neglecting the friction of the constraints, determine
the equations of motion of the bar and the reactive forces at A and B using the
balance equations.
Chapter 16
Dynamics of a Rigid Body
Consider a rigid body B with a fixed axis that should be smooth. Figure 16.1 shows
two devices forming a fixed axis r: two spherical hinges and a spherical hinge
coupled with a cylindrical one. If the contact surfaces between the moving parts
of these devices are smooth, then the reactive forces are orthogonal to them and, at
least in the devices shown, their application straight lines intersect the fixed axis r.
.r/
Consequently, the component M3 of the total reactive force vanishes:
.r/
M3 D 0: (16.1)
.a/
I33 'R D M3 ; (16.2)
position of B is determined by the angle ', and the velocity field is given by the
!
relation rP D 'e
P 3 r, where r D OP . Consequently, (16.2) assumes the final form
.a/
I33 'R D M3 .'; ';
P t/; (16.3)
which determines one and only one motion satisfying the initial conditions (t D 0)
'.0/ D '0 ; P
'.0/ D 'P0 : (16.4)
16.2 Compound Pendulum 263
g
'R D sin '; (16.5)
l
where
I33
lD (16.6)
mh
is called the reduced length of the pendulum. Equation (16.6) shows that a
compound pendulum moves as a simple pendulum whose length is equal to the
reduced length of the compound pendulum.
x h
mg
x’
x
Let B be a rigid body constrained to moving around a smooth fixed point O. If the
constraint is realized with a spherical smooth hinge, then the reactive forces are
orthogonal to the contact spherical surface. Consequently, their application straight
lines contain point O and the total reactive torque relative to O vanishes:
.r/
MO D 0: (16.7)
P O D M.a/ :
K (16.8)
O
.a/
The torque MO of the active forces must depend on the position of B, its velocity
!
field, and time. Since the velocity of any point r D OP is given by the formula
rP D ! r, where ! is the angular velocity of the rigid motion, we can state that
.a/ .a/
MO D MO . ; '; ; p; q; r; t/; (16.9)
where p, q, and r are the components of ! in the body frame. On the other hand,
we have proved that the angular momentum relative to the lab frame of a solid with
a fixed point, when it is projected onto the body frame, is given by (15.20). The time
derivative of KO , in view of (12.42), is given by
P O D Ape
K P 01 C B qe
P 02 C C rP e03 C ! KO : (16.10)
P 01 C B qe
P 02 C C rP e03 C ! KO D MO . ; '; ; p; q; r; t/:
.a/
Ape (16.11)
Finally, projecting this equation along the axes of the body frame (which are
principal axes of inertia relative to the fixed point O), we obtain the Euler equations
16.4 Poinsot’s Motions 265
These three equations, since they contain the six unknowns .t/, '.t/, .t/, p.t/,
q.t/, and r.t/, cannot determine the motion of B. However, if we take into account
the Euler kinematic relations (12.32)–(12.34),
P D 1
.p sin ' C q cos '/; (16.15)
sin
'P D .p sin ' C q cos '/ cot C r; (16.16)
P D p cos ' q sin '; (16.17)
we obtain a system that, under general hypotheses of the regularity of the functions
appearing on the right-hand side of (16.12)–(16.14), admits one and only one
solution satisfying the initial data
We call free rotations or Poinsot’s motions all motions of a solid B with a smooth
.a/
fixed point O in the absence of the total active torque MO :
.a/
MO D 0: (16.20)
In other words, these motions correspond to the solutions of Euler’s equations when
their right-hand sides vanish. Also in this case, it is not possible to find the closed
form of solutions. However, Poinsot (1851) proposed a very interesting geometric
qualitative analysis of free rotations. As a result of this analysis, we are able to
determine the configurations that B assumes during the motion, although we are
not able to give the instant at which B occupies one of them.
266 16 Dynamics of a Rigid Body
which follow at once from (15.49) and (15.51) and hypothesis (16.20).
Let EO be the ellipsoid of inertia of B relative to point O, and denote by r.t/ the
instantaneous rotation axis, that is, the straight line parallel to the angular velocity
! j!ju and containing O. Finally, we denote by P .t/ the intersection point of
r.t/ and E (Fig. 16.5).
We have proved [see (15.37)] that the angular momentum KO is orthogonal to the
plane tangent to the ellipsoid of inertia at the point P .t/. Since, in view of (16.21),
KO is a constant vector, we can state that during motion this tangent plane remains
parallel to itself. Now we prove that the conservation of kinetic energy implies that
the distance h of from O is constant during the motion. First, we note that
! KO
h D OP : (16.22)
jKO j
Further, the point P .t/ 2 E confirms the equation of the ellipsoid so that, in view
of (15.33), we can write
u KO ! KO 2T
hD p D p D p : (16.23)
Ir jKO j j!j Ir jKO j j!j Ir jKO j
2T D ! IO ! D j!j2 u IO u D j!j2 Ir ;
p
2T .0/
hD ; (16.24)
jKO .0/j
r(t)
E
KO
O
polhode
h
P( t) herpholode
fixed plane
T
Since P .t/ 2 E r.t/, its velocity is equal to zero. Finally, we can state that during
a Poinsot motion of B about O, the ellipsoid of inertia E rolls, without slipping, on
the fixed plane .
During motion, the point P .t/ describes a curve on E, which is called a polhode,
and a curve on , known as a herpolhode (Fig. 16.4). It is possible to prove that
the polhode is a closed curve on E, which may reduce to a point. In contrast, the
herpolhode could be an open curve. More precisely, consider the arc described by
P .t/ during a complete turn on the ellipsoid E, and let ˛ be the angle subtended by
having the vertex in the orthogonal projection A of O on . If ˛ D 2.m=n/,
where m and n are integers, then the herpolhode is closed; otherwise it is open. It
is also possible to prove that the herpolhode is contained in an annulus whose
center is A, is always concave toward A, and, when it is open, is everywhere dense
in . In particular, if the initial angular velocity !.0/ is parallel to a principal axis
of inertia, then the angular momentum KO is always parallel to !.0/ and h is equal
to the half-length of the axis of E. Consequently, the polhode and the herpolhode
reduce to a point. If the ellipsoid E has a symmetry of revolution around an axis
a, then the polhode and the herpolhode become two circles. Finally, if E reduces
to a sphere, the polhode and the herpolhode reduce to a point and the motion is
always a uniform rotation. With the notebook MotiPoin it is possible to simulate all
the preceding cases. Figure 16.5 shows a herpolhode obtained with this notebook
corresponding to the data A D 1, B D 1:5, C D 0:5, r0 D 3, and .0/ D =4 in an
interval of motion of 15 s.
268 16 Dynamics of a Rigid Body
0.6
0.4
0.2
0.2
0.4
0.6
Let B be a solid with a smooth fixed point O. In this section, we prove that, among
all the possible Poinsot motions, there are uniform rotations about certain fixed axes
containing O. First we note that the condition ! D const: must be considered in
the lab frame in which we are searching for uniform rotations. However, in view of
(12.42), we have
da ! dr ! dr !
D C!!D : (16.25)
dt dt dt
In other words, if a uniform rotation is possible in the lab frame, then this rotation
is also uniform in the body frame, i.e., the components p, q, and r of ! along the
axes of a principal frame of inertia in the body are constant. In view of Euler’s
equations (16.12)–(16.14), these components must satisfy the following algebraic
system:
.B C /qr D 0; (16.26)
.C A/rp D 0; (16.27)
.A B/pq D 0: (16.28)
• If the eigenvalues A, B, and C differ from each other, then system (16.26)–
(16.28) becomes
qr D pr D pq D 0;
and its solutions are
fp D p0 ; q D r D 0g; fp D r D 0; q D q0 g; fp D q D 0; r D r0 g:
pr D qr D 0;
fp D p0 ; q D q0 ; r D 0g; fp D q D 0; r D r0 g:
with a complex structure (cardan joint) that allows for the complete mobility of B
about O.
From the properties we proved about the tensor of inertia in Chap. 15 we know
that a body frame whose axes are principal axes of inertia can be obtained by adding
to the axis Ox30 D a any arbitrary pair of axes Ox10 and Ox20 that are orthogonal
to each other and to Ox30 . Moreover, any other frame .x100 ; x200 ; x300 /, where 2 a
and Ox10 is parallel to x100 , Ox20 is parallel to x200 , and Ox30 x100 , is again a
principal frame of inertia. Finally, since A D B for the central tensor of inertia, the
corresponding ellipsoid of inertia E has a symmetry of revolution about a, and these
properties hold for all the tensors of inertia and corresponding ellipsoids relative to
any other point of a.
After recalling these properties, we are now in a position to evaluate Poinsot’s
motions of a gyroscope with a smooth fixed point O belonging to the gyroscopic
axis a. Since we have A D B and MO D 0, from the third Euler equation (16.14)
we derive the following conservation law:
r D r0 ; (16.29)
which states that in a free rotation of a gyroscope the component of the angular
velocity along the gyroscopic axis is constant. Further, the angular momentum is
constant. Collecting these results, we can write that
KO D A pe01 C qe02 C C r0 e03 D KO .0/; (16.30)
16.7 Heavy Gyroscope 271
where the unit vectors e0i are directed along the principal axes of the body frame. By
the preceding relation, we can write the angular velocity ! D pe01 C qe02 C re03 in
the following form:
! D !1 C !2 ; (16.31)
where
C 1
!1 D 1 r0 e03 ; !2 D KO : (16.32)
A A
The preceding relations show that any Poinsot motion of a gyroscope is a spherical
motion obtained by composing a uniform rotation about a fixed axis containing O
and parallel to KO , and a uniform rotation about the gyroscopic axis.
x
x’
x’
G
h
G
e’
p x
e’ e’
n
x’
First, since A D B and the component Mx30 D hG e03 e03 D 0, the third Euler
equation still implies the first conservation law r D r0 .
Second, the total energy is constant. In fact, in view of (15.50), we have that the
power of the reactive forces vanishes,
.r/
W .r/ D R.r/ rP 0 C MO ! D 0;
.r/
since rP 0 D 0 and MO D 0. On the other hand, the power W .a/ of the active forces
is given by
d d
p rP G D .p rG / D .phG cos /:
dt dt
Taking into account the theorem of kinetic energy (15.51), we obtain conservation
of the total energy E:
T C phG cos D E; (16.34)
which, by expression (15.21) of the kinetic energy of a solid with a fixed point and
the conditions A D B and r D r0 , assumes the form
A p 2 C q 2 C C r02 C 2phG cos D 2E: (16.35)
Finally, the component M3 of the active torque along the vertical axis Ox3
vanishes since M3 D hG e03 p e3 D 0. Consequently, the component KO e3
of the angular momentum along the vertical axis Ox3 is constant. If we denote by
i the components of e3 in the body frame, this last conservation law can be written
as follows:
A.p1 C q2 / C C r0 3 D K3 .0/: (16.36)
The algebraic equations (16.35) and (16.36) in the unknowns p and q can replace
Euler’s equations to great advantage. However, (16.36) contains the components i
of e3 along the axes of the body frame. In terms of Euler’s angles, these components
are given by the following relations:
The presence of Euler’s angles in these relations requires the use of the Euler
kinematic relations (Sect. 12.4):
p 2 C q 2 D P 2 sin2 C P 2 : (16.43)
Introducing this equation into (16.35) and relations (16.37)–(16.41) into (16.36) and
taking into account (16.42), we can write the following system:
where
2E C r02 2phG
˛D ; aD ; (16.47)
A A
K3 .0/ C
ˇD ; bD > 0: (16.48)
A A
The first of the preceding equations can be written in a more convenient form by
introducing the new variable u D cos . In fact, since
f(u)
- u u
u
P D ˇ br0 u ; (16.55)
1 u2
ˇ br0 u
'P D r0 u: (16.56)
1 u2
A complete integration of the preceding system is impossible without resorting
to numerical methods. However, a qualitative analysis of the solution can be
developed.
Consider the unit sphere † (Fig. 16.7) with its center at O and the curve that
the gyroscopic axis Ox30 draws on † during the motion. When we note that both
axes Ox3 and Ox30 are orthogonal to the line of nodes, identified in Fig. 16.7 by
the unit vector n, we deduce that the angle between the plane Ox3 x30 and the fixed
axis Ox1 results in ı D =2. Consequently, in spherical coordinates, the curve
can be described in terms of the angles ı and . Remembering that u D cos ,
(16.54) and (16.55) represent a system of differential equations in the unknown
parametric equations of the curve , i.e., in the functions ı.t/ and .t/. Finally,
(16.56) describes how the gyroscope rotates about its axis.
We do not prove that the relations existing among the quantities ˛, ˇ, a, and b
imply that the three roots of the equation f .u/ D 0 are real. In view of (16.54) and
the meaning of u, we must have
f .u/ 0; 1 u 1: (16.57)
The preceding conditions imply that the two acceptable roots u1 and u2 of the
equation f .u/ D 0 belong to the interval Œ1; 1, that is,
1 u2 u1 1: (16.60)
Equivalently, since u D cos , we can state that during the motion of the gyroscope
the angle satisfies the condition
1 2 : (16.61)
This shows that the curve described on the unit sphere † is always contained
between the parallel D 1 and D 2 . In particular, if the initial data are such
that 1 D 2 , then reduces to the parallel D 1 D 2 and, in view of (16.55)
and (16.56), both P and 'P are constant. In other words, if 1 D 2 , the gyroscopic
axis moves around the vertical with constant slope 1 , constant precession velocity
P , and proper angular velocity '.P
If 1 ¤ 2 , then the motion of the gyroscopic axis is called nutation. Denoting
ˇ
uD ; D arcsin u; (16.62)
br0
• … Œ1 ; 2 ;
then (16.55) shows that P always has the same sign. Moreover, for D 1 and
D 2 , P D 0 and is tangent to the aforementioned parallels.
• 2 .1 ; 2 /;
then P D 0 when D and the curve is perpendicular to the parallel D
. Moreover, P has a definite sign when 2 .1 ; / and an opposite but still
definite sign if 2 .; 2 /. This means that .t/ increases in the first interval
and decreases in the second one or vice versa. Moreover, for D 1 and D 2 ,
we have P D 0 and the curve is tangent to the parallels D 1 and D 2 .
• D ;
for D 2 both P and P vanish, but P has a constant sign at any other point. It
is possible to prove that exhibits cusps when D D 1 .
Fig. 16.9 A D B D 1, C D 0:5, p.0/ D 0:3, q0 D 0:5, r.0/ D 3:5, .0/ D =4
Fig. 16.10 A D B D 1, C D 1:2, p.0/ D 0:3, q0 D 0:5, r.0/ D 3:5, .0/ D =4
In other words, an initial rotation r0 is impressed about the gyroscopic axis that
forms an angle 0 with the vertical. With the preceding initial conditions, taking
into account (16.35), the constants (16.47) and (16.48) become
16.8 Some Remarks on the Heavy Gyroscope 277
2E C r02 2P hG
˛D D cos 0 D au0 ; (16.64)
A A
ˇ D br0 cos 0 D br0 u0 : (16.65)
the motion of the gyroscope corresponds to case .c/ of the previous section. The
other solution u2 in the interval Œ1; 1 is the root of the equation
from which, in view of (16.67), we derive the following expression for the difference
of the two roots:
a.1 u22 /
u1 u2 D : (16.69)
b 2 r02
The preceding difference is positive since u1 ; u2 2 Œ1; 1, so that we have u2 < u1 .
But during the motion, u1 u u2 , and consequently u1 u u1 u2 . Bearing in
mind these remarks, (16.55) supplies
u1 u u1 u2
0 P D br0 br0 :
1u 2 1 u2
278 16 Dynamics of a Rigid Body
Fig. 16.12 A D B D 1, C D 0:5, p.0/ D 0:3, q0 D 0:5, r.0/ D 3:5, .0/ D =4
a.1 u22 /
A.p 2 C q 2 / D 2phG .u0 u/ < 2phG .u1 u2 / :
b 2 r02
The preceding remarks prove that, for the initial conditions (16.63) and high
values of r0 , the motion of a gyroscope is approximatively a slow regular precession
around a vertical. Since jj P increases with r0 , during the low precession, the
gyroscopic axis vibrates with high frequency (Fig. 16.12).
16.8 Some Remarks on the Heavy Gyroscope 279
This result can also be obtained supposing that the motion of the heavy
gyroscope, corresponding to the initial data (16.63) and high values of r0 , is
described by the equation [see (16.71)]
C ro eP 03 D hG e03 p: (16.72)
In fact, denoting by ! the angular velocity of the gyroscope and recalling (12.41),
the preceding equation can also be written as
hG
! p e03 D 0;
C r0
so that
hG
! D e03 C p;
C r0
where the approximate value of is
hG
D r0 C p e03 ' r0 :
C r0
hG
! D r0 e03 p: (16.73)
C r0
In the previous sections, we considered the motion of a solid B with a smooth fixed
point when the torque MO relative to O of the acting forces vanishes or is equal
to the torque of the weight. It is evident that the lab frame Ox1 x2 x3 , in which we
analyze the motion of B, is at rest with respect to the Earth, i.e., Ox1 x2 x3 is a
terrestrial frame. This implies that, in the torque of the acting forces, we should
also include the torque of the fictitious forces due to the noninertial character of the
terrestrial frame.
In this section, we evaluate the torque of these forces with respect to the center
of mass G of a gyroscope B. Let Ox1 x2 x3 be the lab frame, which is at rest relative
to the Earth. If we analyze the motion of B during a time interval which is much
shorter than a month, we can suppose that the motion of the lab frame with respect
to an inertial frame is a uniform rotation with angular velocity !T . Then, if C is the
region occupied by B and
the mass density of B, then the torque relative to G of
the fictitious forces due to the Earth’s rotation is
Z
.t / ! !
MG D !T 2
GP QP dc; (16.75)
C
For the next applications, it will be sufficient to consider the center of mass at rest in
!
a lab frame. In this case, rP P D ! GP and (16.77) can also be written as follows:
Z
.c/ ! h ! i
MG D 2
GP !T .! GP / dc: (16.78)
C
Denoting by !T0 i the components of !T in a central frame of inertia Gx10 x20 x30 , we
have that
Z Z
! !
!T GP GP dcD
!T0 1 x10 C!T0 2 x20 C!T0 3 x30 x10 e01 Cx20 e02 Cx30 e03 dc:
C C
Since the body frame Gx10 x20 x30 is a central frame of inertia, the products of inertia
vanish and the preceding relation reduces to the following one:
Z Z Z
! !
!T GP GP dc D !T0 1 e01
x102 dc C !T0 2 e02
x202 dc
C C C
Z
C!T0 3 e03
x302 dc: (16.80)
C
so that
Z Z
C
x102 dc D
x202 dc D : (16.81)
C C 2
Z
C
x302 dc D A : (16.82)
C 2
In view of (16.81) and (16.82), Eq. (16.80) assumes the final form
Z
! ! C
2
!T GP GP dc D C !T0 1 e01 C !T0 2 e02 C A !T0 3 e03 : (16.83)
C 2
282 16 Dynamics of a Rigid Body
x
G
where e03 is the unit vector along the gyroscopic axis Gx3 and e1 is the unit vector
along the lab axis Gx1 . Moreover (Sect. 12.4)
P
KG D A.cos 'e01 sin 'e02 / C C 'e
P 03 : (16.88)
P G D M.c/ C M.v/ ;
K (16.89)
G G
.c/ .v/
where MG and MG denote the torques of fictitious forces and reactive forces,
respectively. It is possible to prove (see next chapter) that, if the constraints are
.v/
smooth, then MG is orthogonal to the plane G1 x3 , that is, to the unit vectors e1
and e03 . Consequently, if we consider the projections of (16.89) along these vectors,
then we obtain two differential equations without reactive forces. Now, since e1 is
fixed and e01 e1 D cos ', e02 e1 sin ', we have that
P G e1 D d .KG e1 / D A:
K R (16.90)
dt
On the other hand, in view of (16.86) and (16.88), we also have
P G e03 D d KG e03 KG ! e03
K
dt
d
D KG e03 C KG ! D C ':
R (16.91)
dt
284 16 Dynamics of a Rigid Body
It remains to evaluate the components of MG along e1 and e03 . First, from (16.84)
.c/
we obtain
MG e03 D C ! !T e03 :
.c/
(16.92)
On the other hand, the terrestrial angular velocity is contained in the meridian plane
G1 3 (Fig. 16.13), and we can write
.c/
MG e1 D C !T sin 'P sin : (16.95)
P
.0/ D 0 ; .0/ D 0; '.0/ D 0; '.0/
P D r0 : (16.98)
Since !T and sin are constant and P sin D d.cos /=dt, when we recall
(16.98), from (16.96) we derive the following result:
!T
'P D r0 C !T sin .cos cos 0 / D r0 1 C sin .cos cos 0 / : (16.99)
r0
'P D r0 ;
AR D C !T sin r0 sin :
In conclusion, the gyroscopic compass rotates uniformly about its axis, which
oscillates around the initial position. For the presence of friction, after a few
oscillations, the axis stops in the position D 0, i.e., it is directed along the
meridian.
We conclude this chapter considering a free solid B for which we can write the
balance equations of momentum and angular momentum in the form
.a/
where m is the mass of B and R.a/ , MG denote the active total force and total torque
relative to the center of mass G of the active forces acting on B.
We denote by Ox1 x2 x3 a lab frame and by Gx 1 x 2 x 3 a frame of reference with
its origin at G and axes parallel to the corresponding axes of the lab frame. Finally,
Gx10 x20 x30 is the body frame, which is a principal frame of inertia. In view of (15.7)
and taking into account that the motion of Gx 1 x 2 x 3 relative to the lab frame is
translational, we have
KG D KG ; P G D da KG D dr KG ;
K (16.103)
dt dt
the fact that the acting forces, since they depend on the position of B, its velocity
field, and time, are functions of the aforementioned variables.
We explicitly remark that the presence of all the unknowns in all the equations
make it impossible to analyze the motion of the center of mass independently of the
motion about the center of mass. This is possible, for instance, when B is only acted
upon by its weight mg. In such a case, (16.104) and (16.105) become
rR G D g.a/ ; (16.106)
dr KG
D 0: (16.107)
dt
These equations imply that the center of mass G moves along a parabola, whereas
the motion about the center of mass is a free rotation (Sect. 16.4).
Chapter 17
Lagrangian Dynamics
17.1 Introduction
.a/ .r/
mRrGi D Ri C Ri ; (17.1)
P Gi D
K
.a/
MGi C
.r/
MGi ; (17.2)
f1 ./ D 0;
: : : : : : : : : : ::
fm ./ D 0; (17.3)
in an open region of V6N . In fact, under these hypotheses, system (17.3) defines in
a submanifold of class C 1 and dimension n D 6N m. Equivalently, if q 1 ; : : : ; q n
are arbitrary local coordinates on Vn , i.e., arbitrary Lagrangian coordinates of S , this
manifold can be represented in the parametric form
1
More precisely, V6N is an open n-dimensional submanifold of <3N O.3/ since we must exclude
those values of xGi 1 , xGi 2 , xGi 3 , and i , 'i , i , i D 1; : : : ; N , for which parts of two or more bodies
of S occupy the same region of the three-dimensional space.
290 17 Lagrangian Dynamics
D .q 1 ; : : : ; q n /: (17.4)
f1 .x1 ; x2 ; x3 / D 0;
f2 .x1 ; x2 ; x3 / D 0;
or in parametric form
r D r.s/;
where x1 , x2 , and x3 are the Cartesian coordinates of P , r D .x1 ; x2 ; x3 /
the position vector of P , f1 D 0 and f2 D 0 the implicit equations of two
surfaces intersecting each other along , and s the curvilinear abscissa along .
A Lagrangian coordinate is s, and S has one degree of freedom.
• Let S be a rigid body with a fixed axis a. The manifold VL associated with SL
is < O.3/. The configuration space V1 is a circumference C . In fact, with the
notations of Fig. 16.2, the configurations of S are given assigning the angle '
between the fixed plane Ox1 x3 and the body plane Ox10 x30 . In turn, this angle
defines a point belonging to C . ' is a Lagrangian coordinate of S that has one
degree of freedom.
• When S is a rigid body with a fixed point, we have VL D <3 O.3/ and
V3 D O.3/. Euler’s angles are Lagrangian coordinates, and S has three degrees
of freedom.
• Let S be a double pendulum. Then the configuration space is the torus Vn D
S 1 S 1.
• The bar AB of Fig. 17.1 is contained in a plane Oxy, has point A constrained
to move on the straight line Ox, and can rotate about an axis containing A and
orthogonal to Oxy. The configuration space is a cylinder < S 1 , and x and '
are Lagrangian coordinates. The system has two degrees of freedom.
Other examples can be found in the exercises at the end of the chapter.
x A x
O
x
r’
r
We recall that the position vector r0 is constant since it is evaluated in a body frame.
Therefore, differentiating (17.6) with respect to time, we obtain the velocity of
the point r0 in terms of the Lagrangian coordinates and the Lagrangian velocities
qP 1 ; : : : ; qP n
@r
rP D h qP h : (17.7)
@q
We remark that .qP 1 ; : : : ; qP n / is tangent to the dynamical trajectory , whereas the
velocity vector rP is tangent to the trajectory of the point r0 in three-dimensional
space. These vectors are related to each other as it is shown by (17.7).
We define virtual velocity v of P 2 S as any possible velocity compatible with
the constraints of S . It is evident that a virtual velocity is obtained by (17.7) by
292 17 Lagrangian Dynamics
fi .q 1 ; : : : ; q n / D ci ; i D 1; : : : ; r;
where c1 ; : : : ; cr are constants, and then the constraints (17.9) are holonomic.
Example 17.1. Let S be a disk that rolls without slipping on a horizontal plane
Ox1 x2 . If we suppose that the plane containing S remains vertical during motion,
the configuration space of S is <2 S 2 ; moreover, the coordinates x1 and x2 of the
contact point A and the angles ' and in Fig. 17.2 are Lagrangian coordinates for
S . Finally, the angular velocity ! of S about a vertical axis containing C is
! D 'e P 3;
P 03 C e (17.10)
xP 1 C a'P cos D 0;
xP 2 C a'P sin D 0;
17.5 Configuration Space-Time 293
x’
e x’
C
e’
x’ O x
x A
It is evident that these differential forms are nonintegrable, so that they define a
nonholonomic constraint.
.
r x’
w
x
Example 17.3. Let S be a rotating simple pendulum (Sect. 14.10). In this case,
V6N is the three-dimensional Euclidean space and V1 the rotating circumference on
which the pendulum is constrained to remain.
Before introducing the main ideas of the new approach due to D’Alembert and
Lagrange, we start with a simple example that contains all the ingredients of this
theory.
Let S be a material point constrained to move on a smooth surface † having
parametric equations
r D r.q 1 ; q 2 /; (17.14)
where q 1 and q 2 are surface parameters. It is evident that the configuration space of
S is † and q 1 and q 2 are possible Lagrangian coordinates. If we denote by ˆ the
reactive force, by F.r; rP / the active force acting on S , and by m the mass of S , then
the motion of S on † is governed by the fundamental equation of dynamics
Let q 1 .t/ and q 2 .t/ be the parametric equations of the trajectory of S . Starting
from the position occupied by S on at a given instant t, we consider an arbitrary
17.6 Simple Example 295
x
F
e
w
e
virtual velocity w (Fig. 17.4). From (17.15) we obtain that the virtual power of the
force of inertia mRr and active force vanishes,
since the hypothesis that † is smooth implies ˆw D 0. In (17.16) Tr .†/ denotes the
tangent space to † at the point r occupied by S at the considered instant. Introducing
the expression (17.8) of the virtual velocity into (17.16), we obtain the condition
@r h
.mRr C F/ w D 0; 8.w1 ; w2 /: (17.17)
@q h
On the other hand, the vectors eh D @r=@q h are tangent to the coordinate curves,
and therefore rRh D rR eh and Fh D F eh denote the (covariant) components of rR and
F along the vector eh . Finally, from (17.17), taking into account the arbitrariness of
.w1 ; w2 /, we derive the following two scalar equations:
mrRh D Fh ; h D 1; 2: (17.18)
@r h
rP D qP ; (17.19)
@q h
@r h @2 r
rR D h
qR C h k qP h qP k ; (17.20)
@q @q q
we can say that Fh are functions of q h and qP h , whereas rRh depend on qRh , qP h , and qh .
In other words, Eqs. (17.18) are a system of two second-order differential equations
296 17 Lagrangian Dynamics
in the unknown functions q 1 .t/ and q 2 .t/ that completely determine the motion of
S on the surface †.
We can summarize the preceding results as follows.
1. Equation (17.15) implies (17.17).
2. Equation (17.17) does not contain the unknown reactive force.
3. Equation (17.17) leads to equations that contain the unknowns determining the
motion of S .
In the following sections we prove that the foregoing procedure can be extended
to an arbitrary system S of N constrained rigid bodies.
N Z
X
P .m/ D Rr v dc; (17.21)
i D1 Ci
N Z
X
P .a/ D F v dc; (17.22)
i D1 Ci
N Z
X
P .r/
D ˆ v dc: (17.23)
i D1 @Ci
where v is an arbitrary virtual velocity field, i.e., compatible with the constraints to
which the system is subject.
Proof. The virtual velocity field v is compatible with the constraints. In particular,
it must be rigid for each body Bi of S . Consequently, the restriction of the velocity
field v to the region Ci has the form
!
vi D vGi C !i Gi P ; (17.25)
X
N Z X
N Z
!
P .a/ D vGi F dc C !i Gi P F dc
i D1 Ci i D1 Ci
X
N
.a/
X
N
.a/
P .a/ D vGi Ri C !i MGi : (17.26)
i D1 i D1
Similarly, we have
X
N
.r/
X
N
.r/
P .r/ D vGi Ri C !i MGi (17.27)
i D1 i D1
and
X
N Z X
N Z
!
P .m/ D vGi Rr dc !i Gi P rR dc:
i D1 Ci i D1 Ci
X
N X
N
P .m/
D vGi Pi
Q P Gi
!i K (17.28)
i D1 i D1
since
Z Z
d P i;
Rr dc D Pr dc D Q
Ci dt Ci
Z Z
! d ! P Gi ;
Gi P rR dc D Gi P rP dc rP Gi Qi D K
Ci dt Ci
298 17 Lagrangian Dynamics
N h
X
P .m/ C P .a/ C P .r/ D P i C R.a/ C R.r/
vGi Q i i
i D1
i
P Gi C M.a/ C M.r/ ;
C !i K Gi Gi
In this section we prove that, for any system of constrained rigid bodies with
n degrees of freedom, (17.30) leads to a system of n second-order differential
equations in the unknown functions q 1 .t/; : : : ; q n .t/.
In view of (17.21) and (17.22), we write (17.30) in the following form:
N Z
X
.F.a/ Rr/ w dc D 0; (17.31)
i D1 Ci
P t/ h .q; q;
.Qh .q; q; R t//wh D 0; 8.w1 ; : : : ; wn / 2 <n ; (17.32)
17.8 Lagrange’s Equations 299
where
N Z
X @r
P t/ D
Qh .q; q; F.a/ dc; (17.33)
i D1 Ci @q h
N Z
X @r
P q;
h .q; q; R t/ D Rr dc: (17.34)
i D1 Ci @q h
P q;
h .q; q; R t/ D Qh .q; q;
P t/; h D 1; : : : ; n: (17.35)
N Z
X
T D jPrj jPrj dc;
i D1 Ci
which, recalling the Lagrangian expression (17.13) of the velocity field, becomes
1
T D ahk .q; t/qP h qP k C bh .q; t/qP h C T0 .q; t/; (17.36)
2
where
N Z
X @r @r
ahk .q; t/ D dc D akh .q; t/; (17.37)
i D1 Ci @q h @q h
N Z
X @r @r
bh .q; t/ D dc; (17.38)
i D1 Ci @q h @t
Z 2
1X
N
@r
T0 D dc: (17.39)
2 i D1 Ci @t
Remark 17.1. When the constraints do not depend on time, we must use (17.5)
instead of (17.12). Consequently, @r=@t D 0, and the kinetic energy reduces to the
quadratic form
1
T D ahk qP h qP k T2 : (17.40)
2
The kinetic energy is a positive quantity if the velocity field rP does not vanish, and
it is equal to zero if and only if rP D 0. It is also possible to prove that T D 0 if
and only if qP 1 D qP n D 0. In other words, (17.40) is a positive-definite quadratic
form.
300 17 Lagrangian Dynamics
To give (17.35) a more convenient form, we start by noting that from (17.13)
there follows
@Pr @r
D h; (17.41)
@qP h @q
@Pr @ @r @ @r d @r
h
D h h
P
q h
C h
D ; (17.42)
@q @q @q @t @q dt @q h
so that also
@r d @r d @r
rR D rP h rP
@q h dt @q dt @q h
d @Pr @Pr
D rP h rP :
dt @qP @q h
N Z
X @r
h D Rr dc
i D1 Ci @q h
X N Z X N Z
d @Pr @Pr
D Pr h Pr h dc
i D1
dt Ci @qP i D1 Ci
@q
This result allows us to state that, under suitable regularity hypotheses on the
P t/ and Qh .q; q;
functions T .q; q; P t/, the system of Lagrange equations has one and
only one solution satisfying the initial data
We say that the positional forces acting on the system S of rigid bodies, with
constraints depending on time, are conservative if there exists a C 1 function U.q; t/,
called potential energy, such that
@U
Qh .q; t/ D : (17.47)
@q h
Note that in (17.47) the time t appears as a parameter. If the active forces are
conservative, then @U=@qP h D 0. Consequently, introducing the Lagrange function
P t/ D T .q; q;
L.q; q; P t/ U.q; t/; (17.48)
d @L @L
D 0; h D 1; : : : ; n: (17.49)
dt @qP h @q h
P t/ D T .q; q;
L.q; q; P t/ U.q; q;
P t/: (17.51)
P t/,
To recognize the severe restrictions that (17.50) imposes on the function U.q; q;
we start by noting that it can be put in the form
@2 U k
P t/ D
Qh .q; q; P t/;
qR C uh .q; q;
@qP h @qP k
where uh .q; q;P t/ are suitable functions independent of qP h . Since the left-hand sides
of the preceding equations do not depend on qR h , the function U.q; q; P t/ must depend
linearly on qP h , i.e.,
@2 U
D 0;
@qP h @qP k
and we can state that
This formula shows that the forces having a generalized potential energy, apart from
the term @…h =@t, are formed by conservative forces with a usual potential energy
of UO .q; t/ and by gyroscopic forces whose Lagrangian components are
@…h @…k
Qh .q; q;
P t/ D qP k hk qP k : (17.54)
@q k @q h
These forces have the remarkable property that their power vanishes
@A
E D r' ;
@t
B D r A:
304 17 Lagrangian Dynamics
Taking into account these relations, Lorentz’s force (17.58) can be written as
follows:
@
F D r.q'/ C r .qA/ C .qA/: (17.59)
@t
Comparing (17.59) with (17.57), we conclude that Lorentz’s force has the
following generalized potential energy:
P r C ! .! r/ C 2! rP
F D mŒ!
P r C .! ˝ ! ! 2 I/ r C 2! rP :
D mŒ! (17.61)
where
1
UO D mr .! ˝ ! ! 2 I/ r; (17.63)
2
… D m! r: (17.64)
P
f .q.t/q.t/; t/ D f .q0 ; qP 0 ; 0/; (17.65)
where q.t/ is a solution satisfying the initial data q h .0/ D q0h ; qP h .0/ D qP0h , i D
1; : : : ; n.
In other words, a first integral supplies a conservation law that holds in any mo-
tion of S . From a mathematical point of view, it is a first-order differential equation
17.10 First Integrals 305
that may substitute one of the Lagrange equations, simplifying their integration.
Both these remarks show that searching for first integrals is a fundamental problem
of Lagrangian dynamics.
We say that a Lagrangian coordinate q h is cyclic or ignorable if it does not appear
in the Lagrangian function L. If we introduce the generalized momentum conjugate
to q h
@L
ph D h ; (17.66)
@qP
from Lagrange’s equation we obtain that ph is a first integral when L is independent
of q h :
P t/ D ph .q0 ; qP 0 ; 0/:
ph .q; q; (17.67)
This result shows how important is the choice of a suitable system of Lagrangian
coordinates in analyzing the motion of a dynamical system. Now we prove the
existence of a fundamental first integral.
Theorem 17.4. Let S be a system of rigid bodies with smooth and fixed holonomic
constraints. If the active forces derive from a generalized potential independent of
time [see (17.52)]
P D UO .q/ C …h .q/qP h ;
U.q; q/
then the total energy
E D T C UO (17.68)
is constant during any motion of S .
Proof. Along any motion of S we have that
dL @L h @L
D qP C h qR h
dt @q h @qP
@L h d @L h d @L
D qP C P
q qP h :
@q h dt @qP h dt @qP h
Taking into account the Lagrange equations, the preceding relation reduces to the
condition
dL d @L h
D qP ;
dt dt @qP h
which can be written equivalently as
@L h
qP L D const: (17.69)
@qP h
Further, if the constraints do not depend on time, formula (17.40) of the kinetic
energy reduces to the quadratic form T2 , and we have that
L D T U D T2 UO .q/ …h .q/qP h :
306 17 Lagrangian Dynamics
@L h
qP L D ahk qP h qP k …h qP h T2 C UO C …h qP h D T C UO ;
@qP h
In this section we prove a fundamental theorem that relates the first integrals of
Lagrange’s equations to the symmetries of a Lagrangian function.
As usual, we consider a dynamical system S with smooth and fixed holonomic
constraints and acted upon by conservative forces. We denote by Vn the config-
uration space and by T .Vn / its tangent fiber bundle, which is a 2n-dimensional
manifold called the velocity space of S . We recall that any point of T .Vn / is a pair
.q; v/ of a point q 2 Vn and a tangent vector v belonging to the tangent space Tq .Vn /
at q (Chap. 6). The dynamics of S is completely described by a Lagrangian function
L which can be regarded as a C 1 real-valued function on T .Vn /.
Let 's .q/ W < Vn ! Vn be a one-parameter global transformation group of Vn ,
and let X be its infinitesimal generator (Chap. 7). The group 's .q/ is said to be a
symmetry group of L if
Theorem 17.5 (E. Noether). Let S be a system of rigid bodies with smooth, fixed
holonomic constraints and subject to conservative forces. If 's .q/ W < Vn ! Vn
is a symmetry group of the Lagrangian function L W T .Vn / ! <, then the function
f W T .Vn / ! < given by
@L
f .q; v/ h X h (17.72)
@v
17.11 Symmetries and First Integrals 307
orbits
v
q X
trajectories
P D f .q.0/; q.0//;
f .q.t/; q.t// P
@qs0h k
L.q h .t/; vh / D L.qs0h .q.t//; v /: (17.73)
@q k
The theorem is proved since the functions q h .t/ satisfy Lagrange’s equations. t
u
Example 17.4. Consider a dynamical system whose Lagrangian function L does
not depend on the ignorable coordinate q 1 . Then, the one-parameter group of
transformations
q 01 D q 1 C s; q 02 D q 2 ; : : : ; q 0n D q n
308 17 Lagrangian Dynamics
@L
f .q; v/ D D p1 ;
@qP 1
1
LD m.xP 2 C yP 2 / C .˛x C ˇy/;
2
x 0 D x C ˇs; y 0 D y ˛s
@L 1 @L 2
f D X C X D m.ˇ xP ˛ y/:
P
@xP @yP
1X 2
N
LD mi rPi C ri2 Pi2 C zP2i U.r1 ; : : : ; rN ; z1 ; : : : ; zn /:
2 i D1
Since all the coordinates 1 ; : : : ; n are cyclic, the one-parameter group of transfor-
mations
ri0 D ri ;
i0 D i C s;
z0i D zi ; i D 1; : : : ; N;
17.11 Symmetries and First Integrals 309
It is easy to verify that this first integral coincides with components Kz of the total
angular momentum along the Oz-axis:
X
N
Kz D mi .xi yPi yi xP i /:
i D1
Remark 17.3. We underline that the preceding theorem states that to any symmetry
group of L there corresponds a first integral of Lagrange’s equations. It is possible
to prove that also the inverse implication holds provided that we generalize the
definition of symmetry group. We face this problem in the Hamiltonian formalism.
It is also important to note that, although we can prove the equivalence between
symmetry groups and first integrals by extending the definition of symmetry group,
we are no more able, in general, to obtain the very simple expression (17.72) of the
first integral corresponding to this more general symmetry group.
The extension of Noether’s theorem to the case of a Lagrangian function
depending on time can be obtained by including the time among the Lagrangian
coordinates in the following way. Let L.q; v; t/ W T .Vn /< ! < be the Lagrangian
function of a dynamical system S over the .2n C 1/-manifold T .Vn / <. Consider
the extension LO of L to the .2n C 2/ manifold T .Vn </ such that
v
O
L.q; t; v; u/ D L q; t; v; u: (17.74)
u
The following identities can be easily proved:
@LO @L @LO @L
h
D h
u; D h; (17.75)
@q @q @vh @v
@LO @L @LO @L vh
D u; D L h : (17.76)
@t @t @u @v u
We now verify that if the functions .q h .t// are a solution of the Lagrangian
equations relative to L, then .q h .t/; t/ are a solution of the Lagrangian equations
O In fact, in view of (17.75) and (17.76), regarding the first n equations
relative to L.
O
relative to L we can say that
d @LO @LO d @L @L
D D 0:
dt @qP h @q h dt @qP h @q h
310 17 Lagrangian Dynamics
since the first n Lagrange equations are verified by the functions .q h .t//.
We say that the one-parameter group of transformation 's W < .Vn </ !
.Vn </ on the space-time of configurations .Vn </ is a symmetry group of L if
O that is, if
it is a symmetry group of L,
˛
O O ˛ ˇ @'s ˇ
L.q ; v / D L 's .q /; ˇ v ;
˛ ˛
(17.77)
@v
q 0h D q h ; t 0 D t C s:
Since the infinitesimal generator of this group is the vector field X D .0; : : : ; 0; 1/,
we obtain from (17.79) the conservation of the total mechanical energy
@L h
ED v L:
@vh
17.12 Lagrange’s Equations for Nonholonomic Constraints 311
aih wh D 0; i D 1; : : : ; r n; (17.81)
P t/ h .q; q;
Qh .q; q; P q;
R t/ D 1 a1h C C r arh ; h D 1; : : : ; n: (17.82)
d @T @T X r
D Q h i aih ; h D 1; : : : ; n: (17.83)
dt @qP h @q h i D1
These equations, together with the conditions due to the nonholonomic constraints,
aih qP h D 0; i D 1; : : : ; r; (17.84)
define a system of nCr differential equations in the nCr unknowns q 1 .t/; : : : ; q n .t/,
1 ; : : : ; r .
Example 17.7. Consider the homogeneous disk S rolling without slipping on a
plane already considered in Fig. 17.2. Since S is supposed to remain in a vertical
plane, the weight of S has no effect on the motion. Therefore, the Lagrange function
L of S reduces to the kinetic energy, which can be written as
1 2 1
LD xP 1 C xP 22 C A.p 2 C q 2 / C C r 2 ;
2 2
312 17 Lagrangian Dynamics
where A and C are the momenta of inertia of S , and p, q, and r are the components
of the angular velocity ! in the body frame C x10 x20 x30 (Fig. 17.2). Since e3 D
sin 'e01 C cos 'e02 , the angular velocity can be written as
where the meaning of the constants P0 and 0 is evident. Introducing (17.85) and
(17.86) into (17.87), we obtain the equation
.C C ma2 /'R D 0;
so that
'.t/ D 'P0 t C '0 ; (17.92)
where again the meaning of 'P0 and '0 is evident. Finally, from (17.91), (17.92),
(17.89), and (17.90), we have that
a'P
x1 .t/ D sin.P0 t C 0 /;
P0
a'P
x2 .t/ D cos.P0 t C 0 /
P0
17.13 Small Oscillations 313
if P0 ¤ 0 and
if P0 D 0. In conclusion, the disk rotates uniformly about the vertical axis containing
its center C , rolls uniformly about the horizontal axis containing C , and the contact
point A moves uniformly along a straight line if P0 D 0 and moves uniformly on a
circumference of radius ja'= P P0 j if P0 ¤ 0.
Let S be a dynamical system with smooth and fixed constraints. We say that a
configuration C of S is an equilibrium configuration if, placing S at rest in the
configuration C at the instant t D 0, it remains in this configuration at any time.
Let Vn be the configuration space and denote by q the point of Vn representative of
the configuration C . Then, q is an equilibrium configuration if
qP h D vh ; (17.95)
vP h D g h .q; v/: (17.96)
1 h k
T D a qP qP ;
2 hk
2
@U 1 @ U
U D U.0/ C q C
h
qhqk ;
@q h 0 2 @@q h q k 0
1 h k
T D a qP qP ; (17.100)
2 hk
1 h k
U D bhk q q ; (17.101)
2
where
@2 U
bhk D : (17.102)
@q h @q k 0
The quadratic form (17.100) is positive definite since all the principal minors of
the matrix .ahk / have positive determinants (Sect. 17.8). Also, the quadratic form
(17.101) is positive definite since U.q/ has an effective minimum at 0.
The motions corresponding to the approximate Lagrangian function
1 h k 1 h k
L .q; q/
P D a qP qP bhk q q ; (17.103)
2 hk 2
which are called small oscillations about the stable equilibrium configuration 0, are
solutions of the following linear differential equations:
k k
ahk qP D bhk q : (17.104)
Since the matrix .bhk / is symmetric and the quadratic form (17.10) is positive
definite, all its eigenvalues 1 ; : : : ; n are positive, and it is possible to find an
orthonormal basis .uh / of En , formed by eigenvectors of .bhk /. In this basis, the
316 17 Lagrangian Dynamics
matrices .ahk / and .bhk / have a diagonal form
0 1
1 0
.b hk / D @ A ;
0 n
0 1
1 0
.ahk / D @ A ;
0 1
the Lagrangian function (17.103) becomes
X h 2 1 X 2
n n
P D 1
L .q; q/ qP h q h ; (17.107)
2 i D1 2 i D1
A O
xb x
mg B
is constrained to move along the curve to the second point B, with A no lower
than B, under the action of constant gravity and assuming no friction. In the
vertical plane containing , we introduce a frame of reference Oxy as shown in
Fig. 17.6. Let y.x/ be the equation of the curve and denote by s the curvilinear
abscissa from the origin O. With our choice of the origin of coordinates and the
condition sP D 0 at the initial time, the conservation of energy reduces to the
relation
1 2
sP mgy D 0:
2
Further, the length ds of the element of arc along is given by
p
ds D 1 C y 02 dx:
Bearing in mind the preceding two relations, we obtain the time T to cover
from A to B:
Z xb p
1 C y 02 .x/
TŒy.x/ D p dx: (17.109)
0 2gy.x/
By (17.109), a real number T corresponds to any C 1 .0; xb / function y.x/,
assuming the values y.0/ D 0 and y.xb / D yb , provided that the integral on the
right-hand side of (17.109) exists. Our problem will have a solution if we find a
function with the preceding properties that minimizes the value of T.
• As a second example we consider Fermat’s principle. Let S be an optical medium
with refractive index n.r/. This principle states that the path taken between two
points A and B by a ray of light is the path that can be traversed in the least time.
318 17 Lagrangian Dynamics
Denoting by .x1 .t/; x2 .t/; x3 .t// the parametric equations of , the time spent to
cover with speed v D c=n is given by
Z tb q
T D n.x1 .t/; x2 .t/; x3 .t// xP1 2 C xP2 2 C xP3 2 dt: (17.110)
ta
It is well known that this principle includes in a very compact form all the laws
of geometric optics (e.g., reflection, refraction). Once again the path of a ray
minimizes the integral relation (17.110).
• Let be a curve given by the function y.x/, a x b. Denote by S the surface
of revolution obtained rotating about the Ox-axis. Determine the curve for
which the area of S , given by
Z b p
S Œy.x/ D 2y.x/ 1 C y 02 dx; (17.111)
a
F W F ! <;
qn
qt qb
s
qa
t
q
Further, we suppose that f.s; t/ contains the function q.t/ for s D 0. More explicitly,
we suppose that
and we can state that q.t/ is an extremal of functional (17.112) if and only if
dI
D0 (17.116)
ds sD0
for any choice of the family f.s; t/ of curves satisfying properties (17.113) and
(17.114).
Since
@Pf @ @f @ @f
D D ;
@s @s @t @t @s
320 17 Lagrangian Dynamics
we can write
Z " #
dI tb
@L @f @L @Pf
D C dt
ds ta @f @s @Pf @s
Z tb
@L @f tb @L @ @L @f
D C dt: (17.117)
@Pf @s ta ta @f @t @Pf @s
To obtain the preceding relation, we substituted @=@t with d=dt since, for s D 0, all
the functions under the integral depend only on t; further, in view of (17.114), we
have that f.0; t/ D q.t/, Pf.0; t/ D q.t/.
P Comparing (17.116) and (17.118), we see
that a solution q.t/ of the Lagrange equations is an extremal of functional (17.112).
Conversely, if q.t/ is an extremal of functional (17.112), then (17.118) is satisfied
for every f.s; t/. Consequently, the right-hand side of (17.118)) vanishes for every
f.s; t/, and it is possible to prove that the function q.t/ is a solution of the Lagrange
equations.
In conclusion, the extremals of (17.112) are the solutions of the following
boundary value problem:
@L d @L
D 0; q.ta / D qa ; q.tb / D qb : (17.119)
@q dt @qP
No general theorem of existence and uniqueness exists for the preceding boundary
value problem.
The preceding results prove the following theorem.
Theorem 17.8 (Hamilton’s principle). Let S be a dynamical system with La-
grangian function L. Then, an effective motion for which q.ta / D qa and q.tb / D qb
is an extremal of functional (17.112).
Functional (17.112) is called an action functional.
17.15 Geometric Formulations of Lagrangian Dynamics 321
Up to now, our formulation of Lagrangian dynamics has used in a marginal way the
structures of differential geometry. Essentially, we have associated a differentiable
manifold Vn , the configuration space, with any dynamical system S . The dimension
n of Vn gives the degrees of freedom of S , i.e., the number of independent
parameters defining the configuration of S . Finally, the motion of S is represented
by a curve q.t/ of Vn , called the dynamical trajectory of S , satisfying the Lagrange
equations. This geometric formulation of Lagrangian dynamics is local since it uses
local coordinates .q h / on Vn ; in other words, there is no intrinsic formulation of this
theory. In this section, we present two interesting geometric versions of Lagrangian
dynamics in which any mechanical aspect assumes a geometric form.
Let S be a dynamical system with smooth and fixed holonomic constraints acted
upon by conservative active forces so that the dynamics of S is described by a
Lagrangian function L.q; q/. P We denote by Vn the configuration space of S . In
local coordinates .q 1 ; : : : ; q h / of Vn , the kinetic energy of S is
1
T D ahk .q/qP h qP k ; (17.120)
2
where the quadratic form on the right-hand side of (17.120) is positive definite and
the coefficients ahk are symmetric. Further, the functions ahk .q/ define a .0; 2/-
tensor over Vn since the kinetic energy is an invariant scalar with respect to the
coordinate transformations of Vn and qP h are the contravariant components of the
Lagrangian velocity relative to a coordinate basis .@=@q h /. These properties allow
us to equip Vn with the following Riemannian metric (Chap. 6):
With the introduction of this metric, a scalar product is introduced in the tangent
space Tq .Vn / at any point q 2 Vn ; moreover, for any vector v 2 Tq .Vn / the following
relations between contravariant and covariant components hold:
vh D qP h ; vh D ahk vk D ph : (17.124)
322 17 Lagrangian Dynamics
With these notations, the Lagrange equations (17.44) assume the following form:
@aj k j k
vP h vP vP D Qh .q; v/: (17.125)
@x h
In Chap. 9, we proved that the Christhoffel symbols ijh and the metric coefficients
ahk of a Riemannian manifold are related by the equation
1 @aj k j k p p
vP vP D apj hk vj vk D hk vp vk ; (17.126)
2 @x h
in view of which we can write (17.125) in the final form
rvh dvh p
hk vp vk D Qh .q; v/: (17.127)
dt dt
Taking into account that on the left-hand side of (17.127) are the covariant
components of acceleration in Lagrangian coordinates .q h /, we conclude that the
acceleration of the point q.t/, representing in the configuration space Vn the motion
of the material system S , is equal to the Lagrangian force .Qh /. In other words,
the motion of q.t/ in the configuration space Vn is governed by Newton’s law for a
material point with unit mass.
In particular, in the absence of forces, the dynamical trajectory q.t/ is a geodesic
of the metric (17.121).
In the preceding considerations, the constraints to which the dynamic system is
subject and the reactive forces exerted by them were geometrized by introducing a
configuration space equipped with the Riemannian metric (17.121). Now we want
to prove that it is possible to geometrize the active forces, provided that they derive
from a potential energy U.q/.
To prove this statement, we note that, for conservative forces, the total energy E
is constant during motion, i.e, T .q; q/P C U.q/ D E. Consequently, the dynamical
trajectory, corresponding to a fixed value of the total energy, is contained in the
region Wn Vn defined by the inequality
d 1 @ dq 0j
.F ahk q 0k / .F aj k /q 0j q 0k D 0; q 0j D :
d 2 @q h d
d F @aj k 0j 0k 1 @F
.F ahk q 0k / q q D aj k q 0j q 0k : (17.132)
d 2 @q h 2 @q h
On the other hand, from the conservation of the total energy we have that
2T D aij qP j qP k D 2.E U /:
d @T @T @U
D h;
dt @qP h @q h @q
324 17 Lagrangian Dynamics
d 1 @aj k j k @U
.ahk qP k / qP qP D h :
dt 2 @q h @q
Introducing the parameter .t/ and taking into account (17.130), we obtain the
equations
d 1 @aj k 2 0j 0k @U
F .F ahk q 0k / F q q D h;
d 2 @q h @q
which can be put in the final form
d F @aj k 0j 0k 1 @U 1 @F
.F ahk q 0k / q q D D : (17.133)
d 2 @q h F @q h 2F @q h
The theorem is proved comparing (17.33) and (17.32). t
u
Remark 17.4. Given the mechanical system S and a value E of the total energy,
metric (17.129) is determined. Assigning the initial data q h .0/ D q0h and q 0h .0/ D
q00h , that is, the initial position and direction of the initial velocity, we determine
by (17.32) the geodesic starting from this initial position in the given direction. In
this way, we know the dynamical trajectory but we do not know how it is covered.
The condition that during the motion the energy has a given value implies that the
geodesic must be covered with the law (17.130).
In this section, we analyze the path leading from the Lagrangian formulation of
mechanics to the Hamiltonian one. This path, in the case of constraints independent
of time, consists of the following fundamental steps.
1. First, we transform the Lagrange equations into an equivalent system of 2n
first-order differential equations by considering the Lagrangian velocities qP as
auxiliary unknowns. A solution xL .t/ of this system is a curve of the velocity
space, i.e., of the fiber bundle T Vn of the configuration space. In this space, the
Lagrange equations assume the form
xP L D XL .xL /; (17.134)
where XL .x/ is a vector field on T Vn .
2. Then we define the Legendre transformation, i.e., a diffeomorphism L W T Vn !
T Vn between the tangent fiber bundle T Vn and the cotangent fiber bundle
T .Vn /.
3. Finally, by the differential
TVn
XL
XH
q ,v
xL t xH t
xP H D XH .xH /; (17.136)
d @L @L
h D 0; h D 1; : : : ; n;
dt @qP h @q
Remark 17.5. The kinetic energy is a scalar quantity, i.e., it is invariant with
respect to changes in the Lagrangian coordinates. Consequently, the quadratic form
1
P h qP k and the linear form bh qP h of (17.36) are invariant with respect to these
2 ahk q
transformations. Recalling the criteria that give the tensorial character of a quantity
and noting that .qP h / is a vector of Vn , we can state that .ahk / and .bh / are,
326 17 Lagrangian Dynamics
L W M ! M (17.143)
Taking into account (17.144) and (17.146), we conclude that the Legendre transfor-
mation L associates to a point xL D .q h ; vh / 2 M a point x D .q h ; ph / 2 M
and to a tangent vector X D .X h ; Y h / 2 Tx M a tangent vector X 2 Tx M . If
this correspondence is applied to the (17.142), we obtain the transformed equation
in M
xP D X .x /; (17.147)
where x D .q h ; ph /. Further, recalling that Lvh vk D ahk and taking into account
(17.141), we obtain
X D vh ; Lvh q k vk C ahk akl .Lq l Lvl q m vm / D .vh ; Lq h /: (17.148)
qP h D vh ; (17.149)
pPh D Lq h : (17.150)
@vk @vk
Hq h D pk h
Lq h Lvk h D Lq h ;
@q @q
@vk @vk
Hph D vh C pk Lv k D vh ;
@q h @q h
@H
qP h D ; (17.152)
@ph
@H
pPh D ; (17.153)
@q h
qP h D vh ; (17.154)
vP h D a hk
Lq k Lt vk Lvk q l v ;
l
(17.155)
tP D 1: (17.156)
The solutions x.t/ D .q.t/; v.t/; t/ of this system are curves of M < that are the
integral curves of the vector field
XL D .X h ; Y h ; 1/ D vh ; ahk Lq k Lt vk Lvk q l q l ; 1 : (17.157)
is defined as
L.q h ; vh ; t/ D q h ; Lvh .q; v; t/; t ; (17.160)
@H
qP h D ; (17.162)
@ph
@H
pPh D (17.163)
@q h
tP D 1; (17.164)
17.17 Exercises
1 2
T D m rP C r 2 'P 2 C r 2 P 2 sin2 ;
2
h
U D ;
r
where g is the gravity acceleration and r the radius of the sphere S on which
P is constrained to move. Write the Lagrange equations and determine the
configuration space.
3. Using the notations of Sect. 16.2, prove that the kinetic energy and the potential
energy of a compound pendulum are
1
T D C 'P 2 ;
2
U D mgh cos ':
y
B
B
G
l
A
p
j
A x
O
i
1 P2 2
T D A sin C P 2 C C.'P C P cos /2 ;
2
U D mghG cos :
Write the Lagrange equations and the first integrals related to the cyclic
coordinates.
Hint: It is sufficient to recall (15.21)
1 2
T D Ap C Bq 2 C C r 2
2
and the Euler relations of Sect. 12.4
B
y G
l
j A
O i x Q
mxR G D ˆB ;
myRG D p C ˆA ;
I 'R D l.ˆ A sin ' ˆB cos '/:
Introduce into the third equation the expressions of ˆA and ˆB deduced from
the first two equations. Finally, the Lagrange equation is obtained by proving
that
1
T D I C ml 2 'P 2 ;
2
U D pl cos ':
1 1
T D M.xP 2 C yP 2 / C C 'P 2 ;
2 2
1
U D k1 .x l cos '/2 C .y l sin '/2
2
1
D k2 .x C l cos ' L/2 C .y C l sin '/2 ; (17.165)
2
rP G D xi
P C yj;
P
332 17 Lagrangian Dynamics
j B
l G G l
O x
i A C
T D T1 C T2 ;
U D U1 C U2 ;
with
1 1
T1 D .C1 C ml 2 /'P 2 C mxP 2 ml sin ' xP ';
P
2 2
1 1
T2 D C C ml 2 .9 sin2 ' C cos2 '/ 'P 2 C mxP 2 3ml xP 'P sin ';
2 2
k
U1 D x 2 ;
2
k
U2 D .4l cos ' x/2 :
2
Hint: The center of mass G1 has the following coordinates:
j
O x A l
i
G
p B
(cylinder) and the kinetic energy and the potential energy are
1 1
T D .C C ml 2 /'P 2 C mxP 2 C ml xP 'P sin ';
2 2
k 2
U D x pl cos ':
2
Write the Lagrange equations and determine the equilibrium configurations.
Hint: The coordinates of the center of mass are xG D x C l cos ', yG D
l cos '. Use Koenig’s theorem.
9. Let P1 and P2 be two heavy material points, with masses m1 and m2 ,
respectively, moving without friction on a circumference of radius r. Between
the two points acts a spring as shown in Fig. 17.13. Prove that the configuration
space is a torus S 1 S n and the kinetic energy T and the potential energy U
are
1 1
T D m1 r 2 'P12 C m2 r 2 'P22 ;
2 2
k 2
U D r .'2 '1 /2 C m1 gr sin '2 C m2 r sin '2 : (17.166)
2
Determine the Lagrange equations and the equilibrium positions.
10. A homogeneous rigid disk S rotates about an axis orthogonal to the plane of S
and containing its center O. Along a diameter of S a material point P moves
without friction (Fig. 17.14). Denoting by M and m the masses of S and P ,
respectively, prove that the kinetic energy and the potential energy are
1 2 1
T D mPs C .C C M s 2 /'P 2 ; (17.167)
2 2
k 2
U D r .'2 '1 /2 C m1 gr sin '2 C m2 r sin '2 : (17.168)
2
Determine Lagrange’s equations and find a first integral of them.
334 17 Lagrangian Dynamics
r P
j
O i x
P
s
O ϕ
11. Prove that the potential energy of a double pendulum (Fig. 17.15) is
Determine the configuration space, the kinetic energy, the equilibrium configu-
rations, and the small oscillations around the stable equilibrium configurations.
Hint: Applying Koenig’s theorem to both bodies of the double pendulum with
masses m1 and m2 , we obtain
1 1 1 1
T D m1 rPG2 1 C C1 'P 2 C m2 rPG2 2 C C2 P 2 :
2 2 2 2
On the other hand, denoting k D i j, we have that
rP G1 D 'kl
P 1 .sin 'i cos 'j/ D l1 '.cos
P 'i C sin 'j/
17.17 Exercises 335
y
j
O i x
G
p
p l2
B
and
P
rP G2 D rP A C k
!
AG 2
D 2'l P
P 1 .cos 'i C sin 'j/ C k.sin i cos j/
D .2l1 'P cos ' C l2 P cos /i C .2l1 'P sin ' C l2 P sin /j:
12. Write the kinetic energy and the potential energy of a bi-atomic molecule
supposing that the interaction force between the two atoms is elastic.
13. Let O z be an inertial frame of reference and denote by Oxyz another frame
uniformly rotating about Oz with respect to O z. Determine the generalized
potential of the fictitious forces acting on a mass point P moving in the plane
Oxy.
Hint: Apply (17.63) and (17.64).
Chapter 18
Hamiltonian Dynamics
.U <n ; .q; p// are called natural charts. It is evident that an atlas on Vn generates
an atlas of natural charts on M2n , called a natural atlas on M2n .
n
A natural basis relative to a natural chart .U < ; .q; p// is formed by the vectors
.@=@q h ; @=@ph / tangent to the coordinate curves.
In a natural chart .U <n ; .q; p// of M2n , consider the differential form2
! D ph dq h : (18.1)
1
For the contents of Chapters 18–20 see also [1, 3, 19, 30].
2
The most general differential form on M2n is
T
Similarly, in another natural chart U <n ; .qh ; p/ , with U U ¤ ;, we define
a new differential form
! D p h dq h : (18.2)
Recalling that a transformation from natural coordinates to natural coordinates is
expressed by the equations
q h D q h .q/; (18.3)
@q k
ph D .q/pk I (18.4)
@q h
S
in the intersection U U between the two charts, we have that
! D !; (18.5)
and we can state that the local differential forms (18.1) determine a unique
differential form ! on the whole manifold M2n . Together with this global differential
form, we consider on the whole M2n the 2-form
D d!; (18.6)
dq h ^ dph : (18.7)
d D 0: (18.8)
Proof. Equation (18.10)1;2 follows at once from (18.9). Further, we also have that
./ D I, and (18.10)3 is proved. t
u
18.2 Canonical Coordinates 339
Definition 18.1. The pair .M2n ; /, where is a global closed and nondegenerate
2-form, is called a symplectic manifold. It is the fundamental geometric structure of
Hamiltonian mechanics independent of time.
Definition 18.2. A local chart .U; .q; p// on M2n is a canonical or symplectic chart
if in the coordinates .q; p/ the components of are given by matrix (18.9).
In the preceding section, it was proved that natural coordinates are canonical.
To determine other canonical coordinates, denote by .q; p/ natural coordinates and
consider the coordinate transformation
Proof. Introducing the notations .x ˛ / D .q; p/, .x ˛ / D .q; p/, ˛ D 1; : : : ; 2n, the
transformation (18.11), (18.12) can be written as follows:
x ˛ D x ˛ .x ˇ /: (18.17)
@x @x @x @x
˛ˇ D ˛ D ˛ :
@x @x ˇ @x @x ˇ
340 18 Hamiltonian Dynamics
Then transformation (18.13) is canonical if and only if ˛ˇ D ˛ˇ , and (18.14)
is proved. Multiplying (18.14) on the left by ST , we obtain (18.15). Finally,
from (18.14) we have that
1 D S1 ST ;
and condition (18.16) is proved when we recall (18.10)3. t
u
1
Definition 18.3. Let fh .q; p/, gh .q; p/ be h D 1; : : : ; n, 2n C functions on an
open region of M . We call the function
X n
@fj @gj @fj @gj
Œq h ; pk (18.18)
j D1
@q h @pk @pk @q h
X n
@f @g @f @g
ff; gg : (18.19)
j D1
@q j @pj @pj @q j
a Poisson bracket.
It is a simple exercise to prove the following theorem.
Theorem 18.3. Conditions (18.14)–(18.16) are equivalent, respectively, to
@p k @ph @q k @ph
h
D k; h
D ;
@q @q @q @p k
@p k @q h @q k @q h
D k; D I (18.21)
@ph @q @ph @p k
n o n o
q h ; q k D 0; q h ; p k D ıkh ; fp h ; p k g D 0: (18.22)
For now, we do not know if there are canonical charts besides the natural
ones. Further, although the preceding criteria are necessary and sufficient for the
canonicity of a transformation, it is not an easy task to obtain by them a canonical
chart. In this section, we show a different approach to overcoming this difficulty.
We have already stated that the charts .U; .q; p// and .U ; .q; p// of M2n are
symplectic if and only if [see (18.7) and (18.9)]
dq k ^ dph D dq k ^ dp h (18.23)
T
in U U . This condition is equivalent to
d.ph dq h / D d.p h dq h /;
p h dq h D ph dq h df; (18.24)
T
where f is a C 1 function in U U . We show that this condition, which is
equivalent to (18.23), allows us to generate a symplectic transformation for a
convenient choice of the function f . For this reason, f is called a generating
function of the canonical transformation.
First, we note that system (18.11), (18.12) defines a canonical transformation if
it is invertible, i.e., if
0 1
@q @q
B @q @p C
det B C
@ @p @p A ¤ 0; (18.25)
@q @p
@q h
det ¤ 0: (18.26)
@pk
342 18 Hamiltonian Dynamics
@fO
ph D .q; q/; (18.30)
@q h
@fO
ph D .q; q/: (18.31)
@q h
To put the canonical transformation (18.30), (18.31) into the form (18.11), (18.12),
we must suppose that fO.q; q/ satisfies the following condition:
!
@2 fO
det ¤ 0: (18.32)
@q h @q k
which is equivalent to
where
fQ.q; p/ D fO.q; p/ C q h .q; p/p h :
Expanding the differential of fQ.q; p/ and recalling the arbitrariness of dq h and dp h ,
we obtain that (18.29) is equivalent to the system
@fQ
ph D .q; p/; (18.36)
@q h
@fQ
qh D .q; p/: (18.37)
@p h
To put the canonical transformation (18.36), (18.37) into the form (18.11), (18.12),
we now suppose that fQ.q; p/ satisfies the condition
!
@2 fQ
det ¤ 0: (18.38)
@q h @p k
In conclusion, there are infinite canonical charts on M besides the natural ones.
such that
!.Y/ D x .X; Y/; 8 Y 2 Tx M2n : (18.39)
Let .U; x ˛ /, ˛ D 1; : : : ; 2n, be any local chart on M in a neighborhood of x. If we
denote by .@=@x h / the corresponding natural basis at x and by dx ˛ the dual basis,
then (18.39) has the following coordinate representation:
so that
!h D Xh ; (18.41)
h h
! DX : (18.42)
We conclude this section by noting that, since is a closed 2-form, from (8.28)
we obtain
LX D d.iX /: (18.44)
so that
i X D X[ : (18.45)
LX D d.X[ /;
8 X 2 M2n : (18.46)
18.5 Global and Local Hamiltonian Vector Fields 345
Definition 18.5. Let M2n be the phase state. A vector field X 2 M2n is globally
Hamiltonian if there exists a differentiable function H.x/ 2 F M2n such that
XH D .dH /] : (18.47)
The integral curves .t/ of a globally Hamiltonian vector field satisfy the
equation
P .t/ D XH ..t// D .dH /] ..t//; (18.49)
@H
qP h D ;
@ph
@H
pP h D :
@q h
LX D 0; (18.50)
i.e., if is invariant along the integral curves of the vector field X or, equivalently,
if it is invariant with respect to the one-parameter group of diffeomorphisms
generated by X.
346 18 Hamiltonian Dynamics
X D .dH /] ; 8 x 2 U: (18.51)
Proof. The hypothesis that X is globally Hamiltonian implies X D .dH /] , and then
X[ D dH . From (18.46) it follows that LX D 0. Conversely, in view of (18.46),
the condition LX D 0 is equivalent to d.X[ / D 0, and this condition, in turn,
implies the existence of a function H for which X[ D dH only locally. t
u
Definition 18.8. A one-parameter group of diffeomorphisms 't of M2n is said to be
symplectic if is invariant with respect to any diffeomorphism 't , that is, if (18.50)
holds.
Definition 18.9. 8f , g 2 F M2n , the function
Proof. In fact,
@f @f
df D dq h C h dph ;
@q h @q
@g h @g
dg D h
dq C h dph :
@q @q
@f @ @f @
df ] D ;
@ph @q h @q h @ph
@g @ @g @
dg ] D h
h ;
@ph @q @q @ph
18.6 Poisson Brackets 347
From the skew symmetry of follows (18.55). We omit the proof of (18.56). t
u
Theorem 18.8. 8f; g; h 2 F M2n and 8a; b 2 <, the following identities hold:
ff; f g D 0; (18.57)
ff; gg D fg; f g ; (18.58)
ff; ag C bhg D a ff; gg C b ff; hg ; (18.59)
ff; fg; hgg C fg; fh; f gg C fh; ff; ggg D 0: (18.60)
gives the < vector space M2n a Lie algebra structure. Further, in view of (18.57),
this operation associates a globally Hamiltonian field to a pair of globally Hamil-
tonian fields. In other words, these fields form a Lie subalgebra with respect to the
bracket operation. On the other hand, Theorem 18.8 states that the < vector space
F M2n is also a Lie algebra with respect to the Poisson bracket
.f; g/ 2 F M2n F M2n ! ff; gg 2 F M2n :
.f / D .df /] D 0
implies df D 0.
Definition 18.10. Let f 2 F M2n be a differentiable function such that df ¤ 0,
8x 2 M2n . Then, f is said to be a first integral of the dynamical system
.M2n ; ; H / if
f ..t// D cos t (18.63)
along the integral curves of the Hamiltonian field XH .
In other words, f is a first integral if any integral curve .t/ of the Hamiltonian
vector field XH that crosses the level surface f .x/ D cos t is fully contained in this
surface S . Equivalently, the restriction to S of the vector field XH is tangent to S .
From a physical point of view, a first integral gives a conservation law.
Condition (18.63) can be equivalently expressed by the conditions
fP D XH f D LXH f D 0: (18.64)
2. If f is a first integral of .M2n ; ; H /, then H is a first integral of .M2n ; ; f /;
3. If f and g are first integrals of .M2n ; ; H /, then ff; gg is also a first integral
of .M2n ; ; H /.
Proof. Since fH; H g D fH; H g, we have that fH; H g D 0, and the first
statement is proved. Again, property 2 follows from the skew symmetry of the
Poisson brackets. Finally, the third statement follows at once from (18.60) and the
conditions ff; H g D fg; H g D 0. t
u
Remark 18.2. We underline that the first integral ff; gg could be dependent on the
first integrals f and g.
Remark 18.3. The first integral H.x/ D const states the conservation of the me-
chanical energy of the mechanical system .M2n ; ; H / with forces and constraints
independent of time. In fact, for such a system we have that [see (17.151)]
H.q h ; ph / D ph vh L.q; v/ vDv.qp/
1
LDT U D ahk vh vk U.q/;
2
where U.q/ is the potential energy of the acting forces. Further, we have that ph D
@L=@vh D ahk vk . If we denote by .ahk / the inverse matrix of .ahk /, then vh D
ahk pk , and the Hamiltonian function assumes the form
1 hk
H DT CU D a ph pk C U.q/:
2
or, equivalently, if
LXf H D 0: (18.67)
The following theorem extends Noether’s theorem to the Hamiltonian formalism.
350 18 Hamiltonian Dynamics
Theorem 18.11. The function f is a first integral of the dynamical system .M2n ; ;
H / if and only if H admits a symmetry group generated by the complete globally
Hamiltonian field Xf .
Proof. It is sufficient to note that (18.64), (18.65), and (18.67) imply the following
chain of identities:
t
u
Remark 18.5. The last identity holds if Xf is complete. If M2n is compact, then any
Hamiltonian vector field is complete. In general, all Hamiltonian fields that are not
complete can be only locally associated with a symmetry of H.
Remark 18.6. In Theorem 18.11, it is fundamental to suppose that the infinitesimal
generator X of a symplectic group is Hamiltonian. In fact, if X generates a
symplectic group of symmetry of H but it is not Hamiltonian, then we have that
LX H D 0; LX D 0:
.df /] D Xf ;
@f @f
Xh D ; Xh D :
@ph @q h
18.8 Poincaré’s Absolute and Relative Integral Invariants 351
s F U
s F
uk
s
M*n
U F
u
Definition 18.12. A k-form ˛ 2 ƒk M2n is an absolute integral invariant of the
complete vector field X on M2n if
LX ˛ D 0: (18.69)
Proof. To simplify the proof, we suppose that † is a k-cube .U; ; F / (Sect. 8.4),
so that † D F .U /. From the definition of the integral of a k-form ˛ we obtain
Z Z Z
˛D .'s ı F / ˛ D F ' ˛:
's ı† U U
and the first part of the theorem is proved. Conversely, if (18.71) holds for any U ,
then Z Z
F ˛ D F 's ˛:
U U
From the arbitrariness of U and the continuity of the functions under the integral
we obtain F ı 's ˛ D F ˛, that is, 's ˛ D ˛. t
u
Definition 18.13. The .k 1/-form ˛ is a relative integral invariant of the
complete vector field X on M2n if
d.LX ˛/ D 0: (18.72)
LX .d˛/ D 0; (18.73)
and, in view of (18.69), we can say that ˛ is a relative integral invariant if and only
if d˛ is an absolute integral invariant.
Theorem 18.13. The .k 1/-form ˛ on M2n is a relative integral invariant of the
complete vector field X if and only if
Z Z
˛D ˛; (18.74)
@† 's .@†/
where † is an arbitrary k-dimensional submanifold of M2n and @† is its boundary.
Proof. Again, we suppose that † is a k-cube .U; ; F /. Then, from (18.73) and
Theorem 18.12, we have the condition
Z Z
d˛ D d˛;
† 's .†/
where † is an arbitrary two-dimensional submanifold of M2n and 's is the
(symplectic) one-parameter group of diffeomorphisms generated by X.
In many books, the Hamiltonian fields are introduced starting with one of the
foregoing properties. This is possible only if we refer to a coordinate neighborhood,
not to the whole manifold M2n .
Theorem 18.15 (Liouville’s Theorem). Let .M2n ; ; H / be a Hamiltonian dy-
namical system, and suppose that the Hamiltonian vector field XH is complete. Then
the 2n-form
n.n1/
O D .1/
2
n (18.77)
nŠ
where W is an arbitrary 2n-dimensional submanifold of M2n and 's a one-
parameter symplectic group of diffeomorphisms generated by XH . Further, in
symplectic coordinates,
O D dq 1 ^ ^ dq n ^ dp1 ^ ^ dpn :
(18.79)
Finally, regard W as a set of initial data .q0h ; p0h / of the Hamilton equations of
the system .M2n ; ; H /. Since 's is generated by XH , its orbits coincide with
the dynamical trajectories of the Hamilton equations. Therefore, 's ı W is the
submanifold formed by the configurations at the instant s of all the motions starting
from the points of W . In conclusion, the preceding theorem states that the volume
of this region does not vary over time.
Proof. Since LXH is a derivation (Chap. 8) and (18.50) holds, we have that
n.n1/
O D .1/
2
LXH LXH n
nŠ
n.n1/
.1/ 2
D .LXH ^ ^ C C ^ ^ LXH / D 0;
nŠ
The only nonvanishing terms of the preceding product have the form dq i1 ^ dpi1 ^
^ dq in ^ dpin , where each of the indices i1 ; : : : ; in is different from the others.
Consequently, the right-hand side of (18.81) contains nŠ terms. Consider the term
dq 1 ^ dp1 ^ ^ dq n ^ dpn . It assumes the form dq 1 ^ dq n ^ ^ dp1 ^ dpn
by 1 C 2 C C .n 1/ D n.n 1/=2 inversions in each of which the term
changes its sign. On the other hand, any other term can be reduced to the form
dq 1 ^ dp1 ^ ^ dq n ^ dpn by suitable exchanges of the factors dq i ^ dpi that do
not modify the sign of the considered term. In conclusion, (18.81) can be put in the
form
n.n1/
n D nŠ.1/ 2 dq 1 ^ ^ dq n ^ dp1 ^ ^ dpn ;
and (18.80) is proved. t
u
Theorem 18.16. The equilibrium positions of a Hamiltonian system cannot be
asymptotically stable.
Proof. In fact, if x 2 M2n
is an asymptotically stable position of a Hamiltonian
system, then there exists a neighborhood I of x such that the dynamical trajectories
starting from the points x0 2 I tend to x when t ! 1. In other words, the volume
of 's .I / goes to zero when t ! 1, against Liouville’s theorem. t
u
18.9 Two Fundamental Theorems 355
s W
s W
Example 18.1. The phase portrait of a simple pendulum is represented in Fig. 18.2
(Sect. 14.9). Since the lower the pendulum period the higher the amplitude of
its oscillations, the dynamical trajectories starting from initial data contained in
region W and closer to the origin describe a greater fraction of the oscillation
than trajectories starting from points of W farther from the origin. Therefore, the
evolution of the set of initial data W is that sketched in Fig. 18.2 for the time instants
s1 < s2 .
Definition 18.14. Let .M2n ; ; H / be a Hamiltonian dynamical system, and de-
note by 't a symplectic group generated by the complete Hamiltonian vector field
XH . We say that the submanifold W of M2n is invariant with respect to 't if
S
W
t W
n W
M*n
Theorem 18.18. All trajectories starting from the points of the region S0 W
(except for the points of a subset with measure equal to zero), where W is an
invariant region of the Hamiltonian vector field, go back to S0 an infinite number of
times.
Proof. In fact, if a set U0 S0 exists for which this property is not verified, we
obtain an absurd result in view of the previous theorem. u
t
The preceding theorems show that, when orbits are not periodic, the phase por-
trait is very complex, especially if we take into account that, applying the uniqueness
theorem to Hamiltonian equations, trajectories corresponding to different initial data
cannot intersect each other.
Let .M2n ; ; H / be a Hamiltonian system, and denote by †E the level .2n 1/
submanifold of M2n defined by the equation
H.x/ D E: (18.83)
We have already said that all dynamical trajectories with initial data belonging to †E
lie on †E . Equivalently, the restriction X†E of XH to the points of †E is tangent
to †E .
18.10 Volume Form on Invariant Submanifolds 357
t E+ E
X E E
Theorem 18.19. If †E is a regular submanifold of M2n , then there exists a .2n
1/-form on †E that is an invariant volume form of the vector field X†E .
Proof. Introduce into a neighborhood U of a point x 2 †E the system of coordi-
nates .x 1 ; : : : ; x 2n1 ; H / such that .x 1 ; : : : ; x 2n1 / are coordinates in U \ †E and
the value H D E corresponds to U ˚\†E . Denoting by 0 a region contained in U\
†E , we consider the domain 0 D xj.x 1 ; : : : ; x 2n1 / 2 0 ; E H E C E
M2n (Fig. 18.4). Liouville’s theorem states that the volume V0 of 0 is equal to the
volume Vt of t D 't .0 /, where 't is the one-parameter group generated by the
Hamiltonian field XH . If we denote by .q; p/ canonical coordinates in U , we can
write
Z Z ECE Z
Vt D dq 1 dq n dp1 dpn D dH J dx 1 dx 2n1 ;
t E 't . 0 /
where J denotes the value of J at a suitable point of the interval .E; E C E/.
Finally, the invariant volume of 't . 0 / is
Z
Vt
!.'t . 0 // D lim D JH DE dx 1 dx 2n1 : (18.84)
E!0 E 't . 0 /
In conclusion, the invariant volume .2n 1/-form of the vector field X†E is
E D JH DE dx 1 ^ ^ dx 2n1 : (18.85)
t
u
358 18 Hamiltonian Dynamics
q h D q h ; h D 1; : : : ; n;
p˛ D p˛ ; ˛ D 1; : : : ; n 1; (18.86)
pn D pn .q h ; p˛ ; E/:
q h D q h ; h D 1; : : : ; n;
p˛ D p˛ ; ˛ D 1; : : : ; n 1; (18.87)
pn D pn .q h ; p˛ ; H /:
@pn
J D :
@H
In conclusion, in these coordinates E assumes the form
@pn
E D dq 1 ^ ^ dq n ^ dp1 ^ dpn1 : (18.88)
@H H DE
Example 18.2. The Hamiltonian function of the simple pendulum with mass and
length equal to 1, is
1
H D p 2 g cos q:
2
Then, (18.86) assume the form
p
q D q; p D 2.E C g cos q/
and E becomes
1
E D p dq:
2.E C g cos q/
Remark 18.9. In many books (e.g., [31]), instead of (18.84), the following alterna-
tive invariant volume form E on the submanifold H.q; p/ D E is proposed:
Z
d
!. / D ; (18.89)
jrH j
v
u n " 2 2 #
uX @H @H
jrH j D t C :
@q h @ph
hD1
p
d D gdq 1 dq n dp1 dpn1 : (18.91)
On the other hand, the relations between the partial derivatives appearing
in (18.90) and the implicit form H.q h ; ph / D E of †E are
and we find again the invariant volume form (18.88). In conclusion, although (18.89)
is based on wrong and useless hypotheses, it leads to the right definition of volume
form.
18.11 Exercises
@f @g
ff; gg D ˛ˇ ˇ : (18.94)
@x ˛ @x
3. Verify that if the forces acting on a mechanical system H.q h ; ph / derive from
a generalized potential, then H D const reduces to T C UO .q h ; ph / D const
[see (17.68)].
4. Show Noether’s theorem starting from (18.66) and adopting arbitrary coordinates
.x ˛ / on M2n
.
Hint: Differentiating (18.66) with respect to s we obtain (see preceding exercises)
@f @H
fP D ˛ ˛ˇ ˛ D ff; H g :
@x @x
5. Suppose that the Hamiltonian function H.q h ; ph / does not depend on the
symplectic coordinate q 1 . Verify, by Noether’s theorem, that the corresponding
conjugate momentum is constant.
Hint: The one-parameter group of diffeomorphisms
q 0h D q h C sı 1h ; ph0 D ph ; h D 1; : : : ; n;
X D .1; 0; : : : ; 0/:
@f @f
ı 1h D ; 0D
@ph @q h
1 2
1 1 2
H D p C p22 C .q / C .q 2 /2
2 1 2
the volume 3-form on the invariant submanifold H D E is
1
E D q dq 1 ^ dq 2 ^ dp1 :
2E .p12 C .q 1 /2 C .q 2 /2 /
Chapter 19
The Hamilton-Jacobi Theory
@H @H
qP h D ; pPh D h ; h D 1; : : : ; n; (19.1)
@ph @q
in the symplectic coordinates .q; p/ of M2n .
We wish to determine a symplectic transformation of coordinates
such that, in the new coordinates, the Hamiltonian function depends only on the
coordinates p. Recalling that, under a transformation of symplectic coordinates
(19.2), the new Hamiltonian function is obtained by applying transformation (19.2)
to the variables q; p in the old Hamiltonian function H.q; p/, we are searching for
a transformation of symplectic coordinates for which
@H
qP D .p/; pP D 0; (19.4)
@p
@W .q; p/ @W .q; p/
pD ; qD ; (19.7)
@q @p
Introducing (19.7)1 into (19.3), we can state that the generating function W .q; p/
must be a solution of the Hamilton–Jacobi partial differential equation
@W .q; p/
H q; D H .p/: (19.9)
@q
Remark 19.1. The first-order partial differential equation (19.9) is nonlinear since it
depends quadratically on the first derivatives @W=@q h . Consequently, determining
its solution is more difficult than solving Hamiltonian equations. Further, before
Jacobi, the method proposed by Cauchy, Monge, and Ampére to determine the
solution of a first-order partial equation required the integration of a system of
ordinary differential equations, the characteristic system, which can be associated
with the starting partial differential equation (Appendix A). Now, in the case of
(19.9), this characteristic system coincides with the Hamiltonian equations we wish
19.1 The Hamilton–Jacobi Equation 363
to solve. In other words, we are faced with a vicious circle. However, in the next
section, we present an alternative method proposed by Jacobi that allows us to
overcome this gap.
Remark 19.2. On the right-hand side of (19.9) is a Hamiltonian function in the
new symplectic coordinates we are searching for. It is unknown since its form
depends on the transformation itself. In other words, there are infinite symplectic
transformations of coordinates satisfying (19.9), so we can arbitrarily choose the
form of H .p/, and our choice will influence the transformation of coordinates. In
particular, Jacobi adopted the following choice of this function:
H .p/ D p n E; (19.10)
@u
D xy (19.11)
@x
1 2
u.x; y/ D x y C F .y/;
2
1 2
u.x; y/ D x y C u0 .y/:
2
More generally, we consider the equation
@u @u
C D 0;
@x @y
which, introducing the constant vector field X D .1; 1/, can also be written in the
form
X ru D 0: (19.12)
Consider the system of ordinary differential equations (Appendix A)
xP D 1; yP D 1
364 19 The Hamilton-Jacobi Theory
du
D 0:
dt
Let .x0 .s/; y0 .s// be a curve that intersects at any point one and only one
straight line of . Then, if the values u0 .s/ of u.x; y/ are given along the curve
.x0 .s/; y0 .s//, the function u.x; y/ is completely determined. Once again, the
solution of (19.12) depends on an arbitrary function u0 .s/.
The preceding considerations show that we are interested not in finding the
general integral of the Hamilton–Jacobi equation but in determining a family of
its solutions, depending on n arbitrary constants. In the next section, we show that
such a family of solutions, called a complete integral, exists for a wide class of
Hamiltonian functions. We do not prove that, in general, the complete integral of a
partial differential equation exists and that the general integral is the envelope of the
family of solutions contained in the complete integral.
It is evident that, in general, the solution of (19.9) cannot be given in the form
(19.13) for any Hamiltonian function. However, we can prove that for a wide class of
Hamiltonian functions it is possible to obtain the complete integral of (19.9) starting
from (19.13).
Theorem 19.1. If the Hamiltonian function has the form
Proof. If (19.14) holds, then the Hamilton–Jacobi equation (19.9) can be written as
follows:
1 @W 2 @W 3 @W n @W
gn : : : g3 g2 g1 q ; 1 ; q ; 2 ; q ; 3 : : : ; q ; n D p n :
@q @q @q @q
(19.15)
This equation is satisfied if
@W
g1 q 1 ; 1 D p 1 ;
@q
@W
g2 q 2 ; 2 ; p 1 ; p 2 D p 2 ;
@q
: : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : ::
n @W
gn q ; n ; p1 ; : : : ; p n1 D p n :
@q
@gi
¤ 0; i D 1; : : : ; n; (19.16)
@pi
@W
D G1 .q 1 ; p 1 /;
@q 1
@W
D G2 .q 2 ; p 1 ; p 2 /;
@q 2
: : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : ::
@W
D Gn .q n ; p 1 ; : : : ; p n /: (19.17)
@q n
W D W1 .q 1 ; p 1 / C WO 1 .q 2 ; : : : ; q n ; p 1 ; : : : ; p n /; (19.18)
where Z
W1 .q ; p 1 / D
1
G1 .q 1 ; p 1 /dq 1 ;
W D W1 .q 1 ; p 1 / C W2 .q 2 ; p 1 ; p 2 / C WO 2 .q 3 ; : : : ; q n ; p 1 ; : : : ; p n /; (19.19)
366 19 The Hamilton-Jacobi Theory
where Z
W2 .q 2 ; p 1 ; p 2 / D G2 .q 2 ; p 1 ; p 2 /dq 2 : (19.20)
W D W1 .q 1 ; p 1 / C w2 .q 2 ; p 1 ; p 2 / C C Wn .q n ; p 1 ; : : : ; p h /; (19.21)
where
Z
Wh .q ; p 1 ; : : : ; p h / D
h
Gh .q h ; p 1 ; : : : ; p h /dq h ; h D 1; : : : ; n; (19.22)
@Wh h @Wh h
ph D .q ; p 1 ; : : : ; p h /; qh D .q ; p 1 ; : : : ; p h /; h D 1; : : : ; n:
@q h @p h
(19.23)
p2 kq 2
H.q; p/ D C ; (19.24)
2m 2m
where m is the mass of S and k is the elastic constant of the force acting on S .
Then, the Hamilton–Jacobi equation is
2
1 @W kq 2
C D p E: (19.25)
2m @q 2
@W p
pD D 2mE mkq 2 ;
@q
Z
@W mdq
qD D p : (19.26)
@p 2mE mkq 2
19.3 Examples of Separable Systems 367
On the other hand, in the coordinates .q; p/ the equations of motion are [see
(19.5)]
q D t C ˛; p D E: (19.27)
Introducing these functions into (19.26), we obtain
Z r Z
m m dq
t C˛ D p dq D q ;
2mE mkq 2 k 2E
q2 k
so that r r
2E k
qD sin .t C ˛/:
k m
2. Let P be a material point of mass m moving in a plane in which we adopt
polar coordinates .r; '/ under the action of a Newtonian force with potential
energy U.r/. Then the Lagrangian function and the Hamiltonian function of P
are given by
1
LD m.Pr 2 C r 2 'P 2 / U.r/;
2
!
1 p'2
H D pr C 2 C U.r/:
2
(19.28)
2m r
Z
@W p1
q D
1
D s dr C '; (19.32)
@p 1
p2
r2 2m.E U.r// 21
r
Z
@W m
q2 D D s dr: (19.33)
@p 2
p2
2m.E U.r// 21
r
On the other hand, in the coordinates .q; p/ the equations of motion have the
form
q 1 D ˛ 1 ; q 2 D t C ˛;
(19.34)
p 1 D ˇ1 ; p 2 D E:
Introducing (19.34) into (19.32) and (19.33), we obtain the implicit form of the
orbit r D r.'/ and of the function r D r.t/.
3. Consider a material point P with mass m that is constrained to move on a surface
of revolution S :
x D r cos ';
y D r sin ';
z D f .r/;
where x, y, and z are the Cartesian coordinates of P with respect to a frame of
reference Oxyz, ' is the angle between the plane , containing P and the axis
Oz, and the plane Oxz, and, finally r is the distance of P from Oz. If no active
force is acting on P , then the square of velocity is v2 D xP 2 C yP 2 C zP2 , and the
Lagrangian function of P becomes
1
LD m .1 C f 02 .r//Pr 2 C r 2 'P 2 : (19.35)
2
Since
@L
pr D D m.1 C f 02 .r//2 ;
@Pr
@L
p' D D mr 2 ';
P
@'P
Z s
p p1
W D p1' C 2mE 2 .1 C f 02 /dr:
r
19.4 Hamilton’s Principal Function 369
depend on s. Then, instead of (17.115), we take into account the other integral:
Z t2 .s/
If .s/ D L.f.s; t/; Pf.s; t// dt: (19.37)
t1 .s/
where
q.t/ D f.0; t/; t i D ti .0/; i D 1; 2: (19.39)
This expression, introducing the Hamiltonian function [see (17.151)]
@f @f dt i
D .0; t i / C .0; t i / .0; t i /: (19.42)
@s @t ds
370 19 The Hamilton-Jacobi Theory
In Sect. 17.14, we remarked that the boundary value problem of finding a solution
q.t/ of the Lagrange equations, satisfying the boundary conditions q.t1 / D q1 and
q.t2 / D q2 , is a problem profoundly different from the initial value problem relative
to the same equations. In fact, for the former a general theorem insures the existence
and uniqueness of the solution, whereas for the latter no general theorem holds.
Consequently, the boundary value problem could have no solution, one solution,
or infinite solutions (focal points). Regarding this problem, we assume that for any
choice of q1 2 D1 Vn , q2 2 D2 Vn , there is one and only one solution of
Lagrange’s equations starting from q1 at the instant t 1 and arriving at q2 at the
instant t 2 . If this hypothesis is verified, then for any choice of q1 2 D1 , q2 2 D2 ,
we can evaluate the integral (19.37) along the unique solution q.t/ of Lagrange’s
equations such that q.t 1 / D q1 and q.t 2 / D q2 . In this way, the integral (19.37)
defines a function
Z t2
S W .t 1 ; t 2 ; q1 ; q2 / 2 < D1 D2 !
2
P
L.q.t/; q.t/; t/ dt; (19.43)
t1
@f @f
dq D ds D .0; t i /ds C .0; t i /dt i ; (19.45)
@s @t
dt i
dt i D .0; t i /ds: (19.46)
ds
The initial and final points of the motions f.s; t/ are given by qi .s/ D f.s; ti .s//.
Consequently, the differential (19.45) gives the variations of these points in going
from a given motion to a close one. Finally, changing our notations as follows
t 1 D t0 ; t 2 D t; q1 D q0 ; q2 D q; p1 D p0 ; p2 D p; (19.47)
@S
.t0 ; t; q0 ; q/ D p; (19.48)
@q
@S
.t0 ; t; q0 ; q/ D p0 ; (19.49)
@q0
19.4 Hamilton’s Principal Function 371
@S
.t0 ; t; q0 ; q/ D H.q; p; t/; (19.50)
@t
@S
.t0 ; t; q0 ; q/ D H.q0 ; p0 ; t0 /: (19.51)
@t0
q D q.t0 ; t; q0 ; p0 /: (19.53)
Now we suppose that the Hamiltonian function H does not depend on time. If we
denote by E D H.q0 ; p0 / the value of the total energy corresponding to the initial
data and we recall that the total energy is constant during motion, then (19.50) and
(19.51) yield
S.t0 ; t; q0 ; q/ D E.t t0 / C W .q0 ; q/ (19.55)
where, in view of (19.54), the function W is a complete integral of the Hamilton–
Jacobi equation
@W
H q; .q0 ; q/ D E: (19.56)
@q
We conclude this section with a geometric description of the Hamilton–Jacobi
equation. Let Vn be the configuration space of the dynamical system S described by
the Lagrangian function
1
P D
L.q; q/ ahk qP h qP k U.q/; (19.57)
2
372 19 The Hamilton-Jacobi Theory
N
t
q
Vn
where U.q/ is the potential energy of the forces acting on S , and let Vn be equipped
with the Riemannian metric (Sect. 17.15)
Before proceeding, we need to make clear the meaning of the normal speed cn
of a moving surface †.t/ in the Riemannian space Vn . Denoting by f .q; t/ D 0
the implicit equation of †.t/, the unit normal vector N to †.t/ has covariant
components given by
f;h f;h
Ni D D ; (19.59)
hk
a f;h f;k jrf j
where f;h D @f =@q h and .ahk / is the inverse matrix of .ahk /. Now, at the arbitrary
point q 2 †.t/ we consider an arbitrary curve ./ such that (Fig. 19.1)
.t/ D q; P
.t/ D N.q/: (19.60)
The intersection points of the curve ./ with †./, > t, satisfy the equation
f ../; / D 0. Differentiating this equation with respect to , setting D t, and
recalling (19.60), we obtain the condition
f;h P h C f;t D 0;
ft
cn N D : (19.61)
jrf j
It is evident that cn is the speed of †.t/ along the normal N at q, i.e., it is the normal
velocity of †.t/. Since the surface is moving, we necessarily have @f =@t ¤ 0.
Consequently, we can locally put the equation of †.t/ in the form
tD .q/; (19.62)
19.4 Hamilton’s Principal Function 373
qt
t
q
Vn
where, in view of (19.61), the function .q/ is a solution of the eikonal equation
1
jr j2 D ahk ;h ;k D : (19.63)
cn2 .q/
1
tD W .q0 ; q/ .q0 ; q/: (19.64)
E
It is evident that † .t/ reduces to the point q0 for t D 0. Further, omitting the
dependency on q0 , from (16.63) it follows that its normal speed cn is given by
p p
E E
cn D D p :
jr .q/j a hk ;h ;
k
@
pD ; (19.65)
@q
qt
q t
Vn
Let q.t/ be a dynamical trajectory starting from the point q0 . The contravariant
components of the vector tangent to this trajectory are .qP h /, whereas in metric
(19.58), the covariant components of this vector are ph D ahk qP h . Comparing this
result with (19.65), we can state that the surfaces † .t/ are orthogonal to the
trajectories sorting from q0 .
Remark 19.4. We have proved that the normal speed cn of a point q 2 † .t/ is
parallel to a dynamical trajectory that, starting from a point q0 , intersects † .t/ at
that point q. However, cn is not equal to the velocity of the dynamical system along
the dynamical trajectory. In fact, the velocity v of a point moving along a dynamical
trajectory has contravariant components .qP h /, so that
O .u; q/ D
.q0 .u/; q/:
19.4 Hamilton’s Principal Function 375
To prove the preceding statement, it is sufficient to recall that the envelope of the
surfaces t O .u; q/ upon varying the parameters .u1 ; : : : ; un1 / can be obtained by
eliminating these parameters in the system
Supposing that (16.69) can be solved with respect to the parameters u1 ; : : : ; un1
u˛ D u˛ .q/; ˛ D 1; : : : ; n 1;
To prove that the function .q/ is a solution of the eikonal equation, we note that
from (19.69) we obtain
@ @ O @u˛ @O @O
h
D ˛ h
C h D h:
@q @u @q @q @q
This result states that, at the point q, the surface t D .q/ has the same tangent
plane as the surface t O .u; q/, which originates from a point q0 2 †0 . Further,
this surface satisfies the eikonal equation, and this is true for .q/.
Chapter 20
Completely Integrable Systems
This chapter contains some of the most advanced topics in analytical mechanics.
Due to the introductory character of the book, the theorems requiring a deep
knowledge of geometry and algebra are partially proved or only stated.
Definition 20.1. Let .M2n ; ; H / be a Hamiltonian system that admits p n
first integrals f1 .x/; : : : ; fp .x/ of class C 1 . We say that these first integrals are
independent if the differentials df1 ; : : : ; dfp are linearly independent at any point
x 2 M2n .
Theorem 20.1. Let M2n be of class C k . If the independent first integrals f1 ; : : : ; fp ,
k
p n, are of class C , k 1, then the subset of M2n
Mf .c/ D fx 2 M2n ; fh .x/ D ch ; h D 1; : : : ; pg; (20.1)
ffh ; fk g D 0; h; k D 1; : : : ; p; h ¤ k: (20.2)
Definition 20.4. A Hamiltonian system .M2n ; ; H /
is a completely integrable
system if it admits n independent first integrals in involution.
We have already discussed (Sect. 18.7) the importance of first integrals in
localizing the dynamical trajectories of a Hamiltonian system .M2n ; ; H /. In fact,
if f .x/ is a first integral of .M2n ; ; H /, then a dynamical trajectory starting from
a point of the manifold f .x/ D c is fully contained in this manifold. This result,
which still holds when the Hamiltonian system possesses p first integrals, pushes
us to determine the structure of Mf .c/ in order to localize the region of M2n in
which the dynamical trajectories of .M2n ; ; H / are contained. In this regard, the
following theorem is fundamental.
Theorem 20.2 (Arnold–Liouville). Let .M2n ; ; H / be a completely integrable
]
Hamiltonian system, and denote by Xfh the Hamiltonian vector fields .df /h , h D
1; : : : ; n. If the level manifold
˚
Mf .c0 / D x 2 M2n ; fh .x/ D ch0 ; h D 1; : : : ; p ; (20.3)
into the Lie algebra of vector fields equipped with bracket of the
corresponding
vector fields. Therefore, from ffh ; fk g D 0 there follows Xfh ; Xfk D 0. Let ht .x/
be a one-parameter group of diffeomorphisms generated by Xfh . Since Mf .c0 / is
compact, the vector fields Xfh are complete and the corresponding groups ht .x/ are
global. Along any orbit of kt .x/ we have that
dfh t
. .x// D LXfk fh D ffh ; fk g D 0;
dt k
so that fh .ht .x// D const. Consequently, if for t D 0 we have fh .k0 .x// D
fh .x/ D ch , then, 8t > 0, we also have fh .kt .x// D ch and the orbit starting
from x 2 Mf .c0 / belongs to Mf .c0 / . In other words, Mf .c0 / is invariant with respect
to all the global groups ht .x/, h D 1; : : : ; n, and the infinitesimal generators Xfh ,
h D 1; : : : ; n, of these groups are tangent to Mf .c0 / .
At this point, we can state that on Mf .c0 / there exist n independent tangent vector
fields Xfh satisfying the conditions Xfh ; Xfk D 0. It is possible to prove that
the only compact n-dimensional manifold on which there are n vector fields with
the foregoing properties is an n-dimensional torus T .c0 / D S 1 S 1 . In the
angular coordinates ' 1 ; : : : ; ' n on T n .c0 / the vector fields Xfh have the coordinate
representation
@
Xfh D Xfkh .c0 / ; (20.6)
@' k
where the components Xfhh are constant. Further, the linear independence of the dif-
ferentials df1 ; : : : ; dfn , after a suitable renumbering of the symplectic coordinates,
leads to put the system
fh .q; p/ D ch ; h D 1; : : : ; n; (20.7)
in the form
q D q;
p D p.q; c/;
at least for c belonging to a suitable interval I.c0 /. Then it is possible to prove that,
in this interval, the preceding system defines compact manifolds T n .c/ that inherit
all the properties of T n .c0 /. In particular, on each torus T n .c0 /, the vector fields Xfh
have the coordinate representation
@
Xfh D Xfkh .c0 / ; (20.8)
@' k
and the dynamical trajectories have the form (20.5). t
u
Remark 20.1. Any Hamiltonian system with one degree of freedom (n D 1) is
completely integrable since the energy integral H.q; p/ D E holds. If n D 2, then
a Hamiltonian system is completely integrable provided that it admits a first integral
f .q; p/ D c independent of the energy integral H.q; p/ D E. In fact, since f is
380 20 Completely Integrable Systems
O
O
c’
Uq
p E
E
E
O E q
q* E
q
E q*
CP z0G cos ;
where P is the weight of S , z0G the abscissa of the center of mass of S along
the gyroscopic axis, and A and C are the moments of inertia of S . From the
independence of the Hamiltonian function of and ' follow the first integrals
p Dc ; p' D c' ;
Generally, the coordinates .'; c/, defined by the Arnold–Liouville theorem, are not
symplectic; in other words, functions (20.5) are not solutions of a Hamiltonian
system. It must also be remarked that the aforementioned theorem proves the
existence of these variables, but it does not specify their relation with the coordinates
.q; p/ in which the dynamical problem is initially formulated. In this section, we
prove the existence in a neighborhood V M2n of the level manifold Mf .c0 / of a
new set of coordinates .; I/, called angle-action variables, such that the following
statements hold.
• Are symplectic coordinates.
• The action variables define the tori.
• The variables are angular variables on the tori.
• The relation between the initial coordinates .q; p/ and the new angle-action
coordinates is known.
To prove the existence of these coordinates, we consider the immersion map
i W T n .c/ ! M2n
: (20.9)
If .q; p/ are local symplectic coordinates in M2n , then the parametric equations of
n
T .c/, i.e., the coordinate formulation of the mapping (20.9), has the form
Theorem 20.3. Let ! be the Liouville 1-form. Then, the restriction i ! to T n .c/
of the differential 1-form ! is closed on T n .c/, i.e.,
di !.u; v/ D 0; (20.12)
u D uh Xfh ; v D vh Xfh :
i .u; v/ D uh vk ffh ; fk g D 0
since the first integrals are in involution, and the theorem is proved. t
u
The results (20.12) and (20.13) imply that
Z
i ! D 0 (20.15)
@
along any closed curve @, which is the boundary of an oriented 2-chain on
a torus.
In the set of curves on T n .c/ we introduce the following equivalence relation:
two curves and 1 are equivalent if they are homotopic.1 We denote by Œ the
equivalence class of all the curves homotopic to .
Let h be a closed curve with parametric equations ' h D s, ' k D ak , if h ¤ k,
where 0 s 2 and 0 ak 2 are constant, and denote by Œh the
equivalence class of all the curves homotopic to h . It is evident that none of the
curves of Œh can be reduced to a point by a continuous transformation.
Theorem 20.4. Let and 1 be two arbitrary curves of Œh . Then
Z Z Z
!D ! !: (20.16)
1 Œh
1
A curve 1 D x1 .s/, s 2 Œa; b, on a manifold Vn is homotopic to a curve D x.s/, s 2 Œa; b,
of Vn if there exists a continuous mapping F W Œa; b Œ0; 1 ! Vn such that F .0; s/ D x.s/ and
F .s; 1/ D x1 .s/, in other words, if 1 reduces to by a continuous transformation.
384 20 Completely Integrable Systems
n n
Taking into account the meaning of the variables .'; c/, the parametric equations of
a torus T n .c/ can be written as
where the variables c determine the torus T n .c/ and the angles ' locate a point of
it. Consequently, a curve 2 Œh will have parametric equations
! D ph dq h , and the action variables will only depend on the quantities c, that is,
on the torus. Formally, we can write that
I D J.c/: (20.20)
20.2 Angle-Action Variables 385
c D c.I/; (20.21)
which allows us to use the action variables to define the torus, instead of the
quantities c, as well as to write the parametric equations of the torus in the equivalent
form
O
q D q.'; I/; (20.22)
O
p D p.'; I/: (20.23)
where x0 ; x 2 T n .c/ and is an arbitrary curve on the torus starting from x0 and
ending at x. Due to Theorem 20.3, integral (20.7), that is, the value of the function
W , does not depend on the curve , provided that can be reduced to a point
by a continuous transformation. In contrast, if contains a curve belonging to the
equivalence class Œh , then we obtain a variation W of the value of W given by
W D Ih : (20.28)
In other words, the generating function W assumes infinite values at any point
of the torus. As a consequence of this property of W , the values of the variables
corresponding to a point of the torus T n .c/ are infinite. The difference k h
between two of these values is
@W @ @Ik
k h D k D k W D D ıkh (20.29)
@Ih @Ih @Ih
386 20 Completely Integrable Systems
On the other hand, a torus is determined by giving the action variables I and the
total energy is constant on a torus. Consequently,
H D H .I/; (20.31)
and the Hamilton equations in the symplectic coordinates .; I/ assume the form
@H
P D ; (20.32)
@I
IP D 0: (20.33)
D .I0 /t C 0 ; (20.34)
I D I0 ; (20.35)
Definition 20.6. Let S D .M2n ; ; H / be a completely integrable Hamiltonian
n
system, and denote by T .I/ a torus corresponding to given values of the action
variables. The resonance module of the vector .I/ 2 <n of the fundamental
frequencies of S is the subset M of Zn such that
Theorem 20.6. If the function F .; I/ is measurable with respect to the Lebesgue
measure, then, for almost all the initial data 0 2 T n .I/,2 the temporal mean exists
and does not depend on t0 .
Definition 20.8. Let F .; I/ be a function on T n .I/ that is measurable with respect
to the Lebesgue measure. Then the spatial mean of F .; I/ is defined as follows:
Z
1
F D F .; I/ d; (20.39)
mis.T n .I// T n .I/
2
That is, except for the points of T n .I/ belonging to a subset with a Lebesgue measure equal to
zero.
388 20 Completely Integrable Systems
frequencies. Suppose that dimM D l < n, i.e., that the fundamental frequencies
confirm l independent linear relations
m˛h
h D 0; ˛ D 1; : : : ; l; h D 1; : : : ; n; (20.40)
where the coefficients m˛h are integer numbers. Then there exists a canonical
transformation .; I/ ! . 0 ; I0 /, to new angle-action variables, in which the
Hamiltonian function is
0
h0 D H 0 IlC1 ; : : : ; In0 : (20.41)
@W 0
0˛ D ; (20.42)
@I˛0
@W 0
0lCˇ D 0 ; (20.43)
@IlCˇ
@W 0
Ih D ; (20.44)
@ h
lCˇ
0˛ D m˛h h ; 0lCˇ D lCˇ ; Ih D m˛h I˛0 C IlCˇ
0
ıh : (20.46)
P 0˛ D m˛h P ˛ D m˛h
h D 0; (20.47)
P 0lCˇ D P lCˇ D
lCˇ : (20.48)
On the other hand, recalling that the new coordinates are symplectic and taking into
account (20.46), we have in these new coordinates the Hamiltonian function
In view of (20.47) and (20.48), the Hamilton equations relative to this Hamiltonian
function become
20.4 Angle-Action Variables of the Harmonic Oscillator 389
@H 0
P 0˛ D ;
@I˛0 D 0
@H 0
P 0lCˇ D 0 D
0lCˇ ;
@IlCˇ
0
@H
IPh0 D 0h D 0;
@
and the theorem is proved. t
u
1 2
H.q; p/ D .p C q 2 /; (20.50)
2
Z Z p Z p
E p E p
I.E/ D pdq D p 2E q 2 p 2E q 2 D 2E: (20.52)
E E
Z Z r
q p q
I
W .q; I / D 2E u2 du D u2 du: (20.53)
0 0
We have already shown that this Hamiltonian system is completely integrable and
that the first integrals fh D ph2 C .h q h /2 D ch define a torus with parametric
equations given by
q p p
c c
ph D ch .h q h /2 ; h q h h ; h D 1; : : : ; n: (20.57)
Consequently, the action variables are defined by the relations
p h
p
Zc= q Z c=h
q
Ih .ch / D ch .h q h /2 ch .h q h /2 ; (20.58)
p p
c=h c=h
which yield
ch
Ih .ch / D ; h D 1; : : : ; n: (20.59)
h
Finally, we have that
s
Zq X
n
h Ih
W .q; I/ D .h uh /2 duh : (20.60)
q0 hD1
Let P be a material point with mass m moving in the space under the action of a
Newtonian force. If in the space we adopt spherical coordinates .r; '; / with its
origin at the center of force, then the Lagrange function of P is
1 2
k
LD rP 3 C r 2 P 2 C r 2 sin2 'P 2 C :
2 r
20.5 Angle-Action Variables for a Material Point in Newtonian Fields 391
Since the coordinate ' is cyclic, the conjugate momentum p' is constant. To prove
that P is a completely integrable Hamiltonian system, we must verify that there is a
third integral that is in involution with p' (Remark 20.1). We can easily obtain this
first integral recalling that in a central force field the total angular momentum KO
with respect to the center of force O vanishes. But KO D r mPr, where r is the
position vector; further, in the spherical coordinates .r; '; /, we have r D rer , rP D
rP er C r sin 'e P , where er , e' , and e are unit vectors along the coordinate
P ' C r e
curves, and we can write the angular momentum as
P ' sin e /:
KO D mr 2 .e (20.63)
It is not difficult to verify that the preceding first integrals are independent;
further, ff1 ; f3 g D ff2 ; f3 g D 0 since f1 and f2 are first integrals and
( )
2
p'2
ff1 ; f2 g D p' ; p C
sin2
3
" !
X @p' @ p'2
2
D p C
hD1
@q h @ph sin2
392 20 Completely Integrable Systems
! #
@ p'2 @p'
h p2 C
@q sin2 @ph
!
@ p'2 @p'
D p2 C D 0:
@ sin2 @p
mk 2
˛r ; (20.73)
2˛2
then the roots of the trinomial 2m˛r r 2 2mkr ˛2 are real and are equal to
2 s 3
k 4 4˛r ˛2
r1;2 D 1 1C 5: (20.74)
2˛r mk 2
Applying the Cartesian rule to the trinomial, we recognize that, if ˛r 0, then one
of the roots (20.74) is positive and the other one is negative. Since r is a positive
quantity, only the root r2 is acceptable. Moreover, the trinomial is positive for r > r2
and the orbit is unbounded. In contrast, if ˛r < 0, then both roots are positive, and
the trinomial is not negative when
0 < r1 r r2 : (20.75)
20.5 Angle-Action Variables for a Material Point in Newtonian Fields 393
mk 2
˛r < 0: (20.76)
2˛2
All the preceding considerations allow us to put the level manifold in the parametric
form
l' D ';
D ;
r D r;
p' D ˛' ;
q
1
p D ˛2 sin ˛'2 ;
j sin j
q
1
pr D 2m˛r r 2 2mkr ˛2 ; (20.77)
r
where
0 ' 2; 0 0 ; 0 < r1 r r2 : (20.78)
3
The preceding formulae show that the level manifold T .˛r ; ˛' ; ˛ / is compact so
that it is a torus. The coordinate curves obtained by varying one of the variables r,
', and are closed curves defining three fundamental cycles Œr , Œ' , and Œ .
The action variables are
Z Z 2
I' D p' d' D ˛' d' D 2 ˛' ;
Œ' 0
s
Z
˛'2
I D p0d D2 R ˛2 d;
Œ Œ 0 sin2
s
Z Z r2
k ˛2
Ir D pr dr D 2 2m ˛r C 2 : (20.79)
Œr r1 r r
It is possible to verify that the preceding relations yield the following expressions
of the action variables:
s
2m
I' D 2 ˛' ; I D 2.˛ ˛' /; Ir D .I C I' / C k ; (20.80)
˛r
I' I C I' 2 2 mk 2
˛' D ; ˛ D ; ˛r D : (20.81)
2 2 .I C I' C Ir /2
394 20 Completely Integrable Systems
4 2 mk 2
P' D P D Pr D ;
.I C I' C Ir /2
IP' D IP D IPr D 0: (20.83)
Since all the fundamental frequencies are equal, from Theorem 20.5 we obtain that
the orbits are closed on the torus and the motions are periodic.
where e
is the wave number of the radiation,
its frequency, c the light velocity in
a vacuum, and
RH D 109;677:576 cm1
is the Rydberg constant. If in (20.84) we set p D 1, we obtain the frequencies of the
Lyman series
1
D RH 1 2 ; q D 2; 3; : : : :
c q
For p D 2 we obtain the Balmer series
1 1
D RH 2 ; q D 3; 4; : : : :
c 2 q
justification of the experimental results, Bohr and Sommerfeld made the following
assumptions:
• An electron rotating about the nucleus of a hydrogen atom cannot move along
an arbitrary orbit compatible with classical mechanics. Further, the electron does
not radiate electromagnetic waves during its accelerated motion on an allowed
orbit, in disagreement with classical electrodynamics.
• Electromagnetic radiation is emitted or adsorbed by an electron only when it
goes from an allowed orbit to another one. More precisely, let E be the energy
difference between the values of energy corresponding to two allowed orbits.
Then, in going from one to the other, the electron radiates electromagnetic waves
with frequency
j Ej
D ; (20.85)
h
where h is Planck’s constant.
At this point it is fundamental to assign a selection rule or a quantization rule
to determine the allowed orbits and then to evaluate the possible values of E.
The rule proposed by Bohr and Sommerfeld refers to completely integrable systems.
Therefore, it can be applied to a hydrogen atom.
Denote by I1 ; : : : ; In the action variables of a completely integrable Hamiltonian
system S . The Bohr–Sommerfeld rule states that the allowed orbits satisfy the
following conditions:
Ih D nh „; nh D 1; 2; : : : ; „ D h=2; (20.86)
Introducing these relations into (20.82), we obtain the possible values of the energy
2 2 mk 2
En D ; n D n' C n C nr : (20.88)
n2 „2
On the other hand, if we denote by e the charge of the electron, we have
e2
kD ;
40
where 0 is the dielectric constant of the vacuum, and (20.88) becomes
2 2 me 4 1
En D : (20.89)
02 h2 n2
396 20 Completely Integrable Systems
In the last section of this chapter, we present a brief introduction to Hamiltonian per-
turbation theory. Our approach does not analyze the correctness of the mathematical
procedures that are used. In this regard, we recall, without proof, the fundamental
KAM theorem (Kolmogoroff–Arnold–Moser), which makes clear the hypotheses
under which the results here presented hold.
The Hamiltonian theory of perturbation has as its aim the analysis of a Hamiltonian
system S that is close to a completely integrable Hamiltonian system S0 . It is based
on the assumption that the motions of S are close to the motions of S0 . In other
words, it is based on the preconception that small causes produce small effects.
Let S0 be a completely integrable Hamiltonian system, and denote by H0 .I0 / its
Hamiltonian function depending only on the action variables I0 of S0 . Henceforth
S0 will be called an unperturbed system. Then we consider another Hamiltonian
system S whose Hamiltonian function has the form
Example 20.6. Let P be a planet (e.g., the Earth) moving under the action both of
the Sun S and a massive planet Pm (e.g., Jupiter). If we suppose that the motion of
Pm is not influenced by P , then the position vector rJ of Pm is a known function of
time. Let q, p be the position vector of P and its momentum, and denote by m D 1,
mJ , and mS the masses of P , Pm , and the Sun, respectively. Then the Hamiltonian
function of P is
1 2 K 1
H.q; p; t/ D jpj K ; (20.93)
2 jqj jq rJ j
We can consider applying this procedure again searching for a new coordinate
transformation .; I/ ! . 0 ; I0 / in which the Hamiltonian function becomes
In other words, the idea underlying the Hamiltonian perturbation theory consists
in presupposing the existence of a sequence of completely integrable systems
whose motions increasingly approximate the motions of the perturbed nonintegrable
system S . After making clear the aim of the theory, we show how to determine
the first transformation . 0 ; I0 / ! .; I/ without analyzing all the mathematical
difficulties encountered. First, we search for a generating function S. 0 ; I/ of
such a coordinate transformation. Since this function must generate an identity
transformation when D 0, we set
n
X
S. 0 ; I/ D 0h Ih C S1 . 0 ; I/ C : (20.96)
hD0
@S1
I0h D Ih C C ; (20.97)
@0h
@S1
h D 0h C C ; (20.98)
@Ih
398 20 Completely Integrable Systems
H1 . 0 ; I0 / D H1 . 0 ; I/ C O. 2 /: (20.100)
Introducing these relations into (20.94) we obtain the Hamiltonian function in terms
of the new variables
n
!
X @S1
H0 .I/ C
h .I/ C H1 . 0 ; I/ C O. 2 /; (20.101)
@0h
hD1
where we have introduced the fundamental frequencies [see (20.35)] of the unper-
turbed system S0
@H0
h .I/ D : (20.102)
@I0h I0 DI
In view of this condition, we see that (20.94) is confirmed if S1 . 0 ; I/ is a solution
of the fundamental equation of the first-order perturbation theory:
n
X @S1
h .I/ . 0 ; I/ D H1 .I/ H1 . 0 ; I/: (20.103)
@0h
hD1
This equation contains two unknowns: the generating function S1 . 0 ; I/ and the
function H1 .I/. To determine the former function, we start by noting that to the
values . 0 ; I/ and . 0 C 2; I/ corresponds the same point of the phase state M2n .
Consequently, all the real functions on M2n are periodic functions of the variables
0 with the same period 2. In turn, this property implies that any function on
M2n admits a Fourier expansion in the variables 0 (Appendix B). In particular, the
Fourier expansion of S1 starts with the constant mean value. Since the derivatives of
this value with respect to the variables 0 are zero, the mean value of the left-hand
side of (20.103) vanishes and we have that
where < H1 . 0 ; I/ > is the mean value of the perturbation. Introducing the new
function
HO . 0 ; I/ D H1 . 0 ; I/ < H1 . 0 ; I/ >; (20.105)
Eq. (20.103) becomes
n
X @S1
h .I/ . 0 ; I/ D HO 1 . 0 ; I/: (20.106)
@0h
hD1
20.7 A Sketch of the Hamiltonian Perturbation Theory 399
From these relations we derive the following expressions of the coefficients sk .I/ of
expansion (20.108):
ihk .I/
sk .I/ D : (20.109)
.I/ k
This expression generates some perplexities regarding the convergence of
series (20.108). In fact, if the fundamental frequencies are linearly dependent, then
there exists at least one vector k such that
k D
1 k1 C C
n kn D 0;
and some denominators of (20.109) may vanish. Even if the fundamental frequencies
are linearly independent, some denominators of (20.109) may assume values arbi-
trarily close to zero. For instance, suppose that we are concerned with a Hamiltonian
system with two degrees of freedom
p such that the fundamental frequencies assume
the values
1 D 1 and
2 D 2. Setting
p
ak1 D 2k1 C k2
we obtain
p k2
2 D C a:
k1
400 20 Completely Integrable Systems
1 2 1
H.q; p/ D .p C q 2 / C q 4 ; (20.110)
2 4
H.0 ; I0 / D
I0 C
2 I02 sin4 0 H0 .I0 / C H1 .0 ; I0 /; (20.111)
where 0 .
The mean value of the perturbation term is [see (20.104)]
Z
1 2 2 3 2 2
< H.0 ; I / >D
I sin4 0 d0 D
I ; (20.112)
2 8
Let us return to action-angle coordinates. Theorem 20.5 shows that the flow on each
torus crucially depends on the arithmetical properties of the fundamental frequen-
cies . There are essentially two cases: (1) If the frequencies are nonresonant,
or rationally independent on a torus, then, on this torus, each orbit is dense and
the flow is ergodic. (2) If the frequencies are resonant, or rationally dependent,
then the torus decomposes into an l-parameter family of invariant tori. Each orbit
is dense on such a lower dimensional torus, but not in T n .I/. A special case arises
when there exist l D n 1 independent resonant relations. Then each frequency
1 ; : : : ;
n is an integer multiple of a fixed nonzero frequency, and the whole torus is
filled by periodic orbits with the same period. Can we state that a small perturbation
of an integrable system does not substantially modify the preceding description of
an integrable system? The remarks at the end of the preceding section engender
suspicions about the possibility of approximating the behavior of a perturbed system
by the behavior of suitable completely integrable systems. These suspicions are
strengthened by the following considerations.
20.8 Overview of KAM Theorem 401
hWD! (20.115)
where
˚
A;
D h1 .C;
/ D I0 W .I0 / 2 C;
: (20.118)
It can be proved that the set A;
has a very complex structure: it is closed,
perfect, nowhere dense, and a Cantor set with positive measure.3
Now we can state the KAM theorem.
Theorem 20.10 (KAM). Let S be a perturbed Hamiltonian system with
Hamiltonian function H.I; ; / analytic and nondegenerate. Denote by
> n 1
e given constants. Then a positive constant c . / exists such that, 8 I 2 A;
, all
the tori T n .I/ are invariant, though slightly
p deformed. Moreover, they fill the region
D T n .I/ up to a set of measure O. /.
Remark 20.3. Since A;
has no interior points, it is impossible to state whether an
initial position falls onto an invariant torus or into a gap between such tori. In other
words, theorem KAM states that p the probability that a randomly chosen orbit lies
on an invariant torus is 1 O. /.
3
An example of Cantor’s set can be obtained as follows. Start with the unit interval Œ0; 1 and choose
0 < k < 1. Remove the middle interval of length k=2. The length of the two remaining intervals is
1k=2. From each of them remove the middle interval of length k=8. The four remaining intervals
will have a total length of 1 k=2 k=4. From each of them remove the middle interval of length
k=32. The remaining length will be 1 k=2 k=4 k=8, and so on. After infinitely many steps,
the remaining set will have the length
21.1 Introduction
p D p.; V /: (21.1)
In a more complex system, for instance a mixture of two different gases S1 and S2 ,
(21.1) is substituted by the more general state equation
where c is the concentration of one of the gases S1 constituting the mixture, i.e.,
the ratio between the mass m1 of S1 and the total mass m of the mixture. We recall
that the concentrations c1 and c2 of the two gases confirm the condition c1 C c2 D
1. In any case, we can state that, from a macroscopic point of view, the uniform
1
For the contents of this chapter see [10, 31, 50, 53].
In the kinetic theory of the equilibrium of gases, the microscopic system S is formed
by N molecules incessantly moving inside a container having a volume V . Each
molecule, with f degrees of freedom, freely moves through the volume V until
it encounters another molecule or hits the walls of the container. Any collision is
supposed to be elastic, and the mean path between two subsequent collisions is
called a free mean path. Further, a collision among more than two molecules
is considered to be highly improbable, and the number of collisions per unit of time
is supposed to be very large. The preceding considerations lead us to formulate the
hypothesis of molecular chaos:
• At macroscopic equilibrium, all molecules of S have the same probability of
moving with a given velocity and of occupying an elementary volume dV V .
• All possible velocities are equiprobable for a given molecule.
For the sake of simplicity we consider the case f D 3, i.e., we suppose that the
molecules are material points. From the hypothesis that, at equilibrium, the density
of the gas S is uniform there follows that the number dN.x/ of molecules contained
in an elementary volume having a vertex at x 2 V and edges dx, dy, and dz is given
by the formula
N
dN.x/ D dx dy dz: (21.3)
V
Besides the physical space V , we consider a three-dimensional Euclidean space
<3 in which we introduce a Cartesian frame of reference Ovx vy vz . A possible
velocity of a molecule is given by the position vector v of a point P of <3 relative
to O. It is evident that the state of a single particle is given by a point V <3 .
!
Let dN.v/ be the number of molecules with velocity OQ D v, where Q belongs
21.2 Kinetic Theory of Gases 405
to the parallelepiped having a vertex at P and edges dvx , dvy , and dvz . Then, we call
the function g.v/ a velocity distribution function such that
u C v D u0 C v0 ; (21.5)
2 2 02 02
u Cv D u Cv : (21.6)
The total number of collision per unit time is proportional to the product g.u2 /g.v2 /.
Similarly, the inverse collision process, in which two molecules, after colliding with
speeds u0 and v0 , have speeds u and v, has a probability proportional to the product
g.u02 /g.v02 /. But at equilibrium the number of the first kind of collision must be
equal to the number of the second type of collision to obtain a complete mixing as
required by the hypothesis of molecular chaos. Therefore, we have that
After differentiating this relation with respect to u2 , we put u2 D 0 into the result
and obtain the condition
g 0 .v2 / g 0 .0/
D ˛: (21.8)
g.v2 / g.0/
An elementary integration of this equation leads to the formula
2
g.v2 / D Ae˛v ; (21.9)
where A and ˛ are integration constants. To determine these constants, we note that
(21.4), in view of (21.9), assumes the form
2
dN.v/ D NAe˛v dvx dvy dvz :
We note that the integral on the right-hand side of (21.10) is convergent when ˛ > 0.
Further, from the well-known formulae
Z 1=2 Z
1
˛x 2
1
2 1 1=2
e dx D ; x 2 e˛x dx D (21.11)
1 ˛ 1 2˛ ˛
we obtain
˛ 3=2
AD ;
so that we can write
˛ 3=2 2
dN.v/ D N e˛v dvx dvy dvz ; (21.12)
where the constant ˛ is still to be determined.
Owing to (21.12), we can state that the mean value f .v/ of any function f .v/ in
the velocity space is given by the formula
Z ˛ 3=2 Z
1 2
f .v/ D f .v/dN.v/ D f .v/e˛v dvx dvy dvz : (21.13)
N <3 <3
z
δ P
x
vx
p
y
On the other hand, from the macroscopic theory of perfect gases, we know that
the total energy E is given by the formula
3
ED N k; (21.15)
2
where k is the Boltzmann constant and the absolute temperature of the gas.
Since D E=N , by comparing (21.15) and (21.14), we determine the value of
the constant ˛:
m
˛D ; (21.16)
2k
and (21.12) assumes the final form
m 3=2 m 2
dN.v/ D N e 2k v dvx dvy dvz : (21.17)
2k
Now we prove that (21.17) leads to the state equation of perfect gases. To prove
this statement, it is sufficient to evaluate the pressure of the gas on the walls of
the container. Microscopically, this pressure is due to the momentum adsorbed
by the walls during the many collisions of molecules with the walls themselves.
We suppose that these collisions are elastic and consider a plane wall s orthogonal
to the Ox-axis. Let dZ.v/ be the number of collisions per unit time and unit surface
!
of s of the molecules with speed v D OP , with vx > 0 and P belonging to the
parallelepiped ı, which has a unit face on s and height vx (Fig. 21.1). It is evident
that all the molecules contained in ı at instant t reach the wall s in unit time so that
dZ.v/ is equal to this number. Since the volume of ı is vx , we obtain
vx
dZ.v/ D dN.v/: (21.18)
V
408 21 Elements of Statistical Mechanics of Equilibrium
On the other hand, in each collision a particle gives the wall s the momentum 2mvx ,
and then the total momentum transferred to the wall by all the molecules contained
in ı with vx > 0 is
dN.v/
2mvx dZ.v/ D 2mv2x : (21.19)
V
Consequently, the total momentum transferred to the wall s by all the particles
contained in ı with any possible velocity v for which vx > 0 is obtained by
integrating (21.19):
˛ 3=2 2mN Z 1 2
Z 1
2
Z 1
2
pD e˛vy dvy e˛vz dvz v2x e˛vx dvx : (21.20)
V 1 1 0
Since S is contained in a reservoir with adiabatic walls, its energy H.q; p/ has
a constant value E. Then, the admissible states of S are represented by the points
of the level manifold †E , defined by the equation H.q; p/ D E, which is supposed
to be compact. Owing to the large value of N , we have no possibility either of
determining the initial data of the particles of S or of integrating the Hamiltonian
equations of S. Once again, we resort to statistical assumptions to have information
about the evolution of S.
In view of our choice of the symplectic coordinates of M2nN , to any point
.q; p/ 2 M2nN we can associate N points .qh ; : : : ; qhCn ; ph ; : : : ; phCn /, h D
1; : : : ; N , belonging to the phase space 2n . These points represent the states of
each particle when the whole system is in the state .q; p/. When this state varies on
†E , the corresponding points of 2n vary in a set 2n , which is bounded since
†E is compact. Let B be a cube containing the set and consider a partition P of
B into elementary cells Ci , i D 1; : : : ; L, having edges ! and volume ! 2n . We
suppose that
1. Any molecule of S can be singled out with respect to the others;
2. The length ! of the edge of any cell Ci is small with respect to the edge of
the cube B, so that we can suppose that all the particles inside Ci have the same
energy, but it so large that any cell contains many points of .
The first assumption states that it is possible to attribute a target to any
molecule so that we can count them, distinguishing each particle from the others.
The following considerations show the relevance of the second assumption.
Suppose that TNi particles of S are in the states represented by Ni points
belonging
T to C i , i D 1; : : : ; L. Remembering the meaning of the points of
Ci , the numbers Ni must satisfy the following relations:
N1 C C NL D N; (21.23)
N1 1 C C NL L D E: (21.24)
Equating to zero the partial derivatives of (21.29) with respect to the variables Ni ,
we obtain the system
ln Ni C 1 C ˛ C ˇi D 0; i D 1; : : : ; L; (21.30)
where the constants ˛ and ˇ can be obtained from (21.23) and (21.24). We omit the
proof that the extremum (21.29) is a maximum.
Following Boltzmann and Gibbs, we state that S evolves in such a way
that its states are always very close to the state of S determined by the most
probable distribution N1 ; : : : ; NL . This assumption is supported by the following
considerations.
Suppose that a distribution N1 ; : : : ; NL of particles into the cells C1 ; : : : ; CL is
given, satisfying conditions (21.23) and (21.24). Then, since the coordinates of all
the particles are assigned, a point of †E is determined that belongs to the e cell
of M2nN intersecting †E and having a volume ! 2nN . It is important to remark
21.3 Boltzmann–Gibbs Distribution 411
that by changing the particles into the cells C1 ; : : : ; CL without changing their
number, we change their coordinates obtaining a different cell Ci of volume ! 2nN .
In conclusion, to all distributions corresponding to N1 particles into C1 , N2 particles
in C2 , and so on, we can associate a family of cells of M2nN with a volume
.N1 ; : : : ; NL / D.N1 ; : : : ; NL /
D : (21.34)
.N1 ; : : : ; NL / D.N1 ; : : : ; NL /
Considering the logarithm of both sides and the Taylor expansion of the right-hand
side, we obtain
X L
@
ln D D ln D C ln D .Ni Ni /
i D1
@H i
L
1X @2
C ln D .Ni Ni /.Nh Nh /: (21.35)
2 i D1 @Ni @Nh
Finally, taking into account conditions (21.23) and (21.24), we obtain the condition
2
PL PL
12 1 2
i D1 N .Ni Ni / 12 i D1 Ni
Ni
Ni
1
D e i De ; (21.36)
412 21 Elements of Statistical Mechanics of Equilibrium
which shows that the ratio = is very close to zero for any distribution
N1 ; : : : ; NL different from N1 ; : : : ; NL . The following remark shows the interest
of the preceding result. We know that the orbit of the system S lies on the manifold
†E M2nN . In view of (21.36), we can state that this orbit is almost contained in
those cells Ci corresponding to distribution (21.33).
Multiplying (21.33) by ! 2n , in the limit N ! 1, we obtain
which allows us to evaluate the mean value f of any function f .q; p/:
Z
1
f DR ˇ.q;p/ dqdp
f .q; p/eˇ.q;p/ dqdp: (21.40)
2n e 2n
[see (17.107)], where the quantities q i are the coefficients of the kinetic energy
evaluated in the equilibrium configuration and i > 0 denotes the principal
frequencies. Consequently, the Hamiltonian function is
n
1X 1 i 2 i 2
P D
H.q; p/ .pP / C i .q / : (21.42)
2 i D1 ai
D nk: (21.43)
The first integral on the right-hand side of the preceding equation can be transformed
as follows:
2
Z n 1 Pn
1 X ph2 2k
pi i 2
i D1 ai C i .q /
A e dqdp
2n 2 ah
hD1
n Z
X 2
Y Z n Z
Y
1 ph2 2ka
ph
ip2
2ka i .qi /2
D e h dph e i dpi e 2k dqi : (21.46)
2 < ah < i D1 <
hD1 i ¤h
Finally, by (21.11), we can give the preceding equation the final form
A n
D k:
Pn pi2 2
R 1
2k i D1 ai C i .q i /2
2n e dqdp
The theorem is proved repeating the same calculations for the second integral on the
right-hand side of (21.45). t
u
The preceding theorem allows us to calculate the specific heat c of gases. In fact,
in view of (21.43), the total energy E of a volume of gas containing N molecules
with n degrees of freedom is given by
E D N nk; (21.47)
s Z t2
X
KO D .Pi O/ Fi dt; (22.3)
i D1 t1
2˛ I
xP D D ; x D 0: (22.9)
m m
22.2 Fundamental Problem of Impulsive Dynamics 417
Impulsive forces, as ordinary forces, can be active and reactive. In the first case,
they are real impulsive forces applied to B at an exceptional instant. In the second
case, they are produced by the introduction of new constraints at an exceptional
instant. An example of a reactive impulsive force is a ball bouncing on the floor.
From (22.2) and (22.3) and assumptions 1 and 2 we can write the balance
equations of impulsive dynamics of a rigid body B in the following form:
s
X
mPrG D Ii ; (22.10)
i D1
s
X
KO D .Pi O/ Ii ; (22.11)
i D1
where Ii , i D 1; : : : ; s, are the active and reactive impulses acting on the body B at
an exceptional instant.
For a system S of N rigid bodies Bi , i D 1; : : : ; N , we must apply the preceding
equations to any solid Bi , recalling that the active and reactive impulses acting on
Bi are external to Bi . The reactive impulses can be external to S or determined by
other solids of S. In this last case, they satisfy the action and reaction principle.
In conclusion, if S is a constrained system of N rigid bodies B1 ; : : : ; BN , then
for each body we can write
si
X
mPrGi D Iij Ii ; (22.12)
j D1
si
X .I/
KOi D .Pj Oi / Iij MOi ; (22.13)
j D1
.I/
i D 1; : : : ; N , where Ii and MOi denote, respectively, the total impulse and the total
impulse torque of the active and reactive impulses acting on Bi .
The first assumption simplifies the analysis of the problem we are faced with.
The second assumption implies that the reactive impulse acting at the point Ph 2 Bi
is orthogonal to the boundary of the region Ci occupied by Bi at the exceptional
instant. Regarding the third assumption we note that an example of a unilateral
constraint is given by the collision of a body of S with an external obstacle or with
another body of S. A bilateral constraint can be introduced by fixing one or two
points of a body of S.
With the preceding assumptions we can formulate the fundamental problem of
impulsive dynamics of constrained rigid bodies as follows:
Given the position of S , its anterior velocity field of S, the active impulses acting
on S, and the smooth constraints at an exceptional instant, determine the posterior
velocity field of S.
To solve this problem we have at our disposal:
1. Equations (22.12) and (22.13) applied to each body Bi of S;
2. The restrictions on the reactive impulses deriving from the hypothesis that the
constraints are smooth.
Simple examples (see Exercises 1–3 at the end of the chapter) show that the
preceding data are not sufficient to determine the posterior velocity field when
unilateral constraints, i.e., when there are collisions between bodies of S or between
a body of S and an external obstacle, are present at an exceptional instant. To solve
the fundamental problem of the impulsive dynamics of rigid bodies, we need to
add Newton’s laws on collisions between solids. To formulate these laws, consider
two solids Bi and Bj of S and denote by Pi and Pj the contact points of the
bodies Bi and Bj colliding at an exceptional instant. Further, we denote by ni the
internal normal vector to the boundary of Bi at the point Pi and by nj the internal
normal vector to the boundary of Bj at the point Pj (Fig. 22.1). It is evident that
ni D nj . Finally, we denote by rP i and r PC
i the anterior and posterior speeds of
Pi , respectively, that is, the velocities of Pi before and after the exceptional instant.
Similarly, we denote by rP PC
j and r j the speeds of Pj before and after the collision.
The two quantities
w r
ij D ni .P P
j r i /; wC rC
ij D ni .P PC
i r j / (22.14)
22.3 Lagrangian Formulation of Impulsive Dynamics 419
are called the speed of approach and the speed of separation of the two solids Bi
and Bj , respectively. Then, Newton’s law states that:
There exists a positive number eij 2 Œ0; 1, called the coefficient of restitution
and depending on the physical constitution of Bi and Bj , such that
wC
ij D eij wij : (22.15)
Let u.r/ be an arbitrary vector field depending on the position vector r of a point
belonging to region C . Denote by
where R is the resultant vector of the field u.r/ and MO its momentum with respect
to O. It is evident that if u.r/ is only defined at a finite set of points P1 ; : : : ; Ps of C ,
then the preceding formula is valid provided that the integral is substituted with the
summation over index i of Pi . To prove (22.15), it is sufficient to introduce (22.16)
into the left-hand side of (22.17) to obtain
Z Z
ırO u.r/ dC C ı' .r rO / u.r/ dC;
C C
This result allows us to conclude that (22.12) and (22.13) are equivalent to the
condition
XN Z N
X
i Pr ır dC Ii ıri D 0; (22.20)
i D1 Ci i D1
1
We explicitly note that these displacements could not satisfy the constraints of S.
22.4 Analysis of the Lagrange Equations 421
where
N
X @r
Ih D Ii (22.23)
i D1
@q h
are the Lagrangian components of the impulses.
Since with (17.41) we proved that
@r @Pr
D h;
@q h @qP
we can write
N Z
X @Pr @T
i Pr dC D h D ph ; (22.24)
i D1 Ci @qP h @qP
The impulses acting on the system S at the exceptional instant t are active and
reactive. Further, the reactive impulses can be determined by bilateral or unilateral
constraints. In other words, we can decompose Ih into the summation
.rb/ .ru/
Ih D Iha C Ih C Ih ; (22.26)
.a/
where Ih , h D 1; : : : ; n, are the Lagrangian components of the active impulses,
.rb/
Ih are the Lagrangian components of the reactive impulses produced by bilateral
.ru/
constraints, and Ih are the Lagrangian components of the reactive impulses
generated by unilateral constraints. On the other hand, if the constraints are smooth,
.rb/
Ii is orthogonal to ıri , i D 1; : : : ; N , and all the corresponding Lagrangian
components vanish. Consequently, system (22.25) reduces to
.a/ .ru/
ph D Ih C Ih ; h D 1; : : : ; n: (22.27)
we can conclude that (22.27) solve the fundamental problem of impulsive dynamics,
at least in the absence of unilateral constraints, i.e., in the absence of collisions.
We want to show that we can solve this problem also in the presence of unilateral
smooth constraints, provided that we add Newton’s laws of collision to the Eqs.
(22.27). For the sake of simplicity, we suppose that only the bodies Bi and Bj of the
system S collide at the instant t, when they occupy regions Ci and Cj , respectively.
Denote by Pi 2 Ci and Pj 2 Cj the collision points. Then, since the constraints are
smooth, the reactive impulses at the contact points are parallel to the internal unit
normal vectors ni and nj to the surfaces of Ci , respectively. Further, these impulses
verify the action and reaction principle and ni D nj . In view of these remarks, we
can write that
.ru/ .ru/ @ri .ru/ @rj .ru/ @ri @rj
Ih D Ii ni h C Ij nj h D Ii ni : (22.29)
@q @q @q h @q h
Introducing (22.29) into (22.27), we obtain the system
.a/ .ru/ @ri @rj
phC D ph C Ih C Ii ni ; h D 1; : : : ; n: (22.30)
@q h @q h
System (22.30) shows that the collision between Bi and Bj introduces the only
.rv/
unknown Ii . Therefore, we need an equation to equate the equations and the
unknowns. In view of the Lagrangian formula of the Lagrangian velocity,
@r h
rP D qP ; (22.31)
@q h
(22.14) and (22.28), Newton’s law of collision (22.15) leads to the following
equation:
@ri @rj hk @rj @ri
ni a pkC D cij ni ahk pk ; (22.32)
@q h @q h @q h @q h
22.5 Exercises
G O P x
We choose the Cartesian axes as in Fig. 22.2. First, we solve the problem by
cardinal equations that, with our notations, give
mxP G D Ix ;
myPG D Iy ;
C'P D rIy ;
where m is the mass of S , C the moment of inertia relative to the Oz-axis, and
r the radius of S . Since xP G D yPG D 'P D 0, from the preceding equations we
obtain
Ix
xP GC D ;
m
Iy
yPGC D ;
m
rIy
'P C D :
C
2. Solve the preceding problem using the Lagrangian formalism.
We choose as Lagrangian coordinates xG , yG , and ', where xG and yG are
Cartesian coordinates with the origin at a point O ¤ G. Then the position vector
of any point of S is
where i and j are the unit vectors along the Cartesian axes and 0 r.
Further, the kinetic energy of S can be written as
1 2 1
T D m xP G C yPG2 C C 'P 2 : (22.34)
2 2
First, verify that
y
l
S
.
rP-
r
n x
G O P
ru
I
Applying (22.27), we obtain the same equations we derived from the balance
equations.
3. Let S be a homogeneous circular disk of radius r and mass m moving on a
horizontal smooth plane . Suppose that at instant t, disk S collides with a
vertical plane whose intersection with is the straight line l. Determine the
posterior velocity field knowing the velocity field before the collision.
We refer to Fig. 22.3. The balance equations of impulsive dynamics give
I .ru/
xP GC D xP G ; yPGC D yPG ; 'P C D 'P : (22.35)
m
On the other hand, Newton’s collision law (22.15) gives
C
n r
P D cn rP ; (22.36)
where c is the coefficient of restitution relative to the collision between S and the
straight line l. Denoting by i and j the unit vectors along the Ox- and Oy-axes,
respectively, and recalling that
22.5 Exercises 425
rP P D xP G i C yPG j C 'k
P .xG i C yG j/;
xP GC D c xPG : (22.37)
By this equation and (22.35), we solve the problem of determining the posterior
speed of S .
4. Solve Problem 3 adopting the Lagrangian formalism.
Introduce a new Cartesian frame Oxy, O ¤ G, with axes parallel to the axes
adopted in Fig. 22.4. Then, xG , yG , and ' are Lagrangian coordinates for S and
the position vector r of the points of S and the kinetic energy T are given by
(22.33) and (22.34), respectively. Consequently, we have that
On the other hand, since n D i and the straight line l is at rest, the Lagrangian
components of the reactive impulse I.ru/ D I .ru/ n, due to the unilateral
constraint, are
.ru/ .ru/ @r
Ix G D I i D Ix ;
@xG P
.ru/ .ru/ @r
IyG D I i D 0;
@yG P
.ru/ .ru/ @r
I' D I i
@' P
D I .ru/ i . sin 'i C cos 'j/P D 0:
y
.
S rP-
r .
rP-
n x
G O
I ru G
P P
1
For the contents of this chapter see [43, 44, 51].
@x @2 x
v De
v.X; t/ D ; a De
a.X; t/ D : (23.4)
@t @t 2
Because of the invertibility assumption of (23.1), the preceding fields can be
represented in the Eulerian form
v D v.x; t/ D e
v.x.X; t/; t/; a D a.x; t/ D e
a.x.X; t/; t/: (23.5)
P D @ Cvr ; (23.8)
@t
and we can say that the material derivative in the spatial representation contains two
contributions: the first one is a local change expressed by the partial time derivative
and the second one is the convective derivative.
Relevant features of motion are often highlighted by referring to particle paths
or to streamlines. For this reason we introduce the following definitions.
The vector field v.x; t/, at a fixed time instant t, is called a kinetic field.
A particle path is the trajectory of an individual particle of S . In Lagrangian
terms, particle paths can be obtained by integration of (23.4):
Z t
x D x0 C vQ .X; t/dt: (23.9)
0
If the velocity field is expressed in the Eulerian form v D v.x; t/, the
determination of particle paths requires the integration of a nonautonomous system
of first-order differential equations:
dx
D v.x; t/: (23.10)
dt
A streamline is defined as the continuous line, at a fixed instant t, whose tangent
at any point is in the direction of velocity at that point.
Based on this definition, streamlines represent the integral curves of the kinetic
field, i.e., the solution of the autonomous system
dx
D v.x; t/; t D const; (23.11)
ds
23.2 Velocity Gradient 429
where s is a parameter along the curve. We note that two streamlines cannot
intersect; otherwise, one would face the paradoxical situation of a velocity with
two directions at the intersection point.
The motion is defined as stationary if
@
v.x; t/ D 0; (23.12)
@t
which is equivalent to saying that all particles of S that during the time evolution
cross the position x 2 C have the same velocity.
Since in the stationary motion the right-hand sides of both (23.11) and (23.12)
are independent of t, the two systems of differential equations are equivalent to each
other, so that the particle paths and streamlines coincide.
Two relevant kinematic tensors will be used extensively. The first one is a symmetric
tensor, defined as the rate of deformation or stretching
1 @vi @vj
Dij D C D Dj i ; (23.13)
2 @xj @xi
and the second one is a skew-symmetric tensor, defined as a spin or vorticity tensor
1 @vi @vj
Wij D D Wj i : (23.14)
2 @xj @xi
By means of these two tensors, the spatial gradient of velocity can be conveniently
additively decomposed into
rv D D C W: (23.15)
By recalling the definition of differential
@vi 1 @v2
ai D C 2ihj !h vj C ;
@t 2 @xj
and, referring to (23.17), we obtain the relation
@v 1
aD C .r v/ vC rv2 : (23.19)
@t 2
Moreover,
@vh @vh @xj
D
@XL @xj @XL
and, since each determinant can be written as J @vi =@xi (no summation on i ), (23.19)
is proved.
23.3 Rigid, Irrotational, and Isochoric Motions 431
D D 0: (23.21)
Proof. To prove that (23.20) is a necessary condition, we start from the velocity
field of a rigid motion vi .x; t/ D vi .x0 ; t/ C ij l !j .t/.xl x0l /. Then the velocity
gradient
@vi
D ij k !j
@xk
is skew-symmetric, and we obtain that
@vi @vk
2Dik D C D ij k C kj i !j D 0:
@xk @xi
To prove that D D 0 is a sufficient condition, by (23.16) and (23.17), we have that
@vi
D Wij ; Wij D Wj i : (23.22)
@xj
System (23.21) of nine differential equations with the three unknown functions vi .x/
can be written in the equivalent form
so that (23.21) have a solution if and only if we can integrate the differential
forms (23.22). If region C of the kinetic field is simply linearly connected, a
necessary and sufficient condition for (23.22) to be integrable is
@Wij @Wih
D :
@xh @xj
By cyclic permutation of indices, two additional conditions follow:
@Whi @Whj @Wj h @Wj i
D ; D :
@xj @xi @xi @xh
Summing up the first two, subtracting the third one, and taking into account (23.21)2,
we obtain the condition
@Wij
D 0;
@xh
which shows that the skew-symmetric tensor Wij does not depend on spatial
variables and eventually depends on time. Then, integration of (23.22) gives
vi .x; t/ D v0i .t/ C Wij .t/.xj x0j /;
v D r'; (23.25)
and, since the volume c.0/ is fixed, differentiation and integration can be exchanged
and, because of (23.19), it holds that
Z Z
J r v dc D r v dc D 0; 8 c.t/ C.t/:
c.0/ c.t /
r v D 0: (23.26)
To derive the mass conservation law for a continuous system S , the basic assumption
is introduced that the mass of S is continuously distributed over the region C.t/ it
23.4 Mass Conservation 433
occupies at instant t. In other words, we assume the existence of a function .x; t/,
called the mass density, which is supposed to be of class C 1 on C.t/.
According to this assumption, if any arbitrary region c.0/ 2 C.0/ of S is mapped
onto c.t/ during motion x.X; t/, then the mass of c.0/ at instant t is given by
Z
m.c.0// D .x; t/ dc; (23.28)
c.t /
and the mass conservation principle postulates that, during motion, the mass of any
material region does not change over time
Z
d
.x; t/ dc D 0: (23.29)
dt c.t /
The mass conservation law for an arbitrary fixed volume v assumes the form
Z Z
d
.x; t/ dc D .x; t/ v Ndc: (23.30)
dt v @v
Starting both from (23.29) and (23.30), when the density function is continuous
in the region C.t/, we obtain that mass conservation is expressed by the following
local equation:
@
C r .v/ D P C r v D 0: (23.31)
@t
Before formulating the dynamical laws of a perfect fluid according to Euler, we
prove the fundamental transport theorem:
Theorem 23.2. If .x; t/ is a C 1 Eulerian field defined on the material region C.t/
occupied by the continuous system S , then the following result holds:
Z Z
d
dc D P dc; (23.32)
dt c.t / c.t /
Taking into account (23.19) and (23.31), the preceding equation gives
Z Z Z Z
d
dc D .J r v C J P C J P /dc D J P dc D P dc;
dt c.t / c.0/ c.0/ c.t /
In this section we introduce a model, due to Euler, that gives a good description
of the behavior of gases and liquids, at least under suitable kinematic conditions
clarified in the sequel. We do not present this model starting from the modern
general principles of continuum mechanics since we prefer to follow a historical
approach.
We start with the first fundamental assumption on which Euler’s model of a
perfect fluid is based. Let @c.t/ be the boundary of an arbitrary region c.t/ C.t/,
where C.t/ is the region occupied by the fluid S at instant t, and let n be the external
unit normal to @c.t/. Then the total force F exerted by the fluid external to c.t/ on
the fluid contained in c.t/ is given by
Z
F@c.t / D p.x/nd; (23.33)
@c.t /
where the pressure p depends only on the point x 2 @c.t/. In other words, (23.33)
states that Pascal’s law also holds in dynamical conditions. Besides the surface
force (23.33), it is assumed that there exist forces called body forces that are
distributed over the volume C.t/. Denoting by b the force per unit mass, the total
volume force acting on c.t/ is expressed by the integral
Z
Fc.t / D b dc: (23.34)
c.t /
In particular, if the only volume force is the weight, then b D g, where g is the
gravity acceleration.
The second fundamental assumption of fluid dynamics states that the balance
equations of mechanics hold for any material volume c of the fluid.
Taking into account (23.33), we can write the balance equations of a fluid in the
form
Z Z Z
d
v dc D pn d C b dc; (23.35)
dt c.t / @c.t / c.t /
Z Z
d
.x x0 / v dc D .x x0 / pn d
dt c.t / @c.t /
Z
C .x x0 / b dc; (23.36)
c.t /
Similarly, assuming that the pole x0 is fixed in the frame of reference R relative
to which we evaluate the motion of the fluid S , the i th component of (23.36) gives
the local condition
@p
ij h .xj x0j /ah D ij h ıj h p ij h .xj x0j /
@xh
Cij h .xj x0j /bh ; (23.38)
P D r v; (23.39)
vP D rp C b: (23.40)
r v D 0; (23.41)
1
vP D rp C b: (23.42)
In this case, we obtain a system of two nonlinear partial differential equations in the
unknowns v.x; t/, and p.x; t/. In other words, one liquid differs from another only
in its density.
When the fluid S is compressible, i.e., if S is a gas, then the balancing between
the equations and unknowns is obtained giving a constitutive equation relating the
pressure to the density of the gas:
p D p./: (23.43)
where R is the universal gas constant and the absolute temperature. For a van der
Waals gas S
R a
pD ; (23.45)
M v b v2
where M is the molar mass of S , v D 1= is the specific volume, and a and b are
two constants depending on the gas.
In this section, we consider conservative body forces, i.e., forces for which a
potential energy U.x/ exists such that
b D rU.x/: (23.46)
rp D rU.x/; (23.47)
where is constant for a liquid and a given function p D p./ for a gas. Since
experimental evidence leads us to assume that dp=d > 0, the function p D p./
can be inverted.
If we introduce the notation
Z
dp
h./ D ; (23.48)
then
dh 1
rh.p/ D rp D rp;
d
and (23.47) becomes
r.h.p/ C U.x// D 0: (23.49)
This equation shows that in any connected region occupied by fluids, the following
condition holds:
h.p/ C U.x/ D const: (23.50)
In particular, for a liquid, (23.50) becomes [see (23.48)]
p
C U.x/ D const: (23.51)
• In a fluid S , the level surfaces of the potential energy coincide with the isobars.
• Let S be a liquid, subject to its weight, on whose boundary acts a uniform
pressure p0 (for instance, atmospheric pressure). If the vertical axis Oz is
downward oriented, then the relation U.x/ D gz C const implies that the free
surface of S is a horizontal plane.
• Let S be a liquid in a cylinder subject to its weight and to a uniform pressure on
its boundary. If the cylinder containing S is rotating about a vertical axis a with
uniform angular velocity !, then its free surface is a paraboloid of rotation about
!
a. In fact, the force acting on the arbitrary particle P 2 S is given by g C ! 2 QP ,
where g is the gravity acceleration and Q the projection of P on a. It follows that
!2 2
U D gz .x C y 2 /;
2
and (23.51) proves that the free surface is a paraboloid.
• Let S be a liquid in a container subject to its weight and to atmospheric pressure
p0 on its free surface. Assume that the vertical axis Oz is downward oriented,
and choose the origin in such a way that the arbitrary constant of the potential
energy is equal to zero when z D 0. Then from (23.51) it follows that
p.z/ p0
gz D ;
i.e.,
p.z/ D p0 C gz; z > 0; (23.52)
and Stevin’s law is obtained: Pressure increases linearly with depth by an amount
equal to the weight of the liquid column acting on the unit surface.
• Let S be an ideal gas at equilibrium at constant and uniform temperature when
subject to its weight. Assuming that the vertical axis Oz is upward oriented, we
get U.z/ D gz C const, and, taking into account (23.44), (23.50) becomes
Z p
dp
R D gz;
p0 p
so that
gz
p.z/ D p0 exp : (23.53)
R
Finally, we prove an important consequence of (23.51).
Theorem 23.3 (Archimedes’ principle). The buoyant force on a body submerged
in a liquid is equal to the weight of the liquid displaced by the body.
Ce
Ci
where N is the outward unit vector normal to the body surface , i is the submerged
portion of , and e is the portion above the waterline. By adding and subtracting
on the right-hand side of (23.54) the integral of p0 N over (Fig. 23.1), (23.54)
becomes
Z Z
FD p0 N d .p0 C gz/N d; (23.55)
@Ce @Ci
Finally,
F D gVi k; (23.56)
where k is the unit vector associated with Oz. t
u
Equation (23.56) gives the resultant force acting on the body S . To complete
the equilibrium analysis, the momentum MO of pressure forces with respect to an
!
arbitrary pole O must be explored. If r D OP , then
23.7 Fundamental Theorems of Fluid Dynamics 439
Z Z
MO D p0 r N d r .p0 C gz/N d: (23.57)
e i
Again, by adding and subtracting on the right-hand side of (23.57) the integral of
p0 r N over , (23.57) becomes
Z Z
MO D p0 r N d r .p0 C gz/N d:
@Ce @Ci
Finally,
M0 D g Œx2C i x1C j Vi ; (23.58)
where i and j are orthonormal base vectors on the horizontal plane and
Z Z
x1C Vi D x1 dV; x2C Vi D x2 dV: (23.59)
Ci Ci
Expression (23.58) shows that the momentum of the pressure forces vanishes
if the line of action of the buoyant force passes through the centroid of the body.
The centroid of the displaced liquid volume is called the center of buoyancy.
In summary: A body floating in a liquid is at equilibrium if the buoyant force is
equal to its weight and the line of action of the buoyant force passes through the
centroid of the body. It can be proved that the equilibrium is stable if the center of
buoyancy is above the centroid and is unstable if the center of buoyancy is below
the centroid.
The momentum balance equation (23.40), when applied to a perfect fluid subjected
to conservative body forces, is written as
vP D r.h.p/ C U /: (23.61)
Proof. If is a material closed curve, then there exists a closed curve in C such
that is the image of under the motion equation D x. ; t/. It follows that
Z Z Z
d d @xi d @xi
vi dxi D vi dXj D vi dXj
dt dt @Xj dt @Xj
Z Z
@xi @xP i @vi
D vP i C vi dXj D vP i C vi dxj ;
@Xj @Xj @xj
t
u
Theorem 23.5 (Lagrange). If at a given instant t0 a motion is irrotational, then it
continues to be irrotational at any t > t0 , or, equivalently, vortices cannot form.
Proof. This can be regarded as a special case of Thomson’s theorem. Suppose that
in the region C0 occupied by fluid at instant t0 the condition ! D 0 holds. Stokes’s
theorem requires that
Z
0 D v ds D 0
0
for any material closed curve 0 . But Thomson’s theorem states that
D 0 for all
t > t0 , so that also !.t/ D 0 holds at any instant. t
u
Theorem 23.6 (Bernoulli). In a steady flow, along any particle path, i.e., along the
trajectory of an individual element of fluid, the quantity
1 2
H D v C h.p/ C U (23.64)
2
23.7 Fundamental Theorems of Fluid Dynamics 441
is constant. In general, the constant H changes from one streamline to another, but
if the motion is irrotational, then H is constant in time and over the whole space of
the flow field.
Proof. Recalling (23.19) and the time independence of the flow, we can write
(23.61) as
1
vP D .r v/ v C rv2 D r.h.p/ C U /: (23.65)
2
A scalar multiplication by v gives the relation
1 2
vP r v C h.p/ C U D 0;
2
which proves that (23.64) is constant along any particle path. If the steady flow is
irrotational, then (23.65) implies H D const through the flow field at any time. ut
In particular, if the fluid is incompressible, then Bernoulli’s theorem states that
in a steady flow along any particle path (or through the flow field if the flow is
irrotational) the quantity H is preserved, i.e.,
1 2 p
H D v C C U D const: (23.66)
2
The Bernoulli equation is often used in another form, obtained by dividing (23.66)
by the gravitational acceleration
hz C hp C hv D const;
where hz D U=g is the gravity head or potential head, hp D p=g is the pressure
head, and hv D v2 =2g is the velocity head.
In a steady flow, a stream tube is a tubular region † within a fluid bounded by
streamlines. We note that streamlines cannot intersect each other. Because @=@t D
0, the mass conservation (23.31) gives r .v/ D 0, and by integrating over a
volume V defined by the sections 1 and 2 of a stream tube (Fig. 23.2), we obtain
Z Z
QD v n d D v n d: (23.67)
1 2
This relation proves that the flux is constant across any section of the stream tube.
If the fluid is incompressible, then (23.67) reduces to
Z Z
v n d D v n d: (23.68)
1 2
The local angular speed ! is also called the vortex vector, and the related integral
curves are vortex lines; furthermore, a vortex tube is a surface represented by all
442 23 Introduction to Fluid Mechanics
N
v
1
vortex lines passing through the points of a (nonvortex) closed curve. Recalling
that a vector field w satisfying the condition r w D 0 is termed solenoidal and that
2r ! D r r v D 0, we conclude that the field ! is solenoidal. Therefore, vortex
lines are closed if they are limited, and they are open if unconfined. We observe that
Fig. 23.3 can also be used to represent a vortex tube if the vector v is replaced by !.
The following examples illustrate some relevant applications of Bernoulli’s
equation.
1. Consider an open vessel with an orifice at depth h from the free surface of the
fluid. Suppose that fluid is added on the top to keep the height h constant. Under
these circumstances, it can be proved that the velocity of the fluid leaving the
vessel through the orifice is equal to that of a body falling from the elevation h
with initial velocity equal to zero (this result is known as Torricelli’s theorem
because it was found long before Bernoulli’s work). Assuming that at the
free surface we have v D 0 and z D 0, it follows that H D p0 =, and
applying (23.66) we derive the relation
p0 v2 p0
H D D gh C ;
2
p
so that v D 2gh.
2. In a horizontal pipe of variable cross section, the pressure of an incompressible
fluid in steady motion decreases in a converging section.
First, the mass conservation (23.68) requires that
v1 1 D v2 2 ; (23.69)
so that the fluid velocity increases in a converging section and decreases in the
diverging section.
23.7 Fundamental Theorems of Fluid Dynamics 443
where N is the unit outward vector normal to †. The definition of a vortex tube
implies that ! is tangent to † at any point, so that the theorem is proved since
Z Z Z
! N d D ! N1 d D ! N2 d D 0;
† 1 2
where N1 is the unit vector normal to 1 , pointing toward the interior of the tube,
and N2 is the outward unit vector normal to 2 . t
u
From this theorem it also follows that the particle vorticity increases if the vortex
curves are converging.
Theorem 23.8 (Helmoltz’s second theorem). Vortex lines are material lines.
Proof. At the instant t0 D 0, the vector ! is supposed to be tangent to the surface
0 . Denote by .t/ the material surface defined by the particles lying upon 0 at the
instant t0 . We must prove that .t/ is a vortex surface at any arbitrary instant. First,
we verify that the circulation
along any closed line 0 on 0 vanishes. In fact, if
A is the portion of 0 contained in 0 , it holds that
Z Z Z
D v ds D r v N d D 2 ! N d D 0
0 A A
The theorem can also be stated by saying that vortex lines are constituted by the
same fluid particles and are transported during motion. Examples include smoke
rings, whirlwinds, and so on.
This boundary condition states that the fluid can perform any tangential motion
on a fixed surface whose unit normal is N.
The problem is then to find in C Œ0; t the fields v.x; t/ and .x; t/ that satisfy
the balance equations (23.70) and (23.71), the initial conditions (23.72), and the
boundary condition (23.73).
If the fluid is incompressible . D const/, then Eq. (23.70) becomes
1
vP D rp C b; (23.74)
while the mass conservation (23.71) leads us to the condition
r v D 0: (23.75)
The unknowns of system (23.74) and (23.75) are given by the fields v.x; t/ and
p.x; t/, and the appropriate initial and boundary conditions are
v.x; 0/ D v0 .x/ 8 x 2 C;
vN D 0 8 x 2 @C; t > 0: (23.76)
23.9 Two-Dimensional Steady Flow of a Perfect Fluid 445
v D r'; (23.79)
R
2
If C is not a simply connected region, then the condition r v D 0 does not imply that vds D
0 on any closed curve since this curve could not be the boundary of a surface contained in C .
In this case, Stokes’s theorem cannot be applied.
446 23 Introduction to Fluid Mechanics
where u and v are the components of v on x and y, and i and j are the unit vectors
of these axes.
If now C is a simply connected region of the plane Oxy, then conditions (23.78)
become
@u @v @u @v
C D 0; C D 0: (23.81)
@y @x @x @y
These conditions allow us to state that the two differential forms !1 D udx C vdy
and !2 D vdx C udy are integrable, i.e., there is a function ', called the velocity
potential or the kinetic potential, and a function , called the stream potential or
the Stokes potential, such that
From (23.78)2 it follows that the curves ' D const are at any point normal to the
velocity field. Furthermore, since r' r D 0, the curves D const are flow
lines.
It is relevant to observe that (23.82) suggest that the functions ' and satisfy
the Cauchy–Riemann conditions
@' @ @' @
D ; D ; (23.83)
@x @y @y @x
so that the complex function
is holomorphic and represents a complex potential. Then the complex potential can
be defined as a holomorphic function whose real and imaginary parts are the velocity
potential ' and the stream potential , respectively. The two functions ' and are
harmonic and the derivative of F .z/,
@' @
V F 0 .z/ D Ci D u i v D jV j ei ; (23.85)
@x @x
represents the complex velocity, with jV j being the modulus of the velocity vector
and the angle that this vector makes with the x-axis.
23.9 Two-Dimensional Steady Flow of a Perfect Fluid 447
level lines of
level lines of
v D U0 i (23.87)
is defined, where i is the unit vector of the Ox-axis. The kinetic and Stokes
potentials are ' D U0 x and D U0 y, and the curves ' D const and D const
are parallel to Oy and Ox (Fig. 23.3), respectively. This example shows that the
complex potential (23.86) can be introduced to describe a 2D uniform flow, parallel
to the wall y D 0.
Example 23.2 (Vortex potential). Let a 2D flow be defined by the complex potential
level lines of
level lines of
D i ln r D ln.x 2 C y 2 /;
2 2
and the curves ' D const are straight lines through the origin, while the curves
D const are circles whose center is the origin (Fig. 23.4).
Accordingly, the velocity components become
@'
y
sin
uD D D ;
@x 2 x 2 C y 2 2 r
@'
x
cos
vD D D :
@y 2 x 2 C y 2 2 r
It is relevant to observe that the circulation around a path bordering the origin is
given by
Z
v ds D
;
y
Q >0
level curves of
level curves of
The complex potential (23.88) can then be used with advantage to describe the
uniform 2D flow of particles rotating around an axis through the origin and normal
to the plane Oxy.
Example 23.3 (Sources and sinks). For pure radial flow in the horizontal plane, the
complex potential is taken to be
Q Q Q
F .z/ D ln z D ln rei D .ln r C i/; Q > 0;
2 2 2
so that
Q Q Q Q y
'D ln r D log.x 2 C y 2 /; D D arctan :
2 2 2 2 x
The curves ' D const are circles, whereas the curves D const are straight lines
through the origin. The velocity field is given by
Q
vD er ;
2 r
where er is the unit radial vector (Fig. 23.5). Given an arbitrary closed path around
the origin, the radial flow pattern associated with the preceding field is said to be
either a source, if
Z
v n d D Q > 0; (23.89)
or a sink, if Q < 0. The quantity Q=2 is called the strength of a source or sink.
450 23 Introduction to Fluid Mechanics
Q >0
Q <0 v
level curves of
level curves of
U0 a 2
F .z/ D U0 z C ; (23.93)
z
i.e., the flow is obtained as the superposition of a uniform flow parallel to the x-axis
(in the positive direction) and the flow due to a dipole (source–sink) system of
strength m D U0 a2 . Assuming z D x C iy, then from (23.93) it follows that
a2 a2
' D U0 x 1 C 2 ; D U 0 y 1 :
x C y2 x2 C y 2
The flow lines D const are represented in Fig. 23.7 (for U0 D a D 1/.
Note that the condition x 2 Cy 2 D a2 corresponds to the line D 0. Furthermore,
if the region internal to this circle is substituted by the cross section of a cylinder,
then (23.93) can be assumed to be the complex potential of the velocity field around
a cylinder. The components of the velocity field are given by
2 2
@' 2x y a2
uD D U0 1 a D U0 1 2 cos 2 ;
@x r4 r
2
@' a
vD D U0 2 sin 2:
@y r
It should be noted that, although singular points associated with a doublet do not
actually occur in real fluids, they are interesting because the flow pattern associated
with a doublet is a useful approximation far from singular points, and it can be
combined to advantage with other nonsingular complex potentials.
Example 23.6. All the previous examples, with the exception of Example 23.2,
refer to 2D irrotational flows whose circulation along an arbitrary closed path is
vanishing. We now consider the complex potential
U0 a 2
z
F .z/ D U0 z C Ci ln : (23.94)
z 2 a
452 23 Introduction to Fluid Mechanics
y
1.5
1
0.5
0
x
−0.5
−1
−1.5
−2 −1 0 1 2
V D D U0 .1 2 / C i :
dz z 2z
To find stagnation points (V D 0), we must solve the equation dF=dz D 0. Its roots
are given by
q
1
zvD0 D i
˙ 16U02 2 a2
2 : (23.95)
4U0
Three cases can be distinguished:
2 < 16 2 a2 U 2 ,
2 D 16 2 a2 U 2 , and
2 >
16 2 a2 U 2 , whose corresponding patterns are shown in Figs. 23.8–23.10 (assuming
U0 D a D 1/.
At this stage the reader should be aware that these examples, although of practical
relevance, are based on equations of steady-state hydrodynamics, which in some
cases give rise to inaccurate or paradoxical results. As an example, in the next
section we will analyze D’Alembert’s paradox, according to which the drag of a
fluid on an obstacle is zero. To remove this paradox, it will be necessary to improve
the model of a fluid taking into account the effects of viscosity.
23.10 D’Alembert’s Paradox and the Kutta–Joukowky Theorem 453
y
1.5
1
0.5
0
x
−0.5
−1
−1.5
−2 −1 0 1 2
y
1.5
1
0.5
0
x
−0.5
−1
−1.5
−2 −1 0 1 2
In this section, we prove that the model of a perfect liquid leads to the paradox
that a body experiences no resistance while moving through a liquid (D’Alembert’s
paradox). This paradox shows that, although the model of a perfect liquid leads
to results that are in agreement with many experimental results, in some cases it
fails. This is due to the fact that this model does not take into account the viscosity
of liquids. In other words, there are some cases where it is possible to neglect the
effects of viscosity on motion and the model of a perfect liquid can be used. In other
cases, it is fundamental to take into account these effects to obtain again agreement
between theory and experimental results.
454 23 Introduction to Fluid Mechanics
N
Σ
V
C
But the first integral is the opposite of the force F acting on S , so that the preceding
condition gives
Z
FD .pI C v ˝ v/ N d: (23.98)
@V
On the other hand, Bernoulli’s theorem (23.66), in the absence of body forces,
states that
1
p D p0 C .V 2 v2 /; (23.99)
2
where p0 and V denote, respectively, the pressure and the uniform velocity in
a liquid at infinity. We do not prove that in our hypothesis, liquid flow has the
following asymptotic behavior:
v D V C O.r 3 /; (23.100)
23.10 D’Alembert’s Paradox and the Kutta–Joukowky Theorem 455
F D 0; (23.102)
p D p0 C yV C O.r 2 /; (23.104)
2 2 r 2
where i and j are unit vectors along the Ox- and Oy-axes in the plane of the flow,
the speed V at infinity of the flow is parallel to i, and
I
D v ds
is the circulation along any curve surrounding body C . Then it is possible to prove
that, instead of (23.103), we obtain the Blausius formula
F D
V j: (23.105)
This formula shows that although a steady flow of an inviscid fluid predicts no
drag on an obstacle in the direction of relative velocity in an unperturbed region, it
can predict a force normal to this direction. This is a result obtained independently
by W.M. Kutta in 1902 and N.E. Joukowski (sometimes referred as Zoukowskii) in
1906, known as the Kutta–Joukowski theorem. Such a force is called lift, and it is
important for understanding why an airplane can fly.
Before closing this section, we observe that our inability to predict the drag of an
inviscid fluid in the direction of relative velocity does not mean we should abandon
the perfect fluid model. Viscosity plays an important role around an obstacle, but,
far from the obstacle, the motion can still be conveniently described according to
the assumption of an inviscid fluid.
456 23 Introduction to Fluid Mechanics
This section is devoted to wave propagation in perfect fluids. Waves are considered
to be small perturbations of an undisturbed state, corresponding to a homogeneous
fluid at rest. This assumption allows us to linearize the motion equations and to
apply elementary methods.3
Consider a compressible perfect fluid at rest, with uniform mass density 0 , in the
absence of body forces. Assume that motion is produced by a small perturbation, so
that it is characterized by a velocity v and a density that are only slightly different
from 0 and 0 , respectively. More precisely, such an assumption states that v and
D 0 are first-order quantities together with their first-order derivatives.
In this case, the motion equations
can be linearized. In fact, neglecting the first-order terms of jvj and , we obtain
2 2
v D a sin .n x U t/; D b sin .n x U t/:
3
For an analysis of wave propagation in nonlinear media, see [44].
23.11 Waves in Perfect Gases 457
We want a solution to the previous system with this form. To find this solution, we
note that
@v 2 2 @ 2 2
D U a cos .n x U t/; D U b cos .n x U t/;
@t @t
2 2 2 2
rv D n a cos .n x U t/; r D nb cos .n x U t/;
so that system (23.107) becomes
0 U a D p 0 .0 /bn;
U b C 0 a n D 0;
i.e.,
.p 0 .0 /n ˝ n U 2 /Ia D 0;
.p 0 .0 / U 2 b D 0:
Assuming a reference frame whose axis Ox is oriented along n, we obtain
U D 0; a ? n;
p
U D ˙ p 0 .0 /; a D an; (23.108)
The velocity U is called the sound velocity, and the ratio m D v=U is known
as the Mach number. In particular, the motion is called subsonic or supersonic,
depending on whether m < 1 or m > 1.
Chapter 24
An Introduction to Special Relativity
A relativistic theory has as its aim to show that it is possible to give the general
laws of physics a form independent of any observer, that is, of the adopted frame
of reference. Up to now, this objective has not been completely realized, even from
the general theory of relativity.1 In this chapter, a relativistic theory is intended as a
theory for which we have defined:
1. The class R of the frames of reference in which the physical laws have the same
form,
2. The transformations of space and time coordinates in going from a frame of
reference belonging to R to another one in the same class,
3. The physical laws relative to any frame of reference of R,
4. The transformation formulae of the physical quantities appearing in physical
laws upon changing the frame of reference in the class R.
For instance, in Chap. 13, we showed that Galilean relativity assumes that R is
the class of inertial frames of reference, the space-time coordinate transformations
are the Galilean ones, i.e., formulae (12.5) with a constant matrix Qij , the physical
laws are Newtonian laws, and the mass and forces are objective.
Following discovery of the Maxwell equations governing all optical and electro-
magnetic phenomena, it was quite natural to try to verify if they could be accepted
within the framework of Galilean relativity. We now verify that the Maxwell
equations do not satisfy the principle of Galilean relativity.2
1
See, for instance, [37].
2
For the topics of this chapter see [15, 22, 27, 36, 38, 39, 41, 42, 47, 48, 56].
r E D 0; r H D 0; (24.4)
provided that we suppose that the electromagnetic fields vanish before a given
instant in the past.
It is a very simple exercise to verify that, under a Galilean transformation (12.5)
(with a constant matrix Qij ), leading from inertial frame I to another inertial frame
I 0 , we have that
r D r0; (24.5)
@ @
D 0 u r 0; (24.6)
@t @t
where u is the uniform velocity of I 0 relative to I . Then, in view of (24.5) and (24.6),
under a Galilean transformation I ! I 0 , (24.1) becomes
@
C r 0 .J u/ D 0:
@t 0
0
This equation again assumes the form (24.1) in the inertial frame I if and only if
and J are transformed according to the formulae
0 D ; J0 D J u: (24.7)
@E
r 0 H D 0 0 u rE;
@t 0
@H
r 0 E D 0 0 C 0 u rH:
@t
Taking into account the vector identity
r .a b/ D b ra a rb C ar b br a;
where a and b are arbitrary regular fields, and recalling (24.7) and that the relative
velocity u is constant, the preceding equations become
@E
r 0 .H 0 u E/ D 0 ;
@t 0
@H
r 0 .E C 0 u H/ D 0 0 :
@t
It is evident that there is no transformation formula of the fields E and H for
which these equations assume the same form in the inertial frame I 0 . In conclusion,
Maxwell’s equations do not agree with Galilean relativity.
We now derive a further disconcerting consequence of Maxwell’s equations.
Applying the operator r to both sides of (24.2) and (24.3), recalling the vector
identity
r .r a/ D r.r a/ a;
where is the Laplace operator, and taking into account (24.4), we obtain the
following equations:
@2 H
H D 0 0 ;
@t 2
@2 E
E D 0 0 ;
@t 2
showing that the electric and magnetic fields propagate in a vacuum with a speed
1
cD p ;
0 0
The wave character of light propagation was established during the eighteenth
century, when scientists became convinced that all physical phenomena could
be described by mechanical models. Consequently, it appeared natural to the
researchers of that time to suppose that empty space was filled by an isotropic
and transparent medium, the ether, which supported light waves. This hypothesis
seemed to be confirmed by the circumstance that the forces acting on charges and
currents could be evaluated by supposing that electromagnetic fields generated a
deformation state of the ether, which was described by the Maxwell stress tensor.
However, this description was unacceptable for the following reasons:
• The high value of light speed required a very high density of ether.
• Such a high density implied the existence of both longitudinal and transverse
waves, whereas Maxwell’s equations showed the transverse character of electro-
magnetic waves.
Also damaging to the hypothesis of the ether as a material medium was the fact
that Maxwell’s equations were not covariant, i.e., invariant in form, under Galilean
transformations, as we saw in the previous section. Consequently, electromagnetic
phenomena did not confirm a relativity principle, so they were assumed to hold
only in one frame of reference, the optically isotropic frame. It is well known
that any attempt to localize this frame of reference failed (e.g., Michelson, Morley,
Kennedy, Fitzgerald), so that researchers were faced with a profound physical
inconsistency.
Einstein gave a brilliant and revolutionary solution to the preceding problem
by accepting the existence of an optically isotropic system and exploiting the
consequences of this assumption. More precisely, he postulated that:
There exists at least an optically isotropic frame of reference.
Owing to the preceding postulate, it is possible to find a frame of reference I in
which light propagates in empty space with constant speed along straight lines in
any direction. In particular, in this frame I it is possible to define a global time t
by choosing an arbitrary value c of the speed of light in empty space. In fact, the
time measured by a clock located at a point O 2 I will be accepted if a light signal
that is sent from O at t D t0 toward an arbitrary point P , where it is again reflected
by a mirror toward O, reaches O at the instant t D t0 C 2OP =c, for any t0 . Let us
suppose that a set of identical clocks is distributed among different points in space.
A clock at any point P can be synchronized with a clock located at the fixed point
O by sending a light signal at the instant t0 from O and requiring that it arrive at P
at the instant t D t0 C OP =c. We do not prove that the time variable defined by the
foregoing procedure is independent of the initial point O and the initial instant t0 .
In view of what was stated in Sect. 24.1, if we accept the Galilean transformations
in passing from a frame of reference I to another frame of reference I 0 , which
moves with respect to I with constant velocity u, then there exists at most one
24.2 Optical Isotropy Principle 463
Remark 24.1. It is not an easy task to prove (24.8) and (24.9) starting from the
preceding postulate. However, if we add the hypothesis that the required transfor-
mation is linear, then it becomes very simple to derive the Lorentz transformations,
as is proved in most books on special relativity.3 This assumption is equivalent to
requiring that optically isotropic frames also be inertial frames.
Remark 24.2. Let us consider any event that has constant space-time coordinates in
the frame I 0 . Then, differentiating (10.1), we obtain
j
ui dx
ıji C . 1/ 2 uj D ui : (24.11)
u dt
It is an easy exercise to verify that the preceding system admits the following
solution:
dx i
D ui ; (24.12)
dt
so that we can say that any point of I 0 moves with respect to I with constant velocity
.ui /. Therefore, the relative motion of I 0 with respect to I is a uniform translatory
motion with velocity .ui /. Similarly, we can prove that any point of I moves with
respect to I 0 with constant velocity u. In particular, xO0i denotes the coordinates of
the origin of I with respect to I 0 , when t D 0.
Remark 24.3. In the limit c ! 1, the Lorentz transformations reduce to the
Galilean ones.
3
A simple proof is shown at the end of this section.
464 24 An Introduction to Special Relativity
x 01 B
0
D D u: (24.25)
t D
Comparing (24.24) and (24.25), we have that
B D uA; D D A;
.x 1 /2 c 2 t 2 D 0; .x 01 /2 c 2 .t 0 /2 D 0 (24.28)
in the frames I and I 0 , respectively. Inserting (24.26) and (24.27) into (24.28)2, we
obtain
A2 c 2 C 2 D 1; Au C c 2 C D 0; A2 .u2 c 2 / D c 2 ;
466 24 An Introduction to Special Relativity
i.e., if
1 u
AD r ; C D ;
u2 c2
1
c2
and (24.19) and (24.20) are proved.
In this section, we show that the Lorentz transformations lead to the conclusion that
the concepts of space and time intervals are relative to the observer. Let xA01 and
xB01 > xA01 be the abscissas of two points A0 and B 0 belonging to the axis O 0 x 01 of
the inertial frame I 0 , and denote by x 01 D xB01 xA01 the distance between these
points, measured by rulers at rest relative to I 0 . The length x 01 will be called the
rest or proper length of the segment A0 B 0 . Now we define the length x 1 of the
segment A0 B 0 with respect to an inertial frame I , relative to which this segment
moves with uniform velocity u along the axis Ox 1 O 0 x 01 . The quantity x 1 is
identified by the distance between the two points A and B of the Ox 1 -axis that are
occupied by A0 and B 0 , respectively, at the same instant t in the inertial frame I .
For t constant, (24.18) gives
x 01 D x 1 > x 1 ; (24.30)
0 0
so that the length of A B relative to I is 1= times smaller than the length relative
to I . Since the dimensions of the lengths in the direction perpendicular to the
velocity do not change, a volume V is connected with the proper volume V 0 by
the relation
V 0 D V: (24.31)
Consider now a rest or proper time interval t 0 D t20 t10 evaluated by a clock C
at rest at the point xA01 of the inertial frame I 0 . To obtain the corresponding instants t1
and t2 evaluated in the inertial frame I , it is sufficient to apply the inverse of (24.21)
and remember that I moves relative to I 0 with uniform velocity u. In this way we
obtain
u2
t1 D t10 2 xA01 ;
c
u2
t2 D t20 2 xA01 :
c
Consequently, we obtain
1
t 0 D t < t; (24.32)
where t D t2 t1 . In other words, a moving clock goes slower than a clock at rest.
24.3 Simple Consequences of Lorentz’s Transformations 467
Now we present two simple but interesting applications of (24.31) and (24.32).
• Let P be an unstable particle produced in the atmosphere at a distance l from
the terrestrial surface. Denote by I 0 and I the rest inertial frame of P and a
terrestrial frame, respectively. Suppose that the mean life t 0 (evaluated in I 0 )
and the speed v of P (evaluated in I ) are such that l > vt 0 . If we assume
classical kinematics, then it is impossible for P to reach the surface of the Earth.
Conversely, resorting to (24.31) and (24.32), we can justify the arrival of P at the
terrestrial surface. In fact, for the observer at rest relative to P , the particle must
cover a shorter distance l 0 D l= < l in the time t 0 . In contrast, for an observer
at rest on the Earth’s surface, the particle P must cover the distance l but lives
longer, i.e., t= .
• Suppose that at the points xA1 < xB1 of the Ox 1 -axis of the inertial frame I
two events happen at the instants tA and tB , respectively. For an observer in the
inertial frame I 0 moving relative to I with velocity u, the same events happen at
the instants tA0 and tB0 , given by
u u
tA0 D tA 2 xA ; tB0 D tB 2 xB ;
c c
so that
xB xA
tB0 tA0 D tB tA u : (24.33)
c2
This condition shows that two events that are simultaneous for I are not
simultaneous for I 0 if xB ¤ xA . The following question arises: supposing that
tB > tA , is it possible to find an observer I 0 for which the order of the events is
inverted? This circumstance is confirmed if and only if the following condition
is satisfied:
u
tB tA < 2 .xB xA /: (24.34)
c
Because u=c 1, then also u=c 2 1=c, and the preceding condition gives
xB xA
tB tA < : (24.35)
c
This necessary condition for an inversion of the time order of two events allows
us to state that if in an inertial frame I 0 we have an inversion of the time
order, then the time interval between the two events is necessarily less than
the time taken by light to cover the distance xB xA . Conversely, from (24.35)
follows (24.34), at least for a certain value of u. We remark that (24.35) cannot
be satisfied for two events that are related to each other by a cause-and-effect
relationship. In other words, in an inertial frame, no physical perturbation
propagates with a speed greater than the light speed c.
468 24 An Introduction to Special Relativity
xP 1 u
xP 01 D ; (24.37)
uxP 1
1 2
c
1 xP 2
xP 02 D ; (24.38)
uxP 1
1 2
c
1 xP 3
xP 03 D : (24.39)
uxP 1
1 2
c
By a further derivation with respect to time, we can derive the following formulae
for acceleration in going from inertial frame I to inertial frame I 0 :
3=2
u2
1 2
c
xR 01 D 1
3 xR ; (24.40)
xP 1 u
1 2
c
0 1
u2 xP 2 u
1 2 B 2
xR 02 D c BxR C c 2 xR 1 CC; (24.41)
2 @
xP 1 u A
xP 1 u 1 2
1 2
c c
0 1
u2 xP 3 u
1 2 B 3
xR 03 D c BxR C c 2 xR 1 CC: (24.42)
2 @ xP 1 u A
xP 1 u 1 2
1 2
c c
Remark 24.4. It is evident that the preceding formulae reduce to Galilean ones
when c ! 1.
Now we present some simple consequences of the preceding formulae.
24.4 Relativistic Composition of Velocities and Accelerations 469
x x’
c
a
I I’ x x’
x x’
O O’ u
.x 1 /2 C .x 2 /2 C .x 3 /2 c 2 t 2 D 0;
.x 01 /2 C .x 02 /2 C .x 03 /2 c 2 t 02 D 0;
xP 1 D xP 2 D 0; xP 3 D c:
• Let S be a homogeneous and isotropic optical medium, and let n be its refractive
index. Suppose that S moves uniformly with a speed u along the Ox 1 -axis of the
inertial frame I and denote by I 0 the rest frame of S , with O 0 x 01 parallel to Ox 1 .
Consider a light ray a propagating along O 0 x 01 with speed c=n. Then, applying
the inverse formulae of (24.37)–(24.39), the speed of a along Ox 1 is given by
c
Cu c u
1
xP D n
u ' n C u 1 cn ;
1C
cn
that is,
c 1
xP 1 D Cu 1 2 :
n n
This result represents a simple explanation of Fizeau’s experiment, which, before
special relativity, required very difficult theoretical interpretations that had not
been experimentally confirmed. One of these attempts at interpretation required
the partial drag of the ether by a moving medium (Sect. 24.2).
same form in any inertial frame. As a consequence, two inertial observers who
repeat an experiment in the same initial and boundary conditions arrive at the same
results.
We note that the possibility to verify the covariance of the physical law (24.43) is
related to knowledge of the transformation law of the quantities A, B; : : : in going
from inertial frame I to inertial frame I 0 . In other words, from a mathematical
viewpoint, a physical law could be covariant with respect to more transformation
groups, provided that the transformed quantities A0 , B 0 ; : : : are suitably defined.
Therefore, we can establish the covariance of a physical law only by verifying
that the assumed transformation laws for the quantities A, B; : : : are experimentally
confirmed.
To make clear this aspect of the theory, we recall that in Sect. 24.1 we proved
that the continuity equation is covariant with respect to Galilean transformations
provided that the the charge density and the current density J are transformed
according to (24.7). In contrast, if we start from the special Lorentz transforma-
tions (24.18)–(24.21), instead of (24.5) and (24.6), then we derive the formulae
@ @ u @
D ; (24.45)
@x 1 @x 01 c 2 @t 0
@ @
2
D ; (24.46)
@x @x 02
@ @
3
D ; (24.47)
@x @x 03
@ @ @
D u : (24.48)
@t @t 01 @x 01
Then, under a Lorentz transformation, (24.1) becomes
@ h u 1 i @
0
2
J C 01 J 1 u
@t c @x
@ 2 @ 3
C 02 J C 03 J D 0: (24.49)
@x @x
We see that the continuity equation is covariant under Lorentz transformations if
and only if
u
0 D 2 J 1 ; J 01 D .J 1 u/; J 02 D J 2 ; J 03 D J 3 : (24.50)
c
Denoting by ak the component of vector a parallel to the relative velocity u and by
a? the component orthogonal to u, the preceding relations can also be written as
follows:
Ju
0 D 2 ; J0k D .J0k u/; J0? D J? : (24.51)
c
472 24 An Introduction to Special Relativity
We conclude that, a priori, i.e., from a mathematical point of view, the continuity
equation of charge could be covariant under both Galilean and Lorentz transforma-
tions, provided that we assume different transformation properties for the charge
density and the current vector. However, the experiment will compel us to decide
which of (24.7) and (24.50) we must choose. We emphasize that it could happen
that, still from a mathematical point of view, the considered equations would be
covariant under only one of the two transformation groups. In fact, in Sect. 24.1, we
showed that Maxwell’s equations in a vacuum, (24.2) and (24.3), are not covariant
under Galilean transformations. However, in the next section, we verify that they are
covariant under Lorentz transformations, provided that the electromagnetic fields
are transformed in a suitable way in going from an inertial frame to another inertial
frame.
@E 3 @E 2 @H 1
2
3
D 0 ;
@x @x @t
@E 1 @E 3 @H 2
D 0 ;
@x 3 @x 1 @t
@E 2 @E 1 @H 3
D 0 ;
@x 1 @x 2 @t
@H 1 @H 2 @H 3
C C D 0:
@x 1 @x 2 @x 3
Inserting (24.45)–(24.48) into the preceding system, we obtain
@E 3 @E 2 @H 1 @H 1
c D 0 C 0 u ;
@x 02 @x 03 @t 0 @x 01
@E 1 @
3 3
@ 2 uE 3
.E C 0 uH / D 0 H C ;
@x 03 @x 01 @t 0 mu0 c 2
@
2 3
@E 1 @ 3 uE 2
E 0 uH 02 D 0 0 H ;
@x 01 @x @t mu0 c 2
@H 1 u @H 2 @H 3
01
2 C 02 C 03 D 0;
@x c @x @x
24.6 Covariance of Maxwell’s Equations 473
p
where c D 1= 0 0 is the light velocity in a vacuum. Solving the fourth of
the preceding equations with respect to @H 1 =@x 01 and inserting the obtained
expression into the first equation, we have the following system:
@ h 3 i @ h i @H 1
c 02
E C 0 uH 2 03 E 2 0 uH 3 D 0 0 ;
@x @x @t
" !#
@E 1 @ h i @ uE 3
3 3 2
.E C 0 uH / D 0 0 H C ;
@x 03 @x 01 @t mu0 c 2
" !#
@ h 2 3
i @E 1 @ 3 uE 2
E 0 uH 02 D 0 0 H ;
@x 01 @x @t mu0 c 2
@H 1 u @H 2 @H 3
01
2 C 02 C 03 D 0:
@x c @x @x
By adopting the same notations of (24.51), we can state that Maxwell’s equations
in a vacuum are covariant under Lorentz transformations if the electromagnetic field
is transformed according to the following formulae:
and we can state that the total charge is invariant with respect to Lorentz transfor-
mations.
4
For the difficult topic of electromagnetism in matter, see, for instance, [27, 39, 44].
474 24 An Introduction to Special Relativity
Denote by I0 the proper frame of P at an arbitrary instant t, that is, the inertial
frame that moves relative to I with a uniform velocity equal to the velocity rP at
instant t. In the proper frame I0 , the Lorentz force reduces to the electrostatic force
F0 D qE0 (24.56)
F01 D F 1 ; (24.57)
F02 D F 2 ; (24.58)
F03 D F ;3
(24.59)
xR 01 D 3 xR 1 ; (24.61)
xR 02 2 2
D xR ; (24.62)
xR 01 2 3
D xR ; (24.63)
rR 0k D 3 rR k ; rR 0? D 2 rR ? : (24.64)
24.7 Relativistic Dynamics 475
The second assumption on which Einstein bases his relativistic dynamics is:
2. The Newtonian equation of motion is valid in the proper frame I0 , that is, we
have that
m0 rR 0 D F0 : (24.65)
Applying (24.60) and (24.64)–(24.65), we obtain the following equations in
any inertial frame I :
m0 3 rR k D Fk ; m0 rR ? D F? : (24.66)
Now we prove that the preceding equations are the projections parallel and
orthogonal to rP , respectively, of the equation
d
.mPr/ D F; (24.67)
dt
where
1
m D m0 D m0 q (24.68)
jPrj2
1 c2
Introducing the unit vector u along rP we have that rP D jPrju, rR k D jRrk ju, and
recalling (24.70), we can write (24.72) as follows:
2
d 2
. rP / D 2
jP
r j C 1 rR k D 3 rR k : (24.74)
dt k c
This equation, together with (24.73), proves the equivalence between (24.66)
and (24.67).
476 24 An Introduction to Special Relativity
d
rP .mPr/ D rP F:
dt
Taking into account (24.74), the preceding equation becomes
d d
rP .mPr/ D rP .mPr/ D m0 3 rP rR k D rP F: (24.75)
dt dt k
relating the coordinates .x ˛ / of any event in the Lorentz frame .O; e˛ / to the
coordinates .O 0 ; e0˛ / of the same event in the Lorentz frame .O 0 ; e0˛ /. The definition
of Lorentz frame requires that the matrix A˛ˇ must satisfy the condition
˛ˇ D A ˛ Aˇ : (24.81)
˛
Since in any Lorentz frame the square of the distance s between two events x.1/
˛
and x.2/ assumes the form
ˇ ˇ
s 2 D ˛ˇ x.2/
˛ ˛
x.1/ x.2/ x.1/
3
X 2 2
i i 4 4
D x.2/ x.1/ x.2/ x.1/ ; (24.82)
i D1
4-vector. At any point O 2 V4 , the set of the events P 2 V4 such that the 4-vectors
!
OP D v are null, i.e., the set of the 4-vectors for which
! !
OP OP D 0; (24.83)
!
The 4-vectors v D OP , corresponding to events P , which are internal to CO , are
timelike 4-vectors, whereas those corresponding to events P , which are external to
CO , are spacelike 4-vectors.
Let us fix an arbitrary timelike 4-vector eO , and let .O; eO / be a uniform vector field
of V4 . We say that, at any O 2 V4 , the 4-vector eO defines the direction of the future
at O. Moreover, the internal region COC of CO , to which eO belongs, is said to be
the future of O, whereas the remaining internal region CO of CO is the past of O.
Finally, the set of events external to CO is defined as the present of O.
The first three axes of a Lorentz frame .O; e/ are spacelike 4-vectors, whereas
the fourth axis e4 is a timelike 4-vector. Moreover, let ¢O;e4 be the three-dimensional
space formed by all the 4-vectors generated by linear combinations of the three
4-vectors e1 ; e2 ; e3 . It is easy to verify that ¢O;e4 is a properly Euclidean space, with
respect to the scalar product g, and that its elements are spacelike 4-vectors.
Before making clear the physical meaning of all the preceding definitions, we
must prove the following propositions.
Proposition 24.1. Let u be any timelike 4-vector, and let us denote by ¢O;u the
three-dimensional space of all 4-vectors that are orthogonal to u. Then, any v 2 ¢O;u
is a spacelike 4-vector.
Proof. Let .O; e˛ / be a Lorentz frame, and let u D ui ei C u4 e4 u? C 4
Pu3 e4i be the
decomposition of the 4-vector u in the basis .e˛ /. Since u? u? D 1 .u /2 , the
4-vector u? is spacelike. With the same notation, we set v D v? C v4 e4 , so that the
following relations hold:
u u D u? u? .u4 /2 < 0;
u v D u? v? u4 v4 D 0;
v v D v? v? .v4 /2 :
Consequently,
.u? v? /2
v v D v? v? ;
.u4 /2
24.8 Minkowski’s Space-Time 479
.u? u? /.v? v? /
v v v? v? :
.u4 /2
We must now prove that there exists at least a 4-vector u such that
.u? v? /2
u u D u? u? :
.v4 /2
Again by applying Schwartz’s inequality, we can write the preceding equation as
v? v?
u u D u? u? 1 : (24.88)
.v4 /2
Finally, since u? u? > 0, from (24.86) and (24.88) we have that u u < 0. t
u
480 24 An Introduction to Special Relativity
Remark 24.6. For any spacelike vector v at event O, we can find infinite Lorentz
frames .O; e˛ / for which e1 D v=jvj. In fact, it is sufficient to take one of the
Ptimelike 4-vectors e4 orthogonal to v and choose in the three-dimensional
existing
space ¢O;e4 to which v belongs two other unit vectors that are orthogonal to each
other and orthogonal to v.
Let us suppose that we have introduced a direction of the future by the uniform
O so that it is possible to define the cones CO and COC at any event O.
4-vector u,
Proposition 24.3. Let u 2 COC . Then, v 2 COC if and only if
u v < 0: (24.89)
Proof. In fact, uu < 0 since u 2 COC . But uw is a continuous function of w 2 COC ,
which is a connected set. Consequently, if, for a 4-vector v 2 COC , we had u v > 0,
then there would exist a timelike 4-vector u? 2 COC such that u u? D 0. But this is
impossible for Proposition 24.1.
Conversely, owing to the remark about Proposition 24.1, we can find a Lorentz
frame at O having e4 D u=juj as a timelike axis. In this frame, we have that u u D
u4 v4 . On the other hand, since u; u 2 COC , we have u4 > 0 and v4 > 0, and the
proposition is proved. t
u
We denote by LC O the set of Lorentz frames .O; e˛ / at O whose axes e4
belong to the positive cone COC . The Lorentz transformation between two Lorentz
frames in LCO is said to be orthochronous. It is evident that the totality of these
transformations is a group.
Proposition 24.4. Let .O; e˛ / 2 LC O be a Lorentz frame. Then the Lorentz frame
.O; e0˛ / belongs to LC
O if and only if
Proof. Since e04 D A˛4 e˛ , we have that e04 e4 D A44 . The proposition is proved when
we take into account Proposition 24.3. t
u
3
X
s2 .xAi xBi /2 c 2 .tA tB /2
i D1
3
X
D .xA0i xB0i /2 c 2 .tA0 tB0 /2 ; (24.91)
i D1
relating the space-time coordinates .xAi ; tA / and .xBi ; tB / of event A in the Lorentz
frame I and the corresponding coordinates .xAi 0 ; tA0 /, .xBi 0 ; tB0 / of the same events
in I 0
Moreover, from (24.21) we obtain that
dt 0
D > 0: (24.92)
dt
Since
A44 D > 0; (24.95)
the Lorentz frame .O 0
; e0˛ /
belongs to LC
O.
Conversely, let us assign the Lorentz frame .O 0 ; e0˛ / 2 LCO by (24.80), where
A44 > 0. To determine the corresponding inertial frame I 0 , we must determine
the right-hand sides of (24.93) and (24.94), i.e., the quantities t0 ; ui ; Qji , and xOi 0 .
Owing to the orthogonality of the matrix .Qji /, we must evaluate ten quantities
starting from the four coordinates .xO˛ 0 / and the six independent coefficients A˛ˇ
[see (24.81)].
482 24 An Introduction to Special Relativity
The preceding considerations allow us to make clear the physical meaning of the
definitions we gave in the preceding section. In fact, to any point P 2 V4 , with
coordinates .x ˛ / in a Lorentz frame .O; e˛ / 2 LC O , there corresponds an event
that has coordinates .x i ; x 4 =c/ in the inertial frame I . In particular, all the events
belonging to the light cone CO have coordinates verifying (24.84). To these points
of V4 correspond all the events whose coordinates in the inertial frame I satisfy the
equation
3
X
.x i /2 c 2 t 2 r 2 c 2 t 2 D 0 (24.98)
i D1
of this approach stating that the Newtonian equations must be substituted by the
following ones:
d
.mv/ D F; (24.99)
dt
d
.mc 2 / D F rP ; (24.100)
dt
where
m0
mD q (24.101)
jPrj2
1 c2
is the relativistic mass of P , m0 its rest mass, and rP the velocity of P . In this section,
we recall the four-dimensional formulation of (24.99), (24.100) in the space-time V4 .
Let
x i D x i .t/ (24.102)
be the equation of motion of a particle with respect to an inertial frame I . To the
trajectory (24.102) we can associate a curve of V4 , which in the Lorentz frame
T .O; e˛ / corresponding to I has equations
and it is negative since the velocity of P in any inertial frame is less than the velocity
c of light in a vacuum. In other words, the curve is timelike and its tangent vector
at any point P is contained in the map COC of the light cone at P .
Let x i .t / be the position of P at the instant t in the inertial frame I . We recall
that the rest frame or proper frame I of P at the instant t is an inertial frame that has
its origin at x i .t/ and moves with velocity v with respect to I . The corresponding
Lorentz frame ..x i .t /; ct /; e0 / T has a time axis e4 tangent to at the point x i .t /
since in T the vector e4 must have components .0; 0; 0; 1/. In going from T to T ,
the infinitesimal distance between two events on is invariant, so that we have
3
!
X
2
ds D .xP / c dt 2 D c 2 d 2 ;
i 2 2
(24.105)
i D1
484 24 An Introduction to Special Relativity
where is the proper time, i.e., the time evaluated by observer I.t/. From (24.105)
we derive the relation
dt
D : (24.106)
d
If we adopt this time along , then the parametric equations (24.103) become
x ˛ D x ˛ ./: (24.107)
dx ˛
U˛ D ; (24.108)
d
which, in view of (24.106), has the following components in the Lorentz frame I :
U ˛ D . rP ; c/: (24.109)
Moreover,
U ˛ U˛ D 2 .jPrj2 c 2 / D c 2 < 0: (24.110)
It is a simple exercise to verify that we can write (24.99) and (24.100) in the
covariant form
dU ˛
m0 D ¥˛ ; (24.111)
d
where the 4-force is given by
˛ F rP
.¥ / D F; : (24.112)
c
Let S be a continuous system of moving charges, and denote by and rP the charge
density and the velocity field of S , respectively. If S moves in a vacuum, the
continuity equation (24.1) becomes
@
C r .Pr/ D 0: (24.113)
@t
Consider a transformation (24.80) from a Lorentz frame I to another one I 0 , and
suppose that the quantities
˛ Pr
J D ; (24.114)
c
24.11 Tensor Formulation of Electromagnetism in Vacuum 485
@F ˛ˇ
D 0; (24.117)
@x ˇ
where
0 1
0 H 3 H 2 cE 1
B H 3 0 H 1 cE 2 C
F ˛ˇ DB C
@ H 2 H 1 0 cE 3 A : (24.118)
cE 1 cE 2 cE 3 0
Consequently, if F ˛ˇ are supposed to be the components of a (skew-symmetric)
2-tensor, then (24.117) has the same form in any Lorentz frame and the relativity
principle is satisfied by (24.1) and (24.4)2. F ˛ˇ is called an electromagnetic tensor.
If we introduce the adjoint tensor of F ˛ˇ
1
F ?˛ˇ D ˛ˇ F ; (24.119)
2
which is obtained from F ˛ˇ with the substitution cE ! H, then Maxwell’s
equations (24.2) and (24.4)1 assume the covariant form
@F ?˛ˇ
D 0: (24.120)
@x ˇ
F 0˛ˇ D A˛ Aˇ F
and (24.97) we obtain (24.52) and (24.53).
Exercise 24.3. Verify that the components of the Lorentz force (24.55) are equal to
the first three components of the 4-vector
q ˛ˇ
F˛ D F Uˇ :
c
Appendix A
First-Order PDE
In this appendix, we give a sketch of the method proposed by Monge, Ampere, and
Cauchy to reduce the integration of a first-order PDE to the integration of a system
of ordinary equations.
Let F .x; u; p/ be a function of class C 2 .<2nC1 / verifying the following condi-
tions:
˚
Pn set F2 D .x; u; p/ 2 < ; F .x; u; p/ D 0 is not empty.
2nC1
1. The
2. i D1 Fpi ¤ 0; 8.x; u; p/ 2 F:
Under these hypotheses, the following general first-order PDE:
defines a set …0 of planes containing .x0 ; u0 / and tangent to the integral surfaces of
(A.1), to which .x0 ; u0 / belongs. Owing to conditions 1 and 2, …0 is not empty, and
we can suppose that Fpn .x0 ; u0 ; p0 / ¤ 0. In turn, this result implies that (A.2), at
least locally, can be written in the form
pn D pn .x0 ; u0 ; p1 ; : : : ; pn1 /:
X
n1
f .X; U; p1 ; : : : ; pn1 / p˛ .X˛ x0˛ / C pn .p˛ /.U u0 / D 0; (A.3)
˛D1
Fpi
Xi x0i D .U u0 /; i D 1; : : : ; n: (A.6)
p Fp
charact. direction
xn
u
charact. strip
xn
charact. curve
dx
D Fp ; (A.9)
ds
dp
D .Fx C Fu p/; (A.10)
ds
du
D p Fp ; (A.11)
ds
Theorem A.2.
F .x.s/; u.s/; p.s// D const: (A.12)
Remark A.1. If Eq. (A.1) is quasilinear
Denoting by t the variable xn , by p the derivative Fut , by x the vector .x1 ; : : : ; xn1 /,
and by p the vector .p1 ; : : : ; pn1 /, the preceding equation becomes
F .x; t; p; p/ D 0: (A.17)
p C H.x; t; p/ D 0; (A.18)
dx
D Hp ; (A.19)
ds
dp
D Hx ; (A.20)
ds
du
D p Hp C p; (A.21)
ds
dt
D 1; (A.22)
ds
dp
D Ht : (A.23)
ds
492 A First-Order PDE
Owing to (A.22), we can identify s with t and the preceding system becomes
dx
D Hp ; (A.24)
dt
dp
D Hx ; (A.25)
dt
du
D p Hp H; (A.26)
dt
dp
D Ht : (A.27)
ds
We note that (A.24) and (A.25) are Hamiltonian equations that can be solved
independently of the remaining ones.
u
charact. strip
xn
@x0 @u0
p0 D 0; (A.30)
@v˛ @v˛
F .x0 .v˛ /; u0 .v˛ /; p0 .v˛ // D 0; (A.31)
˛ D 1; : : : ; n 1.
If .x0 .v˛ /; u.v˛ /; p0 .v˛ // 2 verifies the preceding system and the condition
@x0
J D det Fp ¤ 0; (A.32)
@v˛
is satisfied at this point, then system (A.30), (A.31) can be solved with respect to
p0 .v˛ / in a neighborhood of the point .x0 .v˛ /; u.v˛ /; p0 .v˛ //. In this way we obtain
the initial data for (A.9)–(A.11).
The proof of the following theorem can be found in any book on PDEs:
Theorem A.3. If it is possible to complete the initial datum (A.29) by solving, with
respect to p0 .v˛ /, system (A.30), (A.31); then one and only one solution exists of the
Cauchy problem.
Appendix B
Fourier Series
where
Z T
1
a0 D f .f / dt; (B.3)
T T
Z T
1 k t
ak D f .t/ cos dt; (B.4)
T T T
Z T
1 k t
bk D f .t/ sin dt (B.5)
T T T
f .c C / f .c /
(B.6)
2
at any discontinuity point c.
F t
ft
t
T T
F t
ft
T T
and it is odd if
f .t/ D f .t/; (B.8)
8t 2 <.
Remark B.1. If the function f .t/ is even, then the Fourier coefficients bk vanish for
any integer k. In fact, it is sufficient to decompose integral (B.5) into the sum of
the two integrals over the intervals .T; 0/ and .0; T / and recall that sin t is an odd
function. Similarly, if f .t/ is odd, then all the Fourier coefficients ak vanish.
Remark B.2. A Fourier series can be extended to any function f .t/ that in a
bounded interval Œ0; T is continuous with its first derivative almost everywhere in
Œ0; T , provided that the discontinuities of f .t/ and f 0 .t/ are finite. In fact, it is
sufficient to apply series (B.2) to one of the following periodic functions that extend
the function f .t/ in the interval ŒT; T as follows (Figs. B.1 and B.2):
B Fourier Series 497
f .t/; t 2 Œ0; T ;
F1 .t/ D
f .t/; t 2 ŒT; 0;
f .t/; t 2 Œ0; T ;
F2 .t/ D
f .t/; t 2 ŒT; 0:
there follows
1
X 1
X
k t k t k t
Ak ei T D A0 C Ak ei T C Ak ei T
kD1 kD1
1
X
k t k t
D A0 C Ak cos C i sin
T T
kD1
k t k t
C Ak cos i sin
T T
1
X k t
D A0 C .Ak C Ak / cos
T
kD1
k t
C i .Ak Ak / sin :
T
This result is identical to (B.2) if we set
Ak C Ak D ak ; (B.10)
Ak iAk D i bk ; (B.11)
where
Z T
1 k t
Ak D f .t/ei T dt; (B.16)
2T T
Z T
1 k t
Ak D f .t/ei T dt D Ak : (B.17)
2T T
The complex form (B.15) can be at once extended to the case of a function
f .t1 ; : : : ; tn / of n real variables .t1 ; : : : ; tn /, which is periodic with the same
period T with respect to each variable. In fact, since f .t1 ; : : : ; tn / is periodic with
respect to t1 , we have that
1
X k1 t1
.1/
f .t1 ; : : : ; tn / D Ak1 .t2 ; : : : ; tn /ei T ;
k1 D1
.1/
where, in view of (B.16) and (B.17), the functions Ak1 .t2 ; : : : ; tn / are periodic in
the variables t2 ; : : : ; tn . Applying (B.15) to these functions, we obtain
1
X 1
X k1 t1 k2 t2
.2/
f .t1 ; : : : ; tn / D Ak1 k2 .t3 ; : : : ; tn /ei T ei T :
k2 D1 k1 D1
1. Abraham, R., Marsden, J.E.: Foundation of Mechanics. Benjamin Cummings, Reading, Mass.
(1978)
2. Arnold, V.L.: Equazioni Differenziali Ordinarie. Mir, Moscow (1979)
3. Arnold, V.L.: Metodi Matematici della Meccanica Classica. Editori Riuniti, Rome (1979)
4. Barger, V., Olsson, M.: Classical Mechanics: A Modern Perspective. McGraw-Hill, New York
(1995)
5. Bellomo, N., Preziosi, L., Romano, A.: Mechanics and Dynamical Systems with Mathematica.
Birkhäuser, Basel (2000)
6. Birkhoff, G., Rota, G.C.: Ordinary Differential Equations. Wiley, New York (1989)
7. Bishop, R.L., Goldberg, S.I.: Tensor Analysis on Manifolds. MacMillan, New York (1968)
8. Boothby, W.M.: An Introduction to Differential Manifolds and Riemannian Geometry.
Academic, San Diego (1975)
9. Borrelli, R.L., Coleman, C.S.: Differential Equations: A Modeling Approach. Prentice-Hall,
Englewood Cliffs, NJ (1987)
10. Cercignani, C., Illner, R., Pulvirenti, M.: The Mathematical Theory of Diluite gases. Springer,
Berlin Heidelberg New York (1994)
11. Choquet-Bruhat, Y.: Gèometrie Diffèrentielle et Systèmes Extèrieurs. Dunod, Paris (1968)
12. Chow, T.L.: Classical Mechanics. Wiley, New York (1995)
13. Crampin, M., Pirani, F.: Applicable Differential Geometry. Cambridge University Press,
Cambridge (1968)
14. de Carmo, Manfredo P.: Differential Geometry of Curves and Surfaces. Prentice-Hall, Engle-
wood Cliffs, NJ (1976)
15. Dixon, W.G.: Special Relativity: The Foundations of Macroscopic Physics. Cambridge
University Press, Cambridge (1978)
16. Dubrovin, B.A., Novikov, S.P., Fomenko, A.T.: Geometria delle Superfici, dei Gruppi di
Trasformazioni e dei Campi. Editori Riuniti, Rome (1987)
17. Dubrovin, B.A., Novikov, S.P., Fomenko, A.T.: Geometria e Topologia delle Varietà. Editori
Riuniti, Rome (1988)
18. Dubrovin, B.A., Novikov, S.P., Fomenko, A.T.: Geometria Contemporanea. Editori Riuniti,
Rome (1989)
19. Fasano, A., Marmi, S.: Meccanica Analitica. Bollati Boringhieri, Turin (1994)
20. Fawles, G., Cassidy, G.: Analytical Dynamics. Saunders, Ft. Worth, TX (1999)
21. Flanders, H.: Differential Forms with Applications to the Physical Sciences. Academic,
New York (1963)
22. Fock, V.: The Theory of Space, Time and Gravitation, 2nd edn. Pergamon, Oxford (1964)
23. Goldstein, H.: Classical Mechanics, 2nd edn. Addison-Wesley, Reading, MA (1981)
24. Goldstein, H., Poole, C., Safko, J.: Classical Mechanics, 3rd edn. Addison Wesley, Reading,
MA (2002)
25. Hale, J., Koçak, H.: Dynamics and Bifurcations. Springer, Berlin Heidelberg New York (1991)
26. Hill, R.: Principles of Dynamics. Pergamon, Oxford (1964)
27. Hutter, K., van der Ven, A.A., Ursescu, A.: Electromagnetic Field Matter Interactions in
Thermoelastic Solids and Viscous Fluids. Springer, Berlin Heidelberg New York (2006)
28. Jackson, E.A.: Perspectives of Nonlinear Dynamic, vol. I. Cambrige University Press,
Cambridge (1990)
29. Jackson, E.A:, Perspectives of Nonlinear Dynamic, vol. II. Cambrige University Press,
Cambridge (1990)
30. Josè, J.V., Saletan, E.: Classical Dynamics: A Contemporary Approach. Cambridge University
Press, Cambridge (1998)
31. Khinchin, A.I.: Mathematical Foundations of Statistical Mechanics. Dover, New York (1949)
32. Landau, L., Lifshitz, E.: Mechanics, 3rd edn. Pergamon, Oxford (1976)
33. MacMillan, W.D.: Dynamics of Rigid Bodies. Dover, New York (1936)
34. Marasco, A.: Lindstedt-Poincarè method and mathematica applied to the motion of a solid with
a fixed point. Int. J. Comput. Math. Appl. 40, 333 (2000)
35. Marasco, A., Romano, A.: Scientific Computing with Mathematica. Birkhauser, Basel (2001)
36. Möller, C.: The Theory of Relativity, 2nd edn. Clarendon, Gloucestershire, UK (1972)
37. Norton, J.D.: General covariance and the foundations of general relativity: eight decades of
dispute. Rep. Prog. Phys. 56, 791–858 (1993)
38. Panowsky, W., Phyllips, M.: Classical Electricity and Magnetism. Addison-Wesley, Reading,
MA (1962)
39. Penfield, P., Haus, H.: Electrodynamics of Moving Media. MIT Press, Cambridge, MA (1967)
40. Petrovski, I.G.: Ordinary Differential Equations. Prentice-Hall, Englewood Cliffs, NJ (1966)
41. Resnick, R.: Introduction to Special Relativity. Wiley, New York (1971)
42. Rindler, W.: Introduction to Special Relativity. Clarendon Press, Gloucestershire, UK (1991)
43. Romano, A., Lancellotta, R., Marasco, A.: Continuum Mechanics using Mathematica,
Fundamentals, Applications, and Scientific Computing. Birkhauser, Basel (2006)
44. Romano, A., Marasco, A.: Continuum Mechanics, Advanced Topics and Research Trends.
Birkhauser, Basel (2010)
45. Saletan, E.J., Cromer, A.H.: Theoretical Mechanics. Wiley, New York (1971)
46. Scheck, F.: Mechanics: From Newton’s Laws to Deterministic Chaos. Springer, Berlin
Heidelberg New York (1990)
47. Stratton, J.A.: Electromagnetic Theory. McGraw-Hill, New York (1952)
48. Synge, J.: Relativity: The Special Theory, 2nd edn. North-Holland, Amsterdam (1964) (Reprint
1972)
49. Synge, J.L., Griffith, B.A.: Principles of Mechanics. McGraw-Hill, New York (1959)
50. Thomson, C.J.: Mathematical Statistical Mechanics. Princeton University Press, Princeton,
NJ (1972)
51. Truesdell, C., Noll, W.: The Nonlinear Field Theories of Mechanics. Handbuch der Physik,
vol. III/3. Springer, Berlin Heidelberg New York (1965)
52. von Westenholz, C.: Differential Forms in Mathematical Physics. North-Holland, New York
(1981)
53. Wang, K.: Statistical Mechanics, 2nd edn. Wiley, New York (1987)
54. Whittaker, E.T.: A Treatise on the Analytical Dynamics of Particles and Rigid Bodies.
Cambridge University Press, Cambridge (1989)
55. Zhang, W-B.: Differential Equations, Bifurcations, and Chaos in Economy. World Scientific,
Singapore (2005)
56. Zhong Zhang, Y.: Special Relativity and Its Experimental Foundations. Advanced Series on
Theoretical Physical Science, vol. 4. World Scientific, Singapore (1997)
Index
K M
König’s theorem, 208 Mach number, 457
Kepler’s laws, 225 manifold, 68, 69
kernel, 12 mass, 203
kinetic energy, 208, 249 mass conservation principle, 433
kinetic field, 428 mass density, 247,
kinetic potential, 432 433
kinetic theory, 404 mass forces, 256
Kutta–Joukowsky theorem, 455 material point, 197
material volume, 432
matrix of the linear map, 11
L maximal atlas, 69
lab frame, 179 maximal solution, 139
Lagrange bracket, 340 mechanical determinism, 204
Lagrange equations, 300 metric tensor, 83
Lagrange function, 301 Minkowski’s inequality, 47
Lagrange’s multipliers, 311 mixed 2-tensor, 23
Lagrange’s theorem, 440 modulus, 49
Lagrangian components, 300 molecular chaos, 404
Lagrangian components of the impulses, 421 moment of inertia, 254
Lagrangian coordinates, 289 Monge’s cone, 488
Lagrangian equations of impulsive dynamics, monomorphism, 12
421 morphism, 11
Lagrangian form, 427 Mozzi’s axis, 181
Lagrangian velocities, 291 Mozzi’s theorem, 181
Laplace operator, 131
Laplace’s equation, 432, 446
Legendre transformation, 324, 326 N
length, 49, 84 natural atlas, 337
level manifold, 377 natural basis, 74
Levi–Civita symbol, 64 natural charts, 337
Lie derivative, 98 natural coordinates, 81
Lie’s algebra, 9 Newtonian force, 215
lift, 455 Newtonian model, 197
light cone, 478 nondegenerate .0; 2/-tensor, 45
line of nodes, 185 nondegenerate Hamiltonian, 401
linear form, 17 nonholonomic constraints, 292
linear map, 11 normal frequencies, 316
linear momentum, 205, 248 normal modes, 316
linearly dependent, 5 normal speed, 372
linearly independent, 5 null four-vector, 478
Lipschitz condition, 139 nutation, 275
liquid, 435
local time, 161
locally Hamiltonian vector field, 345 O
Lorentz frame, 477 objective function, 201
Lorentz transformation, 477 objective tensor, 200
Lorentz transformation without rotation, odd function, 496
464 odd permutation, 33
Lorentz’s force, 474 one-parameter group of diffeomorphisms, 95
Index 505