VP LectureNotes 2010
VP LectureNotes 2010
, given by
, and the minimum and maximum values of .
In problem 2 the area is
and the length is
is a functional, i.e. a function on a space of functions.
This class of problems is called the calculus of variations.
0.2 Direct method
Problem 3
Show that there exists such that
, for any
.
Find such that
. By the
intermediate value theorem, there exists a solution .
Alternative method;
Define
Claim; such that
.
Proof of claim.
Step 1;
. On the
other hand, let
; then
But
So if . This shows the minimum value is attained inside
if anywhere.
Step 2; By Analysis I, with
.
This is called the direct method for variational problems.
1. Functions on
1.1 Functions
Let
Then
is linear if for ,
. Also
where
.
Definition 1.1.1. A function
.
i.e. , such that
Relation with difference quotients (
,
In Definition 1.1.1, put
exists and is finite, and is equal to
because linear.
Proposition 1.1.2. (i) If is differentiable at , then the partial derivatives
exist and the linear map (in definition 1.1.1) is given by
(ii) If all partial derivatives exist
, then is differentiable
at each
and is given by
We will define
.
The definition of differentiability can now be written as
In Analysis II, youll see a proof that
.
Lemma 1.1.3. If
, then . Also, if
, then
. This is a 1
st
order necessary condition for max or min.
Remark. This is also true if it holds only for in some ball around , i.e.
.
Terminology. If then is a stationary or critical point. If is a maximum or minimum
then it is an extremum (or extreme value).
1.2 Second-order conditions for extreme values
Definition. A symmetric matrix
(positive) if
for all
non-
zero. Similarly,
(non-negative) if
for all
non-zero.
Exercise. Show that if a symmetric matrix is , then all its eigenvalues are , and the same
with signs.
Theorem 1.2.1. If
and , then
(i) is a local minimum
(ii)
is a strict local
maximum. Strict means that, around the point which is a minimum, the function is strictly greater
than at (or smaller, if is a maximum).
Imagine
. Then
and
.
Assume
. Is
a global minimum?
Yes - by Rolles theorem, because otherwise,
with
with
, , there are functions with only one stationary point which is a strict local minimum
but not a global minimum.
Exercise. Find such a function for .
1.3 Convexity
Definition. A set
, we have
.
A function
is convex if
i.e. the line between two points on the curve lies above the curve.
Remark. Define the epigraph
Then
convex convex.
Proof (). Let
. Let
and
be in
, so
and
. For
Hence,
as was to be proved.
Proof (). Exercise.
Remark. Since the definition of convexity involves only line segment joining two points, to check
a function is convex you can check the functions are convex .
Proposition 1.3.1. If
By the chain rule,
, which is (ii).
Proof that (ii) (i). Let and .
Then
Choosing gives the definition of convexity.
Proof that (i) (iii). We have and
. Adding, as
required.
Proof that (iii) (i). Exercise.
To visualise the second condition, work in
.
is graph of
Normal vector
Tangent plane at is
To have a picture for (iii) consider ;
This says that
is strictly convex if
whenever and .
Proposition 1.3.2. If
, the
following are equivalent;
(i) strictly convex
(ii) for
(iii) for
Corollary to 1.3.1. If
then
(i)
(ii)
Exercise. Think of an example to show that the converse of (ii) is false.
Proof of (i) ().
Note this is using the chain rule in the form
Warning when we say a symmetric matrix
is , we mean
Example. Show that
is a concave function on the set of all
with
and
(a
probability distribution on ).
Note that if are both probability distributions on this set of points, then so is
, since
.
Calculate the matrix of second partial derivatives;
since
This is a negative definite matrix. QED.
1.4 Constraints and Lagrange multipliers
Example. Maximise
Answer;
, attained at .
But notice is perpendicular to (unit circle) at these points. To see why, solve the problem
by parametrising as . Then
At maximum values we must have . Apply the chain rule;
Since
. If we find a point
where a
function has
a maximum/minimum on , then
Then
, so
Alternatively,
the Lagrange multiplier. Use function .
Example. Find the rectangle in the plane inscribed in the unit circle
which has the largest possible area.
Area .
Augmented function
Assume neither nor are , then
i.e. stationary point is a square.
If say , we just have a line inside the circle, so and this is a minimum value of
amongst inscribed rectangles.
For
, recall that
Example.
To search for stationary points of a function
constrained to , we look at
written as
.
First-order necessay condition for stationary point;
where the
Assume
. Then
But
using the summation convention. Thus
Theorem 1.4.1. Let be
has maximum at
, then
If
has minimum, then the second inequality has a . Here we use the notion of a symmetric
matrix being defined above.
Moral do usual calculus on .
To solve problems with two or more constraints,
Examples. (i) Find the probability distribution
on which maximises
Constraint is
, so
, so
(ii) For
. satisfies
, so
Then
.
Using
, we get
As always,
.
In vector form we have
Notice
But . Since
and
. As we proved, we need to look at
We want to check this is a negative definite matrix at the solution
so the matrix of 2
nd
partial derivatives is
Find eigenvalues of , i.e.
which satisfies the necessary 2
nd
order condition for being a maximum.
1.5 The Legendre transform
For , given a
by
This is only defined for such that this is finite.
Examples. (i) Let
, , then
At that point
Now define
If is convex, then
.
(ii)
, , then
So
So if
, and
Theorem 1.5.2. If is convex, then
.
Proof. Prove for
with
. Then
is attained for all at unique
such that
i.e.
. QED.
Definition. An affine function is one of the form
Corollary. If convex, then it is a supremum of a family of affine
functions.
Example. Consider
. Then
Domain of is ; supremum is attained when
Then
(Youngs inequality). In fact we always have .
For
by
Exercise. Calculate the Legendre transform of
where
Hamiltonian
To calculate; supremum attained when
Exercise. Newton equations are equivalent to
(ii) Economics. A company buys imports
be the price of a unit of the th good. Then the total cost is and the profit
is . To maximise profit, they try to choose to get
(iii) Thermodynamics. An engine has internal energy where is the volume and
is the entropy. In idealised reversible change, the heat flow . Also
where is the temperature and the pressure. Then
(Maxwell relations).
Suppose the system is immersed in a constant temperature reservoir. In this case the system is
described by a function . This is called the Helmholtz free energy, and is defined by
The supremum is attained at such that
This defines , and we can substitute in to get
In Helmholtz description
(Maxwell relation).
Remark. The entropy is determined implicitly by
This determines uniquely as long as
Constant volume heat capacity
So is a convex function of you need heat to raise the temperature.
2. Calculus of variations
2.1 Differentiating functionals
A functional is a function any space of functions or .
Examples. (i) Let and consider
Recall. If
We will apply this to the functionals;
To define something analogous to , we use
or for complex functions we will often use
Now
when such a function exists. Lets try to apply this to
In Methods we will write
Example.
because
Therefore
Example 3. Let . Then
Now . When we calculate a derivative, we consider smooth with ,
and look at
since vanishes at the end points of the integral; . So
using the notation e.g.
Problem. Let . Minimise
amongst all smooth -periodic functions .
Solution. Assume such a minimising function does exist. Obtain a condition which determines it.
Well then show that there exists a function satisfying this condition, then finally well show that this
function minimises .
Assume
is a solution, then
, i.e.
.
Exercise. Show that
So if
Lemma 2.1.1. If (i)
for all smooth functions , with if is outside some sub-
interval ;
(ii)
then in .
Proof. Assume, to the contrary, that
with
.
Without loss of generality, take ; since is continuous,
such that on
.
Consider the function
Now
, because
.
Consider
Then
which is a contradiction.
Exercise. Find
.
Jargon.
We define closure of set where .
We say is properly supported in or if .
Problem 1. Minimise
over
, then
.
Therefore
for all and
By general formula ( ),
Therefore
By Lemma 2.1.1,
, i.e. if
periodic,
Remark. To prove there is only one -periodic solution of
, imagine is
another, where
with
. Subtract
.
Multiply by and integrate;
by parts. Middle term is by periodicity and then we get , i.e.
.
Final stage of minimisation problem; consider
Integrating the middle term by parts we get
for any
, . So
We need to minimise amongst such curves.
Assume
. Now
The Euler-Lagrange equation is therefore
constantm so
. Fix so that
and
. This is our
candidate minimising curve.
Claim that
is strictly convex
By lemma 1.3.2,
. Now
which if and . So if is another -curve joining to it has
greater length than the straight line.
2.3 Multi-dimensional integrals
Moral. Integration by parts in one dimension converts to using Green identities in more than
one dimension.
Consider
where some region in
. Then
To achieve this we recall
Assume vanishes on , then
i.e.
but so we get Poissons equation;
A minimising will be a stationary point, so .
Note that the integrand
already, and then choose such that .
Notice that if is independent of then is constant, by the Euler-Lagrange equation.
Concepts in theoretical physics; an action in
with
and
. Writing so that
, the Euler-
Lagrange equation becomes
using the summation convention.
Derivation. Let be a smooth variation function, with near the boundary of . Then
since near the boundary of . Alternative form;
Example. Let an action be
so that and
. Then
Euler-Lagrange equation;
the wave equation.
2.4 Lagrange multipliers
DIDO problem; find a curve such that the area under the curve and above the -axis is
maximised, for a fixed length. I.e. maximise
subject to
Consider augmented function where is the Lagrange multiplier. Then
Substitute so that
, i.e.
which is a
circle. Therefore the solution is is arc of circle passing through of length .
Remark. The formulation of the problem disallows curves that go back on themselves in the -
direction, because this is not a graph ! To get over this, we can instead consider
parametrised curves
.
Euler-Lagrange equations;
for .
Exercise. Find closed curve in the plane such that to maximise
the area inside the curve, with
Finite number of constraints
with . Consider
as
for finite-dimensional problems.
Infinite number of constraints need Lagrange multiplier function. E.g. in fluid mechanics. A
velocity vector field on
subject to (incompressible). Then
Exercise. Work out .
Remark. For
the notation
and
Constraint for each means there is a Lagrange multiplier function .
Green identities
by Green, assuming for large
so 2 lines above
1
st
term =
using the second Green identity;
Therefore
So the Euler-Lagrange equation is
. Now
Final construction; stationary for with if
Static Navier-Stokes equation, then approximate by putting all
.
This can be explained by postulating that
light rays take the shortest time possible
to get from A to B.
Time as path through
where speed of light.
This implies
and
Typical problem; imagine the speed of light
is . Then consider a light path
.
Time is
This is the type of integral
2. Mechanics
A particle of mass has position , moving in a potential . The force so
This equation is the Euler-Lagrange equation for the action
where
Most fundamental equations of classical physics come in this form, eg. Maxwells.
3. Geodesics
A geodesic is a curve of shortest length, or more generally a stationary point for the length
.
(i) In the plane, curves from to ,
The only solution is the straight line. We can also write the curve and then
Since
by the change of variables and
Simplest to choose so that
For
or the integral
The length is independent of parametrisation;
The second definition gives curves parametrised so that is constant. This is usually easier to
work with.
Problem. Find geodesic curves on a cylinder
Solution 1. Use cylindrical coordinates; , , . Then
For a curve on , is fixed, and , . Therefore
A geodesic curve on is a curve which makes stationary
The Euler-Lagrange equations are
Conclusion; geodesic curves are helicoidal curves. (See example sheet 2,
question 9 geodesics are great circles.)
Solution 2. Introduce
The Euler-Lagrange equations are
To find and we must use constraints;
Find ; But by Euler-Lagrange,
Write
, then we have
and
. But
constant constant.
Solutions are therefore , . Again, we get helicoidal
curves.
Problem. (i) Consider
(no -dependence!) and show that a curve which makes stationary satisfies
(ii) Find the curve such that a bead moving without friction on the curve takes the shortest
possible time to go from to .
Solution. (i) Since makes stationary,
The last two terms come about by the chain rule, since . This is equal to
(ii) The bead moves under gravity, so the speed
satisfies
Euler-Lagrange;
This is very hard to solve! Instead we use (i), to say
Choose
; then
The curve is a cycloid.
4. Conservation laws (Noether theorem)
Let be a smooth solution of
Theorem 4.1. (i) If
constant.
(ii) If
constant.
Proof. (i)
Newtons law is Euler-Lagrange equation for
Conservation laws; (i) if does not depend on then
constant. We sometimes call a cyclic coordinate.
(ii)
since does not depend explicitly on .
5. The second variation
When is a solution of the Euler-Lagrange equation
a (local) minimiser of
Possible methods; (i) use convexity. See 2.2 for example (shortest curve between two points).
(ii) look for condition analogous to the 2
nd
order condition in calculus.
If
This tells us that (i)
for all sufficiently small (strict local minimum).
(Positive symmetric means
(ii) a local minimum and
is non-negative (i.e.
).
To extend this to a functional , let be a smooth function (or with .
( is analogous to .) Assume is smooth, then;
For all , such that
then
In this case we get
Terminology.
Definition. is a weak local minimum for if for
sufficiently
small. Call it strict if inequality is strict for such .
Theorem 5.1. (i) If
, and
for some , then is a strict weak local minimum for .
(ii) If is a weak local minimum for , then
and
.
Remark. A space of functions is an infinite-dimensional vector space, so we have to be careful
which norm we use on a space of functions!
Example.
Euler-Lagrange equation;
There is a solution
Now
satisfies the criterion with . Therefore for
Recall . Try , then
, so
since
. Conclusion;
.
Remark. In general the second variation can be written
Associated to this is the Sturm-Liouville operator
This has an infinite sequence of eigenvalues,
where
is an eigenfunction. So if
then