Long - History of Monge Problem 11p
Long - History of Monge Problem 11p
Transportation Problem
A. M. Vershik
ISSN 0343-6993
Math Intelligencer
DOI 10.1007/s00283-013-9380-x
1 23
Your article is protected by copyright and all
rights are held exclusively by Springer Science
+Business Media New York. This e-offprint is
for personal use only and shall not be self-
archived in electronic repositories. If you wish
to self-archive your article, please use the
accepted manuscript version for posting on
your own website. You may further deposit
the accepted manuscript version in any
repository, provided it is only made publicly
available 12 months after official publication
or later and provided acknowledgement is
given to the original source of publication
and a link is inserted to the published article
on Springer's website. The link must be
accompanied by the following text: "The final
publication is available at link.springer.com”.
1 23
Author's personal copy
A. M. VERSHIK
eonid Vital’evich Kantorovich (1912–1986) was one reception was quite different. The ideas of Kantorovich, and
disciple M. K. Gavurin on a special method for solving a linear not translated into English then, and there were only a few in
transport problem—the potential method [4]. It is an imple- the library.) The following circumstance, important in itself,
mentation of the general method of duality, and the visual helped to speed up the process.
interpretation leads immediately to the analogy with the the- In 1946, the world marked the bicentenary of the birth of
ory of fluid dynamics and flows in networks, which was later the French mathematician and physicist Gaspard Monge
much developed. The article with Gavurin was addressed to (1746–1818), belonging to the brilliant constellation of foun-
transport engineers and planners, but it was rejected by sev- ders of modern mathematics. He was involved not only in
eral serious journals in the field and remained unpublished for mathematics, but also its many applications; he was the
almost 10 years. Not waiting for its publication, LV wrote his founder and head of the famous E´cole Polytechnique, and he
‘‘On mass transfer’’ [3]. I want to say about this work the same was responsible for many inventions. In particular, he for-
that I said about the booklet. This is a classic in all respects: it mulated in 1781 a mathematical problem of ‘‘Excavation and
contains a profound idea that goes beyond those examples embankments’’ (Les de´blais et les remblais)—how to transport
studied previously, it is brief and self-contained, one feels that the soil during the construction of the building of forts and
there is nothing more to be said, just as there is nothing to be roads with minimal transport expenses. His hypothesis of
added to the second law of Newton, and finally, it includes a normality of optimal traffic paths to some surfaces was later
program of future research, one that was followed at first very proved in a 200-page work by Appel (1884).
slowly but proceeds especially quickly today. In Leningrad in 1947 the Commission on History of Mathe-
Before I go into the content, let me say a few words about matical and Physical Sciences of the USSR Academy, headed by
the intellectual mystery story of how this work, followed by a Academician V. I. Smirnov, held a public session dedicated to G.
number of other works by LV and his school, became known Monge, where the well-known Moscow geometer B. N. Dela-
around the world. This tale is told in more detail in the intro- unay spoke on the geometric works of Monge and drew
ductory article to the new edition of the booklet by the son of attention in particular to this problem. The proceedings of the
LV, V. L. Kantorovich [2]. session were published in 1947 [5]. LV, apparently on seeing this
It is clear from the year of publication that in wartime the book, drew attention to this problem of Monge and saw how its
Doklady note could not be immediately known either in the solution was connected with his work. He gave a talk to the
Soviet Union or in the West. (Doklady and other journals were Moscow Mathematical Society (22 December 1947), details of
tions (see [16]). The main result is that the simplex of where L is an appropriate class of Lipschitz functions on
probability measures on a compact Riemannian manifold the metric space (X, r).
equipped with the quadratic Kantorovich metric is a (possibly
infinite) Riemannian manifold. TH E O R E M 5 ([11]) The class of Lipschitz-type admissible
The interrelation of all admissible metrics, in particular of all metrics on S(X) coincides with the class of translation-
Kantorovich power metrics, is more complicated. Power invariant metrics, and consequently with the class of metrics
metrics, as Lp-norms of difference of measures, increase admitting extension to admissible norms on V0(X).
monotonically with increasing p, so the Kantorovich metric
(p = 1) is the smallest in the class. Correspondingly, in the
If L is the space of all Lipschitz functions, we obtain the
class of all admissible metrics, the Kantorovich metric is a sort
Kantorovich metric, which is the largest metric in this class.
of saddle point (see below).
It seems likely that the Kantorovich metric is the unique
The fact that the power rp (p [ 1) of a metric, is, gen-
metric that belongs to both classes of defined metrics. This
erally speaking, not a metric (the triangle inequality may
gives it a sort of ‘‘saddle point position’’ in the set of all
fail) explains the failure of the extension property. If for a
admissible metrics.
metric r and a number p [ 1 it happens that r p is also a
metric, then the study of the metric Kr,p is reduced to the
study of the Kantorovich metric Kpr for a different metric
The Monge Problem (Paths of Transportation)
Actually the original Monge problem was not only to find a
space, in which extension to the Kantorovich–Rubinstein
plan of mass transportation in the sense we have been
norm is possible.
considering, but to choose a transport routing. This ques-
It is interesting that in the case p ¼ 1, the power Kan-
tion is more delicate: in contrast to LV’s problem, here,
torovich metric coincides with the well-known Lévy-
generally speaking, there need not be a unique solution for
Prokhorov metric–see [10], [19], [20].
best transportation plan, as is shown by trivial examples
already in the finite case.
Two Classes of General Transport Metrics
One of the first accurate statements of the problem, to my
Let us look more closely at two kinds of definition of admis- knowledge, was in my work [15] (see also [16]): to find a
sible metrics that motivated the Kantorovich duality theorem. measurable transformation T of a metric space X into itself,
1. Direct posing of the transport problems (nonlinear in taking the initial probability Borel measure l1 to the final
general) and the corresponding class of admissible metrics. one l2 and minimizing the single integral:
For two Borel probability measures l1, l2 on the metric Z
space (X, r), define the class Wl1 ;l2 of Borel probability
inf rðx; TxÞdl1 ðxÞ : T l1 ¼ l2 :
measures on the space X 9 X (transport plans) whose mar- X
ginal projections are the given l1, l2. Choose a norm N(.) on
This problem was attacked by V. Sudakov, who solved it
the space of measurable functions with respect to the measure
for Euclidean compact sets and Lebesgue measures—see later
space ðX X; WÞ for all W 2 Wl1 ;l2 : The transportation
corrections in [12, 18] and references therein.
problem is to find Here there is a connection to variational problems on
inf N ðrð:; :ÞÞ RN ðl1 ; l2 Þ: infinite groups, in this case the group of measure-preserving
w2Wl1 ;l2
transformations (see [16]). The following variational problem
This infimum RN(., .) clearly defines an admissible metric on is closer in nature to the original formulation of Monge: to find
the simplex S(X). a one-parameter group (or semigroup) Tt ; t 2 RðRþ Þ of
If N is the Lp norm, we have the previous Kantorovich measurable transformations of the metric space X minimizing
power metrics. In particular for p = 1 we obtain the Kant- the (time average) integral:
Z t Z
orovich metric, which is minimal in this family of metrics.
The general linear problem of transportation is the problem inf rðx; Ts xÞdl1 ðxÞds : Ts l1 ¼ ls ; s 2 ½0; t; t 2 ½0; T :
0 X
of minimizing the (double) integral of the metric (or minimum
of expectation) such as the L1-norm over a variety of plans, Variational problems on infinite groups (in particular Lie
subject to linear constraints: groups) have still not been studied enough. However,
Z Z striking achievements in the problem of optimal transpor-
inf rðx; yÞdwðdx; dyÞ : w 2 L ; tation, applications to the Monge-Ampère problem in
X X
partial-differential equations, the theory of the Ricci flow,
where L is the set of plans with given marginals and sat- and others testify to the flourishing of ideas in this theory,
isfying the given constraints. which was first stated in the proper generality 70 years ago.
Figure 5. Communication was not completely broken between the East and the West in the mid-30’s! Here are letters to
Kantorovich from Garrett Birkhoff and John von Neumann.
What Name to Give These Things occurs primarily in Western literature. It so happened that my
Previously I used what I regard as the only correct attribution colleague and friend R. L. Dobrushin, head of the laboratory
of concepts and theorems, namely ‘‘Monge–Kantorovich where L. N. Vasershtein worked, with understandable
problem,’’ ‘‘Kantorovich metric,’’ etc. There is no doubt that enthusiasm spread the word mostly among probabilists and
the Kantorovich metric was defined by L. V. Kantorovich, and statistical physicists about the ‘‘new’’ metric and its spectacular
his name should be on it. Similarly for the Kantorovich– applications. I spoke to Dobrushin in 1975 and told him that
Rubinstein norm. what he called the Vasershtein metric in the report is the
Scientific cooperation between the USSR and the Kantorovich metric. After some discussion, he agreed fully
Western allies was inadequate (we could almost say non- and even said so in one of his later works. But it was too late,
existent) in those days; even within a country there was the wrong name stuck.
poor awareness of developments—all of this meant that the Vasershtein’s interesting article [13] was very brief (it seems
metric was repeatedly rediscovered by other authors, but to me that few people who refer to it have looked at it), it does
not in such a fundamental way as in LV’s work, and, most contain in passing a definition of the Kantorovich metric and
importantly, without his main result on duality. In the list of applies it to the behavior of Markov fields. But there is no
those who have used similar ideas, some stand out for their definition of power metrics, although in the literature [14]
importance: the d-metric of D. Ornstein in ergodic theory; those are also called Vasershtein metrics. Undoubtedly, the
the method of coupling in probability, statistics, and sta- work of Vasershtein is worthy of mention in this connection,
tistical physics; the theory of polymorphisms; and much, but I think we should restore the correct terminology out of
much more. The importance and popularity of the topic respect for L. V. Kantorovich, to the teachers and pioneers in
first studied by LV in the late 1930s led to the reappearance our science.
of the metric under new names.
It is especially ironic to find the Kantorovich metric called ACKNOWLEDGMENTS
the Vasershtein metric. Leonid Vasershtein is a famous Supported by the grants RFBR 11-01-12092-ofi-m and RFBR
mathematician specializing in algebraic K-theory and other 11-01-00677-a. The author expresses his gratitude to Joseph
areas of algebra and analysis, and my good friend—and he is Romanovsky for his help in translation of the article and to
absolutely not guilty of this distortion of terminology, which Vl. Kantorovich for photos from the family archive.