An Algorithm For Minimax Solution of Overdetennined Systems of Non-Linear Equations
An Algorithm For Minimax Solution of Overdetennined Systems of Non-Linear Equations
1. Introduction
THE PROBLEM of finding a solution to an overdetermined set of non-linear algebraic
equations
fj(x) == / , ( * ! , . . . , xn) = 0, j=\,2,...,m, m> n, (1)
in the minimax sense, i.e. to find a vector x that minimizes the maximum residual
F(x) s max|//x)|, (2)
j
Therefore, in the method we present in this paper, the minimax solution to (4) is
found subject to a restriction
||hj| = max|/y < Xk, (5)
i
which is similar to the bounds used by Griffith & Stewart (1961). The value of Xk
is calculated to try to provide the inequality
F(xk+hk) < F(xk) (6)
Thus the value of kk should depend on the goodness of the linear approximation to (1)
at x = xk, and should be chosen as large as possible subject to a certain measure of
agreement between each/} and its linearization being maintained.
Throughout this paper we suppose that the problem under consideration is well
scaled in the variables. It would have been desirable if the method was independent of
2. The Algorithm
We will use the notation
and
F(x, h) = max|//x)+(V/}(x), h)| (8)
j
where (V/}(x), h) is an inner product.
Now suppose that we have found an estimate xk to the solution to our problem, and
a bound on the step length, Xk. Then we calculate a displacement hk to add to xk as a
solution to the linear problem
F(xk,bk) = min {F(x,,h)} (9)
||fl
(hk may be found as the solution to a linear programming problem with (2m+2n)
constraints or more efficiently by a method which is an extension of the exchange
algorithm, Powell (1974)).
In order to accept (xk+hfc) as the next point in the iteration it is desirable to ensure
that the function value will decrease. However, this criterion is not always sufficient to
guarantee convergence, as shown by an example by Fletcher (1972). Therefore we use
the stronger condition
F(xk)-F(xk+hk) > Pl(F(xk)-F(xk, hk)) (10)
where pj is a small, positive number, i.e. we test if the decrease in F(x) exceeds a
small multiple of the decrease predicted by the linear approximation. If (10) is
satisfied we choose x t+1 = xk + hk, otherwise we let xk+l = xk.
The value of Xk+l is defined in the following way: if the decrease in the function
value is small compared to the predicted decrease, we wish to use a smaller bound.
Therefore, if
F(xk) - F(xk+bk) < P2(F(xk) -F(xk, bk)) (11)
ALGORITHM FOR MINIMAX SOLUTION 323
where p t < p2 < 1, we choose At+1 = <r1 ||hA||, 0 < al < 1. As a consequence of this
we will have Ai+1 ^ atXk in the case where xk+1 = xk. If (11) is not fulfilled, and the
agreement between each/; and its linearization is very good, we will allow the value
of the bound to increase. So we use the following test: if
max\fJ{xk+bk)- {fj(xk)+(Vfj(xk), hk)}\ < p3(F(xk)-F(xk+hk)) (12)
0 < p 3 < 1, we choose Xk+l = o2\^k II. where a2 > 1. If neither (11) nor (12) is
satisfied we let At+1 = ||hj|.
Our numerical experience with the algorithm indicates that the choice of the
constants is not critical. For 8 test examples we tried different values (0-20 and 0-33)
°f P2> PZ an£ l au while p t = 001 and a2 = 20, but we did not find any significant
and consequently
/ / x * ) . (V//x*), h*) < 0, 7 = 1 , . . . , t. (21)
It follows from the continuity of/} and V/} that there exists e2 > 0 such that for
||x—x*|| < e2 we have
f/x). (V/}(x), h*) < 0, y = 1,...,/, (22)
and
|(V/}(x),h*)|^£ 2 , y=l,...,r. (23)
Now suppose that ||xt—x*|| < e = min{e1, e 2 }. Let yk be the vector in the direction
of h* with the length min{||hj, e},
(28)
... t
F(xk)-F(xk,hk)
= 1+O(||h t ||). (29)
Now the rules of the algorithm for adjusting and using kk, and the fact that the
sequence {xk} converges, imply that \\hk \\ tends to zero. It follows from equation (29)
that for all sufficiently large k inequality (11) is never satisfied, in which case we have
the bound
> U > MM- (30)
ALGORITHM FOR M1N1MAX SOLUTION 325
Further, since the function values F(xk) must decrease monotonically to F(x*), it also
follows that
lim {F(xk)-F(xk,bk)} = 0. (31)
*-»oo
JeD,
>Pi.8.n (39)
where C, = {je N\(lt ^j^ ut-1) A (X, # x 7+1 )}, and D, = {je C,\Xj e K*}.
Hence F(xk) -* — oo for k -* oo, which is a contradiction.
4. Examples
As one would expect the final rate of convergence depends on whether the linear
systems (4) are bounded away from singularity. We illustrate the two situations by
326 K. MADSEN
two examples. The calculations were performed in double precision on the IBM 370/165
computer.
First we consider an example given by Barrodale, Powell & Roberts (1972). Find
the parameters xh i = 1, 2,.. ., 5 such that the rational expression
(40)
is the best fit to ey at the points y = yi = —1(0.1)1. Thus the functions are
f / \ ^1 **2yj VI • •% *\ *>A /A1\
JTj \J XVt , . . ., XYS ) I ^=
— , , _ . . , _. __2 . 3 — 0*3
^ ' J1 —
~~ 1I Z/
> Zl/ I 1 4L1 1
V* )
k llXfc—X9II I k II h F(xk)
Finally in Table 3 we give the number of iterations used to find the solution (with
4 decimals accuracy) of two types of test examples, well known from work on least
squares. These are the Rosenbrock (1960) function,
A(x) = mx2-x\)
f2(x)=l-Xl (45)
and the "Chebyquad" functions as defined by Fletcher (1965). For comparison we
quote the number of iterations used to find the least squares solution of the problem
(1) by the method of Fletcher (1971), which is similar to our method. In two cases
different solutions were found because the objective functions were different from 0 at
the minimum. For the functions (41) the solutions found agreed in the first two
decimals.
TABLE 3
Functions (41) 5 21 38 8
(43) 2 3 19t 20f
(45) 2 2 17 20
2 4 4
Chebyquad
f- 4
6
8
t Different solutions.
6
8
22f
5
5
14t
22
328 K. MADSEN
Five other types of examples, with the number of unknowns ranging from 2 to 20,
have been used for test purposes. The computational experience in these cases has
been very similar to that shown in the examples above.
The author is greatly indebted to M. J. D. Powell for many helpful comments and
suggestions.
REFERENCES
BARRODALE, I., POWELL, M. J. D. & ROBERTS, F. D. K. 1972 S.I.A.M. J. Num. Anal. 9,493.
FLETCHER, R. 1965 Comput. J. 8, 33-41.
FLETCHER, R. 1971 Report R. 6799, AERE Harwell, Didcot, Berks, England.