CS 726: Nonlinear Optimization 1 Lecture 3: Di Erentiability
CS 726: Nonlinear Optimization 1 Lecture 3: Di Erentiability
Lecture 3 : Di↵erentiability
Michael C. Ferris
January 29 2021
15
Note local solution and global solution
Convention
f : Rn ! R.
I Df (x) is a 1 ⇥ n row vector.
I rf (x) = [Df (x)]T (column vector).
g : R n ! Rm .
I Dg (x) 2 Rm⇥n
I [Dg (x)]T = rg (x) 2 Rn⇥m .
o(t)
lim = 0.
t#0 t
R1
3 f (x + p) = f (x) + 0 rf (x + p)T pd .
f (x̄) f (x ) = f ( x̂ + (1 )x̂)
Proof.
Let x 1 ,x 2 be distinct global minimizers. For 0 < < 1, because f is
strictly convex,
f ( x 1 + (1 )x 2 ) < f (x 1 ) + (1 )f (x 2 ).
f ( x 1 + (1 )x 2 ) < f (x 1 ) + (1 )f (x 1 ) = f (x 1 ).
By substituting this bound into the previous integral, we obtain the following
result.
Definition
From now on, x1 , xn , etc, will be used to refer to components of vectors
and x 1 , x n , etc, will be used to refer to distinct points.
Definition
Strictly Convex: A function f : R ! R̄ is strictly convex if 8x,y such
that x 6= y and 8↵ 2 [0, 1]
Definition
h is strongly monotone (⇢) on ⌦ if 9⇢ > 0 such that 8x, y 2 ⌦
hh(x) h(y ), x yi ⇢ kx y k2
f (x + (y x)) f (x) ⇢
f (y ) f (x) (1 ) kx y k2
2
so taking the limit as !0
⇢
hrf (x), y xi f (y ) f (x) kx y k2
2
⇢ 2
f (y ) f (x) + hrf (x), y xi + kx yk
2
⇢ 2
f (x) f (y ) + hrf (y ), x yi + kx yk
2
2
f (y ) + f (x) f (x) + f (y ) + hrf (x) rf (y ), y xi + ⇢ kx yk
⇢ 2
f (u) f ((1 )u + v ) + hrf ((1 )u + v ), (u v )i + k (u v )k (1)
2
⇢ 2
f (v ) f ((1 )u + v ) + hrf ((1 )u + v ), (1 )(v u)i + k(1 )(v u)k (2)
2
⌧Z 1
2
hrf (x) rf (y ), x yi = r f (y + t(x y ))(x y )dt, x y
0
2
⇢ kx yk
D E hx y , rf (z + (x y )) rf (z)i
2
x y , r f (z + (x y ))(x y) = + o(1)
2
⇢ kx y k + o(1)
then
f (y ) > f (x) + rf (x)T (y x)