Slides Concepts 1 Differentiability
Slides Concepts 1 Differentiability
Mathematical Concepts:
Differentiation and Derivatives
Learning goals
Definition of smoothness
Uni- & multivariate differentiation
Gradient, partial derivatives
Jacobian matrix
Hessian matrix
Lipschitz continuity
UNIVARIATE DIFFERENTIABILITY
Definition: A function f : S ⊆ R → R is said to be differentiable for
each inner point x ∈ S if the following limit exists:
f (x + h) − f (x )
f ′ (x ) := lim
h →0 h
Intuitively: f can be approxed locally by a lin. fun. with slope m = f ′ (x ).
Left: Function is differentiable everywhere. Right: Not differentiable at the red point.
3 fun
f1
2
y
f2
1 f3
0
−2 −1 0 1 2
x
f (x + h) − f (x) − ∇f (x)T · h
lim =0
h →0 ||h||
f (x + hv ) − f (x)
Dv f (x) := lim = ∇f (x)T · v .
h→0 h
Example: The directional derivative for v = (1, 1) is:
T 1 ∂f ∂f
Dv f (x) = ∇f (x) · = +
1 ∂ x1 ∂ x2
NB: Some people require that ||v || = 1. Then, we can identify Dv f (x)
with the instantaneous rate of change√in direction v – and in our
example we would have to divide by 2.
Proof: Let v be a vector with ∥v ∥ = 1 and θ the angle between v and ∇f (x).
T
Dv f (x) = ∇f (x) v = ∥∇f (x)∥ ∥v ∥ cos(θ) = ∥∇f (x)∥ cos(θ)
Source: Wikipedia
∂ 2 f (x)
2
H (x) = ∇ f (x) =
∂ xi ∂ xj i ,j =1,...,d
If f ∈ C 2 , then H is symmetric
Many local properties (geometry, convexity, critical points) are
encoded by the Hessian and its spectrum (→ later)
[Proof of “⇐” by mean value theorem: Show that λmax (Jh ) ≤ L.]