0% found this document useful (0 votes)

124 views5 pages

Wolfe Conditions

This document describes an algorithm to pick step sizes in numerical optimization that satisfy the Wolfe conditions. The algorithm uses line searches to find a step size that provides sufficient decrease in the objective function and ensures the step is not too small. It starts with an initial step size and iteratively applies the Wolfe conditions, adjusting the step size range until an acceptable value is found. The algorithm is illustrated using the Rosenbrock function and graphs showing how it navigates the function's "banana shape" to convergence.

Uploaded by

RaviKumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views5 pages

Wolfe Conditions

Uploaded by

RaviKumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Applied Optimization.

Algorithm to pick step size using Wolfe conditions

Renu M. Rameshan. 24 Mar 2020.

Recap
In the last class we had seen an approximate line search algorithm where you pick an approximate step size
rather than going for the optimum one. We also saw the Wolfe conditions for checking if the step size is
appropriate.

The two Wolfe conditions are

Sufficient decrease: The step size selected ensures a certain amount of reduction in function value. i.e.
the function value at the new location is lesser than the one at previous location by at least “some amount”.
Mathematically

f (xk + αk dk ) ≤ f (xk ) + c1 αk ∇f (xk )T dk . (1)

If you compare with the sheet I shared last week, the terms have been rearranged. The inequality remains
the same. Here the reduction is at least by |c1 αk ∇f (xk )T dk |. c1 ∈ (0, 1) and is usually chosen close to zero.

Curvature condition: The main purpose of this is to prevent step sizes which are too small. In other
words, say at the k th iteration the gradient is fk and the direction is dk = −fk and the gradient at the new
point is such that the function can still reduce further in the direction dk .
An example is: say your function is f (x, y) = x2 + y 2 . Starting from any point one can directly move to the
minimum location (0, 0). But if you choose a step size which stops at a location other than (0, 0), you have
stopped too soon. This second condition prevents such a situation.

The algorithm ignores points where the function can further decrease and selects only that step size where
the directional derivative in dk direction starts becoming less negative or positive. (Refer back to previous
document). Mathematically,

∇f (xk + αk dk )T dk ≥ c2 ∇f (xk )T dk , (2)

We will use φ(αk ) = f (xk + αk dk ). Note that αk is a constant for an iteration and our objective is to
arrive at an acceptable αk .

Algorithm
This algorithm is from Numerical Optimization by Nocedal and Wright. I have added a commentary and
examples to illustrate the process.

The graphs you see below are for the function

f (x, y) = (1 − x)2 + (y − x2 )2 .

This function goes by the name Rosenbrock function, also known as banana function. Used for testing
optimization algorithms. It has only one minimum but convergence to the minimum is difficult.

1
4

3.5

2.5

1.5

0.5

0
0 0.05 0.1 0.15 0.2 0.25 0.3

Figure 1: The initial step size (α1 ) is in a steep region

Remember that φ(αk ) = f (xk +αk dk ). Also from the chain rule: φ0 (αk ) = ∇f (xk +αk dk )T dk . Note that
φ0 (0) = ∇f (xk )T dk The first step in selecting the step size (α) is to check if sufficient decrease condition is
satisfied. The graph in Fig.1 is different from what I had shown in last class (that one was plot of φ(αk )−φ(0)
and the line c1 αk ∇f (xk )T dk ). This one is the plot of φ(αk ). The description of the lines are as follows:

Red line is the RHS of inequality in (1). Slope is c1 φ0 (0), which is negative since φ0 (0) is negative.
Yellow line is tangent of φ(αk ) at αk = 0. Has a slope of φ0 (0).
Magenta line is c2 φ0 (0), where c2 ∈ (c1 , 1). Used in the second Wolfe condition.

The algorithm starts by fixing α0 = 0 and a user defined maximum step size, αmax . The first picked
step size is α1 ∈ (0, αmax ). For the curve drawn c1 = 0.1, c2 = 0.4, φ0 (0) = −18.5. The algorithm proceeds as
follows:
1. Check if sufficient decrease criterion is satisfied. In this case, yes, since at α1 = 0.05 the curve is below
the line.
2. Next we should check if the step size is too short. You can visualize the tangent at α1 and see that it
is steeper (more negative) than c2 φ0 (0). The value of φ0 (α1 ) = −15.56, whereas c2 φ0 (0) = −7.4. This
violates the curvature condition.
3. This step size is not acceptable.
4. Using an interpolation method (linear, quadratic or cubic), pick the next step size from the interval
(α1 , αmax ). αmax = 0.3 for this example.

2
5. Using linear interpolation α2 = 0.175. The new point is shown in Fig.2.

3.5

2.5

1.5

0.5

0
0 0.05 0.1 0.15 0.2 0.25 0.3

Figure 2: α2 is still not acceptable because we are looking for a point where the derivative is still negative.
A positive derivative only gives an acceptable range. We are looking for a point which is roughly at the
boundary where the slope changes from negative to positive, and has negative slope.

6. Note that at α2 the tangent has a positive slope (φ0 (α2 ) = 4.39).
7. Though this satisfies the inequality in (2), since the directional derivative of f (xk + αk dk ) is positive we
will continue the search till a point where the directional derivative while still being negative satisfies
the curvature condition.
8. We find α3 = (α1 + α2 )/2. Note that the range got reduced from (0, αmax ) to (α1 , α2 ) and from this
range we pick α3 .
9. Here α3 = 0.1125 and φ0 (α3 ) = −8.1. Still does not satisfy curvature condition.

10. Choose α4 = (α2 + α3 )/2 = 0.1437. φ0 (α4 ) = −2.54 and we are done! This point satisfies curvature
condition and still has a negative directional derivative. Fig.3 shows the additional points.

3
4

3.5

2.5

1.5

0.5

0
0 0.05 0.1 0.15 0.2 0.25 0.3

Figure 3: α3 not acceptable since it is still too steep, whereas α4 is acceptable. Slope is negative and
satisfies second Wolfe condition.

In case, the initial point fails to satisfy the sufficient decrease condition, α2 , the next point is picked
from the new interval (0, α1 ). Once a point which satisfies the sufficient decrease condition is found then the
algorithm proceeds as given in the above steps. An example for this case is given in Fig.4. Here αmax = 0.325,
α1 = 0.28. I changed αmax just for the heck of it, 0.3 is also perfectly fine.

4
4

3.5

2.5

1.5

0.5

0
0 0.05 0.1 0.15 0.2 0.25 0.3

Figure 4: α1 fails the first Wolfe condition. Whereas α2 = (α0 + α1 )/2 = 0.14 satisfies both the conditions;
φ0 (α2 ) = −3.27.

The sheets from the book which has the algorithm is appended. You need to implement this and mail
me/upload in Moodle by 31 March 2020. The problem statement is:

TODO
Implement the approximate line search algorithm and use it to minimize a function. You should write
a code which can accept any function as an input and return a minimum. Implement the optimization
using (1) optimal step size and (2) approximate step size. Note down the time to converge in both
cases.
Upload the code and a report. Report should contain your observations and results for minimizing
the Rosenbrock function given above.

Gradient Descent
No ratings yet
Gradient Descent
18 pages
Optimisation in MAchine Learning
No ratings yet
Optimisation in MAchine Learning
114 pages
Optimization PPT - Part-2
No ratings yet
Optimization PPT - Part-2
42 pages
PI150 Series Frequency Inverter Operation Manual: 1.foreword
100% (3)
PI150 Series Frequency Inverter Operation Manual: 1.foreword
15 pages
7th English Guide Term 1
No ratings yet
7th English Guide Term 1
84 pages
Lec 02
No ratings yet
Lec 02
43 pages
Lecture 6 Si416 2025
No ratings yet
Lecture 6 Si416 2025
30 pages
Subgradient Method
No ratings yet
Subgradient Method
22 pages
Thermal Properties of Matter
No ratings yet
Thermal Properties of Matter
21 pages
Conditional Gradient (Frank-Wolfe) Method: Lecturer: Javier Pe Na Convex Optimization 10-725/36-725
No ratings yet
Conditional Gradient (Frank-Wolfe) Method: Lecturer: Javier Pe Na Convex Optimization 10-725/36-725
28 pages
Optimumengineeringdesign Day5
No ratings yet
Optimumengineeringdesign Day5
84 pages
Lecture 11 Si416
No ratings yet
Lecture 11 Si416
13 pages
Numerical Optimization For Inverse Problems - 10 Lectures On Inverse Problems and Imaging
No ratings yet
Numerical Optimization For Inverse Problems - 10 Lectures On Inverse Problems and Imaging
15 pages
School of Computer Science and Applied Mathematics
No ratings yet
School of Computer Science and Applied Mathematics
5 pages
Selecting Step Sizes - in Sensitivity Analysis by Finite Differences
No ratings yet
Selecting Step Sizes - in Sensitivity Analysis by Finite Differences
15 pages
Maximum Slope Method
No ratings yet
Maximum Slope Method
14 pages
Line Search Algorithms With Guaranteed Sufficient Decrease
No ratings yet
Line Search Algorithms With Guaranteed Sufficient Decrease
22 pages
24 Cond Grad
No ratings yet
24 Cond Grad
25 pages
Steepest Descent in Unconstrained Optimization
No ratings yet
Steepest Descent in Unconstrained Optimization
12 pages
Topic3 PDF
No ratings yet
Topic3 PDF
50 pages
6 OneD Unconstrained Opt
No ratings yet
6 OneD Unconstrained Opt
29 pages
Steepest Descent Algorithm
No ratings yet
Steepest Descent Algorithm
28 pages
Cubic 2
No ratings yet
Cubic 2
5 pages
Lecture 5
No ratings yet
Lecture 5
2 pages
Slide 6: Script For 17 March 2020
No ratings yet
Slide 6: Script For 17 March 2020
3 pages
Multi Variable Optimization: Min F (X, X, X, - X)
No ratings yet
Multi Variable Optimization: Min F (X, X, X, - X)
38 pages
Line Search Algorithms: Bracket
No ratings yet
Line Search Algorithms: Bracket
13 pages
Frank Wolfe
No ratings yet
Frank Wolfe
25 pages
Line Search Algorithms
No ratings yet
Line Search Algorithms
13 pages
(1.5.2) Unconstrained Nonlinear Programming
No ratings yet
(1.5.2) Unconstrained Nonlinear Programming
25 pages
Lecture 5
No ratings yet
Lecture 5
31 pages
Nocedal - Wright CH - 02-02
No ratings yet
Nocedal - Wright CH - 02-02
12 pages
O4MD 03 Descent Methods
No ratings yet
O4MD 03 Descent Methods
18 pages
Lecture2 Gradient Descent Linear Regression
No ratings yet
Lecture2 Gradient Descent Linear Regression
75 pages
Clnote Sept24
No ratings yet
Clnote Sept24
24 pages
Assignment
No ratings yet
Assignment
2 pages
CO2 Pre-Test & Functional Test Sheet
No ratings yet
CO2 Pre-Test & Functional Test Sheet
10 pages
Lecture 5 Si416 2025
No ratings yet
Lecture 5 Si416 2025
21 pages
Presentation PPT Group No 6 New
No ratings yet
Presentation PPT Group No 6 New
25 pages
Part3 1
No ratings yet
Part3 1
15 pages
Latex For Mu
No ratings yet
Latex For Mu
3 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
Debashis 006
No ratings yet
Debashis 006
16 pages
Unrestricted Search
No ratings yet
Unrestricted Search
5 pages
Lec4 Gradient Method Revise
No ratings yet
Lec4 Gradient Method Revise
33 pages
Log
No ratings yet
Log
119 pages
Subgrad Method Slides
No ratings yet
Subgrad Method Slides
33 pages
Stats 102B Cheat Sheet
No ratings yet
Stats 102B Cheat Sheet
4 pages
Lecture 14
No ratings yet
Lecture 14
9 pages
Lecture8 UnconstrainedII 2023
No ratings yet
Lecture8 UnconstrainedII 2023
57 pages
Dumpsys ANR WindowManager
No ratings yet
Dumpsys ANR WindowManager
5,317 pages
Hauser Lecture2
No ratings yet
Hauser Lecture2
26 pages
Sfepy Manual
No ratings yet
Sfepy Manual
988 pages
Algorithm For Unconstrained-Multivariable Case-2 (CH 6)
No ratings yet
Algorithm For Unconstrained-Multivariable Case-2 (CH 6)
31 pages
Intermediate Level
No ratings yet
Intermediate Level
41 pages
Mathematical Methods of Optimization
No ratings yet
Mathematical Methods of Optimization
62 pages
6 Gradient Method
No ratings yet
6 Gradient Method
19 pages
K Agitation
No ratings yet
K Agitation
6 pages
R Is Differentiable. We Want To Approximate A Point A Where F Takes F, ,, - . - in Which F
No ratings yet
R Is Differentiable. We Want To Approximate A Point A Where F Takes F, ,, - . - in Which F
3 pages
MAE Opti Worksheet 4 Correction
No ratings yet
MAE Opti Worksheet 4 Correction
3 pages
4 Pattern Directions, 21-08-2024
No ratings yet
4 Pattern Directions, 21-08-2024
58 pages
(K) K (k+1) (K) K (K)
No ratings yet
(K) K (k+1) (K) K (K)
6 pages
Hawassa University (Hu), Institute of Technology (Iot) Chemical Engineering Department
No ratings yet
Hawassa University (Hu), Institute of Technology (Iot) Chemical Engineering Department
30 pages
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
Chapter 8 Lecture Notes
No ratings yet
Chapter 8 Lecture Notes
4 pages
HW4 Solutions Autotag
No ratings yet
HW4 Solutions Autotag
7 pages
Convex Functions: Renu M. R
No ratings yet
Convex Functions: Renu M. R
43 pages
6G Spectrum - Analyzer Device User Manual
No ratings yet
6G Spectrum - Analyzer Device User Manual
23 pages
Alpha Series - Front End Cylinder With Single Eye
0% (1)
Alpha Series - Front End Cylinder With Single Eye
2 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
Apporio Taxi Uber Clone
No ratings yet
Apporio Taxi Uber Clone
5 pages
Cmcp700s-Cvt Manual v1.1
No ratings yet
Cmcp700s-Cvt Manual v1.1
8 pages
QC Module 3 (Methods of Marker Planning)
No ratings yet
QC Module 3 (Methods of Marker Planning)
18 pages
Hertz Heat Recovery
No ratings yet
Hertz Heat Recovery
11 pages
Optimization Problem Ax B
No ratings yet
Optimization Problem Ax B
113 pages
Exportar Páginas Numerical-Optimization-Second-Edition - Backup
No ratings yet
Exportar Páginas Numerical-Optimization-Second-Edition - Backup
3 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Electrical Performance Testing of AC Motors
No ratings yet
Electrical Performance Testing of AC Motors
3 pages
The Use of Ultrasonic Cleaning in Dairy Industry: How Does It Work?
No ratings yet
The Use of Ultrasonic Cleaning in Dairy Industry: How Does It Work?
3 pages
Software Project Management: Dr. R. Mall
No ratings yet
Software Project Management: Dr. R. Mall
87 pages
Convex Function Analysis
No ratings yet
Convex Function Analysis
46 pages
EE530, Applied Optimization, 2020 Assignment 1: Ax Is A Norm For Symmetric Positive Definite A
No ratings yet
EE530, Applied Optimization, 2020 Assignment 1: Ax Is A Norm For Symmetric Positive Definite A
1 page
Conduit User Manual
No ratings yet
Conduit User Manual
29 pages
64167USERASSIST
No ratings yet
64167USERASSIST
10 pages
Red Hat Enterprise Linux-9-Upgrading From RHEL 8 To RHEL 9-En-US
No ratings yet
Red Hat Enterprise Linux-9-Upgrading From RHEL 8 To RHEL 9-En-US
61 pages
COSMOS
No ratings yet
COSMOS
3 pages
Productattachments Files Downloads Ezmimo 2-4ghz Datasheet
No ratings yet
Productattachments Files Downloads Ezmimo 2-4ghz Datasheet
1 page
Whitney Workout
No ratings yet
Whitney Workout
1 page
National Cybersecurity Policy 2023 - 2028 Is Published - Carey Abogados
No ratings yet
National Cybersecurity Policy 2023 - 2028 Is Published - Carey Abogados
4 pages
TH460 Service Report 023832
No ratings yet
TH460 Service Report 023832
1 page
FB Viral Page
No ratings yet
FB Viral Page
2 pages
Link L6 U1 5min Test Vocab
No ratings yet
Link L6 U1 5min Test Vocab
1 page
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Wolfe Conditions

Uploaded by

Wolfe Conditions

Uploaded by

Applied Optimization.

Algorithm to pick step size using Wolfe conditions

The two Wolfe conditions are

f (xk + αk dk ) ≤ f (xk ) + c1 αk ∇f (xk )T dk . (1)

∇f (xk + αk dk )T dk ≥ c2 ∇f (xk )T dk , (2)

The graphs you see below are for the function

Figure 1: The initial step size (α1 ) is in a steep region

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.