0% found this document useful (0 votes)

23 views16 pages

ORF363 COS323 F14 Lec3

This lecture covers optimization problems, including unconstrained optimization problems like the Fermat-Weber problem and least squares. It introduces basic optimization terminology like decision variables, objective function, constraints, optimal solution, and optimal value. It presents first and second order necessary conditions for optimality, including that the gradient of the objective function must be zero at a local minimum. It also discusses positive semidefinite and positive definite matrices in the context of second order conditions.

Uploaded by

Jaco Greeff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views16 pages

ORF363 COS323 F14 Lec3

Uploaded by

Jaco Greeff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Lec3p1, ORF363/COS323

This lecture: Instructor:

Amir Ali Ahmadi
• Optimization problems - basic notation and terminology Fall 2014
• Unconstrained optimization TAs: Y. Chen,
○ The Fermat-Weber problem G. Hall,
J. Ye
○ Least squares
• First and second order necessary conditions for optimality
• Second order sufficient condition for optimality
• Solution to least squares

• An optimization problem in general (or abstract) form:

Objective function

Decision variables

Short for minimize

Constraint set (or feasible set)

Short for subject to

In this class (unless otherwise stated), we have:

Typically, some description of and is given as input to us.

Optimal solution :

(also called the "solution" or the "global solution")

A point that minimizes over

May not exist.

May not be unique.

Lec3 Page 1
Lec3p2, ORF363/COS323

x* exists and is unique. x* does not exist.

Problem is "unbounded."

x* exists, but not unique. x* does not exist.

Optimal value: (if x* exists)

But can be well-defined even if x* doesn't exist.

• See the lower right picture above.
• In such a scenario, the term "minimum" is often replaced by "infimum".

• An important case where x* is guaranteed to exist:

○ continuous and compact (i.e., closed and bounded).
• This is known as the Weierstrass theorem.

• What if we want to maximize an objective function instead?

○ Just multiply f by a minus sign:

Optimal solution doesn't change.

Optimal value only changes sign.

Lec3 Page 2
Lec3p3, ORF363/COS323
Unconstrained optimization:

Decision variables are not constrained at all. The goal is only to

minimize the objective function.
Example 1: The Fermat-Weber problem.
You have a list of loved ones who live in given
locations in the US. You would like to decide Fermat Weber
(1607-1665) (1868-1958)
where to live so you are as close to them all as
possible; say, you want to minimize the sum of
distances to each person.

you? • cousin 1
• grandma
• mom
• dad
• sister • Cousin 3
• brother

• lover 1 • best friend • lover 2

Location of person i:

Your location:

• Variant: also given weights wi for each person

(your mom says you should care more about her than lover 1)

• Many other applications: e.g., Princeton is deciding on the location

of a new gym and wants to minimize distance to dormitories, giving
priority to undergrads,…

Lec3 Page 3
Lec3p4, ORF363/COS323

• As we'll see later, this optimization problem is "easy" to solve, not

because it is unconstrained (as there are many terribly hard
unconstrained problems!), but because it has a nice structure
(called convexity).
• If at the same time you wanted to be "far" from some subset of
your friends and family, this would have been a very hard problem
to solve!
• Optimization theory is full of instances where a tiny variation in the
problem formulation changes the problem completely from being
very easy to being very hard. It takes a trained eye to detect this.
Hopefully by the end of the class you will develop an appreciation
for this type of phenomenon.
• But we are getting way ahead of ourselves. For one thing, we
haven't even formalized what it means for an optimization problem
to be "easy" or "hard". Let's forget this for now and move on to
another unconstrained optimization problem---one of the most
widely-encountered in science and engineering.

Example 2: Least squares.

Legendre Gauss
(1752-1833) (1777-1855)
Given: mxn matrix

mx1 vector

Solve:

By default, ||.|| always represents the 2-norm; i.e., ||.||2 .

In expanded notation, we are solving:

Lec3 Page 4
Lec3p5, ORF363/COS323

Some applications of least squares:

Data fitting.

Given:

Fit a (say, cubic) polynomial:

Quick notation exercise: convince yourself that this is a least squares

problem.

Overdetermined system of linear equations.

A simple linear predictor for the stock price of a company:

Stock price at day t

We have three months of daily stock price data to train our model (lots of
5-day windows). How to find the best for future prediction?

and are given from data.

( would be 90.)

Lec3 Page 5
Lec3p6, ORF363/COS323

Optimality conditions

Unconstrained local and global minima.

Consider a function

A point is said to be a:

Local minimum:

Strict local minimum:

Global minimum:

Strict global minimum:

• Local/global maxima defined analogously.

• A (strict) global minimum is of course also a (strict) local minimum.

Local min
Strict global min Strict local max Strict local min
Local max and
local min

No global max in this case. Problem is unbounded above.

Lec3 Page 6
Lec3p7, ORF363/COS323

• In general, finding local minima is a less ambitious goal than finding

global minima.
• Luckily, there are important problems where we can find global
minima efficiently.
• On the other hand, there are problems where finding even a local
minimum is intractable.
• These statements should become more concrete as the course
progresses.

First and second order conditions for local optimality

Optimality conditions are results that give us some structural

information about the properties of optimal solutions. To understand
the proofs that follow, make sure you are comfortable with the
following notions:
• The gradient vector
• The chain rule
• The Hessian matrix
• Taylor series approximation
See lecture notes of the previous lecture or Sections 5.3-5.6 of [CZ13].

Notation reminder:

The gradient vector (nx1 vector)

The Hessian matrix (nxn symmetric matrix)

Notation of [CZ13]:

Lec3 Page 7
Lec3p8, ORF363/COS323

Theorem. (First Order Necessary Condition for (Local) Optimality)

If is an unconstrained local minimizer of a differentiable
function , then we must have:

Fermat
(1607-1665)

Proof.

Lec3 Page 8
Lec3p9, ORF363/COS323

Remarks:
• This condition is necessary but not sufficient for local
optimality.
• Nevertheless, it is useful because any local minimum
must satisfy this condition. So, we can look for local (or
global) minima only among points that make the
gradient of the objective function vanish.
• We will see later that in presence of an important
concept called convexity, this condition is in fact
sufficient for local (and global!) optimality.

Second order conditions.

The statements of our second order optimality conditions involve the

notions of psd and pd matrices. Let's recap these concepts.

Linear algebra interlude.

(See the last lecture if you need more review.)

Symmetric matrix: (AT denotes the transpose of A. )

symmetric not symmetric

Theorem. Eigenvalues of a real symmetric matrix are real.

Proof. See, e.g., Theorem 3.2 in Section 3.2 of [CZ13].

A square matrix A is said to be:

• Positive semidefinite (psd) if:

• Positive definite (pd) if:

Notation:

Lec3 Page 9
Lec3p10, ORF363/COS323

Recall that when we talk of positive semidefiniteness (or

positive definiteness), we assume with no loss of generality
that our matrix is symmetric: If A was not symmetric, we
could take its "symmetric part".

Theorem. A matrix is positive semidefinite if and only if all its

eigenvalues are nonnegative. A matrix is positive definite if and only if
all its eigenvalues are positive.
Proof. See, e.g., Theorem 3.7 in Section 3.4 of [CZ13].

Examples:
MATLAB: eig([2 4;4 5])

Recall our easy test in dimension 2:

This generalizes to n dimensions using the concepts of principal minors

and leading principal minors; see Section 3.4 of [CZ13].

Lec3 Page 10
Lec3p11, ORF363/COS323

Theorem. (Second Order Necessary Condition for (Local) Optimality)

If is an unconstrained local minimizer of a twice
differentiable function , then, in addition to , we
must have:

(i.e., the Hessian at is positive semidefinite.)

Proof.

"Little o" notation: see [CZ13], Section 5.6 or our previous lecture.

Lec3 Page 11
Lec3p12, ORF363/COS323

Theorem. (Second Order Sufficient Condition for (Local) Optimality)

Suppose is twice differentiable, , and

(i.e., the Hessian at is positive definite), then is a strict local

minimum of

Proof.

Lec3 Page 12
Lec3p13, ORF363/COS323

Remarks.
• is not sufficient for local optimality.

• is not necessary for (even strict global) optimality.

Questions to keep at the back of your mind:

• How would we use all these optimality conditions to find local
solutions and certify their optimality?
• Is it easy to find points satisfying these conditions? e.g., is it easy to
solve
• Suppose you certified that a given point is locally optimal, how
would you go about checking if it is also globally optimal?

Exercise. State (and prove) the analogues of our three theorems for
local maxima.

Now that we have a better understanding of the structure of optimal

solutions for unconstrained optimization problems, let's revisit our least
squares problem…

Lec3 Page 13
Lec3p14, ORF363/COS323

Least squares, revisited.

Given: matrix (Assume columns of are
linearly independent)
vector

Solve:

Lec3 Page 14
Lec3p15, ORF363/COS323

Exercise with optimality conditions

Find all the local minima and maxima of the following function:

Lec3 Page 15
Lec3p16, ORF363/COS323

Notes:
• Optimality conditions are covered in Chapter 6 of [CZ13] in a more general
setting where one also has a general constraint The unconstrained
optimality conditions that we presented here are stated in Chapter 6 as
corollaries (called the "interior case"). You are only responsible for what was
covered in class.
• Least squares is covered in Section 12.1 of [CZ13]. But again, this is for
further reading and my notes should have everything that I expect you to
know.

References:
- [CZ13] E.K.P. Chong and S.H. Zak. An Introduction to
Optimization. Fourth edition. Wiley, 2013.

- [Bert04] D.P. Bertsekas. Nonlinear Programming.

Second edition. Athena Scientific, 2004.

Lec3 Page 16

Russian Math Homework Portal
100% (1)
Russian Math Homework Portal
7 pages
Calculate Square Meter
No ratings yet
Calculate Square Meter
13 pages
Opt Cours
No ratings yet
Opt Cours
67 pages
Quantum Computing: A Gentle Introduction
No ratings yet
Quantum Computing: A Gentle Introduction
45 pages
EMTL Lecture Notes
No ratings yet
EMTL Lecture Notes
210 pages
The Simplex Method - Part 1
No ratings yet
The Simplex Method - Part 1
185 pages
Chapter8-Unconstrained Optimization
No ratings yet
Chapter8-Unconstrained Optimization
14 pages
Nonlinear Program
No ratings yet
Nonlinear Program
13 pages
Nocedal - Wright CH - 02-01
No ratings yet
Nocedal - Wright CH - 02-01
9 pages
Lecture 18
No ratings yet
Lecture 18
33 pages
Chap04 ConvexOptimizationBasics
No ratings yet
Chap04 ConvexOptimizationBasics
29 pages
Detailed Lesson Plan 2
100% (1)
Detailed Lesson Plan 2
7 pages
Free CFA Mind Maps Level 1 - 2015
100% (6)
Free CFA Mind Maps Level 1 - 2015
18 pages
Optimization 3
No ratings yet
Optimization 3
30 pages
1st Periodical Exam in Basic Calculus Reviewer
100% (1)
1st Periodical Exam in Basic Calculus Reviewer
10 pages
Rec 1
No ratings yet
Rec 1
2 pages
Opte
No ratings yet
Opte
32 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
CBSE Class 10 Mathematics Sample Paper Set C
No ratings yet
CBSE Class 10 Mathematics Sample Paper Set C
13 pages
Chapter 6 Lecture Notes
No ratings yet
Chapter 6 Lecture Notes
4 pages
OR1
No ratings yet
OR1
135 pages
Theory Note 1
No ratings yet
Theory Note 1
5 pages
Power Systems Operation and Management: Second Lecture
No ratings yet
Power Systems Operation and Management: Second Lecture
35 pages
Chapter 3 Review
100% (1)
Chapter 3 Review
12 pages
Optimization (SF1811 SF1831 SF1841)
100% (1)
Optimization (SF1811 SF1831 SF1841)
198 pages
Optimisation
No ratings yet
Optimisation
38 pages
Chương 6 Tối Ưu Không Ràng Buộc
No ratings yet
Chương 6 Tối Ưu Không Ràng Buộc
22 pages
Optmizationtechniques 150308051251 Conversion Gate01
No ratings yet
Optmizationtechniques 150308051251 Conversion Gate01
18 pages
Lecture 1 2 Background
No ratings yet
Lecture 1 2 Background
6 pages
4Q Mathematics 10 PT
100% (1)
4Q Mathematics 10 PT
4 pages
Optimumengineeringdesign Day3b
No ratings yet
Optimumengineeringdesign Day3b
32 pages
Lect5 Removed
No ratings yet
Lect5 Removed
35 pages
5 Optimization Techniques
No ratings yet
5 Optimization Techniques
40 pages
Applications of Systems of Linear Equations in Two Variables Exercises
No ratings yet
Applications of Systems of Linear Equations in Two Variables Exercises
8 pages
Chapter IV. Constrained Optimization
No ratings yet
Chapter IV. Constrained Optimization
40 pages
Princeton University Notation and Terminology in Optimization
No ratings yet
Princeton University Notation and Terminology in Optimization
13 pages
03a Optimization
No ratings yet
03a Optimization
33 pages
Introduction to Minimax
From Everand
Introduction to Minimax
V. F. Dem’yanov
No ratings yet
Optimality Conditions
No ratings yet
Optimality Conditions
10 pages
Maths Class 8 Sem 1-Pages
No ratings yet
Maths Class 8 Sem 1-Pages
148 pages
02 Basics of Derivative Pricing and Valuation1
No ratings yet
02 Basics of Derivative Pricing and Valuation1
14 pages
ICSE Final Practice Paper-3-1
No ratings yet
ICSE Final Practice Paper-3-1
7 pages
OQM Lecture Note - Part 8 Unconstrained Nonlinear Optimisation
No ratings yet
OQM Lecture Note - Part 8 Unconstrained Nonlinear Optimisation
23 pages
03 Risk Management Applications of Option Strategies1
No ratings yet
03 Risk Management Applications of Option Strategies1
15 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
18 pages
Cpts 440 / 540 Artificial Intelligence: Knowledge Representation
No ratings yet
Cpts 440 / 540 Artificial Intelligence: Knowledge Representation
95 pages
CH 4-Design Optimization-Optimum Design Concepts PDF
No ratings yet
CH 4-Design Optimization-Optimum Design Concepts PDF
62 pages
Lesson 14 Gauss Interpolation: X y X y X y Xy Xy Xy Xy
No ratings yet
Lesson 14 Gauss Interpolation: X y X y X y Xy Xy Xy Xy
7 pages
Functions and Relations
No ratings yet
Functions and Relations
16 pages
Abstract Lie Algebras
From Everand
Abstract Lie Algebras
David J Winter
No ratings yet
Optimization With Constraints: 2nd Edition, March 2004
No ratings yet
Optimization With Constraints: 2nd Edition, March 2004
35 pages
Exploring Graph Theory
No ratings yet
Exploring Graph Theory
6 pages
4.6 Exponential Modeling With Percent Growth and Decay
No ratings yet
4.6 Exponential Modeling With Percent Growth and Decay
4 pages
Lecture 2 - Optimization With Equality Constraints
No ratings yet
Lecture 2 - Optimization With Equality Constraints
44 pages
Introduction To Optimization
No ratings yet
Introduction To Optimization
18 pages
Material and Energy Balance
No ratings yet
Material and Energy Balance
26 pages
1 Introduction To Optimization: 1.1 Notations and Definitions
No ratings yet
1 Introduction To Optimization: 1.1 Notations and Definitions
4 pages
Session 3 - Mathematical Processes - The Core of Learning in School Mathematics by JOMAR A. FELIX
No ratings yet
Session 3 - Mathematical Processes - The Core of Learning in School Mathematics by JOMAR A. FELIX
29 pages
Chapter 9st - Non-Linear Programming
No ratings yet
Chapter 9st - Non-Linear Programming
21 pages
Extinction
No ratings yet
Extinction
12 pages
Bms Basic NLP 120609
No ratings yet
Bms Basic NLP 120609
103 pages
NEOM UNIT-1 Sept-23
No ratings yet
NEOM UNIT-1 Sept-23
34 pages
Mcnotes 41
No ratings yet
Mcnotes 41
8 pages
1202 2537 PDF
No ratings yet
1202 2537 PDF
5 pages
Fundamentals of AI Unit-3 Notes
No ratings yet
Fundamentals of AI Unit-3 Notes
15 pages
Introduction To Connections On Principal Fibre Bundles: by Rupert Way
No ratings yet
Introduction To Connections On Principal Fibre Bundles: by Rupert Way
12 pages
Exercise (12.16) (amended) : α α = 0 ⇔ Α = γ δ (i.e. α is simple)
No ratings yet
Exercise (12.16) (amended) : α α = 0 ⇔ Α = γ δ (i.e. α is simple)
3 pages
..., 2, 1 With ... ... ... : DX DX
No ratings yet
..., 2, 1 With ... ... ... : DX DX
2 pages
Assignment 1
No ratings yet
Assignment 1
13 pages
MIT18 440S11 Lecture27
No ratings yet
MIT18 440S11 Lecture27
18 pages
RTR LagrangeMultipliers
No ratings yet
RTR LagrangeMultipliers
56 pages
235
No ratings yet
235
1 page
Exercise (13.43) : GH A A H G
No ratings yet
Exercise (13.43) : GH A A H G
1 page
) e ,... E, e ,... (E ,... E, e ,... e (:) e ,... E, e ,... e Q (
No ratings yet
) e ,... E, e ,... (E ,... E, e ,... e (:) e ,... E, e ,... e Q (
1 page
365 PDF
No ratings yet
365 PDF
1 page
234 PDF
No ratings yet
234 PDF
1 page
G S S G: Exercise (13.47)
No ratings yet
G S S G: Exercise (13.47)
1 page
FM11SB 7.4
No ratings yet
FM11SB 7.4
14 pages
OptimumEngineeringDesign Day2b
No ratings yet
OptimumEngineeringDesign Day2b
24 pages
NLP Slides
No ratings yet
NLP Slides
201 pages
Wisdom of Crowds Intro
No ratings yet
Wisdom of Crowds Intro
53 pages
Project Report
No ratings yet
Project Report
7 pages
Spatial Patterns in Urban Systems PDF
No ratings yet
Spatial Patterns in Urban Systems PDF
13 pages
Lec 18
No ratings yet
Lec 18
6 pages
Math Chapter 7
No ratings yet
Math Chapter 7
4 pages
Convexity II: Optimization Basics: Ryan Tibshirani Convex Optimization 10-725
No ratings yet
Convexity II: Optimization Basics: Ryan Tibshirani Convex Optimization 10-725
28 pages
Exam1Review Annotated
No ratings yet
Exam1Review Annotated
13 pages
Cheatsheet
No ratings yet
Cheatsheet
2 pages
Mathematical Economics (ECON 471) Unconstrained & Constrained Optimization
No ratings yet
Mathematical Economics (ECON 471) Unconstrained & Constrained Optimization
20 pages
Handout 1 Introduction
No ratings yet
Handout 1 Introduction
7 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
Chapter 8 Designing Substansial Task To Utilize ICT in Math Lessons
No ratings yet
Chapter 8 Designing Substansial Task To Utilize ICT in Math Lessons
9 pages
Bacaer Approximation - Of.r0.periodic - Vector.population
No ratings yet
Bacaer Approximation - Of.r0.periodic - Vector.population
25 pages
Maths p1 Qns
No ratings yet
Maths p1 Qns
18 pages
Revision Sheet For EOY PDF
No ratings yet
Revision Sheet For EOY PDF
13 pages
1 - Theory of Maxima and Minima
No ratings yet
1 - Theory of Maxima and Minima
31 pages
Bcbs Math Second Term Paper
No ratings yet
Bcbs Math Second Term Paper
8 pages
237 PDF
No ratings yet
237 PDF
1 page
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Linear Programming: Presented by - Meenakshi Tripathi
No ratings yet
Linear Programming: Presented by - Meenakshi Tripathi
13 pages
Lecture 2
No ratings yet
Lecture 2
10 pages
Optim
No ratings yet
Optim
70 pages
Mathematics Ia
No ratings yet
Mathematics Ia
2 pages
Lec 17 Multivariable OT
No ratings yet
Lec 17 Multivariable OT
30 pages
Constrained Optimization
No ratings yet
Constrained Optimization
10 pages
Computer Project STD IX
No ratings yet
Computer Project STD IX
2 pages
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ORF363 COS323 F14 Lec3

Uploaded by

ORF363 COS323 F14 Lec3

Uploaded by

Lec3p1, ORF363/COS323

This lecture: Instructor:

• An optimization problem in general (or abstract) form:

Short for minimize

Constraint set (or feasible set)

In this class (unless otherwise stated), we have:

Typically, some description of and is given as input to us.

(also called the "solution" or the "global solution")

A point that minimizes over

May not exist.

x* exists and is unique. x* does not exist.

x* exists, but not unique. x* does not exist.

Optimal value: (if x* exists)

But can be well-defined even if x* doesn't exist.

• An important case where x* is guaranteed to exist:

• What if we want to maximize an objective function instead?

Optimal solution doesn't change.

Decision variables are not constrained at all. The goal is only to

• lover 1 • best friend • lover 2

• Variant: also given weights wi for each person

• Many other applications: e.g., Princeton is deciding on the location

• As we'll see later, this optimization problem is "easy" to solve, not

Example 2: Least squares.

By default, ||.|| always represents the 2-norm; i.e., ||.||2 .

In expanded notation, we are solving:

Some applications of least squares:

Fit a (say, cubic) polynomial:

Quick notation exercise: convince yourself that this is a least squares

Overdetermined system of linear equations.

A simple linear predictor for the stock price of a company:

Stock price at day t

and are given from data.

Unconstrained local and global minima.

Strict local minimum:

Strict global minimum:

• Local/global maxima defined analogously.

No global max in this case. Problem is unbounded above.

• In general, finding local minima is a less ambitious goal than finding

First and second order conditions for local optimality

Optimality conditions are results that give us some structural

The gradient vector (nx1 vector)

The Hessian matrix (nxn symmetric matrix)

Theorem. (First Order Necessary Condition for (Local) Optimality)

Second order conditions.

The statements of our second order optimality conditions involve the

Linear algebra interlude.

Symmetric matrix: (AT denotes the transpose of A. )

symmetric not symmetric

Theorem. Eigenvalues of a real symmetric matrix are real.

Proof. See, e.g., Theorem 3.2 in Section 3.2 of [CZ13].

A square matrix A is said to be:

• Positive definite (pd) if:

Recall that when we talk of positive semidefiniteness (or

Theorem. A matrix is positive semidefinite if and only if all its

Recall our easy test in dimension 2:

This generalizes to n dimensions using the concepts of principal minors

Theorem. (Second Order Necessary Condition for (Local) Optimality)

(i.e., the Hessian at is positive semidefinite.)

Theorem. (Second Order Sufficient Condition for (Local) Optimality)

Suppose is twice differentiable, , and

(i.e., the Hessian at is positive definite), then is a strict local

• is not necessary for (even strict global) optimality.

Questions to keep at the back of your mind:

Now that we have a better understanding of the structure of optimal

Least squares, revisited.

Exercise with optimality conditions

- [Bert04] D.P. Bertsekas. Nonlinear Programming.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.