Optimal Control Theory (Part I) : Dr. A. M. Tanvir Hussain
Optimal Control Theory (Part I) : Dr. A. M. Tanvir Hussain
Introduction
2
25.1 Introduction
25.1.1 Optimization: static and dynamic
3
25.1 Introduction
25.1.2 Optimization: static and dynamic
I More recent development during 1950s and 1960s based mainly on work
done by Bellman and Pontryagin led to optimal control theory
optimal control more powerful than variational calculus, e.g., dealing with
corner solutions
In optimal control, aim is to find the optimal time path for a “control”
variable which determines optimal time path for the state variable
4
25.1 Introduction
25.1.3 Planning or time horizon
5
oM@~iOKMoaeno~ geOco~5o~ oM@~@c?~ eD~ np5K@~ 4M@~eoM@i~geOcon~ o *~ х ~!xUx
nMew~ oM@~ v5iOesn~ Oco@ia@?O5o@~ np5o@n~ Ocoe~ wMO9M~ oM@~ ns7no6d9@~ _5y~ 7@~
oi5cnDei_@?~ ?sjOcK~ oM@~ gie9@nn~ 4M@n@~ geOcon~'~ o 7
7&7 5i@~ i@D@ii@?~ O੩
25.1 Introduction
5n~ ̭7Ē1 7z@͌ 4e~ Oc?Q95o@~ oM@~ gennO7OYOoy~ eD~ pi5cnDeia5oOec~ Diea~ no5o@~ '~ oe~
25.1.4 Multistage
no5o@~decisionmaking
o w@~?i5w~5c~ Ē͌ Die_~geOco~'~oe~geOco~ o 4M@~eoM@i~6k9~'*~ nMewn~
!qČ
:੩ ɣɤɥɦɧɨɩɪɫɬɭ ߇ߧۏ੩
%p ੩ . . rǶ
ͽÓÔ×
ÕÖ ØÙÚÛÜÝÞ;
!Č %p !Č ੩ !Č. !ĀČ. !Č0
9?;G8 ((
Figure 25.1: A discrete problem
6
-Ķ,ĶĶ24Ķ k ͌Ķ
ī%ĶĶĶ
Ķ Ķ$
6ĶĶ
Ķ$1ĶV'Ķ
25.1 Introduction Ķ Ķ$
ĶĶĶ Ķ 6
Ķ ĶĶĶî Ķ :Ķ ̹<'͌o 1 k 7&7JĶ Ķ$
Ķ
Ķ
˅ѵ
%r,s6
ط੩
੩
੩
੩
੩
੩
੩
੩
ظ੩
࣎ਖ਼ਗ਼ਜ਼ੜਫ਼πρԎ੩
Lƪ ¢Ň¯੩Ăė੩
7
ĶĶ͌ Ķ]%ĶĶ%
ĶĶ
#Ķ/ĶĶĶì,%MĶ%Ķ#Ķ Ȭࡪ
'Ķ a
͌#ĶĶĶĶ#Ķ
͌#ĶĶ#ĶEĶ5Ķ
#MĶ Ķ
25.1 Introduction Ķ Ķ
/Ķ Ķ ͌ ěĶ
͌Ķ Ķ %Ķ Ķ
0
Ķ $1Ķ ÕĶĶ
'Ķ Ķ Ķ Ķ Ķ #Ķ Ķ
éþ
Ķ Ķ #$Ķ $$%Ķ Ķ Ķ
25.1.5 Objective functional
ŃŎŇ¯੩ ēуԍ੩
8
25.1 Introduction
25.1.5 Objective functional
choose the path (formally: time path) that gives maximum V value
9
25.2 The undiscounted problem
25.2.1 Problem statement and preliminaries
ẋ(t) is the equation of motion of state variable (or the state equation)
finite end period T , free endpoint (or vertical terminal line) problem
⇒ x(T ) is free to be chosen optimally
10
25.2 The undiscounted problem
25.2.2 Solution process
11
25.2 The undiscounted problem
25.2.2 Solution process
∂H
= 0 (25.3)
∂y
1
These conditions were developed by L. S. Pontryagin and his associates
2
See Theorem 25.1 (H-L) for maximum principle and boundary conditions
12
25.2 The undiscounted problem
25.2.2 Solution process
1&dŤ ִڠ੩Ğԭ੩Şڧ੩ٰږڕ੩ۛŧ੩
`੩
14
25.2 The undiscounted problem
25.2.2 Solution process
Q2
I Solutions: combine (25.3) and (25.4) to find y ? (t), x? (t) and λ? (t); use
(25.5) to find definite solutions Overall efficien
2
M RS A = M RS B = M
x*(T)
x*(t)
PPF
x0
Figure 25.5:
Hypothetical solution
λ*(t) paths for our free E
endpoint (or vertical
terminal line) problem
y*(t)
λ*(T) = 0
0 T time, t
15
25.2 The undiscounted problem
25.2.3 Interpreting the λ? (t) function
λ? (0) is the shadow value (or marginal willingness to pay) for a marginal
addition to x(0)
λ? (T ) gives the loss (or sacrifice) of maximum function value incurred due
to having the marginal unit of x(T ) remaining at the end
16
25.2 The undiscounted problem
25.2.4 An example
17
25.3 The discounted problem
25.3.1 Fundamentals
people are impatient and prefer the present over the future
I Notation: ρ will be the the “discount rate” and e−ρt will be the the
“discount factor”
discount rate remains constant while discount factor increases over [0, T ]
18
25.3 The discounted problem
25.3.2 The general problem statement
19
25.3 The discounted problem
25.3.3 Solution process
where
Hc = He−ρt (25.10)
−ρt
µ = λe (25.11)
∂Hc
= 0 (25.12)
∂y
20
25.3 The discounted problem
25.3.3 Solution process
∂Hc
µ̇ − ρµ = − and ẋ = g[x(t), y(t), t] (25.13)
∂x
21
25.3 The discounted problem
25.3.4 An example
22
Overall effic
2
I In our original fixed terminal line problem, we would choose x(T ) optimally,
i.e., any change in x(T ) will not change maximum value of the objective
In this fixed terminal point problem, we are not free to do that and therefore
it’s not certain that λ(T ) = 0 will apply; see example 25.3 (H-L)
23
Q2
Overall efficienc
25.4 Alternative terminal conditions 2
x1(t) PPF
Figure 25.7: An
optimization problem
x2(t) with given terminal
x0 condition x(T ) ≥ b
E
x3(t) x(T) = b divides all x(t) paths into
permissible (those with
x(T ) ≥ b) and not
x4(t) permissible (those with
λ*(t)
x(T ) < b)
0 T time, t λ*(T) = 0
24
x3(t)
Q2
x4(t)
x(T) = b PP
Figure 25.8: An
optimization problem E
with given terminal
condition x(T ) = b where
T is finite but free
x0
λ*(t)
0 T1 T2 T3 time, t λ*(T) =
I Terminal point is fixed at b, but, the time horizon (or “speed”) of reaching
that target is free; in this case, transversality condition is H(T ) = 0
25