0% found this document useful (0 votes)

103 views20 pages

Linear Quadratic Stochastic Control With Partial State Observation

This document summarizes the solution to the linear quadratic stochastic control problem with partial state observation (LQG problem). It presents the key steps as: 1) The optimal policies are linear functions of the minimum mean squared error state estimates from a Kalman filter. 2) The solution can be obtained using dynamic programming by expressing the optimal cost-to-go as a quadratic function plus a constant. 3) The optimal objective can be expressed in terms of contributions from the linear quadratic regulator problem and the state estimation problem.

Uploaded by

arjun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

103 views20 pages

Linear Quadratic Stochastic Control With Partial State Observation

Uploaded by

arjun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

EE363 Winter 2008-09

Lecture 10
Linear Quadratic Stochastic Control with
Partial State Observation

• partially observed linear-quadratic stochastic control problem

• estimation-control separation principle

• solution via dynamic programming

10–1
Linear stochastic system

• linear dynamical system, over finite time horizon:

xt+1 = Axt + But + wt, t = 0, . . . , N − 1

with state xt, input ut, and process noise wt

• linear noise corrupted observations:

yt = Cxt + vt, t = 0, . . . , N

yt is output, vt is measurement noise

• x0 ∼ N (0, X), wt ∼ N (0, W ), vt ∼ N (0, V ), all independent

Linear Quadratic Stochastic Control with Partial State Observation 10–2

Causal output feedback control policies

• causal feedback policies:

– input must be function of past and present outputs
– roughly speaking: current state xt is not known

• ut = φt(Yt), t = 0, . . . , N − 1
– Yt = (y0, . . . , yt) is output history at time t
– φt : Rp(t+1) → Rm called the control policy at time t

• closed-loop system is

xt+1 = Axt + Bφt(Yt) + wt, yt = Cxt + vt

• x0, . . . , xN , y0, . . . , yN , u0, . . . , uN −1 are all random

Linear Quadratic Stochastic Control with Partial State Observation 10–3

Stochastic control with partial observations

• objective:

−1
N
!
X
xTt Qxt uTt Rut + xTN QxN

J =E +
t=0

with Q ≥ 0, R > 0

• partially observed linear quadratic stochastic control problem

(a.k.a. LQG problem):
choose output feedback policies φ0, . . . , φN −1 to minimize J

Linear Quadratic Stochastic Control with Partial State Observation 10–4

Solution

• optimal policies are φt(Yt) = Kt E(xt|Yt)

– Kt is optimal feedback gain matrix for associated LQR problem
– E(xt|Yt) is the MMSE estimate of xt given measurements Yt
(can be computed using Kalman filter)

• called separation principle: optimal policy consists of

– estimating state via MMSE (ignoring the control problem)
– using estimated state as if it were the actual state, for purposes of
control

Linear Quadratic Stochastic Control with Partial State Observation 10–5

LQR control gain computation

• define PN = Q, and for t = N, . . . , 1,

Pt−1 = AT PtA + Q − AT PtB(R + B T PtB)−1B T PtA

• set Kt = −(R + B T Pt+1B)−1B T Pt+1A, t = 0, . . . , N − 1

• Kt does not depend on data C, X, W , V

Linear Quadratic Stochastic Control with Partial State Observation 10–6

Kalman filter current state estimate

• define
– x̂t = E(xt|Yt) (current state estimate)
– Σt = E(xt − x̂t)(xt − x̂t)T (current state estimate covariance)
– Σt+1|t = AΣtAT + W (next state estimate covariance)

• start with Σ0|−1 = X; for t = 0, . . . , N ,

Σt = Σt|t−1 − Σt|t−1C T (CΣt|t−1C T + V )−1CΣt|t−1,

Σt+1|t = AΣtAT + W

• define Lt = Σt|t−1C T (CΣt|t−1C T + V )−1, t = 0, . . . , N

Linear Quadratic Stochastic Control with Partial State Observation 10–7

• set x̂0 = L0y0; for t = 0, . . . , N − 1,

x̂t+1 = Ax̂t + But + Lt+1et+1, et+1 = yt+1 − C(Ax̂t + But)

– et+1 is next output prediction error

– et+1 ∼ N (0, CΣt+1|tC T + V ), independent of Yt

• Kalman filter gains Lt do not depend on data B, Q, R

Linear Quadratic Stochastic Control with Partial State Observation 10–8

Solution via dynamic programming

• let Vt(Yt) be optimal value of LQG problem, from t on, conditioned on

the output history Yt:
!
N
X −1
Vt(Yt) = min E (xTτ Qxτ + uTτ Ruτ ) + xTN QxN Yt

φt ,...,φN −1
τ =t

• we’ll show that Vt is a quadratic function plus a constant, in fact,

Vt(Yt) = x̂Tt Ptx̂t + qt, t = 0, . . . , N,

where Pt is the LQR cost-to-go matrix (x̂t is a linear function of Yt)

Linear Quadratic Stochastic Control with Partial State Observation 10–9

• we have

VN (YN ) = E(xTN QxN |YN ) = x̂TN Qx̂N + Tr(QΣN )

(using xN |YN ∼ N (x̂N , ΣN )) so PN = Q, qN = Tr(QΣN )

• dynamic programming (DP) equation is

xTt Qxt uTt Rut

Vt(Yt) = min E + + Vt+1(Yt+1)|Yt
ut

(and argmin, which is a function of Yt, is optimal input)

• with Vt+1(Yt+1) = x̂Tt+1Pt+1x̂t+1 + qt+1, DP equation becomes

xTt Qxt uTt Rut x̂Tt+1Pt+1x̂t+1

Vt(Yt) = min E + + + qt+1|Yt
ut

E(xTt Qxt|Yt) uTt Rut E(x̂Tt+1Pt+1x̂t+1|Yt)

= + qt+1 + min +
ut

Linear Quadratic Stochastic Control with Partial State Observation 10–10

• using xt|Yt ∼ N (x̂t, Σt), the first term is

E(xTt Qxt|Yt) = x̂Tt Qx̂t + Tr(QΣt)

• using
x̂t+1 = Ax̂t + But + Lt+1et+1,
with et+1 ∼ N (0, CΣt+1|tC T + V ), independent of Yt, we get

E(x̂Tt+1Pt+1x̂t+1|Yt) = x̂Tt AT Pt+1Ax̂t + uTt B T Pt+1But + 2x̂Tt AT Pt+1But

T T

+ Tr (Lt+1Pt+1Lt+1)(CΣt+1|tC + V )

• using Lt+1 = Σt+1|tC T (CΣt+1|tC T + V )−1, last term becomes

Tr(Pt+1Σt+1|tC T (CΣt+1|tC T +V )−1CΣt+1|t) = Tr Pt+1(Σt+1|t−Σt+1)

Linear Quadratic Stochastic Control with Partial State Observation 10–11

• combining all terms we get

Vt(Yt) = x̂Tt (Q + AT Pt+1A)x̂t + qt+1 + Tr(QΣt)

+ Tr Pt+1(Σt+1|t − Σt+1)
+ min(uTt (R + B T Pt+1B)ut + 2x̂Tt AT Pt+1But)
ut

• minimization same as in deterministic LQR problem

• thus optimal policy is φ⋆t(Yt) = Ktx̂t, with

Kt = −(R + B T Pt+1B)−1B T Pt+1A

• plugging in optimal ut we get Vt(Yt) = x̂Tt Ptx̂t + qt, where

Pt = AT Pt+1A + Q − AT Pt+1B(R + B T Pt+1B)−1B T Pt+1A

qt = qt+1 + Tr(QΣt) + Tr Pt+1(Σt+1|t − Σt+1)

• recursion for Pt is exactly the same as for deterministic LQR

Linear Quadratic Stochastic Control with Partial State Observation 10–12

Optimal objective

• optimal LQG cost is

J ⋆ = E V0(y0) = q0 + E x̂T0 P0x̂0 = q0 + Tr P0(X − Σ0)

using x̂0 ∼ N (0, X − Σ0)

• using qN = Tr QΣN and

qt = qt+1 + Tr(QΣt) + Tr Pt+1(Σt+1|t − Σt+1)

we get
N
X N
X
J⋆ = Tr(QΣt) + Tr Pt(Σt|t−1 − Σt)
t=0 t=0
using Σ0|−1 = X

Linear Quadratic Stochastic Control with Partial State Observation 10–13

• we can write this as
N
X N
X
J⋆ = Tr(QΣt) + Tr Pt(AΣt−1AT + W − Σt) + Tr(P0(X − Σ0))
t=0 t=1

which simplifies to
J ⋆ = Jlqr + Jest
where
N
X
Jlqr = Tr(P0X) + Tr(PtW ),
t=1
N
X
Jest = Tr((Q − P0)Σ0) + Tr((Q − Pt)Σt) + Tr(PtAΣt−1AT )
t=1

– Jlqr is the stochastic LQR cost, i.e., the optimal objective if you
knew the state
– Jest is the cost of not knowing (i.e., estimating) the state

Linear Quadratic Stochastic Control with Partial State Observation 10–14

• when state measurements are exact (C = I, V = 0), we have Σt = 0,
so we get
XN
J ⋆ = Jlqr = Tr(P0X) + Tr(PtW )
t=1

Linear Quadratic Stochastic Control with Partial State Observation 10–15

Infinite horizon LQG

• choose policies to minimize infinite horizon average stage cost

N −1
1 X T T

J = lim E xt Qxt + ut Rut
N →∞ N
t=0

• optimal average stage cost is

J ⋆ = Tr(QΣ) + Tr(P (Σ̃ − Σ))

where P and Σ̃ are PSD solutions of AREs

P = Q + AT P A − AT P B(R + B T P B)−1B T P A,
Σ̃ = AΣ̃AT + W − AΣ̃C T (C Σ̃C T + V )−1C Σ̃AT

and Σ = Σ̃ − Σ̃C T (C Σ̃C T + V )−1C Σ̃

Linear Quadratic Stochastic Control with Partial State Observation 10–16

• optimal average stage cost doesn’t depend on X

• (an) optimal policy is

ut = K x̂t, x̂t+1 = Ax̂t + But + L(yt+1 − C(Ax̂t + But))

where

K = −(R + B T P B)−1B T P A, L = Σ̃C T (C Σ̃C T + V )−1

• K is steady-state LQR feedback gain

• L is steady-state Kalman filter gain

Linear Quadratic Stochastic Control with Partial State Observation 10–17

Example

• system with n = 5 states, m = 2 inputs, p = 3 outputs; infinite horizon

• A, B, C chosen randomly; A scaled so maxi |λi(A)| = 1

• Q = I, R = I, X = I, W = 0.5I, V = 0.5I

• we compare LQG with the case where state is known (stochastic LQR)

Linear Quadratic Stochastic Control with Partial State Observation 10–18

Sample trajectories

sample trace of (xt)1 and (ut)1 in steady state

2
(xt)1
0

−2

0 10 20 30 40 50

1
(ut)1

−1

0 10 20 30 40 50
t

blue: LQG, red: stochastic LQR

Linear Quadratic Stochastic Control with Partial State Observation 10–19

Cost histogram

histogram of stage costs for 5000 steps in steady state

500
400 J⋆
300
200
100
0
0 5 10 15 20 25 30

500
⋆
400 Jlqr
300
200
100
0
0 5 10 15 20 25 30

Linear Quadratic Stochastic Control with Partial State Observation 10–20

Process Modelling and Model Analysis-Hangos-Cameron
82% (11)
Process Modelling and Model Analysis-Hangos-Cameron
561 pages
Experiment 3: Ball and Beam Position Control Using Quarc
No ratings yet
Experiment 3: Ball and Beam Position Control Using Quarc
24 pages
Linear-Quadratic Stochastic Control Problem - Solution Via Dynamic Programming
No ratings yet
Linear-Quadratic Stochastic Control Problem - Solution Via Dynamic Programming
16 pages
Inno2020 Emt4203 Control II Chap3.3-4 LQ Optimal
No ratings yet
Inno2020 Emt4203 Control II Chap3.3-4 LQ Optimal
11 pages
SC LQG
No ratings yet
SC LQG
33 pages
A2 Linear-Quadratic Optimal Control
No ratings yet
A2 Linear-Quadratic Optimal Control
8 pages
7 Linear Quadratic Control: 7.1 The Problem
No ratings yet
7 Linear Quadratic Control: 7.1 The Problem
10 pages
LQG Lecture
No ratings yet
LQG Lecture
41 pages
LQG Wind Turbine
No ratings yet
LQG Wind Turbine
5 pages
A Brief Tutorial On Linear and Nonlinear Control Theory
No ratings yet
A Brief Tutorial On Linear and Nonlinear Control Theory
46 pages
Optimum Design of Measurement Channels and Control Policies For Linear-Quadratic Stochastic Systems PDF
No ratings yet
Optimum Design of Measurement Channels and Control Policies For Linear-Quadratic Stochastic Systems PDF
11 pages
Lecture5 LQR PDF
No ratings yet
Lecture5 LQR PDF
54 pages
Linear-Quadratic-Gaussian Controllers: Immune Response Example
No ratings yet
Linear-Quadratic-Gaussian Controllers: Immune Response Example
11 pages
Linear Quadratic Regulator
0% (1)
Linear Quadratic Regulator
52 pages
02 - Dynamic Programming and LQR
No ratings yet
02 - Dynamic Programming and LQR
25 pages
OCDM2223 Tutorial7solved
No ratings yet
OCDM2223 Tutorial7solved
5 pages
Stochastic Feedback Controller Design Considering The Dual Effect
No ratings yet
Stochastic Feedback Controller Design Considering The Dual Effect
13 pages
Chapter 4
No ratings yet
Chapter 4
33 pages
Lecture Summary: Markov Jump Linear Systems: Vijay Gupta and Richard M. Murray
No ratings yet
Lecture Summary: Markov Jump Linear Systems: Vijay Gupta and Richard M. Murray
3 pages
Kwong - On The LQG Problem With Correlated Noise and Its Relation To Minimum Variance Control - 1991
No ratings yet
Kwong - On The LQG Problem With Correlated Noise and Its Relation To Minimum Variance Control - 1991
14 pages
4F3 - Predictive Control
No ratings yet
4F3 - Predictive Control
27 pages
OPTCON LQ Optimal Control 2024-10-16
No ratings yet
OPTCON LQ Optimal Control 2024-10-16
13 pages
RL and ObC Lecture 1
No ratings yet
RL and ObC Lecture 1
34 pages
Optimal Control 2018 Souanef
No ratings yet
Optimal Control 2018 Souanef
15 pages
LQR
No ratings yet
LQR
5 pages
11 Certainty-Equivalent Control: N N 1 D M
No ratings yet
11 Certainty-Equivalent Control: N N 1 D M
2 pages
LQG Design
No ratings yet
LQG Design
20 pages
2017 - On The Sample Complexity of The Linear Quadratic Regulator
No ratings yet
2017 - On The Sample Complexity of The Linear Quadratic Regulator
43 pages
Lec12 PDF
No ratings yet
Lec12 PDF
39 pages
Fa Ii
No ratings yet
Fa Ii
62 pages
Model Based Output Difference Feedback Optimal Control
No ratings yet
Model Based Output Difference Feedback Optimal Control
6 pages
EE363 Review Session 1: LQR, Controllability and Observability
No ratings yet
EE363 Review Session 1: LQR, Controllability and Observability
6 pages
Tema 4
No ratings yet
Tema 4
81 pages
Adaptive DP For Discrete Time LQR Optimal Tracking Control Problems With Unknown Dynamics
No ratings yet
Adaptive DP For Discrete Time LQR Optimal Tracking Control Problems With Unknown Dynamics
6 pages
Linear Quadratic (LQG) Control
No ratings yet
Linear Quadratic (LQG) Control
18 pages
Optimal Control of A Fully Decentralized Quadratic Regulator
No ratings yet
Optimal Control of A Fully Decentralized Quadratic Regulator
7 pages
Inno2024 EMT4203 CONTROL II NOTES R6
No ratings yet
Inno2024 EMT4203 CONTROL II NOTES R6
9 pages
Linear-Quadratic Regulator (LQR) - Wikipedia
100% (1)
Linear-Quadratic Regulator (LQR) - Wikipedia
4 pages
EE402 Lecture 17
No ratings yet
EE402 Lecture 17
6 pages
Optimal Control Theory Chapter 12
No ratings yet
Optimal Control Theory Chapter 12
55 pages
1 s2.0 S1474667017318037 Main
No ratings yet
1 s2.0 S1474667017318037 Main
6 pages
LQG MPC Notes PDF
100% (1)
LQG MPC Notes PDF
42 pages
Optimization and Control: Examples Sheet 2: LQG Models
No ratings yet
Optimization and Control: Examples Sheet 2: LQG Models
2 pages
Pole-Placement by State-Space Methods
No ratings yet
Pole-Placement by State-Space Methods
36 pages
Model Free Difference Feedback Control of Stochastic Systems
No ratings yet
Model Free Difference Feedback Control of Stochastic Systems
6 pages
© by SIAM. Unauthorized Reproduction of This Article Is Prohibited
No ratings yet
© by SIAM. Unauthorized Reproduction of This Article Is Prohibited
29 pages
16 - Optimal Control of Unknown Parameter Systems
No ratings yet
16 - Optimal Control of Unknown Parameter Systems
3 pages
L-5 Introduction To Robust Control
No ratings yet
L-5 Introduction To Robust Control
9 pages
Stochastic Control Princeton
No ratings yet
Stochastic Control Princeton
14 pages
Optim
No ratings yet
Optim
23 pages
Lecture 4 Control
No ratings yet
Lecture 4 Control
23 pages
Mpc12 Exam
No ratings yet
Mpc12 Exam
8 pages
Ex 3 Probl
No ratings yet
Ex 3 Probl
2 pages
Advanced Digital Control Syst EE554
No ratings yet
Advanced Digital Control Syst EE554
27 pages
A Linear-Quadratic Optimal Control Problem of Forward-Backward Stochastic Differential Equations With Partial Information
No ratings yet
A Linear-Quadratic Optimal Control Problem of Forward-Backward Stochastic Differential Equations With Partial Information
13 pages
Snesie
No ratings yet
Snesie
70 pages
05 - Robust MPC
No ratings yet
05 - Robust MPC
28 pages
LQG LTR Controller Design For An Aircraft Model
No ratings yet
LQG LTR Controller Design For An Aircraft Model
12 pages
Automatica: Abhinav Kumar Singh Bikash C. Pal
No ratings yet
Automatica: Abhinav Kumar Singh Bikash C. Pal
7 pages
5 PDFsam Book Model Based Predictive Control (Bismark)
No ratings yet
5 PDFsam Book Model Based Predictive Control (Bismark)
12 pages
Infinite Horizon Linear Quadratic Regulator
No ratings yet
Infinite Horizon Linear Quadratic Regulator
11 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Analysis of Multi-Priority M/G/1 Queue Using The Imbedded Markov Chain Approach
No ratings yet
Analysis of Multi-Priority M/G/1 Queue Using The Imbedded Markov Chain Approach
1 page
7a5a78d41274 P 07
No ratings yet
7a5a78d41274 P 07
1 page
8ca5a041742f P 08
No ratings yet
8ca5a041742f P 08
1 page
R Mean Residual Service Time As Seen by A New Job: Class P
No ratings yet
R Mean Residual Service Time As Seen by A New Job: Class P
1 page
7a5a78d41274 P 01
No ratings yet
7a5a78d41274 P 01
1 page
8ca5a041742f P 17
No ratings yet
8ca5a041742f P 17
1 page
8ca5a041742f P 09
No ratings yet
8ca5a041742f P 09
1 page
Simulation vs. Analysis ?: Discrete State, Continuous Time Continuous State, Continuous Time
No ratings yet
Simulation vs. Analysis ?: Discrete State, Continuous Time Continuous State, Continuous Time
1 page
Handout 18: Problem Set 5 Solutions: Figure 1: Unbalanced AVL Tree - Case 1
No ratings yet
Handout 18: Problem Set 5 Solutions: Figure 1: Unbalanced AVL Tree - Case 1
1 page
Handout 18: Problem Set 5 Solutions: Height Left Height Right Left Height Right Left Left Height Right
No ratings yet
Handout 18: Problem Set 5 Solutions: Height Left Height Right Left Height Right Left Left Height Right
1 page
4be9fd86dc16 P 142
No ratings yet
4be9fd86dc16 P 142
1 page
4be9fd86dc16 P 127
No ratings yet
4be9fd86dc16 P 127
1 page
ADC Lab 2
No ratings yet
ADC Lab 2
18 pages
Inno2019 Emt4203 Control II Syllabus
No ratings yet
Inno2019 Emt4203 Control II Syllabus
1 page
Mixing Control in A ColdHot Water - AR
No ratings yet
Mixing Control in A ColdHot Water - AR
13 pages
Key Findings From The Nonlinear Benchmark For Seis
No ratings yet
Key Findings From The Nonlinear Benchmark For Seis
12 pages
Fees Details WP - Technocrat Automation
No ratings yet
Fees Details WP - Technocrat Automation
16 pages
Implementing A PID Controller Using A PIC
No ratings yet
Implementing A PID Controller Using A PIC
15 pages
Question Bank: Flexible Ac Transmission Systems
No ratings yet
Question Bank: Flexible Ac Transmission Systems
20 pages
2 Marks
No ratings yet
2 Marks
2 pages
Universidad de Las Fuerzas Armadas Espe: Fausto Granda G
No ratings yet
Universidad de Las Fuerzas Armadas Espe: Fausto Granda G
13 pages
November Lecture 6 Takagi Sugeno
No ratings yet
November Lecture 6 Takagi Sugeno
114 pages
Control of Axial Flux Permanent Magnet Motor Using A Hybrid PI/ Fuzzy Logic Controller
No ratings yet
Control of Axial Flux Permanent Magnet Motor Using A Hybrid PI/ Fuzzy Logic Controller
8 pages
Basic Principles of PID Controllers
No ratings yet
Basic Principles of PID Controllers
8 pages
ABB Power System Voltage Stability
No ratings yet
ABB Power System Voltage Stability
19 pages
Control System MCQ
100% (1)
Control System MCQ
4 pages
IME Module 5
No ratings yet
IME Module 5
9 pages
Lab Manual - Manufacturing Technology
No ratings yet
Lab Manual - Manufacturing Technology
79 pages
Solid State DC Drives Part1 PDF
100% (2)
Solid State DC Drives Part1 PDF
72 pages
Instrumentation GPSA Handbook
No ratings yet
Instrumentation GPSA Handbook
6 pages
Stage I - Paper I, Objective Type, Common To All Candidates, 2 Hours Duration, 200 Marks Maximum
No ratings yet
Stage I - Paper I, Objective Type, Common To All Candidates, 2 Hours Duration, 200 Marks Maximum
3 pages
February 2020 Number 2 Ietaa9 (ISSN 0018-9286) : Regular Papers
No ratings yet
February 2020 Number 2 Ietaa9 (ISSN 0018-9286) : Regular Papers
2 pages
Optimal Approach Autopilot Topologies
No ratings yet
Optimal Approach Autopilot Topologies
30 pages
Feedback Amplifier
No ratings yet
Feedback Amplifier
32 pages
Design and Dynamic Analysis of A Transformable HOvering Rotorcraft
No ratings yet
Design and Dynamic Analysis of A Transformable HOvering Rotorcraft
8 pages
EE309 Notes 17
No ratings yet
EE309 Notes 17
3 pages
Virtual Environments
No ratings yet
Virtual Environments
34 pages
Module 4 (B) Active Filters
No ratings yet
Module 4 (B) Active Filters
24 pages
Frequency Regulation of High-Penetration Renewable Energy Microgrids Using Adaptive Model Predictive Control
No ratings yet
Frequency Regulation of High-Penetration Renewable Energy Microgrids Using Adaptive Model Predictive Control
11 pages
Control Sol GA PDF
No ratings yet
Control Sol GA PDF
342 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.