Professional Documents
Culture Documents
AA203 Optimal and Learning-Based Control: Proof of NOC, Pontryagin's Minimum Principle
AA203 Optimal and Learning-Based Control: Proof of NOC, Pontryagin's Minimum Principle
Linear Non-linear
methods methods
Optimal control
Indirect Direct
DP HJB / HJI
methods methods
Reachability
CoV NOC PMP LQR iLQR DDP LQG analysis
4/28/19 AA 203 | Lecture 9 2
Necessary conditions for optimal control
(with unbounded controls)
We want to prove that, with unbounded controls, the necessary
optimality conditions are (𝐻 is the Hamiltonian)
23
𝐱̇ ∗ 𝑡 = 2𝐩
𝐱 ∗
𝑡 , 𝐮∗ 𝑡 , 𝐩∗ 𝑡 , 𝑡
23
𝐩̇ ∗ 𝑡 = − 2𝐱 𝐱 ∗ 𝑡 , 𝐮∗ 𝑡 , 𝐩∗ 𝑡 , 𝑡 for all 𝑡 ∈ [𝑡7 , 𝑡' ]
23
𝟎 = 𝐱∗ 𝑡 , 𝐮∗ 𝑡 , 𝐩∗ 𝑡 , 𝑡
2𝐮
EG +
𝜕𝑔< ∗ 𝑑 𝜕𝑔 <
0 = 𝛿𝐽< 𝐮 = D I 𝐱 𝑡 , 𝐱̇ ∗ 𝑡 , 𝐮∗ 𝑡 , 𝐩∗ 𝑡 , 𝑡 − 𝐱 ∗ 𝑡 , 𝐱̇ ∗ 𝑡 , 𝐮∗ 𝑡 , 𝐩∗ 𝑡 , 𝑡 𝛿𝐱 𝑡
EF 𝜕𝐱 𝑑𝑡 𝜕𝐱̇
+ +
𝜕𝑔< ∗ 𝜕𝑔<
+ 𝐱 𝑡 , 𝐱̇ ∗ 𝑡 , 𝐮∗ 𝑡 , 𝐩∗ 𝑡 , 𝑡 𝛿𝐮 𝑡 + 𝐱 ∗ 𝑡 , 𝐱̇ ∗ 𝑡 , 𝐮∗ 𝑡 , 𝐩∗ 𝑡 , 𝑡 𝛿𝐩(𝑡)J 𝑑𝑡
𝜕𝐮 𝜕𝐩
= 𝐟 𝐱 ∗ 𝑡 , 𝐮∗ 𝑡 , 𝑡 − 𝐱̇ ∗ 𝑡
2. If the final time is free and the Hamiltonian does not depend
explicitly on time, then
𝐻 𝐱 ∗ 𝑡 , 𝐮∗ 𝑡 , 𝐩∗ 𝑡 =0 for all 𝑡 ∈ [𝑡7 , 𝑡' ]
= 1 + 𝐩 𝑡 + 𝐴 𝐱, 𝑡 + j 𝐩 𝑡 + 𝐛A 𝐱, 𝑡 𝑢A (𝑡)
AkQ