Professional Documents
Culture Documents
Proof: Substituting
1
v = u + R−1 (x)(Lg V (x))T
2
into (3.3.2) and using the HJB-identity we get the following chain of equalities:
Z ∞ 1
J = (l + v T Rv − v T (Lg V )T + Lg V R−1 (Lg V )T ) dt
Z0 ∞ 4 Z ∞
1
= (−Lf V + Lg V R (Lg V )T − Lg V v) dt +
−1
v T R(x)v dt
0Z 2 Z ∞ Z ∞0 Z ∞
∞ ∂V dV
T
=− (f + gu) dt + v R(x)v dt = − + v T R(x)v dt
0 ∂x 0 Z
∞
0 dt 0
= V (x(0)) − lim V (x(T )) + v T R(x)v dt
T →∞ 0
Because we minimize (3.3.2) only over those u which achieve limt→∞ x(t) = 0,
the above limit of V (x(T )) is zero and we obtain
Z ∞
J = V (x(0)) + v T R(x)v dt
0
ẋ = x2 + u
It can be checked that V (x) is positive definite and radially unbounded. The
control law
1 ∂V √
u∗ (x) = − = −x2 − x x2 + 1 (3.3.7)
2 ∂x
achieves GAS of the resulting feedback system
√
ẋ = −x x2 + 1
ẋ = Ax + Bu (3.3.8)
P A + AT P − P BR−1 B T P + C T C = 0 (3.3.9)
u∗ (x) = −R−1 B T P ∗ x
A proof of this result can be found in any standard text, such as [1]. For our
further discussion, the semidefiniteness of l(x) = xT C T Cx is of interest because
it shows the significance of an observability property. It is intuitive that “the
detectability in the cost” of the unstable part of the system is necessary for
an optimal control to be stabilizing. A scalar example will illustrate some of
the issues involved.
94 CHAPTER 3. STABILITY MARGINS AND OPTIMALITY
in which the state is observable. The corresponding Ricatti equation, and its
solutions P1² and P2² are
√ √
2P − P 2 + ²2 = 0, P1² = 1 − 1 + ², P2² = 1 + 1 + ²
√ √
The HJB solutions V1² (x) = (1 − 1 + ²)x2 and V2² (x) = (1 + 1 + ²)x2
converge, as ² → 0, to V1 (x) = 0 and V2 (x) = 2x2 , respectively. This reveals
that V1 (x) = 0 is the limit of V1² (x) which, for ² > 0, is negative definite while
J ² must be nonnegative. Hence V1² (x) cannot be a value function, let alone
an optimal value function. The optimal value function for J ² is V2² (x) and
Theorem 3.19 identifies its limit V2 (x) as the optimal value for J. 2
In our presentation thus far we have not stated the most detailed conditions
for optimality, because our approach will be to avoid the often intractable task
of solving the HJB equation (3.3.3). Instead, we will employ Theorem 3.19
only as a test of optimality for an already designed stabilizing control law.