Professional Documents
Culture Documents
1, 1972
1. I n t r o d u c t i o n
t Paper received May 12, 1971. The research reported in this document was done while
the author was a Guggenheim Fellow at the Imperial CoUege, London, England, and
was supported in part by the U.S. Army Research Office, the U.S. Air Force Office of
Scientific Research, and the U.S. Office of Naval Research under the Joint Services
Electronics Program, Contracts Nos. N00014-67-A-0298-0006, 0005, and 0008. The
author is indebted to various discussions with D. Q. Ma3aae and J. M. C. Clark of
Imperial College which clarified several points.
z Professor, Division of Engineering and Applied Physics, Harvard University, Cambridge,
Massachusetts.
24
© I972PlenumPublishing Corporation,227 West 17th Street, New York,N.Y. 10011.
JOTA: VOL. 9, NO. 1, 1972 25
and x(0) is N(2, Po). Here, F and G are given n × n and n × r matrices,
respectively, and constitute a controllable pair. We are also given the
measurements
z(t) = H~(t) + ~(t), (3)
where H is p x n and (F, H) is observable, with v(t) another gaussian
white-noise process independent of w and
E(v(t)) = O, E(~.(t) v(z)~') =: RSt,, R > 0. (4)
I [xl'~'~J~lr[AllA*2][xl]b't~2~lLx~] !
where the expectation is taken w.r.t, the basic random variables x(O),
g0(t), and v(t), with the control law u taken to be a function of the past
measurement history z(r), 0 <~ r <~ t. It can be shown (Ref. 3) that
any linear terms in u in (7) can always be eliminated by appropriate
linear transformations and modifications on the problem. T h e objective
of the problem is to find the particular control law which minimizes (7).
3 For notational slmplh~carion, we use w~'te noiseformally here. However, the development
to follow can also be carried out by considering the discrete equivalent version of the
problem where no intricacies of the stochastic process need arise. A rigorous treatment
of the continuous time version of the problem to be stated below can also be carried
oat essentially along the lines established by Wonham (Ref. 4, pp. 178-180).
26 JOTA: VOL. 9, NO. 1, 1972
2. D e t e r m i n i s t i c Solution
= + , (8)
] = fro
1 fr
'
r&~&~]
IAI %J
[~q dr. (9)
that is,
3+ Stochastic Solution
J = 2E [4,, 4,,]
~ . LA~2A2e] t&J dt ~-' o [e, ' e2]T t a ~ 2 e J t d dt , (14)
and
rn -- ml m .trq + p'q _ .%
(16)
where K r = [ K 1 , K~] r is the well-known Kalman-Bucy filter gain.
In particular, E ( e ( t ) ~(t) T) = 0 for all t, which enables the decomposition
of (14). Now, since the control u cannot effect e, then the best that can
be done is to require that
&(t) = c~(t) 4~(t) (17)
by virtue of the optimality of the deterministic trajectory and the fact
that the correction term K l ( Z - - t t ~ 1 - - H ~ z ) is another independent
white noise in the first of Eqs. (15). Note that it does not imply that
u(t) = B ( t ) ~ l ( t ) , which is what is expected from the certainty-equi-
valence principle. In fact, we have on the stochastic singular surface
with
r(t) = c,~ + C~[(F~ -- g d J ~ ) + (F~ -- IqH~) C~]
-- (& -- K#,) -- (F~ -- K#~) C~1 (19)
and
o(t) = C.~K~ - K~.
Again, impulsive control is used at t = 0, that is,
T h e control lag, defined by (18) and (20) is realizable and minimizes (14)
by virtue of k~ = C21~1, which is known to be optimal from Section 2.
4~6. Case T--+ co, This solution also shed additional light on the
classical minimum-error servo-regulator problem, where we let T --+ co.
In this case, provided the Riccati equations involved have steady-state
solutions, the linear controller will be constant and will be of lower
order than the full-scale filter-controller solution called for in con-
ventional stochastic optimal control theory. The result of this paper is
thus applicable to the regulation of nonlinear systems about an operating
point.
References