Professional Documents
Culture Documents
The purpose of this paper is to formulate, study, and (in certain cases) resolve the
R. E. K A L M A N Inverse Problem of Optimal Control Theory, which is the following: Given a control
Research Institute for Advanced Studies law, find all performance indices for which this control law is optimal.
(RIAS), Baltimore, M d .
Under the assumptions of (a) linear constant plant, (b) linear constant control law,
(c) measurable state variables, (d) quadratic loss functions with constant coefficients,
(e) single control variable, we give a complete analysis of this problem and obtain various
explicit conditions for the optimality of a given control law. An interestingfeature of the
analysis is the central role of frequency-domain concepts, which have been ignored in
optimal control theory until very recently. The discussion is presented in rigorous math-
ematical form.
On the other hand, the assumption that k is a completely ob- Review of the Optimization Problem
servable control law, i.e., that [11]
N o w we turn to Problem B. W e seek an explicit expression for
rank [k, F'k, . . ., (F'^-'k] = the minimum of ( 9 ) and the corresponding optimal control law.
As the general theory of minimizing (7) is quite well known [3-4],
is not needed until much later (see Theorem 6). we shall stress those aspects of the problem which result from
letting ti->-o. This problem was solved in [3].
Mathematical Formulation of the Optimization Problem The loss function L cannot be completely arbitrary. W e want
Our first objective ought to be to find all optimization problems to investigate what restrictions must be imposed on L for purely
which result in a constant, linear control law (Problem A). How- mathematical reasons. W e shall also examine the physical sig-
ever, the general solution of this problem is not known at present. nificance of L.
Therefore we shall merely give some sufficient conditions. In the The first restriction on L is a consequence of Pontryagin's
next section we will see that these conditions imply that the control "Maximum Principle" [4, 12], According to this principle, the
law is constant and linear. variational problem (7) has a solution only if the so-called pre-
Let us recall the mathematical definition of the dynamic op- Hamiltonian function [4]
timization problem in control theory j 9 } . W e denote by x<>) H(t, x,p,p) = L(t, x, p) + (p, Fx + gp)
the unique solution or motion of (1) corresponding to some fixed,
continuous control function p.(t) and the initial state x0 — xM(0; (where p is an «-vector, the costate of ( 1 ) ) has an absolute
Xo). The loss function (or Lagrangian) L is an arbitrary smooth minimum with respect to p. for every fixed value of (t, x, p).
function of t, x, and p. Suppose xo is a fixed initial state and <1 > 0 Substituting (10) into H, it follows that II will have a minimum
a fixed value of the time. Then the integral only if cr & 0. If cr > 0, then H has a unique minimum for all t,
x, p. This is the so-called regular case. If 0 = 0, then H can have
a minimum if and only if r'x = 0 and p'g = 0, in which case H is
V(xo, tx; p) = f W, x„(t; x 0 ), p(t))dt (7) independent of p. This is the singular case. In the latter event,
Jo
the Maximum Principle furnishes very little information about
is a continuous functional of the control function p(t). As usual, the proper choice of p { 1 2 } .
V is called the performance index of the control system (1-2). T o avoid complications which are of little interest in this paper,
We ask: For what continuous control function p*(t) (if any) is we shall consider only the regular case. (The same assumption is
V(x0, t\\ p.) a minimum? made in most of the classical literature on the calculus of varia-
The minimum value of V(xo, h; p) will be denoted by V*(xo, h). tions.) Hence cr must be positive. W e may set cr = 1 without
The Principle of Optimality of the calculus of variations shows loss of generality. Then (10) takes the form:
[3-4] that the function p*(t) (if it exists at all) may always be
2L(x, p) = x'(Q - rr')x + (p + r'x)* (Q = Q')-
generated by a control law of the type
A second restriction is required to assure that the minimum
P*(t) = -X(t, x(t)). (8) value of (7) does not diverge to — «> as h approaches some
Thus the precise formulation of Problem A is the following: finite value. (In other words, we want to rule out the possibility of
Let us pause to examine the physical significance of (11). V(xo, pT) - (1/2)||zo||V„
Clearly L is nonnegative. If L is positive, we incur a loss. The
object of optimal control is to minimize this loss. The loss is zero = lim f L(x°(t), if(t))dt < co.
if and only if both terms in L vanish separately. Therefore H is
chosen in such a way that the state of the plant is "satisfactory"
Since L is nonnegative, the matrix P a is nonnegative definite.
if and only if Hx — 0 { 1 3 } . The choice of r in L fixes the desired
(iii) Of course, ixw(t) is not necessarily optimal and therefore
level p. = — r'x of the control variable.
Therefore P „ may be defined equivalently as As far as initial states in X i are concerned, the outcome of the
optimization problem is determined a priori b y the choice of r.
lim H « ; 0, 0) = P „ . (14)
(l—eo I t is clear from these observations that there is no loss of
generality in requiring
4 We use the notation || x where A is any symmetric matrix, for
the quadratic form x'Ax. dimension Xi = 0. (Aj)
(26)
Implications of Optimality
where det (si — F) = \J/(s) = s" + a „ s " - 1 + . . . + ai. From Theorem 5 we can deduce a number of interesting rela-
Moreover, if tions between optimality and frequency-domain concepts. In
this way we obtain a fully satisfactory solution of the Inverse
Y^t'co)"-1 + . . . + 7i
q'$(iw)g = — (27) Problem of Linear Optimal Control Theory.
(wo)" + a„(i'co)" _ I + . . . + «,' From Condition ( I ) it is clear that a stable control law may be
optimal only if the return difference Tk(iu) satisfies the condition
6 The quadratic form || x for x complex and A real is defined, as
usual, by |n(ico)| = |1 + k'$(s)g\2 > 1. (II)
IA =2
XiaijXj, This condition may well fail to hold { 1 7 } . If it does hold, then
H may be obtained according to Lemma 3 b y factoring the non-
where Xi is the complex conjugate of Xi. negative polynomial T(co 3 ):
determining k an optimal compromise must be made between reducing cussion at the 1962 Joint Automatic Control Conference.
4
1
(44) is equivalent to Case 2 . Si = 0 , S2 = 1. Then co* > 0 and f* &
2 + 4co*2
&(»)**< ~s) = t(s)t(-s) + pf(s)f(-s). (45) Subcase. If h'$(s)g = l / s ( s — 1) then f* attains its lower
As p —• <» precisely 2m zeros of (45) tend to the zeros of bound, while p = co*4.
f ( s ) f ( — s )- The remaining 2(n — m) zeros tend to co and are
asymptotic to the zeros of the equation Case 3. Si = s 2 = —1. Then co* g 1 and f* S: \ l — I -
1 2 ^ 2co*2
s2(n-m) = p (46)
but both equal signs cannot hold simultaneously.
Since all zeros of the polynomial i/'*(s) must be in the lefWialf Subcase (a). If /t'($)(s)(7 = l / ( s 2 + 1), then p = co*4 - 1,
plane, it follows that m zeros of 1pk(s) tend to the corresponding co* > 1, and f* attains its lower bound.
zeros of f ( s ) (which are in the left-half plane, or on the imaginary
Subcase (b). If h'$(s)g = s/(s + l) 2 , then p = f* 2 - 1, f* > 1,
axis by the definition of h) while the remaining n — m zeros of
and co* = 1.
\pk{s) tend asymptotically to a Butterworth pattern [9, 10, 19] of
radius p 1 / 2 ( n ~ m ) . Case 4. si = i and s2 = — i. Then co* ^ 1 and f* §
These observations may be summarized in the following form:
THEOREM I I . (Solution of Problem F). Consider a completely
trollable plant and the optimization problem (9) corresponding to thetaneously.
con-
{ * — ——, but again both equal signs cannot hold simul-
measured output and the transfer function of the controller from and if .r0 = £ j J , then \\HeF'xa\\ == 0.
plant output to control input. Given the first transfer function,
by the methods of this paper it is possible to obtain implicit
conditions which must be satisfied by the second transfer func- {15} The idea of using frequency-domain concepts in the
tion if it is to be optimal. No explicit form of these conditions present context is due to V. M . Popov [31]. B y an extension of
is known (at present). Popov's ideas, the writer has succeeded in obtaining a solution of
the celebrated problem of Lur'e [17]. In fact, Theorem 4 is a
{7} The reader is reminded that in most of control theory (and
variant of the Main Lemma used in [17], which has also other