Professional Documents
Culture Documents
9, SEPTEMBER 1991
Abstract-The initial state of an unforced linear system is fied, it is desired to obtain a safe set of initial conditions, i.e.,
output admissible with respect to a constraint set Y if the a set Z such that x(0) E Z implies (1.2) is satisfied for all
resulting output function satisfies the pointwise-in-time condi- integers t L 0.
tion A t ) E Y , t 2 0. The set of all possible such initial condi-
tions is the maximal output admissible set 0,. The properties of The problem just cited, as well as many others of practical
0, and its characterization are investigated. In the discrete-time interest, can be stated equivalently as a problem involving an
case, it is generally possible to represent O,, or a close approxi- unforced linear system with pointwise in time output con-
mation of it, by a finite number of functional inequalities. straints. Specifically, we are given a triple, A E &? n x n ,
Practical algorithms for generating the functions are described.
In the continuous-time case simple representations of the maxi-
C E &? p x ', and Y C 9? p , and are to determine if the motion
mal output admissible set are not available; however, it is shown of the discrete-time system
that the discrete-time results may be used to obtain approximate
representations. Maximal output admissible sets have important
x ( t + 1) = A x ( t ) , x ( t ) E g n , y ( t ) = C x ( t ) (1.3)
applications in the analysis and design of closed-loop systems satisfies the output constraint
with state and control constraints. To illustrate this point, a
modification of the error governor control scheme proposed by Y ( t )E y (1.4)
Kapasouris, Athans, and Stein [6] is presented. It works as well
as their implementation but reduces the computational load on for all t E ,a+, where 9+ is the set of nonnegative integers.
the controller by several orders of magnitude. Problem (1.1)-( 1.2) fits this format by making the assign-
ments A + B K + A , C + D K + C , a n d Y + Y .
To assist our subsequent discussions, we introduce some
I. INTRODUCTION
terminology. The state constraint set associated with A ,
x(0) E Z, the subsequent motion x( t ) , t > 0 belongs to Z. If We have introduced 0, and 0: in the context of output
Z is positively invariant and Z C X ( C , Y), it is obvious thatadmissibility because of the close connection between output
Z is output admissible. This observation has been used in admissibility and the way in which physical constraints are
several different contexts to produce output admissible sets expressed. It is also possible to introduce 0, and 0: in the
[3], [9]-[13]. Assume, as is common in these papers, that S context of positively invariant sets. From (1.7) and (1.8) it
is the discrete-time system x ( t + 1) = Ax( t ) , A is asymp-may be verified that if 0, and 0: are not empty they are
totically stable, and X is a polyhedron containing the origin positively invariant. Thus, over the class of positively invari-
in its interior. Use ' to denote matrix transpose. Let V(x) = ant subsets of X ( C , Y) they are maximal. To our knowl-
x' Px be a Lyapunov function for the system (1.3), generated edge, such maximal positively invariant sets have not been
by solving A'PA - P = - Q, where Q' = Q > 0. Then studied previously.
P' = P > 0 and it is clear that [ 141 the set W = { x E 2 n : The results of this paper are organized in the following
V (X) 5 c } , c > 0, is positively invariant. Clearly, c may be way. Section I1 deals with basic issues concerning the dis-
selected so that W C X and W is output admissible. Even if crete-time case: Y-dependent properties of Om, the determi-
c is maximized subject to W C X , this approach is likely to nation of 0, by a finite number of operations, the characteri-
be very conservative in the sense that much larger output zation of Y and 0, by functional inequalities, simplifica-
admissible sets exist. The difficulty is that the ellipsoid W tions which occur when C, A is unobservable and/or A is
may not fit tightly into the polyhedron X [3]. This has led to unstable. A condition for finite determinability, which is
the consideration of conditions which imply that a specified given in Section 11, leads to algorithmic procedures for the
polyhedral set W is positively invariant [9]-[13]. The condi- computation of 0,. These procedures are presented along
tions have been applied with special assumptions on C , D, with some simple examples in Section 111. In Section IV it is
and Y to closed-loop systems of the form (1. l ) , (1.2). This shown that 0, is finitely determined i f A is asymptotically
leads to techniques for finding K such that (1.1) is asymptot- stable, the pair C , A is observable, Y contains the origin in
ically stable and (1.2) is satisfied for all initial conditions in
its interior and is bounded. If A is only Lyapunov stable, it
W . For the techniques to work the prespecified W must be often follows that 0, is not finitely determined. However, if
sufficiently small and, as is the case with ellipsoidal W , muchthe only characteristic roots of A which have unit magnitude
larger sets of safe initial conditions may exist. are at X = 1, an approximation of 0, is finitely determined.
It is worth noting that the approaches taken in [9]-[13] These matters are taken up in Section V. Section VI treats
apply to discrete-time systems, but not to continuous-time continuous-time systems and proves the approximation result
systems. The reason is apparent: for discrete-time systems, mentioned above. In Section VI1 some additional conditions
positive invariance of Z is equivalent to A-invariance of Z. on Y , called the Minkowski assumptions, are considtred.
Thus, the positive invariance required in the various develop- With these assumptions some of the results of the previous
ments of the papers is expressed simply in terms of the sections are strengthened. In Section VI11 we introduce a
algebraic properties of A . The situation for continuous-time modification of the error governor control scheme of [6]
systems is certainly more complex: Z is positively invariant which is based on 0, rather than 0:. An example applica-
if and only if Z c eA'Z for all t E gR+. tion shows that it is simple, fast, and effective. Section IX
As a final point of interest, it is easy to give examples of contains some final remarks.
output admissible sets which are not positively invariant; The following notations and terminology will be used. The
identitymatrixin 2"" is I,,. Let YE 9?, r e @, X E 9?',
such sets have received little if any attention in the literature.
This paper departs from past lines of research in that it is A E 9?m x n , and Z, Z,, Z, c 2 ". Then: xi is the ith com-
concerned with the investigation of maximal output admissi-
ble sets. For the discrete-time and continuous-time systems
ponent of x , I I X I J= Jm, I I A I=I rnax{IIAxII:IlxIl
= l } , S(r) = { x : JIxI(Ir}, a Z = { a x : X E Z } , - Z =
these sets are defined, respectively, by (-l)Z, Z, + +
Z, = { x l x,: xl E Z , , x,EZ,}. Z is
O , ( A , C , Y) = { X E g': CA'XE Y V t E J + ) (1.7) symmetric if Z = - Z . The boundary and interior of Z are
and denoted, respectively, by b d Z and int Z. The product set of
Z with itself k times is Z k .
O : ( A , C , Y ) = { X E g': CeA'xE Y VtE 2 + } . (1.8)
We assume hereafter that 0 E Y. This assumption is satis-
Our emphasis will be on the discrete-time case because of its fied in any reasonable application and has nice consequences.
relative simplicity and the susceptibility of 0, to algorithmic In particular, 0, and 0: are nonempty and contain the
determination. However, we shall also examine the relation- origin.
ship between O,(eAT, C , Y) and O;(A, C , Y) where T E
&+. As T 0 the first set approximates the second set, a
+
result which allows us to apply discrete-time computational 11. BASICRESULTS FOR THE DISCRETE-TIME CASE
techniques to continuous-time systems. References to maxi- Imposing special conditions on Y often imposes corre-
mal output admissible sets have appeared previously in con- sponding conditions on 0,. Some implications of this type
nection with several nonlinear feedback control scheme which are summarized in the following theorem.
take into account control and state constraints; see [15] for Theorem 2.1:
the discrete-time case and [6]- [8] for the continuous-time i) Each of the following properties of Y are inherited by
case. 0,: closure, convexity, symmetry.
1010 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 36, NO. 9 , SEPTEMBER 1991
ii) Suppose C , A is observable and Y is bounded. Then, satisfy f j ( 0 )I0. Then: i) the functions g i : g " --* g given
0, is bounded. bY
iii) Suppose A is Lyapunov stable (the characteristic roots g i ( x ) = sup { f , ( C A ' x ) :t E Y + } (2.4)
of A satisfy the following conditions: I hi(A ) I I 1, i =
1; n, and I h i ( A )I = 1 implies h i ( A ) is simple) and
a ,
are defined, ii) 0 E 0, and iii)
0 E int Y . Then, 0 E int 0,.
iv) Let cy^ g .Then O , ( A , C , a Y ) = a O , ( A , C , Y ) .
Proof: The results in i) are easily verified from the Proof: Let x E .!2 '. Using the notation defined in the
definition of 0,. To prove ii) define proof of Theorem 2.1, it follows that for all t E Y'+, CA'X E
H' = [C' A'C' ... (A')"-'C'] E g n x n p S ( y l (1 X I ( ) . Thus, by the compactness of S and the continuity
of the f j , the sequence f i ( C A ' x ) , t E 9+, is bounded from
The observability assumption implies that H has rank n. above. Therefore, the supremum in (2.4) is defined for all
Thus, if Hx = z has a solution, the solution is unique and is X E g".Results ii) and iii) are a direct consequence of
given by x = H t z where Ht = ( H ' H ) - ' H ' . Define f j ( 0 ) I0 and the definition of 0,. 0
When Y is given by (2.3) and 0, is finitely determined
O , ( A , C , Y )= { x E & " n : C A k x E Y f o r k = O ; . . , t } .
(2.1) 0, = { X E g":fi(CA'X) I O ,
i ~ { l , - * * , t ~E }{ O
, ; * - , t * } } . (2.6)
Clearly, 0, C O N - , = { x : H X EY " } = H t Y " . Since Y
is bounded, so is H i Y " . Thus, ii) follows. The assumption Not all of the s * (t* + 1) inequalities in (2.6) may be active
of Lyapunov stability in iii) implies that there exists a con- in the sense that there exists and x E 0, such that f j ( C A ' x )
stant y t > 0 such that for all x E LJ? and t E y+, I( CA'x \( = 0. The active inequalities have a special structure which is
5 y,I( xII. Choose y2 > 0 so that S(y2) C Y . Then, y I 11 x I I given in the following theorem.
5 y2 implies CA'x E Y for all t E 9+. Hence, S(y, /y,)C Theorem 2.4: Suppose 0, is given by (2.6). Then there
0, and 0 ~ i n 0,.t Result iv) is obvious for cy = 0, so exists a nonempty set of integers S* c { 1 , . . * , s} and in-
assume cy # 0. Then, O,(A, C , a y ) = { x : C A ' c y - ' x ~Y dexes t', ~ E S * such
, that i) t* = max { t,: i E S * } ,
v t E 9'} = { c y ( a - ' x ) :cA'(cy-'x) E Y V t E S'} = ii)
cyO,(A, c, Y ) . 0
Remark 2.1: The identity in iv) shows that 0, is scaled 0, = { X E g ' " : f i ( c ~ ' Ix )0 , i E S * , tE {o;.., t ? } }
in direct proportion to the "size" of the output constraint set.
There is no change in its shape. (2.7)
Obviously, the set Of( A , C , Y ) satisfies the following
iii) for all iES* and t E ( 0 ; ti*} there exists and
e ,
condition:
0, such that fi(CA'x) = 0.
X E
0, C Of2C 0,, v t , , t , E 9+ such that t , It , . (2.2) -
Proof: Let S* = { i : 3 t E { 0 , -,t*} and x E 0, such
that f j ( C A ' x ) = 0 ) . If i # S* it follows that f j is inactive in
We say 0, is finitely determined if for some t EY', (2.6), i.e., fi(CA'x) < 0 for all t E ( 0 ,* * , t * } and x E 0,.
0, = 0,. Let t* be the smallest element in 9+ such that For i E S * , define t? = rnax{tE{O;..,t*}: 3 ~ ~ such 0 ,
0, = Ot*.We call t* the output admissibility index. From that f,(CA'x) = 0} . Then it is obvious that (2.7) holds.
(2.2) it follows that 0, = 0, for all t 1 t*. Result iii) follows from (2.7) and the definition of t'.
Theorem 2.2: 0, is finitely determined if and only if Clearly, there exists X E 0, such that f i ( C A t f x )= 0. This
,
0, = O,+ for some t E 9+. implies f i ( C A t ? - ' x ( t ) )= 0, t = O,..., t?, where x ( t ) =
Proof: If 0, is finitely determined the equality holds A'x. By (1.7), x ( t ) E 0, for all t ~ 9 + . 0
for and t 1 t*. From 0, = 0,+1 it is easily confirmed that Remark 2.2: Suppose there exists an i E { 1, * s} such a ,
x E 0, implies that Ax E 0,; thus, 0, is A-invariant. Also, that f i ( C x ) < 0 for all x EX(C, Y ) . Then it is obvious that
0, C X . Hence, 0, is output admissible and 0, C 0,. i # S * . Even when such trivial situations are excluded, it is
Property (2.2) completes the proof. -
0 possible that S* # { 1, * s}. Consider, for instance, the
a ,
Finitely determined maximal output admissible sets gener- example: n = p = 1, A = [-0.81, C = [l], f l = y - 1,
ally have a simpler structure than those which are not. f, = - y - 2 then S* = (1) and t: = t* = 1 .
Moreover, they can be obtained by finite recursive proce- Remark 2.3: Result iii) states that all the inequalities in
dures. These matters, and conditions which imply finite (2.7) are active. This does not exclude the possibility that
determinability, are discussed in subsequent sections. some of the inequalities may be redundant in the sense that
Next, consider the situation where Y is defined by func- they can be removed from (2.7) without destroying the
tional inequalities: characterization of 0,. For example, redundant inequalities
Y = { y ~ g ~ : f 1~ 0(, iy= ) l ; . * , s } . (2.3) in (2.3) lead to redundant inequalities in (2.7). If such
redundant inequalities are excluded, it is still possible to give
Under what circumstances is 0, defined in a similar way? examples where there are redundant inequalities in (2.7).
Theorem 2.3: Suppose A is Lyapunov stable and that for However, these examples appear to be structurally unstable,
i = 1 , * * s the functions f i : %!
e , +9 are continuous and i.e., small changes in A , C or the f i eliminate redundant
GILBERT AND TAN: LINEAR SYSTEMS WITH STATE AND CONTROL CONSTRAINTS 1011
inequalities. Thus, from a practical point of view it appears where A , E 9m x m is Lyapunov stable and CA'x diverges
that (2.7) offers the most economical representation of 0,. on Y+ unless the last n - m components of x are zero.
Remark 2.4: Because of Theorem 2.4, (2.4) and (2.5) can Now assume Y is bounded. Then it is obvious that
be replaced by
O , ( A , C , Y ) = O m ( A l , C ,Y, ) x ( 0 ) . (2.14)
g i ( x ) = max{f,(CA'x): tE { O ; . . , t:}} (2.8) A coordinate-free interpretation of (2.14) is that Om C L .
and Remark 2.5: With respect to the determination of 0, the
consequences of the preceding two paragraphs are clear. It is
0, = { X E g n :g i ( x ) 5 0 , ~ E S * } . (2.9)
possible to successively eliminate from the representation of
These simplification in (2.4) and (2.5) are valuable in control C , A the "unobservable" and "divergent motion" sub-
applications where it is necessary to have an efficient compu- spaces. Hence there is no loss of generality in restricting our
tational procedure for testing whether or not x E 0,. attention to systems which are observable and (if Y is
A change in state coordinates for the system (1.3) produces bounded) Lyapunov stable. This observation is important
a simple transformation of 0,. To see this let U E &? be because several of our key results depend on the assumptions
the nonsingular matrix which describes the change in coordi- of observability and Lyapunov stability.
nates. Then it can be verified from (1.7) that
111. THEALGORITHMIC OF 0,
DETERMINATION
Om(A , C , Y ) = UOm( a,e, Y ) Theorem 2.2 suggests the following conceptual algorithm
where 2. = CU, A" = U - ' A U . (2.10) for determining t* and 0,.
Algorithm 3.1:
Because of this relationship, we shall feel free in our subse- Step 1: Set t = 0.
quent derivations to express the pair C , A in whatever Step 2: If O,,, = 0,, stop. Set t* = t and 0, = 0,. If
,
coordinate system seems most natural. This path is accept- 0,+ # O,, continue.
able for both theoretical developments and the computation +
Step 3: Replace t by t 1 and return to Step 2. 0
of 0,. Clearly, the algorithm will produce t* and 0, if and only
The most obvious application of this idea is to unobserv- if 0, is finitely determined. There appears to be no finite
able systems. Choose the coordinate system in the usual way algorithmic procedure for showing that 0, is not finitely
[16] so that C , A has the form
:;1 12]
- determined. Fortunately, as subsequent developments show,
(2.11) it is often possible to resolve the issue of finite determination
C = [C1 01, A =
by other means. Algorithm 3.1 is not practical because it
does not describe how the test O,,, = 0, is implemented.
where C , E 9p x m , A , E 9 m x m , A , E 9 ( n - m ) x ( n A3- mThe
7 ) difficulty can be overcome if Y is defined by (2.3) and
E g ( n - m ) x m and the pair C , , A , is observable. The appli-
the hypotheses of Theorem 2.3 are satisfied, for then Step 2
cation of (1.7) then shows that
leads to a set of mathematical programming problems.
O , ( A , C , Y ) = O , ( A , , C , , Y ) x gn-'".(2.12) Algorithm3.2:
Step 1: Set t = 0.
Thus, 0, is a cylinder set whose determination depends only
Step 2: Solve the following optimization problems for
on the triple A , , C , , Y . Stated in a coordinate-free way, 0,
i = l ; . . , s:
is a cylinder set which has infinite extent in those directions
which belong to the unobservable subspace. Another conse- maximize Ji( x) = fi(CA'+'x) ( 3 4
quence of (2.12) is the necessity of the observability condi-
tion in result ii) of Theorem 2.1 subject to the constraints
Finally, we consider systems (1.3) which are observable
f j ( C A k x )5 0 , j = l ; . . , S , k = O ; . . , t . (3.2)
but have divergent motions. Define L = { X E g n :the se-
quence A'x, t E Y + , is bounded}. Clearly, L is a linear
Let J: be the maximum value of Ji(x). If J: 5 0 for
space. It is determined by the direct sum of the invariant
i = 1 ; * - , s, stop. Set t* = t and define 0, by (2.6).
subspaces of A which are associated with those eigenvalues
Otherwise, continue.
that satisfy I hi(A ) 1 < 1, together with the span of the
eigenvectors of A which correspond to I hi( A ) I = 1. We
+
Step 3: Replace t by t 1 and return to Step 2. 0
Remark 3.1: After Algorithm 3.2 has terminated, it is
will not dwell on the details, but it is possible to determine a
possible to obtain S* and the indexes ti* by solving a
basis for L numerically from the real Schur form [17] of A .
sequence of mathematical programming problems. Let J z =
Let m = dimension of L. Choose a basis for g n in the
max f i ( C A ' x )subject to the constraints f j ( C A k x )5 0, j =
following way: let the first m basis vectors be a basis for L ,
l ; . . , s , k = O ; . * , t * . For each i ~ { l ; . * , s } determine
let the remaining n - m basis vectors be a basis for the
? E Y + sothat J z < 0 for t = i + l ; . . , t* and Jji= 0. If
orthogonal complement of L. In this basis it is easy to see
this is possible, i E S * and ti* = f. If J,,< 0 for t =
that the pair C , A has the form
O ; . . , t*, i # S * .
The success of Algorithm 3.2 depends on the existence of
effective algorithms for solving the rather large mathematical
1012 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 36, NO. 9, SEPTEMBER 1991
2
programming problems which arise. This presents some dif-
ficulty because global optima are needed. Even when the fi,
i = 1; a , s, are convex, the difficulty remains because the
60'o 2
programming problems require the maximum of a convex
function subject to convex constraints. When Y is a polyhe-
dron, the difficulty disappears. Then, the programming prob-
lems are linear and efficient computer codes for obtaining the
global maxima abound.
Remark 3.2: Suppose the optimization algorithm used in
Step 2 is not guaranteed to provide a global minimum.
Algorithm 3.2 may still produce a useful result. If it has been
determined that for some x and t in Step 2 that (3.2) is
satisfied and Ji(x ) I0, i = 1 , . * s, then 0, = 0,. Of
e ,
TABLE I
DATAFOR EXAMPLES
Example A C Y Xi t*
3.1
0.5
[ -1.0 0.21 [-0.5,5.0] 0.6,0.3
3.2 1.o
[ ::: - 1.0 l.OI W) 0.76,0.6
3.3
0.3 [ ::: 1 .o 1 O.O
[-1.0,
x(--,
+m)
1.01
0.5,0.3
3.4
0.3
[ 1.0 0.01 [-1.0, +m) 0.5,0.3
3.5
-0.3
[ 1.0 1.01 [O.O, 1.01 0.58, - 0.28
3.6
0.2
[ 1.0 1.01 [o.o, 1.o] 0.5,0.2
3.7
3.8
i[; i]
cos -
[sin
-sin -
cos
-sin11
[ 1.0
[ 1.0
1.01
1.01
[ - 1.0,1.0]
[ - 1.0, 1.01
3
00
cos 1
3.9 1.0,0.5 0
of such assumptions here. Instead, because it has greater that C, x # Y . Since Y is closed, this implies the existence of
practical interest, we will investigate the relaxation of i). an r > 0 such that for all w E S ( r ) , C , x +
In particular, we consider systems where A is Lyapunov w # Y . Because x E Om(A , C , Y ) , it follows that y( t ) =
stable. By Remark 2.3 this represents no loss of generality +
CA'x = [ C , O]x [0 C,A',]xe Y for all ~ E Y +From .
when Y is bounded. The most common situation, and the the asymptotic stability of A , there exists a k E Y+ such
one which we treat here, is where the only characteristic that 11 [O C, A:] x I( < r . Thus, y( k ) # Y and this contradic-
roots of A that are on the unit circle are at X = 1. The tion completes the proof. 0
Lyapunov stability of A implies that these roots are all Consider the application of the theorem to Example 3.10.
simple. Thus, by doing a block diagonalization of A , where The matrices C and A already have the form (4.1) with
each block corresponds to a distinct eigenvalue of A [16], it d = 1, so no transformation of coordinates is necessary. It is
is possible to find a choice of coordinates which puts A , and easily confirmed that while Om(A , C , Y ) is not finitely
consequently C , into the form determined, Om(A , C , Y x Y ) is. Unfortunately, the trick
of replacing A , C , Y by A , C ,Y x Y does not work gener-
ally. See Example 4.1 in Table I. Again, this system satisfies
(4.1) with d = 1. But now neither Om(A , C , Y ) nor
Here, the partitioning of C and A is dimensionally consis- Om(A , C , Y x Y ) is finitely determined.
tent, d is the number of characteristic roots at X = 1 , and To see why, examine Fig. 2 , which shows x2-x3 sections
g ( n - d ) x ( n - d ) is asymptotically stable. The representa- of Om(A , C, Y ) taken for several values of x' E [0, 11.
tion (4.1) simplifies our developments. Because Om C { x : I x' 1 Il } = X , and is symmetric,
Theorem 4.2: Suppose A , C has the form (4.1) and Y is there is no need to consider values of x' outside [0, I]. Each
closed. Define of the sections is a polygon. At x' = 1, the section of
Om(A,C , Y ) is a polygon with only 6 vertices; but as X'
approaches 1 from below, the number of vertices in the
polygon near the origin increases. Note the differences in
X, = { x :[C, O]XEY} C g n . (4.2) scale for Fig. 2(a) and 2(b). A detailed analysis shows that if
x' is sufficiently close to 1, the number of vertices may be
Then: i) 0, C X , , ii) Om(A,C , Y ) = Om(A,C, Y x Y ) arbitrarily large. Thus, Example 4.1 is vastly more com-
Proof: Result i) implies ii) because if it holds, the plicated than Example 3.10. The characterization of
additional condition [ CL 01x E Y , which is contained in the Om(A , C, Y ) requires the intersection of an infinite number
augmented triple A , C , Y x Y , is satisfied automatically. of half spaces and there is no way that Om(A , C , Y ) can be
Suppose, contrary to i), there exists an x E Om(A , C , Y ) such finitely determined.
1014 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 36,NO. 9, SEPTEMBER 1991
TABLE I1
EXAMPLE
4.1: t* VERSUS
E 1.0 0.5 0.25 0.10 0.05 I x 1x 1x 10-~ 1x I x IO-^
t* 2 3 3 4 5 I 10 13 16 19
worth noting that some of our prior results extend, with little approximation of 0: exists, it does not provide a practical
modification, to continuous-time systems. scheme for obtaining T ( E ) Verifying
. the left inclusion of
Remark 6.1: Theorem 2.1 applies to O:( A , C , Y) with (6.3) by a series of guesses for T ( E is
) a workable idea, but it
no change except for the obvious modification in the defini- presents serious computational difficulties. In particular, the
tion of Lyapunov stability (Re Xi( A ) 5 0, i = 1 , . . ., n and testing of the left inclusion of (6.3) for each T requires the
Re Xi( A ) = 0 implies Xi( A ) is simple). The line of reason- solution of the following optimization problems: maximize
ing is exactly the same as before except for part ii) where the F;.(x)subject to X E Om(eAT, C , Y(E)) where
following steps are used: there exists a T > 0 such that
C , e A T is observable [16], by Theorem 2.1 Om(eAT, C , Y) c ( x ) = sup { f i ( C e A ' x ) t~: g+), (6.5)
is bounded, (6.1) implies O:(A, C , Y) is bounded. Simi- The inclusion holds if and only if for i = 1, * * * , s the
larly, Theorem 2.3 and its proof go through with minor resulting maxima satisfy F;* I0. While the constraint X E
changes. Specifically, 0: replaced Om in (2.5) and gi is Om(eAT, C , Y(E)) can be represented easily by using an
defined by obvious modification of (2.5)-(2.7), the evaluation of x) e.(
g i ( x ) = sup { f i ( C e A ' x ) :t~ &"}. (6.2) is computationally troublesome because it involves a supre-
mum over 9?+.Certain simplifications occur when Y satis-
Finally, (2.10) extends to 0:. Therefore, (2.11)-(2.14), and
fies the Minkowski assumptions (see the next section). At the
Remark 2.5 are also valid for 0:.
very least, the proof of the theorem supports an obvious
Theorem 6.1: Assume C , A is observable, A is Lya-
intuitive criterion for the choice of T . In particular, (6.4) is
punov stable and assumptions i)-iii) of Theorem 5.1 hold.
satisfiedif ITAi(A)Ia l f o r i = l ; * . , n .
Then for each E E (0, eo] there exists a T ( E )> 0 such that
VII. THEMINKOWSKI
ASSUMPTIONS
Om(eAT(E ),
c,Y(E)) c o:( A , C , Y ) c o m ( e A T ( 'C) , , Y ) .
Many of the prior results are strengthened when Y satisfies
(6.3)
the following Minkowski assumptions: a) 0 E int Y, b) Y is
Proof: The right inclusion follows from (6.1). We be-
gin the proof of the left inclusion by establishing several compact, c) Y is convex. Under these assumptions there is a
technical facts. By the Lyapunov stability, there exists a corresponding Minkowski distance function, defined by
y > 0 such that 11 CeA'((Iy, t E &?. By the observability p y ( y ) = i n f { a : CY > 0, ~ E C Y Y } . (7.1)
of C , A there exists a To > 0 such that for all T E
(0, To], C , eAT is observable [16]. Hence, by Theorem 2.1 The key properties of the distance function are summarized
there exists an r > 0 such that Om(eATo, C , Y(E)) c S ( r ) . in the following theorem. (See, e.g., [19].)
Let '
= j - To where j E .Y+ and j > 0. It is then clear
Theorem 7.1: Suppose YE i?
assumptions. Then i) p , : !2
satisfies the Minkowski
is defined and p y ( 0 ) =
that for all such j , Om(eAq,C , Y(E)) c Om(eATo,
+
C , Y(E))
0; ii) p y is convex; iii) for all y E 9 and CY E .@, p y ( c r y )
C S ( r ) . Let 6, > 0. By the continuity of the f i , Y is
+
compact and therefore Y S(6,) is compact. Because of the = W A Y ) ; iv) for all Y1, Y2 E gp9 P Y ( Y I + Y 2 ) 5 P Y ( Y A )
+ p y ( y 2 ) ;v) there exist r l , r2 > 0 such that for all y E &? ,
compactness the f , are uniformly continuous on Y S(6,) +
and, in fact, there is a single modulus of continuity & ( E ) r,Ilyll Ip Y ( y )Ir2lIy/l; vi) Y = { y : p Y ( y )5 11; vii)
) - E , and applying our above Most practical physical constraints lead to constraint sets Y
1,Jx. Noting that f i ( C e A k T x5
series of facts, leads to the desired result. Choose T = T, = which satisfy the Minkowski assumptions and whose distance
T ( E )so that functions are readily determined. Key consequences of the
Minkowski assumptions are contained in the following theo-
ylleAu--I,l(rlmax{Go,6(E)} V U E [0, T I . (6.4) rem.
The analyticity of e A u shows that (6.3) is satisfied if j is Theorem 7.2: Assume C , A is observable, A is Lya-
sufficiently large. 0 punov stable, and Y satisfies the Minkowski assumptions.
Remark 6.2: Of course, Om(eAT('), C , Y(E)) is an output Then Om(A , C , Y) and O:( A , C , Y) satisfy the Minkowski
admissible set for the continuous-time system and is eAT(')- assumptions and their distance functions are given by
invariant. In general, it is not A-invariant or positively po,(x) = Sup(pr(CA'x): t € J + } , (7.2)
invariant for (1.6).
While the theorem shows that a sensible discrete-time poL(x) = s u p { p y ( C e A ' x ) : t~ g+}, (7.3)
~
1016 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 36, NO. 9, SEPTEMBER 1991
Proof: It is an immediate consequence of Theorem 2.1 problems: maximize py(CeA'u,)over t E 9+. This can be
and Remark 6.1 that Om(A , C , Y ) and O:( A , C , Y ) satisfy done efficiently by mapping @ onto [0, 11 and applying
the Minkowski assumptions. By the definitions of the one of the many available algorithms for maximizing a
Minkowski distance function and Omand parts vi) and iii) of function on an interval. Let the maximum values be denoted
Theorem 7.1, po_(x)= inf{a > 0: a - ' x ~ O , }= inf{a by p:. Then, p* = max {p:, i = 1; e , N } . In general, the
> 0: CA'a-Ix E Y , V t E J+) = inf { a > 0: py(CAt number of vertices N grows very rapidly with n. Thus, the
a - l x ) I1, v t E Y + } = inf{a > 0: pT(CA'x)Ia, vtE approach is by no means a panacea.
Y'}. From this, (7.2) is evident. The proof of (7.3) is Remark 7.5: It is easy to see from the A-invariance of
essentially the same. U Om = { x: po,Cx) 5 l} and the fact that porn is a Minkowski
Remark 7.1: Result (7.2) may be interpreted as a special distance that the sequence pow( x( t ) ) , t E J+,is nonincreas-
case of Theorem 2.3 where s = 1, f, = p y - 1, and g , = ing on any solution of x ( t + 1) = A x ( t ) . This and property
Pom - 1. v) of Theorem 7.1 make porn useful as a Lyapunov function.
Remark 7.2: The characterization of Y by f , ( y ) = For example, if A is asymptotically stable, Om is a domain
p L y ( y )- 1 I0, may be used to define Y ( E in
) Theorem 5.1 of attraction such that the output constraint is not violated.
and 6.1. Then, by part vii) of Theorem 7.1
=
0.2642
[
diag[ -0.3223
0.3223
0.26421' [ -0.4709
'.l4l7
1
0*4709], 0.2167,0.9993,1,1 ,
0.1417
BT= [ -0.0792
-2.6526 - 1.0503
-2.3646
1S410
-6.5588
-4.3306
0.5859
0.5860
-0.2589
-2.3701
0.9062
0.0318
-0.0319 -0.5221
0.4376 1
'
-0.1946 -0.1451 - 1.3393 0.2037 0
1.2752 -0.0352 -0.2546 -0.3914 -0.0002
Om(A,, C, Y )by an approximation Om(A,, Cc, Y ( E )x Y ) The characteristic roots of A , are evident from its real block
as described in Section V. Set diagonal form.
The compensator is Lyapunov stable with two characteris-
K(k)=max{KE[o~ll~Acxc(k)+BcKe(k)EOm}. tic roots of A , on the unit circle at h = 1. Thus, an
( 8 . 2 ) approximation of Om(A,, C, Y ) was considered. The sys-
tem equations already have the form (4. l), so Algorithm 3.2
Then it is obvious that x,( k ) E 0, implies xc(k + 1) E Om. was applied to the determination of Om = Om(A,, C, Y ( E )
Thus, if x,(O) E Om,which is a reasonable assumption for a x Y ) . The set Y was defined by (2.3) with f i ( y ) =
compensator, 11 u ( k )11 I1, k E 9+. Moreover, it is certain 0.04 1 yi I - 1 and s = 2. Note that the Minkowski assump-
that the maximization problem has a solution because Om is tions are satisfied and p y ( y ) = 0.04)lyllm. The required
closed and A , x , ( k ) + B , ~ e ( k )E Om for K = 0. It should optimization problems were solved by linear programming
be noted that the on-line implementation of (8.2) is straight- for several values of E,. Table I11 summarizes the results. It
forward. Since 0, is defined by a system of linear inequali-
appears that Om(A,, C, Y x Y ) is not finitely determined.
ties, the upper limit imposed on K by each inequality is given However, E = 0.004 gives a very good approximation; by
by a simple formula. The least of these limits is ~ ( k )The . Remark 7.4, it follows that the error of the approximation is
real-time implementation of the overall control strategy re- at most 0.4 % . The computational time required by Algorithm
quires that the computation of ~ ( kand ) x,(k + 1) in (8.1) 3.2 is not large. For E = 0.004 and an Apollo DN 3500, it is
be done in less than T seconds.
6.1 s; for 6 = 0.00004, it is 11.7 s.
The intuitive basis for the operation of the error governor
The actual characterization of O,(A,, C,, Y ( E )x Y ) is
is clear. If x,(k) E int Om and e ( k ) is sufficiently small,
quite simple. For E = 0.004, it is given by the representation
~ ( k =) 1. Consequently, when both r ( t ) and e ( k ) are rea-
sonably small, the closed-loop system satisfies the linear (2.7)
equations of motion. When e ( k ) is large, or when A , x , ( k ) Om = { x , : 1 xzl I24.9, I xtl I 24.9,
is near the boundary of Om and B,Ke( k ) points toward the
boundary, ~ ( k<) 1 and the compensator action is reduced ) C , , A { x , ) 125, )C,,AEx,I 5 2 5 ,
to avoid saturation. As noted in [6], the basis for the error j = 0 ; * , 7, k = 0;*.,9} (8.3)
governor is entirely intuitive. There is no theory which
actually proves that the error governor has better response where C, and C, are the rows of C,. Suppose that the
characteristics. Because of the asymptotic stability of the C,,Af and C,,A: are precomputed. Then the testing of
plant it is easy to show that the closed-loop system is x E Om requires 18 inner product evaluations and 20 compar-
GILBERT AND TAN: LINEAR SYSTEMS WITH STATE AND CONTROL CONSTRAINTS 1019
1018 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 36, NO. 9, SEPTEMBER 1991
TABLE III
AIRCRAFT SYSTEM:t? AND t,* VERSUSE
CONTROL
E 4 x IO-^ 4 x 4~ 1 0 - ~ 4~ 1 0 - ~ 4~ IO-^ 4~ io-' 4~ i o - 8 4 x 10-~
t: 5 7 9 11 13 14 16 17
t2* 8 9 11 12 14 15 17 19
The closed-loop system was simulated with the step input ... . .... .. .. .. .. . .
,
.._.._....._.."
r , ( t ) = r z ( t ) = 10. Fig. 4 shows the output response c,(t)
and c z ( t ) for three different situations: i) K = 1 and no
saturation, ii) K = 1 and saturation, iii) ~ ( kdetermined
) by
the error governor ( E = 0.004). As expected in i), the result-
ing linear sampled-data feedback system performs well and
has essentially the same response as the corresponding con-
tinuous-time system in [6]. The response ii) shows the effects
0 1 2 3 4 5
,
of adding saturation to the same system. Both overshoot and Time (Secs)
settling time are badly degraded. With the error governor in (a)
place saturation does not occur and the response iii) is much
improved. Fig. 5 shows ~ ( k )For . k 2 15(t 2 0.75), ~ ( k )
= 1 and the system behaves as a linear system.
15, i' Error Governor
L'!!.ar-Syrllrn.
Sat.Conlrol!+.r..
....._..'_..
in [6]. However, there is a substantial saving in the computa-
tional time required for the controller. On a Macintosh 512K
the simulation of our implementation takes 13.1 s; the corre-
sponding time reported in [8] is approximately 29,000 s. The
reason for the large time is not clear from the discussion in
[8], but it is certainly related to the complexity of testing
whether or not a point x belongs to OL(A,, C,, Y). On an
0 1 2 Time (Secs) 3 4 5
Apollo DN 3500 the simulation takes 1.24 s, which is about
four times faster than real time. Hence, for systems of (b).
significant complexity, it appears that the implementation of Fig. 4. Output responses for aircraft control system with step input rl = r2
practical on-line controllers is feasible. = 10.
It is perhaps worthwhile to compare briefly the error
governor approach to some others which have been proposed
for treating the control of linear plants with state and control
constraints. The papers [3], [9] - [ 131 are concerned with state
regulator problems similar to the one described by ( l . l ) ,
(1.2) where Y is a polyhedron. While they do give attention
to desirable choices for the state feedback matrix K, the
positively invariant set of allowed initial conditions must be
specified a priori and it is, in general, not maximal. Other
researchers formulate the state regulator problem, with its
state and control constraints, as an optimal control problem.
See, e.g., [23]-[25] and the many papers cited in these
references. The optimal control generated by such an ap-
proach allows the capture of the largest possible set of initial Fig. 5. Gain produced by error governor for aircraft control system with
conditions. A serious disadvantage is the difficulty in imple- step input r , = r, = 10.
menting the on-line computation of the resulting nonlinear a specified linear design which can be carried out using
feedback control. It appears that feasible implementations are dynamic compensation and the most advanced design meth-
only possible for systems of very low order ( n I3). The ods, the nonlinear control strategy is implemented by a
error governor has a number of important advantages: it can structurally simple modification of the linear control system,
be applied to systems with time-varying inputs, it begins with it appears feasible to implement the nonlinear control for
GILBERT AND TAN: LINEAR SYSTEMS WITH STATE AND CONTROL CONSTRAINTS 1019
Elmer G. Gilbert (S’51-A’52-M’57-SM’78- Hopkins University Society of Scholars. He has published numerous papers
F’79) received the Ph.D. degree in instrumentation and holds eight patents.
engineering from the University of Michigan, Ann
Arbor. Kok Tin Tan (S’88) was born in Singapore, on
Since 1957, he has been a faculty member in the October 3, 1958. He received the B.Eng(Honors)
Department of Aerospace Engineering at the Uni- and M.Eng in mechanical engineering from the
versity of Michigan, where he is currently a Pro- National University of Singapore, in 1983 and
fessor. He is also a member of the Department of 1986 and received the M.Sc in aerospace engineer-
Electrical Engineering and Computer Science and a ing from University of Michigan, Ann Arbor, in
participant in the Center for Research on Integrated 1989.
Manufacturing at the University. His current re- He is currently a candidate for the Ph.D degree
search interests include optimization, nonlinear control, and robotics. in the Aerospace Engineering Department, the
Dr. Gilbert’s special awards include the O.H. Schuck Award for best University of Michigan. His research interests in-
paper at the 1978 Joint Automatic Control Conference and a Control clude nonlinear control, optimal control, and flight
Systems Society best paper award in 1979. He is a member of the Johns control.