Professional Documents
Culture Documents
net/publication/242014364
CITATIONS READS
984 3,841
2 authors, including:
Tingshu Hu
University of Massachusetts Lowell
147 PUBLICATIONS 5,907 CITATIONS
SEE PROFILE
All content following this page was uploaded by Tingshu Hu on 09 May 2016.
Control Systems
with Actuator Saturation:
Analysis and Design
ISBN: 0-8176-4219-6
Charlottesville, Virginia
November, 2000
To Jianping and Sylvia
T.H.
Preface xi
1 Introduction 1
1.1 Linear Systems with Actuator Saturation . . . . . . . . . . 1
1.2 Notation, Acronyms, and Terminology . . . . . . . . . . . . 3
vii
viii CONTENTS
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
4.2 Domain of Attraction – Planar System under
Saturated Linear Feedback . . . . . . . . . . . . . . . . . . 57
4.3 Semi-Global Stabilization – Planar Systems . . . . . . . . . 67
4.4 Semi-Global Stabilization – Higher Order Systems . . . . . . 74
4.5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
Bibliography 371
Index 386
xii CONTENTS
Preface
xiii
xiv Preface
Introduction
Every physical actuator is subject to saturation. For this reason, the orig-
inal formulations of many fundamental control problems, including con-
trollability and time optimal control, all reflect the constraints imposed by
actuator saturation. Control problems that involve hard nonlinearities such
as actuator saturation, however, turned out to be difficult to deal with. As
a result, even though there have been continual efforts in addressing actua-
tor saturation (see [4] for a chronological bibliography on this subject), its
effect has been ignored in most of the modern control literature.
On the other hand, it has been well known that, when the actuator
saturates, the performance of the closed-loop system designed without con-
sidering actuator saturation may seriously deteriorate. In the extreme case,
even the stability may be lost. A well-known example of performance degra-
dation (e.g., large overshoot and large settling time) occurs when a linear
compensator with integrators, say a PID compensator, is used in a closed-
loop system. During the time when the actuator saturates, the error is
continuously integrated even though the control is not what it should be,
and hence, the states of the compensator attain values that lead to larger
controls than the actuator limit. This phenomenon is called integrator
windup [23]. In the absence of integrators, a given reference setpoint might
result in a different steady state response, causing the need to reset the
reference to a value different from the desired setpoint. With integral con-
trol, the controllers automatically bring the output to the given reference
1
2 Chapter 1. Introduction
setpoint and hence the integrator does the reset. For this reason, integrator
windup is sometimes referred to as reset windup.
A practical approach to compensating this performance degradation due
to actuator saturation is to add some problem-specific anti-windup schemes
to deal with the adverse effects caused by saturation. These schemes are
typically introduced using ad hoc modifications and extensive simulations.
The basic idea behind these schemes is to introduce additional feedbacks in
such a way that the actuator stays properly within its limit. For example,
several schemes have been proposed to solve the reset windup problem when
integrators are present in the forward loop. Most of these schemes lead to
improved performance but poorly understood stability properties. More
recently, some researchers have attempted to provide more systematic and
more general schemes to deal with the problem (see, e.g., [11,12,30,53,60]).
In this book, we will take the approach of considering actuator satura-
tion at the outset of control design. As seen in the recent literature, there
has been a resurge of interest in this approach, possibly owing to its system-
atic nature. A fundamental issue is the identification of the class of linear
systems that are (globally) asymptotically null controllable by bounded
controls. A system is said to be globally asymptotically null controllable
by bounded controls if, for a given bound on the controls, every state in its
state space can be driven to the origin either in a finite time or asymptot-
ically by a bounded control. In particular, it was established in [76,89,90]
that a linear stabilizable system having all its poles in the closed left-half
plane is globally asymptotically null controllable. For this reason, a linear
stabilizable system with all its poles in the closed left-half plane is com-
monly said to be asymptotically null controllable with bounded controls,
or ANCBC, and most of the recent work has been focused on ANCBC sys-
tems. For such systems, various types of feedback laws have been proposed
that work globally (on the entire state space) or semi-globally (on any a
priori given arbitrarily large bounded set in the state space). We refer the
reader to [65] and the references therein for some accounts of these results.
It is clear that a linear system having poles in the open right-half plane
is not globally asymptotically null controllable with bounded controls. Any
feedback laws designed for such a system would not work globally. Two
natural questions to ask are:
set of all states that can be driven to the origin in a finite time or
asymptotically by a bounded control?
This book is intended to answer these two questions, for both continuous-
time and discrete-time systems. We will start with explicit analytical de-
scriptions of the asymptotically null controllable region. Once we have
identified this region, we will design feedback laws that achieve various
closed-loop performance specifications on the entire asymptotically null
controllable region or a large portion of it. These performances range from
the basic control problem of stabilization to those beyond large domain of
attraction such as transient properties, disturbance rejection and output
regulation. A full list of the topics to be addressed in this book can be
found in the preface to this book, or in the table of contents.
|u(k)|∞ ≤ 1, ∀ k ∈ N.
In the case that ρ = 1, we will simply use E(P ) to denote E(P, 1).
Also, for a continuous function V : Rn → R+ , a level set LV (c) is
defined as
LV (c) := x ∈ Rn : V (x) ≤ c .
ẋ = Ax + Bu,
is said to be
In each of these situations, we will also say that the matrix A is sta-
ble (or Hurwitz), neutrally stable, semi-stable, exponentially unstable
and anti-stable, respectively.
is said to be
In each of these situations, we will also say that the matrix A is stable
(or Schur), neutrally stable, semi-stable, exponentially unstable and
anti-stable, respectively.
8 Chapter 1. Introduction
where
1 , X 2 ) = sup
d(X inf |x1 − x2 |.
x 1 ∈X 1 x 2 ∈X 2
λx1 + (1 − λ)x2 ∈ X ,
for all 0 ≤ λ ≤ 1.
λx1 + (1 − λ)x2 ∈
/ ∂X
for all λ ∈ (0, 1). For example, an ellipsoid is strictly convex while a
cube is not.
1.2. Notation, Acronyms, and Terminology 9
• The convex hull of a set X is the minimal convex set that contains X .
For a group of points x1 , x2 , · · · , xI ∈ Rn , the convex hull of these
points is
I
I
co x1 , x2 , · · · , xI = αi xi : αi = 1, αi ≥ 0 .
i=1 i=1
If F is the feedback matrix, then L(F ) is the region where the feedback
control u = sat(F x) is linear in x. We call L(F ) the linear region of
the saturated feedback sat(F x), or simply, the linear region of the
saturation function.
10 Chapter 1. Introduction
Chapter 2
Null Controllability –
Continuous-Time
Systems
2.1. Introduction
11
12 Chapter 2. Null Controllability – Continuous-Time Systems
In the earlier literature, the null controllable region, also called the
controllable set, was closely related to the time optimal control (see, e.g.,
[18,52,63,76]). For a given initial state x0 , the time optimal control problem
has a solution if and only if x0 ∈ C. If x0 is on the boundary of C(T ), then
the minimal time to steer x0 to the origin is T . The corresponding time
optimal control is a bang-bang control. In recent literature on control of
linear systems with saturating actuators, the characterization of the null
controllable region forms the guideline for searching feedback laws (see, e.g.,
[65,95]).
Because of the fundamental role the null controllable region plays in con-
trol theory for linear systems with bounded controls, there have been contin-
ual efforts towards its characterization (see, e.g., [3,4,18,21,52,54,62,89,90]
and the references therein). In this chapter, we will present simple and ex-
plicit descriptions of the null controllable region. Our presentation is based
on both the earlier understanding of the null controllable region and our
recent work on the topic [41].
Section 2.2 contains some preliminaries from which a complete descrip-
tion of the null controllable region can be made. Section 2.3 presents a
general description of the null controllable region. Although explicit and
complete, this general description can be drastically simplified by utiliz-
ing the eigenstructure of the given system. Sections 2.4 and 2.5 show how
such simplification is made. In particular, Section 2.4 deals with the case
that all eigenvalues are real, and Section 2.5 deals with the case of com-
plex eigenvalues. Section 2.6 explains how C(T ) can be described. Section
2.7 addresses the situation where the linear system contains some uncon-
trollable, but stable, poles. In this case, the notion of asymptotically null
controllability is introduced. Finally, a brief concluding remark is made in
Section 2.8.
Definition 2.2.1.
With the above definition, we see that x0 ∈ C(T ) if and only if there
exists an admissible control u such that
T
AT
0 = e x0 + eA(T −τ ) Bu(τ )dτ
0
T
= eAT x0 + e−Aτ Bu(τ )dτ .
0
It follows that
T
−Aτ
C(T ) = x=− e Bu(τ )dτ : u ∈ Ua . (2.2.3)
0
The minus sign “−” before the integration can be removed since Ua is
symmetric. Also, we have
C= C(T ). (2.2.4)
T ∈[0,∞)
a) If A is semi-stable, then C = Rn ;
14 Chapter 2. Null Controllability – Continuous-Time Systems
c) If
A1 0
A= ,
0 A2
with A1 ∈ Rn1 ×n1 anti-stable and A2 ∈ Rn2 ×n2 semi-stable, and B
is partitioned accordingly as
B1
B= ,
B2
then,
C = C 1 × Rn2 ,
where C 1 is the null controllable region of the anti-stable system
ẋ1 = A1 x1 + B1 u.
where C denotes the closure of C. We will also use “∂” to denote the
boundary of a set. In this chapter, we will derive a method for explicitly
describing ∂C.
Let
B = b1 b2 · · · bm ,
and, for each i = 1 to m, let the null controllable region of the system
ẋ = Ax + bi ui
In view of (2.2.6) and Proposition 2.2.1, in the study of null controllable re-
gions we will assume, without loss of generality, that (A, B) is controllable,
A is anti-stable, and m = 1. For clarity, we rename B as b.
2.3. General Description of Null Controllable Region 15
It is easy to see that x(t) solves (2.2.7) with x(0) = x0 , x(t1 ) = x1 and
certain u if and only if z(t) = x(t1 − t) solves (2.2.8) with z(0) = x1 ,
z(t1 ) = x0 and v(t) = u(t1 − t). The two systems have the same curves as
trajectories, but traverse in opposite directions.
Consider the time-reversed system of (2.2.1),
Definition 2.2.2.
It is known that C(T ) and C of (2.2.1) are the same as R(T ) and R
of (2.2.9) (see, e.g., [76]). To avoid confusion, we will continue to use the
notation x, u, C(T ) and C for the original system (2.2.1), and z, v, R(T )
and R for the time-reversed system (2.2.9).
In this section, we will show that the boundary of the null controllable
region of a general anti-stable linear system with saturating actuators is
composed of a set of extremal trajectories of its time-reversed system. The
description of this set will be further simplified for systems with only real
16 Chapter 2. Null Controllability – Continuous-Time Systems
poles and for systems with complex poles in Sections 2.4 and 2.5, respec-
tively.
We will characterize the null controllable region C of the system (2.2.1)
through studying the reachable region R of its time-reversed system (2.2.9).
Since A is anti-stable, we have
∞
R= z= e−Aτ bv(τ )dτ : v ∈ Ua
0
0
= z= eAτ bv(τ )dτ : v ∈ Ua .
−∞
Theorem 2.3.1.
0
∂R = z = eAτ b sign cT eAτ b dτ : c
= 0 . (2.3.1)
−∞
Since c
= 0 and (A, b) is controllable, cT eAτ b
≡ 0. Since cT eAt b has a finite
number of zeros in any finite interval,
µ t : cT eAt b = 0 = 0, (2.3.4)
maximizes the right hand side of (2.3.3). We maintain that v ∗ is the unique
optimal solution of (2.3.3). To verify this, we need to show that for any
v ∈ Ua , v
= v ∗ ,
0 0
cT eAτ bv ∗ (τ )dτ > cT eAτ bv(τ )dτ. (2.3.5)
−∞ −∞
Since v
= v ∗ , there are a set E1 ⊂ [−∞, 0] with nonzero measure, i.e.,
µ(E1 ) = δ1 > 0, and a number ε1 > 0 such that
|v(t) − v ∗ (t)| ≥ ε1 , ∀ t ∈ E1 .
This shows that v ∗ (t) is the unique optimal solution of (2.3.3) and hence
the unique admissible control satisfying
0
∗
z = eAτ bv ∗ (τ )dτ. (2.3.6)
−∞
cT z ∗ = max cT z.
z∈R
18 Chapter 2. Null Controllability – Continuous-Time Systems
for some c
= 0 (such c, |c| = 1, may be non-unique). So, if v is an admissible
control and there is no c such that v(t) = sign(cT eAt b) for all t ≤ 0, then
0
eAτ bv(τ )dτ ∈
/ ∂R
−∞
for all t ∈ R, i.e., Φ(t, v) lies entirely on ∂R. An admissible control v such
that Φ(t, v) lies entirely on ∂R is said to be an extremal control and such
Φ(t, v) an extremal trajectory. On the other hand, given an admissible
control v(t), if there exists no c such that v(t) = sign(cT eAt b) for all t ≤ 0,
then by Theorem 2.3.1, Φ(0, v) ∈ / ∂R must be in the interior of R. By
the time invariance property of the system, if there exists no c such that
v(t) = sign(cT eAt b) for all t ≤ t0 , Φ(t, v) must be in the interior of R for
all t ≥ t0 . Consequently, E c is the set of extremal controls. The following
lemma shows that ∂R is covered by the set of extremal trajectories.
Lemma 2.3.1.
∂R = Φ(t, v) : t ∈ R, v ∈ E c . (2.3.11)
i.e.,
∂R = Φ(t, v) : v ∈ E c ,
for any fixed t ∈ R. Hence ∂R can be viewed as the set of extremal
trajectories at any frozen time. Now let t vary, then each point on ∂R
moves along a trajectory but the whole set is invariant. So we can also
write
∂R = Φ(t, v) : v ∈ E c , t ∈ R ,
Noting that a shift in time of the control corresponds to the same shift
of the state trajectory, we see that, if v1 ∼ v2 , then
Φ(t, v1 ) : t ∈ R = Φ(t, v2 ) : t ∈ R ;
and if c1 ∼ c2 , then
Definition 2.3.2.
Theorem 2.3.2. If E m
c is a minimal representative of E c , then
∂R = Φ(t, v) : t ∈ R, v ∈ E m c .
Theorem 2.4.1. For the system (2.2.9), assume that A has only real eigen-
values, then,
i=1
N
Then g(t) ∈ GN has at most i=1 ki − 1 zeros.
N −1
has at least i=1 ki zeros, which is a contradiction. 2
Proof of Theorem 2.4.1. The proof of a) was sketched in [76]. To show
b), assume that A has N distinct real eigenvalues λi , i = 1, 2, · · · , N , each
N
with a multiplicity of ki ( i=1 ki = n). It is well-known that
N
cT eAt b = eλi t fi (t),
i=1
for some fi ∈ P ki . If c
= 0, then cT eAt b
≡ 0 by the controllability of
(A, b). (Thus a) follows from Lemma 2.4.1). To complete the proof of
b), we first show that any bang-bang control v with n − 1 switches is an
extremal control.
Let t1 , t2 , · · · , tn−1 ∈ R be the distinct switching times of v. From the
following n − 1 linear equations
cT eAti b = 0, i = 1, 2, · · · , n − 1,
g̃(ti ) = 0, ˙ i )
= 0,
g̃(t i = 2, 3, · · · , n − 1.
2.4. Systems with Only Real Eigenvalues 23
N −1
g̃(t) − = e(λi −λN )t fi (t) + fN (t) −
i=1
A= , b= ,
0 A1 b1
where A1 is of size n − j. It is easy to see that A1 is also of the Jordan
canonical form and (A1 , b1 ) is controllable. Furthermore,
eAt = .
0 e A1 t
Accordingly, consider c of the form,
0
c= ,
c1
then
cT eAt b = cT1 eA1 t b1 .
By the forgoing proof for the full dimensional case, we see that there exists
c1 such that v(t) = sign(cT1 eA1 t b1 ) is a bang-bang control with switching
times exactly at ti , i = 1, 2, · · · , n − 1 − j.
Therefore, we conclude that any bang-bang control with less than n − 1
switches is also extremal. 2
By Theorem 2.4.1, the set of extremal controls can be described as
follows,
1, −∞ ≤ t < t1 ,
E c = ± v : v(t) = (−1) ,
i
ti ≤ t < ti+1 , t1 < t2 ≤ · · · ≤ tn−1
(−1)n−1 , tn−1 ≤ t < ∞,
v(t) ≡ ±1 ,
24 Chapter 2. Null Controllability – Continuous-Time Systems
For each v ∈ E mc , we have v(t) = 1 (or −1) for all t < 0. Hence, for
t ≤ 0, t
Φ(t, v) = − e−A(t−τ ) bdτ = −A−1 b (or A−1 b).
−∞
Afterwards, v(t) is a bang-bang control with n − 2 or less switches. Denote
Hence,
∂R = Φ(t, v) : t ∈ R, v ∈ E m
c
n−1
i −A(t−ti )
= ± 2(−1) e + (−1) I A−1 b :
n
i=1
0 = t1 ≤ t2 · · · ≤ tn−1 ≤t≤∞ . (2.4.1)
0.8
v=+1
0.6
0.4
0.2
−
0 z
e
+
ze
−0.2
−0.4
−0.6
v=−1
−0.8
−1
−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1
Plotted in Figs. 2.4.1 and 2.4.2 are respectively the ∂R of a second order
system with
0 −0.5 0
A= , B= ,
1 1.5 −1
and that of a third order system with
0.2 1 0 1
A = 0 0.2 0 , B = 1 .
0 0 0.4 1
Since the trajectories of the original system and those of the time-reversed
system are the same but traverse in opposite directions, we can also say
that ∂R = ∂C is covered by a set of trajectories of the original system.
While all the trajectories of the time-reversed system start at ze+ or ze−
and are very easy to generate by simulation, it is impossible to get the
same trajectories from the original system. For example, when n = 2, one
half of ∂R is formed by the trajectory of the time-reversed system that
starts at ze− under a constant control v = +1. The trajectory goes from ze−
toward ze+ asymptotically but never reaches ze+ at a finite time. It seems
that if we apply u = +1 at ze+ to the original system, the trajectory will
2.5. Systems with Complex Eigenvalues 27
2 −
z
e
1
−1
−2
−3 z+
e
10
5 30
20
0 10
0
−5 −10
−20
−10 −30
go from ze+ to ze− along the same trajectory of the time-reversed system.
However, this is not the case. The trajectory of the original system will
stay at ze+ under the constant control u = +1. The boundary ∂R can only
be partially generated from the original system if we know one point on it
other than ±ze+ . But this point is not easy to get from the original system
by simulation.
and let
c1 b1
= V c,T
= V −1 b,
c2 b2
28 Chapter 2. Null Controllability – Continuous-Time Systems
then
At
cos(βt) − sin(βt) b1
T
c e b= c1 c2 eαt
sin(βt) cos(βt) b2
b1 b2 c1
= cos(βt) sin(βt) eαt .
−b2 b1 c2
b1 b2
Since is nonsingular, it follows that
−b2 b1
b1 b2 c1 sin(θ)
: c
= 0 = r : r
= 0, θ ∈ [0, 2π) .
−b2 b1 c2 cos(θ)
Hence
sign cT eAt b : c
= 0 = sign(sin(βt + θ)) : θ ∈ [0, 2π) ,
v(t) = sign(sin(βt))
3
v=−1
z−e
1 z−
s
0
+
zs
−1 +
ze
−2
−3
v=1
−4
−5
−6 −4 −2 0 2 4 6
Since sign(k + sin(βt + θ)) is the same for all k ≥ 1 (or k ≤ −1), we have
Emc = v(t) = sign(k + sin(βt)), t ∈ R : k ∈ [−1, 1] .
30 Chapter 2. Null Controllability – Continuous-Time Systems
0.5
−0.5
−1
−1.5
1
0.5 1.5
1
0 0.5
0
−0.5 −0.5
−1
−1 −1.5
Each v ∈ E mc is periodic with period 2Tp , but the length of v(t) being 1 and
that of v(t) being −1 vary with k. Φ(t, v) can be easily determined from
simulation or direct computation. Plotted in Fig. 2.5.2 are some extremal
trajectories on ∂R of the time-reversed system (2.2.9) with
0.8 0 0 1
A = 0 0.8 −2 , B = 1 .
0 2 0.8 1
b) α
= α1 . In this case,
E c = v(t) = sign k1 e(α1 −α)t + k2 sin(βt + θ) , t ∈ R :
(k1 , k2 )
= (0, 0), θ ∈ [0, 2π) .
E c = E c1 ∪ E c2 ∪ E c3 ,
where
E c1 = v(t) ≡ ±1 , (k2 = 0),
2.5. Systems with Complex Eigenvalues 31
E c 2 = v(t) = ±sign(sin(βt + θ)) : θ ∈ [0, 2π) , (k1 = 0),
and,
E c3 = v(t) = ±sign ke(α1 −α)t + sin(βt + θ) : k > 0, θ ∈ [0, 2π) .
Let
v(t) = sign ke(α1 −α)t + sin(βt + θ) ∈ E c 3 .
suppose that
v1 (t) = sign e(α1 −α)t + sin(βt + θ1 ) ,
v2 (t) = sign e(α1 −α)t + sin(βt + θ2 ) ,
When α1 < α (or α1 > α), both e(α1 −α)t and e(α1 −α)(t−h) go to zero as t
goes to ∞ (or −∞). For v1 (t) and v2 (t − h) to change signs at the same
time, we must have βt + θ1 = β(t − h) + θ2 + lπ, for some integer l. Since
at any switching time of v1 (t) and v2 (t),
we conclude that
sin(βt + θ1 ) = sin(β(t − h) + θ2 )
and hence
e(α1 −α)t = e(α1 −α)(t−h) .
32 Chapter 2. Null Controllability – Continuous-Time Systems
0.5
−0.5
−1
−1.5
−2
1
0.5 1.5
1
0 0.5
0
−0.5 −0.5
−1
−1 −1.5
When α1 < α, for each v ∈ E m c3 , v(t) = 1 (or −1) for all t ≤ 0, so the
corresponding extremal trajectory stays at ze+ = −A−1 b or ze− before t = 0.
And after some time, it goes toward a periodic trajectory since as t goes to
infinity, v(t) becomes periodic. When α1 > α, for each v ∈ E m c3 , v(t) = 1(or
−1) for all t ≥ 0, and the corresponding extremal trajectory starts from
near periodic and goes toward ze+ or ze− .
Plotted in Fig. 2.5.3 are some extremal trajectories on ∂R of the time-
reversed system (2.2.9) with
0.5 0 0 1
A= 0 0.8 −2 , B = 1 .
0 2 0.8 1
2.6. Some Remarks on the Description of C(T ) 33
The equation (2.6.1) was presented in [47] and the other properties
in Theorem 2.6.1 can be obtained similarly as those in Theorem 2.3.1.
From Theorem 2.6.1, we see that each point on ∂R(T ) can be obtained by
applying a control
v(t) = −sign cT eAt b
to the time-reversed system under zero initial condition. In the case that
A has only real eigenvalues, it was shown in [47] that the set of controls
v(t) = sign cT eAt b , t ∈ [0, T ] : c
= 0
34 Chapter 2. Null Controllability – Continuous-Time Systems
is also the set of bang-bang controls with n − 1 or less switches. That is,
T
∂R(T ) = z= e−A(T −τ ) b v(τ )dτ : v ∈ E c (T ) , (2.6.3)
0
lim x(t) = 0.
t→∞
The set of all states that are asymptotically null controllable, denoted by
C a , is called the asymptotically null controllable region.
Theorem 2.7.1. Consider the system (2.2.1). Assume that (A, B) is sta-
bilizable and is given in the following form,
A1 0 B1
A= , B= ,
0 A2 B2
2.8. Conclusions 35
C a = C 1 × Rn2 ,
ẋ1 = A1 x1 + B1 u.
where A20 ∈ Rn20 ×n20 has all its eigenvalues on the imaginary axis and
A2− ∈ Rn2− ×n2− is Hurwitz. Let the state be partitioned accordingly as
x = [xT1 xT20 xT2− ]T , with x20 ∈ Rn20 , x2− ∈ Rn2− . Then,
%
&
A1 0 B1
,
0 A20 B20
2.8. Conclusions
In this chapter, we have obtained explicit descriptions of the boundary of
the null controllable region of a continuous-time linear system with bounded
controls. In the next chapter, the discrete-time counterparts of the results
of this chapter will be presented. Throughout the book we will be concerned
with feedback laws that are valid on the entire null controllable region or a
large portion of it.
36 Chapter 2. Null Controllability – Continuous-Time Systems
Chapter 3
Null Controllability –
Discrete-Time
Systems
3.1. Introduction
37
38 Chapter 3. Null Controllability – Discrete-Time Systems
Our first concern is the set of states that can be steered to the origin by
admissible controls.
Definition 3.2.1.
2) The set of all states that are null controllable in K steps is called the
null controllable region of the system at step K and is denoted by
C(K).
Definition 3.2.2.
2) The set of all null controllable states is called the null controllable
region of the system and is denoted by C.
a) If A is semi-stable, then C = Rn .
c) If
A1 0
A= ,
0 A2
with A1 ∈ Rn1 ×n1 anti-stable and A2 ∈ Rn2 ×n2 semi-stable, and B
is partitioned accordingly as
B1
B= ,
B2
then
C = C 1 × Rn2 ,
where C 1 is the null controllable region of the anti-stable sub-system
x1 (k + 1) = A1 x1 (k) + B1 u(k).
40 Chapter 3. Null Controllability – Discrete-Time Systems
is C i , then
m
C= Ci = x1 + x2 + · · · + xm : xi ∈ C i , i = 1, 2, · · · , m .
i=1
Hence we can begin our study of null controllable regions with systems
having only one input.
In summary, we will assume in the study of null controllable regions
that (A, B) is controllable, A is anti-stable, and m = 1.
In many situations, it may be more convenient to study the null con-
trollability of a system through the reachability of its time-reversed system.
The time-reversed system of (3.2.1) is
2) The set of all states that are reachable in K steps is called the reach-
able region of the system (3.2.2) at step K and is denoted by R(K).
2) The set of all reachable states is called the reachable region of the
system (3.2.2) and is denoted by R.
It is easy to verify that C(K) and C of (3.2.1) are the same as R(K) and
R of (3.2.2). To avoid confusion, we will reserve the notation x, u, C(K),
and C for the original system (3.2.1), and use z, v, R(K), and R for the
time-reversed system (3.2.2).
To proceed, we need more notation. With K1 , K2 integers, for con-
venience, we will use [K1 , K2 ] to denote the set of integers {K1 , K1 +
1, · · · , K2 }. We will also use [a1 , a2 ] to denote the usual closed interval
of real numbers. The situation will be clear from the context.
and
∞
−(K−)
C=R= − A bv() : |v()| ≤ 1, ∀ ≥ 0 .
=0
It is easy to see that C(K), R(K), C and R are all convex and that C(K)
and R(K) are polytopes. An extremal point of a polytope is usually called
a vertex. In some special cases, R (or C) could also be a polytope of finite
many extremal points. But in general, R has infinitely many extremal
points. In any case, R is the convex hull of Ext(R), the set of all the
extremal points of R. In view of this, it suffices to characterize Ext(R).
Also, it can be shown with standard analysis that C(K), R(K), C and
R depend on A and b continuously in the Housdorff metric, even if (A, b)
is not controllable in the usual linear sense. For technical reasons, we first
consider the reachable region R(K).
42 Chapter 3. Null Controllability – Discrete-Time Systems
cT Ak b
= 0, ∀ k ∈ [0, K − 1],
and
v ∗ (k) = sign(cT Ak b), ∀ k ∈ [0, K − 1].
and
hT z ∗ < hT z, ∀ z ∈ R(K) \ {z ∗ }. (3.3.3)
3.3. General Description of Null Controllable Region 43
hT A−(K−) b
= 0, ∀ ∈ [0, K − 1]
and
v ∗ () = sign hT A−(K−) b , ∈ [0, K − 1].
=0
(3.3.5)
Writing cT A as cT AK A−(K−) and replacing cT AK with cT and K −
with , we have
K
− − −
Ext(R(K)) = − A b sign(c A b) : c A b
= 0, ∀ ∈ [1, K] .
T T
=1
Theorem 3.3.1 can also be proven with a similar procedure as the proof
of Theorem 2.3.1. It should be noted that for the continuous-time case,
every point in ∂R is an extremal point. This is implied by the fact that R
is strictly convex for a continuous time system.
We note that in the above theorem, the infinite summation always exists
since A is anti-stable.
44 Chapter 3. Null Controllability – Discrete-Time Systems
Since
sign cT A− b = sign γcT A− b
for any positive number γ, this formula shows that Ext(R) can be deter-
mined from the surface of a unit ball. It should be noted that each extreme
corresponds to a region in the surface of the unit ball rather than just
one point. This formula provides a straightforward method for computing
the extremal points of the null controllable region and no optimization is
involved. In the following, we will give a more attractive formula for com-
puting the extremal points of the null controllable region by exploring the
eigenstructure of the matrix A.
Lemma 3.4.1. For the system (3.2.2), suppose that A has only positive
real eigenvalues. Then,
Proof. Since A has only positive real eigenvalues, systems (3.2.1) and
(3.2.2) can be considered as the discretized systems resulting from
and
ż(t) = −Ac z(t) − bc v(t) (3.4.3)
with sampling period h, where Ac has only positive real eigenvalues. Thus,
Ac h
A = e Ac h , b = A−1
c e − I bc
and
Ac h
cT Ak b = cT A−1
c e − I eAc hk bc .
It follows from the above lemma that the set of extremal controls on
[0, K] can be described as follows,
1, 0 ≤ k < k1 ,
E c (K) = ± v : v(k) = (−1)i , ki ≤ k < ki+1 ,
(−1)n−1 , kn−1 ≤ k ≤ K − 1,
0 ≤ k1 ≤ · · · ≤ kn−1 ≤ K − 1 .
Notice that we allow ki = ki+1 in the above expression to include all the
bang-bang controls with n − 1 or less switches.
For a square matrix X, it can be easily verified that
k −1
2
X k
(I − X) = X k1 − X k2 .
k=k1
i=1
0 ≤ k1 ≤ · · · kn−1 ≤ K − 1
n−1
= ± A−K + 2 (−1)i A−i + (−1)n I (I − A)−1 b :
i=1
K ≥ 1 ≥ · · · ≥ n−1 ≥ 1 .
Ext(C) = Ext(R)
n−1
i −i
= ± 2 (−1) A + (−1) I (I − A)−1 b :
n
i=1
∞ ≥ 1 ≥ · · · ≥ n−1 ≥ 1 .
Hence, there are exactly 2K extremal points (vertices), versus the upper
bound of 2K extremal points which emerges from a superficial analysis of
R(K). Furthermore, notice that
Ext(R) = ± 2A− − I (I − A)−1 b : 1 ≤ ≤ ∞ . (3.4.4)
which has K(K −1) extremal points, versus the upper bound of 2K extremal
points which emerges from a superficial analysis of R(K). As expected,
Ext(R) = ± 2A−1 − 2A−2 + I (I − A)−1 b : 1 ≤ 2 ≤ 1 ≤ ∞ .
Ext(R(2q))
2q−1 % &
π i
= − A−(2q−l) b sign sin β+ + π : i ∈ [0, 2q−1] , (3.5.1)
2q q
=0
and
r2q
Ext(R) = Ext(R(2q)). (3.5.2)
r2q − 1
In view of this theorem, we can compute the extremal points of R(2q)
using (3.5.1), and then scale them by
r2q
r2q − 1
to obtain the extremal points of R.
The set of extremal points Ext(R) also coincides with the steady state
trajectory of the time-reversed system (3.2.2) to a particular periodic bang-
bang control. Let the control be
% &
∗ π
v (k) = sign sin βk +
2q
and z ∗ (k) be the zero initial state response,
k−1 % &
∗ −(k−) π
z (k) = − A b sign sin β + .
2q
=0
Denote Γ(K) as
Γ(K) := ± z ∗ (K + k) : k ∈ [0, 2q − 1] , (3.5.3)
then the limit limK→∞ Γ(K) exists, and this limit is the union of the steady
state trajectories of (3.2.2) under v ∗ (k) and −v ∗ (k). Let
Γ = lim Γ(K).
K→∞
Ext(C) = Ext(R) = Γ.
50 Chapter 3. Null Controllability – Discrete-Time Systems
If
1
β= π,
N
where N is a positive integer, then v ∗ (k) has a period of 2N and
AN = −rN I.
If we denote
rN + 1 rN + 1 +
x+
s = (I − A)−1
b = x ,
rN − 1 rN − 1 e
and
x− +
s = −xs ,
then we can obtain an explicit expression for the steady state trajectory Γ,
i−1
−i − −(i−−1) −1
Γ = ± A xs + A (−A b)(1) : 1 ≤ i ≤ N . (3.5.4)
=0
v ∗ (k) = sign(sin(βk))
and z ∗ (k) be the zero initial state response to the time-reversed system.
Although the control and the time response are not exactly periodic, there
exist a set of limit points of the state trajectories, which also form the
extremal points of R and C.
3.6. An Example
Consider the continuous-time system
0 1 0
ẋ = Ac x + bc u = x+ u, (3.6.1)
−0.5 1.5 1
ż = −Ac z − bc v. (3.6.2)
3.7. Asymptotically Null Controllable Region 51
where x+ −1
e = −Ac bc is the equilibrium point of (3.6.2) under the constant
control u = 1. The second equality shows that ∂C c is formed by the trajec-
tories of (3.6.2) starting from ±x+
e under the control v = ∓1.
With h the sampling period and
Ac h
A = e Ac h , b = A−1
c e − I bc ,
0.8
h=1
0.6 h=2 h=0.1,0.2
0.4
h=4
0.2
h=8
0
-0.2
-0.4
-0.6
-0.8
-1
-2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 2.5
lim x(k) = 0.
k→∞
The set of all states that are asymptotically null controllable, denoted by
C a , is called the asymptotically null controllable region.
Theorem 3.7.1. Consider the system (3.2.1). Assume that (A, B) is sta-
bilizable and is given in the following form,
A1 0 B1
A= , B= ,
0 A2 B2
3.8. Conclusions 53
C a = C 1 × Rn2 ,
x1 (k + 1) = A1 x1 (k) + B1 u(k).
3.8. Conclusions
In this chapter, we have presented an explicit description of the extremal
points of the null controllable region of a discrete-time linear system with
bounded controls. The continuous-time counterparts of the results pre-
sented here was given in the previous chapter. Throughout the book we
will be concerned with feedback laws that are valid on the entire null con-
trollable region or a large portion of it.
54 Chapter 3. Null Controllability – Discrete-Time Systems
Chapter 4
Stabilization on Null
Controllable Region –
Continuous-Time Systems
4.1. Introduction
In this chapter, we will study the problem of stablizing a linear system with
saturating actuators. The key issue involved here is the size of the result-
ing domain of attraction of the equilibrium. Indeed, local stabilization, for
which the size of the domain of attraction is not a design specification, is
trivial. It is straightforward to see that any linear feedback that stabilizes
the system in the absence of actuator saturation would also locally sta-
bilize the system in the presence of actuator saturation. In fact, with the
given stabilizing linear feedback law, actuator saturation can be completely
avoided by restricting the initial states to a small neighborhood of the equi-
librium. Our focus in this chapter is on the construction of feedback laws
that would lead to a domain of attraction that contains any a priori given
bounded subset of the asymptotically null controllable region in its interior.
We refer to such a problem as semi-global stabilization on the asymptoti-
cally null controllable region, or simply, semi-global stabilization. We recall
from Chapter 2 that the null controllable region of a linear system subject
to actuator saturation is the set of all the states that can be steered to
the origin in a finite time by an admissible control, and the asymptotically
55
56 Chapter 4. Stabilization – Continuous-Time Systems
null controllable region is the set of all the states that can be driven to the
origin asymptotically by an admissible control.
Recent literature has witnessed a surge of interest in the stabilization of
linear systems subject to actuator saturation. Most of the existing results,
however, pertain to systems that are asymptotically null controllable (AN-
CBC). We recall from Section 2.7 that the asymptotically null controllable
region of an ANCBC system is the entire state space. For these systems,
it was shown in [93] that global stabilization is possible. A nested feed-
back design technique for constructing nonlinear globally asymptotically
stabilizing feedback laws was proposed in [98] for a chain of integrators and
was fully generalized in [95]. Alternative solutions to the global stabiliza-
tion problem consisting of scheduling a parameter in an algebraic Riccati
equation according to the size of the state vector were later proposed in
[94,99]. The question of whether or not a general linear ANCBC system
subject to actuator saturation can be globally asymptotically stabilized by
a linear feedback was answered in [24,96], where it was shown that a chain
of integrators of length greater than two cannot be globally asymptotically
stabilized by any saturated linear feedback. In a search for simple control
strategies, we have earlier constructed linear feedback laws ([67,65]) that
achieve semi-global stabilization of these systems.
The objective of this chapter is to construct feedback laws that would
achieve semi-global stabilization of linear systems that have one or two
exponentially unstable poles and are subject to actuator saturation. Semi-
global stabilization for general systems will be treated in Chapter 9. Our
motivation for considering separately systems with one or two exponen-
tially unstable poles here is twofold. First, for such systems, we are able to
construct simple feedback laws. For a planar system with both poles expo-
nentially unstable or systems having no exponentially unstable poles, linear
feedback is sufficient. For systems having both polynomially unstable poles
and one or two exponentially unstable poles, two linear feedback laws, one
for inside a certain invariant set and the other for the outside, would be
sufficient. Second, in establishing these stabilization results, we will arrive
at some results that are interesting in their own right. For example, we
show that, for a planar anti-stable linear system under any saturated linear
stabilizing feedback law, the boundary of the domain of attraction is formed
by the unique limit cycle of the closed-loop system. Our presentation in
this chapter is based on our recent work [42].
4.2. Domain of Attraction under Saturated Linear Feedback 57
In Section 4.2, we establish the just mentioned fact that, for a planar
anti-stable linear system under any stabilizing saturated linear feedback,
the boundary of the domain of attraction is formed by the unique limit
cycle of the closed-loop system. In Section 4.3, we will show how to appro-
priately choose the linear feedback gain such that this unique limit cycle
will approach the boundary of the null controllable region of the system.
We will thus establish semi-global stabilization of such systems. In Section
4.4 we will establish semi-global stabilization for higher order systems with
one or two exponentially unstable poles. In Section 4.5, we will draw a brief
conclusion to the chapter.
ẋ = Ax + bu, (4.2.1)
Denote the state transition map of (4.2.4) by φ : (t, x0 ) → x(t). The domain
of attraction S of the equilibrium x = 0 of (4.2.4) is defined by
S := x0 ∈ Rn : lim φ(t, x0 ) = 0 .
t→∞
58 Chapter 4. Stabilization – Continuous-Time Systems
(A + bf )T P + P (A + bf ) < 0.
Lemma 4.2.1. The origin is the unique equilibrium point of the system
(4.2.4).
Proof. This is a result from [2]. A simpler proof goes as follows. The other
two candidate equilibrium points are x+ e = −A
−1
b and x− + +
e = −xe . For xe
to be an equilibrium, we must have f x+ +
e ≥ 1 so that u = sat(f xe ) = 1.
Since A is anti-stable and (A, b) is controllable, we can assume, without
loss of generality, that
0 1
A= , a1 , a2 > 0,
−a1 a2
and
0
b= .
1
This implies that if f = f1 f2 is stabilizing, then f1 < a1 . It can be
verified that
f1
f x+
e = −f A
−1
b= < 1.
a1
This rules out x+ −
e . Similarly, we can rule out xe . 2
Let us introduce the time-reversed system of (4.2.4),
Clearly (4.2.6) also has only one equilibrium point, an unstable one, at the
origin. Denote the state transition map of (4.2.6) as ψ : (t, z0 ) → z(t).
Theorem 4.2.1. ∂S is the unique limit cycle of the planar systems (4.2.4)
and (4.2.6). Furthermore, ∂S is the positive limit set of ψ(·, z0 ) for all
z0
= 0.
This theorem says that ∂S is the unique limit cycle of (4.2.4) and (4.2.6).
This limit cycle is a stable one for (4.2.6) (in a global sense) but an unstable
one for (4.2.4). Therefore, it is easy to determine ∂S by simulating the
time-reversed system (4.2.6). Shown in Fig. 4.2.1 is a typical result, where
two trajectories, one starting from outside, the solid curve, and the other
starting from inside, the dashed curve, both converge to the unique limit
cycle. The straight lines in Fig. 4.2.1 are f z = 1 and f z = −1.
-2
-4
-6
-8
-10 -8 -6 -4 -2 0 2 4 6 8 10
For an illustration of Lemma 4.2.2, see Fig. 4.2.2. In Fig. 4.2.2, the
curve from xi to yi is x(t) = eAt xi , t ∈ [0, Ti ], a segment of a trajectory
of the autonomous system ẋ = Ax. Lemma 4.2.2 indicates that if any two
different trajectories leave certain straight line on the same side, they will
be further apart when they return to it.
2.5
1.5
y y x x
2 1 1 2
1
0.5
−0.5
−1
−10 −5 0 5
y1 = eAT1 x1 , y2 = eAT2 x2
4.2. Domain of Attraction under Saturated Linear Feedback 61
For an illustration of Lemma 4.2.3, see Fig. 4.2.3. It says that if two
different trajectories of the autonomous system ẋ = Ax enter the region
between f x = c and f x = −c, they will be further apart when they leave the
region. Notice that in Lemma 4.2.2, A is anti-stable, and in Lemma 4.2.3,
A is asymptotically stable.
1.5
x x
2 1
1
0.5
−0.5
y y
2 1
−1
−1.5
−2
−10 −5 0 5
It should be noted that in Lemmas 6.3.1 and 6.3.2, special forms of A and
f are assumed. Since (f, A) is observable, they can always be transformed
into the special form. Lemmas 6.3.1 and 6.3.2 are applicable to the general
form of (f, A) since linear transformation does not change the ratio between
the lengths of the two aligned vectors, y1 − y2 and x1 − x2 .
Proof of Theorem 4.2.1. We first prove that, for the system (4.2.6), every
trajectory ψ(t, z0 ), z0
= 0, converges to a periodic orbit as t → ∞. Recall
that E(P, ρ0 ) (defined in (4.2.5)) lies within the domain of attraction of the
62 Chapter 4. Stabilization – Continuous-Time Systems
the second term belongs to C, the null controllable region of (4.2.1), for all
t. It follows that there exists a ρ1 > ρ0 such that
Let
Q = z ∈ R 2 : ρ0 ≤ z T P z ≤ ρ1 .
ξ̇2 > 0,
ξ̇2 < 0,
i.e., the trajectories go downwards. This implies that every periodic trajec-
tory crosses f x = 1 exactly twice. Similarly, every periodic trajectory also
crosses f x = −1 exactly twice. It also implies that a periodic trajectory
goes counterclockwise.
Now suppose on the contrary that (4.2.4) has two different periodic tra-
jectories Γ1 and Γ2 , with Γ1 enclosed by Γ2 , as illustrated in Fig. 4.2.4.
Since any periodic trajectory must enclose the origin and any two trajec-
tories cannot intersect, all the periodic trajectories must be ordered by
enclosement. Let x1 and y1 be the two intersections of Γ1 with f x = 1,
and x2 , y2 be the two intersections of Γ2 with f x = 1. Then along Γ1 , the
trajectory goes from x1 to y1 , −x1 , −y1 and returns to x1 ; and along Γ2 ,
the trajectory goes from x2 to y2 , −x2 , −y2 and returns to x2 .
Let
x+
e = −A
−1
b.
we have
y1 − x+
e =e
AT1
(x1 − x+
e ), y2 − x+
e = e
AT2
(x2 − x+
e )
64 Chapter 4. Stabilization – Continuous-Time Systems
2.5
2 Γ2
Γ1
1.5
y y x1 x
2 1 2
1
0.5 +
xe
−0.5
−x −x −y −y
2 1 1 2
−1
−1.5
−2
−2.5
−10 −8 −6 −4 −2 0 2 4 6 8 10
f (x1 − x+ + + + +
e ) = f (x2 − xe ) = f (y1 − xe ) = f (y2 − xe ) = 1 − f xe > 0,
f (x − x+ +
e ) ≥ 1 − f xe .
Remark 4.2.1.
1) In the above proof, we also showed that ∂S is symmetric and has two
intersections with f x = 1 and two with f x = −1;
Since
0 1 cos θ
cos θ sin θ = 0,
−1 0 sin θ
66 Chapter 4. Stabilization – Continuous-Time Systems
equation (4.2.10) has at most four equilibria on [0, 2π), which correspond to
the directions of the real eigenvectors of A. If θ(0) is an equilibrium, then
θ(t) is a constant, and x(t) is a straight line. If θ(0) is not an equilibrium,
then θ(t) will never reach an equilibrium at finite time, otherwise the tra-
jectory x(t) would intersect a straight line trajectory, which is impossible.
Hence, if x0 is not an eigenvector of A, θ̇(t) will never be equal to zero, thus
θ̇(t) will be sign definite. This shows that x(t) is strictly monotonically
increasing (or decreasing).
Let us now consider the direction angle of the trajectory, ẋ =: γ. Since
ẍ(t) = Aẋ(t), by the same argument, γ(t) also increases (or decreases)
monotonically. We claim that if A is asymptotically stable or anti-stable
(det (A) > 0), then θ̇(t) and γ̇(t) have the same sign, i.e., the trajectories
bend toward the origin; and if the signs of the two eigenvalues of A are
different (det (A) < 0), then θ̇(t) and γ̇(t) have opposite signs. This can be
simply shown as follows. Rewrite (4.2.10) as,
1 T 0 1
θ̇ = x Ax. (4.2.11)
|x|2 −1 0
Similarly,
1 T 0 1 1 T T 0 1
γ̇ = ẋ Aẋ = x A AAx.
|ẋ|2 −1 0 |ẋ|2 −1 0
Hence
det (A) T 0 1 |x|2 det (A)
γ̇ = x Ax = θ̇. (4.2.12)
|ẋ|2 −1 0 |ẋ|2
This shows that the claim is true.
Now we can apply the above claim to the limit cycle of the system
(4.2.4) (refer to Fig. 4.2.4). From x1 to y1 , [x(t) − x+
e ] increases, so ẋ(t)
increases. From y1 to −x1 , x(t) and ẋ(t) also increase. Similarly, ẋ(t)
increases from −x1 to −y1 , and from −y1 to x1 . It is straightforward to
verify that ẋ(t) is continuous at x1 , y1 , −x1 and −y1 . So ẋ(t), the direction
angle, is monotonically increasing along the limit cycle. This implies that
the region enclosed by the limit cycle, S, is convex. 2
4.3. Semi-Global Stabilization – Planar Systems 67
AT P + P A − P bbT P = 0. (4.3.1)
Note that this equation is associated with the minimum energy regulation,
i.e., an LQR problem with cost
∞
J= uT (t)u(t)dt.
0
f0 = −bT P.
By the infinite gain margin and 50% gain reduction margin property of
LQR regulators, the origin is a stable equilibrium of the system
ẋ = Ax + b sat(kf0 x) (4.3.2)
for all k > 0.5. Let S(k) be the domain of attraction of the equilibrium x =
0 of (4.3.2). Then, the following result establishes semi-global stabilizability
by linear feedback.
Theorem 4.3.1.
lim dist(S(k), C) = 0.
k→∞
68 Chapter 4. Stabilization – Continuous-Time Systems
and
0
b= .
−1
Since A is anti-stable and (A, b) is controllable, A and b can always be
transformed into this form. Suppose that A has already taken this form
and
b1
b= .
b2
Let
V = −A−1 b −b ,
then, V is nonsingular and it can be verified that
0
V −1 AV = A, V −1 b = .
−1
and
1 −1
ze+ = −A−1
b= , ze− = −ze+ = .
0 0
We also have f0 A−1 b = 0.
For a given k > 0.5, by Theorem 4.2.1, the system (4.3.2) has a unique
limit cycle which is the boundary of S(k). To visualize the proof, ∂C and
∂S(k) for some k are plotted in Fig. 4.3.1, where the inner closed curve is
∂S(k), and the outer one is ∂C.
We recall that when the eigenvalues of A are real (see (2.4.2)),
t
−At − −A(t−τ )
∂C = ± e ze − e bdτ : t ∈ [0, ∞] , (4.3.3)
0
0.6
0.4
0.2
x1 −y1
−
ze
0
y1 −x1
−0.2
−0.4
−0.6
−0.8
−1.5 −1 −0.5 0 0.5 1 1.5
Figure 4.3.1: The domain of attraction and the null controllable region.
On the other hand, ∂S(k) is the limit cycle of the time-reversed system of
(4.3.2),
ż = −Az − b sat(kf0 z). (4.3.5)
Here the limit cycle as a trajectory goes clockwise. From Remark 4.2.1,
we know that the limit cycle is symmetric and has two intersections with
kf0 z = 1 and two with kf0 z = −1 (see Fig. 4.3.1). Let T be the time
required for the limit cycle trajectory to go from y1 to x1 , and T2 the time
from x1 to −y1 , then
∂S(k) = ±e−(A+kbf0 )t y1 : t ∈ [0, T ]
t
∪ ± e−At x1 − e−A(t−τ ) bdτ : t ∈ [0, T2 ] . (4.3.6)
0
lim T = 0,
k→∞
70 Chapter 4. Stabilization – Continuous-Time Systems
and
lim T2 = ∞ (or Tp ).
k→∞
If these are true, the lengths of the parts of the limit cycle between the
lines kf0 z = 1 and kf0 z = −1 will tend to zero. We will first show that
lim T = 0.
k→∞
Let
x11 y11
x1 = 1 , y1 = 1 ,
2ka2 − 2ka 2
and
−(A+kbf0 )t
y11
kf0 e 1 ≤ 1, ∀ t ∈ [0, T ].
− 2ka 2
We also note that the upward movement of the trajectory at x1 and y1
implies that
2k − 1 1 − 2k
x11 < , y11 < .
2k 2k
As k → ∞, the matrix
0 −a1
A + kbf0 =
1 a2 (1 − 2k)
has two distinct real eigenvalues −λ1 and −λ2 (Their dependence on k is
also omitted). Assume λ2 > λ1 . Since λ1 λ2 = a1 and λ1 + λ2 = a2 (2k − 1),
we have
lim λ1 = 0, lim λ2 = +∞.
k→∞ k→∞
Let
λ2 λ1
V = .
1 1
It can be verified that
−λ1 0
A + kbf0 = V V −1 .
0 −λ2
4.3. Semi-Global Stabilization – Planar Systems 71
It follows that
−1
(A+kbf0 )T λ2 λ1 e−λ1 T 0 λ2 λ1
e = ,
1 1 0 e−λ2 T 1 1
1 λ2 − λ1 + λ2 e−λ2 T − λ1 e−λ1 T
x11 = ,
2ka2 e−λ2 T − e−λ1 T
and,
1 λ2 − λ1 + λ2 eλ2 T − λ1 eλ1 T
y11 = .
2ka2 eλ1 T − eλ2 T
Since
1 − 2k λ1 + λ2
y11 < =−
2k 2ka2
and
eλ1 T − eλ2 T < 0,
we have
λ1 eλ2 T < λ2 − λ1 + λ2 eλ1 T < 2λ2 eλ1 T
and
ln 2λ 2
1 2λ2
T < λ1
= ln 2 ,
λ2 − λ1 λ2 − λ1 a1
where we note that
a1
λ1 = .
λ2
Since
lim λ2 = ∞, lim λ1 = 0,
k→∞ k→∞
we obtain
lim T = 0.
k→∞
It follows that
y11 λ2 − λ1 + λ2 eλ2 T − λ1 eλ1 T
lim = lim
k→∞ x11 k→∞ (λ2 − λ1 )e(λ1 +λ2 )T + λ2 eλ1 T − λ1 eλ2 T
λ2 − λ1 1+e
λ1 T
1+eλ2 T
= lim λ2 T (1+eλ1 T ) = 1,
k→∞ λ2 eλ1 T − λ1 e 1+eλ2 T
lim λ1 = 0.
k→∞
72 Chapter 4. Stabilization – Continuous-Time Systems
Since x1 and y1 are bounded by the null controllable region, x11 and y11
are finite numbers. Hence,
i.e.,
y11 −AT2
x11
1 = −e 1 + I − e−AT2 A−1 b,
− 2ka2 2ka2
y 11 y 11 − x11 0
I + e−AT2 = I − e−AT2 A−1 b+e−AT2 1 + 1 .
0 − 2ka 2 2ka2
The matrix A has two real eigenvalues at 0.5 and 1. Following the design
procedure, we obtain
f0 = 0 3 .
0.8
0.6
0.4
0.2
−0.2
−0.4
−0.6
−0.8
−1
−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1
−1
−2
−3
−4
−5
−6 −4 −2 0 2 4 6
ẋa = A1 xa + b1 u
C a = C 1 × Rn .
and
Ω2 (γ2 ) := xs ∈ Rn : |xs | ≤ γ2 . (4.4.3)
AT1 P 1 + P 1 A1 − P 1 b1 bT1 P 1 = 0.
76 Chapter 4. Stabilization – Continuous-Time Systems
Moreover,
lim r2 (ε) = ∞,
ε→0
3|P1 (ε)| 1
r2 (ε) = ' · 1 .
|P2 (ε)| + |P2 (ε)| + 3|P1 (ε)| |P3 (ε)| 2 P (ε) bT P 12 (ε)
2 2
1
Theorem 4.4.1. Let f0 = −bT1 P1 . For any γ1 < 1 and γ2 > 0, there exist
a k > 0.5 and an ε > 0 such that Ω1 (γ1 )×Ω2 (γ2 ) is contained in the domain
of attraction of the equilibrium x = 0 of the closed-loop system
sat(kf0 xa ), x ∈/ D(ε),
ẋ = Ax + bu, u = (4.4.7)
sat(f (ε)x), x ∈ D(ε).
Proof. Since γ1 < 1, by Theorem 4.3.1, there exists a k > 0.5 such that
Ω1 (γ1 ) lies in the interior of the domain of attraction of the equilibrium
xa = 0 of
ẋa = A1 xa + b1 sat(kf0 xa ). (4.4.8)
Let ε0 > 0 be given. For an initial state xa0 ∈ Ω1 (γ1 ), denote the trajectory
of (4.4.8) as ψ(t, xa0 ). Define
T (xa0 ) := min t ≥ 0 : |ψ(t, xa0 )| ≤ r1 (ε0 ) ,
then T (xa0 ) is the time when ψ(t, xa0 ) first enters the ball
B1 = xa ∈ R2 : |xa | ≤ r1 (ε0 ) .
Let
TM = max T (xa0 ) : xa0 ∈ ∂Ω1 (γ1 ) (4.4.9)
and
TM
A2 (TM −τ )
γ = max eA2 t γ2 + e b2 dτ, (4.4.10)
t∈[0,TM ] 0
78 Chapter 4. Stabilization – Continuous-Time Systems
then, by Lemma 4.4.1, there exists an ε < ε0 such that r1 (ε) ≥ r1 (ε0 ),
r2 (ε) ≥ γ and
D1 (ε) = x ∈ R2+n : |xa | ≤ r1 (ε), |xs | ≤ r2 (ε) ⊂ D(ε)
Remark 4.4.1.
1) For the case that the open-loop system has a single exponentially
unstable pole, Theorem 4.4.1 is also true. Let x, A, b and P (ε) be
partitioned similarly to the two exponentially unstable pole case, with
xa , A1 , b1 , P1 (ε) ∈ R. Then, it is easy to verify that
% &
|b1 | |b1 |
C1 = − , ,
A1 A1
and
2A1 2A1
P1 = , f0 = − .
b21 b1
Also, for all k > 0.5, the origin of the system
ẋa = A1 xa + b1 sat(kf0 xa )
2) For the case that the open-loop system has no exponentially unstable
poles (ANCBC), we have C a = Rn , and
lim P (ε) = 0.
ε→∞
Therefore, the set D(ε), which is in the domain of attraction for the
system under the control u = sat(f (ε)x), can be made arbitrarily
large to include any compact subset of Rn . This means that the
control u = sat(f (ε)x) can achieve semi-global stabilization and no
switching is necessary.
Step 1. We obtain
kf0 = 0.1360 −0.748 .
This is searched by comparing the boundary of Ω1 (γ1 ) with the limit
cycle of (4.4.8), which can always be made large enough to enclose
Ω1 (γ1 ).
Step 3. Search ε such that r1 (ε) > r1 (ε0 ) and r2 (ε) > γ. One such ε is
given by ε = 0.45ε0, with r1 (ε) = 0.5887 > r1 (ε0 ), r2 (ε) = 17.7138 >
γ, and
f (ε) = 0.120097 −0.660525 0 0.000949 0 .
80 Chapter 4. Stabilization – Continuous-Time Systems
−2
−4
−6
0 100 200 300 400 500 600 700
Applying the feedback matrices kf0 and f (ε) to the system (4.4.7), we
get the desired domain of attraction. Shown in Figs. 4.4.1 and 4.4.2 are the
time responses x1 (t) and x3 (t) of (4.4.7) with an initial state
4.7005
0.70001
x0 =
10 ,
0
0
15
10
−5
−10
−15
0 100 200 300 400 500 600 700
the convergence rate is increased. Applying the above control law, with
N = 20, to the system with the same initial state, the time response of x3
is plotted in Fig. 4.4.3. Fig. 4.4.4 is the plot of |x(t)|. In the figures, we see
that the convergence rate is indeed increased.
82 Chapter 4. Stabilization – Continuous-Time Systems
15
10
−5
−10
−15
0 100 200 300 400 500 600 700
15
10
0
0 100 200 300 400 500 600 700
4.5. Conclusions
In this chapter, we provided a simple semi-global stabilization strategy
for exponentially unstable linear systems with saturating actuators. For a
planar anti-stable system, the resulting feedback laws are saturated linear
state feedbacks and for higher order systems with one or two anti-stable
modes, they are piecewise linear state feedbacks with only one switch. As
a corollary, we also recovered the known result that an ANCBC system
can be semi-globally stabilized with a saturated linear feedback. The semi-
global stabilizability for general systems will be established in Chapter 9,
where more complicated feedback laws will be used.
84 Chapter 4. Stabilization – Continuous-Time Systems
Chapter 5
Stabilization on Null
Controllable Region –
Discrete-Time Systems
5.1. Introduction
85
86 Chapter 5. Stabilization – Discrete-Time Systems
is the simpler controller structure and that the controller is less sensitive
to system model uncertainties and disturbances.
Finally, as in the continuous-time case, we will untilize the above design
technique to construct feedback laws that achieve semi-global stabilization
on the null controllable region for higher order systems with no more than
two exponentially unstable poles. Our presentation in this chapter draws
on materials from [45].
In Sections 5.2 and 5.3, we will establish global stabilization on the
null controllable region for second order anti-stable linear systems subject
to actuator saturation. In particular, in Section 5.2, a saturated linear
feedback law is constructed that causes all the states in the null controllable
region to reach a set of equilibrium points in a finite number of steps. In
Section 5.3, based on this saturated linear feedback law, an overall nonlinear
feedback law is constructed that first causes all states in the null controllable
region to reach the set of equilibrium points in a finite number of steps and
then forces them from there to the origin, also in a finite number of steps.
Section 5.4 presents a semi-global stabilization strategy for second order
anti-stable linear systems. Section 5.5 deals with higher order systems with
one or two anti-stable poles. Finally, a brief conclusion to the chapter is
drawn in Section 5.6.
Denote the equilibrium point of (5.2.1) under the constant control u(k) = 1
as x+
e := (I − A)
−1
b. This is also the equilibrium point of (5.2.2) under the
constant control v(k) = 1. Define x− +
e := −xe .
The study of stabilization of the above system depends on the eigen-
structure of the system matrix A. For simplicity, we will restrict our in-
vestigation on two cases, although Example 5.3.1 shows that the results
5.2. Global Stabilization at Set of Equilibria 87
developed for these two cases are also valid for the general case. These two
cases are:
with G defined by
G = (I − Ā)−1 b̄ (−2Ā−1 + I)b̄ ,
we have
−1 1
G = TG = .
0 1
Hence, for simplicity, we make the following assumption in this chapter.
Assumption 5.2.1. For the second order system (5.2.1), assume that A
and b satisfy
−1
−1
−1 1
(I − A) b (−2A + I)b = . (5.2.3)
0 1
88 Chapter 5. Stabilization – Discrete-Time Systems
f x+
e = 1, f (A−1 x−
e −A
−1
b) = 1, (5.2.4)
and
f A−1 x+
e = 0, f A−1 b = −1. (5.2.5)
Proof. Multiplying both sides of (5.2.3) from the left with f yields
By Assumption 5.2.1,
−1
x+
e = (I − A)
−1
b= ,
0
so we have f x+
e = 1 and
If we combine this with (5.2.4), we get f A−1 b = −1. Hence, (5.2.5) holds.
Now we have
−1 + + + −1
(A + bf ) = (A + bf )xe = Axe + b = xe = .
0 0
T
Multiplying both sides of (5.2.3) from right with 1 1 , we obtain
−1 0 0 0
−2A b = =⇒ 2b + A = . (5.2.7)
1 1 0
5.2. Global Stabilization at Set of Equilibria 89
Therefore,
0 0 0
(A + bf ) =A + 2b = .
1 1 0
It then follows that
−1 0 −1 0 1 0
(A + bf ) = =⇒ (A + bf ) = .
0 1 0 0 0 0
2
Using this f = −1 2 , we construct a saturated linear feedback
For any initial state x(0) ∈ C, the trajectory of (5.2.9) will reach the line
*
set E = E {x+ −
e , xe } in a finite number of steps and then remain in E.
Furthermore, E is the set of stable equilibrium points of (5.2.9).
This theorem is illustrated in Fig. 5.2.1 for Case 2, where ±x+ e are in
the interior of C. The two parallel lines are f x = ±1 and E is the horizontal
line segment between the two parallel lines.
Proof of Theorem 5.2.1. We will prove this theorem for Case 1 and
Case 2 separately.
1.5
0.5
−0.5
−1
−1.5
−1.5 −1 −0.5 0 0.5 1 1.5
Here we use the integer “i” instead of “” in (3.4.4) and “k” in (3.4.5). For
convenience, we have also replaced x+ −
e in (3.4.5) with xe and −1 in the
sum with 1 accordingly. We consider half of the set Ext(C),
i−1
A−i x−
e + A−(i−−1) (−A−1 b) : 1 ≤ i ≤ ∞ ,
=0
i−1
pi := A−i x−
e + A−(i−−1) −A−1 b .
=0
pi p4
p3
p2
p1 fx=1
x+ fx=-1
e
o xe- ( po )
-p
1
-p 2
-p
3
The null controllable region C is cut into two symmetric parts by the
straight line passing through x+ −
e and xe . Denote the upper part with
+
pi , i ≥ 1, at its extremal points as C . This part is composed of the
triangles pi x+
e pi+1 , denoted by ∆i , i ≥ 0:
∆i := γ1 x+ e + γ2 pi + γ3 pi+1 : γ1 + γ2 + γ3 = 1, γ1 ∈ (0, 1), γ2 , γ3 ≥ 0 ,
We first show that any state in ∆0 will reach E at the next step. For
x(k) ∈ ∆0 , there exist γ1 ∈ (0, 1), γ2 , γ3 ≥ 0, γ1 + γ2 + γ3 = 1, such that
x(k) = γ1 x+ + −
e + γ2 p0 + γ3 p1 = γ1 xe + γ2 xe + γ3 p1 .
Recalling that
−1 1 0
x+
e = , x− +
e = −xe , A + bf = ,
0 0 0
we have
(A + bf )x+ +
e = xe , (A + bf )x− −
e = xe .
(A + bf )p1 = Ap1 + b = x−
e .
It follows that
x(k + 1) = (A + bf )x(k) = γ1 x+ −
e + (γ2 + γ3 )xe ∈ E.
x(k + 1) = Ax(k) + b.
Hence,
x(k + 1) = Ax(k) + b
= A(γ1 x+
e + γ2 pi + γ3 pi+1 ) + (γ1 + γ2 + γ3 )b
= γ1 (Ax+
e + b) + γ2 (Api + b) + γ3 (Api+1 + b)
= γ1 x+
e + γ2 pi−1 + γ3 pi
∈ ∆i−1 ,
5.2. Global Stabilization at Set of Equilibria 93
A∆i + b = ∆i−1 , i ≥ 1.
rN + 1 −
x−
s = x .
rN − 1 e
Let p0 = x−
s and
i−1
pi = A−i p0 + A−(i−−1) (−A−1 b)(1), i ∈ [1, N ].
=0
Then,
Api + b = pi−1 , i ∈ [1, N ], (5.2.12)
and pN = x+
s . Since f A
−1 −
xe = 0 and f A−1 b = −1 (by (5.2.5)), we have
f p1 = f A−1 p0 − A−1 b = f A−1 x−
s − fA
−1
b = 1.
p3
p2
p22
pi p21
p1 fx=1
1 p12
p 1
fx=-1
x+ ) pN pN
2 1
x -e ( po1) 11111
00000
s( pN11111111
00000000 o 00000p 2 x -s ( po )
11111
00000000
11111111
2 p N+1
1 x+e o
pN+1 pN+1
-p 1
-p
2
There are N −1 triangles above the line f x = 1, which are, ∆i , i ∈ [1, N −1].
In addition, there is a small triangle beside ∆N −1 , lying within −∆0 , which
is also above the line f x = 1 (the left gridded triangle in Fig. 5.2.3). We
need to deal with this triangle separately. The line f x = 1 intersects with
the boundary of C at two points − − one is p1 , and we label the other as
pN +1 . It can be shown that pN +1 lies on the line between x+ s (pN ) and −p1 .
So there exists a γ ∈ (0, 1) such that
pN +1 = γx+
s − (1 − γ)p1 , f pN +1 = 1. (5.2.13)
Since f x+
e = 1, we have
rN + 1
f x+
s = .
rN − 1
It follows that
rN − 1
γ= .
rN
5.2. Global Stabilization at Set of Equilibria 95
From (5.2.13),
rN − 1 + 1
ApN +1 = Axs − N Ap1 . (5.2.14)
rN r
And from (5.2.12),
Ap1 = p0 − b = −x+
s − b. (5.2.15)
Noting that Ax+ +
e + b = xe , we have
rN + 1 + rN + 1 +
Ax+
s = Axe = N (x − b),
r −1
N r −1 e
and
rN + 1
Ax+ +
s = xs − b. (5.2.16)
rN − 1
Putting (5.2.15) and (5.2.16) into (5.2.14), we obtain
% &
rN − 1 + rN + 1 1
ApN +1 = x − b − N (−x+ +
s − b) = xs − b.
r N s
r −1
N r
This shows that
ApN +1 + b = x+
s = pN ,
Because ∆i , i ∈ [1, N ], are above the line f x = 1 and the extremal points
pi , i ∈ [1, N + 1], satisfy (5.2.12), similar to Case 1, the relation between
∆i s can be simply put as follows,
i−1
p1i = A−i p10 + A−(i−−1) (−A−1 b)(1), i ∈ [0, N + 1].
=0
Hence,
Api + b = pi−1 , Ap1i + b = p1i−1 .
It follows from the equality Ax+ +
e + b = xe that
A(pi − x+ +
e ) = pi−1 − xe ,
A(p1i − x+ 1 +
e ) = pi−1 − xe ,
By induction, we have
Ai (pi − x+ +
e ) = p0 − xe ,
Ai (p1i − x+ 1 +
e ) = p0 − xe ,
Recalling that
rN + 1 −
p10 = x−
e , p0 = x−
s = x ,
rN − 1 e
we can verify that
rN − 1
p10 − x+
e = p 0 − x+
e = 2x−
e , (5.2.17)
rN
and
1 2
p0 − p10 = p0 − x+
e = N x− . (5.2.18)
r N r −1 e
We thus have, by linearity,
rN − 1
p1i − x+
e = N
pi − x+
e
r
and
1
pi − p1i =
N
pi − x+ e .
r
From the above equalities, we see that pi , p1i and x+ e are on the same line.
Inductively, let pj+1
0 be a point between p j
0 and p 0 such that
1 1 +
p0 − pj+1
0 = N
p 0 − p j
0 = N (j+1) p0 − xe , (5.2.19)
r r
and
i−1
pj+1
i = A−i pj+1
0 + A−(i−−1) −A−1 b (1), i ∈ [0, N + 1]. (5.2.20)
=0
5.2. Global Stabilization at Set of Equilibria 97
From (5.2.20), we see that pj0 , pj1 , · · · , pjN +1 is a trajectory of (5.2.2) under
the constant control v(k) ≡ 1. Since pji , i ≥ 1, are all above the line f x = 1,
it follows that
sat f pji = 1.
This shows that the closed-loop system (5.2.9) has a reversed trajectory,
pjN +1 , pjN , · · · , pj0 . We will show that pj0 is closer to the origin than pjN .
Note that p10 = x− e . It follows from (5.2.17), (5.2.18) and (5.2.19) that
1 1
p0 − pj+1
0 = (p0 − x+ −
e ) = N j (p0 − xe ).
rN (j+1) r
Recalling that p0 = −pN , we have,
1
−pN − pj+1
0 = (−pN − x−
e ),
rN j
i.e.,
1
pN + pj+1
0 = (pN − x+
e ). (5.2.22)
rN j
From (5.2.21), we have,
1
pN − pjN = (pN − x+
e ).
rN j
Subtracting the above equation from (5.2.22), we get
pj+1
0 = −pjN . (5.2.23)
p
p 3i+1 i+1
2
pi+1
p 1i+1
x+
e p1 p 2 p 3 pi
i i i
be compressed to the line set E in the next step and will stay there.
We now need to know how the other trajectories travel. If x(0) ∈
∆N \ ∆1N (the quadrilateral p1N p2N p2N +1 p1N +1 ), then at the N th step, the
2
trajectory will reach ∆20 \ ∆10 (the quadrilateral p10 p20 p21 p11 ). Since p20 = −p1N ,
∆20 \ ∆10 is the union of a parallelogram and −∆1N (the right grided small
triangle in Fig. 5.2.3). The states in the parallelogram will reach the side
of −∆1N on the line between x− −
e and xs at the next step because the
parallelogram is in the linear region |f x| ≤ 1. No matter at the N th or the
(N + 1)th step, the set ∆2N \ ∆1N will be transfered to −∆1N . Similarly, the
set ∆jN \ ∆j−1
N will be transfered to −∆j−1 N \ (−∆j−2 N ) at the N th or the
(N + 1)th step (refer to (5.2.23)).
Now suppose x(0) ∈ ∆N , then there always exists a j ≥ 1 such that
x(0) ∈ ∆jN \ ∆j−1N . By the foregoing argument, the trajectory will reach
−∆j−1
N \ (−∆ j−2
N ) at the N th or the (N + 1)th step. By continuing this
5.3. Global Stabilization – Planar Systems 99
process, the trajectory will first reach ∆1N (or −∆1N ), then ∆10 (or −∆10 ) and
finally E and remain in E.
Since any state in C can be traced back to ∆N , we know that all the
trajectories starting from C will reach E in a finite number of steps. 2
was constructed that drives all the states in the null controllable region C
to the set E. In what follows, we will construct a feedback law that first
drives all states in C to E in a finite number of steps, and then from there
to the origin, also in a finite number of steps.
With
x1 (k)
x(k) = ∈ E,
0
we might be tempted to look for a control that keeps the state in E, while
forcing it closer to the origin, i.e., to find a control |u(k)| ≤ 1 such that
x1 (k) x1 (k)
A + bu(k) = ρ,
0 0
with |ρ| < 1. Unfortunately, this is impossible because for each x(k) ∈ E,
there is only one control to keep it in E and this control can only keep the
state stationary. One solution is to use two controls u(k) and u(k + 1) in a
sequence. We use u(k) to drive the state away from E and use u(k + 1) to
drive it back to E but at a point closer to the origin.
A point in E can be expressed as
x1 (k)
x(k) = , x1 (k) ∈ [−1, 1].
0
with |ρ| < 1. A smaller ρ indicates that x(k + 2) is closer to the origin and
implies a faster convergence rate. Therefore, we would like to minimize |ρ|.
The problem can be formulated as:
Lemma 5.3.1. For |x1 (k)| ≤ 1, the optimization problem (5.3.2) has a
solution: with
1
ρ1 (x1 (k)) = − (det A − trA + 1) + trA − det A,
|x1 (k)|
1 det A − trA + 1 det A
ρ2 (x1 (k)) = − + ,
|x1 (k)| trA − 1 trA − 1
Furthermore, when |x1 (k)| = 1, we have ρ∗ (x1 (k)) = 1 and when |x1 (k)| <
1, we have ρ∗ (x1 (k)) < 1. Also, ρ∗ is an increasing function of |x1 (k)|.
Proof. It is easy to see that if x1 (k) = 0, then inf |ρ| = 0. We assume that
x1 (k) > 0. The case where x1 (k) < 0 can be dealt with in a similar way.
Recall from (5.2.3) that
1
b = (A − I) ,
0
5.3. Global Stabilization – Planar Systems 101
where
1
ρ1 = − (det A − trA + 1) + trA − det A,
x1 (k)
1 (det A − trA + 1) det A
ρ2 = − + ,
x1 (k) trA − 1 trA − 1
1
ρ3 = (det A − trA + 1) + trA − det A,
x1 (k)
1 (det A − trA + 1) det A
ρ4 = + .
x1 (k) trA − 1 trA − 1
When x1 (k) = 1,
ρ2 = ρ3 = 1, ρ1 < 1, ρ4 > 1,
ρ∗ = max(0, ρ1 , ρ2 ) < 1.
102 Chapter 5. Stabilization – Discrete-Time Systems
with
u(k) −1 ∗ x1 (k)
= Ab b ρ (x1 (k))I − A2 (5.3.7)
u(k + 1) 0
and ρ∗ (x1 (k)) given by (5.3.3). Then the solution of (5.3.6) lies in E for
even k, the sequence {|x1 (2)|, |x1 (4)|, · · ·} is monotonically decreasing and
there exists a k < ∞ such that x(k) = 0.
Since 0 < x1 (0) < 1, we have ρ∗ (x1 (0)) < 1 and x1 (2) < x1 (0). Because ρ∗
is an increasing function of |x1 (k)|, we have that
we see that there exists some xm > 0 such that ρ1 , ρ2 < 0 for all |x1 (k)| ≤
xm . From (5.3.8), there exists a k < ∞ such that |x1 (k)| ≤ xm , so
ρ∗ (x1 (k)) = 0 and we get x1 (k + 2) = 0 and x(k + 2) = 0. 2
Now we know that under the saturated linear feedback (5.2.8), all the
states in C will be steered to the line segment E (or E in Case 2) in a finite
5.3. Global Stabilization – Planar Systems 103
number of steps, and under the two step controller (5.3.7), any state in E
will be driven to the origin in finite number of steps. What remains is to
combine these two controllers in a simple way.
The question is, given a state x(k) ∈ C, which controller should we use?
If x(k) ∈ E, we have to use the two step controller, which generates two
controls u(k) and u(k + 1) at a time. After u(k) is applied, the state reaches
x(k + 1) which may not be in E. This same x(k + 1), however, might arise
from using the saturated linear feedback (5.2.8). One approach would be
to add some memory in the control law which keeps track of where the
previous state was, namely in E or not in E. There is a simpler approach,
as is shown below.
Lemma 5.3.3. Let x(k) ∈ E and u(k), u(k + 1) be the two controls gen-
erated by the controller (5.3.7). Then,
|f x(k + 1)| ≤ 1.
Proof. After applying the two controls u(k) and u(k + 1), we have
x1 (k + 2)
x(k + 2) = Ax(k + 1) + bu(k + 1) = , x1 (k + 2) ∈ (−1, 1).
0
It follows that
−1 x1 (k + 2)
x(k + 1) = A − A−1 bu(k + 1).
0
Therefore,
f x(k + 1) = u(k + 1).
This proves a). Item b) follows from the above equation and the fact that
|u(k + 1)| ≤ 1. 2
This result says that the second control u(k + 1) coincides with the
saturated linear feedback control u = sat(f x). So we don’t need to know
104 Chapter 5. Stabilization – Discrete-Time Systems
Theorem 5.3.1. Consider the system (5.3.10). For every x(0) ∈ C, there
exists a k < ∞ such that x(k) = 0.
and
√
2
eig(A) = r(cos β ± sin β), r = 1.3538, β = π.
11
Two trajectories of the system are plotted in Fig. 5.3.1. The initial points
are marked with “∗”. They are very close to the boundary of C. The
extremal points of C are marked with “◦”. We see that the two trajectories
reach the origin in finite number of steps.
1.5
0.5
−0.5
−1
−1.5
−2
−2 −1.5 −1 −0.5 0 0.5 1 1.5 2
We need to determine an invariant set in the linear region |f¯x| ≤ 1 for this
system. Let
1 0 b1 δ(2 − b1 δ)
P = , p2 = .
0 p2 2(b2 δ)2
Then,
(A + bf¯)T P (A + bf¯) − P < 0.
Define a level set
E P, c2 := x : xT P x ≤ c2
and
c0 = max c > 0 : |f¯x| ≤ 1, ∀ x ∈ E(P, c2 ) .
Then, E(P, c20 ) is the largest level set that lies entirely in the linear region.
We see that both P and E(P, c20 ) depend on δ. When δ is very small, p2
will be very big and E(P, c20 ) will be a flat ellipsoid that encloses almost
all points in E (Recall that E is a horizontal line between f x = 1 and
f x = −1). As δ → 0, p2 → ∞ and E(P, c20 ) → E (see Fig. 5.4.1, where the
inner ellipsoid is E(P, c20 )).
Now consider the switching control law,
sat(f x), x ∈ / E(P, c20 ),
u(x) = (5.4.2)
sat(f¯x), x ∈ E(P, c20 ).
Note that E(P, c20 ) is a contractive invariant set: once the trajectory enters
E(P, c20 ), it will remain in E(P, c20 ) and then converge to the origin under
5.5. Semi-Global Stabilization – Higher Order Systems 107
the control law u = sat(f¯x) = f¯x. Because any closed interval in E can be
enclosed in E(P, c20 ) by decreasing δ, the domain of attraction of the origin
of the closed-loop system can be made arbitrarily close to C for the case
that A has two real eigenvalues.
and c0 = 0.9747. The simulation result is shown in Fig. 5.4.1, where “∗”
is the initial state of the trajectory. The trajectory reaches the set E(P, c20 )
in 5 steps, then remains in E(P, c20 ) and converges to the origin.
1.5
0.5
−0.5
−1
−1.5
−2
−1.5 −1 −0.5 0 0.5 1 1.5
xa (k + 1) = A1 xa (k) + b1 u(k)
and
Ω2 (γ2 ) := xs ∈ Rn : |xs | ≤ γ2 . (5.5.3)
As ε ↓ 0, P (ε) decreases.
Let P 1 be the unique positive definite solution to the DARE
Then by the continuity property of the solution of the DARE (see [86,
Theorem 4.2.7], we have
P1 0
lim P (ε) = .
ε→0 0 0
Let
f1 (ε) := −(bT P (ε)b + 1)−1 bT P (ε)A.
First, consider the domain of attraction of the origin of the following closed-
loop system
x(k + 1) = Ax(k) + b sat(f1 (ε)x(k)). (5.5.5)
It can be shown by a similar method of [65, Lemma 2.3.6] that there exist
constants c > 0 and ε∗ > 0 such that
1 1
f1 (ε)P − 2 (ε) = (bT P (ε)b + 1)−1 bT P (ε)AP − 2 (ε) < c, ∀ ε ∈ (0, ε∗ ].
It follows that
1
D(ε) := x ∈ R2+n : xT P (ε)x ≤
c2
Then,
D1 (ε) := x ∈ R2+n : |xa | ≤ r1 (ε), |xs | ≤ r2 (ε) ⊂ D(ε).
Moreover,
lim r2 (ε) = ∞,
ε→0
Theorem 5.5.1. Let h(·) be defined in (5.3.9). For any γ1 < 1 and γ2 > 0,
there exists an ε > 0 such that Ω1 (γ1 )×Ω2 (γ2 ) is in the domain of attraction
of the origin for the following closed-loop system
h(xa ), x∈
/ D(ε),
x(k + 1) = Ax(k) + bu(k), u(k) = (5.5.7)
sat(f1 (ε)x), x ∈ D(ε).
Proof. The proof is analogous to that of Theorem 4.4.1. 2
Remark 5.5.1.
1) For the case that the open loop system has a single exponentially
unstable pole, there exists a simpler nonlinear function h(·) such that
Theorem 5.5.1 is true. Let x, A and b be partitioned similarly to the
two exponentially unstable pole case, with xa , A1 , b1 ∈ R. Then, it is
easy to verify that
% &
|b1 | |b1 |
C1 = − , .
|A1 | − 1 |A1 | − 1
Also, the origin of the system
% &
A1
xa (k + 1) = A1 xa (k) − b1 sat xa (k)
b1
has a domain of attraction equal to C 1 . Hence, if we let
% &
A1
h(xa ) = −sat xa ,
b1
then Theorem 5.5.1 is also true for this case.
5.6. Conclusions 111
2) For the case that the open loop system has no exponentially unstable
poles (ANCBC), we have C = Rn , and
lim P (ε) = 0.
ε→∞
Therefore, the set D(ε), which is in the domain of attraction for the
system under the control u = sat(f1 (ε)x), can be made arbitrarily
large to include any compact subset of Rn . This means that the
control u = sat(f1 (ε)x) can achieve semi-global stabilization and no
switching is necessary.
5.6. Conclusions
In this chapter, we established global and semi-global stabilizability for
exponentially unstable discrete-time linear systems with saturating actu-
ators. For a planar anti-stable system, controllers are constructed that
achieve global and semi-global stabilization, respectively. For higher order
systems with one or two anti-stable modes, semi-global stabilizability was
established also by explicit construction of feedback laws. The semi-global
stabilizability for general systems will be established in Chapter 9.
112 Chapter 5. Stabilization – Discrete-Time Systems
Chapter 6
Practical Stabilization on
Null Controllable Region
6.1. Introduction
113
114 Chapter 6. Practical Stabilization
Given any (arbitrarily small) set that contains the origin in its interior, we
will show that its domain of attraction can be made to include any compact
subset of C a in the presence of disturbances bounded by an (arbitrarily
6.3. Proof of Main Results 115
large) given number. More specifically, we will establish the following result
on semi-global practical stabilization on the null controllable region for the
system (6.2.1).
Theorem 6.2.1. Consider the system (6.2.1) with A having two exponen-
tially unstable eigenvalues. Given any set X 0 ⊂ int(C a ), any set X ∞ such
that 0 ∈ int(X ∞ ), and any positive number D, there is a feedback law
u = F (x) such that any trajectory of the closed-loop system enters the set
X ∞ in a finite time an remains there as long as it starts from within the
set X 0 .
2.5
1.5
1 , , ,
p3 p2 p1 p1 p2 p3
0.5
0
−14 −12 −10 −8 −6 −4 −2 0 2 4 6
and p1 , p2 and p3 are the first intersections of the trajectories that start
from p1 , p2 and p3 . Then,
It follows that
|p2 −p1 |
|p2 − p1 | |p2 − p1 | 1+ |p3 −p2 |
< =⇒ < 1.
|p3 − p2 | |p3 − p2 | 1+ |p2 −p1 |
|p3 −p2 |
Hence,
|p2 −p1 |
|p3 − p1 | |p − p2 | + |p2 − p1 | |p − p2 | 1 + |p3 −p2 | |p3 − p2 |
= 3 = 3 < .
|p3 − p1 | |p3 − p2 | + |p2 − p1 | |p3 − p2 | 1 + |p2 −p1 | |p3 − p2 |
|p3 −p2 |
(6.3.4)
Also from (6.3.3), we have
|p3 −p2 |
|p3 − p2 | |p3 − p2 | 1+ |p2 −p1 |
> =⇒ > 1.
|p2 − p1 | |p2 − p1 | 1+ |p3 −p2 |
|p2 −p1 |
Hence,
|p3 −p2 |
|p3 − p1 | |p − p2 | + |p2 − p1 | |p − p1 | 1 + |p2 −p1 | |p2 − p1 |
= 3 = 2 > .
|p3 − p1 | |p3 − p2 | + |p2 − p1 | |p2 − p1 | 1 + |p3 −p2 | |p2 − p1 |
|p2 −p1 |
(6.3.5)
Combining (6.3.4) and (6.3.5), we obtain
We will study the trajectories of (6.3.7) with respect to two horizontal lines
kf x = 1 and kf x = −1 where
f= 0 1
and k > 0. For some points on the line kf x = 1, the trajectories of (6.3.7)
starting from these points will enter the region
x ∈ R2 : |kf x| < 1
and then intersect the line kf x = −1. We will define some functions to
relate these points on kf x = 1 to the first intersections of the trajectories
with kf x = −1. The functions will be defined in terms of the point
− ak2
p0 =
− k1
on kf x = −1. It is easy to see that, if a point x is on kf x = −1 and is to
the left of p0 , then the vector ẋ points downward; if x is to the right of p0 ,
then the vector ẋ points upward (see Fig. 6.3.2).
This defines two functions x11 → y11 , and x11 → T with x11 ∈ (−∞, xm ],
y11 ∈ (−∞, − ak2 ] and T ∈ (0, Td ].
6.3. Proof of Main Results 119
Lemma 6.3.2. For all x11 ∈ (−∞, xm ), we have x11 < y11 and
dy11 d2 y11 dT
> 1, > 0, > 0.
dx11 dx211 dx11
1.5
, , , ,
p p p p
3 2 1 0
1
0.5
−0.5
p p p p
3 2 1 0
−1
−1.5
−2
−2.5
−12 −10 −8 −6 −4 −2 0 2 4 6
where
0 −a1 −b1
A= , b= ,
1 a2 −b2
120 Chapter 6. Practical Stabilization
a1 , a2 , b2 > 0, b1 ≥ 0, and
f= 0 1 .
If k > a2 /b2 , then A + kbf is Hurwitz and the origin is the unique equi-
librium point of (6.3.9) and it is stable. Denote the domain of attraction
of the origin as S(k), then by Theorem 4.2.1, ∂S(k) is the unique limit
cycle of (6.3.9). We will further show that the domain of attraction S(k)
increases as k increases.
Consider k0 > a2 /b2 . Denote the increment of k as δk . Remark 4.2.1
says that ∂S(k0 ) is symmetric with respect to the origin and has two inter-
sections with each of the lines k0 f x = 1 and k0 f x = −1. In Fig. 6.3.3, the
closed curve is ∂S(k0 ) and p1 , p2 , p3 (= −p1 ) and p4 (= −p2 ) are the four
intersections.
In the sequel, for a vector p ∈ R2 , we use (p)i to denote the ith coordi-
nate of p.
Since at p2 , the trajectory goes downward, i.e., ẋ2 < 0, so,
a2 − k0 b2
(p2 )1 < < 0.
k0
From Lemma 6.3.2, we have (p1 )1 < (p2 )1 < 0 (x11 < y11 in the lemma).
Hence both p1 and p2 are on the left half plane. Define
(p2 )1 a2
∆(k0 ) = − k0 + − k0 .
b2 b2
Then ∆(k0 ) > 0 due to the fact that the trajectory goes downward at p2 .
Proposition 6.3.1. Suppose k0 > a2 /b2 . Then, for all δk ∈ (0, ∆(k0 )),
S(k0 ) ⊂ S(k0 + δk ).
Proof. Since δk > 0, the two lines (k0 + δk )f x = ±1 are between the lines
k0 f x = 1 and k0 f x = −1. It follows that the vector field above k0 f x = 1
and that below k0 f x = −1 are the same for
ẋ = Ax + b sat(k0 f x) (6.3.10)
and
ẋ = Ax + b sat((k0 + δk )f x). (6.3.11)
Hence, if a trajectory of (6.3.11) starts at p4 (or p2 ), it will go along ∂S(k0 )
to p1 (or p3 ).
6.3. Proof of Main Results 121
1.5
p p
1 1 4
s s
0.5 1 4
0
s
0
−0.5 ,, ,
s s s s
2 2 2 3
−1 p p
2 3
−1.5
−2
−8 −6 −4 −2 0 2 4 6 8
It follows from the claim that any trajectory of (6.3.11) that starts from
∂S(k0 ) will stay inside of S(k0 ) when it returns to the lines k0 f x = ±1. So
it is bounded and hence belongs to S(k0 + δk ). Note that any trajectory
outside of S(k0 + δk ) will diverge because the system has a unique unstable
limit cycle as the boundary of the domain of attraction. Since the two sets
are convex and open, we will have
S(k0 ) ⊂ S(k0 + δk ).
k0 f x ≤ sat((k0 + δk )f x).
k0 f x ≥ sat((k0 + δk )f x).
Let
x11
s1 = , h>0
h
be a point on ∂S(k0 ) between p1 and s0 such that ẋ of (6.3.11) at s1 directs
outward of ∂S(k0 ). Let
y11
s2 =
−h
be the intersection of ∂S(k0 ) with x2 = −h. Then by 1) the trajectory of
(6.3.11) starting at s1 will remain outside of ∂S(k0 ) above the abscissa. We
will show that when the trajectory reaches the line x2 = −h at s2 , it must
be inside ∂S(k0 ).
Let
0 0
s3 = , s4 =
−h h
(see Fig. 6.3.3). Denote the region enclosed by s1 s2 s3 s4 s1 as G0 , where the
part s1 s2 is on ∂S(k0 ) and the other parts are straight lines. Since this
region lies between k0 f x = ±1, the vector field of (6.3.10) on this region is,
1
for all δk ∈ [0, ∆(k0 )). Since y11 < (p2 )1 , h < k0 and −(a2 −k0 b2 −δk b2 ) > 0,
it follows that
Suppose that δk ∈ [0, ∆(k0 )). If δy11 < 0, then s2 is outside of ∂S(k0 )
and we must have δQ > 0. In this case, the left hand side of (6.3.16) is
negative and the righthand side is positive. This is a contradiction. Hence
δy11 must be positive and s2 must be inside ∂S(k0 ). By 2), the vector ẋ of
the system (6.3.11) directs inward of ∂S(k0 ) from s2 to p2 , therefore, when
the trajectory reaches k0 f x = −1, it must be to the right of p2 , i.e., still
inside ∂S(k0 ).
Now suppose s1 lies between (k0 + δk )f x = 1 and k0 f x = 1. Then, by
applying Green’s Theorem, we get exactly the same equation as (6.3.16),
although we need to partition the region enclosed by s1 s2 s3 s4 s1 into three
parts. And similar arguments apply. Thus we conclude that for all δk ∈
[0, ∆(k0 )), S(k0 ) ⊂ S(k0 + δk ). 2
and
f V −1 (f ) = 0 1 =: fˆ. (6.3.20)
Such a transformation always exists. To see this, assume that A is already
in this form. Since A is anti-stable and A + bf is stable, (f, A) must be
observable. So,
f A − a2 f
V (f ) =
f
is nonsingular and it can be verified that this V (f ) is the desired transfor-
mation matrix. Moreover, V (f ), V −1 (f ), b1 (f ) and b2 (f ) are all analytic
in f . Now consider the transformed system
Note that  and fˆ are both independent of f . Under the state transforma-
tion, S(f ) is transformed into
the domain of attraction for (6.3.21) and ∂ Ŝ(f ) is its unique limit cycle.
Let
x̂11
p1 =
1
be a point on the line fˆx̂ = 1 such that a trajectory starting at p1 will go
above the line and return to the line (for the first time) at
ŷ11
p1 = .
1
or equivalently,
x̂11 +(Â−1 b̂(f ))1 ŷ11 +(Â−1 b̂(f ))1
ÂT1 1+(Â−1 b̂(f ))2 1+(Â−1 b̂(f ))2
e = ,
1 1
be the first intersection of the trajectory with fˆx̂ = −1. Note that between
fˆx̂ = 1 and fˆx̂ = −1, the vector field of (6.3.21) is
˙x̂ = (Â + b̂(f )fˆ)x̂ = 0 −a1 + b1 (f ) x̂
1 a2 + b2 (f )
and that Â+b̂(f )fˆ is Hurwitz. Hence Lemma 6.3.2 applies and we know that
ẑ11 is continuously differentiable in ŷ11 . To see that ẑ11 is also continuously
differentiable in f , recall that we have assumed that A + bf has distinct
eigenvalues, so the eigenvalues are analytic in f . From (6.B.1) in the proof
of Lemma 6.3.2, we see that T is continuously differentiable in λ1 and λ2
and hence in f for T < Td . Thus, ẑ11 is also continuously differentiable
in f . Note here that ẑ11 corresponds to y11 in (6.B.2) and ŷ11 to x11 in
(6.B.1). In summary, we can write
is a point on the limit cycle ∂ Ŝ(f ), then, we must have ẑ11 = −x̂11 , i.e.,
hence,
∂g
= 0,
∂ x̂11
and by the implicit function theorem, x̂11 is differentiable in f . Recall that
x̂11
p1 =
1
is a point in the vector field of (6.3.21). The corresponding intersection in
the original system (6.3.17) is
−1 x̂11
V (f ) ,
1
clearly, it is also differentiable in f . 2
Combining Propositions 6.3.1 and 6.3.2, we have
Corollary 6.3.1. Consider the system (6.3.9) with A, b and f in the spec-
ified form. Given k1 and k2 , k2 > k1 > ab22 , suppose that A + kbf has
distinct eigenvalues for all k ∈ [k1 , k2 ], then
S(k) ⊂ S(k + δk ),
Clearly, A + kbf (ε) is Hurwitz for all k ≥ 0.5. It can also be shown that
there exists some k0 > 0 such that A + kbf (ε) has distinct eigenvalues for
all k > k0 and ε ∈ [0, 1].
For x(0) = x0 and w ∈ W, denote the state trajectory of (6.3.24) as
ψ(t, x0 , w).
Lemma 6.3.3. Consider the system (6.3.24). Let ε > 0 be given. Denote
σmax (P (ε))D2 4
c∞ = , c0 = .
ε(2k − 1) bT P (ε)b
and
S ∞ (ε, k) := E(P (ε), c∞ ) = x : xT P (ε)x ≤ c∞ .
Then, S p (ε) and S ∞ (ε, k) are invariant sets, and, for any w ∈ W, x0 ∈
S p (ε), ψ(t, x0 , w) will enter S ∞ (ε, k) in a finite time and remain in it.
Proof. Let V (x) = xT P (ε)x. It suffices to show that V̇ < 0 for all
x ∈ S p (ε) \ S ∞ (ε, k) and for all |w| ≤ D. In what follows, we simply write
P (ε) as P and f (ε) as f , since in this lemma, ε is fixed. Note that
Since
4
xT P x ≤ c0 = ,
bT P b
we have 1
1
bT P x ≤ bT P 2 P 2 x ≤ 2,
sat(kf x + w) ≤ kf x + w,
and
where dist (ψ(t, x0 , w), S ∞ (ε, k)) is the distance from the point
ψ(t, x0 , w) to the set S ∞ (ε, k). Our objective is to choose ε and k such
that
Let’s now assume that the condition c∞ < c0 in Lemma 6.3.3 is satisfied.
Then we have S p (ε) ⊂ S D (ε, k) ⊂ S(ε, k). By using the Lyapunov function
V (x) = xT P (ε)x, we can only determine a subset S p (ε) of S D (ε, k). As
ε decreases, P (ε) decreases. It was shown in [107] that if ε1 < ε2 , then
S p (ε2 ) ⊂ S p (ε1 ). Hence by decreasing ε, we can enlarge S p (ε). However,
since limε→0 S p (ε) can be much smaller than C a , we are unable to prove
that S D (ε, k) is close to C a by simply enlarging S p (ε) as was done in [85].
For this reason, we will resort to the detailed investigation on the vector
field of (6.3.24) in the presence of the disturbance.
We now continue with the proof of the theorem and focus on the second
order systems. Also assume that A is anti-stable. In this case, C a = C.
We will prove the theorem by showing that, given any X 0 ⊂ int(C), any
(arbitrarily small) X ∞ such that 0 ∈ int(X ∞ ), and any D > 0, there exist
an ε > 0 and a k ≥ 0.5 such that X 0 ⊂ S D (ε, k) and S ∞ (ε, k) ⊂ X ∞ .
Theorem 4.3.1 applies to the case where ε = 0. It means that
X 0 ⊂ int(S(ε, k)), ∀ k ≥ k0 .
6.3. Proof of Main Results 131
Let
P̂ (ε) = (V −1 )T P (ε)V −1 .
Since ε is now fixed, we denote P̂ (ε), Ŝ p (ε), Ŝ D (ε, k), Ŝ(ε, k) and Ŝ ∞ (ε, k)
as P̂ , Ŝ p , Ŝ D (k), Ŝ(k) and Ŝ ∞ (k), respectively.
Now we consider
This standard form fits just right into Corollary 6.3.1 , so we can be sure
that Ŝ(k) increases as k increases. It follows that
k fˆx̂+
e + w ≥ 1, k fˆx̂−
e + w ≤ −1, or |k fˆx̂w
e + w| ≤ 1,
respectively.
132 Chapter 6. Practical Stabilization
Here we have
1 a2 b 1 + a1 b 2
x̂+
e = , x̂− +
e = −x̂e ,
a1 −b1
and
1 a2 b 1 + a1 b 2
x̂w = w.
e
a1 + b 1 k −b1
If  has no complex eigenvalues, then x̂+ −
e , x̂e ∈ ∂ Ĉ (see Chapter 2), so
x̂+ −
e , x̂e ∈
/ X̂ 0 for any X̂ 0 ⊂ int(Ĉ). But if  has a pair of complex eigen-
values, x̂+ −
e , x̂e ∈ int(Ĉ) and will be in X̂ 0 if X̂ 0 is close enough to Ĉ. So,
it is desirable that x̂+ −
e and x̂e cannot be made stationary by any |w| ≤ D.
This requires
k fˆx̂+
e + w < 1, k fˆx̂−
e + w > −1, ∀ |w| ≤ D,
which is equivalent to
b1
k + w > −1, ∀ |w| ≤ D.
a1
If D < 1, this is satisfied for all k. If D > 1, we need to choose k such that
a1
k> (D − 1).
b1
Note that this will be impossible if b1 =0, which corresponds to the case
where ε = 0. This is one reason that ε should be nonzero.
Finally, as k → ∞, x̂w
e → 0 for all w, |w| ≤ D. Hence k can be chosen
e
∈ X̂ 0 \ X̂ ∞ .
large enough such that x̂w
In summary, from the above analysis, we will restrict ourselves to k such
that
a1 D a2 b 1 + a1 b 2
k > (D − 1), ∈ X ∞. (6.3.28)
b1 a1 + b 1 k −b1
• In the ellipsoid Ŝ p , we have shown in Lemma 6.3.3 that all the tra-
jectories will converge to Ŝ ∞ (k), which can be made arbitrarily small
by increasing the value of k.
Note that xm (k) increases as k increases Then, the vector field in Q(k) has
the following property:
This implies that for any straight line E with slope bb21 , if x̂ ∈ E ∩ Q(k),
then the vector x̂˙ points to the right of E for all w, |w| ≤ D.
Proof. Between the lines k fˆx̂ = D + 1 and k fˆx̂ = −(D + 1), sat(k fˆx̂ + w)
takes value in [−1, 1] and hence,
˙x̂ ∈ −a1 x̂2 b1
+λ : λ ∈ [−1, 1] . (6.3.31)
x̂1 + a2 x̂2 b2
For x̂ ∈ Q(k), if
% & % &
−1 b2 −1 b2
tan − π < Âx̂ < tan
,
b1 b1
134 Chapter 6. Practical Stabilization
0.8
0.6
E
0.4
0.2
^ ^^
k f x=D+1
Sp
Q(k)
0
−0.2
^^
k f x=−(D+1)
−0.4
−0.6
−0.8
−1
−10 −8 −6 −4 −2 0 2 4 6 8 10
then,
% & % &
−1 b2 −1 b2
tan − π < (Âx̂ + λb̂) < tan
, ∀ λ ∈ [−1, 1]. (6.3.32)
b1 b1
Since xm (k) is increasing with k, we see from (6.3.29) that for all k > K,
b2
tan−1 Â D+1
b1 k
% &
−a D+1
=
1 k π −1 b2
< − < tan .
−xm (k) + a2 D+1
k 2 b 1
for all x̂ ∈ Q(k) and |w| ≤ D. With similar arguments, it can be further
verified that
%
&
−xm (k)
min Âx̂ + λb̂ : x̂ ∈ Q(k), λ ∈ [−1, 1] ≥
 D+1 +b
% &k
b2
> tan−1 − π.
b1
b) Denote the region enclosed by the trajectories from ±p1 to ±p2 , and
the straight lines from ±p2 to ∓p1 as Ŝ I (k) (in Fig. 6.3.5, the region
enclosed by the inner closed curve). Then,
lim dist Ŝ I (k), Ŝ(k) = 0.
k→∞
136 Chapter 6. Practical Stabilization
2.5
1.5
0.5
s2 p2 p0 p1 s1
−s −p (p ) −p −s
1 1 3 2 2
−0.5
−1
−1.5
−2
−2.5
−10 −8 −6 −4 −2 0 2 4 6 8 10
then x̂˙ 2 = 0 at p0 and to the left (right) of p0 , x̂˙ 2 < 0 (> 0). Let p1 be a
point on k fˆx̂ = D + 1 between p0 and s1 , then a trajectory starting at p1
goes upward and will return to k fˆx̂ = D + 1 at some p2 between p0 and s2 .
The point p2 is uniquely determined by p1 . We then draw a straight line
from p2 with slope bb21 . Let the intersection of the line with k fˆx̂ = −(D + 1)
be p3 . Clearly, p2 and p3 depend on p1 continuously. And the quantity
(p3 − (−s1 ))1
r(p1 ) :=
(s1 − p1 )1
also depends on p1 continuously. If p1 = s1 , then p2 = s2 . Note that the
trajectories above the line k fˆx̂ = D + 1 are independent of w and hence
6.3. Proof of Main Results 137
are the same as those with w = 0. Since s2 and −s1 are on a trajectory of
(6.3.27) with w = 0, so by Lemma 6.3.4, −s1 must be to the right of the
straight line with slope bb21 that passes s2 . This shows that −s1 is to the
right of p3 (with p1 = s1 ) and hence
If p1 = p0 , then p2 = p0 and
a2 (D+1)
b2 − k − 2(D+1)b
kb2
1
p3 = .
− D+1
k
So,
(s1 )1 + b2 − a2 (D+1) − 2(D+1)b1
r(p1 = p0 ) = k
kb2
.
(s1 )1 − b2 − a2 (D+1)
k
Since s1 and s2 are restricted to the null controllable region Ĉ, there exist
some K1 > 0 and γ > 0 such that, for all k > K1 ,
|p0 − s2 | |p2 − s2 |
≥ 1 + γ =⇒ > 1 + γ. (6.3.35)
|p0 − s1 | |p1 − s1 |
From Fig. 6.3.5 , we see that
2(D+1)
As k → ∞, k → 0, so (p2 − (−p1 ))1 → 0. Since s1 + s2 → 0, we have
|p2 − s2 | − |p1 − s1 | → 0.
From (6.3.35),
|p2 − s2 | − |p1 − s1 | > γ|p1 − s1 |.
c) First we show that Ŝ I (k) is an invariant set. Note that the part of
∂ Ŝ I (k) from p1 to p2 and that from −p1 to −p2 are trajectories of (6.3.27)
under any |w| ≤ D. At any point on the line from p2 to −p1 , Lemma 6.3.4
says that x̂˙ directs to the right side of the line, i.e., no trajectory can cross
the line between p2 and −p1 leftward. By symmetry, no trajectory can cross
the line between −p2 and p1 rightward. These show that no trajectory can
cross ∂ Ŝ I (k) outward, thus Ŝ I (k) is an invariant set. Since Ŝ p is also an
invariant set and any trajectory that starts from inside of it will converge
to Ŝ ∞ (k), it suffices to show that any trajectory that starts from inside of
Ŝ I (k) will enter Ŝ p . We will do this by contradiction.
Suppose that there exist an x̂0 ∈ Ŝ I (k) \ Ŝ p and a w ∈ W such that
ψ(t, x̂0 , w) ∈ Ŝ I (k) \ Ŝ p for all t > 0, then there must be a point x̂∗ ∈
Ŝ I (k) \ Ŝ p such that either
and there is an ε > 0 such that for any T > 0, there exists a t > T
satisfying |ψ(t, x̂0 , w) − x̂∗ | > ε.
2.5
1.5
0.5 ^
s2 p2 q2 Sp q1 p1 s
1
−1
−1.5
−2
−2.5
−10 −8 −6 −4 −2 0 2 4 6 8 10
So we have
and
Ω2 (γ2 ) := xs ∈ Rn : |xs | ≤ γ2 .
For any compact subset X 0 of C a = C 1 × Rn , there exist a γ1 ∈ (0, 1) and
a γ2 > 0 such that X 0 ⊂ Ω1 (γ1 ) × Ω2 (γ2 ). For this reason, we assume,
without loss of generality, that X 0 = Ω1 (γ1 ) × Ω2 (γ2 ).
For an ε > 0, let
P1 (ε) P2 (ε)
P (ε) = ∈ R(2+n)×(2+n)
P2T (ε) P3 (ε)
be the unique positive definite solution to the ARE
AT P + P A − P bbT P + εI = 0. (6.3.36)
142 Chapter 6. Practical Stabilization
AT1 P 1 + P 1 A1 − P 1 b1 bT1 P 1 = 0.
Then,
P1 0
lim P (ε) = .
ε→0 0 0
Let
f (ε) := −bT P (ε).
We first study the following closed-loop system
Recall from Lemma 6.3.3 that the invariant set S p (ε) is in the domain
of attraction of the set S ∞ (ε, k) for sufficiently large k’s.
Moreover,
lim r2 (ε) = ∞,
ε→0
that starts from within X a0 will converge to X a∞ in a finite time and stay
there. Denote the trajectory of (6.3.38) that starts at xa0 as ψ1 (t, xa0 , w)
and define
TM := max min t > 0 : ψ1 (t, xa0 , w) ∈ X a∞ .
xa0 ∈∂ X a0 ,w∈W
then, by Lemma 6.3.6, there exists an ε < ε0 such that r1 (ε) ≥ r1 (ε0 ),
r2 (ε) ≥ γ and
D1 (ε) = x ∈ R2+n : |xa | ≤ r1 (ε), |xs | ≤ r2 (ε) ⊂ S p (ε)
Remark 6.3.1.
1) Similar to Remark 4.4.1, Theorem 6.2.1 is also true for the case that
the open loop system has a single exponentially unstable pole. Let
x, A, b and P (ε) be partitioned similarly to the two exponentially
unstable pole case, with xa , A1 , b1 , P1 (ε) ∈ R. For the anti-stable
sub-system, the controller has the form
2A1 1
u = f1 xa = −k xa , k> .
b1 2
144 Chapter 6. Practical Stabilization
2) For the case that the open-loop system has no exponentially unstable
poles (ANCBC), the controller has the form of u = kf (ε)x and no
switching is necessary. But unlike Remark 4.4.1, we have to increase
k if X 0 is increased or X ∞ is decreased.
6.4. An Example
In this section, we will use an aircraft model to demonstrate the results ob-
tained in this chapter. Consider the longitudinal dynamics of the TRANS3
aircraft under certain flight condition [51],
ż1 0 14.3877 0 −31.5311 z1
ż2 −0.0012 −0.4217 1.0000 −0.0284
z2
ż3 = 0.0002 −0.3816 −0.4658 0 z3
ż4 0 0 1.0000 0 z4
4.526
−0.0337
+
−1.4566 v.
0
The states z1 , z2 , z3 and z4 are the velocity, the angle of attack, the pitch
rate and the Euler angle rotation of aircraft about the inertial y-axis, re-
spectively. The control v is the elevator input, which is bounded by 10
6.4. An Example 145
where
0.0212 0.1670 −0.4650 0.6247
A1 = , A2 = ,
−0.1670 0.0212 −0.6247 −0.4650
and
8.2856 0.7584
b1 = , b2 = .
−2.4303 −1.8562
The system has two stable modes −0.4650 ± j0.6247 and two anti-stable
ones, 0.0212 ± j0.1670. Suppose that w is bounded by |w| ≤ D = 2.
For the anti-stable xa -subsystem, we take γ1 = 0.9. With the technique
used in the proof of Theorem 6.2.1 for second order systems, we obtain a
feedback u = f1 xa , where
f1 = −0.4335 0.2952 ,
such that Ω1 (γ1 ) is inside some invariant set S I . Moreover, for all initial
xa0 ∈ S I , under the control u = f1 xa , xa (t) will enter the ball
X a = xa ∈ R2 : |xa | ≤ 29.8501 .
In Fig. 6.4.1, the outermost closed curve is the boundary of the null con-
trollable region ∂C 1 , the inner dash-dotted closed curve is ∂S I , the dashed
closed curve is ∂Ω1 (γ1 ), and the innermost solid closed curve is ∂X a .
The xs -subsystem is exponentially stable. Under the saturated control,
it can be shown that for any initial value xs0 ∈ R2 , there exists a T > 0
such that xs (t) will enter a bounded ball at time T and remain there. The
bounded ball is computed as
X s = xs ∈ R2 : |xs | ≤ 4 .
We see that, for any (xa0 , xs0 ) ∈ S I × R2 , under the partial feedback
control u = f1 xa , the state (xa , xs ) will enter the set X a × X s in a finite
time and remain there. The next step is to design a full state feedback
to make the set X a × X s inside the domain of attraction of an arbitrarily
small set.
146 Chapter 6. Practical Stabilization
300
200
100
−100
−200
−300
−300 −200 −100 0 100 200 300
and
S p (ε) = x ∈ R4 : xT P (ε)x ≤ 10.3561 .
300
x12
k=2.5
200
100
−100
−200
x
11
−300
−300 −200 −100 0 100 200 300
we choose w(t) = 2 sin(0.1t) and xa0 to be a point very close to the bound-
ary of S I (see the point marked with “◦” in Fig. 6.4.2 and Fig. 6.4.4). We
also set
1000
xs0 = ,
1000
which is very far away from the origin. When k = 2.5, the disturbance is
not satisfactorily rejected (see Fig. 6.4.2 for a trajectory of xa and Fig. 6.4.3
for the time response of |x(t)|). When k = 30, the disturbance is rejected
to a much higher level of accuracy (see Fig. 6.4.4 and Fig. 6.4.5).
6.5. Conclusions
500
450
k=2.5
400
350
300
2
||x(t)||
250
200
150
100
50
0
0 50 100 150 200 250 300 350
time t
300
x12
k=30
200
100
−100
−200
x
11
−300
−300 −200 −100 0 100 200 300
500
450
k=30
400
350
300
2
||x(t)||
250
200
150
100
50
0
0 50 100 150 200 250 300 350
time t
in a direct way to systems with more than two exponentially unstable open
loop poles.
1
f p = f p =
k
and
p = eAT p,
we have,
x11
0 k eAT 1 = 1, (6.A.1)
k
−AT y11
0 k e 1 = 1. (6.A.2)
k
150 Chapter 6. Practical Stabilization
From (6.A.1) and (6.A.2), we can also express x11 and y11 as functions of T .
In other words, x11 and y11 are related to each other through the parameter
T . Since the valid domain of x11 can be finite or infinite depending on
the location of the eigenvalues of A, it is necessary to break the proof for
different cases. We will see soon that the relation among x11 , y11 and T is
quite different for different cases.
1 λ1 − λ2 + λ2 eλ2 T − λ1 eλ1 T
x11 (T ) = · , (6.A.3)
k eλ1 T − eλ2 T
1 λ1 − λ2 + λ2 e−λ2 T − λ1 e−λ1 T
y11 (T ) = · . (6.A.4)
k e−λ1 T − e−λ2 T
Due to the uniqueness of the trajectory, T is also uniquely determined by
x11 . So, x11 ↔ T , x11 ↔ y11 , y11 ↔ T are all one to one maps. From
the above two equations, we know that x11 (T ) and y11 (T ) are analytic on
(0, ∞). It can be verified from (6.A.3) that
λ1 + λ2 a2 λ1
lim x11 = − =− , lim x11 = − = am .
T →0 k k T →∞ k
So we know the valid domain of x11 is (− ak2 , am ). It can also be verified
that
dx11
> 0,
dT
or
dT
> 0.
dx11
Denote
dy11
g(T ) := − ,
dx11
6.A. Proof of Lemma 6.3.1 151
then,
λ1 − λ2 + λ2 eλ1 T − λ1 eλ2 T
g(T ) = .
λ1 − λ2 + λ2 e−λ1 T − λ1 e−λ2 T
It can be verified that
lim g(T ) = 1,
T →0
and
dg λ1 λ2 (eλ1 T − eλ2 T )
= h(T ),
dT (λ1 − λ2 + λ2 e−λ1 T − λ1 e−λ2 T )2
where
h(T ) = (λ1 − λ2 ) 1 − e−(λ1 +λ2 )T + (λ1 + λ2 ) e−λ1 T − e−λ2 T .
dh
= (λ1 + λ2 )e−(λ1 +λ2 ) λ1 − λ2 + λ2 eλ1 T − λ1 eλ2 T > 0, ∀ T > 0,
dT
we have h(T ) > 0, hence
dg
> 0.
dT
This shows that g(T ) > 1 for all T > 0, i.e.,
dy11
< −1.
dx11
Since
dg dg(T ) dx11 d2 y11 dx11
= · =− 2 · ,
dT dx11 dT dx11 dT
and
dg dx11
> 0, > 0,
dT dT
it follows that
d2 y11
< 0.
dx211
then,
1 T
eAT = V V −1 eλT .
0 1
In this case,
1
x11 (T ) = − 1 + λT − e−λT ,
kT
1
y11 (T ) = 1 − λT − eλT .
kT
Similar to Case 1, it can be shown that
2λ a2 λ
lim x11 = − =− , lim x11 = − = am .
T →0 k k T →∞ k
So the valid domain of x11 is (− ak2 , am ). It can also be verified that
dx11
> 0.
dT
Denote
dy11
g(T ) := − ,
dx11
then,
1 − eλT + λT eλT
g(T ) = , lim g(T ) = 1,
1 − e−λT − λT e−λT T →0
and
dg λ2 T
= 2 h(T ),
dT (1 − e−λT − λT e−λT )
where
h(T ) = eλT − e−λT − 2λT.
It can be shown that h(T ) > 0, hence
dg
> 0.
dT
The remaining part is similar to Case 1.
Case 3. The matrix
0 −(α2 + β 2 )
A=
1 2α
has two complex eigenvalues α ± jβ, α, β > 0.
Let
β −α
V = ,
0 1
6.B. Proof of Lemma 6.3.2 153
then,
cos βT − sin βT
eAT = V V −1 eαT .
sin βT cos βT
From (6.A.1) and (6.A.2) we have,
1
x11 (T ) = −β cos βT − α sin βT + βe−αT ,
k sin βT
1
y11 (T ) = β cos βT − α sin βT − βeαT .
k sin βT
The valid domain of T is (0, πβ ), this can be obtained directly from the
vector field and also from the above equations. Notice that
2α a2
lim x11 (T ) = − =− , lim x11 (T ) = ∞.
T →0 k k T→π
β
and
dg (α2 + β 2 ) sin βT
= 2 h(T ),
dT (β − (α sin βT + β cos βT )e−αT )
where
h(T ) = βeαT − βe−αT − 2α sin βT.
It can be verified that h(0) = 0 and
dh
> 0,
dT
thus, % &
dg π
> 0, ∀ T ∈ 0, .
dT β
The remaining part is similar to that in Case 1. 2
1 λ2 − λ1 + λ2 e−λ2 T − λ1 e−λ1 T
x11 (T ) = , (6.B.1)
k e−λ2 T − e−λ1 T
1 λ2 − λ1 + λ2 eλ2 T − λ1 eλ1 T
y11 (T ) = , (6.B.2)
k eλ1 T − eλ2 T
and,
λ2 − λ1 + λ2 e−λ1 T − λ1 e−λ2 T
g(T ) = .
λ2 − λ1 + λ2 eλ1 T − λ1 eλ2 T
By the definition of Td ,
a2 λ1 + λ2
y11 (Td ) = =− .
k k
It can be shown that as T → Td , g(T ) → ∞. Since g(0) = 1 and
dg 2λ1 λ2
= (λ1 + λ2 )[ch(λ1 T − λ2 T ) − 1]
dT (λ2 − λ1 + λ2 eλ1 T − λ1 eλ2 T )2
+ (λ2 − λ1 )[ch(λ2 T ) − ch(λ1 T )] > 0,
where
ea + e−a
ch(a) = ≥1
2
is monotonously increasing, we have that
then,
cos βT − sin βT
eAT = V V −1 e−αT .
sin βT cos βT
π
In this case, Td < β,
1
x11 (T ) = − β cos βT − α sin βT + βeαT ,
k sin βT
1
y11 (T ) = − β cos βT + α sin βT + βe−αT ,
k sin βT
and
β + (β cos βT + α sin βT )e−αT
g(T ) = , T ∈ (0, Td ).
β + (β cos βT − α sin βT )eαT
Since g(0) = 1 and
dg (α2 + β 2 ) sin βT
= 2
dT (β + (β cos βT − α sin βT )eαT )
× 2α sin βT + β(eαT − e−αT ) > 0
we have g(T ) > 1 for all T ∈ (0, Td ). It can also be verified that
dx11
> 0.
dT
For all the above three cases, since g(T ) > 1, i.e.,
dy11 dx11
> , ∀ T ∈ (0, Td )
dT dT
and
x11 (T )
lim = 1.
T →0 y11 (T )
7.1. Introduction
and
x(k + 1) = Ax(k) + Bsat(F x(k)),
157
158 Chapter 7. Estimation of Domain of Attraction
Proof. Since u ∈ co{ui : i ∈ [1, I]} and v ∈ co{v j : j ∈ [1, J ]}, there exist
αi , βj ≥ 0, i = 1, 2, · · · , I, j = 1, 2, · · · , J , such that
I
J
αi = βj = 1,
i=1 j=1
7.3. Some Facts about Convex Hulls 161
and
I
J
u= αi ui , v= βj v j .
i=1 j=1
Therefore,
I I i J
αi u i α
i=1 i u β
j=1 j
u i=1
= J =
v βj v j J j I
j=1 j=1 βj v i=1 αi
J
I
αi βj ui I
J
i=1 j=1 ui
= J = αi βj .
I
αi βj v j vj
i=1 j=1 i=1 j=1
Noting that
I
J I
J
αi βj = αi βj = 1,
i=1 j=1 i=1 j=1
we obtain (7.3.1). 2
is the set of vectors formed by choosing some elements from u and the rest
from v. Given two matrices F, H ∈ Rm×n ,
Di F + Di− H : i ∈ [1, 2m ]
is the set of matrices formed by choosing some rows from F and the rest
from H.
162 Chapter 7. Estimation of Domain of Attraction
%
&
u1 u1 u1 v1 v1
sat ∈ co , , , ;
u2 u2 v2 u2 v2
u1 u1 u1 u1 u1
sat u2 ∈ co u2 , u2 , v2 , v2 ,
u3 u3 v3 u3 v3
v1 v1 v1 v1
u2 , u2 , v2 , v2 ;
u3 v3 u3 v3
..
.
and finally,
sat(u) ∈ co Di u + Di− v : i ∈ [1, 2m ] .
2
In this way, we have placed sat(F x) into the convex hull of a group of linear
feedbacks.
7.4. Continuous-Time Systems under State Feedback 163
2
v u
[ u1] [ u1]
2 2
sat(u)
1
v u
[ v1] [ v 1]
0 2 2
−1
−2
−3
−3 −2 −1 0 1 2 3
ẋ = Ax + Bu.
Under the saturated linear state feedback u = sat(F x), the closed-loop
system is
ẋ = Ax + Bsat(F x). (7.4.1)
For a matrix F ∈ Rm×n , denote the ith row of F as fi and define
L(F ) := x ∈ Rn : |fi x| ≤ 1, i = 1, 2, · · · , m .
If F is the feedback matrix, then L(F ) is the region where the feedback
control u = sat(F x) is linear in x. We call L(F ) the linear region of the
saturated feedback sat(F x), or simply, the linear region of saturation.
For x(0) = x0 ∈ Rn , denote the state trajectory of the system (7.4.1)
as ψ(t, x0 ). The domain of attraction of the origin is then given by
S := x0 ∈ Rn : lim ψ(t, x0 ) = 0 .
t→∞
164 Chapter 7. Estimation of Domain of Attraction
(A + BK1 F )T P + P (A + BK1 F )
1
+ [F T (K2 − K1 ) + P B)][(K2 − K1 )F + B T P ] < 0, (7.4.2)
2
and E(P, ρ) ⊂ L(K1 F ). Then E(P, ρ) is a contractive invariant set and
hence inside the domain of attraction.
7.4.1. To estimate the domain of attraction, we may choose from all these
E(P, ρ) the “largest” one (the one with the largest volume or contains the
largest set with a fixed shape, etc.). A similar condition based on circle
criterion is given in [32]. These conditions are then used for stability and
performance analysis with LMI software in [32,81]. Since the inequality
(7.4.2) is not jointly convex in K1 and P , the two parameters need to be
optimized separately and iteratively. However, it is not guaranteed that
the global optimum can be obtained.
For a given state feedback law u = sat(F x), we will develop a less con-
servative set invariance condition by exploring the special property of the
saturation nonlinearity. It is based on direct Lyapunov function analysis
in terms of an auxiliary feedback matrix H ∈ Rm×n . This condition turns
out to be equivalent to some LMIs. Denote the ith row of H as hi .
then E(P, ρ) is a contractive invariant set and hence inside the domain of
attraction.
It follows that
Ax + Bsat(F x) ∈ co Ax + B(Di F + Di− H)x : i ∈ [1, 2m ] .
166 Chapter 7. Estimation of Domain of Attraction
Therefore,
for all x
= 0. Therefore, for every x ∈ E(P, ρ) \ {0},
Corollary 7.4.1. Given an ellipsoid E(P, ρ), if there exists a positive di-
agonal matrix K1 ∈ Rn×n , 0 < K1 < I, such that
(A + BF )T P + P (A + BF ) < 0,
(A + BF )T P + P (A + BF ) < 0. (7.4.6)
Proof. The “only if” part is obvious. Now we show the “if” part. Here we
have |h(x)| ≤ 1 for all x ∈ E(P, ρ). It follows from Lemma 7.3.2 that for
every x ∈ E(P, ρ),
sat(F x) ∈ co F x, h(x) .
Hence,
xT P (Ax + Bsat(F x)) ≤ max xT P (Ax + Bh(x)), xT P (Ax + BF x) .
Theorem 7.4.2 implies that, for a single input system, the invariance of
an ellipsoid E(P, ρ) under a saturated linear control u = sat(F x) is in some
sense independent of F as long as the condition
(A + BF )T P + P (A + BF ) < 0
is satisfied. In other words, suppose that both F1 and F2 satisfy the condi-
tion
(A + BFi )T P + P (A + BFi ) < 0, i = 1, 2,
then, the maximal invariant ellipsoid E(P, ρ) (with ρ maximized) is the
same under either u = sat(F1 x) or u = sat(F2 x). This means that the
invariance of an ellipsoid depends on its shape rather than a particular
feedback. For a given P > 0, Theorem 11.2.1 of Chapter 11 will give a
way of determining the largest ρ such that E(P, ρ) can be made contractive
invariant with some control u = h(x), |h(x)| ≤ 1.
F F2
3
−1 F1
−2
T
u=−sgn(B Px)
dV/dt
−3
−4
−5
−6
1.5 2 2.5 3 3.5 4 4.5 5
θ
With all the ellipsoids satisfying the set invariance condition in Theo-
rem 7.4.1, we would like to choose from among them the “largest” one as
the least conservative estimate of the domain of attraction, i.e., we would
like to choose from all the E(P, ρ)’s that satisfy the set invariance condition
such that the quantity αR (E(P, ρ)) is maximized, where αR (E(P, ρ)) is the
size of E(P, ρ) with respect to some shape reference set X R . This problem
can be formulated as:
sup α (7.4.8)
P >0,ρ,H
s.t. a) αX R ⊂ E(P, ρ),
b) (A + B(Di F + Di− H))T P + P (A + B(Di F + Di− H)) < 0,
i ∈ [1, 2m ],
c) E(P, ρ) ⊂ L(H).
log det(P/ρ)−1 and remove constraint a), then we obtain the problem of
maximizing the volume of E(P, ρ). The reason for using log det(P/ρ)−1
instead of det(P/ρ)−1 is that the former is a convex function of P . Similar
modification can be made to other optimization problems to be formulated
in this chapter and Chapters 9 and 10. Moreover, the following proce-
dure to transform (7.4.8) into a convex optimization problem with LMI
constraints can be adapted to the corresponding volume maximization (or
minimization) problems.
To solve the optimization problem (7.4.8), we need to transform the set
inclusion constraints a) and c) into inequalities. If X R is a polyhedron,
X R = co x1 , x2 , · · · , xl ,
α2 xTi P xi ≤ ρ, i = 1, 2, · · · , l. (7.4.9)
If X R is an ellipsoid
X R = x ∈ Rn : xT Rx ≤ 1 ,
α2 P ≤ ρR. (7.4.10)
To see this, note that E(P, ρ) ⊂ L(H) if and only if all the hyperplanes
hi x = ±1, i = 1, 2, · · · , m, lie completely outside of the ellipsoid
E(P, ρ) = x ∈ Rn : xT P x ≤ ρ ,
sup α (7.4.13)
P >0,ρ,H
Constraint b) is equivalent to
% &−1 % &−1
P T P
A + B(Di F + Di− H) + A + B(Di F + Di− H) < 0,
ρ ρ
i ∈ [1, 2m ]. (7.4.16)
inf γ (7.4.18)
Q>0,Z
γ xTi
s.t. a1) ≥ 0, i = 1, 2, · · · , l,
xi Q
b) QAT + AQ + (Di F Q + Di− Z)T B T + B(Di F Q + Di− Z) < 0,
i ∈ [1, 2m ],
1 zi
c) ≥ 0, i = 1, 2, · · · , m,
ziT Q
1
α∗ = (γ ∗ )− 2 ,
with H ∗ = Z ∗ (Q∗ )−1 and the resulting invariant set is E((Q∗ )−1 , 1). (Here,
we have, without loss of generality, let ρ = 1.)
We note that if E(Pi , ρi ), i ∈ [1, N ], are all contractive invariant, then
the union of these ellipsoids is also in the domain of attraction. Hence we
can obtain a better estimate by choosing different shape reference sets and
obtaining the union of the resulting invariant ellipsoids, as will be illustrated
in the following example.
7.5. Discrete-Time Systems under State Feedback 173
Let
X R = co 0, x1
−1
with x1 = . We solve (7.4.18) and get α∗ = 4.3711. The maximal
0.8
ellipsoid is E(P ∗ , 1) with
0.1170 0.0627
P∗ = .
0.0627 0.0558
This ellipsoid is plotted in Fig. 7.4.2 in solid curve. The inner dashed
ellipsoid is an invariant set obtained by the circle criterion method in [81]
and the region bounded by the dash-dotted curve is obtained by the Popov
method, also in [81]. We see that both the regions obtained by the circle
criterion and by the Popov method can be actually enclosed in a single
invariant ellipsoid.
To arrive at an even better estimate, we vary x1 over a unit circle, and
solve (7.4.18) for each x1 . Let the optimal α be α∗ (x1 ). The union of all the
resulting ellipsoids gives a better estimate of the domain of attraction. The
outermost dotted boundary in Fig. 7.4.2 is formed by the points α∗ (x1 )x1
as x1 varies along the unit circle.
−2
−4
−6
−8
−5 −4 −3 −2 −1 0 1 2 3 4 5
where
sat(fi x(k))
γi (k) =
fi x(k)
denotes the saturation level. Here, we view γi (k) as the varying gain of each
control channel that takes value between K(i, i) and 1. Then, the quadratic
stability (within E(P, ρ)) of the system is guaranteed by the quadratic sta-
bility of the linear systems corresponding to the 2m vertices of the box of
varying gains, which are
γi = K(i, i) or 1, i = 1, 2, · · · , m.
And each vertex in turn specifies a linear feedback matrix in the set
Di F + Di− KF : i ∈ [1, 2m ] .
Using this idea, Proposition 7.5.1 can be easily shown with some standard
techniques in robustness analysis.
Since K < I, we see that L(F ) is in the interior of L(KF ). Hence, the
condition E(P, ρ) ⊂ L(KF ) allows E(P, ρ) to go beyond the linear region
of the saturation function sat(F x).
Similar to the continuous-time case, we have the following less conser-
vative criterion for an ellipsoid to be contractive invariant.
It follows that
Ax + Bsat(F x) ∈ co Ax + B(Di F + Di− H)x : i ∈ [1, 2m ] .
for all x
= 0. Therefore, for every x ∈ E(P, ρ) \ {0},
(A + BF )T P (A + BF ) − P < 0,
(A + BF )T P (A + BF ) − P < 0. (7.5.5)
and
F = −0.7651 −2.0299 .
Given
5.0127 −0.6475
P = ,
−0.6475 4.2135
178 Chapter 7. Estimation of Domain of Attraction
we would like to find the maximal ρ such that E(P, ρ) is contractive in-
variant. By applying Theorem 7.5.2 and Theorem 11.3.1 with a bisection
search, the maximal E(P, ρ) is E(P, ρ∗ ) with ρ∗ = 2.349.
Let us compare the largest invariant ellipsoid, E(P, ρ∗ ), with estimates
obtained by other methods.
Hx=−1
0.8
Fx=−1
0.6
0.4
0.2
0
x’Px=ρ
Fx=1 1
−0.2
−0.6
Hx=1
−0.8
−1
−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1
sup α (7.5.7)
P >0,ρ,H
s.t. a) αX R ⊂ E(P, ρ),
b) (A + B(Di F + Di− H))T P (A + B(Di F + Di− H)) − P ≤ 0,
i ∈ [1, 2m ],
c) E(P, ρ) ⊂ L(H).
we have
−1 −1
P P − T
ρ ρ (A+B(Di F +Di H))
−1 −1 ≥ 0,
(A+B(Di F +Di− H)) P
ρ
P
ρ
i ∈ [1, 2m ]. (7.5.8)
inf γ (7.5.9)
Q>0,Z
γ xTi
s.t. a1) ≥ 0, i = 1, 2, · · · , l,
xi Q
Q (AQ + B(Di F Q + Di− Z))T
b) ≥ 0,
AQ + B(Di F Q + Di− Z) Q
i ∈ [1, 2m ],
1 zi
c) ≥ 0, i = 1, 2, · · · , m.
ziT Q
ẋ = Ax + B sat(u),
y = Cx,
ż = Ac z + Bc y,
u = Cc z + Dc y.
where
A 0 B
A= , B= , F = Dc C Cc .
Bc C Ac 0
We see that the system (7.6.1) has the form of (7.4.1). Hence all the
methods developed in the previous sections can be utilized to estimate
7.7. Conclusions 181
the domain of attraction of (7.6.1). Very often, the initial condition z(0)
of the output feedback controller is set to 0. This information can be
utilized in choosing the shape reference set to reduce the conservatism of
the estimation. For example, if X R is a polyhedron, its vertices can be
chosen of the form (xi , 0). In this way, X R is in a subspace corresponding
to the x state. If X R is an ellipsoid, then we can choose
R1 0
R= ,
0 R2
with the elements of R2 much larger than those of R1 so that the ellipsoid
is very thin along the direction of the z state.
When an invariant ellipsoid E(P, ρ) is obtained as an estimation of the
domain of attraction, we may restrict out attention to only a subset of
E(P, ρ), namely, its intersection with the subspace z = 0. Suppose that
P1 P2
P = ,
P2T P3
7.7. Conclusions
In this chapter, we have considered the problem of estimating the domain of
attraction for a linear system under a pre-designed saturated linear feedback
law. We used ellipsoids as our estimates of the domain of attraction. A
simple condition was derived in terms of an auxiliary feedback matrix for
determining if a given ellipsoid is contractive invariant. This condition was
shown to be less conservative than the existing conditions which are based
on the circle criterion or the vertex analysis. An important feature of this
new condition is that it can be expressed as LMIs in terms of all the varying
parameters. This makes the problem of maximizing the invariant ellipsoid
with respect to some shape reference set or the volume of the invariant
ellipsoid very easy. With a little modification, this analysis problem can
be turned into a controller design problem, as will be seen in the next
chapter. Moreover, the results in this chapter will be further extended
182 Chapter 7. Estimation of Domain of Attraction
8.1. Introduction
In this chapter, we will present a method for designing feedback gains that
result in large domains of attraction. Our approach is to formulate the
problem into a constrained optimization problem. Since the precise do-
main of attraction under a feedback law is hard to identify, we will first
obtain invariant ellipsoids as estimates of the domain of attraction and
then maximize the estimate over stabilizing feedback laws.
In solving the optimization problem, we will also reveal a surprising
aspect of the design for large domain of attraction. If our purpose is solely
to enlarge the domain of attraction, we might as well restrict the invariant
ellipsoid (an estimate of the domain of attraction) in the linear region of
the saturation function, although allowing saturation will provide us more
freedom in choosing controllers. Another interesting aspect is that, for a
discrete-time system, the domain of attraction can be further enlarged if
the design is performed on its lifted system.
ẋ = Ax + Bu, x ∈ Rn , u ∈ Rm . (8.2.1)
183
184 Chapter 8. On Enlarging the Domain of Attraction
Under the saturated linear state feedback u = sat(F x), the closed-loop
system is given by
ẋ = Ax + Bsat(F x). (8.2.2)
For a fixed F , Chapter 7 provides a method for estimating the domain of
attraction of the origin for the system (8.2.2) by searching for the largest
invariant ellipsoid. In this section, we will design a feedback matrix F
such that this estimate is maximized with respect to a given reference set
X R . By applying Theorem 7.4.1, this optimization problem can be readily
formulated as follows,
sup α (8.2.3)
P >0,ρ,F,H
s.t. a) αX R ⊂ E(P, ρ),
b) (A + B(Di F + Di− H))T P + P (A + B(Di F + Di− H)) < 0,
i ∈ [1, 2m ],
c) E(P, ρ) ⊂ L(H).
The only difference of the optimization problem (8.2.3) from (7.4.8) is
that (8.2.3) has an extra optimization parameter F . Denote the supremum
of α as α∗1 .
Let us consider a simpler optimization problem
sup α (8.2.4)
P >0,ρ,F
s.t. a) αX R ⊂ E(P, ρ),
b) (A + BF )T P + P (A + BF ) < 0,
c) E(P, ρ) ⊂ L(F ).
Denote the supremum of α in (8.2.4) as α∗2 . Recalling that Di + Di− = I,
we can view (8.2.4) as a result from forcing F = H in (8.2.3). It follows
that
α∗2 ≤ α∗1 .
What is somewhat surprising is that,
α∗1 = α∗2 .
This can be seen as follows. Suppose that (α1 , ρ1 , P1 , F1 , H1 ) satisfies the
constraints of (8.2.3). By choosing Di = 0, then we obtain from b) of
(8.2.3) the following inequality,
(A + BH1 )T P + P (A + BH1 ) < 0.
8.3. Discrete-Time Systems 185
Under the saturated linear state feedback u = sat(F x), the closed-loop
system is given by
sup α (8.3.3)
P >0,ρ,F
s.t. a) αX R ⊂ E(P, ρ),
b) (A + BF )T P (A + BF ) − P < 0,
c) E(P, ρ) ⊂ L(F ).
and
u(kL)
u(kL + 1)
xL (k) = x(kL), uL (k) = .. ,
.
u(kL + L − 1)
we obtain the lifted L-step system
Let
uL (k) = sat(FL xL (k)), FL ∈ RmL×n ,
be a stabilizing feedback. Under this feedback law, the closed-loop system
is given by
xL (k + 1) = AL xL (k) + BL sat(FL xL (k)). (8.3.5)
Note that the control
is periodic for the original system (8.3.1) with period L. Under this control,
if E(P, ρ) is an invariant set for the original unlifted system, then it is also
invariant for the lifted system. But an invariant set E(P, ρ) for the lifted
system need not be invariant for the state of the original system. However,
the domain of attraction is the same for both the lifted system and the
original system. This can be seen as follows. Clearly, x(k) → 0 implies
xL (k) → 0. On the other hand, if xL (k) → 0, then uL (k) → 0 and x(kL+j),
j = 1, 2, · · · , L, will be arbitrarily close to x(kL) = xL (k). Hence, we also
have x(k) → 0. Because of this, we can enlarge the domain of attraction
by enlarging the invariant ellipsoid for the lifted closed-loop system.
Similar to the one-step case, the problem of maximizing the invariant
ellipsoid can be described as
sup α (8.3.6)
P >0,FL
8.3. Discrete-Time Systems 187
Proof.
Case 1. p = 1
Denote the set of feasible (α, P ) satisfying constraints a), b) and c) as
Φ(L) = (α, P ) : there exists an FL s.t. a), b) and c) are satisfied .
and
(A + BF )T P (A + BF ) − P < 0, (8.3.8)
which is equivalent to
P (A + BF )T
> 0,
A + BF P −1
and to
(A + BF )P −1 (A + BF )T − P −1 < 0. (8.3.9)
Let
F
F (A + BF )
FL = .. ,
.
F (A + BF )L−1
then,
AL + BL FL = AL + AL−1 BF + AL−2 BF (A + BF ) + · · ·
+BF (A + BF )L−1
= (A + BF )L .
188 Chapter 8. On Enlarging the Domain of Attraction
fL j P −1 fL Tj = fi (A + BF )q P −1 ((A + BF )T )q fiT .
which shows that P and FL also satisfy constraint c). Hence, (α, P ) ∈ Φ(L).
Case 2. p > 1
Let
Ap = Ap , Bp = [ Ap−1 B Ap−2 B ···B]
and
Ap L = ApL , Bp L = [ ApL−1 B ApL−2 B · · · B ],
then,
Ap L = (Ap )L , Bp L = [ AL−1
p Bp AL−2
p Bp · · · Bp ].
Suppose we first lift the system (8.2.1) with step p to get xp (k) = x(kp)
and
xp (k + 1) = Ap xp (k) + Bp sat(up (k)),
and then lift the above system with step L to get xp L (k) = xp (kL) = x(kLp)
and
xp L (k + 1) = Ap L xp L (k) + Bp L sat(up L (k)).
Applying the result in Case 1, we immediately have α∗ (p) ≤ α∗ (pL). 2
8.3. Discrete-Time Systems 189
Remark 8.3.1. The equality α∗ (p) = α∗ (pL) with L > 1 can occur in
some special cases. For example, let A = a > 1, B = 1, and X R = [−1, 1].
It can be verified that
1
α∗ (L) = , ∀ L ≥ 1.
a−1
From the above theorem, we see that
and
α∗ (1) ≤ α∗ (3) ≤ α∗ (6) ≤ α∗ (12) · · · .
But α∗ (L1 ) ≤ α∗ (L2 ) does not necessarily hold for all L1 < L2 .
L 1 2 4 8 16 32
α∗ (L) 1.0650 1.0930 1.1896 1.4017 1.5164 1.5426
0.8
0.6
0.4
0.2
−0.2
−0.4
−0.6
−0.8
−1
−2 −1.5 −1 −0.5 0 0.5 1 1.5 2
The eigenvalues of (Ā + B̄ F̄o )T Po (Ā + B̄ F̄o ) − Po are −0.0006 and −0.4293.
The eigenvalues of Ā + B̄ F̄o are 0.2758 ± j0.8814. These show that the
convergence rate is slow.
vertices of X R (see Fig. 8.3.2, where the vertices of X R are marked with
“∗” and the vertices of the null controllable region are marked with “·”).
5 3
x2
x3
0 0
−1
−2
−5 −3
−40 −20 0 20 40 −5 0 5
x x
1 2
2 5
1
0
x3
x3
−1
−5
5
−2 50
0
0
−3
−40 −20 0 20 40 x −5 −50 x
2 1
x1
L 1 2 4 8 16 32
α∗ (L) 0.4274 0.4382 0.4593 0.4868 0.5564 0.6041
8.4. Conclusions
5 3
x2
x3
0 0
−1
−2
−5 −3
−40 −20 0 20 40 −5 0 5
x x
1 2
2 5
1
0
x3
x3
−1
−5
5
−2 50
0
0
−3
−40 −20 0 20 40 x2 −5 −50 x1
x1
5 3
1
x2
x3
0 0
−1
−2
−5 −3
−40 −20 0 20 40 −5 0 5
x1 x2
2 5
1
0
x3
x3
−1
−5
5
−2 50
0
0
−3
−40 −20 0 20 40 x2 −5 −50 x1
x1
trate the effectiveness of the design method. Also shown in the examples is,
however, the slow convergence rate of the resulting feedback system. This
issue will be discussed in detail in Chapter 11.
194 Chapter 8. On Enlarging the Domain of Attraction
Chapter 9
Semi-Global Stabilization
with Guaranteed
Regional Performance
9.1. Introduction
195
196 Chapter 9. Semi-Global Stabilization with Regional Performance
has the desired performance. We would like to study the stability and
performance of the system in the presence of actuator saturation,
Let D0 be an invariant set of the closed-loop system and be inside the linear
region of the saturation function, i.e.,
D 0 ⊂ x ∈ Rn : |F0 (x)|∞ ≤ 1 .
Suppose that D 0 is in the linear region, then it is an invariant set and within
D0 , the saturation function does not have an effect and hence the desired
closed-loop performance is preserved.
The objective of this chapter is to construct feedback laws that semi-
globally stabilize the system (9.1.1) on its null controllable region and in
the mean time preserve the desired closed-loop performance in the region
D0 . We will design our controllers by combining a sequence of feedback
laws
u = Fi (x), i = 0, 1, · · · , M,
in such a way that the union of the invariant sets corresponding to each
of the feedback laws is also an invariant set, which is shown to be in the
domain of attraction. By appropriately selecting this sequence of feedback
laws, the union of these invariant sets can then be made large enough to
include any subset in the interior of the null controllable region. This idea
will be made feasible by the use of lifting technique.
This chapter is organized as follows. In Section 9.2, we propose a method
for expanding the domain of attraction by switching among a finite sequence
of feedback laws. This switching design is then used in Section 9.3 to show
that the domain of attraction can be enlarged to include any subset in the
interior of the null controllable region. Section 9.4 extends the results of
9.2. Expansion of the Domain of Attraction 197
Denote
i
Ωi = Dj , i = 0, 1, · · · M.
j=0
Then, Ω0 ⊂ Ω1 ⊂ · · · ⊂ ΩM .
where
F0 (x), if x ∈ Ω0 ,
F (x), if x ∈ Ω1 \ Ω0 ,
1
Gi (x) := (9.2.3)
.. ..
. .
Fi (x), if x ∈ Ωi \ Ωi−1 .
Here we note that, for each i = 1, 2, · · · , M ,
i−1
Ωi \ Ωi−1 = Di \ Dj .
j=0
198 Chapter 9. Semi-Global Stabilization with Regional Performance
If x(0) ∈ Ωi , then under the feedback u = Gi (x), x(k) ∈ Ωi for all k and
lim x(k) = 0,
k→∞
lim x(k) = 0.
k→∞
ẋ = f (x, u),
in particular,
ẋ = Ax + Bsat(u), (9.2.5)
with a sequence of stabilizing feedback laws
u = Fi (x), i = 0, 1, · · · , M.
9.3. Semi-Globalization – Discrete-Time Systems 199
The null controllable region, denoted as C, is the set of states that can be
steered to the origin in a finite number of steps. We also have
C= C(K).
K≥0
where
u(kL)
xL (k) = x(kL), uL (k) = u(kL + 1) ,
u(kL + L − 1)
and
AL = AL , BL = [AL−1 B AL−2 B · · · AB B]. (9.3.3)
We have more flexibility in the design of a system by using the lifting
technique because it allows us to see further the effect of a control law and
to consider the combined effect of the control actions at several steps.
For a feedback matrix F ∈ RmL×n , denote the unsaturated region (lin-
ear region) of the closed-loop system
as
L(F ) := x ∈ Rn : |fj x| ≤ 1, j = 1, 2, · · · , mL ,
where fj is the jth row of F . If L ≥ n0 , then there exists an F such that
AL + BL F = 0.
Hence L(F ) is an invariant set of the lifted system (9.3.4) and is inside the
domain of attraction.
For a positive definite matrix P ∈ Rn×n , denote
E(P ) = x ∈ Rn : xT P x ≤ 1 .
Suppose that E(P ) ⊂ L(F ), then under the feedback law uL = F xL , E(P ) is
also an invariant set inside the domain of attraction. Here we are interested
in the ellipsoids because they can be generalized to the Lyapunov level sets
for the case AL + BL F
= 0. We will show that any compact subset of the
null controllable region can be covered by the union of a finite sequence of
such ellipsoids.
E(Pi ) ⊂ L(Fi ), i = 1, 2, · · · , M,
9.3. Semi-Globalization – Discrete-Time Systems 201
and
M
βC(L) ⊂ E(Pi ),
i=1
where
βC(L) = βx : x ∈ C(L) .
Proof. Let ∂(βC(L)) be the boundary of βC(L). First, we show that, there
exists an ε > 0 such that, for any x1 ∈ ∂(βC(L)), there exist an F ∈ RmL×n
and a P > 0 that satisfy
AL + BL F = 0,
and
B(x1 , ε) ⊂ E(P ) ⊂ L(F ),
where
B(x1 , ε) = x ∈ Rn : |x − x1 | ≤ ε .
Let e be the unit vector in Rn whose th element is 1 and other elements
are zeros. For simplicity, assume x1 = γe1 , otherwise we can use a unitary
transformation x → V x, V T V = I, to satisfy this. Note that a unitary
transformation is equivalent to rotating the state space and does not change
the shapes of B(x1 , ε), E(P ) and C(L).
Since x1 = γe1 ∈ βC(L), it follows from (9.3.1) and (9.3.3) that there
exists a uL 1 ∈ RmL , |uL 1 |∞ ≤ β, such that
AL γe1 + BL uL 1 = 0. (9.3.5)
Define
max{|x| : x ∈ ∂C(L)}
µ= .
min{|x| : x ∈ ∂C(L)}
Since L ≥ n0 , C(L) includes the origin in its interior and hence µ < ∞. It
follows that γe ∈ µβC(L) for all ≥ 2. Therefore, for each ≥ 2, there
exists a uL ∈ RmL , |uL |∞ ≤ µβ, such that
AL γe + BL uL = 0. (9.3.6)
1
F = uL 1 uL 2 · · · uL n ,
γ
202 Chapter 9. Semi-Global Stabilization with Regional Performance
then,
β µβ
|fj1 | ≤ , |fj | ≤ , ∀ = 2, · · · , n, j = 1, 2, · · · , mL.
γ γ
From (9.3.5) and (9.3.6), we have
1
(AL + BL F )e = AL e + BL uL = 0, = 1, 2, · · · , n.
γ
This shows that AL + BL F = 0.
Let
p1 0
P = ,
0 p2 In−1
where
% &2 % &2 % &−1
1 2β βµ (β + 1)2
p1 = , p2 = (n − 1) 1− .
γ2 β+1 γ 4
Let
γmin = min |x| : x ∈ ∂(βC(L))
and
% & '
−1
2β 2β 2 n − 1)βµ
ε= 1− γmin max ,' .
β+1 β+1 4 − (β + 1)2
Then,
1
2 2β
P ε ≤ 1 − .
β+1
Note that ε is independent of γ and a particular x1 .
We also have
% &2 % &2
1 2 1 1 β n − 1 βµ
fj P −1 fjT = fj1 + Σn=2 fj2
≤ + = 1, (9.3.7)
p1 p2 p1 γ p2 γ
which implies that E(P ) ⊂ L(F ). To see this, we verify that, for any
x ∈ E(P ),
1
1 1 1
|fj x| = fj P − 2 P 2 x ≤ fj P −1 fjT 2 (xT P x) 2 ≤ 1.
M
∂(βC(L)) ⊂ B(xi , ε).
i=1
By the foregoing proof, we know that for each xi ∈ ∂(βC(L)), there exist
an Fi and a Pi such that AL + BL Fi = 0 and
Hence,
M
∂(βC(L)) ⊂ E(Pi ).
i=1
To see this, for any x ∈ βC(L), let y be the intersection of ∂(βC(L)) with
the straight line passing through the origin and x. Then, y ∈ E(Pi0 ) for
some i0 . Since E(Pi0 ) is convex and contains the origin, x ∈ E(Pi0 ). 2
Remark 9.3.1. We would like to point out that, the family of Fi ’s may
contain repeated members with different Pi ’s. This is the case, for example,
when the system has a single input (m = 1) and the lifting step L is the
same as n, the dimension of the state space. In this case, we have only a
−1
unique Fi = −BL AL with C(L) ⊂ L(Fi ).
cover βC(L). Actually, the lifting step can be reduced if we replace the
dead-beat condition AL + BL F = 0 with a less restrictive one:
(AL + BL F )T P (AL + BL F ) − cP ≤ 0,
Because of (9.3.8) and (9.3.9), E(Pi ) is an invariant set inside the domain
of attraction for the closed-loop system
The null controllable region at time T , denoted as C(T ), is the set of states
that can be steered to the origin in time T by an admissible control u ∈ Ua .
The null controllable region, denoted as C, is given by
C(T ).
T ≥0
and
h
xh (k + 1) = Ah + eA(h−τ ) BF (τ )dτ xh (k). (9.4.4)
0
A(h−τ ) A(h−τ )
Ah + e BF (τ )dτ P Ah + e BF (τ )dτ − P < 0.
0 0
Note that P can be scaled such that E(P ) ⊂ L(F ). In this case, E(P ) is an
invariant set inside the domain of attraction for the system (9.4.3). Since
for all xh (k) ∈ E(P ), the control is linear in xh (k), so, when xh (k) tends
to the origin, the control uh (k, τ ) = F (τ )xh (k) will gets smaller and hence
the state of the original system (9.4.1) between t = kh and t = (k + 1)h will
stay close to xh (k). Similar to the discrete-time case, we have the following
lemma.
Lemma 9.4.1. Given h > 0 and a positive number β < 1, there exists a
family of Fi ∈ F m×n , i = 1, 2, · · · , M , with corresponding positive definite
matrices Pi ’s, such that
h
Ah + eA(h−τ )BFi (τ )dτ = 0,
0
E(Pi ) ⊂ L(Fi ), i = 1, 2, · · · , M,
and
M
βC(h) ⊂ E(Pi ).
i=1
Proof. The idea of the proof is the same as that of Lemma 9.3.1. Here we
just show how to construct ε, F and P for a given x1 ∈ ∂(βC(h)). We also
assume that x1 = γe1 . Since γe1 ∈ ∂(βC(h)), there exists a u1 ∈ F m×1 ,
|u1 (τ )|∞ ≤ β for all τ ∈ [0, h), such that
h
Ah γe1 + eA(h−τ )Bu1 (τ )dτ = 0,
0
9.5. An Example 207
and for ≥ 2, there exists a u ∈ F m×1 , |u (τ )|∞ ≤ βµ for all τ ∈ [0, h),
such that h
Ah γe + eA(h−τ )Bu (τ )dτ = 0.
0
Let
1
u1 u2 · · · un ,
F =
γ
and P and ε be the same as those in the proof of Lemma 9.3.1, the remaining
part of the proof will be the same as that of Lemma 9.3.1 except that (9.3.7)
is replaced by
2
The following is the counterpart of Theorem 9.3.1 for the discrete-time
system (9.1.1).
h h
Ah + eA(h−τ ) BFi (τ )dτ Pi Ah + eA(h−τ ) BFi (τ )dτ − cPi ≤ 0,
0 0
E(Pi ) ⊂ L(Fi ), i = 1, 2, · · · , M,
and
M
X0 ⊂ E(Pi ).
i=1
9.5. An Example
Consider the system (9.1.1) with
0.8876 −0.5555 −0.1124
A= , B= .
0.5555 1.5542 0.5555
208 Chapter 9. Semi-Global Stabilization with Regional Performance
with Q = I and R = 1 is
u = F0 (x) = −0.2630 −2.1501 x.
9.6. Conclusions
In this chapter, we proposed a control design method for semi-global sta-
bilization on the null controllable region with guaranteed regional perfor-
mance. This design method applies to general (possibly exponentially un-
stable) systems in either continuous-time or discrete-time. The resulting
feedback laws expand the domain of attraction achieved by an a priori de-
signed feedback law to include any bounded set in the interior of the null
controllable region, while preserving the desired performance of the original
feedback law in a fixed region.
9.6. Conclusions 209
1.5
0.5
−0.5
−1
−1.5
−1.5 −1 −0.5 0 0.5 1 1.5
1.5
0.5
−0.5
−1
−1.5
−1.5 −1 −0.5 0 0.5 1 1.5
0.5
−0.5
−1
−1.5
0 10 20 30 40 50 60 70 80 90
0.8
0.6
0.4
0.2
−0.2
−0.4
−0.6
−0.8
−1
0 10 20 30 40 50 60 70 80
Disturbance Rejection
with Stability
10.1. Introduction
In this chapter, we will study the following linear systems subject to actu-
ator saturation and persistent disturbances,
and
x(k + 1) = Ax(k) + Bsat(u(k)) + Ew(k), (10.1.2)
211
212 Chapter 10. Disturbance Rejection with Stability
for example, for the class of disturbances with finite energy in [32]. In this
chapter, we will deal with persistent disturbances.
In addition, we are also interested in the problem of asymptotic distur-
bance rejection with nonzero initial states. This is the problem of designing
a feedback law that, in the presence of disturbances, causes closed-loop sys-
tem trajectories starting from a set to converge to a smaller set. A related
problem was addressed in [85] and Chapter 6, where the disturbances are
input additive and enter the system before the saturating actuators, i.e.,
the system has the following state equation,
ẋ = Ax + Bsat(u + w).
Reference [85] deals with the class of systems with A having all its eigen-
values in the closed left-half plane, and Chapter 6 deals with exponentially
unstable systems with A having two or fewer eigenvalues in the open right-
half plane. It is shown in [85] and Chapter 6 that there exist a family
of linear feedbacks that achieve asymptotic disturbance decoupling. This
is in the sense that, given any positive number D, any compact subset
X 0 of the asymptotically null controllable region and any arbitrarily small
neighborhood X ∞ of the origin, there is a feedback control from this fam-
ily such that any trajectory starting from within X 0 will enter X ∞ in a
finite time for all the disturbance w, w∞ ≤ D. We, however, could not
expect to have this nice result for the systems (10.1.1) and (10.1.2), where
the disturbance enters the system after the saturating actuator. If w or E
is sufficiently large, it may even be impossible to keep the state bounded.
What we expect is to have a set X 0 (as large as we can get) and a set X ∞
(as small as we can get) such that all the trajectories starting from within
X 0 will enter X ∞ in a finite time and remain in it thereafter.
This chapter is organized as follows. Section 10.2 deals with continuous-
time systems. Several problems related to set invariance in the presence
of disturbances and disturbance rejection are formulated in Section 10.2.1.
The solutions to these problems are given in Section 10.2.2 and 10.2.3. In
particular, Section 10.2.2 gives a condition for set invariance in the presence
of disturbances. Section 10.2.3 solves the problem of disturbance rejection
with guaranteed domain of attraction. An example is presented in Sec-
tion 10.2.4 to illustrate the effectiveness of the design methods. Section 10.3
deals with discrete-time systems and parallels Section 10.2. Section 10.4
draws a brief conclusion to the chapter.
10.2. Continuous-Time Systems 213
where, for simplicity and without loss of generality, we assume that the
bounded disturbance w belongs to the set
W := w : w(t)T w(t) ≤ 1, ∀ t ≥ 0 .
For an initial state x(0) = x0 , denote the state trajectory of the closed-loop
system in the presence of w as ψ(t, x0 , w). A set in Rn is said to be invariant
if all the trajectories starting from within it will remain in it regardless of
w ∈ W. Let P ∈ Rn×n be a positive definite matrix and let V (x) = xT P x.
An ellipsoid E(P, ρ) is said to be strictly invariant if
for all x ∈ ∂E(P, ρ) and all w, wT w ≤ 1. The notion of invariant set plays
an important role in studying the stability and other performances of a
system (see [7,9] and the references therein). If an invariant set is bounded,
then all the trajectory starting from within it will be bounded.
Our primary concern is the boundedness of the trajectories for some (as
large as possible) set of initial states. This requires a large invariant set.
On the other hand, for the purpose of disturbance rejection, we would also
like to have a small invariant set containing the origin in its interior so that
a trajectory starting from the origin will stay close to the origin.
To formally state the objectives of this section, we need to extend the
notion of the domain of attraction as follows.
where
dist(ψ(t, x0 , w), X ) = inf |ψ(t, x0 , w) − x|
x∈X
is the distance from ψ(t, x0 , w) to X .
In the above definition, | · | can be any vector norm. The problems we
are to address in this section can be formulated as follows.
Problem 10.2.1 (Set invariance analysis). Let F be known. Determine if
a given ellipsoid E(P, ρ) is (strictly) invariant.
Problem 10.2.2 (Invariant set enlargement). Given a shape reference set
X 0 ⊂ Rn , design an F such that the closed-loop system has a bounded
invariant set E(P, ρ) which contains α2 X 0 with α2 maximized.
Problem 10.2.3 (Disturbance rejection). Given a shape reference set
X ∞ ⊂ Rn , design an F such that the closed-loop system has an invari-
ant set E(P, ρ) ⊂ α3 X ∞ with α3 minimized. Here we can also take X ∞ to
be the (possibly unbounded) polyhedron
x ∈ Rn : |ci x| ≤ 1, i = 1, 2, · · · , p .
and E(P, ρ) ⊂ L(H), then E(P, ρ) is a (strictly) invariant set for the system
(10.2.2).
Proof. We will prove the strict invariance. That is, for V (x) = xT P x, we
will show that
Following the procedure of the proof of Theorem 7.4.1, we can show that
for each x ∈ E(P, ρ),
sup α2 (10.2.4)
P >0,ρ,η>0,F,H
s.t. a) α2 X 0 ⊂ E(P, ρ),
b) (A + B(Di F + Di− H))T P + P (A + B(Di F + Di− H))
1 η
+ P EE T P + P < 0, i ∈ [1, 2m ],
η ρ
c) |hi x| ≤ 1, ∀ x ∈ E(P, ρ), i ∈ [1, m].
216 Chapter 10. Disturbance Rejection with Stability
Let % &−1
P
Q= , Y = F Q, Z = HQ,
ρ
then, constraint b) is equivalent to
inf α3 (10.2.5)
P >0,ρ,η>0,F,H
s.t. a) E(P, ρ) ⊂ α3 X ∞ ,
b) (A + B(Di F + Di− H))T P + P (A + B(Di F + Di− H))
1 η
+ P EE T P + P < 0, i ∈ [1, 2m ],
η ρ
c) |hi x| ≤ 1, ∀ x ∈ E(P, ρ), i ∈ [1, m],
trajectories starting from within E(P, ρ2 ) will enter E(P, ρ1 ) and remain in-
side it. This can be seen as follows. If all the ellipsoids E(P, ρ), ρ ∈ [ρ1 , ρ2 ],
are strictly invariant, then we have
It follows that all the trajectories starting from within E(P, ρ2 ) will enter
E(P, ρ1 ) and remain inside it.
Theorem 10.2.2. Given two ellipsoids, E(P, ρ1 ) and E(P, ρ2 ), ρ2 > ρ1 >
0, if there exist H1 , H2 ∈ Rm×n and a positive η such that
Proof. Let h1,i and h2,i be the ith row of H1 and H2 respectively. The
conditions E(P, ρ1 ) ⊂ L(H1 ) and E(P, ρ2 ) ⊂ L(H2 ) are equivalent to
1 1
ρ1 h1,i ρ2 h2,i
≥ 0, ≥ 0, i = 1, 2, · · · , m.
hT1,i P hT2,i P
Let
H = λH1 + (1 − λ)H2 .
Clearly
1
ρ hi
≥ 0.
hTi P
From (10.2.6) and (10.2.7), and by convexity, we have (10.2.8). 2
In view of Theorem 10.2.2, to solve Problem 10.2.4, we only need to
construct two invariant ellipsoids E(P, ρ1 ) and E(P, ρ2 ) satisfying the con-
dition of Theorem 10.2.2 such that X 0 ⊂ E(P, ρ2 ) and E(P, ρ1 ) ⊂ α4 X ∞
with α4 minimized. Since ρ2 can be absorbed into other parameters, we
assume for simplicity that ρ2 = 1 and ρ1 < 1. Problem 10.2.4 can then be
formulated as
inf α4 (10.2.9)
P >0,0<ρ1 <1,η>0,F,H1 ,H2 ,
s.t. a) X 0 ⊂ E(P, 1), E(P, ρ1 ) ⊂ α4 X ∞ ,
b) (A + B(Di F + Di− H1 ))T P + P (A + B(Di F + Di− H1 ))
1 η
+ P EE T P + P < 0, i ∈ [1, 2m ],
η ρ1
c) (A + B(Di F + Di− H2 ))T P + P (A + B(Di F + Di− H2 ))
1
+ P EE T P + ηP < 0, i ∈ [1, 2m ],
η
d) |h1,i x| ≤ 1, ∀ x ∈ E(P, ρ1 ), i ∈ [1, m],
e) |h2,i x| ≤ 1, ∀ x ∈ E(P, 1), i ∈ [1, m].
γ = α24 , Q = P −1 , Y = F Q, Z1 = H1 Q, Z2 = H2 Q,
and denote the ith row of Z1 and Z2 as z1,i , z2,i , respectively. Then, the
above problem is equivalent to
inf γ (10.2.10)
Q>0,0<ρ1 <1,η>0,Y,Z1 ,Z2 ,
10.2.4. An Example
The system has a pair of unstable complex poles. We first ignore the
disturbance and use the method of Chapter 8 (by solving (8.2.3)) to find
a feedback with the objective of maximizing the domain of attraction with
respect to the unit ball, X R = E(I, 1). The result is,
α∗1 = 2.4417,
0.0752 −0.0566
P1∗ = ,
−0.0566 0.1331
and
F1∗ = −0.0025 −0.2987 .
The invariant ellipsoid E(P1∗ , 1) is plotted in Fig. 10.2.1 as the larger ellip-
soid. As a comparison, we also plotted the boundary of the null controllable
region of the open loop system (see the dashed outer curve).
We next deal with Problem 10.2.2. By solving (10.2.4) with X 0 being
a unit ball, we obtain α∗2 = 2.3195, with η2∗ = 0.019. The corresponding
feedback matrix is
F2∗ = −0.0009 −0.3110 .
220 Chapter 10. Disturbance Rejection with Stability
−1
−2
−3
−4
−5
−6 −4 −2 0 2 4 6
Figure 10.2.1: The invariant ellipsoids and the null controllable region.
which is very large as compared with F2∗ , the one that achieves large domain
of attraction.
Now we turn to Problem 10.2.4. The optimization result from solving
Problem 10.2.2 gives us some guide in choosing X 0 . If X 0 is a disk, we
know that X 0 must be smaller than the maximal disk contained in E(P2∗ , 1),
which is, E I, (α∗2 )2 = E(I, 2.31952). Here we choose X 0 = E(I, 22 ) and
X ∞ = E(I, 1). To obtain the optimal solution, we vary η from 0 to ∞ and
10.3. Discrete-Time Systems 221
and
0.1145 −0.0922
P4∗ = .
−0.0922 0.1872
In Fig. 10.2.2, the larger ellipsoid is E(P4∗ , 1), the smaller ellipsoid is
E(P4∗ , ρ1 ) and the outermost dashed closed-curve is the boundary of the
null controllable region. A trajectory is plotted from x0 ∈ ∂E(P4∗ , 1) with
w =sign(sin(0.3t)).
−1
−2
−3
−4
−5
−6 −4 −2 0 2 4 6
where
dist(ψ(k, x0 , w), X ) = inf |ψ(k, x0 , w) − x|
x∈X
and E(P, ρ) ⊂ L(H), then E(P, ρ) is a (strictly) invariant set for the system
(10.3.2).
Proof. We prove the strict invariance. That is, we will show that
sup α2 (10.3.5)
P >0,ρ,η>0,F,H
s.t. a) α2 X 0 ⊂ E(P, ρ),
b) (1 + η)(A + B(Di F + Di− H))T P (A + B(Di F + Di− H))
% % T & &
1+η E PE
+ λmax − 1 P ≤ 0, i ∈ [1, 2m ],
η ρ
c) |hi x| ≤ 1, ∀ x ∈ E(P, ρ), i ∈ [1, m].
Let % &−1
P
Q= , Y = F Q, Z = HQ,
ρ
10.3. Discrete-Time Systems 225
such that
1+η
1− η λ Q (AQ + BDi Y + BDi− Z)T
≥ 0,
AQ + BDi Y + BDi− Z 1
1+η Q
and
λ ET
≥ 0.
E Q
If we fix η and λ, then the original optimization constraints can be
transformed into LMIs as with (7.4.8). The global maximum of α2 will be
obtained by running η from 0 to ∞ and λ from 0 to 1+ηη
. Since ρ can be
absorbed into other parameters, we simply set ρ = 1.
For Problem 10.3.3, we have
inf α3 (10.3.6)
P >0,ρ,η>0,F,H
s.t. a) E(P, ρ) ⊂ α3 X ∞ ,
b) (1 + η)(A + B(Di F + Di− H))T P (A + B(Di F + Di− H))
% % T & &
1+η E PE
+ λmax − 1 P ≤ 0, i ∈ [1, 2m ],
η ρ
c) |hi x| ≤ 1, ∀ x ∈ E(P, ρ), i ∈ [1, m],
Theorem 10.3.2. Given two ellipsoids, E(P, ρ1 ) and E(P, ρ2 ), ρ2 > ρ1 > 0,
if there exist H1 , H2 ∈ Rm×n and a positive η such that
Proof. Let h1,i and h2,i be the ith rows of H1 and H2 respectively. The
conditions E(P, ρ1 ) ⊂ L(H1 ) and E(P, ρ2 ) ⊂ L(H2 ) are equivalent to
1 1
ρ1 h1,i ρ2 h2,i
≥ 0, ≥ 0, i = 1, 2, · · · , m.
hT1,i P hT2,i P
and
1+η
1− T
ρ2 η λmax (E P E) P (A + B(Di F + Di− H2 ))T
> 0,
A + B(Di F + Di− H2 ) P −1
by convexity, we have
1 − 1+η T
ρη λmax (E P E) P (A + B(Di F + Di− H))T
> 0,
A + B(Di F + Di− H) P −1
inf α4 (10.3.10)
P >0,0<ρ1 <1,η>0,F,H1 ,H2 ,
s.t. a) X 0 ⊂ E(P, 1), E(P, ρ1 ) ⊂ α4 X ∞ ,
1 − 1+η
ρ η λmax (E T
P E) P (A + B(D i F + D −
i H 1 ))T
b) 1 > 0,
A + B(Di F + Di− H1 ) P −1
i ∈ [1, 2m ],
1 − 1+η
η λmax (E T
P E) P (A + B(Di F + Di− H2 ))T
c) > 0,
A + B(Di F + Di− H2 ) P −1
i ∈ [1, 2m ],
d) |h1,i x| ≤ 1, ∀ x ∈ E(P, ρ1 ), i ∈ [1, m],
e) |h2,i x| ≤ 1, ∀ x ∈ E(P, 1), i ∈ [1, m].
γ = α24 , Q = P −1 , Y = F Q, Z1 = H1 Q, Z2 = H2 Q,
228 Chapter 10. Disturbance Rejection with Stability
inf γ (10.3.11)
Q>0,0<ρ1 <1,η>0,λ,Y,Z1 ,Z2 ,
10.4. Conclusions
In this chapter, we have considered linear systems subject to actuator sat-
uration and disturbances. Conditions for determining if a given ellipsoid
is strictly invariant were derived. With the aid of these conditions, we
developed analysis and design methods, both for enlarging the invariant el-
lipsoid and for disturbance rejection. Examples were used to demonstrate
the effectiveness of these methods.
Chapter 11
On Maximizing the
Convergence Rate
11.1. Introduction
Fast response is always a desired property for control systems. The time
optimal control problem was formulated for this purpose. Although it is
well known that the time optimal control is a bang-bang control, this control
strategy is rarely implemented in real systems. The main reason is that it
is generally impossible to characterize the switching surface. For discrete-
time systems, online computation has been proposed in the literature, but
the computation burden is very heavy since linear programming has to be
solved recursively with increasing time-horizon. Also, as the time-horizon
is extended, numerical problems become more severe. Another reason is
that even if the optimal control can be obtained exactly and efficiently, it
results in open-loop control.
Another notion related to fast response is the convergence rate. For
linear systems, the overall convergence rate is measured by the maximal
real part of the closed-loop eigenvalues. Consider the linear system
ẋ = Ax, x ∈ Rn . (11.1.1)
229
230 Chapter 11. On Maximizing the Convergence Rate
where Re(λi (A)) is the real part of the ith eigenvalue of A, then α > 0.
For simplicity, assume that A + αI is neutrally stable, then there exists a
positive definite matrix P such that
AT P + P A ≤ −2αP.
Let
V (x) = xT P x.
Then,
V̇ (x) = xT (AT P + P A)x ≤ −2αxT P x = −2αV (x).
Hence,
V̇ (x)
≤ −2α, ∀ x ∈ Rn \ {0}. (11.1.3)
V (x)
Furthermore, because of (11.1.2), we have
1 V̇ (x)
α = min − : x ∈ R \ {0} .
n
(11.1.4)
2 V (x)
Therefore, we call α the overall convergence rate of the system (11.1.1). For
a general nonlinear system, the convergence rate can be defined similarly
as in (11.1.4). Since local stability is a more general property than global
stability, we would like to define the term convergence rate on some subset
of the domain of attraction. Consider a nonlinear system
ẋ = f (x).
Suppose that V̇ (x) < 0 for all x ∈ LV (ρ) \ {0}. Then, the overall conver-
gence rate of the system on LV (ρ) can be defined as
1 V̇ (x)
α := inf − : x ∈ LV (ρ) \ {0} . (11.1.5)
2 V (x)
x(k + 1) = f (x(k)),
where ∆V (x) is the increment of V (x) along the trajectory of the system.
We can also define the convergence rate at each point x in the state space
as % &
V̇ (x) ∆V (x)
− or − .
V (x) V (x)
For linear systems subject to actuator saturation, the problem of max-
imizing the convergence rate at each x ∈ Rn is well defined. The objective
of this chapter is to find a control law constrained by the actuator satura-
tion such that −V̇ (x) is maximized at each x. It turns out that the optimal
control law is a bang-bang type control with a simple switching scheme.
Since the discontinuity might be undesirable, for example, causing chatter-
ing around the switching surface, we will also derive a continuous control
law with some loss of optimality in the convergence rate. The proposed
continuous control law is a saturated high gain linear feedback. As the gain
goes to infinity, the saturated high gain feedback approaches the optimal
bang-bang control law.
For a discrete-time system, the control law that maximizes the conver-
gence rate is a coupled saturated linear feedback. If the system has one or
two inputs, the control law can be put into a simple formula. If the system
has more than two inputs, the controller is more complicated. It is linear
inside some polyhedron. Outside of this polyhedron, we need to solve a
simple convex optimization problem.
We should note that under the maximal convergence control, the de-
crease of V (x) over a fixed time interval needn’t be maximal. This is be-
cause we don’t take the path of a trajectory into consideration. However, a
232 Chapter 11. On Maximizing the Convergence Rate
Section 11.2.3 reveals some properties and limitations about the overall
convergence rate and provides methods to deal with these limitations. Sec-
tion 11.2.4 shows that the maximal convergence control also achieves both
the maximal and the minimal invariant ellipsoids in the presence of distur-
bances. A brief concluding remark is made in Section 11.4.
Assume that the system is stabilizable and that B has full column rank.
Let
B = b1 b2 · · · bm .
In Chapter 8, we developed a design method for enlarging the domain
of attraction under a state feedback. The approach was to maximize the
invariant ellipsoid with repect to some shape reference set. The optimized
controller, however, results in very slow convergence rate of the closed-
loop system. The objective of this chapter is to design a controller that
maximizes the convergence rate inside a given ellipsoid. Given a positive
definite matrix P , let
V (x) = xT P x.
For a positive number ρ, the level set associated with V (x) is the ellipsoid,
E(P, ρ) = x ∈ Rn : xT P x ≤ ρ .
Under the constraint that |u|∞ ≤ 1, the control that maximizes the con-
vergence rate, or minimizes V̇ (x, u), is simply
m
ẋ = Ax − bi sign(bTi P x). (11.2.3)
i=1
Because of the discontinuity of the sign function, equation (11.2.3) may have
no solution for some x(0) or have solution only in a finite time interval. For
example, for the single input case m = 1, equation (11.2.3) will have no
solution if B T P x(0) = 0 and ẋ at each side of the switching plane B T P x = 0
points to the other side. We will use a continuous feedback law,
ui = −sat(kbTi P x),
in the next section. In what follows, we use the bang-bang control law to
investigate the possibility that an ellipsoid can be made invariant with a
bounded control |u|∞ ≤ 1.
Recall that an ellipsoid E(P, ρ) is invariant for a system ẋ = f (x) if
all the trajectories starting from it will stay inside of it. It is contractive
invariant if
ui = −sign(bT P x)
b) The ellipsoid E(P, ρ) is contractive invariant for (11.2.3), i.e., the fol-
lowing condition is satisfied,
m
V̇ (x) = xT (AT P + P A)x − 2 xT P bi sign(bTi P x) < 0,
i=1
∀ x ∈ E(P, ρ) \ {0}. (11.2.4)
It is clear from Fact 11.2.1 that the maximal convergence control pro-
duces the maximal invariant ellipsoid of a given shape. We will see in
the next section that if (11.2.4) is satisfied, there also exists a continuous
feedback law such that E(P, ρ) is contractive invariant. In this case, all the
trajectories starting from E(P, ρ) will converge to the origin asymptotically.
For an arbitrary positive definite matrix P , there may exists no ρ such
that E(P, ρ) can be made invariant. In what follows we give condition for P
such that E(P, ρ) can be made invariant for some ρ and provide a method
for finding the largest ρ.
(A + BF )T P + P (A + BF ) < 0; (11.2.5)
Proof. b)→ a). If (11.2.5) is satisfied, then there exists a ρ > 0 such that
E(P, ρ) ⊂ x ∈ Rn : |F x|∞ ≤ 1 .
If x0 ∈ E(P, ρ), then under the control u = F x, x(t) will stay in E(P, ρ) and
we also have |u|∞ ≤ 1 for all t ≥ 0. This means that E(P, ρ) can be made
contractive invariant with a bounded control. Hence by the equivalence of
the statements a) and b) in Fact 11.2.1, we have (11.2.4).
c) → b). It is obvious.
236 Chapter 11. On Maximizing the Convergence Rate
P → P̄ = (T −1 )T P T −1 ,
B → B̄ = T B
and
0
P B → P̄ B̄ = (T −1 )T P B = .
R
Recall that we have assumed that B has full column rank. Also, let us
accordingly partition x as
x1
,
x2
and AT P + P A and P as
Q1 Q12 P1 P12
AT P + P A = , P = .
QT12 Q2 T
P12 P2
For all
x1
∈ ∂E(P, ρ),
0
we have xT P B = 0. So, if a) is true, then (11.2.4) holds for some ρ > 0,
which implies that
xT1 Q1 x1 < 0,
for all x1 such that xT1 P1 x1 = ρ. It follows that Q1 < 0. Hence there exists
a k > 0 such that
Q1 Q12
(A − kBB T P )T P + P (A − kBB T P ) = < 0.
QT12 Q2 − kRRT
Chapter 8. The shape of the resulting invariant ellipsoid E(P, ρ) will be the
closest to that of E(P0 , 1).
Now assume that we have a P > 0 such that the conditions in Proposi-
tion 11.2.1 are satisfied. Given ρ > 0, we would like to determine if E(P, ρ)
is contractive invariant for the closed-loop system (11.2.3). Let’s start with
the single input case. In this case, the condition (11.2.4) simplifies to
for all k ∈ (0, 1]. This shows that the condition (11.2.7) is equivalent
to (11.2.8). Based on this equivalence property, we have the following
necessary and sufficient condition for the contractive invariance of a given
ellipsoid.
and
B T P (AT P + P A − λj P )−1 P B > 0. (11.2.10)
Then, E(P, ρ) is contractive invariant for the system (11.2.3) if and only if
Since E(P, ρ) can be made contractive invariant for some ρ > 0, we must
have g(x) < 0 for all B T P x = 0. In this case, the contractive invariance of
E(P, ρ) is equivalent to that all the extrema of g(x) in the surface xT P x = ρ,
B T P x > 0, if any, are less than zero.
By the Lagrange multiplier method, an extremum of g(x) in the surface
x P x = ρ, B T P x > 0, must satisfy
T
g(x) = λρ − xT P B.
11.2. Continuous-Time Systems 239
(AT P + P A − λP )x = P B,
we conclude that
det (AT P + P A − λP )
= 0.
To show this, we assume, without loss of generality, that
Q1 Q12 P1 P12 0
AT P + P A = , P = , P B = ,
QT12 q2 T
P12 p2 r
(AT P + P A − λP )x = P B,
then,
x1 = −(Q1 − λP1 )−1 (Q12 − λP12 )x2 ,
and
− (QT12 − λP12
T
)(Q1 − λP1 )−1 (Q12 − λP12 ) + q2 − λp2 x2 = r.
Multiplying both sides with det (Q1 − λP1 ) and applying (11.2.11), we ob-
tain
det (AT P + P A − λP )x2 = det (Q1 − λP1 )r.
Since r
= 0 and det (Q1 − λP1 )
= 0, we must have
det (AT P + P A − λP )
= 0.
x = (AT P + P A − λP )−1 P B,
Denote
Φ = λP − AT P − P A,
then the equation (11.2.15) can be written as,
B T P Φ−1 P Φ−1 P B = ρ.
%
&
ρ 0 T
B P 0 Φ−1 0 0 I
det − =0
0 P −1 0 I 0 Φ−1 PB 0
%
&
Φ 0 0 I ρ−1 0 BTP 0
det − =0
0 Φ PB 0 0 P 0 I
λP − AT P − P A P
det = 0.
ρ−1 P BB T P λP − AT P − P A
This last equation is (11.2.9).
Also, at the extremum, we have xT P B > 0. This is equivalent to
(11.2.10),
B T P (AT P + P A − λP )−1 P B > 0.
Finally, at the extremum
within each cone. For example, consider m = 2, the surface of the ellipsoid
E(P, ρ) can be divided into the following subsets:
S1 = x ∈ Rn : bT1 P x = 0, bT2 P x ≥ 0, xT P x = ρ , −S1 ,
S2 = x ∈ Rn : bT1 P x ≥ 0, bT2 P x = 0, xT P x = ρ , −S2 ,
S3 = x ∈ Rn : bT1 P x > 0, bT2 P x > 0, xT P x = ρ , −S3 ,
S4 = x ∈ Rn : bT1 P x > 0, bT2 P x < 0, xT P x = ρ , −S4 .
With the bang-bang type control law (11.2.2), V (x) of the closed-loop sys-
tem will decrease with a maximal convergence rate. As we have noted in the
last section, the discontinuity of the bang-bang control law may cause the
state equation to have no solution. When the control law is implemented
on a sampled-data system, it may cause frequent switching (chattering)
around the switching surface. In this section, we will replace the bang-
bang control law with a saturated high gain linear feedback at the cost of
a slight reduction in convergence rate. We also start with the single input
case.
(A − k0 BB T P )T P + P (A − k0 BB T P ) < 0, (11.2.17)
u = −sat(kB T P x),
for all k ≥ k0 .
If |B T P x| ≥ k1 , then
sat(kB T P x) = kB T P x.
u = −sat(kB T P x).
u = −sat(kB T P x)
and
u = −sign(B T P x).
The difference between the two V̇ (x)’s under these two controls is
1
= 0, if |B T P x| > ,
T k
2x P B sign(B P x) − sat(kB P x)
T T
≤ 2 , if |B T P x| ≤ 1
.
k k
Note that |sign(B T P x) − sat(kB T P x)| ≤ 1. By letting k → ∞, the differ-
ence between the two V̇ (x)’s will go to zero uniformly on E(P, ρ). Thus we
can say that high gain saturated linear feedback will produce sub-optimal
convergence rate.
We now consider the multiple input case. For v ∈ Rm , denote
T
sat(v) = sat(v1 ) sat(v2 ) · · · sat(vm ) .
Here, as usual, we have slightly abused the notation by using sat to denote
both the scalar and the vector saturation functions. We have the following
result.
244 Chapter 11. On Maximizing the Convergence Rate
Theorem 11.2.3. For a multiple input system, suppose that E(P, ρ) can
be made contractive invariant with a bounded control, then there exists a
k > 0 such that E(P, ρ) is contractive invariant under the control
u = −sat(kB T P x).
Proof. The condition that E(P, ρ) can be made contractive invariant im-
plies that
m
xT (AT P + P A)x − 2 xT P bi sign(bTi P x) < 0,
i=1
∀ x ∈ E(P, ρ) \ {0}. (11.2.20)
m
ε = − max x (A P + P A)x − 2
T T T T
x P bi sign(bi P x) .
x∈∂ E (P,ρ)
i=1
Since
1
= 0, if |bTi P x| > ,
T k
x P bi sign(bi P x) − sat(kbi P x)
T T
≤ 1 , if |bT P x| ≤ 1
i ,
k k
we have m
2m
2 x P bi sign(bi P x) − sat(kbi P x) ≤
T T T
.
k
i=1
2m
If we choose k > ε , then from (11.2.23),
m
xT (AT P + P A)x − 2 xT P bi sat(kbTi P x) < 0, ∀ x ∈ ∂E(P, ρ),
i=1
u = −sat(kB T P x).
2
246 Chapter 11. On Maximizing the Convergence Rate
Let
0.0836 −0.0639
P = .
−0.0639 0.1460
By checking the condition of Theorem 11.2.1 bisectionally, the largest el-
lipsoid that can be made contractive invariant with a bounded control is
E(P, ρ∗ ) with ρ∗ = 1.059. By using the design method in Chapter 8, a
feedback
u = sat(F0 x), F0 = 0.0036 −0.3057
is found such that
with E(P, ρ), ρ = 1.058 inside the linear region of the saturation
L(F0 ) = x ∈ R2 : |F0 x|∞ ≤ 1
u = −sat(5B T P x).
This is illustrated in Figs. 11.2.1 and 11.2.2. In Fig. 11.2.1, “∗” represents
the initial state, the solid trajectory is under the control of u = sat(F0 x) and
the dash-dotted one is under the control of u = −sat(5B T P x). Fig. 11.2.2
shows V (x) = xT P x as a function of time. Also, the solid one is under the
control of u = sat(F0 x) and the dash-dotted one is under the control of
u = −sat(5B T P x).
From Fig. 11.2.2, we see that the decrease of xT P x becomes slower and
slower under the control
u = −sat(5B T P x).
u = −sat(kB T P x)
11.2. Continuous-Time Systems 247
F0x=−1
0
T
5B Px=1
−1
T
x Px=1.058
−2
−3
−4
−8 −6 −4 −2 0 2 4 6 8
is limited by the shape of the ellipsoid or the matrix P . This problem will
be discussed in the next section.
We also consider the system (11.2.3) under the maximal convergence con-
trol,
m
ẋ = Ax − bi sign(bTi P x). (11.2.24)
i=1
1.4
1.2
1
u=sat(F0x)
xTPx 0.8
0.6
0.4
u=−sat(5BTPx)
0.2
0
0 2 4 6 8 10 12 14 16 18 20
t
Theorem 11.2.4.
a)
1 V̇ (x)
α(ρ) = min − T
: x Px = ρ ; (11.2.25)
2 ρ
c) Let
β0 = min − xT (AT P + P A)x : xT P x = 1, xT P B = 0 ,
then,
β0
lim α(ρ) = . (11.2.26)
ρ→0 2
11.2. Continuous-Time Systems 249
Proof.
a) Consider x ∈ ∂E(P, ρ) and k ∈ (0, 1],
V̇ (kx) k 2 xT (AT P + P A)x − 2k m T T
i=1 x P bi sign(bi P kx)
− =−
V (kx) k 2 xT P x
2
m
−x (A P + P A)x + k i=1 xT P bi sign(bTi P x)
T T
= . (11.2.27)
xT P x
Since
m
xT P bi sign(bTi P x) ≥ 0,
i=1
(kx) (x)
− VV̇ (kx) increases as k decreases. It follows that the minimal value of − VV̇ (x)
is obtained on the boundary of E(P, ρ), which implies (11.2.25).
2α(ρ) ≥ β0 − ε.
Denote
X 0 = x ∈ Rn : xT P x = 1, xT P B = 0 ,
250 Chapter 11. On Maximizing the Convergence Rate
and
X (δ) = x ∈ Rn : xT P x = 1, |xT P B|∞ ≤ δ .
It is clear that
lim dist(X (δ), X 0 ) = 0,
δ→0
we have,
2 T
m
min −xT (AT P + P A)x + √ x P bi sign(bTi P x) :
ρ i=1
xT P x = 1, |xT P B|∞ ≤ δ ≥ β0 − ε, (11.2.29)
2α(ρ) ≥ β0 − ε, ∀ρ > 0.
2δ
≥ β1 + √
ρ
> β0 − ε.
2α(ρ) ≥ β0 − ε
for % &2
2δ
ρ< .
−β1 + β0 − ε
This completes the proof. 2
Theorem 11.2.4 says that α(ρ) can be obtained by computing the max-
imum of V̇ (x) over ∂E(P, ρ). For the single input case, Theorem 11.2.1
provides a method for determining if this maximum is negative. The exact
value of α(ρ) can be computed with a procedure similar to the proof of
Theorem 11.2.1. To avoid too much technical detail, we assume for sim-
plicity that, for any real eigenvalue λj of A + P −1 AT P , there exists no x
satisfying
(P A + AT P − λj P )x = P B.
This is the case if (A + P −1 AT P, B) is controllable, thus the assumption is
generally true. Let λ1 , λ2 , · · · , λJ be real numbers satisfying (11.2.9) and
(11.2.10). Denote
1
βj = −λj + B T P (AT P + P A − λj P )−1 P B,
ρ
252 Chapter 11. On Maximizing the Convergence Rate
then,
1
α(ρ) = min βj : j = 0, 1, · · · , J ,
2
where β0 is as defined in Theorem 11.2.4.
Since the overall convergence rate is limited by β0 /2 and it approaches
this limit as ρ goes to 0, we would like β0 not to be too small. For a fixed
P , a formula for computing β0 can be derived directly from the definition.
Let the kernel of B T P be {N y : y ∈ Rn−m }. Then
β0 = −λmax (N T P N )−1 N T (AT P + P A)N . (11.2.31)
β0 =sup λ (11.2.32)
F
s.t. (A + BF )T P + P (A + BF ) ≤ −λP.
It follows that
β0 = min − xT ((A + BF )T P + P (A + BF ))x : xT P x = 1, xT P B = 0
≥ min − xT ((A + BF )T P + P (A + BF ))x : xT P x = 1 ,
and hence,
β0 ≥ sup min − xT ((A + BF )T P + P (A + BF ))x : xT P x = 1 .
F
(11.2.33)
In what follows, we will prove that
β0 = sup min − xT ((A + BF )T P + P (A + BF ))x : xT P x = 1 .
F
(11.2.34)
In view of (11.2.33), it suffices to show that for any ε > 0, there exists an
F = −kB T P , with k > 0, such that
min − xT ((A + BF )T P + P (A + BF ))x : xT P x = 1 ≥ β0 − ε.
(11.2.35)
11.2. Continuous-Time Systems 253
From the definition of β0 , we see that there exists a δ > 0 such that
min − xT (AT P + P A)x : xT P x = 1, |xT P B|2 ≤ δ ≥ β0 − ε
sup α (11.2.38)
P >0,ρ,F,H
s.t. a) αX R ⊂ E(P, ρ),
b) (A + BF )T P + P (A + BF ) < 0,
c) E(P, ρ) ⊂ L(F ),
d) (A + BH)T P + P (A + BH) ≤ −βP.
The final outcome is that under the control of (11.2.39) or (11.2.40), the
closed-loop system will have a contractive invariant set E(P, ρ) and a guar-
anteed limit of the convergence rate
β0 β
≥ .
2 2
11.2. Continuous-Time Systems 255
ẋ = Ax + Bu + Ew, x ∈ Rn , u ∈ Rm , w ∈ Rq , (11.2.41)
No matter what w is, the control that maximizes the convergence rate, or
minimizes V̇ (x, u, w), is also
wi = sign(eTi P x), i = 1, 2, · · · , q.
m
x (A P + P A)x − 2
T T
xT P bi sign(bTi P x)
i=1
q
+ xT P ei sign(eTi P x) ≤ 0, ∀ x ∈ ∂E(P, ρ). (11.2.45)
i=1
Denote
m
V̇ (x) = x (A P + P A)x − 2
T T
xT P bi sign(bTi P x)
i=1
q
+2 xT P ei sign(eTi P x).
i=1
Fact 11.2.2 says that E(P, ρ) is invariant for (11.2.44) if and only if V̇ (x) ≤ 0
for all x ∈ ∂E(P, ρ). In the following, when we say that E(P, ρ) is invariant,
we mean it is invariant for the system (11.2.44), or for the system (11.2.41)
under the control
ui = −sign(bTi P x).
From Proposition 11.2.3, we see that all the ρ such that E(P, ρ) is in-
variant form an interval. By Fact 11.2.2, this is the largest interval we
can achieve with any feedback control constrained by the input saturation.
11.2. Continuous-Time Systems 257
In other words, the maximal convergence control produces both the max-
imal invariant ellipsoid and the minimal invariant ellipsoid. As we have
mentioned in Chapter 10, a large invariant ellipsoid is desired for a large
domain of attraction, and a small invariant ellipsoid indicates a good ca-
pability of disturbance rejection. In summary, the maximal convergence
control is ideal for dealing with disturbances.
Similar to the system in the absence of disturbance, it can be shown
that the saturated high gain feedback
u = −sat(kB T P x)
ui = −sign(bTi P x), i = 1, 2, · · · , m,
Similar to the two-input case in Section 11.2.1, we can divide the surface
∂E(P, ρ) into 8 subsets:
S1 = x ∈ R n : B T P x = 0, E T P x ≥ 0, xT P x = ρ , −S1 ,
S2 = x ∈ R n : B T P x ≥ 0, E T P x = 0, xT P x = ρ , −S2 ,
S3 = x ∈ R n : B T P x > 0, E T P x > 0, xT P x = ρ , −S3 ,
S4 = x ∈ R n : B T P x > 0, E T P x < 0, xT P x = ρ , −S4 .
and all the local extrema of V̇ (x) in S3 and S4 are non-positive. These
optimization problems can be handled similarly as those in Section 11.2.1.
258 Chapter 11. On Maximizing the Convergence Rate
Assume that the system is stabilizable and that B has full column rank.
Given V (x) = xT P x, we would like to maximize the convergence rate within
E(P, ρ).
Define
∆V (x, u) = uT B T P Bu + 2uT B T P Ax + xT AT P Ax − xT P x
T
= u + (B T P B)−1 B T P Ax B T P B u + (B T P B)−1 B T P Ax
−xT AT P B(B T P B)−1 B T P Ax + xT AT P Ax − xT P x.
Let
F0 = −(B T P B)−1 B T P A.
It is clear that the convergence maximization problem is equivalent to
Let us first consider the single input case. In this case, the control that
maximizes the convergence rate is simply,
i.e.,
T
A − B(B T P B)−1 B T P A P A − B(B T P B)−1 B T P A − P < 0.
(11.3.5)
As in the continuous-time case, we also have
Fact 11.3.1. For the single input case, the following two statements are
equivalent:
a) The ellipsoid E(P, ρ) can be made contractive invariant for the system
(11.3.1) with a bounded control;
b) The ellipsoid E(P, ρ) is contractive invariant for (11.3.4), i.e., the fol-
lowing condition is satisfied,
Proposition 11.3.1. For the single input case, the ellipsoid E(P, ρ) can be
made contractive invariant for some ρ > 0 if and only if (11.3.5) is satisfied.
for all
x ∈ x ∈ Rn : |F0 x| ≤ 1, x
= 0
= x ∈ Rn : |B T P Ax| ≤ B T P B, x
= 0 .
260 Chapter 11. On Maximizing the Convergence Rate
Like the equivalence of (11.2.7) and (11.2.8), it can be shown that (11.3.7)
is equivalent to
And we have
Theorem 11.3.1. For the single input case, assume that (11.3.5) is satis-
fied. Let λ1 , λ2 , · · · , λJ > 1 be real numbers such that
λj P − AT P A P
det =0 (11.3.9)
ρ−1 AT P BB T P A λj P − AT P A
and
B T P A(AT P A − λj P )−1 AT P B ≥ B T P B. (11.3.10)
Then, E(P, ρ) is contractive invariant for the system (11.3.4) if and only if
Proof. Denote
Ax − B(B T P B)−1 B T P Ax = Ax − B.
Since P satisfies (11.3.5), we have g(x) < 0 for all x on the plane B T P Ax =
B T P B. So the invariance of E(P, ρ) is equivalent to that all the extrema in
the surface xT P x = ρ, B T P Ax > B T P B, if any, are less than zero.
11.3. Discrete-Time Systems 261
g(x) = (λ − 1)ρ − B T P Ax + B T P B.
It follows from (11.3.5) that Q1 − P1 < 0 and Q1 − λP1 < 0 for all λ > 1.
It can be shown as in the proof of Theorem 11.2.1 that (AT P A − λP )x =
AT P B and λ > 1 imply that AT P A − λP is nonsingular. Hence
x = (AT P A − λP )−1 AT P B,
and from xT P x = ρ,
u = sat(F0 x),
as the optimal control. But for the general case, the optimal u cannot be
put into this simple form. Actually, (11.3.2) is a minimal distance problem.
Let
v = F0 x = −(B T P B)−1 B T P Ax,
262 Chapter 11. On Maximizing the Convergence Rate
then the optimization problem is to find a point in the box |u|∞ ≤ 1 that
is closest to v, with the distance induced by the weighted 2-norm, i.e.,
1
|u − v|B T P B = ((u − v)T B T P B(u − v)) 2 .
This is illustrated in Fig. 11.3.1. In Fig. 11.3.1, v is marked with “◦” and
the optimal u is marked with “∗”. Suppose that v ∈ R2 is outside of the
3
R4
2 R
3
R
2
v v
v
1
u u u
v
u
0 −R R0
1
R1
−1
−3
−2.5 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2
Figure 11.3.1: Illustration for the relation between v and the optimal u.
min γ (11.3.14)
u,γ
γ (u − v)T
s.t. ≥ 0, |u|∞ ≤ 1.
u−v (B T P B)−1
This optimization problem can be efficiently solved so that the control can
be computed and implemented on line. It can be shown that the optimal
control u resulting from (11.3.14) is continuous in v and hence continuous
in x.
We have some remarks for the case that B has full row rank. It is known
that there exists a linear dead-beat control for this case. Let B + be the
right inverse of B, then
It follows that
T
V (Ax + Bu) = u + (B T P B)−1 B T P Ax B T P B
× u + (B T P B)−1 B T P Ax .
11.4. Conclusions
We have shown in this chapter that, for a continuous-time system subject
to input saturation and persistent disturbance, the maximal convergence
control is a bang-bang type control with a simple switching strategy. A
saturated high gain linear feedback is developed to avoid the discontinuity
of the bang-bang control. For a discrete-time system, the maximal con-
vergence control is a coupled saturated linear feedback. We also provided
methods for determining the largest ellipsoid that can be made invariant
with a bounded control for both continuous-time and discrete-time systems.
Chapter 12
Output Regulation –
Continuous-Time
Systems
12.1. Introduction
265
266 Chapter 12. Output Regulation – Continuous-Time Systems
our results on an aircraft model. Finally, Section 12.7 draws a brief con-
clusion to this chapter.
u = F x + Gw, (12.2.2)
1) the system
ẋ = (A + BF )x
is asymptotically stable;
2) for all (x0 , w0 ) ∈ Rn+r , the interconnection of (12.2.1) and the feed-
back law (12.2.2) results in
lim e(t) = 0.
t→∞
1) the system
ẋ = Ax + BCc ξ + BDc Cx,
ξ˙ = Ac ξ + Bc Cx
is asymptotically stable;
Assumption 12.2.1.
The latter can be interpreted as the system obtained from (12.2.1) by cut-
ting the connections between the exosystem and the plant. More specifi-
cally, the following results were proven in [31].
270 Chapter 12. Output Regulation – Continuous-Time Systems
ẋ = Ax + Bφ(x, 0)
lim e(t) = 0.
t→∞
12.3. The Regulatable Region 271
ξ̇ = ψ(ξ, e), ξ0 ∈ Y0 ,
u = φ(ξ),
ẋ = Ax + Bφ(ξ),
(12.2.7)
ξ˙ = ψ(ξ, Cx)
lim e(t) = 0.
t→∞
exist matrices Π ∈ Rn×r and Γ ∈ Rm×r that solve the following linear
matrix equations,
ΠS = AΠ + BΓ + P,
(12.3.1)
0 = CΠ + Q.
Given the matrices Π and Γ, we define a new state z = x − Πw and
rewrite the system equations as
ż = Az + Bu − BΓw,
ẇ = Sw, (12.3.2)
e = Cz.
From these new equations, it is clear that e(t) goes to zero asymptotically
if z(t) goes to zero asymptotically. This is possible only if (see [71])
sup ΓeSt w0 ∞ < 1. (12.3.3)
t≥0
For this reason, we will restrict our attention to exosystem initial conditions
in the following compact set
W 0 = w0 ∈ Rr : |Γw(t)|∞ = ΓeSt w0 ∞ ≤ ρ, ∀ t ≥ 0 , (12.3.4)
for some ρ ∈ [0, 1). For later use, we also denote δ = 1 − ρ. We note that
the compactness of W 0 can be guaranteed by the observability of (Γ, S).
Indeed, if (Γ, S) is not observable, the exosystem can be reduced to make
it so.
We can now precisely define the notion of asymptotically regulatable
region as follows.
Definition 12.3.1.
1) Given T > 0, a pair (z0 , w0 ) ∈ Rn × W 0 is regulatable in time T if
there exists an admissible control u, such that the response of (12.3.2)
satisfies z(T ) = 0. The set of all (z0 , w0 ) regulatable in time T is
denoted as Rg (T ).
3) The set of all (z0 , w0 ) for which there exist admissible controls such
that the response of (12.3.2) satisfies
lim z(t) = 0
t→∞
12.3. The Regulatable Region 273
Requiring the former instead of the latter will also guarantee the closed-loop
stability in the absence of w. As will be explained in detail in Remark 12.3.2,
this will result in essentially the same description of the asymptotically
regulatable region.
v̇ = Av + Bu, u ∈ Ua . (12.3.5)
The asymptotically null controllable region C a and the related C and C(T ),
were defined in Chapter 2. Since our description of Rg (T ), Rg and Rag
relies heavily on them, we briefly recall these definitions as follows.
Clearly,
C= C(T ), (12.3.6)
T ∈[0,∞)
and
T
−Aτ
C(T ) = e Bu(τ )dτ : u ∈ Ua . (12.3.7)
0
It is also clear that the null controllable region and the asymptotically null
controllable region are identical if the pair (A, B) is controllable. Some
274 Chapter 12. Output Regulation – Continuous-Time Systems
and
A1 0 B1
A= , B= , (12.3.8)
0 A2 B2
where A1 ∈ Rn1 ×n1 is semi-stable (i.e., all its eigenvalues are in the closed
left-half plane) and A2 ∈ Rn2 ×n2 is anti-stable (i.e., all its eigenvalues are
in the open right-half plane). The anti-stable subsystem,
ż2 = A2 z2 + B2 u − B2 Γw,
(12.3.9)
ẇ = Sw,
v̇2 = A2 v2 + B2 u
and let
V (T ) = V2 − e−A2 T V2 eST .
12.3. The Regulatable Region 275
Then,
a) Rg 2 (T ) = (z2 , w) ∈ Rn2 × W 0 : z2 − V (T )w ∈ C 2 (T ) ; (12.3.11)
b) Rg 2 = (z2 , w) ∈ Rn2 × W 0 : z2 − V2 w ∈ C 2 ; (12.3.12)
c) Rag = Rn1 × Rg 2 . (12.3.13)
Proof.
a) Given (z20 , w0 ) ∈ Rn2 × W 0 and an admissible control u, the solution
of (12.3.9) at t = T is,
T
z2 (T ) = eA2 T z20 + e−A2 τ B2 u(τ )dτ
0
T
−A2 τ
− e B2 Γe Sτ
w0 dτ . (12.3.14)
0
Thus,
T
−A2 T
e z2 (T ) = z20 − V (T )w0 + e−A2 τ B2 u(τ )dτ.
0
lim V (T ) = V2 .
T →∞
z20 − V2 w0 ∈ C 2 =⇒ (z20 , w0 ) ∈ Rg 2 .
276 Chapter 12. Output Regulation – Continuous-Time Systems
Since
lim V (T ) = V2 ,
T →∞
Hence,
(z20 , w0 ) ∈ Rg 2 =⇒ z20 − V2 w0 ∈ C 2 .
Denote
Z2 = δe−A2 T1 v2 : v2 ∈ C 2 .
For each z2 ∈ Z2 , there is an admissible control u2 such that
∞ ∞
z2 = δe−A2 T1 e−A2 τ B2 u2 (τ )dτ = e−A2 τ B2 δu2 (τ − T1 )dτ.
0 T1
Hence,
∞
z20 − V2 w0 + z2 = z20 − e−A2 τ B2 ΓeSτ w0 dτ
0
∞
+ e−A2 τ B2 δu2 (τ − T1 )dτ
T1
12.3. The Regulatable Region 277
T1
= z20 − e−A2 τ B2 ΓeSτ w0 dτ
∞0
+ e−A2 τ B2 δu2 (τ − T1 ) − ΓeSτ w0 dτ
T1
T1
=− e−A2 τ B2 u1 (τ )dτ
0
∞
+ e−A2 τ B2 δu2 (τ − T1 ) − ΓeSτ w0 dτ
T1
∈ C2,
Since e−A2 T1 is nonsingular, the set Z2 contains the origin in its interior.
It follows that
z20 − V2 w0 ∈ C 2 .
Since
z1 (T )
0
is inside the asymptotically null controllable region of the above system
under the constraint uδ ∞ ≤ δ (see Chapter 2), there exists a uδ such
that
lim z(t) = 0.
t→∞
Hence (z0 , w0 ) ∈ Rag . This establishes that Rn1 × Rg 2 ⊂ Rag and hence
Rn1 × Rg 2 = Rag . 2
278 Chapter 12. Output Regulation – Continuous-Time Systems
Since
lim V (T ) = V2 ,
T →∞
that
T T
z20 − e−A2 τ B2 ΓeSτ w0 dτ + e−A2 τ B2 u(τ )dτ
0 0
T
= z20 − V (T )w0 + e−A2 τ B2 u(τ )dτ
0
ε ε
>ε− = ,
2 2
for all T > T0 and u ∈ Ua . Since the smallest singular value of eA2 T
increases exponentially with T , it follows from (12.3.14) that z2 (T ) will grow
unbounded. Recall from the definition of Problems 12.2.1 and 12.2.2 that
the state x(t) and hence z(t) has to be bounded, so even if the requirement
is replaced with
lim e(t) = 0, (12.3.18)
t→∞
12.4. State Feedback Controllers 279
if and only if
x20 − (Π2 + V2 )w0 ∈ C 2 ,
where
Π2 = 0 In2 Π.
In this section, we will construct a feedback law that solves the problem of
output regulation by state feedback for linear systems subject to actuator
saturation. We will assume that a stabilizing state feedback law
u = f (v), |f (v)|∞ ≤ 1, v ∈ Rn ,
v̇ = Av + Bf (v) (12.4.1)
with
|g(z, w)|∞ ≤ 1, ∀ (z, w) ∈ Rn × W 0 ,
the closed-loop system is given by,
Denote the time response of z(t) to the initial state (z0 , w0 ) as z(t, z0 , w0 )
and define
S zw := (z0 , w0 ) ∈ Rn × W 0 : lim z(t, z0 , w0 ) = 0 .
t→∞
Since Rag is the set of all (z0 , w0 ) for which z(t) can be driven to the origin
asymptotically, we must have S zw ⊂ Rag . Our objective is to design a
control law u = g(z, w) such that S zw is as large as possible, or as close to
Rag as possible.
First we need a mild assumption which will be removed later. Assume
that there exists a matrix V ∈ Rn×r such that
This will be the case if A and S have no common eigenvalues. With the
decomposition in (12.3.8), if we partition V accordingly as
V1
V = ,
V2
then V2 satisfies
−A2 V2 + V2 S = −B2 Γ.
Denote
Dzw := (z, w) ∈ Rn × W 0 : z − V w ∈ S , (12.4.4)
on which the following observation can be made.
Observation 12.4.1.
b) In the absence of w,
x0 ∈ S =⇒ (z0 , 0) ∈ Dzw .
12.4. State Feedback Controllers 281
Proof. The fact that Dzw increases as S increases is easy to see. To see
the rest of a), we note that, for a general plant, C a = Rn1 × C 2 . If S = C a ,
then S = Rn1 × C 2 , and
Dzw = (z, w) ∈ Rn × W 0 : z − V w ∈ Rn1 × C 2
= (z1 , z2 , w) ∈ Rn1 × Rn2 × W 0 : z1 − V1 w ∈ Rn1 , z2 − V2 w ∈ C 2
= Rn1 × Rg 2 = Rag .
ż = Az + Bf (z − V w) − BΓw,
(12.4.5)
ẇ = Sw.
For this system, Dzw is an invariant set and for all (z0 , w0 ) ∈ Dzw ,
ż = Az + Bf (z − V w) − AV w + V S w
= A(z − V w) + Bf (z − V w) + V ẇ.
v̇ = Av + Bf (v),
and
lim (z(t) − V w(t)) = lim v(t) = 0.
t→∞ t→∞
2
282 Chapter 12. Output Regulation – Continuous-Time Systems
(A + BF )T X + X(A + BF ) < 0
|F v|∞ ≤ 1, ∀ v ∈ E(X).
v̇ = Av + Bf (v). (12.4.7)
Then, for any α > 0, αD is an invariant set in the domain of attraction for
the system v
v̇ = Av + αBf . (12.4.8)
α
12.4. State Feedback Controllers 283
v̇ v v
= A + Bf ,
α α α
v v0
and replace with v, then we get (12.4.7). If v0 ∈ αD, i.e., ∈ D, then
α α
v(t)
∈ D, ∀t > 0
α
and
lim v(t) = 0.
t→∞
For any α ∈ (0, 1), since W 0 is a compact set, there exists a positive
integer N such that
1
αN X 2 V w < δ, ∀ w ∈ W 0 . (12.4.9)
That is,
αN V w ∈ δE(X), ∀ w ∈ W 0.
k
For this system, Dzw is an invariant set. Moreover, if k = 0, 1, · · · , N , then,
lim z(t) − αk V w(t) = 0, ∀ (z0 , w0 ) ∈ Dzw
k
,
t→∞
and if k = N + 1, then,
N +1
lim z(t) = 0, ∀ (z0 , w0 ) ∈ Dzw .
t→∞
Proof. With
u = fk (z, w, α), k = 0, 1, · · · , N,
we have
% &
F (z − αk V w)
ż = Az + 1 − αk BΓw + αk Bsat − BΓw
αk
% &
F (z − αk V w)
= Az + αk Bsat − αk BΓw. (12.4.11)
αk
It follows from Lemma 12.4.2 that αk E(X) is an invariant set in the domain
k
of attraction for the vk -system. Hence Dzw is invariant for the system
(12.4.10) and if (z0 , w0 ) ∈ Dzw
k
, i.e.,
vk0 = z0 − αk V w0 ∈ αk E(X),
then,
lim z(t) − αk V w(t) = lim vk (t) = 0.
t→∞ t→∞
For % &
Fz
u = fN +1 (z, w) = Γw + δsat ,
δ
we have % &
Fz
ż = Az + δBsat
δ
and the same argument applies. 2
Based on the technical lemmas established above, we construct our final
state feedback law as follows,
u = g(z, w, α, N )
12.4. State Feedback Controllers 285
fN +1 (z, w), if (z, w) ∈ ΩN +1 := Dzw
N +1
,
f (z, w, α), if (z, w) ∈ Ωk := Dzw
k
\ ∪N +1 j
k j=k+1 Dzw ,
=
k = 0, 1, · · · , N,
+1 j
f (z − V w), if (z, w) ∈ Ω := Rn × W 0 \ ∪N
j=0 Dzw .
satisfies
lim z(t) = 0,
t→∞
i.e., Dzw ⊂ S zw .
Hence there is a finite time t0 ≥ 0 such that z(t0 ) − V w(t0 ) ∈ E(X), i.e.,
0
(z(t0 ), w(t0 )) ∈ Dzw . The condition that (z(t), w(t)) ∈ Dzw
k
, k > 0, might
be satisfied at an earlier time t1 ≤ t0 . In any case, there is a finite time
t1 ≥ 0 such that
+1
(z(t1 ), w(t1 )) ∈ Ωk = Dzw
k
\ ∪N j
j=k+1 Dzw ,
286 Chapter 12. Output Regulation – Continuous-Time Systems
u = fk (z, w, α),
lim z(t) − αk V w(t) = 0.
t→∞
Since α ∈ (α0 , 1), we have
1
(1 − α) X 2 V w < α, ∀ w ∈ W 0.
Since the first term goes to zero asymptotically, there exists a finite time
t2 > t1 such that
1
2
X (z(t2 ) − αk+1 V w(t2 )) ≤ αk+1 .
i.e.,
(z(t2 ), w(t2 )) ∈ Dzw
k+1
.
If k = N , then, by (12.4.9),
1 1 1
2 2
X z ≤ X (z − αN V w) + αN X 2 V w
1
< X 2 (z − αN V w) + δ.
Also, the first term goes to zero asymptotically, so there exists a finite time
t2 > t1 such that 1
2
X z(t2 ) ≤ δ,
N +1
i.e., (z(t2 ), w(t2 )) ∈ Dzw .
k+l
Just as before, (z, w) might have entered Dzw , l > 1, before it enters
k+1
Dzw . In any case, there is a finite time t such that
+1
(z(t), w(t)) ∈ Ωk+l = Dzw
k+l
\ ∪N j
j=k+l+1 Dzw ,
12.4. State Feedback Controllers 287
for some l ≥ 1. After that, the controller will be switched to fk+l (z, w, α).
k
It is also important to note that, by Lemma 12.4.3, Dzw is invariant
under the control u = fk (z, w, α). Once (z, w) ∈ Ω ⊂ Dzw , it will never go
k k
back to Ωl , l < k (or Ω) since Ωl , l < k and Ω have no intersection with Dzw k
l k
(But Ω , l > k, might have intersection with Dzw ). In summary, for any
(z0 , w0 ) ∈ Dzw , suppose (z0 , w0 ) ∈ Ωk , the control will first be fk (z, w, α)
and then switch successively to fk1 , fk2 , · · ·, with k1 , k2 , · · ·, strictly increas-
N +1
ing until (z, w) enters Dzw and remains there. Hence, by Lemma 12.4.3,
we have
lim z(t) = 0.
t→∞
From the proof of Theorem 12.4.1, we see that for all (z0 , w0 ) ∈ Dzw ,
the number of switches is at most N + 2.
For a better understanding, we illustrate the proof of Theorem 12.4.1
k
with Fig. 12.4.1. The sets Dzw for the simplest case where both z and w are
one dimensional are plotted. Here we have X = V = 1, α = 0.6, δ = 0.2
and N = 3. The parallelogram bounded by the straight lines Lk (along
with the two vertical lines w = ±1) is
k
Dzw = (z, w) : z − αk w ≤ αk , |w| ≤ 1 , k = 0, 1, 3, 4.
The dotted line passing through the origin is in parallel to the line L0 .
Suppose that (z0 , w0 ) ∈ Ω0 ⊂ Dzw0
, then under the control u = f0 (z, w, α),
(z, w) will converge to the dotted line. Since α is chosen such that this
1 1
dotted line is inside Dzw (see (12.4.14)), (z, w) will enter Dzw in a finite
time. After that, the controller will be switched to f1 (z, w, α) and so on.
4
Finally (z, w) will enter Dzw and z(t) will go to zero. If z is two dimensional,
k
we will have cylinders as Dzw instead of parallelograms.
In what follows, we will deal with the case that there is no V satisfying
−AV + V S = −BΓ.
This will occur if A and S have some same eigenvalues on the imaginary
axis. Another case is that some eigenvalues of A and S on the imaginary
axis are very close. This will result in large elements of V , so α could be
very close to 1 and N could be very large. The following method is derived
to deal with these two cases.
288 Chapter 12. Output Regulation – Continuous-Time Systems
2.5
z
2 L0
1.5
L
1
1
0.5 L
3
L
4
0
L
4
−0.5 L3
−1
L1
−1.5
−2 L
0
w
−2.5
−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1
with A1 ∈ Rn1 ×n1 semi-stable and A2 ∈ Rn2 ×n2 anti-stable. Also suppose
that there is a known function f (v2 ), |f (v2 )|∞ ≤ 1 for all v2 ∈ Rn2 , such
that the origin of the following system
v̇2 = A2 v2 + B2 f (v2 )
v̇ = Av + δBsat(h(v))
−A2 V2 + V2 S = −B2 Γ,
z20 − V2 w0 ∈ S 2 , w0 ∈ W 0 ,
lim z2 (t) = 0.
t→∞
Hence there is a finite time t1 > 0 such that z(t1 ) ∈ Rn1 × δS 2 . After that
if we switch to the control u = Γw + δ sat(h(z), we will have
ż = Az + δBsat(h(z))
and z will stay in S δ = Rn1 ×δS 2 . To avoid the possibility that the state z2
might be trapped on the boundary of δS 2 , we need to modify the controller
further such that it will not switch before z2 has entered γδS 2 for some
γ ∈ (0, 1). For this purpose, we introduce a switching state variable s(t):
0, if z2 ∈ Rn2 \ δS 2 ,
s(t) = 1, if z2 ∈ γδS 2 ,
−
s(t ), if z2 ∈ δS 2 \ γδS 2 .
We may simply assume that s(0) = 0. Now the modified controller is,
g(z2 , w, α, N ), if z2 ∈ Rn2 \ γδS 2 and s = 0,
u=
Γw + δ sat(h(z)), if z2 ∈ δS 2 and s = 1.
lim z2 (t) = 0.
t→∞
290 Chapter 12. Output Regulation – Continuous-Time Systems
Letting
z̃ = z − z̄, w̃ = w − w̄,
we can write the composite system as
ż = Az + Bu − BΓw, (12.5.2)
ẇ = Sw,
z̃˙ A + L1 C −BΓ z̃
= .
w̃˙ L2 C S w̃
Now we have to use (z̄, w̄) instead of (z, w) to construct a feedback
controller. Since (C, A) is observable, we can choose
L1
L=
L2
appropriately such that the estimation error (z̃, w̃) decays arbitrarily fast.
Moreover, the following fact is easy to establish,
12.5. Error Feedback Controllers 291
v̇ = Av + Bf (v + η),
where η stands for the disturbance arising from, for example, the observer
error. Assume that |f (v)|∞ ≤ 1 for all v ∈ Rn and that there exist a
set D0 ⊂ Rn and positive numbers γ and d0 such that the solution of the
system satisfies
where
va = lim sup |v(t)|∞ .
t→∞
v̇ = Av + Bu.
Lemma 12.5.2. There exists an L ∈ R(n+r)×q such that, under the con-
trol
u1 (t), t < T0 ,
u=
f (z̄ − V w̄), t ≥ T0 ,
where u1 is any admissible control, the solution of the system (12.5.2)
satisfies
lim (z(t) − V w(t)) = 0, ∀ (z0 , w0 , z̃ 0 , w̃ 0 ) ∈ DM .
t→∞
lim ṽ(t) = 0,
t→∞
12.5. Error Feedback Controllers 293
that
lim (z(t) − V w(t)) = lim v(t) = 0.
t→∞ t→∞
2
Lemma 12.5.2 means that we can keep z(t) bounded if (z0 , w0 , z̃ 0 , w̃ 0 ) ∈
DM . Just as the state feedback case, we want to move z(t) to the origin
gradually by making z(t) − αk V w(t) small with increased k. Due to the
switching nature of the final controller and that the feedback has to be based
on (z̄, w̄), we need to construct a sequence of sets which are invariant with
respect to (z̄, w̄) rather than (z, w) under the corresponding controllers.
Using linear system theory, it is easy to design an F ∈ Rm×n , along
with a matrix X > 0 such that A + BF is Hurwitz and the set
E(X) = v ∈ Rn : v T Xv ≤ 1
is inside the linear region of the saturation function sat(F v) and that for
some positive number d1 , E(X) is invariant for the system
Let α and N be determined from X in the same way as with the state
feedback controller. With F ∈ Rm×n , we form a sequence of controllers,
% &
F (z̄ − αk V w̄)
u = fk (z̄, w̄, α) = (1 − α )Γw̄ + α sat
k k
, k = 0, 1, · · · , N,
αk
and % &
F z̄
u = fN +1 (z̄, w̄) = Γw̄ + δ sat .
δ
Under the control u = fk (z̄, w̄, α), consider vk = z̄ − αk V w̄, then we get
% &
F vk
v̇k = Avk + αk B sat − (L1 − αk V L2 )C z̃. (12.5.7)
αk
Note the difference between this equation and the corresponding (12.4.12)
in the state feedback case. Here we need to take into account the extra
term (L1 − αk V L2 )C z̃. For clarity, we split the discussion into three cases.
Case 1. k = 0
Let v = z̄ − V w̄. Then the system
Since
lim z̃(t) = 0,
t→∞
we also have
lim v(t) = 0, ∀ v0 ∈ E(X).
t→∞
Case 2. 0 < k ≤ N
Similar to Lemma 12.4.3, we have an invariant set αk E(X) for the system
(12.5.7) under the restriction
(L1 − αk V L2 )C z̃ ≤ αk d1 . (12.5.9)
∞
Also, because
lim z̃(t) = 0,
t→∞
we have
lim vk (t) = 0, ∀ v0 ∈ αk E(X).
t→∞
Case 3. k = N + 1
and
Dz̄Nw̄+1 = (z̄, w̄) ∈ Rn × Rr : z̄ ∈ δE(X) .
Lemma 12.5.3. Suppose that F is chosen such that E(X) is invariant for
the system (12.5.6) and that z̃ satisfies all the conditions (12.5.8), (12.5.9)
and (12.5.10). Then, under the control u = fk (z̄, w̄, α), the set Dz̄k w̄ is
invariant for the system (12.5.1) and
lim z̄(t) − αk V w̄(t) = 0.
t→∞
where
fN +1 (z̄, w̄), if (z̄, w̄) ∈ ΩN +1 := Dz̄Nw̄+1 ,
+1 j
fk (z̄, w̄, α), if (z̄, w̄) ∈ Ωk := Dz̄k w̄ \ ∪N j=k+1 Dz̄ w̄ ,
g(z̄, w̄, α, N ) =
k = 0, 1, . . . , N,
+1 j
f (z̄ − V w̄), if (z̄, w̄) ∈ Ω := Rn × Rr \ ∪N j=0 Dz̄ w̄ .
Note that
% &
F (z̄ − αk V w̄)
|fk (z̄, w̄, α)|∞
= (1 − α )Γw̄ + α sat
k k
≤1
αk
∞
can be satisfied if
|Γw̄|∞ ≤ |Γw|∞ + |Γw̃|∞ ≤ 1. (12.5.13)
Let d0 and d1 be given, then by Lemma 12.5.1, all the conditions
(12.5.5), (12.5.8), (12.5.9), (12.5.10) and (12.5.13) can be satisfied for t ≥ T0
by suitably choosing L. Note that, in (12.5.13), |Γw|∞ ≤ ρ < 1.
Then u = f (z̄ − V w̄) will be in effect. This is just the situation described
in Lemma 12.5.2. So we have
and similar to the proof of Theorem 12.4.1, it can be shown that (z̄, w̄) will
enter Ω0 , or some other Ωk at a finite time.
lim z̃(t) = 0,
t→∞
Similar to the proof of Theorem 12.4.1, it can be shown that (z̄, w̄) will
enter Ωk1 for some k1 > k.
By repeating this procedure, we will have (z̄, w̄) ∈ ΩN +1 at some finite
time, and hence
lim z̄(t) = 0.
t→∞
Also, since
lim z̃(t) = 0,
t→∞
we have
lim z(t) = 0.
t→∞
Remark 12.5.1. It is difficult to tell which set will (z̄, w̄) belong for the
time interval [0, T0 ] because of the observation error. It might switch among
the sets Ω, Ωk , k = 0, · · · , N + 1, frequently. To avoid frequent switching of
the controller, we may simply set u = 0 for t < T0 .
12.6. An Example 297
12.6. An Example
In this section, we will apply the results developed above to the control of
an aircraft. Consider the longitudinal dynamics of the TRANS3 aircraft
under certain flight condition [51],
ẋ = Ax + Bu,
(12.6.1)
y = Cx,
with
0 14.3877 0 −31.5311
−0.0012 −0.4217 1.0000 −0.0284
A=
0.0002 −0.3816 −0.4658
,
0
0 0 1.0000 0
4.526
−0.0337
B = 0.1745 × ,
−1.4566
0
and
C= 1 0 0 0 ,
where the state consists of the velocity x1 , the angle of attack x2 , the pitch
rate x3 and the Euler angle rotation of aircraft about the inertial y-axis x4 ,
the control u is the elevator input, whose value is scaled between +1 and −1
(corresponding to ±10o). The design objective is to reject the disturbance
P w, where
−0.6526 −0.3350 0.4637 0.9185
0.0049 0.0025 −0.0035 −0.0068
P = 0.2100
,
0.1078 −0.1492 −0.2956
0 0 0 0
and w contains the frequencies of 0.1 rad/s and 0.3 rad/s. Clearly, this
problem can be cast into an output regulation problem for the system
(12.2.6) with
0 −0.1 0 0
0.1 0 0 0
S= 0
,
0 0 −0.3
0 0 0.3 0
and Q = 0. A solution to the linear matrix equations (12.3.1) is
Π = 0, Γ = 0.8263 0.4242 −0.5871 −1.1630 .
298 Chapter 12. Output Regulation – Continuous-Time Systems
300
200
100
−100
−200
−300
−300 −200 −100 0 100 200 300
u = fk (z2 , w, α), k = 0, 1, · · · , 4,
is observable.
With the design method proposed in Chapter 6, we obtain a controller
u = f (v2 ) = sat(F0 v2 ), F0 = −0.0175 0.0103 ,
v̇2 = A2 v2 + B2 f (v2 + η)
300 Chapter 12. Output Regulation – Continuous-Time Systems
1500
1000
500
error e
−500
−1000
0 50 100 150 200 250 300 350
time t
1.5
1
−: control u; −.: the switching history
0.5
−0.5
−1
−1.5
0 50 100 150 200 250 300 350
time t
Figure 12.6.3: The control and the switching history – state feedback.
12.7. Conclusions 301
satisfies an asymptotic bound from D0 with a finite gain and nonzero re-
striction, where D0 is also very close to the null controllable region (see
Fig. 12.6.4, where the outermost solid closed curve is the boundary of C 2
and the dash-dotted closed curve is the boundary of D0 ).
With the error feedback design method proposed in this chapter, we
obtain for the anti-stable subsystem
F = 0.0378 0.0357
12.7. Conclusions
In this chapter, we have systematically studied the problem of output regu-
lation for linear systems subject to actuator saturation. The plants consid-
ered here are general and can be exponentially unstable. We first charac-
terized the asymptotically regulatable region, the set of initial conditions of
the plant and the exosystem for which output regulation can be achieved.
We then constructed feedback laws, of both state feedback and error feed-
back type, that achieve output regulation on the asymptotically regulatable
region.
302 Chapter 12. Output Regulation – Continuous-Time Systems
300
200
100
−100
−200
−300
−300 −200 −100 0 100 200 300
1500
1000
500
error e
−500
−1000
−1500
0 50 100 150 200 250 300 350
time t
6
x 10
5
−1
−2
−3
−4
−5
0 5 10 15 20 25 30
time t
1.5
1
−: control u; −.: the switching history
0.5
−0.5
−1
−1.5
0 50 100 150 200 250 300 350
time t
Figure 12.6.7: The control and the switching history – error feedback.
304 Chapter 12. Output Regulation – Continuous-Time Systems
Chapter 13
Output Regulation –
Discrete-Time
Systems
13.1. Introduction
305
306 Chapter 13. Output Regulation – Discrete-Time Systems
we will also briefly state the discrete-time version of the classical results on
the problem of output regulation for continuous-time linear systems of [22].
Section 13.3 characterizes the regulatable region. Sections 13.4 and 13.5
respectively construct state feedback and error feedback laws that achieve
output regulation on the regulatable region. Finally, Section 13.6 draws a
brief conclusion to the chapter.
Assumption 13.2.1.
A1. The eigenvalues of S are on or outside of the unit circle.
lim e(k) = 0.
k→∞
For this reason, we will restrict our attention to exosystem initial conditions
in the following compact set
W 0 = w0 ∈ Rr : |Γw(k)|∞ = ΓS k w0 ∞ ≤ ρ, ∀ k ≥ 0 , (13.3.4)
for some ρ ∈ [0, 1). For later use, we also denote δ = 1 − ρ. We note that
the compactness of W 0 can be guaranteed by the observability of (Γ, S).
Indeed, if (Γ, S) is not observable, then the exosystem can be reduced to
make it so.
We can now precisely define the notion of asymptotically regulatable
region as follows.
Definition 13.3.1.
1) Given K > 0, a pair (z0 , w0 ) ∈ Rn × W 0 is regulatable in K steps if
there exists an admissible control u, such that the response of (13.3.2)
satisfies z(K) = 0. The set of all (z0 , w0 ) regulatable in K steps is
denoted as Rg (K).
310 Chapter 13. Output Regulation – Discrete-Time Systems
3) The set of all (z0 , w0 ) for which there exist admissible controls such
that the response of (13.3.2) satisfies
lim z(k) = 0
k→∞
Remark 13.3.1. We would like to note that the regulatable region is de-
fined in terms of
lim z(k) = 0
k→∞
rather than
lim e(k) = 0.
k→∞
Requiring the former instead of the latter will also guarantee the closed-
loop stability in the absence of w. As in the continuous-time case (Chapter
12), this will result in essentially the same description of the regulatable
region.
Clearly,
C= C(K)
K∈[0,∞)
and K−1
−i−1
C(K) = A Bu(i) : u∞ ≤ 1 . (13.3.5)
i=0
13.3. The Regulatable Region 311
It is also clear that the null controllable region and the asymptotically null
controllable region are identical if the pair (A, B) is controllable. Some
simple methods to describe C and C a were developed in Chapter 3.
To simplify the characterization of Rg and Rag and without loss of gen-
erality, let us assume that
z1
z= , z1 ∈ Rn1 , z2 ∈ Rn2 ,
z2
and
A1 0 B1
A= , B= , (13.3.6)
0 A2 B2
where A1 ∈ Rn1 ×n1 is semi-stable (i.e., all its eigenvalues are on or inside
the unit circle) and A2 ∈ Rn2 ×n2 is anti-stable (i.e., all its eigenvalues are
outside of the unit circle). The anti-stable subsystem
v2 (k + 1) = A2 v2 (k) + B2 u(k)
Then,
a) Rg 2 (K) = (z2 , w) ∈ Rn2 × W 0 : z2 − V (K)w ∈ C 2 (K) ; (13.3.9)
b) Rg 2 = (z2 , w) ∈ Rn2 × W 0 : z2 − V2 w ∈ C 2 ; (13.3.10)
c) Rag = Rn1 × Rg 2 . (13.3.11)
Proof.
a) Given (z20 , w0 ) ∈ Rn2 × W 0 and an admissible control u, the solution
of (13.3.7) at k = K is,
K−1
K−1
z2 (K) = AK z20 + A−i−1
2 B2 u(i) − A−i−1
2 B2 ΓS i w0 . (13.3.12)
i=0 i=0
K−1
K−1
− A−i−1
2 B2 ΓS i = A−i−1
2 (−A2 V2 + V2 S)S i
i=0 i=0
K−1
= −A−i i −i−1
2 V2 S + A2 V2 S i+1
i=0
= −V2 + A−K
2 V2 S
K
= −V (K), (13.3.13)
where the third “=” is simply obtained by expanding the terms in the
summation and cancelling all the middle terms. Thus,
K−1
A−K z2 (K) = z20 − V (K)w0 + A−i−1 B2 u(i).
i=0
lim V (K) = V2 .
K→∞
z20 − V2 w0 ∈ C 2 =⇒ (z20 , w0 ) ∈ Rg 2 .
Since
lim V (K) = V2 ,
K→∞
there is a K2 > K1 such that z20 − V (K2 )w0 ∈ C 2 (K1 ) ⊂ C 2 (K2 ). It follows
from a) that (z20 , w0 ) ∈ Rg 2 (K2 ) ⊂ Rg 2 .
Next we show that
(z20 , w0 ) ∈ Rg 2 =⇒ z20 − V2 w0 ∈ C 2 .
Denote
Z2 = δA−K
2
1
v2 : v2 ∈ C 2 .
For each z2 ∈ Z2 , there is an admissible control u2 such that
∞
∞
z2 = δA−K
2
1
A−i−1 B2 u2 (i) = A−i−1 B2 δu2 (i − K1 ). (13.3.16)
i=0 i=K1
∞
+ A−i−1 B2 δu2 (i − K1 ) − ΓS i w0
i=K1
K 1 −1
=− A−i−1 B2 u1 (i)
i=0
∞
+ A−i−1 B2 δu2 (i − K1 ) − ΓS i w0
i=K1
∈ C2.
The last step follows from the fact that |ΓS i w0 |∞ ≤ ρ = 1 − δ for all i ≥ 0
and thus |δu2 (i − K1 ) − ΓS i w0 |∞ ≤ 1. This implies that
z20 − V2 w0 + z2 : z2 ∈ Z2 ⊂ C 2 .
Since A−K
2
1
is nonsingular, the set Z2 contains the origin in its interior. It
follows that z20 − V2 w0 ∈ C 2 .
c) It is easy to see that Rag ⊂ Rn1 × Rg 2 . We need to show that
R × Rg 2 ⊂ Rag .
n1
Since
z1 (K)
0
is inside the asymptotically null controllable region of the above system
under the constraint uδ ∞ ≤ δ (see Chapter 3), there exists a uδ such
that
lim z(k) = 0.
k→∞
Hence (z0 , w0 ) ∈ Rag . This establishes that Rn1 × Rg 2 ⊂ Rag and hence
Rn1 × Rg 2 = Rag . 2
if and only if
x20 − (Π2 + V2 )w0 ∈ C 2 ,
where
Π2 = 0 In2 Π.
In this section, we will construct a feedback law that solves the problem of
output regulation by state feedback for linear systems subject to actuator
saturation. We assume that a stabilizing state feedback law u = f (v),
|f (v)|∞ ≤ 1 for all v ∈ Rn , has been designed and the equilibrium v = 0 of
the closed-loop system
Denote the time response of z(k) to the initial state (z0 , w0 ) as z(k, z0 , w0 )
and define
S zw := (z0 , w0 ) ∈ R × W 0 : lim z(k, z0 , w0 ) = 0 .
n
k→∞
Since Rag is the set of all (z0 , w0 ) for which z(k) can be driven to the origin
asymptotically, we must have S zw ⊂ Rag . Our objective is to design a
316 Chapter 13. Output Regulation – Discrete-Time Systems
This will be the case if A and S have no common eigenvalues. With the
decomposition in (13.3.6), if we partition V accordingly as
V1
V = ,
V2
then V2 satisfies
−A2 V2 + V2 S = −B2 Γ.
Denote
Dzw := (z, w) ∈ Rn × W 0 : z − V w ∈ S . (13.4.4)
Observation 13.4.1.
b) In the absence of w,
x0 ∈ S =⇒ (z0 , 0) ∈ Dzw .
For this system, Dzw is an invariant set and for all (z0 , w0 ) ∈ Dzw ,
and
lim (z(k) − V w(k)) = lim v(k) = 0.
k→∞ k→∞
(A + BF )T X(A + BF ) − X < 0
318 Chapter 13. Output Regulation – Discrete-Time Systems
|F v|∞ ≤ 1, ∀ v ∈ E(X).
Then E(X) is an invariant set and is in the domain of attraction for the
closed-loop system (13.4.6).
then for any α > 0, αD is an invariant set in the domain of attraction for
the system % &
v(k)
v(k + 1) = Av(k) + αBf . (13.4.8)
α
Proof. Write (13.4.8) as
% &
v(k + 1) v(k) v(k)
=A + Bf ,
α α α
v(k)
and replace α with v(k), then we get (13.4.7). If v0 ∈ αD, i.e.,
v0
∈ D,
α
then,
v(k)
∈ D, ∀k > 0
α
and
lim v(k) = 0.
k→∞
2
For any α ∈ (0, 1), there exists a positive integer N such that
1
αN X 2 V w < δ, ∀ w ∈ W 0 , (13.4.9)
and if = N + 1, then,
N +1
lim z(k) = 0, ∀ (z0 , w0 ) ∈ Dzw .
k→∞
It follows from Lemma 13.4.2 that α E(X) is an invariant set in the domain
of attraction for the v -system. Hence Dzw is invariant for the system
(13.4.10) and if (z0 , w0 ) ∈ Dzw , i.e.,
v0 = z0 − α V w0 ∈ α E(X),
320 Chapter 13. Output Regulation – Discrete-Time Systems
then,
lim (z(k) − α V w(k)) = lim v (k) = 0.
k→∞ k→∞
For % &
Fz
u = fN +1 (z, w) = Γw + δsat ,
δ
we have % &
F z(k)
z(k + 1) = Az(k) + δBsat
δ
and the same argument applies. 2
u = g(z, w, α, N )
fN +1 (z, w), if (z, w) ∈ ΩN +1 := Dzw
N +1
,
f (z, w, α), if (z, w) ∈ Ω := D \ ∪N +1 Dj ,
zw j=+1 zw
=
= 0, 1, · · · , N,
+1 j
f (z − V w), if (z, w) ∈ Ω := R × W 0 \ ∪N
n
j=0 Dzw .
satisfies
lim z(k) = 0,
k→∞
i.e., Dzw ⊂ S zw .
13.4. State Feedback Controllers 321
Hence there is a finite step k0 ≥ 0 such that z(k0 ) − V w(k0 ) ∈ E(X), i.e.,
0
(z(k0 ), w(k0 )) ∈ Dzw . The condition (z(k), w(k)) ∈ Dzw
, > 0, might be
satisfied at a smaller step k1 ≤ k0 . In any case, there is a finite step k1 ≥ 0
such that
+1
(z(k1 ), w(k1 )) ∈ Ω = Dzw
\ ∪N j
j=+1 Dzw
u = f (z, w, α),
lim (z(k) − α V w(k)) = 0.
k→∞
Since the first term on the right-hand side goes to zero asymptotically, there
exists a finite k2 > k1 such that
1
2
X (z(k2 ) − α+1 V w(k2 )) ≤ α+1 .
i.e.,
(z(k2 ), w(k2 )) ∈ Dzw
+1
.
If = N , then, by (13.4.9),
1 1 1
2 2
X z ≤ X (z − αN V w) + αN X 2 V w
1
< X 2 (z − αN V w) + δ.
Also, the first term goes to zero asymptotically, so there exists a finite
integer k2 such that 1
2
X z(k2 ) ≤ δ,
N +1
i.e., (z(k2 ), w(k2 )) ∈ Dzw .
+q
Just as before, (z, w) might have entered Dzw , q > 1, before it enters
+1
Dzw . In any case, there is a finite k such that
+1
(z(k), w(k)) ∈ Ω+q = Dzw
+q
\ ∪N j
j=+q+1 Dzw ,
for some q ≥ 1. After that, the controller will be switched to f+q (z, w, α).
It is also important to note that, by Lemma 13.4.3, Dzw is invariant
under the control u = f (z, w, α). Once (z, w) ∈ Ω ⊂ Dzw , it will never
lim z(k) = 0.
k→∞
From the proof of Theorem 13.4.1, we see that for all (z0 , w0 ) ∈ Dzw ,
the number of switches is at most N + 2.
In what follows, we will deal with the case that there is no V that
satisfies
−AV + V S = −BΓ.
This will occur if A and S have some same eigenvalues on the unit circle.
Another case is that some eigenvalues of A and S on the unit circle are very
close. This will result in large elements of V , so α could be very close to 1
13.4. State Feedback Controllers 323
and N could be very large. The following method is derived to deal with
these two cases.
Suppose that the z-system (13.3.2) has the following form,
z1 (k + 1) A1 0 z1 (k) B1 B1 Γ
= + u(k) − w(k)
z2 (k + 1) 0 A2 z2 (k) B2 B2 Γ
(13.4.15)
n1 ×n1 n2 ×n2
with A1 ∈ R semi-stable and A2 ∈ R anti-stable. Also suppose
that there is a known function f (v2 ), |f (v2 )|∞ ≤ 1 for all v2 ∈ Rn2 , such
that the origin of the following system
−A2 V2 + V2 S = −B2 Γ,
z20 − V2 w0 ∈ S 2 , ∀ w0 ∈ W 0 ,
lim z2 (k) = 0.
k→∞
Hence, there is a finite integer k1 > 0 such that z(k1 ) ∈ Rn1 × δS 2 . After
that if we switch to the control u = Γw + δ sat(h(z), we have
and z will stay in S δ = Rn1 ×δS 2 and converge to the origin asymptotically.
In summary, we have the controller,
g(z2 , w, α, N ), if z2 ∈ Rn2 \ δS 2 ,
u=
Γw + δ sat(h(z)), if z2 ∈ δS 2 ,
and under this control, the following set
(z, w) ∈ Rn × W 0 : z2 − V2 w ∈ S 2
will be a subset of S zw .
lim z(k) = 0
k→∞
Hence,
(z − V w)(n0 ) = An0 (z − V w)(0) = An0 (z0 − V w0 ).
For (z0 , w0 ) to be in S zw , it suffices to have
i.e.,
z(n0 ) − V w(n0 ) ∈ S.
This is in turn equivalent to
(z0 , w0 ) ∈ Dzw := (z, w) ∈ R × W 0 : An0 (z − V w) ∈ S .
13.6. Conclusions
In this chapter, we have systematically studied the problem of output reg-
ulation for linear discrete-time systems subject to actuator saturation. The
plants considered here are general and can be exponentially unstable. We
first characterized the regulatable region, the set of initial conditions of the
plant and the exosystem for which output regulation can be achieved. We
then constructed feedback laws, of both state feedback and error feedback
type, that achieve output regulation on the regulatable region.
326 Chapter 13. Output Regulation – Discrete-Time Systems
Chapter 14
14.1. Introduction
327
328 Chapter 14. Non-Actuator Saturation
linear systems under state saturation. Section 14.2 deals with continuous-
time systems and Section 14.3 deals with discrete-time systems. Finally, in
Section 14.4, we present a semi-global stabilization result for linear systems
subject to sensor saturation.
lim φ(t, x0 ) = 0, ∀ x0 ∈ U0 ,
t→∞
We see that the dynamics of the system (14.2.1) and hence its stability
properties are equivalent to those of the system
ẋ = A sat(x). (14.2.2)
For this reason, we will focus on (14.2.2) in this section. This is also the rea-
son why we refer to system (14.2.1) as linear system under state saturation.
to make a21 ≥ 0.
Our main result in this section, presented in the following theorem, gives
a complete description of the stability properties of the system (14.2.2) with
A given in (14.2.3). As explained above, any system of the form (14.2.1)
with Hurwitz A can be transformed into the form of (14.2.3).
a) a22 < 0;
On the other hand, if none of a) and b) is satisfied, the system will have
diverging trajectories and there will also be a closed trajectory.
i) a22 < 0;
The facts that a22 ≥ 0 and A is Hurwitz imply that a12 > 0.
The proof can be divided into three parts. In the first part, the vector field
is studied in detail and some constants are captured to characterize the
vector field. In the second part, we show that it is these constants, rather
than the stability of the A matrix, that determine the global boundedness
14.2. State Saturation – Continuous-Time Systems 331
ẋ = Ax.
In the region
U := x : |x1 | ≤ 1, x2 ≥ 1 ,
we have
Since a11 ≤ a12 , ẋ1 ≥ 0 in this region and the trajectories go rightward.
Also note that ẋ is independent of x2 , so for all the points on a vertical
line x1 = c in this region, ẋ is the same. Because of this, if x0 ∈ U and
ψ(t, x0 ) ∈ U for all t ∈ [0, t1 ], then with ∆ > 0,
%
&
0 0
ψ t, x0 + = ψ(t, x0 ) + , ∀ t ∈ [0, t1 ]. (14.2.5)
∆ ∆
332 Chapter 14. Non-Actuator Saturation
U
3
−R V
2
0 −W W
−1
−2
−V R
−3
−U
−4
−4 −3 −2 −1 0 1 2 3 4
ẋ and the slope of the trajectories are constants. We denote the constant
slope η(x) as
−a21 + a22
α := . (14.2.7)
−a11 + a12
In the region
W := x : x1 ≥ 1, |x2 | ≤ 1 ,
we have
If a11 a21 = a12 a22 and βh1 + h2 = 0, then outside certain region, all
the trajectories are closed. If a11 a21 < a12 a22 (or a11 a21 = a12 a22 and
βh1 + h2 > 0), there will be diverging trajectories and if in addition A is
Hurwitz, there exists a closed trajectory.
14.2. State Saturation – Continuous-Time Systems 335
Proof. Since a12 > a11 > 0 and a21 > a22 ≥ 0, we have α < 0, β > 0,
|h1 |, |h2 | < ∞.
Denote % &
α
u∗ = max 0, (βh1 + h2 ) , αh1 .
β
Let
1
p1 = , uk ≥ u∗ ,
uk + 1
be a point on the line x1 = 1 (see the point labeled “1” in Fig. 14.2.2). Let
the trajectory starting from p1 be ψ(t, p1 ). We will show later that ψ(t, p1 )
will go through regions V , W , R, and −U consecutively (not fall into the
central square before leaving −U ). Let the intersections of ψ(t, p1 ) with
the lines x2 = 1, x2 = −1, x1 = 1 and x1 = −1 be, respectively,
1 + vk 1 + wk 1 −1
, , , ,
1 −1 −1 − rk −1 − uk+1
9
8
2 1
7 2
6 3
−2 4
5
−4
−6
−8 −6 −4 −2 0 2 4 6
1
starting from will be unbounded. It can also be shown with
1 + uk
Poincaré-Bendixon Theorem that there exists a closed trajectory within
some region.
In this case, − α
β
= 1 and uk+2 = uk + 2(βh1 + h2 ). If βh1 + h2 < 0, the
sequence uk , uk+2 , · · ·, will decrease steadily before the trajectory touches
the central square. A global attractor can be constructed.
If βh1 + h2 > 0, the sequence uk , uk+2 , · · ·, will increase steadily and
the trajectory will go unbounded. And similar to Case 1, there exists a
closed trajectory.
If βh1 + h2 = 0, then for uk sufficiently large, the trajectory
%
&
1
ψ t,
1 + uk
14.2. State Saturation – Continuous-Time Systems 337
is closed. 2
Part 3. Proof of Theorem 14.2.1
In view of Remark 14.2.1 and Proposition 14.2.1, we only need to consider
the following system
−a11 a12
ẋ = A sat(x) = sat(x), a12 > a11 > 0, a21 > a22 ≥ 0.
−a21 a22
(14.2.13)
This proposition can be established as follows. If a11 a21 < a12 a22 , then
by Proposition 14.2.2, the system is not globally asymptotically stable.
So the necessity of the condition is obvious. What remains is to show the
sufficiency of the condition. If a11 a21 = a12 a22 and A is Hurwitz, then it can
be verified that βh1 + h2 < 0. So, by Proposition 14.2.2, if a11 a21 ≥ a12 a22 ,
then the system has a global attractor. Now that the global boundedness
of the trajectories is guaranteed, the only thing that needs to be shown is
that the system has no closed trajectory.
Since all the trajectories are kept unchanged when the vector field is
multiplied by a positive constant, we assume that a11 = 1 in the sequel for
simplicity. Now we have,
−1 a12
A= .
−a21 a22
The restriction a12 > a11 > 0 and the assumption that A is Hurwitz trans-
late to a12 > 1, a22 < 1 and ka212 > 1. And the condition a11 a21 ≥ a12 a22
is equivalent to k ≥ 1. Therefore, we can establish Proposition 14.2.3 by
showing that the system
−1 a12
ẋ = A sat(x) = sat(x), a12 > 1, k ≥ 1, 0 < a22 < 1,
−ka12 a22 a22
(14.2.15)
is globally asymptotically stable. The proof will be carried out by evolving
A from the simplest form where a22 = 1, k = 1, to the case a22 = 1, k ≥ 1,
and finally to the general case 0 < a22 < 1, k ≥ 1. When a22 = 1, the system
is surely not globally asymptotically stable because A is not Hurwitz, but
the trajectories in this case will be used as a reference for showing the
convergence of the trajectories when a22 decreases.
Let’s start with a simple model (the primary model),
−1 a12
ẋ = sat(x), a12 > 1. (14.2.16)
−a12 1
Lemma 14.2.2. All the trajectories of (14.2.16) are closed. Each trajec-
tory is symmetric with respect to the line x1 = x2 and the line x1 = −x2 .
Then, ψ2 (t, x∗ ) is a closed curve that lies within the central square. Denote
the region enclosed by ψ2 (t, x∗ ) as S0 , then every point inside S0 is on a
closed trajectory. And outside S0 , every trajectory will converge to ψ2 (t, x∗ )
(see Fig. 14.2.3).
14.2. State Saturation – Continuous-Time Systems 339
−1
−2
−3
−4
−2 −1.5 −1 −0.5 0 0.5 1 1.5 2 2.5
where a12 > 1, k ≥ 1 and 0 < a22 < 1. We will consider the perturba-
tion of the trajectories as a22 varies. Denote the trajectory of (14.2.18) as
ψ(t, x0 , a22 ). As compared with (14.2.17), ẋ1 is the same but ẋ2 is multi-
plied with a scalar a22 . Because of this, the trajectories of (14.2.18) exhibit
some interesting properties.
Fact 14.2.1.
a) Let
x01
x0 = , x02 ≥ 1,
x02
be a point above the line x2 = 1, then for all a22 > 0,
0 1 ψ(t, x0 , a22 ) − x02 = a22 0 1 ψ2 (t, x0 ) − x02 ,
and
1 0 ψ(t, x0 , a22 ) = 1 0 ψ2 (t, x0 ),
b) Let
x01
x0 = , x01 ≥ 1,
x02
be a point to the right of the line x1 = 1, then for all a22 > 0,
1
1 0 ψ(t, x0 , a22 ) − x01 = 1 0 ψ2 (a22 t, x0 ) − x01 ,
a22
and
0 1 ψ(t, x0 , a22 ) = 0 1 ψ2 (a22 t, x0 ),
An illustration for Fact 14.2.1 is given in Fig. 14.2.4, where the ‘∗’s are
x0 , the solid curves are ψ2 (t, x0 ), and the dashed curves are ψ(t, x0 , a22 ),
a22 < 1.
With Fact 14.2.1, we are ready to present a final lemma that leads to
the proof of Proposition 14.2.3.
14.2. State Saturation – Continuous-Time Systems 341
2.5
1.5
0.5
−0.5
−1
−1.5
−2
−2 −1.5 −1 −0.5 0 0.5 1 1.5 2
and xc1 < x01 , then if δ ∈ (0, a22 ) is sufficiently small, ψ(t, x0 , a22 − δ) will
intersect with ψ(t, x0 , a22 ) at a point to the right of x0 . If it also intersects
the line x2 = −x02 , the intersection will be to the left of xc .
since for the points to its right, they can be traced back to the left as the
trajectories go rightward above the line x2 = 1. Let
x01 1
x0 = , x01 < ,
1 ka12
then ψ2 (t, x0 ) of the system (14.2.17) (see the solid curve in Fig. 14.2.5)
will return to the line x2 = 1 at a point x0 . From Fact 14.2.1, ψ(t, x0 , a22 )
1.5
XO ’
XO
1
0.5
−0.5
−1
XC
−1.5
−1.5 −1 −0.5 0 0.5 1 1.5
will also return to x0 for all a22 > 0 (see the dashed curve in Fig. 14.2.5).
1
ka12
We have shown in Lemma 14.2.4 that for any x0 to the left of ,
1
if ψ2 (t, x0 ) reaches the line x2 = −1 at some point
xc1
xc = ,
−1
then xc1 < −x01 , i.e., xc is to the left of −x0 . It can be shown that xc is
also to the left of x0 . From Lemma 14.2.5, we know that as a22 decreases
from 1, the intersection of ψ(t, x0 , a22 ) and x2 = −1 will move leftward,
hence remain to the left of −x0 and x0 . Note that x0 is on ψ(t, x0 , a22 ),
14.3. State Saturation – Discrete-Time Systems 343
so ψ(t, x0 , a22 ) overlaps with ψ(t, x0 , a22 ). Therefore, x0 is not on a closed
trajectory, since a closed trajectory must be symmetric.
The possibility of the existence of a closed trajectory that does not
intersect with x2 = 1 can be excluded similarly.
If k > 1, then a11 a21 > a12 a22 ; if k = 1, then a11 a21 = a12 a22 and it
can be verified that β = a22 and
a12 a21
> .
a11 a22
Since A is Hurwitz,
a12 a22
> .
a11 a21
Therefore,
% &
a12 a21 a22
> max , ≥ 1,
a11 a22 a21
i.e., a12 > a11 . Hence by Propositions 14.2.1 and 14.2.2, the system is not
globally asymptotically stable whether a21 > a22 or not. And in both cases,
the system has unbounded trajectories and there is also a closed trajectory.
2
344 Chapter 14. Non-Actuator Saturation
Given an initial state x(0) = x0 , denote the trajectory of the system (14.3.1)
that passes through x0 at k = 0 as ψ(k, x0 ). In this section, we only consider
the positive trajectories. Hence, throughout the section, k ≥ 0.
Definition 14.3.1. The equilibrium x = 0 of the system (14.3.1) is said
to be stable if, for any ε > 0, there exists a δ > 0 such that,
|ψ(k, x0 )| ≤ ε, ∀ k ≥ 0, ∀ |x0 | ≤ δ.
It is said to be globally asymptotically stable if it is a stable equilibrium
and
lim ψ(k, x0 ) = 0, ∀ x0 ∈ R2 .
k→∞
14.3. State Saturation – Discrete-Time Systems 345
lim ψ(k, x0 ) = 0, ∀ x0 ∈ U0 ,
k→∞
(A3 )12
x1 = = 0.7308,
1 − (A3 )11
14.3. State Saturation – Discrete-Time Systems 347
The proof of Theorem 14.3.1 consists of three parts. In the first part,
some general properties of the limit trajectories of the system (14.3.1) are
characterized. An outstanding feature is that a nontrivial limit trajectory
can only intersect with two pairs of opposite sides of the unit square and
it cannot have intersections with both of the neighboring sides. This result
turns our attention to a simpler system which has only one saturated state,
a11 x1 (k) + a12 x2 (k)
x(k + 1) = . (14.3.3)
sat(a21 x1 (k) + a22 x2 (k))
This system will be studied in detail in Part 2. In particular, we will
establish a relation between the present intersection of a trajectory with
the lines x2 = ±1 and the next intersection. This relation helps us to
identify the condition for the system (14.3.3) to be globally asymptotically
stable, which in turn leads to our final proof of the main result in Part 3.
Part 1. Limit Trajectories
To prove that the system (14.3.1) is globally asymptotically stable, we
need to show that the only limit point of any trajectory is the origin. It
348 Chapter 14. Non-Actuator Saturation
is well-known [74] that the system may have stationary points other than
the origin, and it may have periodic trajectories and even trajectories that
neither are periodic, nor go to a stationary point. In this part, we are going
to characterize some general properties of the non-convergent trajectories.
These properties will facilitate us to exclude the existence of such non-
convergent trajectories under the condition of Theorem 14.3.1.
Since every trajectory is bounded by the unit square, there exists a set
of points such that the trajectory will go arbitrarily close to them infinitely
many times.
lim ψ(ki , x0 ) = x∗ .
i→∞
The set of all such limit points is called the limit set of the trajectory. We
denote this limit set as Γ(x0 ).
We see that Lh and −Lh are the two horizontal sides of S, and Lv and −Lv
are the two vertical sides of S. Notice that they do not include the four
vertices of the unit square. Also, denote
1 −1
v1 = , v2 =
1 1
as the two upper vertices of the square.
Let y0 be a limit point of some non-convergent trajectory and for sim-
plicity, let yk = ψ(k, y0 ). Denote
Y = ± yk : k ≥ 0
and
AY = ± Ayk : k ≥ 0 .
Clearly, Y must have an intersection with the boundary of the unit square.
If Y ∩ Lh is not empty, define
x1
γ1 = inf x1 : ∈ Y ∩ (Lh ∪ {v1 , v2 }) ,
1
and
x1
γ2 = sup x1 : ∈ Y ∩ (Lh ∪ {v1 , v2 }) .
1
If Y ∩ Lv is not empty, define
1
γ3 = sup x2 : ∈ Y ∩ (Lv ∪ {v1 , −v2 }) ,
x2
and
1
γ4 = inf x2 : ∈ Y ∩ (Lv ∪ {v1 , −v2 }) .
x2
Proposition 14.3.1. Let y0 be a limit point of some trajectory.
b) If y0 ∈ Lv , then ψ(k, y0 ) will not touch Lh or −Lh and will stay inside
the strip
x1
: |x2 | ≤ max |γ3 |, |γ4 | ;
x2
350 Chapter 14. Non-Actuator Saturation
as shown in Fig. 14.3.1. In the figure, the points marked with “∗” belong
to AY , the polygon with dash-dotted boundary is the convex hull of AY
and the polygon with vertices ±ui , i = 1, 2, 3, 4, and some points in the
14.3. State Saturation – Discrete-Time Systems 351
1.5
1
u1 u2
−u
4
0.5
−u3
0
u3
−0.5
u4
−u2 −u1
−1
−1.5
−1.5 −1 −0.5 0 0.5 1 1.5
|ψ(K0 , x0 ) − y0 |∞ < ε.
|ψ(K1 + K0 , x0 ) − y0 |∞ < ε.
Define
Z(ε) = ψ(k + K0 , x0 ) : 0 ≤ k ≤ K1
and
AZ(ε) = Aψ(k + K0 , x0 ) : 0 ≤ k ≤ K1 .
Using similar arguments as in the finite element case, we can show that
A(AZ(ε))
|det (A)| = ≥ 1 − O(ε).
A(Z(ε))
b) Similar to a).
c) If, on the contrary, Y contains both v1 and v2 , then the convex hull
of Y is S. Also, AY contains a point
x1
Ayj = , x1 ≤ −1, x2 ≥ 1,
x2
and a point
x1
Ayk = , x1 ≥ 1, x2 ≥ 1,
x2
hence the convex hull of AY contains S. This also leads to A(AY ) ≥ A(Y ),
a contradiction. 2
Now we are clear from Proposition 14.3.1 that if there is any limit trajec-
tory, it can intersect only one pair of the sides of the unit square, either
(Lh , −Lh ), or (Lv , −Lv ), not both of them. Hence, we only need to investi-
gate the possibility that a limit trajectory only intersects ±Lh . The other
14.3. State Saturation – Discrete-Time Systems 353
possibility that it only intersects ±Lv is similar. For this reason, we restrict
our attention to the following system,
a11 x1 (k) + a12 x2 (k)
x(k + 1) = := sat2 (Ax(k)). (14.3.4)
sat(a21 x1 (k) + a22 x2 (k))
Let
αs = min |x1 | : x1 satisfies (14.3.5) ,
then no limit trajectory can exist completely within the strip
x1
: |x1 | < αs .
x2
We call x0 , x10 and x20 the first, the second and the third intersections,
respectively. We also call x0 and x10 the present and the next intersections.
Clearly, x10 is uniquely determined by x0 . We also see that the relation
x0 → x10 is a map from Leh to itself. To study the global asymptotic stability
of the system (14.3.4), it suffices to characterize the relation between x0 and
x10 . Through this relation, we can show that if (14.3.5) is not true for any
x1 , then for every x0 ∈ Leh , the intersections x10 , x20 , · · · will move closer and
closer toward an interval, and all the trajectories starting from this interval
will not touch the lines ±Leh and will converge to the origin.
Let x0 ∈ Leh . The next intersection of ψ2 (k, x0 ) with Leh occurs at step
k1 if
0 1 Ak1 x0 ≥ 1, 0 1 Ak x0 < 1, ∀ k < k1 .
The next intersection is x10 = ψ2 (k1 , x0 ) = sat2 (Ak1 x0 ) (or −sat2 (Ak1 x0 )).
Since for different x0 ∈ Leh , the number of steps for the trajectories to
return to ±Leh , i.e., the number k1 as defined above, is different, we see
that the relation between x0 and x10 must be discontinuous.
We will first determine an interval on Leh from which a trajectory will
not intersect ±Leh again (no x10 ) and will converge to the origin.
Since A is asymptotically stable, there exists a positive definite matrix
P such that
AT P A − P < 0.
Define the Lyapunov function as
V (x) := xT P x,
Let ρc be such that E(P, ρc ) ⊂ Se and E(P, ρc ) just touches ±Leh . In this
case, E(P, ρc ) has only one intersection with Leh . Let this intersection be
αc
pc = .
1
14.3. State Saturation – Discrete-Time Systems 355
there exists an interval around pc in Leh , of nonzero length, such that for
every x0 in this interval, ψ2 (k, x0 ) = Ak x0 , k ≥ 1, will never touch ±Leh
and will converge to the origin.
Here we will use a simple way to denote a line segment. Given two
points p1 , p2 ∈ R2 , denote
[p1 , p2 ] := λp1 + (1 − λ)p2 : 0 ≤ λ ≤ 1 ,
and similarly,
(p1 , p2 ] = [p1 , p2 ] \ {p1 },
Define
α
α0 := min α < αc : Ak ∈ Se , ∀ k ≥ 0 ,
1
and
β
β0 := max β > αc : Ak ∈ Se , ∀ k ≥ 0 .
1
Since a21
= 0, the line ALeh := {Ax : x ∈ Leh }, has intersections with
both Leh and −Leh . So there exist points on both sides of pc which will be
mapped out of Se under A. Hence, α0 and β0 are finite numbers. Now let
α0 β0
p0 = , q0 = .
1 1
It follows that for every x0 ∈ (p0 , q0 ), the linear trajectory Ak x0 will never
touch ±Leh for all k ≥ 1. This implies that p0 and q0 are the inner most
356 Chapter 14. Non-Actuator Saturation
pair of points on Leh where the discontinuity occurs on the relation between
x0 and x10 . Based on p0 and q0 , all the other discontinuous points on Leh
can be characterized with an inductive procedure (see [36]). Suppose that
the number of discontinuous points to the left of p0 is I and the number to
the right of q0 is J . Let the points pi , qj , i = 1, 2, · · · , I, j = 1, 2, · · · , J ,
be labeled such that pi is to the left of pi−1 and that qj is to the right of
qj−1 . For simplicity, denote
−∞ ∞
p∞ = , q∞ = .
1 1
Denote the second intersection of the trajectory ψ2 (k, pi ) with Leh as p1i
and that of ψ2 (k, qj ) with Leh as qj1 . The following lemma collects some
simple facts about the relations between pi , p1i , qj and qj1 .
Lemma 14.3.2.
d) For i, j ≥ 1, p1i and qj1 cannot be both in [pi , qj ], nor both outside
of [pi , qj ], i.e., there must be one of them inside [pi , qj ] and the other
one outside of the interval.
and
1
[qj−1 , qj−1 ] ⊂ [qj1 , qj ].
Item d) shows that we have either
or
[qj1 , qj ] ⊂ [pi , p1i ].
Item b) shows that all these intervals must include [p0 , q0 ]. In summary, the
facts in Lemma 14.3.2 jointly show that the intervals [pi , p1i ] and [qj1 , qj ],
14.3. State Saturation – Discrete-Time Systems 357
Lemma 14.3.3. Assume that the condition (14.3.5) is not true for any
x1 ∈ R. Consider x0 ∈ Leh . If x0 ∈ (p∞ , p0 ], then x10 ∈ (x0 , q∞ ) and one of
the following must be true:
b) x10 ∈ (x0 , p0 ];
Similarly, if x0 ∈ [q0 , q∞ ), then x10 ∈ (p∞ , x0 ) and one of the following must
be true.
e) x10 ∈ [q0 , x0 );
Also, if x0 ∈ (pi , p1i ) (or x0 ∈ (qj1 , qj )), then x10 , x20 and the subsequent
intersections will all be on the interval (pi , p1i ) (or (qj1 , qj )). Furthermore,
for any x0 ∈ Leh , there is a finite k1 such that ψ2 (k1 , x0 ) ∈ (p0 , q0 ). After
that, ψ2 (k, x0 ) will have no more intersection with Leh and will converge to
the origin.
Lemma 14.3.4.
Case 1. αc ≤ 0. Let
−αs
ps = ,
1
then p1s = ps ∈ (p∞ , p0 ). Suppose that ps ∈ [pi+1 , pi ). Then, for every
x0 ∈ (ps , p1i ], the trajectory ψ2 (k, x0 ) will converge to the origin;
In both cases, no limit trajectory can be formed completely inside the strip
x1
: |x1 | < αs .
x2
and the following argument also goes through). Note that, if there is some
x1 ∈ R, |x1 | ≤ 1, that satisfies (14.3.5), i.e.,
x1 x1 k x1
AN
=±
, 0 1 A ≤ 1, ∀ k < N,
1 1+d 1
would be less than the area of the convex hull of the set
± Ax0 , ±A2 x0 , · · · , ±AN x0 .
trajectory that include points in ±(Lh ∪ {v1 , v2 }), it must include at least
one point on ±Lh and one on ±v1 (or ±v2 ). Here we assume that it includes
v2 .
Let’s consider the trajectories ψ(k, v2 ) and ψ2 (k, v2 ). Suppose that
ψ(k, v2 ) has an intersection with ±Lh but does not include v1 and any
point in ±Lv , we conclude that ψ(k, v2 ) = ψ2 (k, v2 ) will converge to the
origin. The argument goes as follows.
Let k0 be the smallest k such that ψ(k, v2 ) intersects ±Lh . Denote
1
v2 = ψ(k0 , v2 ). Since b) is not true, k0 must also be the smallest k such
that
0 1 Ak v2 ≥ 1.
So, we have ψ2 (k, v2 ) = ψ(k, v2 ) for all k ≤ k0 . Here we have two cases.
Case 1. αc ≤ 0
Case 2. αc > 0
In this case ps is to the right of v1 . By the assumption that ψ(k, v2 ) does not
include v1 , the intersections of ψ2 (k, v2 ) with ±Lh will stay to the left of v1
(or to the right of −v1 ). Since αc > 0, by Lemma 14.3.4, the intersections
will move rightward until falling on [qj1 , ps ), where [qj1 , ps ) is the interval in
Lemma 14.3.5 c). Similar to Case 1, we have that ψ2 (k, v2 ) converges to
the origin and ψ(k, v2 ) = ψ2 (k, v2 ).
So far, we have excluded the possibility that a limit trajectory includes
any point in the set ±(Lh ∪{v1 , v2 }). The possibility that a limit trajectory
includes any point in the set ±(Lv ∪ {v1 , v2 }) can be excluded in a similar
way. Thus, there exists no limit trajectory of any kind and the system
(14.3.6) must be globally asymptotically stable. 2
362 Chapter 14. Non-Actuator Saturation
14.4.1. Introduction
While actuator saturation has been addressed in much detail, only a few
results are available that deal with sensor saturation. In particular, issues
related to the observability of a linear system subject to sensor saturation
were discussed in detail in [57]. A discontinuous dead beat controller was
recently constructed for single input single output linear systems in the
presence of sensor saturation [59] that drives every initial state to the origin
in a finite time.
In this section, we consider the problem of semi-globally stabilizing lin-
ear systems using linear feedback of the saturated output measurement.
Here, by semi-global stabilization we mean the construction of a stabiliz-
ing feedback law that yields a domain of attraction that contains any a
priori given (arbitrarily large) bounded set. This problem was motivated
by its counterpart for linear systems subject to actuator saturation [65,67].
More specifically, it was established in [65,67] (see also Remark 4.4.1 of
Chapter 4) that a linear system subject to actuator saturation can be semi-
globally stabilized using linear feedback if the system is stabilizable and
detectable in the usual linear sense and all its open loop poles are in the
closed left-half plane, no matter where the invariant zeros are. What we
will show in this section is that a single input single output linear system
subject to sensor saturation can be semi-globally stabilized by linear sat-
urated output feedback if the system is stabilizable and detectable in the
usual linear sense and all its invariant zeros are in the closed left-half plane,
no matter where the open loop poles are. This result thus complements the
results of [59] in the sense that, it requires an extra condition that the
invariant zeros of the system be on the closed left-half plane to conclude
semi-global stabilizability by linear feedback. It can also be viewed as dual
to its actuator saturation counterpart in [65,67]. We, however, note that in
the dual situation [67], the condition of all poles being in the closed left-half
plane is necessary even with nonlinear feedback [90], while in the current
situation, the condition of all invariant zero being in the closed left-half
plane is not necessary with nonlinear feedback (by the result of [59]). It is
not clear at this time if it would become necessary for linear feedback.
14.4. Sensor Saturation 363
Consider the following single input single output linear system subject to
sensor saturation,
ẋ = Ax + Bu, x ∈ Rn , u ∈ R,
(14.4.1)
y = sat(Cx), y ∈ R,
where sat : R → R is the standard saturation function. Our main results
on semi-global stabilizability of the system (14.4.1) is given in the following
theorem.
Theorem 14.4.1. The system (14.4.1) is semi-globally asymptotically sta-
bilizable by linear feedback of the saturated output if
• The pair (A, B) is stabilizable;
• All invariant zeros of the triple (A, B, C) are in the closed left-half
plane.
More specifically, for any a priori given bounded set X 0 ⊂ R2n , there exists
a linear dynamic output feedback law of the form
ż = F z + Gy, z ∈ Rn ,
(14.4.2)
u = Hz + H0 y,
364 Chapter 14. Non-Actuator Saturation
such that the equilibrium (x, z) = (0, 0) of the closed-loop system is asymp-
totically stable with X 0 contained in its domain of attraction.
Proof. We will establish this result in two steps. In the first step, we will
construct a family of feedback laws of the form (14.4.2), parameterized in
ε ∈ (0, 1]. In the second step, we will show that, for any a priori given
bounded set X 0 ⊂ R2n , there exists an ε∗ ∈ (0, 1] such that, for each
ε ∈ (0, ε∗ ], the equilibrium (x, z) = (0, 0) of the closed-loop system is
asymptotically stable with X 0 contained in its domain of attraction.
The construction of the feedback laws follows the following algorithm.
Lemma 14.4.1. For the given triple (A0 , B0 , F0 (ε)), there exists a
nonsingular matrix T0 (ε) ∈ Rn0 ×n0 such that
ż = Az + Bu + L(Cz − y),
r
α1 α2 αr
u = −C0 z0 − ai z1i − r (y − F0 (ε)z0 ) − r−1 z12 − · · · − z1r ,
i=1
ε ε ε
i.e.,
r!
αi = Cri−1 = , i = 1, 2, · · · , r.
(i − 1)!(r − i + 1)!
366 Chapter 14. Non-Actuator Saturation
We now proceed with the second step of the proof: to show that, for any
a priori given bounded set X 0 ⊂ R2n , there exists an ε∗ ∈ (0, 1] such that,
for each ε ∈ (0, ε∗ ], the equilibrium (x, z) = (0, 0) of the closed-loop system
is asymptotically stable with X 0 contained in its domain of attraction.
Without loss of generality, let us assume that the system is already in the
form of (14.4.3), i.e., T = I. Letting e = x − z, we can write the closed-loop
system as follows,
ẋ0 = A0 x0 + B0 x11 ,
ẋ11 = x12 ,
ẋ12 = x13 ,
..
.
ẋ1r = C0 (x0 − z0 ) + a1 (x11 − z11 ) + a2 (x12 − z12 ) + · · ·
α1 α2 αr
+ar (x1r − z1r ) − r (y − F0 (ε)z0 ) − r−1 z12 − · · · − z1r ,
ε ε ε
ż = Az + L(Cz − y) + B −C0 z0 − a1 z11 − · · · − ar z1r
α1 α2 αr
− r (y − F0 (ε)z0 ) − r−1 z12 − · · · − z1r ,
ε ε ε
y = sat(x11 ).
x̃0 = T0 (ε)x0 ,
x̃11 = x11 − F0 (ε)x0 ,
1 2
x̃1i = εi−1 x1i + Ci−1 εi−2 x1i−1 + Ci−1 εi−3 x1i−2 + · · · + Ci−1
i−2
εx12
i−1
+Ci−1 (x11 − F0 (ε)x0 ), i = 2, 3, · · · , r,
e0 = x0 − z0 ,
e1i = x1i − z1i , i = 1, 2, · · · , r,
(14.4.10)
and denote
T
e = eT0 e11 e12 · · · e1r .
With these new state variables, the closed-loop system can be written
as follows,
1 1
x̃˙ 11 = − x̃11 + x̃12
ε ε
− F0 (ε)A0 T0−1 (ε) + F0 (ε)B0 F0 (ε)T0−1 (ε) x̃0 − F0 (ε)B0 x̃11 ,
1 1
x̃˙ 12 = − x̃12 + x̃13
ε ε
− F0 (ε)A0 T0−1 (ε) + F0 (ε)B0 F0 (ε)T0−1 (ε) x̃0 − F0 (ε)B0 x̃11 ,
..
.
1 1
x̃˙ 1r−1 = − x̃1r−1 + x̃1r
ε ε
−[F0 (ε)A0 T0−1 (ε) + F0 (ε)B0 F0 (ε)T0−1 (ε)]x̃0 − F0 (ε)B0 x̃11 ,
1 1 1
x̃˙ 1r = − x̃1r + [x11 − sat(x11 )] − F0 (ε)e0
ε ε ε
+εr−1 [C0 e0 + a1 e11 + a2 e12 + · · · + ar e1r ]
+α2 e12 + α3 εe13 + · · · + αr εr−2 e1r
−[F0 (ε)A0 T0−1 (ε) + F0 (ε)B0 F0 (ε)T0−1 (ε)]x̃0 − F0 (ε)B0 x̃11 ,
ė = (A + LC)e − L[x11 − sat(x11 )].
(14.4.11)
r
√ T
V (x̃0 , x̃11 , · · · , x̃1r , e) = ν x̃T0 P0 x̃0 + x̃21i + εe P e, (14.4.12)
i=1
Such a c exists due to the boundedness of X 0 and the definition of the state
variables as given by (14.4.10). With this choice of c, it is obvious that
(x, z) ∈ X 0 implies that
(x̃0 , x̃11 , · · · , x̃1r ) ∈ LV (c) := (x̃0 , x̃11 , · · · , x̃1r , e) ∈ R2n : V ≤ c .
368 Chapter 14. Non-Actuator Saturation
ν
ν− δi0 ε2 − δ01 ν 2 ≥ ,
i=1
4
2
δ1 δ1
− δ01 − δ10 − 2δ11 ε − δi1 ε ≥
ε i=2
2ε
δ1 δ1
− δi0 − δi1 ε ≥ , i = 2, 3, · · · , r − 1,
ε 2ε
δ1 2η 2 δ1
− δr1 ε − √ 1 ≥ .
ε ε 2ε
With these choices of ν and ε∗1 , we conclude that, for any |x11 | ≤ 1,
√
δ1
r
ν 2 2 ε 2
V̇ ≤ − |x̃0 | − |x̃1i | − |e| , ε ∈ (0, ε∗1 ].
4 2ε i=1 2
Now let ε∗2 ∈ (0, 1] be such that, for all ε ∈ (0, ε∗2 ],
implies that,
1
|F0 (ε)x0 | ≤ ,
2
and
r
2δ01 ν|x̃0 ||x̃11 | + (δi0 + δi1 )εx̃21i + δi0 ε|x̃0 |2 + δi1 εx̃211
i=1
√ √ 1
+η1 x̃21r + η1 + η2 ε |e|2 + η2 ε(|x11 | − 1)2 ≤ .
8ε
The first inequality is due to (14.4.4) and implies that,
1
x̃211 − (|x11 | − 1)2 ≥ .
4
With this choice of ε∗2 , we have that, for any |x11 | > 1,
√ T 1
V̇ ≤ −ν x̃T0 x̃0 − εe e − , ε ∈ (0, ε∗2 ].
8ε
which, in turn, shows that the equilibrium (x, z) = (0, 0) of the closed-
loop system is asymptotically stable with X 0 contained in its domain of
attraction. 2
14.4.3. An Example
1 0 0 0 1 0
14.5. Conclusions 371
It can be easily verified that this system is controllable and observable with
an invariant zero at s = 0. The open loop poles are located at {−1, ±j, 1}.
Following the design algorithm proposed above, we construct a family of
parameterized output feedback laws as follows,
In Figs. 14.4.1 and 14.4.2, ε is chosen to be ε = 0.1. It is clear that with this
choice of ε, the initial conditions are not inside the domain of attraction.
In Figs. 14.4.3 and 14.4.4, ε is chosen to be ε = 0.001. We see that, the
output is out of saturation after some time and the closed-loop system
become linear and all its states converge to zero. This demonstrates that
as ε decreases, the domain of attraction is enlarged.
14.5. Conclusions
In this chapter, we have presented a few results on linear systems subject
to state or sensor saturation. These results only serve as an indication
that saturation occurs in system components other than actuators. In fact,
saturation nonlinearity is ubiquitous in engineering systems and remains as
an exciting topic for research.
372 Chapter 14. Non-Actuator Saturation
200
100
−100
states x and z
−200
−300
−400
−500
−600
−700
0 0.5 1 1.5 2 2.5 3 3.5 4
time t
0.8
0.6
0.4
0.2
output y
−0.2
−0.4
−0.6
−0.8
−1
300
250
200
100
50
−50
−100
0 0.5 1 1.5 2 2.5 3 3.5 4
time t
0.5
0
output y
−0.5
−1
[2] J. Alvarez, R. Suarez and J. Alvarez, “Planar linear systems with sin-
gle saturated feedback,” Systems & Control Letters, Vol. 20, pp. 319-
326, 1993.
[6] F. Blanchini, “Feedback control for LTI systems with state and con-
trol bounds in the presence of disturbances,” IEEE Transactions on
Automatic Control, Vol. 35, pp. 1231-1243, 1990.
375
376 Bibliography
[25] E. G. Gilbert and K. T. Tan, “Linear systems with state and con-
trol constraints: the theory and application of maximal output ad-
missible sets,” IEEE Transactions on Automatic Control, Vol. 36,
pp. 1008-1020, 1991.
[26] P.-O. Gutman and M. Cwikel, “Admissible sets and feedback con-
trol for discrete-time linear dynamical systems with bounded control
and dynamics,” IEEE Transactions on Automatic Control, Vol. 31,
pp. 373-376, 1986.
[32] H. Hindi and S. Boyd, “Analysis of linear systems with saturating us-
ing convex optimization,” Proceedings of the 37th IEEE Conference
on Decision & Control, pp. 903-908, 1998.
[37] T. Hu and Z. Lin, “Output regulation for general linear systems with
saturating actuators,” submitted to Automatica, 1999.
[38] T. Hu and Z. Lin, “On enlarging the basin of attraction for linear sys-
tems under saturated linear feedback,” Systems & Control Letters,
Vol. 40, pp. 59-69, 2000.
[40] T. Hu, Z. Lin and B. M. Chen, “An analysis and design method
for linear systems subject to actuator saturation and disturbance,”
submitted to Automatica, 1999. Also in Proceedings of American
Control Conference, pp. 725-729, 2000.
Bibliography 379
[41] T. Hu, Z. Lin and L. Qiu, “An explicit description of the null control-
lable regions of linear systems with saturating actuators,” submitted
for publication, 2000.
[65] Z. Lin, Low Gain Feedback, Lecture Notes in Control and Information
Sciences, Vol. 240, Springer-Verlag, London, 1998.
[74] D. Liu and A.N. Michel, Dynamical Systems with Saturation Nonlin-
earities, Lecture Notes in Control and Information Sciences, Vol. 195,
Springer-Verlag, London, 1994.
[102] A. R. Teel, “A nonlinear small gain theorem for the analysis of con-
trol systems,” IEEE Transactions on Automatic Control, Vol. 42,
pp. 1256-1270, 1996
Bibliography 385
[108] J. C. Willems, “Least squares stationary optimal control and the al-
gebraic Riccati equations,” IEEE Transactions on Automatic Con-
trol, Vol. 16, pp. 621-634, 1971.
387
388 Index