Professional Documents
Culture Documents
9, SEPTEMBER 1997
Systems with Finite Communication Bandwidth The concepts of a coder-estimator sequence and a finitely recursive
Constraints—Part I: State Estimation Problems coder-estimator sequence are introduced in this paper. The latter
concept is clearly motivated by the idea of finite-dimensional filters.
Wing Shing Wong and Roger W. Brockett Convergence issues related to these sequences are investigated in
this paper. In particular, various necessary and sufficient conditions
connecting the communication data rate with the rate of change of
Abstract— In this paper, we investigate a state estimation problem the underlying process are established for the existence of stable and
involving finite communication capacity constraints. Unlike classical esti- asymptotically convergent coder-estimator schemes.
mation problems where the observation is a continuous process corrupted
by additive noises, there is a constraint that the observations must Our work here is connected with several well-developed theories.
be coded and transmitted over a digital communication channel with In the special case where the stochastic process is reduced to a
finite capacity. This problem is formulated mathematically, and some static random variable, the problem is closely related to the vector
convergence properties are defined. Moreover, the concept of a finitely quantization problem [9], and to a lesser degree, it is also related
recursive coder-estimator sequence is introduced. A new upper bound for
the average estimation error is derived for a large class of random
to the rate distortion theory [3] and statistical inference theory
variables. Convergence properties of some coder-estimator algorithms via compressed data [1], [18], [2]. There are also several previous
are analyzed. Various conditions connecting the communication data rate research works which investigate various information-related aspects
with the rate of change of the underlying dynamics are established for of decision and control systems. For example, the issue of finite
the existence of stable and asymptotically convergent coder-estimator precision of quantized systems is studied in [7] and [16], and the
schemes.
sampled-data control systems are well investigated (see the references
Index Terms—Finitely recursive coder-estimator sequence, hybrid sys- in [10]). Recent papers [15], [4], and [11] are also related to our
tems, prefix code, state estimation. work here.
III. CODER-ESTIMATOR SEQUENCE In some special cases, a stable coder-estimator sequence may even
Let the stochastic process fX (t)g1
t=0 denoting the state of a system
converge in the following sense.
be defined on the probability space (
; F ; P ). The state is observed Definition 4: A coder-estimator sequence converges in quadratic
via an observation process, Y (t), where mean for X if
^ 2
Y t( ) = X (t) + V (t) (2) i!1 EkXi 0 X (ti )k = 0:
lim (6)
()
where V t is a zero-mean observation noise process independent of
()
X t , also defined on the probability space (
F P )
; ; . Let t denoteF IV. FINITELY RECURSIVE CODER-ESTIMATOR SEQUENCES
f ()0 g
the -field generated by Y s ; s t . In the definition of a coder-estimator sequence, the coding func-
If the observed information is transmitted to an estimator with no tions and the estimates are based on the whole past history of the
delay and no loss in precision, that is, via a channel with infinite process X . The computation and storage demand for such a general
bandwidth, one can formulate a classical estimation problem to find coder-estimator sequence can be excessively high. From an efficiency
the optimal estimate of X given Y . However, if the communication viewpoint, it makes good sense to consider the following special class
channel between the observer and the estimator has a finite com- of coder-estimator sequences.
Definition 5: Let D be a direct sum of finitely many copies of the
real numbers, < and B3 . A coder-estimator sequence is said to be
munication bandwidth, then the resulting problem does not fit any
of the classical models. Notice that even if there is no propagation
finitely recursive if there exists a vector of finitely many auxiliary
variables, A, taking values in D, and functions F from D 8 B 3 to
delay, the finite communication bandwidth assumption automatically
efficient. where
The performance of a coder-estimator sequence is captured by a
related sequence of coder-estimator errors = ean (10)
A natural question is whether a coder-estimator sequence is stable VI. IMPORTANT CLASSES OF CODER-ESTIMATOR SEQUENCES
in the following sense.
Definition 3: A coder-estimator sequence is stable for X if there For simplicity, we assume from now on that the state to be
exists a finite constant C , so that for all i estimated is a scalar process. There are two natural examples of
these fixed codeword length sequences: mean sequences and equal-
EkX^i 0 X (ti )k2 C: (5) partition sequences.
1296 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 42, NO. 9, SEPTEMBER 1997
Mean Coder-Estimator Sequence and its Generalization: A mean VII. BASIC PROPERTIES AND VECTOR QUANTIZATION
coder-estimator sequence uses a 1-bit codeword per sample data. Before one can analyze the performance of a general coder-
Define a 1-bit coder-estimator sequence in the following recursive estimator sequence, it is crucial to analyze the performance of a
manner: at sampling time t, define a 1-bit coding scheme on the ob- single coder-estimator step. Let X be a real-valued random variable
served signal, Y (t), by means of the characteristic sets [01; EY (t)) with known measurable p.d.f. q , mean m, and variance 2 . The
and [EY (t); 1]. After receiving the transmitted codeword, a min- observation Y is defined by
imum variance estimator is used to estimate X (t). Moreover, by
accounting for the effects of the system dynamics, one can obtain a Y =X +V (22)
new distribution of Y at the next observation instant conditioned on where V is a zero mean observation noise with a known, measurable
the received codeword. The same procedure can then be repeated. To p.d.f. r. The observed information is coded by a coding scheme h;
be more rigorous, construct a coder-estimator sequence by the received codeword is denoted by c.
A natural question concerns finding the optimal coding scheme and
E
A0 = Y (0) (11) estimator g to minimize the error
h0 (Y (0)) =
1 01 < Y (0) < A0 EjX 0 g(c)j2:
A0 Y (0) < 1
(12) (23)
0
^ 0 (c0 ) =E[X (0)jc0 ]:
X (13) For the special case where there is no observation noise, that
is V = 0, this corresponds to the well-known vector quantization
For i 1, define
problem. The error function is typically referred to as the distortion
measure. Define the coding scheme by partitioning the state-space
j 1 1 1 ; c 01 ]
E
A
into 2n cells, f i g, so that if the observed value falls in i it is A
Ai = [Y (i ) c0 ; i (14) represented by the value X ^ i . The A
i ’s are called the characteristic
1 01 < Y (i ) < A i sets of the coding function. The following properties are fundamental
0 A Y (i ) < 1
h (Y (i )) =
i (15)
i results from vector quantization theory [9].
X (c0 ; 1 1 1 ; c ) =E[X (i )jc0 ; 1 1 1 ; c ]:
^ i i i (16) • If the estimates, X ^ i , are given, then the distortion is minimized
by the Voronoi partition. The partition cells are polytopal.
The mean coder-estimator sequence is characterized by the fact • Given a state-space partition, the optimal estimate for each cell
that the coding functions are defined by using the conditional mean is chosen by minimizing the conditional expected distortion.
of the observed process, Y , as a partition point. Define an n-bit There is no known general algorithm for deriving the optimal
generalized mean coder-estimator sequence to be any sequence with coder-estimator pair, although there are locally optimal algorithms
coding functions that partition the real line into 2n intervals and use such as the Lloyd algorithm or simulated annealing based algorithms
the conditional mean of the observed process as one of the 2n 0 1 [9].
partition points. Even though the setting of our investigation is more general than
Equal-Partition Coder-Estimator Sequences: This is a finitely re- vector quantization, in the rest of this paper, it is assumed that:
cursive coder-estimator sequence. Define a support interval to be the • all coding functions are defined by polytopal partition cells.
smallest closed interval which contains the support of a function. Lemma 1: Suppose the coding function is defined by the charac-
Assume that the p.d.f. for X0 ; Ui ; and Vi for all i all have finite
support intervals. Particularly, let the support interval for the p.d.f.
A A
teristic sets ( 1 ; 1 1 1 ; M ), and conditional mean estimator is used.
E = E
pi [(X 0 E[X jY 2 A ])2 jY 2 A ]
0x0
i i
A0 =1
i
= : (17)
B0 x0 M
= EX 2 0 E j 2 A ])2
pi ( [X Y i
hi (Y (in )) = j if Li + Mj 01 Y (in) < L i + Mj (19) For our study of coder-estimator sequences, it is important to
^ i (ci = j ) = Li + M obtain lower bounds for the error reduction capability of a coder-
j0
X (20)
estimator step. A result is proven here for the mean coder-estimator
Ai+1 (ci = j ) max(Li + Mj 01 0
v; Li + v ) step based on an interesting inverse Cauchy–Schwartz inequality and
Bi+1 (ci = j )
=
min(Li + Mj + v; Ui v) 0 : (21)
the following lemma.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 42, NO. 9, SEPTEMBER 1997 1297
The last step follows from an inequality due to Čebyšev [13, p. 39]. Since
Hence 0 )y)g(x + (1 0 )y)
f (x + (1
c 2
0 2a c q(x)dx c (x0a)2 q(x)dx: (f (x) + (1 0 )f (y))(g(x) + (1 0 )g(y))
0
(x a)q (x)dx b+c
0 a) b
b 2(c b f (x)g(x) + (1 0 )f (y)g(y)
p
hence f g is concave. In particular, (x 0 m) q is concave on Ai .
Theorem 1: Assume that q is piecewise concave, monotone non-
increasing or nondecreasing on N intervals, and n = dlog2 (N + 1)e.
The Blaschke and Pick theorem [8] states that for nonnegative
Then there exists an n-bit generalized mean coder-estimator step that
concave functions f and g defined on an interval with Lebesque I
measure
satisfies
2
E < 34 2 : 4 f gd :
f d
2
g d
2
(27)
I I I
Proof: From the theorem assumption, it follows that there exists
p p
Let f = q and g = (x 0 m) q , then it follows that
a generalized n-bit coding scheme so that each of the characteristic 2
set is an interval and q is concave and monotonic on each interval. qdx (x 0 m)2 qdx 4 (x 0 m)qdx :
Moreover, the coding function also contains characteristic sets of the A A A
form (a; m] and (m; b] for some a and b. Label the characteristic
A
sets by i . Define Hence, the claimed inequality holds.
where
Zi+1 = Vi+1 + Ui
pi 111
; ;i ;i = pi 111
; ;i :
for all i; x; and y .
i 2Z =f1 111 g ; ;M
Assume that the p.d.f. of X0 ; Ui ;, and Vi have finite support inter-
From (9), it follows that the conditional variance of X (kn ) vals, [0x0 ; x0 ]; [0u; u]; and [0v; v ]; respectively. The performance
conditioned on the codewords i0 ; 1 1 1 ; ik is equal to 2 32i ;111;i + of an n-bit equal-partition coder-estimator sequence is analyzed here
o2 . By construction, the conditional distribution of X (kn ) satisfies for such a system.
the conditions of Theorem 1. Hence Theorem 3: The n-bit equal-partition coder-estimator sequence is
p(ik i0 ; j 1 1 1 ; i 01 )32 111 34
k i ; ;i
2 2
3i ;111 ;i + o :
2 stable if
i 2Z
a log2 e < 1: (32)
Hence
EjX^ 0 X (kn)j2
k
Proof: Let Si ;111;i represent the length of the support interval
= 111 pi 111
; ;i ;i
2
3i 111
; ;i ;i
of the p.d.f. of Xj conditioned on receiving the codewords i0 ; 1 1 1 ; ij .
i 2Z i 2Z It follows from the Lipschitz assumption on f that the p.d.f. of
Xj +1 conditioned on the codewords c0 ; 1 1 1 ; cj has a support interval
34 2 111 111 3i
2 2 with length not greater than Cf S + 2u, and the support interval
pi ; ;i 111
; ;i + o
i 2Z i 2Z of Yj +1 conditioned on the same set of codewords has length not
k +1 greater than Cf S + 2u + 2v . Hence, each of the characteristics set
<
3
2 2
3 + 3o :
2 is an interval with length not greater than (Cf S + 2u + 2v )=2n . If
4 the i + 1th codeword is received, Xi+1 must lie within +v or 0v of
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 42, NO. 9, SEPTEMBER 1997 1299
the associated characteristic set. Hence [16] D. Williamson, “Finite wordlength design of digital Kalman filters for
n
state estimation,” IEEE Trans. Automat. Contr., vol. 30, pp. 930–939,
Si ;111;i ( Si ;111;i + 2u + 2v )=2 + 2v: 1990.
[17] E. Wong and B. Hajek, Stochastic Processes in Engineering Systems.
Let New York: Springer-Verlag, 1985.
[18] Z. Zhang and T. Berger, “Estimation via compressed information,” IEEE
Tj = max Si ;111;i : Trans. Inform. Theory, vol. 34, pp. 198–211, 1988.
i ;111;i
Then
Tj +1
n
( Tj + 2u + 2v )=2 + 2v:
REFERENCES
I. INTRODUCTION
[1] R. Ahlswede and I. Csiszár, “Hypothesis testing with communication The controller design for a third-order linear system has been
constraints,” IEEE Trans. Inform. Theory, vol. 32, pp. 533–542, 1986. drawing the attention of many researchers for several decades [1]–[6]
[2] S. I. Amari and T. S. Han, “Statistical inference under multiterminal rate
restrictions: A differential geometric approach,” IEEE Trans. Inform. because a conventional dynamic plant controlled by a proportional-
Theory, vol. 35, pp. 217–227, 1989. integral-derivative (PID) controller turns out to be a third-order
[3] T. Berger, Rate Distortion Theory. Englewood Cliffs, NJ: Prentice- system. It was pointed out [7] that not only poles but also zeros
Hall, 1971. significantly characterize the step response of a transfer function.
[4] V. S. Borkar and S. K. Mitter, “LQG control with communication
constraints,” to be published.
Recently, the focus is on the pole-zero relations for the step response
[5] R. W. Brockett, “Hybrid models for motion control systems,” in without overshoot and undershoot. Note that a step response has no
Perspectives in Control, H. Trentelman and J. C. Willems, Eds. Boston: undershoot in the whole history if and only if it is a monotone
Birkhäuser, 1993, pp. 29–54. nondecreasing step response. The number of undershoot times (or
[6] T. M. Cover and J. A. Thomas, Elements of Information Theory. New local extrema) in the step response has been widely discussed for a
York: Wiley-Interscience, 1991.
[7] D. F. Delchamps, “Stabilizing a linear system with quantized state strictly proper transfer function with only real poles and real zeros
feedback,” IEEE Trans. Automat. Contr., vol. 35, pp. 916–924, 1990. [8]–[11]. A special case of this theme is the initial undershoot [12],
[8] J. B. Diaz and F. T. Metcalf, “Complementary inequalities I: Inequalities which is actually an old result [11], [13]–[15]. Incidentally, another
complementary to Cauchy’s inequality for sums of real numbers,” J. kind of old result for the monotone nondecreasing step response was
Math. Anal. Appl., vol. 9, pp. 59–74, 1964.
[9] A. Gersho and R. M. Gray, Vector Quantization and Signal Compression.
also repeatedly reported in the recent works, which will be explained
Boston: Kluwer, 1992. in the following paragraph. For a single-input/single-output (SISO)
[10] P. T. Kabamba and S. Hara, “Worst-case analysis and design of discrete-time system, the linear programming approach formulated by
sampled-data control systems,” IEEE Trans. Automat. Contr., vol. 38, the l1 theory [16] or simple coefficient relations [17] can be used to
pp. 1337–1357, 1993. design a minimum overshoot controller.
[11] S. R. Kulkarni and P. J. Ramadge, “Bit rate bounds for state tracking
with finite rate communication,” in Proc. IEEE Conf. Decision Contr., On the other hand, many works [18]–[24] were devoted to finding
1996. explicit conditions for a nonovershooting and a monotone nondecreas-
[12] M. Loeve, Probability Theory, vol. 1. New York: Springer-Verlag, ing step response. The condition proposed in [18] is in terms of the
1977.
[13] D. S. Mitrinović, Analytic Inequalities. New York: Springer-Verlag, Manuscript received February 27, 1996; revised October 10, 1996. This
1970. work was supported in part by the National Science Council, Taiwan, under
[14] P. Z. Peebles, Jr., Digital Communication Systems. Englewood Cliffs, Grant NSC83-0422-E009-066.
NJ: Prentice-Hall, 1987. The authors are with the Department of Control Engineering, Na-
[15] S. Vembu, S. Verdú and Y. Steinberg, “The source-channel separation tional Chiao Tung University, Hsinchu 30050, Taiwan ROC (e-mail:
theorem revisited,” IEEE Trans. Inform. Theory, vol. 41, pp. 44–54, sklin@cc.nctu.edu.tw).
1995. Publisher Item Identifier S 0018-9286(97)05954-0.