Professional Documents
Culture Documents
The Information Structure of Feedforward Preview Contr - 2014 - IFAC Proceedings
The Information Structure of Feedforward Preview Contr - 2014 - IFAC Proceedings
Abstract: Preview control using a fedforward imperfect forecast measurement of a disturbance signal
is investigated in the context of discrete-time linear quadratic Gaussian (LQG) control. A new approach
for incorporating such forecast measurements is built directly on established preview control models and
results. The calculation of the optimal control gain, for which an efficient computation has already been
derived, is found to be independent of the stochastic forecast measurements, implying that the optimal
state estimator is where performance improvements in this problem set-up occur. Most significantly, the
forecast data model is shown to equip the problem with a nested information structure whereby any
forecast feedforward control problem of a fixed horizon length is always equivalent to a problem with
a longer horizon and infinitely unreliable forecast measurements beyond the smaller horizon length. A
numerical example illustrates the effect of forecast horizon length and data quality on the closed-loop
system performance.
177
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
is new and extends the work of Hazell and Limebeer to provide property that, by construction, the eigenvalues of A∆ are all
a distinct analysis of the information aspects of performance. zero, ensures that [A, B] is stabilizable.
(A2) The LQ state penalty matrix Q ≥ 0 and the control
2.1 Development of the Augmented System penalty matrix R > 0.
(A3) The pair [A, Q1/2 C] is detectable, which together with
We develop the augmented model for this system by defining (A1) implies that [A, Q1/2 C] is also detectable. It also im-
the augmented state Xt , exogenous process noise Wt , output plies that [A, C] and therefore [A, C] are detectable.
Yt , and measurement noise Vt as follows: (A4) The covariances satisfy V > 0, Vf > 0, W ≥ 0, Wd ≥ 0.
xt wt (A5) The pair [A, W 1/2 ] is stabilizable, which implies that
" # " #
Xt = x∆,t , Wt = 0 , [A, W ] is stabilizable.
xd,t Bd wd,t (3)
Under these assumptions, the LQG optimal control 1 for system
vt yt (4) with criterion (5) is given by
Vt = v , Yt = y .
f,t f,t
X̂t+1|t = (A − BK − LC)X̂t|t−1 + LYt , (6)
Using these signals, the augmented system has a state variable ut = −KX̂t|t−1 . (7)
realization given by T T T
Xt+1 = AXt + But + Wt , Since Yt = yt yf,t , this implements both the feedback and
(4) feedforward control, where the feedforward control potentially
Yt = CXt + Vt ,
where uses the disturbance predictions from every horizon up to N −1.
The state-feedback matrix, K, and the output-injection matrix,
L, are computed by solving the two algebraic Riccati equations
A 0 0 B
" # " #
P = AT PA − AT PB(B T PB + R)−1 B T PA + Q,
C C∆ 0 (8)
A = 0 A∆ B∆ Cd , B= 0 , C= 0 I 0 .
0 0 Ad 0 Σ = AΣAT − AΣC T (CΣC T + V )−1 CΣAT + W , (9)
for P and Σ and then taking
We will also define the set of measurements from time zero to
t as Y t := {Y0 , . . . , Yt }. We note that although the augmented K = (B T PB + R)−1 B T PA, (10)
system is uncontrollable from ut , it is stabilizable since the L = AΣC T (CΣC T + V )−1 .
disturbance model Gd is assumed stable and is non-interacting
by construction. The above augmented system is now in the ap- Closed-Loop LQG Performance
propriate form to synthesize the LQG controller and estimator
pair directly.
Denote the augmented state-estimate error as
3. THE LQG PROBLEM, SOLUTION, AND Et = Xt − X̂t|t−1 .
PERFORMANCE
Substituting the control input signal ut from (7) into the aug-
mented system dynamics in (4) and into the augmented state
The augmented system (4) describes the feedforward prob- estimate dynamics from (6), the closed-loop system can be
lem with N -step-ahead forecast disturbance measurements cor- written as
rupted by noise capturing their reliability via (1-2). The com-
plete state of the delay line, x∆,t , is available to the controller Xt+1 (A − BK) BK Xt Wt
E = 0 (A − LC) E + W − LV .
but corrupted by zero-mean measurement noise vf,t . By ma- t+1 t t t
nipulating the covariance structure of this noise, we are able to The closed-loop covariance matrix can be found by solving the
accommodate the forecast data properties. We do this by pos- following Lyapunov equation:
ing an LQG optimal feedback control design problem, whose
solution comprises linear state estimate feedback. E{Xt XtT } E{Xt EtT }
E{Et XtT } E{Et EtT }
The criterion to be optimized in this disturbance rejection
problem is (A − BK) BK E{Xt XtT } E{Xt EtT }
(T −1 ) = 0 (A − LC) E{Et XtT } E{Et EtT }
1 X
J = lim E yiT Qyi + uTi Rui , (5)
(A − BK) BK
T
W
T →∞ T
i=0 × + cov W − tLV .
0 (A − LC) t t
where Q and R are user-specified performance penalty matrices
which determine the disturbance rejection problem. Using the properties of the trace of a matrix and noting that the
covariance of the estimation error is simply the Kalman filter
To complete the infinite-horizon stochastic LQG design, we covariance, E{Et EtT } = Σ, it can be verified that the closed-
also specify loop LQG performance criterion can be computed as
" T#
C J = trace (Q + KT RK)E{Xt XtT }
T Q [C C
Q = C∆ ∆ 0] ,
+KT RK(Σ − E{Xt EtT } − E{Et XtT }) + trace (QV ) .
0
"
W 0 0
# (11)
V 0
W = cov(Wt ) = 0 0 0 , V = cov(Vt ) = 0 V . For the remainder of this paper, we shall refer to the above
f
0 0 Wd
problem as the forecast feedforward control problem. In the
We make the following assumptions to ensure the existence of subsequent sections, we refine the structures of K and L,
a stabilizing optimal control.
1 We have chosen, for notational clarity only, to use the LQG optimal control
(A1) The matrix Ad has all its eigenvalues strictly inside the with the prediction estimator even though the plant itself is strictly proper and
unit circle and [A, B] is stabilizable. This, together with the the filtered estimator is truly optimal. See Ishihara and Takeda [1986]
178
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
extending the observations of Hazell and Limebeer [2010] • The first block element of K∆ is zero because, in our
concerned with “efficient computation,” which in turn extended predictor feedback case, it represents a past disturbance,
those of Bitmead et al. [1990] for the control only. Then the which is subsumed by current and future information.
detailed structure of Vf , the feedforward measurement noise • As noted by Hazell and Limebeer, as the feedforward
covariance, will be introduced, since this is the embodiment of horizon N → ∞, the tail elements ofK∆ and KdN tend
the problem’s Information Structure. to zero exponentially fast. Likewise, the block elements
of K apart from the tail elements KdN are fixed. Thus, as
4. STRUCTURE OF THE STATE-ESTIMATE FEEDBACK N → ∞ the feedback control gain tends to a semi-infinite
GAIN AND FILTER GAIN block vector, which is simply precomputed.
Before we can analyze the information structure of the forecast In Hazell and Limebeer [2010], the feedforward signal is pro-
feedforward control problem, we must first apply an important vided exactly, i.e. is noise free, and hence Vf = 0 above. In
result from Hazell and Limebeer [2010] which gives explicit this case the combined state estimate covariance Σ in (9) is
formulæ for the augmented control gain K and its constituent zero except for the upper left corner corresponding to the plant
components. Prior to this, we develop an appropriate partition. state estimate. Accordingly, Hazell et al. are able to derive a
After establishing the efficient computation of K in the context structure of L which mirrors with that of K, the feedback gain
of this paper, we summarize why the analogous results, from matrix. This yields their “computationally efficient” approach
Hazell and Limebeer [2010], for the efficient computation of L to calculating these feedforward estimator gain values.
and Σ do not apply to the case of an uncertain forecast as the When the feedforward noise, Vf , is not zero, this efficiency
feedforward measurement. disappears and the elements of the state estimate, the plant
Consider the conformable partitioning of the augmented state- state estimate and the disturbance/delay state estimate, are
estimate feedback gain K for multiplication by the augmented coupled through the output measurement yt . That being said,
the case of Vf > 0 is precisely that dealing with forecast
state estimate X̂t|t−1 (which inherits its partition from (3)) as data and at this point our treatment deviates significantly from
K = [K K(0) K(1) . . . K(N −1) KdN ] , (12) that of Hazell and Limebeer [2010]. Indeed, the effect of
where, to form controller ut , −K multiplies the estimate of feedforward reliability is our prime focus, since it encapsulates
plant state xt , −Kn , with 0 ≤ n ≤ N − 1, multiplies the the information structure of the problem. We next develop
estimate of the delay line state x∆,t , and −KdN multiplies the analytical tools for this.
estimated disturbance model state xd,t .
We partition the associated algebraic Riccati equation solution 5. NESTING OF FEEDFORWARD CONTROLLERS
P from (8) in similar fashion as
P P ∆ Pd
Consider two forecast feedforward problems with respective
T forecast horizon lengths N and M , where we take M > N . Our
P = P∆ P∆∆ P∆d . (13) aim in this section is to demonstrate that the N -horizon feed-
T T
Pd P∆d Pdd forward controller is identical to the M -horizon feedforward
controller with the information structure with
4.1 Efficient Computation for P and K of Hazell and Limebeer Vf,N = Vf,N +1 = · · · = Vf,M −1 = ξI,
The following theorem is the direct application of the results for ξ > 0, so that as ξ → ∞, these noise variances become
of Hazell and Limebeer [2010] in the context of the forecast infinite.
feedforward control problem, and is therefore stated without First, we need a lemma concerned with jointly Gaussian pro-
proof. cesses with possibly infinite covariances.
Theorem 1. (Efficient Formulae for K and P). Define the ma- Lemma 1. Consider the jointly Gaussian distributed random
trix Ac as vectors
Ac = A − BK. X
" # "
mX
# "
ΣX ΣXY T ΣXZ T
#!
The components of the augmented, unique, stabilizing control Y ∼N m Y , Σ Y X T ΣY ΣY Z T ,
gain K in (10), with P partitioned as in (13), are given by Z mZ ΣZX T ΣZY T ΣZ
K = (B T P B + R)−1 B T P A, (14) and let ΣZ = ξI, with ξ > 0. Then
T −1 T
K∆ = (B P B + R) B lim E{X|Y = y, Z = z} = E{X|Y = y},
h i ξ→∞
× 0 (C T Q) (ATc C T Q) . . . (ATc N −2 C T Q) , (15) lim cov(X|Y = y, Z = z) = cov(X|Y = y).
−1 T
N −1 T ξ→∞
KdN = B T P B + R B ATc
C QCd
Proof: For jointly Gaussian random variables
∞ n o
N X j
+ATc ATc C T QCd Ajd Ad .
(16)
A mA ΣA ΣAB T
j=0 B ∼N m , Σ T Σ
B BA B
,
where P , the solution to the discrete-time algebraic Riccati E{A|B = b} = mA + ΣAB T Σ−1
B (b − mB ) ,
equation (DARE) associated with pure feedback to the plant −1
state, is found by solving cov(A|B = b) = ΣA − ΣAB T ΣB ΣBAT .
P = AT P A − AT P B(B T P B + R)−1 B T P A + C T QC.
T T T
We apply this result with A = X and B = Y Z .
We offer the following comments. Using ∆ = A − BD−1 C,
• The plant state-estimate feedback control gain, K, and
A B
−1
∆−1 −∆−1 BD−1
its associated algebraic Riccati equation solution, P , are = .
unaffected by the presence of feedforward.
C D −D−1 C∆−1 D−1 (D + C∆−1 B)D−1
179
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
Whence, as ξ → ∞, Σ−1
Z → 0 yielding
M
Now note that the latter M − N elements in X̂t|t−1 are pre-
−1 −1 dictions of the disturbance process without the aid of further
ΣY ΣY Z T ΣY 0
ΣZY T ΣZ = . feedforward measurements. Thus, as ξ → ∞,
0 0
Applying these results to the conditional expectation E{X|Y = t−1
E{dt+N +j |YN } = Cd Ajd E{xN t−1
d,t |YN },
y, Z = z} as ξ → ∞ produces the result. 2
We now proceed to the main result of this paper, which demon- for j ≥ 0. Further,
strates the information structure of the forecast feedforward t−1 M −N t−1
control problem. E{xM d,t |YN } = Ad E{xNd,t |YN }.
Theorem 2. (Data Nesting Property). Consider two forecast feed-
forward control problems with the same plant model, distur- With these observations, plus the formulæ (15-16), it is appar-
bance model and driving noise covariance, LQ criterion, and ent that
plant process and measurement noise covariances,
uN,t = −KN X̂t|t−1
N M
= −KM lim X̂t|t−1 = lim uM,t . 2
[A, B, C, Ad , Bd , Cd , Wd , Q, R, W, V ], ξ→∞ ξ→∞
but with differing forecast horizon lengths N and M with
N ≤ M . Let the feedforward information structure of the N -
horizon problem be described by the forecast noise covariance
N
cov(vf,t ) = blockdiag[Vf,0 , . . . , Vf,N −1 ] (17)
and result in control signal {uN,t : t ≥ 0}. Further, let the 6. NUMERICAL EXAMPLE
feedforward information structure of the M -horizon problem
be described by forecast noise covariance
We consider three variants of the feedforward control prob-
M
cov(vf,t ) = blockdiag[Vf,0 , . . . , Vf,N −1 , ξI, . . . , ξI ], (18) lem which illustrate the effect of information structure on the
| {z }
M −N terms
closed-loop performance and permit comparison to Hazell and
Limebeer [2010]. The plant and disturbance models are charac-
with ξ > 0 and corresponding control signal {uM,t : t ≥ 0}. terized by the following.
Then,
uN,t = lim uM,t , for all t ≥ 0. 1 0.0.7869 0.4261
ξ→∞ A = 0 0.6065 , B = 0.7869 , C = [1 0] ,
Ad = 0.999, Bd = 0.5, Cd = 0.4,
Proof: The case where N = M is trivially satisfied: assume W = 0.001 I2 , V = 0.001, Wd = 1,
M > N . From (12) the N -horizon control gain is given by Q = 10, R = 0.001.
KN = [K K(0) . . . K(N −1) KdN ] ,
We note that this plant-disturbance pair exhibits low observabil-
and the M -horizon control gain is given by ity in that an eigenvalue of the plant dynamics A is relatively
KM = [K K(0) . . . K(N −1) K(N ) . . . K(M −1) KdM ] , close to the eigenvalue of the disturbance dynamics Ad , reflect-
From (15-16), the first N + 1 block elements of these control ing the amplified benefits of disturbance feedforward. We take
gains are identical (i.e. up to K(N −1) ). feedforward horizon N = 10 and consider the following three
cases, plotted in Figure 2.
These control gains multiply the following augmented state 1. (Almost) perfect feedforward: Vf,i = 10−6 , i = 1, . . . , 10.
estimates. For the N -horizon problem, This corresponds to a disturbance feedforward noise level
E{xt |YN t−1
}
which is negligible in terms of its effect on closed-loop perfor-
E{d |Y t−1 } mance. This is the case studied in Hazell and Limebeer [2010].
t N 2. Moderate reliability: Vf,i = 0.01 × 1.85i .
N
..
3. Diminished reliability: Vf,i = 0.01 × 2.3i .
X̂t|t−1 =
.
,
t−1 For e c ast r e l i ab i l i ty
E{dt+N −1 |YN }
45
V f , n, C ovar i an c e of f or e c ast n oi se
t−1
E{xNd,t |YN } 40
Pe r f e c t f or e c ast
35
and for the M -horizon problem, as ξ → ∞, M o d e r at e r e l i ab i l i ty
30 D i m i n i sh e d r e l i ab i l i ty
t−1 t−1
E{xt |YM } E{xt |YN }
25
E{dt |Y t−1 } E{dt |Y t−1 }
M N
20
.
..
..
.
15
t−1 t−1
t+N −1 |YM } E{dt+N −1 |YN }
E{d
M 10
X̂t|t−1 = → ,
t−1 t−1
E{dt+N |YM } E{dt+N |YN } 5
..
..
. . 0
t−1 t−1
E{dt+M −1 |YM } E{dt+M −1 |YN } −5
t−1 t−1
E{xMd,t |YM } E{xMd,t |YN }
−10
1 2 3 4 5 6 7 8 9 10
where we have invoked Lemma 1 and (17) to alter the condi- For e c ast h or i z on
tioning and a notational convenience has been used to indicate
that the disturbance state xd,t is further in the future for the M Fig. 2. Forecast noise covariances for the three cases: no noise,
problem versus the N problem. moderate reliability and diminished reliability.
180
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
181