You are on page 1of 20

Recent Advances and Challenges in Process Identification

Sten Bay Jørgensen∗ Jay H. Lee†


CAPEC Department of Chemical Engineering
Department of Chemical Engineering Georgia Institute of Technology
Technical University of Denmark Atlanta, Georgia, USA
Lyngby, Denmark

Abstract
Process identification is undergoing tremendous developments as computational capabilities improve. Theories are rapidly
catching up with the needs of practical applications but practical process identification experiences still reveal significant
gaps between theory and practice. This review attempts to highlight the present gaps and challenges. With this objective,
the review treats recent progress in process identification with data gathered in closed loop, and in the tailoring of an
entire identification process to a given control objective.

Keywords
Process identification, Model uncertainty, Robust control, Parameter estimation, Closed-loop identification, Identification
for control, Multivariable systems, Nonlinear systems, Time-varying systems

Introduction whole identification procedure to a given control objec-


tive. The past decade has also seen an unprecedented
Process identification is concerned with using observed
range and volume of applications of process identifica-
process data to form a model of the process that can be
tion in industries, mainly to provide models for predic-
used in various constructive ways for process improve-
tive control. After a decade of such explosive develop-
ment. It is unquestionably one of the most important
ments, it is indeed apt to reflect upon the progress and
steps of control system design—accounting for as much
the state of the field at this CPC meeting.
as 80-90% of the cost and time involved. It encompasses
The gap between research and practice, though nar-
a diverse set of tasks that include plant testing, selection
rower than in most fields, is nevertheless significant and
of a model structure, parameter estimation, and model
therefore is worth elaborating a bit:
validation. Prior to actual controller implementation, it
• Plant Test: Industrial plant tests use simple signals
is the only step that requires direct interaction with the
like steps or PRBSs. In addition, it is almost always
process. Consequently, any erroneous decision here can
limited to perturbing one input at a time, mostly out
jeopardize success of an entire control project. This fact
of the concern for unpredictable effects on process
demands that all decisions involved in identifying a pro-
behavior (Qin and Badgewell, 1997). Literature is
cess model be made carefully and systematically based
replete with optimal test signal design methods in-
on sound scientific principles and methods—a fact that
cluding those that attempt to incorporate specific
perhaps explains the good synergy between researchers
control requirements and process characteristics into
and practitioners of the field.
design in an iterative manner (Rivera et al., 1993;
The paper of Åström and Eykhoff (1971) was one of
Asprey and Macchietto, 2000; Pearson, 1998; Cooley
the first to review the research in system identification.
and Lee, 2001). However, such tailored and itera-
During the 70s and 80s, the theoretical foundations for
tive designs have rarely been attempted in practice,
system identification were laid by the pioneering work
if ever. The single-input testing will inevitably em-
of Ljung (1987), Söderström and Stoica (1989) and their
phasize accuracy of individual SISO dynamics but
coworkers. The work of Ljung centered around a par-
several studies have shown that accurate identifica-
ticular paradigm called Prediction Error Minimization
tion of SISO dynamics may be inadequate for mul-
(PEM), which today is the norm of industrial practice.
tivariable control of certain types of plants (e.g., ill-
Process identification at the beginning of the 90s was
conditioned plants) (Andersen et al., 1989; Koung
reviewed by Andersen et al. (1991) in CPC-IV.
and MacGregor, 1994; Li and Lee, 1996).
The decade has seen several major developments. One
• Model Structure: Popular structures are Finite Im-
of them is the subspace approach, which was motivated to
pulse Response (FIR) models and ARX models,
overcome some drawbacks of PEM for multivariable sys-
both of which lead to linear regression problems.
tem identification (Van Overschee and De Moor, 1996;
Other structures, like ARMAX models, OE mod-
Verhaegen and Dewilde, 1992; Larimore, 1990). Signifi-
els, and Box-Jenkins models, which require non-
cant advances have also come along in closed-loop iden-
convex optimization, are less common but are used
tification and “control-oriented” (or “control-relevant”)
in some occasions. In almost all cases, Multiple-
identification. In these areas the goal is to tailor the
Input-Single-Output (MISO) structures are used, in
∗ sbj@kt.dtu.dk
which a separately parameterized model is fitted for
† jay.lee@che.gatech.edu each output (Andersen et al., 1991). This practice

55
56 Sten Bay Jørgensen and Jay H. Lee

is clearly inefficient, both with respect to model or- dustrial practice. However, model uncertainties are
der and accuracy, in view of the fact that most in- seldom captured in a form that can be used for ro-
dustrial process outputs exhibit significant levels of bust controller design. There is an extensive liter-
cross-correlation. The preference for (or insistence ature on how information on process noises (either
on) MISO structure is clearly linked to ease of pa- statistics or bounds) can be translated into model
rameter estimation as explained below. bounds (Rivera et al., 1993; Ninness and Goodwin,
• Model Estimation: PEM is by far the most domi- 1995; Böling and Mäkilä., 1995). However, these
nant method for estimating model parameters, per- methods are seldom used as process noise informa-
haps owing to its flexibility and sound theoreti- tion itself is unavailable or highly inaccurate. In
cal basis as well as the ready availability of soft- addition, there are many other contributing uncer-
ware tools. However, with multivariable structures, tainties, e.g., process nonlinearities, actuator errors,
PEM requires special parameterizations and non- etc. Good theories with wrong assumptions are just
convex optimization (Ljung, 1987; Van Overschee as ineffective as bad theories.
and De Moor, 1994), a fact that perhaps explains • Closed-Loop Testing and Iterative Improvement:
the industry’s proclivity toward use of MISO struc- Many industrial processes already have several
tures. The subspace approach is designed to obvi- working loops that may not be removed. Hence,
ate these problems but requires relatively large data we suspect that closed-loop testing has already been
sets. In fact, the two approaches are best combined practiced to some extent. However, it is not clear
into one: The subspace approach can be used to pro- whether practitioners are always aware of poten-
vide a good initialization for PEM, which should al- tial problems that can arise with usage of feed-
leviate the aforementioned problems. However, ex- back correlated data. It is also less common to use
tensive use of such MIMO identification methods in closed-loop testing as a way to generate data that
the industry is not at all evident. are highly informative for feedback controller design
Statistical methods like the Maximum Likelihood (Gevers and Ljung, 1986; Gevers, 2000; Forssell and
Estimation or Bayesian estimation have not found Ljung, 1999a). In addition, many published papers
much use. This is probably because of the lack of deal with iterative improvement of model and con-
probability information or the fact that these com- troller through closed-loop testing (Van den Hof and
plex methods often reduce to the same least squares Scharma, 1995; Hjalmarsson and Birkel, 1998). The
calculation as PEM under commonly made prob- idea of continually improving the closed-loop per-
ability assumptions. Likewise, use of nonparamet- formance by using data collected from a working
ric methods such as the frequency-domain Empirical loop is very attractive from a practical viewpoint.
Transfer Function Estimation seems rare, probably It connects well with the industry’s growing con-
due to the lack of sufficiently large data sets. cern over maintaining performance of advanced con-
• Use of Estimated Model: The PEM approach trollers. However, it is not clear that the industry
with many structures (e.g., ARX or ARMAX) as at large has seriously considered this possibility.
well as the subspace approach yield a combined • Nonlinear Process Identification: While systematic
deterministic-stochastic system model. However, tools for formulating first engineering principle mod-
the noise part of the model is seldom used in control els have begun to appear, few systematic methods
system design. This may be due to the belief that for identifying first engineering principle models are
disturbances experienced during an identification available (Asprey and Macchietto, 2000; Lee, 2000).
experiment are not representative of those encoun- On the other hand, some implementations of non-
tered during real operation. However, this practice linear model predictive control have been reported
bears some danger as the two model parts are iden- (Qin and Badgewell, 1998; Young et al., 2001). The
tified to work together as a single predictor. Also, most common industrial approach to deal with pro-
many process monitoring and soft-sensing schemes cess nonlinearities is by use of multiple models.
require precisely information on how variables are Switching rules among different models are ad hoc
correlated to one another in time due to unknown and seldom systematically designed. Some applica-
inputs and noises. Hence, the noise part of a multi- tions of artificial neural networks are reported but
variable model, when fitted with appropriate data, their effectiveness as causal models, i.e., as opti-
can serve a very useful function. In addition, some mization and control would use them, is question-
control applications, such as those involving infer- able at best. Despite the vigorous research in this
ential estimation, should clearly benefit from avail- area during the past decade, the field still lacks a ba-
ability of an accurate noise model (Amirthalingam sic framework and a unifying theoretical foundation
and Lee, 1998; Amirthalingam et al., 2000). (Johansen and Foss, 1995).
• Model Validation: Standard tests like residual anal- These gaps in Table 1 serve to motivate our selec-
ysis and cross validation are ubiquitous in the in- tion of topics covered in this review paper. We have
Recent Advances and Challenges in Process Identification 57

Issue Practice Theory


Model Structure SISO/MISO ARX or FIR MIMO State Space
Parameter estimation PEM (Least Squares) Subspace and PEM
Noise info usage Rarely used Disturbance estimation, inferential control
Model validation Residual analysis Model error bounds
Plant Test One input at a time Simultaneous testing of multiple inputs
Closed Loop test Little understood Vigorously researched
Nonlinear Process Identification Seldom used Systematic tools lacking

Table 1: Summary of current gaps between practice and theory.

chosen topics, for which theories, in our view, have ad- Closed-Loop Identification
vanced to a point that some of the aforementioned gaps
Introduction
can be closed to bring significant benefits to the prac-
tice. For that, we concentrate mainly on two topics, During the past decade, interest in closed-loop identifi-
control-oriented identification and closed-loop identifica- cation by the community has risen noticeably. Closed-
tion, which in our view have seen the most significant loop identification is motivated by the fact that many
research and progress related to process control during industrial processes have already in place one or more
the past decade. We attempt to highlight the progress as loops that cannot be removed for safety and/or economic
well as the current limitations, and point out obstacles reasons. Beyond this lies the attractive idea of being
an engineer may face in adopting the new approaches in able to improve the closed-loop performance continually
practice. by making use of data being collected from a working
We have chosen to leave out the subspace approach in loop. It has also been claimed that data collected from a
our review, other than developments relevant to closed- closed-loop operation better reflect the actual situation
loop identification, as the topic has already been well in which the developed model will be used, and there-
publicized. Current literature on this topic includes a fore could yield better overall results (Gevers and Ljung,
book (Van Overschee and De Moor, 1996) and several 1986; Hjalmarsson et al., 1996; Gevers, 2000).
review papers, e.g., Viberg (1995) and Shi and MacGre- On the other hand, a closed-loop condition presents
gor (2000). We have also chosen to leave out a review on some additional complications for system identification.
nonlinear process identification, for which only limited The fundamental problem is the correlation between the
progress has occurred but further developments are very output error and the input through the feedback con-
much needed. Marquardt (2001)in this conference gives troller. Because of the correlation, many identification
a comprehensive coverage of this topic. methods that are proven to work with open-loop data
The rest of the paper is organized as follows. In Sec- can fail. This is true for the prediction error approach
tion 2, we review the current state of knowledge in closed- as well as the subspace approach and nonparametric ap-
loop identification. The asymptotic behavior of various proaches like empirical transfer function estimation.
identification approaches are discussed but our focus re- Awareness of the potential failings has engendered sig-
mains on practical implications of the theories. In Sec- nificant research efforts, which in turn have led to better
tion 3, we discuss the problem of tailoring the entire understanding of the properties of the existing methods
identification process to a given specific control objec- when used with closed-loop data as well as some remedies
tive. We discuss why this consideration naturally leads and special measures needed to circumvent the potential
to iterative identification and review both open-loop and problems. We will try to give a concise and pragmatic
closed-loop strategies. In Section 4, we conclude. summary of the recent developments, concentrating on
Our main objective for writing this paper is to fuel an the practical implications of the theoretical results. See
honest and substantive debate among researchers and Gustavsson et al. (1987); Van den Hof and Scharma
practitioners on the current state of identification theo- (1995); Forssell and Ljung (1999a) for more comphen-
ries as related to potential closure of the existing gaps. sive surveys and formal disquisitions on the topic. We
We hope that such a debate will clarify the strengths and will focus on the prediction error approach (Ljung, 1987),
limitations of the existing theories and methodologies for which is the standard at the moment, but we will also
practitioners. We also hope that the factors previously point to some potential problems and remedies for the
passed over by the researchers but must be accounted for subspace approach at the end.
theories to be practicable will also be brought out in the The closed-loop identification methods can be classi-
open. fied into three broad categories. In the direct approach,
the feedback is largely ignored and the open-loop system
is identified directly using measurements of the input and
58 Sten Bay Jørgensen and Jay H. Lee

e
Dither Signal H0 be untenable in many industrial situations where the
r d controllers are equipped with various anti-windup and
Location 2
Location 1
y override features.
C u
G0
-
Direct Approach
In the direct approach, measurements of the plant input
and output are used to build a model for the open-loop
Figure 1: Typical setup for closed-loop identification. system directly, as in open-loop identification. The main
advantage of the direct method is that the controller is
not restricted to linear ones and its identity needs not
be known. However, the correlation introduced by the
output. In the indirect approach, the closed-loop transfer
feedback can cause problems and successful application
function between an external perturbation signal and the
demands some additional knowledge as we shall see.
output (or input) is first identified and an open-loop sys-
tem model is determined from it with knowledge of the The Method. The generic model structure used in
controller. Finally, in the joint input-output approach, linear identification is as follows:
measurements of both the input and output are used to
identify a joint system, which has the plant input and y(t) = G(q, θ)u(t) + H(q, θ)e(t) (2)
output as its outputs and the external perturbation sig-
nal as its input. From the joint system, an open-loop sys- θ is the vector of unknown parameters of the model and
tem model is extracted. We will examine each approach e is a white noise sequence. We may restrict the search

one at a time, concentrating on asymptotic properties to θ ∈ ΘM . In this case, GM = {G(q, θ) | θ ∈ ΘM } and
and their implications. ∆
HM = {H(q, θ) | θ ∈ ΘM } represent the allowable set
The block diagram in Figure 1 displays a typical setup
for plant model G and noise model H, respectively. Let
of closed-loop identification experiment. External per-
the available data be denoted by
turbations may be added to the setpoint of the controller
(at Location 1 in Figure 1) or directly to the controller DN = [y(1), u(1), · · · , · · · , y(N ), u(N )] (3)
output (at Location 2 in Figure 1). For the simplicity
of exposition, we will assume from now on that exter- The prediction error minimization (PEM) chooses θ̂N ,
nal perturbations enter through Location 2. Note that, the estimate of θ based on DN , according to
in the case that the controller is linear, this assumption
can be made without loss of generality since perturba- θ̂N = arg min VN (θ, DN )
 
(4)
θ∈ΘM
tions introduced at Location 1 can always be rewritten
as equivalent perturbations at Location 2. where
Disturbances in the plant output are described col-
lectively as a white noise signal (denoted by e) passed 1 X
N
through some linear filter H0 . Without loss of general- VN (θ, DN ) = kε(θ, t)kW (5)
N t=1
ity, we will assume that H0 is stably invertible and monic
(H0 (∞) = I) and the white noise signal e has the mean ε(θ, t) = H −1 (q, θ) [y(t) − G(q, θ)u(t)] (6)
of zero and the covariance of Pe .
Assuming the plant G0 and the controller C are both ∆
and kxkW = xT W x denotes the weighted quadratic
linear, we can write down the following closed-loop rela- norm.
tionships:
Notation: We will often use symbols ĜN and ĤN
(I + G0 C)−1 G0 (I + G0 C)−1 H0
 
" # " # to denote G(q, θ̂N ) and H(q, θ̂N ). The same symbols
y(t) | {z }  r(t)
= 
 S0  may also be used to compactly represent G(ejω , θ̂N ) and
−1
 (1 + CG0 ) −C(I + G0 C)−1 H0  e(t)

u(t) | {z } H(ejω , θ̂N ) when the context makes their meanings clear.
S0r Similarly, we will use Gθ and Hθ to denote G(q, θ) and
(1) H(q, θ), or sometimes G(ejω , θ) and H(ejω , θ).
In the above, S0 and S0r represent the sensitivity function
and reverse sensitivity function respectively. Convergence Behavior. If all the signals are quasi-
Of course, both the assumption of linear plant and the stationary, VN (θ, DN ) → V̄ and θ̂N → θ̄ w.p. 1 as N →
particular way of describing the disturbance are great ∞, where
simplifications but are typical of developing and analyz- N
ing linear identification methods. In addition, the as- ∆ 1 X
V̄ (θ) = lim E{kε(θ, t)kW } (7)
sumption of linearity of the controller, when made, may N →∞ N
t=1
Recent Advances and Challenges in Process Identification 59

and • If the controller is nonlinear or time-varying, Φx



θ̄ = arg min V̄ (θ) (8) may be positive definite even without Φr > 0.
θ∈ΘM
• Suppose G0 ∈ GM but H0 ∈ / HM (implying the
From Parseval’s relation,
noise part is undermodelled). Then, from (13), we
Z π see that ĜN does not converge to G0 as Gθ = G0
1
θ̄ = arg min tr [W Φε ] dω (9) does not achieve the minimum of (13) in this case.
θ∈ΘM 2π −π
• When the noise model is completely fixed a priori (as
where Φε is the spectrum of the prediction error ε.
in an Output Error structure), ĜN does not converge
Since from (6),
to G0 in general. From expression (13), we see that
ε(t) = Hθ−1 [G0 u(t) + H0 e(t) − Gθ u(t) − Hθ e(t)] + e(t) ĜN → G0 only when the assumed noise model is
(10) perfect or Φue = 0, which corresponds to the open-
and e(t) is independent of (G0 − Gθ )u(t) and (H0 − loop case.
Hθ )e(t) because (G0 − Gθ ) and (H0 − Hθ ) both contain
Error Analysis. The error behavior in the limit can
at least one delay,
be formalized as follows.
Φε = Hθ−1 ∆GH Hθ−1∗ + Pe (11)
Bias
where Bias refers to the expected error E{θ0 − θ̂N }. In terms
(G∗0 − G∗θ )
  

∆GH = (G0 − Gθ ) (H0 − Hθ )
 Φu Φue of our notations, it is θ0 − θ̄ (or G0 − Gθ̄ in terms of
Φeu Pe (H0∗ − Hθ∗ ) frequency-domain transfer function). An expression for
(12) bias in the limit can be obtained by further manipulating
Hence, equation (13) into the following expression (Forssell and
Z π Ljung, 1999b).
tr ∆GH Hθ−1∗ W Hθ−1 dω

θ̄ = arg min (13)
θ∈ΘM −π θ̄ =
Z π
The expression in (13) provides some good insights into
arg min tr {[(G0 + Eθ − Gθ )Φu (G0 + Eθ − Gθ )∗
the convergence behavior under the direct approach. θ∈ΘM −π
• Suppose G0 ∈ GM and H0 ∈ HM (i.e., the true plant + (H0 − Hθ )(Pe − Φeu Φ−1
u Φue )
and noise process both lie inside the parameterized
(H0 − Hθ )∗ ] Hθ−1∗ W Hθ−1 dω

(15)
model sets). Then, it is clear from the objective
function that the minimum corresponds to Gθ̄ = G0 where
and Hθ̄ = H0 (implying consistent estimation) if Eθ = (H0 − Hθ )Φeu Φ−1
u (16)
 
Φu Φue From (15), we can conclude the followings:
Φx = >0 ∀ω.
Φeu Pe 1. If the parameterization of the plant/noise model is
flexible enough that Gθ ∈ GM and Hθ ∈ HM ,
• In the case of a linear controller,

Φu Φue
 Gθ̄ = G0 and Hθ̄ = H0
Φeu Pe provided that the minimum is unique, which is guar-
S0 Φr S0r∗ + CS0 H0 Pe H0∗ S0∗ C ∗ −CS0 H0 Pe
 r
anteed by Φru > 0. In this case, ĜN is an unbiased

estimate of G0 .
| {zr } | {z } | {z }
 Φu Φeu Φue
=

−Pe H0∗ S0∗ C ∗ Pe 2. Suppose noise model is fixed a priori as HM , which

 
| {z
Φeu
} does not have any dependence on θ. (Alternatively,
assume that separate sets of parameters are used for
Φu + Φue Pe−1 Φeu Φue
 r 
= G and H as in Box-Jenkins model). Also suppose
Φeu Pe that Gθ ∈ GM . Then, we can conclude from the
−1
  r  
I Φue Pe Φu 0 I 0 above that
=
0 I 0 Pe Pe−1 Φeu I
(14) (G0 − Gθ̄ ) = (H − HM )Φeu Φ−1
u (17)

Note that Φu is expressed as sum of two compo- Hence, (H − HM )Φeu Φ−1 u is the bias. The bias will
nents, Φru representing the contribution from exter- be zero if one or both of the following conditions is
nal dithering and Φeu the contribution from noise satisfied.
feedback. From (14), it is clear that Φx > 0 iff • The assumed noise model is perfect, i.e., HM =
Φru > 0. H0 .
60 Sten Bay Jørgensen and Jay H. Lee

• Φeu = 0, which implies open-loop testing. in the frequency domain is shaped by the signal-to-noise
ratio.
We also note that the size of bias depends on the
For a closed-loop system, it follows directly (Forssell
following two things:
and Ljung, 1999b) that
• (H − HM ), which is the error in the assumed n −T
noise model. Cov(vecĜN ) ∼ (Φru ) ⊗ Φd (23)
N
• Φeu Φ−1
u , which is affected by, among many Recall that Φu = Φru + Φeu . Thus the above says that the
things, the power of the external perturbation excitation contributed by noise feedback does not con-
signal relative to the noise signal. Note that tribute to variance reduction, at least in the asymptotic
case of n → ∞. One can make some sense of this result
Φeu Φ−1 −1 e −1
u = (Pe Φu ) × (Φu Φu ) (18) by considering the extreme case that input excitation is
entirely due to noise feedback. In this case, measure-
Pe Φ−1
u is the noise-to-signal ratio in an open- ments of output and input contain just the information
loop sense. What multiplies this is Φeu Φ−1 u , about closed-loop transfer functions (I + G0 C)−1 H0 and
which can be interpreted as the relative contri-
C(I + G0 C)−1 H0 . If the model order is allowed to ap-
bution of the noise feedback to the total input
proach infinity, even perfect knowledge of the closed-loop
power. The larger the noise feedback’s contri-
functions yields no information about G0 and H0 inde-
bution, the bigger the bias. Note that increas-
pendently. This is because, for an arbitrary choice of H0 ,
ing the controller gain will decrease Pe Φ−1
u but one can always choose G0 to match any given closed-loop
will also increase Φeu Φ−1
u , thus making its effect functions and vice versa (barring an invertibility prob-
on the bias inconclusive.
lem). In such a case, variance would be infinite at all
3. In the case of undermodeling such that Gθ ∈ / GM frequencies, which is consistent with expression given in
and/or Hθ ∈ / HM , θ will be chosen to make both (23).
G0 − Gθ and H0 − Hθ small. Bias in the frequency An exceptional case is when noise model is perfectly
domain will be distributed according to the weight- known a priori. In this case, one can show that asymp-
ings given in (15). totic variance distribution follows the open-loop case.
Hence, noise feedback contributes just as much as ex-
ternal dithering to variance reduction. This is consistent
Variance with the foregoing argument as the perfect knowledge of
The other part of error is variance, which refers to the H0 would enable direct translation of information about
error due to an insufficient number of data points relative the closed-loop functions into that about the open-loop
to the number of parameters. This error is mathemati- function.
cally formalized as In situations where model order can be constrained
  T  to a finite number, the noise feedback would make some

Cov(θ̂N ) = E θ̂N − θ̄ θ̂N − θ̄ (19) contribution to the information content for the estima-
tion of open-loop functions G and H.
in the parameter domain and
Notation: The Kronecker product between A ∈ <n×m
and B ∈ <p×r is defined as
 

Cov vecĜN =
a1,1 B ··· ··· a1,m B
  T   
E vecĜN − vecGθ̄ vecĜN − vecGθ̄ (20)  .. .. 
 a1,2 B . ··· . 
A⊗B =
.. ..
 (24)
 .. .. 
in the frequency domain, where the notation vec(·) refers  . . . . 
to a vectorized form of a matrix obtained by stacking the an,1 B ··· ··· an,m B
columns of the matrix into a single column.
Practical Implications. The following are the im-
In Zhu. (1989), it is shown for open-loop identification
portant points to take away from the foregoing analysis.
that, as n → ∞ and N → ∞,
• The main advantage of the direct approach is that
n −T the controller is completely taken out of the picture
Cov(vecĜN ) ∼ (Φu ) ⊗ Φd (21)
N in the estimation step. Not only is it unnecessary
to know the controller, but it is also not required
where
for the controller to be linear and time-invariant.
Φd = H0 Pe H0∗ (22)
It is an important advantage considering that most
and ⊗ denotes the Kronecker product. The above ex- industrial controllers are not adequately represented
pression shows that the asymptotic variance distribution by a linear, time-invariant operator.
Recent Advances and Challenges in Process Identification 61

• The most serious problem for the direct approach much input excitation is there through noise feed-
is that noise model (structure) needs to be perfect back. On the other hand, if the noise structure is
in order for it to yield consistent estimates. Fixing very restricted (for example, fixed completely), ex-
noise model completely a priori, as is often done in ternal dithering would be needed to reduce bias.
practice, results in a biased-esimate of G0 in general. • Choice of location for dithering (between Location 1
For example, the popular least squares identification and Location 2) is not important as a perturbation
of finite impulse response parameters would give a made at one location can always be translated into
biased model. This is the most important difference an equivalent perturbation at the other location. On
from the open-loop identification case. the other hand, the perturbation signal (e.g., its
• The theoretical result indicates that consistent esti- spectrum) should be designed accordingly. To see
mates can be obtained by leaving noise model suf- this, let us compare the expressions for the resulting
ficiently flexible (so that Hθ ∈ HM ). Though this input spectrum (Φru ), which appears in the asymp-
would indeed help reduce bias, it may not get rid of totic variance expressions of (23), under the two
the bias problem completely in practice as most dis- dithering strategies. For dithering at Location 1,
turbances in industrial processes are non-stationary we obtain
and are not accurately captured by a white noise Φru = CS0 Φr S0∗ C ∗ (25)
through a linear filter. In addition, increasing the Dithering at Location 2 gives
order of model would mean increased variance, thus
demanding heavier external dithering. Φru = S0r Φr S0r∗ (26)

• In case that noise part is undermodelled, the size Loops with integral controllers yield S0 (or S0r ) that
of the resulting bias in G depends on the signal-to- starts from 0 at ω = 0 and increases to 1 at high fre-
noise ratio and the relative contribution of exter- quencies. Hence, a white noise perturbation signal
nal dithering to the total input power (compared at Location 2 would give Φru that is zero at ω = 0
to noise feedback). Hence, in principle, bias can and very low in the frequency region well below the
be made negligible by overwhelming noise feedback controller’s bandwidth. In view of (23), this would
with heavy external dithering. On the other hand, make the low frequency part of the model very poor.
boosting the signal-to-noise ratio by increasing the Estimation of the low frequency dynamics is exac-
controller gain has no ameliorating effect on bias as erbated by the typical shape of noise spectrum Φv ,
it increases the relative contribution of noise feed- which is high in the low frequency region for most
back to the input power as well the signal-to-noise process control problems. On the other hand, white
ratio . noise dithering at Location 1 does not suffer from
this problem to a same degree since CS0 ≈ G−1 0 in
• External dithering with a persistently exciting sig- the low frequency region and ≈ I in the high fre-
nal is necessary for consistent estimation. It is gen- quency region. However, white noise dithering sig-
erally not sufficient to have an input that is persis- nal is generally not optimal, regardless of the loca-
tently exciting. The requirement for external dither- tion, and its spectrum could be designed systemati-
ing may be removed by using a nonlinear controller cally based on estimated noise spectrum and desired
or a time-varying controller. variance distribution.
• Asymptotic variance of a frequency transfer func- • In many practical situations, disturbances process
tion estimate is shaped by input excitation achieved sees during an identification experiment are non-
through external dithering. The excitation through stationary and better represented by a model that
noise feedback contributes little to variance reduc- contains integrators such as the one shown below:
tion when model order is high. This means relying
solely on noise feedback for input excitation can lead 1
y = G(q)u + H(q) e
a very bad result (an estimate of infinite variance). 1 − q −1
Exception is the case where the noise model is accu- ⇒ ∆y = G(q)∆u + H(q)e (27)
rately known a priori. However, such a case would
be rare in practice, and any error in the a priori From the right-hand-side of the arrow in the above,
fixed noise model could translate into a bias in the we see that differencing the input and output data
estimate for G0 . prior to applying the PEM makes this case equiv-
alent to the standard case. All the foregoing dis-
• External dithering is motivated from both the view- cussions regarding the signal spectra hold with re-
point of bias and of variance. If the noise structure spect to the differenced signals. For example, to
is left very flexible, variance would be a significant distribute the variance fairly evenly across the fre-
problem without external dithering, no matter how quency, one should use a dithering strategy where
62 Sten Bay Jørgensen and Jay H. Lee

an integrated white noise signal (or an integrated Another concern may be that resulting ĜN may not
PRBS) is added to the controller’s setpoint. This even give a stable closed loop under the controller C. To
would make ∆r a white noise signal in our analysis. ensure that the model is stabilized by the controller, one
On the other hand, differencing of noisy data can can first parameterize the set of all linear plants that are
amplify the noise effect and make the identification stabilized by C using the dual Youla parameterization
more difficult. and then perform the search over the set (de Callafon
and Van den Hof, 1996).
Indirect Approach By applying the expression of (13) to the indirect iden-
In the indirect approach, measurements of plant output tification, convergence behavior can be easily analyzed.
(or input) is used along with record of an external dither It is not discussed here since the problem is essentially
signal to build a model for the closed-loop system first. the same as the open-loop case. Error analysis can also
Then, based on knowledge of the controller, an estimate be carried out in a similar manner as before. Asymptotic
for open-loop function is extracted from the estimate of variance of ĜN under the indirect method is the same as
the closed-loop function. Note that in the direct method and follows (23).

y(t) = (I + G0 C)−1 G0 r(t) + (I + G0 C)−1 H0 e(t) (28) Key Points.


| {z } | {z }
T0yr T0ye • To obtain the closed-loop function, one can use
any proven open-loop identification method without
Hence, in the first step, T0yr is estimated using data modification.
record for r and y, and in the second step, G0 is ex-
tracted from the estimate. • Consistent estimation is possible even without a per-
The main advantage of the indirect approach is that fect noise model (structure), which is the main at-
the first step is in essence the same problem as open- tractive point for the indrect approach.
loop identification because one does not have to be con- • The most serious problem for the indirect approach
cerned with any feedback-induced correlation between is the assumption of linear controller, which may be
the system input (r) and the noise (e). This removes untenable in many industrial situations. Most in-
the requirement of perfect noise model (structure) for dustrial controllers are equipped with special anti-
consistent estimation. On the other hand, the second windup/override features and their behavior may
step requires a mathematical representation of the con- not be represented accurately by a single linear func-
troller, which generally has to be assumed linear in order tion. One needs to be careful that these special
to extract the open-loop functions from the closed-loop features do not become active during data collec-
function. In addition, the resulting model can be of very tion. An error in the controller representation will
high order, depending on the parameterization used. translate into an error in extracting the open-loop
function from the closed-loop function.
The Method. The generic model structure used
here is • Another potential disadvantage is high model or-
der that often results from the two step calcula-
y(t) = T yr (q, θ)r(t) + T ye (q, θ)e(t) (29) tion. This problem may be obviated by employing
a particular parameterization involving a parame-
The same PEM can be applied to the above model struc- terized form of the open-loop function (such as the
ture with data record of one in (31)). On the other hand, with such a spe-
cialized structure, the prediction error minimization
DN = [y(1), r(1), · · · , · · · , y(N ), r(N )]
becomes more demanding computationally.
to obtain estimates T̂Nyr and T̂Nye . Since T0yr = (I + • The asymptotic variance distribution in the fre-
G0 C)−1 G0 , we can obtain an estimate for open-loop quency domain follows the same expression as in the
function G0 as follows: direct identification case. Hence, all the previous
remarks regarding shaping of variance distribution
ĜN = T̂Nyr (I − T̂Nyr C)−1 (30) apply here as well.
The above calculation can result in ĜN of very high or- Joint Input-Output Approach
der. One may obviate this problem by using the following
The idea behind the joint input-output approach is to use
parameterization of the closed-loop function:
measurements of both the output and input to remove
T yr (q, θ) = (I + G(q, θ)C)−1 G(q, θ) (31) the requirement of a known controller. Since

However, adoption of such a parameterization would y(t) = G0 (I + CG0 )−1 r(t) + (I + G0 C)−1 H0 e(t) (32)
| {z } | {z }
make parameter estimation (via PEM) more complex. T0yr T0ye
Recent Advances and Challenges in Process Identification 63

and Hence, ur represents the portion of the input generated


by signal r and ue is the portion due to noise feedback.
u(t) = (I + CG0 )−1 r(t) + −C(I + G0 C)−1 H0 e(t) (33) Now,
| {z } | {z }
T0ur T0ue y(t) = G0 ur (t) + G0 ue (t) + H0 e(t) (39)

we have Since ur is uncorrelated with the rest of the right-hand-


−1
G0 = T0yr (T0ur ) (34) side, a consistent estimate of G0 can be obtained with
data for y and ur . This consideration leads to the fol-
The above relation can be used to calculate the open- lowing method:
loop function from the two closed-loop functions without
knowledge of the controller. 1. Start with the parameterized structure
In effect, in the joint input-output approach, the con-
troller is “identified.” Even though the controller is not u(t) = T ur (q, θ)r(t) + TM
ue
(q)e(t) (40)
required to be known, it is implicitly assumed to be linear
and time-invariant, an assumption that may not always Apply PEM to the above model with data record
be justified in practical situations. DN and obtain T̂Nur .
2. Obtain data for ur through
The Method. In the first step, the following model
form is used: ûrN (t) = T̂Nur r(t), t = 1, · · · , N (41)
   yr   ye 
y(t) T (q, θ) TM (q)
= r(t) + e(t) (35) 3. Apply PEM to the model structure
u(t) T ur (q, θ) ue
TM (q)
ye
In the above, we assumed that the noise part of the model y(t) = G(q, θ)ur (t) + TM (q)e(t) (42)
is fixed a priori but it too can be parameterized for esti-
using the data record of
mation. Note that consistent estimates can be obtained
with a fixed noise model in this case since r and e are DN = [ûrN (1), y(1), · · · , · · · , ûrN (N ), y(N )]
uncorrelated.
To the above, PEM can be applied with the data to obtain ĜN .
record of
In the projection method, T ur is allowed to be an
N
D = [y(1), u(1), r(1), · · · · · · · · · , y(N ), u(N ), r(N )] acausal operator by parameterizing it as a two-sided FIR
filter:
to obtain estimates T̂Nyr and T̂Nur . Then open-loop plant Xn2

estimate ĜN is obtained by


ur
T (q, θ) = θi q −i (43)
i=−n1
 −1
ĜN = T̂Nyr T̂Nur (36) This is to ensure that the resulting ûrN is uncorrelated
(“orthogonal” in the least squares language) asymptoti-
One drawback is that ĜN so obtained may be of very cally with u − ûrN , which becomes a part of the residual
high order. This can be circumvented by using a model in the second PEM. Recall that, in the least squares esti-
form that recognizes the underlying structure, such as mation, the residual has to be orthogonal to the regressor
the one below: in order to obtain a consistent estimate (Ljung, 1987).
If the controller is linear, all the preceding argument
T yr (q, θ) = G(q, θ1 )T ur (q, θ2 ) (37) holds and the orthogonality of the residual can be as-
  sured (asymptotically) with a sufficiently rich yet causal
θ1 Tθur . However, with a nonlinear or time-varying con-
θ=
θ2 troller, ûrN obtained with any casual T̂Nur and u− ûrN may
By adopting the above structure, G(q, θ1 ) is identified be correlated, thereby destroying the consistency of esti-
directly, rather than through the inversion in (36). mation in the final step. By employing an acausual FIR
filter with sufficiently large n1 and n2 for T ur (q, θ), one
Two-Stage Method and Projection Method. gets ûrN that is uncorrelated (orthogonal) with u − ûrN ,
Among the variations of the above method are the so even when the controller is nonlinear or time-varying.
called two-stage method by Van den Hof and Schrama. The convergence behavior is essentially the same as for
(1993) and projection method by Forssell and Ljung the indirect approach and does not require an elaborate
(2000a). Note that discussion here. Error behavior too is similar to that
for the indirect approach. The variance also follows the
u(t) = (I + CG0 )−1 r(t) + −C(I + G0 C)−1 H0 e(t) (38) same expression of (23). The same comments apply to
| {z } | {z }
ur (t) ue (t) the two-step method and the projection method.
64 Sten Bay Jørgensen and Jay H. Lee

Closed-Loop Perfect External Linear Known


method Noise Model Dithering Controller Controller
Direct Yes Yes∗ No No
Indirect No Yes Yes Yes
Joint I/O No Yes Yes No
Two-Step No Yes Yes No
Projection No Yes No No
∗ Unless the noise model is perfectly known a priori

Table 2: Summary of the requirements of different closed-loop identification methods.

Key Points. Some key points for the joint I/O unbiased estimate, a nice proven property for most sub-
method and its off-springs are: space methods, is lost. A practical implication is that,
• As with the indirect approach, the main advantage with closed-loop data, the method may yield a poor
of the joint input-output approach over the direct model regardless of number of data points used. Here, we
approach is that consistent estimates can be ob- will briefly examine what fundamental problem closed-
tained even with an imperfect noise model. loop data pose for the subspace method and examine
some available modifications to the standard approach
• The main advantage of the joint input-output ap- in order to circumvent the problem.
proach over the indirect approach is that explicit
knowledge of the controller is not required. How- Main Idea of Subspace Identification. We first
ever, it does implicitly assume that the controller is review the subspace method. Though many versions ex-
linear. ist in the literature, the essential ideas are same and the
• The two step method is essentially same as the stan- practical outcomes from applying different algorithms
dard joint input-output method and does not appear should not differ much (Van Overschee and De Moor,
to offer any new advantage. 1995). We will discuss one of the most popular methods,
called N4SID, which was introduced by Van Overschee
• The projection method, on the other hand, further
and De Moor (1995).
improves it by removing the requirement of linear
The underlying plant is assumed to be
controller for consistent estimation. It also retains
the aforementioned advantage over the direct ap- x(t + 1) = Ax(t) + Bu(t) + w(t)
proach. The relaxation of the requirement for lin- (44)
ear controller behavior is practically significant in y(t) = Cx(t) + Du(t) + ν(t)
view of the fact that most industrial controllers show
where w(t) and ν(t) are white noise processes. The state-
some degree of nonlinear behavior due to various
space description is very general and subsumes most of
fixes and add-ons.
the input-output structures studied in the system iden-
• Measurement errors for the input do not destroy the tification literature.
consistency if they are uncorrelated with the dither The following is an alternative representation of the
signal. above when (44) is viewed as an input-output system
The practical requirements of the different closed-loop description:
identification approaches are summarized in table 2.
x∞ (t + 1) = Ax∞ (t) + Bu(t) + K∞ ε∞ (t)
Subspace Identification with Closed-Loop Data (45)
y(t) = Cx∞ (t) + Du(t) + ε∞ (t)
So far, our discussion has centered around PEM. An al-
ternative to PEM is subspace identification, which has (45) can be interpreted as the steady-state Kalman filter
drawn much attention in recent years. The main attrac- for (44) and hence ε∞ is the innovation sequence, which
tion for the subspace approach is that it yields a mul- is white. The two are equivalent in an input-output sense
tivariable system model without the need for a special and (45) is referred to as the innovation form for (44).
parameterization, which requires significant prior knowl- The N4SID method attempts to identify the follow-
edge and nonconvex optimization. Both represent sig- ing non-steady-state Kalman filter equation (within some
nificant hurdles for using PEM for multivariable system state coordinate transformation) to obtain the parame-
identification, which explains why it is hardly used for ters for the system equation in (45):
this purpose. Most subspace methods in the literature
can fail, however, when used with closed-loop data. By xn+1 (t + 1) = Axn (t) + Bu(t) + Kn εn (t)
(46)
“fail”, we mean that the guarantee of an asymptotically y(t) = Cxn (t) + Du(t) + εn (t)
Recent Advances and Challenges in Process Identification 65

From the notation, you may deduce that the above rep- and Hnf through linear least squares. The least squares
resents a non-steady-state Kalman filter started at t − n estimation yields consistent estimates of Hnp and Hnf as
with zero initial estimate and initial covariance set equal residual E0+ n (t) is orthogonal to regressors Yn (t), Un (t)
− −
0+
to the system’s open-loop covariance. Successful iden- and Un (t).
tification of the above equation yields system matrices Using estimate Ĥnp , one can determine the system or-
(A, B, C, D) for (45). In addition, with a large n, it gives der (by examining its rank) and obtain estimates for
a good approximation for stochastic part of the system Γon ,M1 and M2 (within a coordinate transformation). Fi-
in (45) since Kn → K∞ and Cov{εn } → Cov{ε∞ } as nally, data for the Kalman state xn can be constructed
n → ∞. Here n is assumed to be higher than the intrin- from the estimates as
sic system order.  Y−
 
n (t)

For the simplicity of discussion, we will assume from x̂n (t) = M̂1 M̂2 , t = n, · · · , N −n
U−n (t)
hereafter that u(t) is an independent (temporally uncor-
(52)
related) sequence.
The whole calculation can be done by performing some
The key to subspace identification is the following
matrix projections with appropriately arranged data ma-
multi-step prediction equation:
trices, which can be implemented in a computationally
efficient and robust way (Van Overschee and De Moor,
1996).
   
y(t) C
 y(t + 1)   CA  By following a similar procedure, one can also obtain
=  xn (t)
   
 .. .. data for xn+1 (t + 1), t = 1, · · · , N − n.
 .   . 
y(t + n − 1) CAn−1
After the data for xn (t) and xn+1 (t + 1) are obtained,
the system matrices are estimated based on the following
D 0 ··· 0
 
equation:

u(t)
 .. ..  u(t + 1)
 CB D . .          
+  ..
 xn+1 (t + 1) A B Kn
 .. .. .. .. 
.

= xn (t) + u(t) + εn (t)
 . . . .  
y(t) C D I
CAn−2 B ··· CB D u(t + n − 1) (53)

εn (t|t − 1)
 Again, since εn (t) is orthgonal to xn (t) and u(t), lin-
 εn (t + 1|t − 1)  ear least squares gives consistent estimates of A, B, C,
+

..

 (47) and D. One can also obtain consistent estimates of Kn
 .  ∆
and Pn (= Cov{εn (t)}) using the residuals from the least
εn (t + n − 1|t − 1)
squares.
εn (t + j|t − 1) represents the j + 1-step-ahead prediction In overall, one obtains consistent estimates of the
error (based on the Kalman estimate xn (t)). The above Kalman filter matrices in (46) since the first least
will be denoted by squares yields a consistent estimate of Hnp and there-
fore of xn (t) (within a coordinate transformation), and
Y0+
n (t) = Γn xn (t) + Hn Un (t) + En (t)
o f 0+ 0+
(48) the second least squares yields a consistent estimate
A key point is that E0+ of (A, B, C, D, Kn , Pn ) provided consistent estimates of
n (t) is orthogonal to both xn (t)
and Un0+ (t). Now, since the Kalman filter is linear with xn (t) and xn+1 (t + 1) are used.
respect to the measurement and the input, we can ex-  With the obtained  estimates denoted hereafter by
press Â, B̂, Ĉ, D̂, K̂, P̂ , one can form the state-space model
xn (t) = M1 Y−
n (t) + M2 Un (t)

(49)
x(t + 1) = Âx(t) + B̂u(t) + K̂e(t)
where (54)
y(t) = Ĉx(t) + D̂u(t) + e(t)
y(t − n) u(t − n)
   
 y(t − n + 1)   u(t − n + 1)  where e is assumed to be a white noise process of covari-
Y−
n (t) =  ; U−
n (t) = 
   
.. ..  ance P̂ . The deterministic part of the model is unbiased
 .   . 
and the stochastic part is slightly biased due to mismatch
y(t − 1) u(t − 1)
between Kn and K∞ as well as that between Pn and P∞ .
(50)
Note: If the input u is not white, the assumed initial-
Substituting (49) into (48) gives
ization of the non-steady-state Kalman filter should be
 Y− altered slightly in order to make E0+
 
n (t) n uncorrelated with
Yn (t) = Γn M1 M2
0+ o
+Hnf Un0+ (t)+E0+

n (t) xn (t) and U 0+ (t). In this case, the proper initial esti-
} Un (t)

| {z
p
Hn mate to assume for the Kalman Filter is not zero, but a
(51) function of U−n (t) and Yn (t). This creates a slight incon-

By arranging the data for y and u appropriately based sistency between the assumed initialization of the under-
on the above equation, one can obtain estimates for Hnp lying Kalman filter for xn (t) and that for xn+1 (t + 1),
66 Sten Bay Jørgensen and Jay H. Lee

and complicates the procedure somewhat. The details the residual. The identified ARX model is used succes-
of the modifications needed to save the nice asymptotic sively to construct the n-step predictions of y, which are
property are worked out in (Van Overschee and De Moor, obtained with all the future inputs set equal to zero (i.e.,
1994, 1996). This is mostly a theoretical concern, how- u(t) = · · · , u(t + n − 1) = 0). Denote the resulting n-step
ever. Since the effect of initialization on the Kalman predictions by
estimate becomes negligible as n → ∞, the basic algo-
Ŷ0+
 T T
rithm works well with a large n, even when the input is n (t) = ŷ (t|t − 1) ŷ (t + 1|t − 1)
not white. T
··· ŷ T (t + n − 1|t − 1) . (55)
Problem with Closed-Loop Data. With closed-
ion, Ŷn (t) is treated an object equivalent to
0+
loop data, consistency for the first least squares estima- Fromh here

tion breaks down. This is because E0+ Y (t)
n (k) is no longer Ĥnp Un− (t) in the standard method. Hence, the state is
uncorrelated with U0+ . For example, u(t) is a function n

of y(t) and therefore of εn (t|t − 1), which is correlated created as x̂n (t) = LŶ0+
n (t), where L contains the basis

with εn (t + 1|t − 1). Note that u(t) appears in the regres- picked by examining the data matrix for Ŷ0+
n (t).
sor for the two-step-ahead prediction model of y(t + 1).
Hence, the regressor becomes correlated with its residual Control-Oriented Process Identification
εn (t + 1|t − 1). Since Ĥnp is biased, the whole argument The term control-oriented process identification refers to
for convergence breaks down. a tailoring of an entire identification process to require-
The Modifications. Since in the indirect approach, ments of intended control. It points directly to the much
system input is an external perturbation signal which needed integration between model development and con-
has no correlation with system noise, the standard sub- troller design. The term perhaps was born in the midst
space method can be used to obtain a closed-loop sys- of a debate on the apparent disparity between system
tem model. Given a state-space representation of the identification methodologies and robust controller design
controller, a state-space model for the open-loop system methodologies. It was pointed out that, since most ro-
can easily be extracted from it. bust controller designs require both a nominal model and
Van Overschee and De Moor (1997) discuss a way error bounds, more than just a nominal model should be
to modify the projection algorithm they use in their passed from the identification step to the controller de-
N4SID algorithm to account for closed-loop nature of sign step.
data. Though this method does not belong to the in- Over the course of time, the term has come to mean
direct approach, the method requires knowledge of the much more than simply providing a model description
controller as is the case for the indirect approach. suited to robust controller design. Since achievable per-
The joint input-output approach can also be inte- formance of a model-based controller, regardless of de-
grated seamlessly with the subspace method. The sub- sign strategy one employs, depends on the model quality,
space method can be used to identify a joint system and it is beneficial to shape the whole identification process
then an open-loop system model can be conveniently ex- based on the ultimate goal of achieving the best possible
tracted from it, as shown in (Verhagen, 1993). closed-loop performance. For example, the data genera-
In addition, the two-step method and the projection tion step has a direct bearing on the size and distribution
method discussed earlier in the context of PEM should of model error, which in turn influence the closed-loop
be applicable here. Since the steps for applying these performance achievable with a particular controller de-
methods would be essentially the same as before, just sign strategy. Hence, rather than using some univer-
with PEM replaced by a subspace method, they are not sal testing procedure that ignores the characteristics of
discussed here. underlying plant dynamics and control objective, one
Finally, Ljung and McKelvey (1996) proposed an al- should tailor-design the procedure to befit the plant char-
ternative way to construct state data. Instead of picking acteristics.
the state basis from the projected data matrix, which Integration of identification and control design has
poses a consistency problem in the case of closed-loop been presented by many authors going back to Ziegler
data, they suggested to identify a high-order ARX model and Nichols (1942) and Åström and Hägglund (1984)
and use it to calculate the n-step predictions. Note that who aimed at model free tuning of simple controllers.
identifying a high-order ARX model amounts to identi- The desire for development of identification for robust
fying just the first of the n-step predictor in (51) with control was indicated by Andersen et al. (1991)at CPC-
a very large n. The reason for identifying only the first IV. Theoretical interest in identification for control de-
of the n-step predictor is that consistent estimates of sign has grown significantly during the past decade and
the one-step-ahead predictor can be obtained even with the intensive research has yielded some useful insights
closed-loop data, since the regressor is uncorrelated with and methodologies.
One fundamental understanding that emanated from
Recent Advances and Challenges in Process Identification 67

the research is the necessity of iteration. For exam- such low order models cannot represent the true plant
ple, control-oriented design of a data-gathering experi- over the entire relevant frequency range they will have
ment demands that one relates the design parameters to a systematic error, a bias error, in addition to the in-
model error and then ultimately to the closed-loop per- evitable noise-induced variance error. The relative con-
formance. Under well-defined assumptions on the prior tributions of the two types of error may be expressed
model set and data set (or prior probability distribu- in the frequency domain, by defining a model error T̃ ,
tions on the model parameters and measurement errors), where the model T (q) = [P (q) H(q)] contains a pro-
it is possible to relate design parameters (e.g., test sig- cess and a noise transfer function y(t + 1) = P (q)u(t) +
nals) to model error. The more complicated half of the H(q)e(t). Thus a performance objective J¯ may be for-
puzzle is how model error degrades closed-loop perfor- mulated as
mance. This degradation depends on the nominal model
as well as the controller design method employed (i.e., T̃ (eiω ) = T̂ (eiω ) − T0 (eiω ) (56)
Z π n
how the nominal model and possibly model error infor- ¯
o
J(T̃ ) = E T̃ (eiω )C(ω)T̃ T (e−iω ) dω (57)
mation gets translated into a controller). Even though −π
the controller design method may be known a priori, the
nominal model is not. This leads to the necessity of ap- where the Hermitian 2x2 block matrix weighting function
proximating it through iteration (Cooley and Lee, 2001). C(ω) describes the relative importance of a good fit over
Even though the complex relationship between model the frequency range of interest as well as the relative
error and closed-loop performance is the challenging as- importance of the fit of P and H respectively. Note that
pect of the problem from a theoretical viewpoint, the the expectation is due to the randomness of T̃ . Using a
issue of how design parameters influence model quality first order approximation of the model error gives:
is by no means simple and in fact may be a more seri-
T̃N (eiω , Θ) =
ous barrier to practical use of most available approaches.
This is because the question inherently requires prior = T̂N (eiω , Θ) − T0 (eiω )
information on plant and measurement error (as deter- ' T (eiω , Θ∗ ) − T0 (eiω ) + (Θ̂N − Θ∗ )T T 0 (eiω , Θ∗ )
ministic bounds or probability distributions), which are
not readily available or easily expressed in the required = B(eiω , Θ) + (Θ̂N − Θ∗ )T T 0 (eiω , Θ∗ )
mathematical terms. (58)
The problem of control-oriented identification can also
be posed in the context of closed-loop identification, and where
in fact most research in this area has followed this route. dT (iω, Θ)
The main result is that, to minimize the closed-loop error T 0 (eiω , Θ) = (59)

that will ultimately result from a model-based controller, B(e ) = T (e ) − T0 (eiω )
iω iω
(60)
the identification of the model should use closed-loop
data produced with the very controller (Gevers, 2000). and Θ∗ is the parameter estimate as N → ∞.
However, the controller is not known prior to an actual Substituting this into (57) gives the following approx-
experiment and therefore iteration (between controller imate expression Ljung (1999):
design and closed-loop identification) is performed with
the hope that the controller converges through the iter- J¯ ' JB + JP (61)
ation.
The following section presents key issues and results with
regarding identification of models with the aim of achiev- Z π
ing the best possible closed-loop performance. First, we JB = B(eiω )C(ω)B T (e−iω )dω (62)
present some preliminaries related to tradeoffs between −π
Z π
two types of identification errors. Understanding of how 1
JP = tr[P (ω)C(ω)]dω (63)
model error arises from different sources plays an im- N −π
portant role during the development of identification for
control. Thereafter control designs and model approxi- where
mations are explained before development of identifica- P (ω) = T 0T (e−iω )[N Cov(ΘˆN )]T 0 (eiω ) (64)
tion for control is described and a methodology for joint
optimization of modelling and control is presented. The bias contribution JB is mainly affected by the model
set as well as by signal power spectrum. The variance
Preliminaries contribution JP on the other hand decreases with in-
Identification. In identification for control, it is pre- creasing amount of data and signal power, whereas it in-
ferred to identify low order models, which subsequently creases with the number of parameters estimated. Min-
may be used for robust controller design. Given that imizing the objective function in (61) clearly involves
68 Sten Bay Jørgensen and Jay H. Lee

entries depend on the nominal model P̄ and the way in


∆ which the allowable perturbation ∆ affects the nominal
Q11 Q12 model. The set of models P may be characterized as
Q=
Q Q21 Q22 P = {P |P = Fu (Q, ∆)} , with ||∆||∞ < 1 (65)

where the upper LFT Fu (Q, ∆) ≡ Q22 + Q21 ∆(I −


Figure 2: Representation of model perturbation
Q11 ∆)−1 Q12 assuming the inverse exists. With this rep-
by upper Linear Fractional Transformation (LFT) resentation the nominal model is P̄ = Q22 .
Fu (Q, ∆).
Control Design. The performance of a feedback
connection T(P0 , C), where

v  
y
 
r2
r1 y = T(P0 , C) (66)
P u
 
r1
 
P0 C
T(P0 , C) = (I + CP0 )−1 (67)
-1 C I

which is constructed from a plant P0 and a controller C


C
r2 as shown in Figure 3, can be characterized by a norm
value of J(P0 , C), which is a closed-loop-relevant oper-
ator. Minimization of this norm may be done through
controller design directly, provided that measurements
Figure 3: Feedback connection T(P0 , C) around a
from the plant can provide information on the control ob-
plant P0 .
jective function J(P0 , C). If this norm can be assessed di-
rectly on the plant, then the controller may be tuned op-
timally iteratively. This approach may be termed model-
making a tradeoff between bias and variance, i.e. be- free controller tuning. This idea has been used by several
tween the model set and the number of parameters used. researchers. Hjalmarsson. et al. (1994) developed an it-
A proper decision on the tradeoff is key to success of erative procedure where two to three batch experiments
any process identification scheme. The tradeoff is a fac- are performed at each iteration. This procedure has been
tor through the design of an entire process identification further developed to the multivariable case by Hjalmars-
process, including selection of model set and its parame- son and Birkel (1998), and reviewed by Gevers (1998).
terization, experimental design (e.g., test signals’ power The direct iterative tuning is restricted to those control
spectra), etc. The development of identification meth- objective functions that can be assessed directly from ob-
ods for control design has also exploited this tradeoff in servations. However, this methodology originated from
various ways as described in the sequel. studying the interplay between identification and con-
trol. This interplay is further discussed in the following
Representation of Model Approximations. sections, where the key aspect is to relate the purpose of
Knowledge of the nominal plant to be controlled is identification, i.e., achieving control performance, to the
generally incomplete for robust controller design. Model identification procedure.
error can be represented by a set of models. Such a set
can be built up from a nominal model P̄ along with a Identification of Model for Control
model perturbation ∆ (Doyle et al., 1992) as illustrated The nominal model P̂ within a model set plays an impor-
in Figure 2. This Linear Fractional Transformation tant role. Since the nominal model is the sole basis for
(LFT) model framework can represent additive as well many controller designs, its quality is very important. In
as multiplicative uncertainty, each of which requires a this section, we review the work on identifying a nominal
different Q. In addition Q will contain the necessary in- model that leads to a good control performance when a
formation to characterize a set of models, P. In Hansen controller design based on nominal performance is used.
and Fanklin (1988), a fractional model representation is The importance of the nominal model has been recog-
described by a quotient of two stable factors parameter- nized by many researchers, (e.g., Rivera and Gaikwad,
ized via the dual Youla-Kucera parameterization. This 1992; Scharma and Bosgra, 1993; Zang et al., 1995; Lee
approach is able to deal with estimation of a model set et al., 1995).
that includes both stable and unstable plants, even when
operating under feedback controlled conditions. This Problem Formulation. A key idea in identifica-
approach is further explored by introducing a separate tion for control is to tune the bias and variance er-
coefficient matrix Q (Zhou et al., 1996), where the ror for control design. Such a tuning may be achieved
Recent Advances and Challenges in Process Identification 69

by bounding the actual control performance ||J(P0 , C)|| design here takes the form of a weighted trace optimal
through utilization of the nominal control performance (L-optimal) design, where the weighting matrices depend
||J(P̂ , C)|| and the performance degradation due to ap- on desired loop shapes as well as the estimate from the
proximate modelling ||J(P0 , C) − J(P̂ , C)||. The bound- last iteration. The optimal design was originally formu-
ing is achieved through exploitation of the triangular in- lated as a nonconvex optimization for the sampled data
equality: values of test inputs directly, but was later reformulated
by Samyudia and Lee (2000) as a Linear Matrix Inequal-
||J(P̂ , C)|| − ||J(P0 , C) − J(P̂ , C)|| ≤ ||J(P0 , C)|| ity problem cast in terms of the covariance of the inputs.
The procedure was shown to perform much superior to
≤ ||J(P̂ , C)|| + ||J(P0 , C) − J(P̂ , C)|| (68)
the conventional PRBS tests on several ill-conditioned
Thus a tight upper bound may be achieved by minimiz- multivariable processes, which served as the main moti-
ing the performance degradation ||J(P0 , C) − J(P̂ , C)||. vation for their development.
For a given controller C, this minimization constitutes a Although most of the above results are useful in point-
“control relevant identification problem” (Gevers, 1993; ing out the desire for iterative experiments, they suffer
Van den Hof and Scharma, 1995), where a nominal model from the requirement of exact modelling of the plant P0 .
P̂ is found by minimizing the difference between the This requirement inevitably leads to the requirement of
performance of the feedback connections T(P0 , C) and estimating a high order nominal model (e.g., FIR model
T(P̂ , C). From the triangular inequality it is clear that used in Cooley and Lee (2001)).
both the nominal model P̂ and the controller C may be Approximate Modeling. Analysis of the more re-
used to minimize the performance cost ||J(P0 , C)||. alistic situation, where P̂ is considered to be an approx-
Alternating between minimizing the performance imation of P0 was presented by Wahlberg and Ljung
degradation ||J(P0 , C) − J(P̂ , C)|| as an identification (1986). These authors showed that a norm based ex-
problem and minimizing the nominal performance pression can be used to characterize the bias of a model
||J(P̂ , C)|| as a control design problem provides an iter- P̂ , and that this bias could be tuned provided a suit-
ative scheme for subsequent identification and control able model structure was used. Liu and Skelton (1990)
design. By such a scheme it is hoped that ||J(P0 , C)|| proposed closed loop experiments to provide the proper
decreases. The development of these schemes is briefly weighting filters in an explicitly tunable bias expression
reviewed below as they have led to several interesting from Wahlberg and Ljung (1986). However, the con-
identification schemes. troller to be used for closed loop experiments is still un-
Exact Modeling. Some of the first attempts to- known. The possibility of using closed loop experiments
wards investigating the interaction between identifica- was proposed by Liu and Skelton (1990) and developed
tion and control in case of exact modelling, i.e. without by Zang et al. (1995), Hakvoort et al. (1994) and Tay
bias error, were presented by Åström and Wittenmark et al. (1997).
(1971) and Gevers and Ljung (1986). The latter au- Gevers (2000) in his recent review discusses an ap-
thors mentioned that, in the case of exact modelling, to proach based on the idea that minimization of the per-
minimize norm-based performance degradation, a pre- formance degradation caused by model error can be for-
diction error based estimation method can be devised mulated as a prediction error minimization (weighted by
that uses closed-loop experiments and appropriate data the sensitivity function) with closed-loop data. How-
filters. However the data filters contain knowledge about ever, for the equivalence to hold, the closed-loop data
the controller to be designed, and therefore an itera- must be generated by the very controller, which is to be
tive procedure of identification and control design can determined later and therefore unknown before the ex-
be used to gain knowledge of the data filters. This was periment. This naturally brings up an iterative scheme
confirmed by Hjalmarsson et al. (1996) and Forssell and where the controller design with nominal model and the
Ljung (2000b). The latter authors investigated partic- prediction error minimization with closed-loop data are
ular experimental design issues involved in minimizing alternated. Even though this approach has been applied
the performance degradation due to variance errors in with promising results on several examples, the conver-
the identified model with constraints on a linear combi- gence has been shown to fail.
nation of input and output variance. For the case where The idea of providing proper weighting filters in a tun-
penalty is only upon the misfit in P , they show that the able bias expression during identification was also used
optimal controller is given by the solution to a standard by Rivera et al. (1993) and Rivera and Gaikwad (1992),
LQ problem. Also the optimal reference signal is deter- where it is assumed that prefiltering of open-loop data
mined. from the plant can replace the benefits from closed-loop
The idea of minimizing the parameter covariance ma- experiments. The variance error was not considered.
trix is also pursued by Cooley and Lee (2001) in an Although the model bias during the approximate iden-
open-loop identification’s context. The control oriented tification is tuned towards the intended model applica-
70 Sten Bay Jørgensen and Jay H. Lee

tion in the above works, the model plant mismatch is


Closed loop experiment
not taken into account during the control design. In
order to be able to account for both bias and variance
errors in control design and also to judge the quality of Estimate Pi
J (P, Ci ) ∞ ≤ γ i ∀P ∈ Pi
the model against the performance requirement, estima-
tion of a model set rather than just a nominal model is
needed. Determine γ i

Identification and Robust Control Design


Estimating Model Sets. A set of models can be yes: i = i+1
used to represent the incomplete knowledge of the plant
no
J (P, C i +1 ) ∞ ≤ γ i +1∀P ∈ Pi +1
P0 . The incompleteness is due to the limited availabil-
ity of possible disturbed observations of the plant be-
Design C i +1
haviour. Such a set of models can be considered to con- Estimate Pi +1
sist of all models that are validated by the data (Ljung,
1999). However, Smith et al. (1997) pointed out that it
is never possible to validate models solely on the basis
no
J (P, Ci +1 ) ∞ ≤ γ i +1 < γ i∀P ∈ Pi
of finite number of experiments. Thus a set of mod-
Closed loop experiment
els will consist of models that cannot be invalidated by yes
the data from the plant P0 . The available data along
with the prior assumptions give rise to a set of feasi-
ble models F (Hakvoort et al., 1994). If the prior as- no
Satisfactory performance

sumptions are correct, then P0 ∈ F. However, the set F yes

can be unstructured, and therefore estimation of a struc-


tured set P should be done such that P outer-bounds F.
Performance and robustness are conflicting requirements Figure 4: Iterative identification cycle for robust con-
(Doyle et al., 1992), and in case of conservative control trol.
design, this conflict causes the performance of a designed
controller to deteriorate. Thus P should be estimated in
such a way that the performance degradation of a con- and construction of the Q coefficient matrix in Figure 2,
troller designed based upon P is as small as possible. is crucial for the design of a well-performing robust con-
Hence, to enable incorporation of robustness into the troller (de Callafon, 1998). Furthermore the set must
design of a model based controller, a set of models must be suitable for robust control design, in order to enable
be estimated. Mainly two different approaches for esti- performance improvement of the controlled plant. Thus
mating error bounds have appeared, which differ by the the structure and the estimation of the set P should take
nature of underlying assumptions on the error bounds. the performance cost ||J(P0 , C)|| into account.
When the prior assumptions are stochastic, so called
‘soft’ error bounds result. Examples here of are Goodwin Iterative Identification and Robust Control De-
et al. (1992), Bayard (1992), Rivera et al. (1993), Nin- sign Procedure. The ultimate quest for joint opti-
ness and Goodwin (1995), and Cooley and Lee (1998). mization of modelling and control is a very compli-
When deterministic assumptions on the data or the plant cated task, which still needs considerable research ef-
are used, non-probabilistic error bounds result. Conse- fort. A suboptimal procedure, depicted in Figure 4,
quently, these are called ‘hard’ bounds. Examples here of has been proposed by de Callafon and Van den Hof
are Wahlberg and Ljung (1992) and Böling and Mäkilä. (1997) and de Callafon (1998). One step in the pro-
(1995). Combinations of both types of model error cedure is that given a controller Ci at the ith iteration,
bounding may give the combined advantages (de Vries which together with the plant P0 forms a stable feed-
and Van den Hof, 1995; Hakvoort and Van den Hof, back connection that satisfies the performance specifica-
1994). Some of the error bounding approaches take the tion ||J(P0 , Ci )||∞ ≤ γi , then design a controller Ci+1
intended control application into account by estimating that satisfies:
control relevant, possibly low order, nominal models with ||J(P0 , Ci+1 )||∞ ≤ γi+1 ≤ γi (69)
an additive or multiplicative bound on the modelling er-
ror, e.g. Bayard (1992). Another key step of the model-based procedure where
Although the approaches referenced in the two pre- the knowledge of the plant P0 is represented by a set
vious paragraphs treat bias and variance aspects sepa- of models Pi is that evaluation of ||J(P0 , Ci )||∞ can be
rately, the estimation of a set of models P should include achieved by evaluating ||J(P, Ci )||∞ for all P ∈ Pi . This
both bias and variance aspects. Thus the identification enables specification of a control objective function that
Recent Advances and Challenges in Process Identification 71

Symbols steps involve, (1)estimation of a nominal factorization


γi − such that the closed loop criterion is being minimized,
J (P0 , Ci ) o subjected to internal stability of T(P̂ , C), and (2)esti-
mation of model uncertainty, which consists of charac-

− terization of a frequency dependent upper bound on the
o − − − − − uncertainty such that the closed-loop criterion is mini-
o o
o o o o mized using (V, W ) subject to P0 ∈ P (Hakvoort and
Van den Hof, 1997).
i i +1 i +1 i+2 i+2 i+3 i+3
Discussion. A number of important identification-
related issues emerge from the above procedure:
Figure 5: Progress during iterative identification cy- • Identification can be performed by open-loop exper-
cle for robust control. iments but more naturally by closed-loop experi-
ments: In order to account for the link between
identification and control, identification should be
cannot be measured directly from data. Thereby three performed with data that are representative of the
procedural steps can be outlined: eventual closed-loop condition. Although such data
1. Initial identification: Use experimental data from can be generated by carefully designing open-loop
T(P0 , Ci ) and prior information on data or plant, to test signal, they are more naturally obtained by clos-
estimate a set of models Pi such that γi is minimized ing a loop with a sequence of controllers that gets
while P0 ∈ Pi , where progressively closer to the eventual controller.
• Need for an iterative scheme: Accounting for the
||J(P, Ci )||∞ ≤ γi ∀P ∈ Pi (70) intended application requires an iterative scheme of
2. Control Design: Design Ci+1 such that identification and feedback controller design. As the
iteration continues, the feedback controller’s robust
||J(P, Ci+1 )||∞ ≤ γi+1 ≤ γi ∀P ∈ Pi (71) performance continues to improve and this in turn
generates information on the dynamics of plant that
3. Re-Identification: Use new experimental data is more and more relevant for the ultimately in-
from T(P0 , Ci+1 ) and prior information to estimate tended robust control.
a set of models Pi+1 such that P0 ∈ Pi+1 , subject
• Model error information: While performing itera-
to the condition
tions, information is developed on the complexity
||J(P, Ci+1 )||∞ ≤ γi+1 ∀P ∈ Pi+1 (72) of nominal model, shape of allowable model un-
certainty, and attainable robust performance. The
Note that in the first step, a performance assessment characterization of modelling errors enables us to
is carried out to evaluate ||J(P0 , Ci )||∞ for initialization avoid performance degradation during the itera-
purposes. Subsequently, the second step contributes a tions. This also opens the possibility to formulate
controller and the third step is a modelling validation invalidation criteria for refusing models and con-
step to enforce Equation 69. Subsequent to the initial- trollers during iterations.
ization in step 1, repeated execution of steps 2 and 3 will
• Unstable plants: Both stable and unstable plants
provide a design procedure where the upper bound γi on
can be handled using the algebraic framework of sta-
a predetermined performance cost ||J(P0 , Ci )||∞ can be
ble factorizations, where a possible unstable dynam-
progressively reduced as illustrated in Figure 5.
ics is split into two stable factors. This split opens
Implementation of this model based iterative proce-
the possibility of an open-loop equivalent identifica-
dure requires a procedure for estimating a set of models
tion of the factors. The algebraic framework allows
P tuned towards robust control design application. Such
possible model errors to be described in a dual-Youla
estimation of a modelling set may be carried out in a
parameterization. Thus the effect of model pertur-
two step procedure described in de Callafon (1998). The
bations can be studied under feedback controlled
set of models depend upon the nominal coprime factor-
conditions. By considering a nominal stable fac-
ization (N̂ , D̂) and the weighting functions (V̂ , Ŵ ) that
torization perturbed by an unknown but bounded
bound the model uncertainty. For the estimation, the
stable operator, the set of models describe all mod-
closed loop performance based optimization
els that are stabilized by a given feedback controller.
min sup ||J(P, C)||∞ (73) The above iterative procedure builds upon a large body
N,D,V,W P ∈P
of literature. The proposed procedure may be too com-
is considered to ensure a model uncertainty set which is plex for most applications, where the desire simply is
suitable for robust control design. The two estimation to retune the feedback controller to improve the plant
72 Sten Bay Jørgensen and Jay H. Lee

performance. Simplifications of the procedure should be References


considered. Such simplifications could in fact become Amirthalingam, R. and J. H. Lee, “Inferential Control of Contin-
the standard practice of tomorrow for improving closed uous Pulp Digester,” J. Proc. Cont., 9, 397–406 (1998).
loop performance when requested by the operator on a Amirthalingam, R., S. Sung, and J. H. Lee, “A Two Step Proce-
routine basis as the need arises. dure for Data-Based Modeling for Inferential Predictive Control
System Design,” AIChE J., 46, 1974–1988 (2000).
Andersen, H. W., M. Kümmel, and S. B. Jørgensen, “Dynamics
Summary and Identification of a Binary Distillation Column,” Chem. Eng.
Sci., 44, 2571–2581 (1989).
Closed-loop identification has progressed significantly Andersen, H. W., K. Rasmussen, and S. B. Jørgensen, Advances in
over the past decade. Sound albeit limited theories have Process Identification, In Arkun, Y. and W. Ray, editors, Proc.
emerged to point out the limitations and tradeoffs for 4’th Int’l Conf. On Chemical Process Control—CPC IV, South
various closed-loop identification approaches. Final anal- Padre Isl and, TX (1991).
ysis reveals that there is no single methodology to be Asprey, S. P. and S. Macchietto, “Statistical Tools for Optimal
Dynamic Model Building,” Comput. Chem. Eng., 24, 1261–
preferred on a universal basis and understanding of the 1267 (2000).
advantages and disadvantages for the various available Åström, K. J. and P. Eykhoff, “System Identification—A Survey,”
options is indispensable for making a right choice for a Automatica, 7, 123–162 (1971).
given situation. Potential benefits of closed-loop iden- Åström, K. J. and T. Hägglund, “Automatic Tuning of Simple Reg-
tification are huge and multi-faceted, ranging from the ulators with Specifications on Phase and Amplitude Margins,”
Automatica, 34, 90–113 (1984).
increased safety of the process and reduced harmful dis-
Åström, K. J. and B. Wittenmark, “Problems of Identification and
ruption to ongoing operation to the engineer’s ability to Control,” J. Math. Anal. Appl., 34, 90–113 (1971).
collect more informative data in shorter time. Bayard, D. S., Statistical Plant Set Estimation Using Schoeder-
The area of identification for control holds significant Phased Multisinosoidal Input Design, In Proc. American Con-
promise to enable efficient, “plant-friendly” identifica- trol Conference, pages 2988–2995. Chicago,USA (1992).
tion of multi-variable plants, including ill-conditioned Böling, J. M. and P. M. Mäkilä., On Control Relevant Criteria
in H∞ Identification, In Proc. American Control Conference,
plants, for which the traditional open-loop SISO iden- pages 3076–3080, Seatle, USA. Proc. American Control Confer-
tification is known to fail. Shaping of an identification ence (1995).
process according to a specific control objective is natu- Cooley, B. L. and J. H. Lee, “Integrated Identification and Robust
rally iterative in that various choices involved, e.g., shape Control,” J. Proc. Cont., 8, 431–440 (1998).
of test signals, complexity of nominal model, shape of al- Cooley, B. L. and J. H. Lee, “Control-Relevant Experiment De-
sign for Multivariable Systems Described by Expansions in Or-
lowable model uncertainty and attainable performance, thonormal Bases,” Automatica, 37, 273–281 (2001).
are highly dependent on the underlying plant. The shap- de Callafon, R. A. and P. M. J. Van den Hof, Multivariable Closed-
ing can be done in an open-loop experimental context Loop Identification: From Indirect Identification to Dual-Youla
where the data collected from previous experiments are Parameterization., In Proc. Of the 32nd Conference on Decision
and Control, pages 3384–3389 (1996).
used to improve the design of a new experiment. How-
de Callafon, R. A. and P. M. J. Van den Hof, “Suboptimal Fee-
ever, the shaping is more naturally done in a closed-loop back Control by a Scheme of Iterative Identification and Cotrol
experimental context, where data are made more and Design,” Mathematical Modelling of Systems, 3, 77–101 (1997).
more representative of the eventual closed loop applica- de Callafon, R. A., Feedback Oriented Identification for Enhanced
tion by iteratively improving the controller. Despite the and Robust Control, PhD thesis, Technical University of Delft
(1998).
promise, more methodological developments are needed
de Vries, D. K. and P. M. J. Van den Hof, “Quantification
to address the practical issues, e.g., the requirement for of Uncertainty in Transfer Function Estimation: A Mixed
ensuring the integrity of on-going operation, before rou- Probabilistic—Worst-Case Approach,” Automatica, 31, 543–
tine application can be foreseen. Potential benefits of 558 (1995).
such methodological developments are significant in that Doyle, J. C., B. A. Francis, and A. R. Tannenbaum, Feedback
process identification may be substantially facilitated Control Theory. MacMillan, New York (1992).
Forssell, U. and L. Ljung, “Closed-Loop Identification Revisited,”
when plants can be operated in their standard closed- Automatica, 35, 1215–1241 (1999a).
loop configuration. Forssell, U. and L. Ljung, “A Projection Method for Closed-Loop
Identification,” IEEE Trans. Auto. Cont. (1999b). In press.
Acknowledgments Forssell, U. and L. Ljung, “A Projection Method for Closed-
Loop Identification,” IEEE Trans. Auto. Cont., 45, 2101–2106
JHL gratefully acknowledges the financial support from (2000a).
the National Science Foundation (CTS #0096326), As- Forssell, U. and L. Ljung, “Some Results on Optimal Experiment
pen Technology, Weyerhaeuser, and Owens Corning. Design,” Automatica, 36, 749–756 (2000b).
Gevers, M. and L. Ljung, “Optimal Experiment Design with Re-
SBJ gratefully acknowledges support from the Euro-
spect to the Intended Model,” Automatica, 22, 543–554 (1986).
pean Union (Batch-Pro), The Department of Energy, Gevers, M., Towards a Joint Design of Identification and Con-
Denmark, and The CAPEC industrial consortium (see trol?, In Trentelman, H. and J. C. Willems, editors, Essays on
www.capec.kt.dtu.dk). Control: Perspectives in the Theory and its Applications, pages
111–151. Birkhäuser, Boston (1993).
Recent Advances and Challenges in Process Identification 73

Gevers, M., “Iterative Feedback Tuning: Theory and Applica- Ljung, L., System Identification—Theory for the User. 2nd ed.
tions,” IEEE control Sections, pages 26–41 (1998). Prentice Hall, New Jersey, second edition edition (1999).
Gevers, M., A decade of progress in iterative process control design: Marquardt, W., Fundamental Modeling and Model Reduction for
from theory to practice, In ADCHEM 2000, pages 7–12, Pisa, Optimization Based Control of Transient Processes, In Chemical
Italy. IFAC (2000). Process Control VI. CACHE and AIChE (2001).
Goodwin, G. C., M. Gevers, and B. Ninness, “Quantifying the Er- Ninness, B. and G. C. Goodwin, “Estimation of Model Quality,”
ror in Estimated Transfer Functions with Application to Model Automatica, 31, 1771–1795 (1995).
Order Selection,” IEEE Trans. Auto. Cont., AC-37, 913–928 Pearson, R. K., “Input sequences for nonlinear modeling,” Non-
(1992). linear Model Based Process Control. Proceedings of the NATO
Gustavsson, I., L. Ljung, and T. Söderström, “Identification of Advanced Study Institute, pages 599–621 (1998).
Processes in Closed Loop—Identifiability and Accuracy As- Qin, S. J. and T. A. Badgewell, An Overview of Industrial Model
pects,” Automatica, 13, 59–77 (1987). Predictive Control Technology, In Kantor, J. C. and C. E. Gar-
cia, editors, Fifth International Conference on Chemical Pro-
Hakvoort, R. G. and P. M. J. Van den Hof, An Instrumental Vari-
cess Control, pages 232–256. Fifth International Conference on
able Procedure for the Identification of Probabilistic Frequency
Chemical Process Control, AIChE and CACHE (1997).
Response Uncertainty Regions, In Proc. 33rd IEEE Conference
on Decision and Control, pages 3596–3601, Lake Buena Vista, Qin, S. J. and T. A. Badgewell, An Overview of Nonlinear Model
USA (1994). Predictive Control Applications, In International Symposium
on Nonlinear Model Predictive Control: Assessment and Future
Hakvoort, R. G. and P. M. J. Van den Hof, “Identification of Prob- Directions, Ascona (1998).
abilistic System Uncertainty Regions by Explicit Evaluation of
Rivera, D. E. and S. V. Gaikwad, Modeling for Control Design in
Bias and Variance Errors,” IEEE Trans. Auto. Cont., AC-42,
Combined Feedback/Feedforward Control, In Proc. American
15166–1528 (1997).
Control Conference, pages pp. 1445–1446 (1992).
Hakvoort, R. G., R. J. P. Schrama, and P. M. J. Van den Hof, “Ap- Rivera, D. E., X. Chen, and D. S. Bayard, Experimental Design for
proximate Identification with Closed-Loop Performance Crite- Robust Process Control Using Schroeder-Phased Input Signals,
rion and Application to LQG Feedback Design,” Automatica, In Proc. American Control Conference,, pages 895–899 (1993).
30, 679–690 (1994).
Samyudia, Y. and J. H. Lee, A Two-Step Approach to Control
Hansen, F. R. and G. F. Fanklin, On a Fractional Representation Relevant Design of Test Input Signals for Iterative System Iden-
Approach to Closed-Loop Experiment Design, In Proc. Ameri- tification, In Proceedings of SYSID 2000 (2000).
can Control Conference, Atlanta, USA, pages 1319–1320 (1988). Scharma, R. J. P. and O. H. Bosgra, “Adaptive Performance En-
Hjalmarsson, H. and T. Birkel, Iterative Feedback Tuning of Lin- hancement by Iterative Identification and Control Design,” Int.
ear Time-Invariant MIMO Systems, In 37th IEEE Conf. On Journal of Adaptive Control and Signal Processsing, 7, pp. 475–
Decision and Control. (1998). 487 (1993).
Hjalmarsson., H., S. Gunnarsson, and M. Gevers, A Conver- Shi, R. and J. F. MacGregor, “Modeling of Dynamic Systems Using
gent Iterative Restricted Complexity Control Design Scheme, Latent Variable and Subspace Methods,” J. Chemometrics, 14,
In A Convergent Iterative Restricted Complexity Control Design 423–439 (2000).
Scheme, pages p. 1735–1740, Lake Buena Vista, USA. pp. 1735- Smith, R. G., G. Dullerud, S. Rangan, and K. Poola, “Model Val-
1740. 33rd IEEE Conference on Decision and Control (1994). idation for Dynamical Uncertain Systems,” Mathematical Mod-
Hjalmarsson, H., M. Gevers, and F. Debruyne, “For Model-Based elling of Systems, 3, 43–58 (1997).
Control Design, Closed Loop Identification Gives Better Perfor- Söderström, T. and P. Stoica, System Identification. Prentice Hall
mance,” Automatica, 32, 1659–1673 (1996). (1989).
Johansen, T. A. and B. A. Foss, “Identifcation of Non-Linear Sys- Tay, T.-T., I. Mareels, and J. B. Moore, High Performance Con-
tem Structure and Parameters Using Regime Decopmosition,” trol. Birkhauser (1997).
Automatica, 31, 321–326 (1995). Van den Hof, P. M. J. and R. J. P. Scharma, “Identification
Koung, C. W. and J. F. MacGregor, “Identification for Robust and Control—Closed Loop Issues,” Automatica, 31, 1751–1770
Multivariable Control: The Design of Experiments,” Automat- (1995).
ica, 30, 1541–1554 (1994). Van den Hof, P. M. J. and R. J. P. Schrama., “An Indirect Method
Larimore, W. E., Canonical Variate Analysis in Identification, for Transfer Function Estimation from Closed Loop Data,” Au-
Filtering and Adaptive Control, In Proceedings. 29’th IEEE tomatica, 29, 1523–1527 (1993).
Conference on Decision and Control, pages 596–604, Honolulu, Van Overschee, P. and B. De Moor, “N4SID: Subspace Algorithms
Hawaii (1990). for the Identification of Combined Deterministic-Stochastic Sys-
Lee, W. S., B. D. O. Anderson, I. M. Y. Mareels, and R. L. Kosut, tems,” Automatica, 30, 75–93 (1994).
“On some Key Issues in the Windsurfer Approach to Adaptive Van Overschee, P. and B. De Moor, “A Unifying Theorem for
Robust Control,” Automatica, 31, pp. 1619–1636 (1995). Three Subspace System Identification Algorithms. ,” Automat-
ica, 31, 1877–1883 (1995).
Lee, J. H., Modeling and Identification for Nonlinear Model
Predictive Control: Requirements, Current Status and Fu- Van Overschee, P. and B. De Moor, Subspace Identification of
ture Research Needs, pages 269–293. Birkhauser Verlag, Linear Systems. Theory, Implementation, Applications. Kluwer
Basel/Switzerland (2000). Academic Publishers (1996).
Van Overschee, P. and B. De Moor, Closed-Loop Subspace Iden-
Li, W. and J. H. Lee, “Control Relevant Identification of Ill-
tification Algorithm, In Proc. Of the 36th IEEE CDC, pages
Conditioned Processes,” Comput. Chem. Eng., 20, 1023–1042
1848–1853 (1997).
(1996).
Verhaegen, M. and P. Dewilde, “Subspace Model Identification.
Liu, K. and R. E. Skelton, Closed Loop Identification and Iterative Part I: The Output-Error State Space Model Identification Class
Control Design, In Proc. 29th IEEE Conference on Decision of Algorithms. Part II: Analysis of the Elementary Output-Error
and Control, pages 482–487 (1990). State Space Model Identification Class of Algorithms.,” Int. J.
Ljung, L. and T. McKelvey, “Subspace Identification from Closed- Control, 56, I: 1187–1210; II: 1211–1241 (1992).
Loop Data,” Signal Processing, 52, 209–215 (1996). Verhagen, M., “Application of a Subspace Identifcation Algorithm
Ljung, L., System Identification—Theory for the User. Prentice to Identify LTI Systems Operating in Closed-Loop,” Automat-
Hall (1987). ica, 30, 61–74 (1993).
74 Sten Bay Jørgensen and Jay H. Lee

Viberg, M., “On Subspace-Based Methods for the Identification


of Linear Time-Invariant Systems,” Automatica, 31, 1835–1852
(1995).
Wahlberg, B. and L. Ljung, “Design Variables for Bias Distribution
in Transfer Function Estimation,” IEEE Trans. Auto. Cont.,
AC-37, 900–912 (1986).
Wahlberg, B. and L. Ljung, “Hard Frequency-Domain Error
Bounds from Least Squares Like Identification Techniques,”
IEEE Trans. Auto. Cont., AC-37, 900–912 (1992).
Young, R. E., R. D. Bartusiak, and R. W. Fontaine, Evolution of an
Industrial Nonlinear Model Predictive Controller, In Preprints
Chemical Process Control -, pages 399–410 (2001).
Zang, Z., R. R. Bitmead, and M. Gevers, “Iterative Weighted
Least-Squares Identification and Weighted LQG Control De-
sign,” Automatica, 31, pp. 1577–1594 (1995).
Zhou, K., J. C. Doyle, and K. Glover, Robust and Optimal Control.
Prentice Hall, New Jersey, USA (1996).
Zhu., Y. C., “Black-Box Identification of MIMO Transfer Func-
tions: Asymptotic Properties of Prediction Error Models.,” J.
Adapt. Cont. and Sig. Process., 3, 357–373 (1989).
Ziegler, J. G. and N. B. Nichols, “Optimum Settings for Automatic
Controllers,” Trans. ASME, 42, 759–768 (1942).

You might also like