Professional Documents
Culture Documents
Abstract - This paper investigates the influence of popular mathematical model that can represent a process
statistical information criteria to one-step-ahead dynamics.
prediction (1-SAP) error of an ARX black-box model. The Understanding of a chemical process behavior is
criteria investigated are the Akaike Information Criteria important especially when optimum condition of the
(AIC), Akaike Final Prediction Error (FPE) and
Rissanen’s Minimum Description Length (MDL). The
process is required. This understanding is often related
investigation will be based on Pseudo-Random Binary to the mathematical model that represents the process
Sequences (PRBS) data collected from an electrically dynamic itself. Until today, numerous techniques have
heated steam distillation essential oil extraction system. been devised to develop the mathematical model of any
The data is the steam temperature measured within the given processes. They are basically can be divided into
distillation column beneath the material bed. By using two main categories; the first principle model and the
MATLAB System Identification Toolbox, an ARX model empirical model. The main difference between these
will be estimated and validated. Prior to model validation, models is the approach of modeling.
all the information criteria will be examined and the
criteria that suggested the most flexible model shall be
selected for future works. The linear regression will be
In most cases, the processes are too complicated to be
minimized by using Levenberg-Marquardt algorithm. represented by mathematical model. A huge amount of
Evaluation of model performance will be based on both time is required to develop the model and this is not
graphical and statistical approaches such as R2, adjusted- desirable especially to the practitioners. Here where the
R2, residual distribution, mean and variance. The results empirical model comes in handy. The development of
have shown that the selected model based on MDL the model is mainly based on the data collected from the
criterion is more parsimonious and flexible as compared to process. Without knowing the underlying process, the
the others. model can be developed to a certain level of accuracy
depending on the requirement of the user. Thus, this
I. INTRODUCTION modeling approach is also known as the black-box
approach.
System identification is a specialized area under control
system engineering which focuses on developing the System identification is procedural and the proper steps
mathematical model of a system. The mathematical should be followed in order to obtain the satisfactory
model is capable of relating the system output for any model. Several established text references have
given input in such a way that it can even predict the proposed the steps which can be described by Fig. 1 [3-
future output of the system. The model is very 5].
important in control system engineering nowadays since
the recent advancements is on developing intelligent Fig. 1 shows the flow of system identification which
control system. This intelligent system is capable of consists of four main steps started with the experiment
controlling and optimizing the system which is very and followed by model structure selection, model
difficult to be controlled by any traditional control estimation and model validation. If the validation result
techniques especially the chemical processes e.g. is accepted, then the identification finished. Otherwise,
distillation process. the loop will be repeated.
It has been a trend in the modern process control In the second step which is model structure selection,
technique to integrate the process model with the there are two scopes to be focused, the type of the
control system in order to anticipate the process output model and the size of the model. In black-box model
without the process actually happened [1]. The merging family, there are several assumed model types which
of process model and the control system was realized by can be categorized into linear and nonlinear. The linear
the adaptive control system, which integrate the system structure is the most popular since it is simple and can
identification technique into the control algorithm [2]. be estimated quickly. The focus of this paper is on the
The system identification is a technique of developing a
linear Autoregressive with Exogenous Input (ARX) estimated model i.e. the graphical and the statistical
model type. approaches [4]. The graphical approach is the simplest
and usually need commonsense in interpreting the
goodness of the model. The statistical approaches are
necessary in order to select the satisfactory model and
produce results that are statistically acceptable.
Therefore, a penalty for model complexity was Measured output signal (°C)
95
introduced. A criterion for determination of model
structure and parameter values can be written as 90
W N0 (θ , M , Z N ) = V N (θ , Z N ) + U N ( M ) (10) 85
80
UN(M) is a function that measures the complexity of the PRBS input signal (On/Off)
1
model structure, which has been related to the 0.8
dimensionality of θ, 0.6
0.4
dim θ 0.2
U N (M ) = (11) 0
N 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000
Time (sec.)
To explain the AIC, FPE and MDL, let the loss function, Fig. 3: The measured output signal and the PRBS input
VN defined as the normalized sum of squared error or signal.
NSSE. The NSSE can be computed from (8) and
-11
denoted by NSSE
-11.2 AIC
N FPE
∑ε
-11.4
1
V NSSE (θ , Z N ) = (t , θ )
2 MDL
(12)
2N t =1
-11.6
loss function (log)
-11.8
-12
Based on (12), the AIC, FPE and MDL are obtained as
follows: -12.2
-12.4
⎛ d ⎞
V AIC = ⎜1 + 2 ⎟V N (θ , Z N ) (13)
-12.6
⎝ N ⎠ -12.8
⎛ d ⎞
V MDL = ⎜1 + log( N ) ⎟V N (θ , Z N )
-13
(14) 0 5 10 15 20 25 30 35 40
⎝ N⎠ no. of parameters
90
ARX-MDL 6.5415 6.4552
80 Legend: Shaded value indicates the highest result (ARX-NSSE is not
0 500 1000 1500
time (samples)
2000 2500
included)
100
1-SAP (°C)
0 V. CONCLUSION
-0.2
0 500 1000 1500 2000 2500
time (samples) In this paper, the influence of statistical information
criterion to the 1-SAP has been investigated. Four
Fig. 5: Measured, 1-SAP and residual of ARX-MDL criterions i.e. NSSE, AIC, FPE and MDL are tested.
model However, the NSSE is a non-complexity-penalizing
criterion but evaluated for benchmarking purposes. The
Fig. 5 shows the measured output, ARX-MDL model’s NSSE has selected model with 40 parameters, which is
1-SAP and their residual. From residual plot, 1-SAP is significantly high as compared to 20 parameters
in good agreement with the measured output. R2 test and selected by MDL. The AIC and FPE are also selected
adjusted-R2 test show 98.9875% and 98.9798% high model order, which is very close to NSSE. Based
respectively. The overall results for all models are on 1-SAP results on steam distillation data, MDL
tabulated in the following table. selected model (ARX-MDL) is parsimonious and
comparable to the other selected models, ARX-NSSE,
Table 2: The overall 1-SAP result of ARX models for ARX-AIC and ARX-FPE. Although statistical results
validation data such as R2, adjusted-R2 and residual variance did not
Models % R2 % adj-R2 shown MDL as the best criterion but it is comparable
ARX-NSSE 99.0003 98.9845 and even produced the lowest residual mean.
ARX-AIC 98.9990 98.9840 Furthermore, from graphical point of view, 1-SAP of
ARX-MDL 98.9875 98.9798 the ARX-MDL is in good agreement with the measured
Legend: Shaded value indicates the highest result (ARX-NSSE is not signal and no outliers detected from the residual plot.
included) Therefore it can be concluded that the MDL criterion
has selected a sufficient model order and the other
Referring to Table 2, ARX-AIC produced the highest R2 criterions, which selected higher orders will not
and adjusted-R2 results, which is close to the non- improve the 1-SAP to any significant results.
penalized selected model, ARX-NSSE. The following
ACKNOWLEDGMENT [7] L. Ljung, System Identification: Theory for the
User. New Jersey: Prentice Hall, 1987.
This work was conducted on the data gathered at the [8] L. Ljung, "Estimation focus in system
Faculty of Electrical Engineering, UiTM Shah Alam identification: prefiltering, noise models, and
with the support of JPbSM UiTM and IRDC UiTM. The prediction," in Proc. IEEE Conference on
authors would like to thank all staffs involved. Decision and Control, Pheonix, Arizona, 1999,
pp. 2810-2815.
REFERENCES [9] G. Etcheverry, W. Suleiman and A. Monin,
"Quadratic System Identification By
[1] D. E. Seborg, T. F. Edgar and D. A. Hereditary Approach," in Proc. 2006 IEEE
Mellichamp, Process Dynamics and Control, International Conference on Acoustics, Speech
2nd ed. New Jersey: John Wiley & Sons, 2004. and Signal Processing (ICASSP), 2006, pp. III-
[2] K. J. Astrom and B. Wittenmark, Adaptive 129 - III-132.
Control, 2nd ed. Reading, Massachusetts: [10] L. Ljung, System Identification Toolbox User's
Addison-Wesley Publishing Company, Inc., Guide, 7th ed. Natick, USA: The MathWorks,
1995. Inc., 2007.
[3] L. Ljung, System Identification: Theory for the [11] D. C. Montgomery, G. C. Runger and N. F.
User, 2nd ed. New Jersey: Prentice Hall PTR, Hubele, Engineering Statistics, 3rd ed: John
1999. Wiley & Sons, Inc., 2004.
[4] T. Soderstrom and P. Stoica, System [12] D. C. Montgomery and G. C. Runger, Applied
identification. Hertfordshire: Prentice Hall Statistics and Probability for Engineers, 3rd ed.
International (UK) Ltd., 1989. New York: John Wiley & Sons, Inc., 2003.
[5] M. Norgaard, O. Ravn, N. K. Poulsen and L. K. [13] P. I. Good, Introduction to statistics through
Hansen, Neural Networks for modelling and resampling methods and R/S-Plus. New Jersey:
control of dynamic systems. London: Springer- John Wiley & Sons, Inc., 2005.
Verlag London Ltd, 2000.
[6] J. Eborn, "Modelling and Simulation of
Thermal Power Plants," Lund, Sweden: Lund
Institute of Technology, 1998.