Professional Documents
Culture Documents
Abstract: - The contribution focuses on artificial neural networks application for prediction of chemical semi-
batch reactor output. There are studied two approaches to prediction. The first methodology uses classical time
series prediction apparatus, when the prediction is based on the one signal only. The second approach uses
two signals and corresponds to model predictive control methodology. Both predictors are based on artificial
multilayered feed-forward neural networks.
neural network that would not require adaptation 3 Multilayered feed-forward neural
during control of the given reactor.
networks
Multilayer feed-forward neural networks (MFFNNs)
are very often called backpropagation networks
2 Chemical Semi-Batch Reactor because of the typical training algorithm. Some
In this paper, a semi-batch reactor model is used to authors prefer name multilayer perceptrons (MLP)
study the prediction abilities of multilayered feed- [25, 26], because MFFNNs have been developed by
forward neural networks and their applicability in generalization from Rosenblatt perceptron with
the model predictive control. The model input data binary transfer function.
comes from a real process - the chromium waste The signals flow only in the one direction (from
recycling process [15]. The task of the predictor is the input to the output) in these neural networks. All
to predict the in-reactor temperature. neurons are structured into layers and, typically, all
Let us consider a single input – single output neurons in the specific layer use same transfer
(SISO) system of the chemical exothermic semi- function. In contrast to Rosenblatt perceptron
batch reactor, which is described by the following multilayer feed-forward neural networks use various
mathematical model: transfer functions, usually continuous (e.g. linear,
hyperbolic tangent, sigmoidal functions, etc.).
d m(t ) By applying of Kolmogorov theorem it has been
= FI (1)
dt proved [27] that two-layered MFFNN (with one
hidden layer) can approximate any function with
E certain accuracy while non-polynomial transfer
d a (t ) FI −
= − A ⋅ e R⋅T (t ) ⋅ a(t ) (2) function in the hidden layer is used. This
dt m(t ) methodology is adopted in the paper.
E
−
R ⋅T (t )
d T (t ) FI ⋅ c I ⋅ TI A ⋅ e ⋅ ∆H r ⋅ a(t )
= + − 4 Simulations and results
dt m(t ) ⋅ c c MATLAB with Neural Network Toolbox and
K ⋅ S ⋅ T (t ) K ⋅ S ⋅ TC (t ) Simulink were used for all simulations. There were
− + (3)
m(t ) ⋅ c m(t ) ⋅ c studied two approaches: signal prediction and
system prediction.
d TC (t ) FC ⋅ TCI K ⋅ S ⋅ T (t ) In the first case the artificial neural network used
= + − five past values of the predicted signal as the input
dt mC mC ⋅ cC
vector and predicted only one step ahead. In other
K ⋅ S ⋅ TC (t ) FC ⋅ TC (t ) words, when it was needed the MFFNN repeatedly
− − (4)
mC ⋅ cC mC used its own predictions as inputs. Thus, in the input
(zero) layer of the artificial network were five
where m is the total weight of the reaction neurons and the output layer consisted one neuron.
components in the reactor, a is the mass
concentration of the reaction component in the
reactor, c = 4500 J.kg.K-1 is the specific heat
capacity of the reactor content, T is the temperature
of the reactor content. FI, TI = 293.15 K and cI =
4400 J.kg.K-1 is the reaction component input mass
flow rate, the temperature and the specific heat Fig. 1 – Scheme of MFFNN with 5 input neurons
capacity. FC = 1 kg.s-1, TCI = 288.15 K, TC, cC = (net1)
4118 J.kg.K-1 and mC = 220 kg is the cooling water
mass flow rate, the input temperature, the output As is depicted in the Fig. 1, the network had five
temperature, the specific heat capacity and the neurons in the hidden layer. This number was
weight of the cooling water in the cooling system of obtained by many experiments as sufficient for this
the reactor, respectively. case. The hidden layer used hyperbolic tangent
Other constants: A = 219.588 s-1, E = sigmoid as the transfer function, while the output
29967.5087 J.mol-1, R = 8.314 J.mol-1.K-1, ∆Hr = layer employed linear transfer function.
1392350 J.kg-1, K = 200 kg.s-3.K-1 and S = 7.36 m2 is Three simulation results from our previous work
the effective heat-transfer area [15]. [15] were chosen as the training and testing data.
predictor was also tested. This predictor used not Fig. 4 – Prediction error of net1 for the training data
only the predicted signal old values but also old (data1)
values of the system input. In this case it was the
chromium sludge dosing speed. Thus the
multilayered feed-forward neural network involved 380
ten neurons in the input layer (see Fig. 2). As well predicted signal
reference signal
as in the previous approach, the hidden layer used 370
0.4
0.1
prediction error [K]
380
predicted signal 0
370 reference signal
-0.1
360
-0.2
350
-0.3
T [K]
340 -0.4
330 -0.5
0 500 1000 1500 2000 2500 3000
time [s]
320
Fig. 6 – Prediction error of net1 for the testing data
310
(data2)
300
0 0.5 1 1.5 2 2.5 3
time [s] 4
x 10
390 0.3
predicted signal
380 reference signal
0.25
370
0.2
360
340 0.1
330
0.05
320
0
310
300 -0.05
0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 2 2.5 3
time [s] 4 time [s] 4
x 10 x 10
Fig. 7 – Net1 test on the testing data (data3) Fig. 10 – Prediction error of net2 for the training
data (data1)
0.5
380
predicted signal
0
reference signal
370
-0.5
prediction error [K]
360
-1
T [K]
350
-1.5
340
-2
330
-2.5
0 0.5 1 1.5 2 2.5 3
time [s] 4 320
x 10 0 500 1000 1500 2000 2500 3000
Fig. 8 – Prediction error of net1 for the testing data time [s]
380 2
predicted signal
370 reference signal
1.5
360
1
prediction error [K]
350
T [K]
340 0.5
330
0
320
-0.5
310
300 -1
0 0.5 1 1.5 2 2.5 3 0 500 1000 1500 2000 2500 3000
time [s] 4 time [s]
x 10
Fig. 9 – Net2 test on the training data (data1) Fig. 12 – Prediction error of net2 for the testing data
(data2)
390
predicted signal
380 reference signal Table 1 – Prediction quality criterions
370
net1 net2
360
C1 C2 C1 C2
350
data1 0.0024 4.529e-05 9.828e-05 2.218e-06
T [K]
340
data2 0.0133 0.0013 0.0115 0.0107
330
data3 0.0565 0.0806 0.0155 0.0080
320
310
As can be seen from results, the prediction is in both
300
0 0.5 1 1.5 2 2.5 3 cases very good. However, if the best method
time [s] 4
0.5
5 Conclusion
prediction error [K]
0
Motivation of this paper was to develop effective
-0.5
predictor for the chemical semi-batch reactor that is
-1 used in chromium recycling process. The accuracy
-1.5
and settings of the predictor should fulfill the
requirements for the applicability in the model
-2
predictive control of the reactor.
-2.5
0 0.5 1 1.5 2 2.5 3 The proposed predictor (net2) proved the
time [s] 4
x 10 abilities in the all selected tests and validation. Thus
Fig. 14 – Prediction error of net2 for the testing data
it can be concluded that it can be supposed for the
(data3)
model predictive controllers of the given chemical
reactor.