Professional Documents
Culture Documents
Argatov 2019
Argatov 2019
a r t i c l e i n f o a b s t r a c t
Article history: Application of artificial neural networks (ANNs) for modeling of vapor-liquid equilibrium in multicom-
Received 21 May 2019 ponent mixtures is considered. Two novel ANN-based models are introduced, which can be seen as
Received in revised form generalizations of the Wilson model and the NRTL model. A unique feature of the proposed approach is
8 August 2019
that an ANN approximation for the molar excess Gibbs energy generates approximations for the activity
Accepted 15 August 2019
Available online 20 August 2019
coefficients. A special case of the ternary acetic aciden-propyl alcoholewater system (at 313.15 K) is used
to illustrate the efficiency of the different models, including Wilson's model, Focke's model, and the
introduced generalized degree-1 homogeneous neural network model. Also, the latter one-level NN
Keywords:
Vaporeliquid equilibrium
model is compared to the Wilson model on 10 binary systems. The efficiency of the two-level NN model
Ternary system is assessed by a comparison with the NRTL model.
Excess gibbs energy © 2019 Elsevier B.V. All rights reserved.
Activity coefficients
Artificial neural network
1. Introduction In Refs. [11,12], ANNs were utilized to estimate the activity co-
efficients of ternary liquideliquid equilibrium data. In the majority
The problem of vapor-liquid equilibrium (VLE) in multicompo- of the mentioned studies, a multilayer perceptron (MLP) was
nent mixtures represents a practical interest and was considered in employed with different number of nodes in one hidden layer.
a number of studies [1]. For instance, the prediction of VLE in However, two-hidden-layer feedforward networks have been also
mixtures involved in vaporeliquid separation are of special used [10,13]. It is known [14] that the approximation capability of
importance for modeling distillation in alcoholic beverage pro- MPL networks severely depends on the number of hidden neurons,
duction [2]. Nh , which, in turn, determines the number of learnable parameters
In the last decade, artificial neural networks (ANNs) have been (weights and biases). On the other hand, with increasing Nh , more
intensively applied for modeling different physical and chemical experimental data will be needed for training the MPL network.
phenomena [3,4], including VLE (see, e.g., Ref. [5]). There are In the present paper, we pursue the approach proposed by Focke
number of successful examples of applications of feed-forward [15] to limit the number of neurons Nh to the number m of com-
ANNs, and, in particular, for direct prediction of the Fick diffu- ponents in the mixture. This crucial constraint inevitably signifi-
sivity in binary liquid systems [6] and vapor-liquid phase equili- cantly lowers the approximation capability of ANN models,
briumeflashecalculations [7]. Recently, an ANN model was requiring, however, much less data to determine the model pa-
proposed [8] for the prediction of the activity coefficient at infinite rameters. At the same time, the flexibility of ANN models is sup-
dilution of binary systems with the properties of the individual ported by a certain arbitrariness in the choice of the neuron's
components used as inputs to the neural network. Besides the activation function. Another idea that enabled Focke to advance the
mixture composition, temperature and pressure can be employed ANN modelling of mixtures was the use of inputs as substitutes for
as the ANN inputs [9,10]. weights in the output neuron. In what follows, this suggestion has
been utilized as well.
Another important feature of our approach is the incorporation
€ University, SE-
* Corresponding author. Faculty of Technology and Society, Malmo of prior knowledge about VLE into the ANN architecture (an idea
205 06, Malmo€, Sweden. mentioned previously by Petersen et al. [16]). In particular, the ANN
E-mail address: ivan.argatov@gmail.com (I. Argatov).
https://doi.org/10.1016/j.fluid.2019.112282
0378-3812/© 2019 Elsevier B.V. All rights reserved.
2 I. Argatov, V. Kocherbitov / Fluid Phase Equilibria 502 (2019) 112282
2.2. Focke's NN model Therefore, any variation of xi leads to variation of the base of the
i-th factor on the right-hand side of Eq. (8), even though it does not
Recently, Focke [15] developed a special neural network model explicitly depend on the variable xi .
for describing mixture properties. The architecture of the Focke NN It should be emphasized that, though the Wilson model (1) and
I. Argatov, V. Kocherbitov / Fluid Phase Equilibria 502 (2019) 112282 3
the Focke model (6) are mathematically interrelated (see Appen- We emphasize that the total number of fitting parameters in
dix), they will be treated as different approximations. (11) is mðm 1Þ, which is the same as that of Wilson's model (1). In
view of (11) and (12), the ANN-type approximation (13) satisfies the
2.3. Degree-1 homogeneous NN model zero limit condition (7).
Strictly speaking, in Eq. (13), the activation function f1 is applied
In our modeling approach, we consider an ANN as a special kind not directly to the weighted sum of inputs but rather to its inverse
of nonlinear regression model, whose building blocks can be value, since the sum of all inputs is equal to unit (see Eq. (9)). By
regarded as generalized artificial neurons, which receive weighted setting F1 ðuÞ ¼ f1 ð1=uÞ, the ANN model (13) can be recast in the
inputs and sequentially perform the operation of summation and form of multilayer perceptron with an activation function pos-
the activation operation of nonlinear transformation. sessing a step-like discontinuity.
We take advantage of the flexibility of ANNs and construct a Observe that because tanhðuÞ2ð1; 1Þ for any u2ð ∞; ∞Þ, the
special architecture that represents the output variable y as a ho- following inequality holds:
mogeneous function of degree 1 in the input variables x1 ; x2 ; …; xm 0 , 1
(see Fig. 2). The units in the input layer receive m input signals xi , X m Xm
y x f @1 W x A 1:
which are transferred to each of m hidden neurons with individual i 1 j¼1 ij j
i¼1
weights Wij . The neuron outputs with weights equal to the corre-
sponding inputs are fed into the output neuron to produce the ANN From here it follows the ANN model (13) with the activation
model output y. The presented neural network model differs from a function (10) is a priori applicable for approximating functions yðx1 ;
MLP by the work of its hidden neurons, which is explained below. x2 ; …; xm Þ, which take the values between 1 and 1. Therefore, in a
To be more specific, we make use of the hyperbolic tangent general case, an appropriate normalization of the data is required.
tanhu ¼ ð1 e2u Þ=ð1 þ e2u Þ as an activation function of the hid- Further, we apply the ANN model (13) for approximating the
den layer neurons, which is one of the most used activation func- molar excess Gibbs energy, i.e., we put
tions. This choice allows us to avoid the presence of biases, thereby
significantly reducing the number of fitting parameters. Moreover, Pm !
gE Xm
j¼1 xj
since tanhð0Þ ¼ 0, we put ¼ xi f1 Pm ; (14)
RT i¼1 j¼1 Wij xj
f1 ðuÞ ¼ tanhðu 1Þ; (10)
so that the function GE ¼ ngE will be given by
so that
Pm !
GE X m
j¼1 nj
f1 ð1Þ ¼ 0: (11) ¼ ni f1 Pm :
RT i¼1 j¼1 Wij nj
Additionally, it is required that
By the definition (see Eq. (3)), the activity coefficients can be
Wii ¼ 1; (12) evaluated as
where Wij are the weights of the hidden layer connections, and no Xm !
summation over repeated indices is implied. n
j¼1 j
lngk ¼ f1 Xm
The number of the hidden neurons equals the number of
j¼1
Wkj nj
mixture components, and the inputs are used as the weights of the
output layer. Finally, the identity activation function is employed Xm ! Xm Xm
for a single output unit. X m n
j¼1 j
W n Wik
j¼1 ij j
n
j¼1 j
0
þ ni f 1 Xm X m 2
Thus, the introduced ANN model works as follows:
i¼1 W n
j¼1 ij j Wij nj
Pm ! j¼1
X
m
j¼1 xj
y¼ xi f1 Pm : (13) or, which is the same,
i¼1 j¼1 Wij xj
Xm !
x
j¼1 j
lngk ¼ f1 Xm
j¼1
Wkj xj
Xm (15)
! Xm Xm
X m x
j¼1 j
W x Wik
j¼1 ij j
x
j¼1 j
0
þ xi f 1 Xm Xm 2 :
i¼1
j¼1
W ij xj W x
ij j j¼1
Lij . Thus, the ANN model (13) generalizes Wilson's model for an
arbitrary monotonic activation function satisfying the centering
condition (11).
Observe that at the beginning we employ an tanh-based acti-
vation function, and then we generalize our equations for an arbi-
trary activation function f1 ðuÞ, which satisfies Eq. (11). Finally, we
consider a special case of logarithmic function to show a link to the
Wilson model. The logic of this approach is that both the form of
activation function and its centering play a role in designing an
ANN model.
Fig. 4. Schematic of generalized neuron with two kinds of input connections with
weights Vij and Wij . The neuron output is obtained by application of the activation
2.4. Generalized two-level degree-1 homogeneous NN model function f0 to the ratio of the corresponding weighted sums of inputs (see formula
(18)).
f0 ð0Þ ¼ 0: (16) Xm !
V x
j¼1 ij j
The centering condition (16) establishes the connection of the lngk ¼ f0 Xm
two-level NN model with the NRTL model. The choice of the acti- j¼1
Wkj xj
vation function adds an extra degree of freedom to the model. The Xm !(
general requirement is that the activation function f0 ðuÞ should be X
m V x Vik
j¼1 ij j
nonlinear, easily computable and with a good fitting performance. þ xi f 00 Xm Xm (19)
Along with the weights Wij , let us introduce the second set of i¼1 W x
j¼1 ij j
W x
j¼1 ij j
weights Vij , such that Xm 9
Wik V x >
j¼1 ij j
=
Vii ¼ 0: (17)
Xm 2 >
;
We put W x
j¼1 ij j
n b
100% X Yi Y i ;
MAPE ¼
n i¼1 Yi
Table 1
Comparison of the one-level NN regression model (13) with the Wilson model for several binary systems.
Acetone-Chloroform [20] 55 1.0969 1.8052 1.8049 7.0319 1.3210 1.6745 1.473 6.1125
Acetone-Hexane [21] 55 0.3980 0.3351 2.2554 15.0724 0.4774 0.3629 1.9657 11.18
Toluene-Dioxane [22] 100 1.1651 0.6517 6.0137 6.8682 1.0401 0.7657 6.1928 7.0385
Toluene-Isobutanol [22] 100 0.5985 0.4694 1.3988 7.283 0.6221 0.5198 1.5409 8.0809
Nitromethane [23] 45 0.1477 0.2493 0.2455 1.5114 0.2675 0.2452 9.7108 57.4062
Tetrachloromethane
Tetrahydrofuran [24] 25 1.2652 0.4031 11.5507 9.4097 1.2505 0.4724 10.5295 9.5073
Propanol
Benzene-Isooctane [25] 45 1.1967 0.3780 0.77721 1.3395 1.0780 0.5032 2.8067 3.7381
Propanol-Propylacetate [26] 40 0.5782 0.5251 2.2208 7.1915 0.6048 0.5685 2.7768 8.2608
Chloroform-Hexane [20] 55 1.1642 0.4867 2.5815 4.895 1.0647 0.5926 1.7216 3.6074
Methyl Acetate-Benzene [27] 50 0.9654 0.6899 8.7575 9.6196 0.9099 0.7547 8.727 9.6215
g E values are very close to zero. alcoholewater system (at 313.15 K) and make use of a set of 21
The results (see Table 1) show that in most cases the ANN and examples from Ref. [29] that are uniformly distributed over the
Wilson models show similar performances. In several cases, the entire interior of the input domain (see red points in Fig. 6).
ANN model provides better fitting of the experimental data Because, by their construction, the ANN-type models satisfy the
compared to the Wilson model. An interesting exception is Nitro- zero limiting condition (7), the vertices of the input polyhedron
methane e Tetrachloromethane system, where the ANN model (see blue points in Fig. 6) are automatically included into the train
performs substantially worse. The latter system shows very high data set with nil contribution to the training error.
positive deviations from ideal behavior (with high values of activity The following Focke coefficients have been obtained for the
coefficients) and the shape of the selected activation function acetic aciden-propyl alcoholewater system at 313.15 K:
seems to be not optimal for describing these values. A detailed
discussion of the optimal choice of activation function for fitting of a11 ¼ 0; a12 ¼ 0:04764; a13 ¼ 0:21113;
different types on experimental data is however out of the scope of a21 ¼ 0:18613; a22 ¼ 0; a23 ¼ 0:45959; (21)
this article. a31 ¼ 0:6983; a32 ¼ 1:48588; a33 ¼ 0:
Observe that this effect can be explained by the asymmetry in
the discrepancy between the activation function f1 ðuÞ and the The six non-zero coefficients aij written out above were evalu-
logarithmic function used in the Wilson model (see Fig. 5a). ated by fitting the Focke model (6) to the experimental data [29].
Fig. 5. Activation functions f1 ðuÞ ¼ tanhðu 1Þ and f0 ðuÞ ¼ tanhu for the one-level (a) and two-level (b) NN models, respectively. The two-dot-dash line indicates the common
tangent line. The superscripts 0 or 1 of the activation function refer to the centering condition where the function has null value.
6 I. Argatov, V. Kocherbitov / Fluid Phase Equilibria 502 (2019) 112282
Further, the following ANN weights have been obtained for the
acetic aciden-propyl alcoholewater system at 313.15 K:
Fig. 8. Absolute relative percentage errors of the regression models for the activity coefficients.
activity coefficients. [3] M. Dehghani, H. Modarress, A. Bakhshi, Modeling and prediction of activity
coefficient ratio of electrolytes in aqueous electrolyte solution containing
amino acids using artificial neural network, Fluid Phase Equilib. 244 (2) (2006)
Acknowledgment 153e159.
[4] R. Beigzadeh, M. Rahimi, S. Shabanian, Developing a feed forward neural
IA is grateful to the Biofilms center for the hospitality during his network multilayer model for prediction of binary diffusion coefficient in
liquids, Fluid Phase Equilib. 331 (2012) 48e57.
stay at the Malmo € University, where this research was carried out. [5] M. Moghadam, S. Asgharzadeh, On the application of artificial neural network
for modeling liquid-liquid equilibrium, J. Mol. Liq. 220 (2016) 339e345.
Nomenclature [6] R. Beigzadeh, M. Rahimi, S.R. Shabanian, Developing a feed forward neural
network multilayer model for prediction of binary diffusion coefficient in
liquids, Fluid Phase Equilib. 331 (2012) 48e57.
aij adjustable parameters in the Focke model [7] J.P. Poort, M. Ramdin, J. van Kranendonk, T.J. Vlugt, Solving vapor-liquid flash
aij non-randomness parameters in the NRTL model problems using artificial neural networks, Fluid Phase Equilib. 490 (2019)
39e47.
f, f0 , f1 transfer (activation) functions [8] H.A. Behrooz, R.B. Boozarjomehry, Prediction of limiting activity coefficients
GE excess Gibbs energy for binary vapor-liquid equilibrium using neural networks, Fluid Phase
gE molar excess Gibbs energy Equilib. 433 (2017) 174e183.
[9] S. Mohanty, Estimation of vapour liquid equilibria of binary systems, carbon
gk activity coefficient of the k-th component dioxideeethyl caproate, ethyl caprylate and ethyl caprate using artificial
Lij adjustable parameters in the Wilson model neural networks, Fluid Phase Equilib. 235 (1) (2005) 92e98.
m number of components [10] V. Nguyen, R. Tan, Y. Brondial, T. Fuchino, Prediction of vaporeliquid equi-
librium data for ternary systems using artificial neural networks, Fluid Phase
n total number of moles of all components
Equilib. 254 (1e2) (2007) 188e197.
ni number of moles of the i-th component [11] H. Ghanadzadeh, M. Ganji, S. Fallahi, Mathematical model of liquideliquid
p pressure equilibrium for a ternary system using the GMDH-type neural network and
R ideal gas constant genetic algorithm, Appl. Math. Model. 36 (9) (2012) 4096e4105.
€
[12] A. Ozmen, Correlation of ternary liquideliquid equilibrium data using neural
T temperature network-based activity coefficient model, Neural Comput. Appl. 24 (2) (2014)
tij dimensionless interaction parameters in the NRTL 339e346.
model [13] M. Hakim, G. Behmardikalantari, H. Najafabadi, G. Pazuki, A. Vosoughi,
M. Vossoughi, Prediction of liquideliquid equilibrium behavior for aliphaticþ
Vij weights in neural network model aromaticþ ionic liquid using two different neural network-based models,
Wij weights in neural network model Fluid Phase Equilib. 394 (2015) 140e147.
xi inputs for neural network model [14] S. Haykin, Neural Networks: A Comprehensive Foundation, Prentice-Hall, New
Jersey, 1999.
y output of neural network model [15] W. Focke, Mixture models based on neural network averaging, Neural Com-
put. 18 (1) (2006) 1e9.
Appendix.A. Relation between the models of Focke and [16] R. Petersen, A. Fredenslund, P. Rasmussen, Artificial neural networks as a
predictive tool for vapor-liquid equilibrium, Comput. Chem. Eng. 18 (1994)
Wilson S63eS67.
[17] G. Wilson, Vapor-liquid equilibrium. xi. a new expression for the excess free
Taking exponents of both sides of Eq. (1), we get energy of mixing, J. Am. Chem. Soc. 86 (2) (1964) 127e130.
[18] H. Renon, J. Prausnitz, Local compositions in thermodynamic excess functions
Xm Xm x i for liquid mixtures, AIChE J. 14 (1) (1968) 135e144.
gE
[19] H. Reynel-Avila, A. Bonilla-Petriciolet, J. Tapia-Picazo, An artificial neural
exp ¼ exp ln xj Lij ;
RT i¼1 j¼1
network-based nrtl model for simulating liquid-liquid equilibria of systems
present in biofuels production, Fluid Phase Equilib. 483 (2019) 153e164.
[20] L. Kudryavtseva, M. Susarev, Liquid-vapor equilibrium in chloroform-hexane
from where it follows that and acetone-chloroform systems, Russ. J. Appl. Chem. 36 (1963) 1231e1237.
[21] L. Kudryavtseva, M. Susarev, Liquid-vapor equilibriums in the systems
Y X m x i
gE m acetone-hexane and hexane-ethyl alcohol at 35, 45, and 55 and 760mmHg,
exp ¼ i¼1
exp ln xj Lij ; Russ. J. Appl. Chem. 36 (1963) 1471e1477.
RT j¼1 [22] M. Susarev, A. Toikka, Liquid-vapor-equilibrium in toluene-dioxane-isobutyl
alcohol system at 80 degrees and 100 degrees, Russ. J. Appl. Chem. 46 (11)
or, which is the same, as (1973) 2461e2464.
[23] I. Brown, F. Smith, Liquid-vapour equilibria. VII. The systems nitromethaneþ
Y X m x i benzene and nitromethaneþ carbon tetrachloride at 45+C, Aust. J. Chem. 8 (4)
gE m (1955) 501e505.
exp ¼ i¼1
xj Lij : [24] J.A. Salas, E.L. Arancibia, M. Katz, Excess molar volumes and isothermal vapor-
RT j¼1 liquid equilibria in the tetrahydrofuran with propan-1-ol and propan-2-ol
systems at 298.15 K, Can. J. Chem. 75 (2) (1997) 207e211.
Now, it is readily seen that the latter equation coincides with Eq. [25] S. Weissman, S.E. Wood, Vapor-liquid equilibrium of benzene-2, 2, 4-
(6), provided that aij ¼ Lij and y ¼ expð g E =RTÞ. trimethylpentane mixtures, J. Chem. Phys. 32 (4) (1960) 1153e1160.
[26] V. Sokolov, N. Markuzin, Experimental data on vapoureliquid equilibrium and
However, it should be emphasized that we directly apply the chemical reaction in the system acetic aciden-propanolewaterepropyl ace-
Focke model (6) for modeling the excess Gibbs energy g E by tate, 35e82, Soviet Institute of Science Information, 1982, pp. 1e12.
imposing the zero constraint (7). That is why, Eqs. (6) and (6) [27] I. Nagata, H. Hayashida, Vapor-liquid equilibrium data for the ternary sys-
tems: methyl acetate-2-propanol-benzene and methyl acetate-chloroform-
provide a model which is different from that of Wilson [17].
benzene, J. Chem. Eng. Jpn. 3 (2) (1970) 161e166.
[28] V. Kocherbitov, A. Toikka, Liquid-vapor and liquid-liquid phase equilibria in
References the system acetic acid-n-propanol-water-n-propyl acetate at 313.15 K, Russ. J.
Appl. Chem. 72 (10) (1999) 1706e1708.
[1] R. Orye, J. Prausnitz, Multicomponent equilibriadthe wilson equation, Ind. [29] V. Kocherbitov, A. Toikka, Liquid-vapor equilibrium in the acetic acidn-propyl
Eng. Chem. 57 (5) (1965) 18e26. alcohol-water system at 313.15 K, Russ. J. Appl. Chem. 70 (11) (1997)
[2] C.A. Faúndez, F.A. Quiero, J.O. Valderrama, Phase equilibrium modeling in 1777e1781.
ethanolþ congener mixtures using an artificial neural network, Fluid Phase [30] J. Tayman, D.A. Swanson, On the validity of mape as a measure of population
Equilib. 292 (2010) 29e35. forecast accuracy, Popul. Res. Policy Rev. 18 (4) (1999) 299e322.