Professional Documents
Culture Documents
Paper Transferencia.
Paper Transferencia.
a r t i c l e i n f o a b s t r a c t
Article history: In the present study, a feed-forward artificial neural network (ANN) was developed to estimate the Fick
Received 3 March 2012 diffusion coefficient in binary liquid systems. It was found as a function of the mole fraction of one com-
Received in revised form 27 June 2012 ponent, diffusion coefficient at infinite dilution, viscosity, and molar volume of each component. These
Accepted 28 June 2012
values are easily accessible from literatures and handbooks. Data from 54 systems consisting of 851 data
Available online 6 July 2012
points were collected and the ANN was trained with one and two hidden layers using various numbers
of neurons. After selection of the best ANN, the results were compared with other models. The results
Keywords:
show that this model has a superior performance on estimating the Fick diffusion coefficient. In addition,
Diffusion coefficient
Binary system
a new ANN was developed to predict the Maxwell–Stefan (MS) diffusivity, based on which the Fick dif-
Artificial neural network fusion coefficient was calculated. The results show that direct prediction of the Fick diffusivity leads to a
Modeling higher accuracy compared with that of the indirect prediction. Moreover, in the direct calculation, there
is no need for the thermodynamic correction factor, which can be treated as a second advantage of this
method. The accuracy of the method was evaluated through data points of other systems, which were
not previously introduced to the developed ANN.
© 2012 Elsevier B.V. All rights reserved.
0378-3812/$ – see front matter © 2012 Elsevier B.V. All rights reserved.
http://dx.doi.org/10.1016/j.fluid.2012.06.025
R. Beigzadeh et al. / Fluid Phase Equilibria 331 (2012) 48–57 49
Table 1 the Fick diffusivity for each liquid system over a whole range of
Theoretical and empirical correlative models for prediction of mutual diffusion coef-
mole fraction of components via pure component properties.
ficients in binary liquid systems (all used thermodynamic correction factor).
ANN is an appropriate method for estimating complex functions
Model names Equations in order to evaluate equilibrium and transport properties such as
1 VLE, [24,25] solubility, [26,27] and thermal conductivity [28–30] for
Carman–Stein [16] D = m
· (x1 1 D2∞ ) · (x2 2 D1∞ )
Rathbun–Babb [13] D = (x1 D2∞ ) · (x2 D1∞ ) ˛ ˛ is a adjustable mixtures. The present study introduces a feed forward ANN model,
parameter that was optimized. trained by the Levenberg–Marquardt algorithm [31–33], to predict
According to the system type the Fick diffusion coefficient in several liquid systems. The exper-
Gainer [17] D = D1∞ x22 x11 kx1
2 1/2
M2 1/4 Evap·A 1/2 imental data over a range of concentrations for fifty-four systems
k= 1 M1 Evap·B
M is (including 851 data points) were collected from the literature to
molecular weight and Evap is energy develop the ANN model. Results were compared with other for-
of vaporization mulas and a satisfactory advantage was found. In addition, another
Bosse-LVS [18] =1
D
x x
gE ANN model was developed to predict the MS diffusivity, and the
m
(1 D2∞ ) 1 (2 D1∞ ) 2 exp − RT
Fick diffusion coefficients were consequently found. The prediction
gE is excess Gibbs energy
m −1⁄2
results from both methods were compared.
1−
Guerkan [19] D = D0 1 + K x1 x2
D0 is
Table 2
Viscositya and molar volumeb of all compound types in temperature of systems used in the ANN input.
No. Component typec T (K) (MPa s) 106 v (m3 mol−1 ) No. Component type T (K) (MPa s) 106 v (m3 mol−1 )
In the present study, the extrapolation infinite dilution diffu- where Xmax and Xmin are the extreme values of variable Xi .
sion coefficients were used in order to achieve greater accuracy. It
should be noted that, after preparing the network, the correspond- 2.3. ANN modeling and training
ing relations [3,36,37] can be used to find these values.
In this study, the ANN input data were the mole fraction of the In this work, a feed forward neural network multilayer model
first component in binary liquids, diffusion coefficients at infinite was applied to predict the liquid diffusivity. The MATLAB software
dilution, viscosity, and molar volume of each component. In addi- [38] was employed for the ANN modeling. Fig. 1 shows the layout of
tion, the input and output data at various temperatures were used. the ANN architecture used in this work. In the ANN architecture, the
The employed viscosity and molar volume are listed in Table 2. numbers of input and output neurons were determined according
In the present work, all input data points were divided randomly to the problem definition. In this work, the input and output lay-
into three sections including: training (60%), validation (20%) and ers have seven and one neurons, respectively. Theoretical methods
test (20%) data sets. Table 3 gives the overall range of data points
employed in this work for developing the ANN model.
Table 3
List of range of employed data.
2.2. Preprocessing
Variable Range
As far as in training of the ANN, the input and output data have x1 0–1
1 (mpa s) 0.224–3.085
different physical units and range, it is often useful to normalize
2 (mpa s) 0.271–3.085
all the input and output data before training. Therefore, all data 103 v1 (m3 mol−1 ) 0.018–0.319
(Xi ) were normalized (Yi ) in the range of 0–1 using the following 103 v2 (m3 mol−1 ) 0.018–0.261
relation: 109 D1 ∞ (m2 s−1 ) 0.48–5
109 D2 ∞ (m2 s−1 ) 0.4–6.05
(Xi − Xmin ) 109 Ð (m2 s−1 ) 0.4–10.89
Yi = (7) 109 D (m2 s−1 )
(Xmax − Xmin ) 0.15–6.05
R. Beigzadeh et al. / Fluid Phase Equilibria 331 (2012) 48–57 51
for determining the appropriate number of hidden layers are not process of the ANN was carried out using the Levenberg–Marquardt
available. Moreover, the number of neurons in each hidden layer back propagation algorithm. In this training method, connection
prior to training cannot be obtained theoretically. Therefore, the weights wij and biases bj are iteratively adjusted to minimize the
trial-and-error method is commonly used to design the ANN. output deviation (predicted by ANN) from the target values accord-
In the present study, the trial-and-error method was employed ing to Levenberg–Marquardt [30–32] optimization method. The
to attain the number of layers and neurons. For this purpose, a early stopping method was used to avoid memorizing the train-
MATLAB script [38] was written that creates and screens differ- ing data set. In this method, in case of a decrease in deviation of the
ent structure several times, such as: one and two hidden layers as predicted values without decrease in predicted values of the val-
well as various numbers of neurons in each hidden layer. Because idation data set, the training will be stopped. Moreover, the error
the initial weights may have a large effect on the convergence, runs values should be calculated for different network structures with
were repeated 100 times with different randomly generated initial different numbers of neurons in hidden layers with one and two
values. The results presented in this work are the best ones obtained hidden layers.
from this procedure.
An increase in the number of hidden neurons may cause over-
fitting that usually occurs when a model is complicated. The 2.4. Deviations
complicated neural network means that there is a large number
of neurons (with a large number of weights and biases) relative to In this work in order to select the best ANN architecture, the
the number of training data. In these networks, the optimal number average squared deviation (ASD) and the correlation coefficient (R)
of neurons should be determined. between the experimental and estimated values were used. The
Each neuron is connected to the neurons of the adjacent layer, average relative deviation (ARD) was applied to compare the results
whose information is transferred via the connection. Each of these with other methods using the following relations:
connections has a weight (wij ) and these weights indicate the
1 Exp
N
strength of the connection. The output of each neuron can be 2
ASD = (D − DPred ) (9)
obtained from the following equation: N
n i=1
yj = F wji xi + bj
(8)
100
N DExp − DPred
i=1 ARD (%) = (10)
N DExp
Here yj is the output from jth neuron, xi is the output vector from i=1
ith to jth neuron, bj is the jth neuron bias, n is number of input vec-
tors to jth neuron, and F is the transfer function that transformed
N
Exp Exp
the sum of the weights plus the bias to get the normalized output (DExp − DAve ) · (DPred − DAve )
values. There are many types of transfer functions. In this work the i=1
R= (11)
“tansig” transfer function was used for the hidden layers and “pure- N Exp 2 N Pred ) 2
i=1
(DExp − DAve ) · i=1
(DPred − DAve
lin” function were considered for the output layer. The training
52 R. Beigzadeh et al. / Fluid Phase Equilibria 331 (2012) 48–57
Fig. 3. ASD values for different number of neurons in the first and second hidden
layers.
Table 4
Best ARD values of different training algorithm for ANN with 7-25-1 configuration.
Trainlm 4.75
Traincgb 5.17
Trainb 7.30
Trainbr 7.28
Traingdm 9.79
Traingda 9.51
Table 6
Results of the ANN (7-25-1) compared with other methods.
No. Binary systems T (K) Number of data pointsa ARD (%)b D Ref.
Table 6 (Continued)
No. Binary systems T (K) Number of data pointsa ARD (%)b D Ref.
c d
ANN Darken Leffler–Cullinan Li et al.
illustrate that using the Levenberg–Marquardt algorithm leads to The results reveal that the ANN led to satisfactory results in
the lowest error. comparison with other methods.
Deviations of the training, validation and testing stages of the The Wilson parameters were used to calculate the , extracted
selected topologies are reported in Table 5. Acceptable differences from vapor–liquid equilibrium data [74,75]. Dilution diffusion coef-
between testing and training data sets of the ASD values can be ficients were extracted from experimental data or in the absence of
found in this table. This means that the network was suitable for experimental results they were extrapolated as shown in Table 7.
predicting data points that are not used in the training data set. The temperature was not considered as input variable but its effect
On the other hand, the results of the developed ANN for on them is quite obvious.
predicting the MS diffusivity leads to higher errors compared with In a typical research, usually the mutual diffusion coefficient
the direct method of predicting the Fick diffusivity. This drawback prediction models were provided and then a diagram of diffu-
can be related to failure in obtaining accurate thermodynamic sion coefficients versus concentration for better illustrating the
correction factor values. Here, the Wilson excess Gibbs energy prediction results was presented. Fig. 4 illustrates the experi-
model was used to calculate the thermodynamic correction mental diffusion coefficients and results of the ANN prediction
factor. The applied Wilson parameters in this model were fitted for Acetone–Chloroform system at 298 K. The mole fractions of
against vapor–liquid equilibrium experimental data to obtain this system were selected between 0 and 1 with 0.01 intervals
activity coefficients. Nevertheless, a low error level in the activity (0:0.01:1). This demonstrates an acceptable ability of the network
coefficients leads to a large error level in the first derivatives of prediction in the whole range of concentrations. In this figure, the
the activity coefficients [37,39]. Therefore, an error occurs for ANN outcomes were compared with those obtained from other
calculating , especially when is close to zero. The reason is that methods (modified Darken [3], Leffler–Cullinan [8] and Li et al. [9]).
with low changes in its value, the diffusion coefficient will change The presented results indicate satisfactory predictions by ANN.
significantly. Therefore, this error is inevitable in models that use Fig. 5 indicates a comparison between the predicted results from
the activity coefficients. In this work, it was tried to predict the Fick the ANN and the experimental values. This figure shows that the
diffusion coefficients with low prediction errors and without any
need for the calculation of the thermodynamic correction factor.
Table 6 shows the average relative deviations of the predicted
results from the ANN (with one hidden layer) compared with
the experimental data for predicting the Fick diffusivity of each
solution. The extrapolation dilution diffusion coefficients are not
employed in the data presented in this table. Moreover, the Net-
work capability in estimation of the Fick diffusion coefficients in
other systems (unknown for the ANN) was investigated. Refer-
ring to Table 6, it can be found that the prediction results for
these systems are quite acceptable. The total average relative devi-
ation for all experimental data points for the ANN results is 4.75%,
while this value is above 11% for the modified Darken [3] and
Leffler–Cullinan [8] models. Deviations of the modified Darken [3]
and Leffler–Cullinan [8] models were calculated, but deviation val-
ues of the Li et al. [9] model are reported directly from their work.
The mixture viscosity used for the Leffler–Cullinan [8] model was
calculated by an empirical formula as follows [9]:
1
= · [(1 v1 )x1 + (2 v2 )x2 ] (12)
v
Fig. 4. A comparison among predicted and experimental binary Fick diffusion coef-
v = x1 v1 + x2 v2 (13) ficients acetone–chloroform at 298 K.
R. Beigzadeh et al. / Fluid Phase Equilibria 331 (2012) 48–57 55
Table 7
Infinite dilution diffusion coefficients used in the ANN input.
Table 8
Connection weights and biases.
Neuron W1 b1 b2 = 5.007
x1 D1 ∞ D2 ∞ v1 v2 1 2 W2
ANN works similar to a box that contains many polynomial equa- List of symbols
tions. One can enter the system type and mole fraction of each
component and find the diffusion coefficient value with a high
precision.
bj bias
C molar concentration (mol m−3 )
D Fick diffusion coefficient (m−2 s−1 )
4. Conclusions Ð Maxwell–Stefan diffusion coefficient (m−2 s−1 )
Di ∞ diffusion coefficient at infinite dilution of species i
This study presents a novel method for mutual diffusion coef- (m−2 s−1 )
ficients prediction in binary liquids. An artificial neural network F transfer function
scheme was developed to estimate the mutual diffusion coeffi- gE excess Gibbs energy (J mol−1 )
cients in binary liquid systems. The variables such as diffusion J molar flux (mol m−2 s−1 )
coefficients at infinite dilution, viscosity, molar volume, and mole M molecular weight
fraction of each component were used as the network inputs. Data n number of input vectors
of 54 binary liquid systems consisting of 851 data points were col- N number of data points
lected and divided randomly into three sections including; training R universal gas constant (8.314 J mol−1 K−1 )
(60%), validation (20%), and test (20%). The ANNs with different T temperature (K)
architectures were used to predict the Fick and Maxwell–Stefan v molar volume (m3 mol−1 )
diffusion coefficients. The optimum number of neurons for one wij weight
and two hidden layers was selected by the trial-and-error method. x independent variable
From this study, it can be concluded that the direct prediction of the xi mole fraction of component i/output vector from ith neu-
Fick diffusivity is more appropriate than the prediction using MS ron
diffusivity. The high precision in estimating liquid diffusion coef- X network input
ficients and no need to calculate the thermodynamic correction yj neuron output
factor are the reasons, which support this conclusion. The pre-
diction results of the ANN with one hidden layer (7-25-1) was
presented and compared with the prediction results from other Greek letters
models. For this ANN architecture, the total average squared devi- activity coefficient
ation of 0.01 and the total average relative deviation of 4.75% were thermodynamic factor
obtained. dynamic viscosities (pa s)
From this study, it can be concluded that the artificial neural net- Wilson’s parameter
works can be a solution to problems with complex relationships Wilson’s parameter
between diffusion coefficients and concentration in binary liquid
systems, even for strongly non-ideal systems. The presented ANN in Subscripts
this work introduces a technique to find diffusion coefficient values 1 2 component 1 and 2 in solution
with a high precision by introducing the system type and mole frac- i component i/input layer
tion of each component instead of sophisticated polynomial fitting j hidden layer
for each individual system. k output layer
R. Beigzadeh et al. / Fluid Phase Equilibria 331 (2012) 48–57 57