Professional Documents
Culture Documents
Fitting The Sovov A's Supercritical Fluid Extraction Model
Fitting The Sovov A's Supercritical Fluid Extraction Model
com
Received 18 January 2007; received in revised form 25 June 2007; accepted 28 August 2007
Available online 1 September 2007
Abstract
The estimation of parameters of the Sovová’s model for supercritical fluid extraction is addressed in the present work. A global optimization
approach is employed. Reasons for this decision are given as a consequence of more general modeling objectives. The way of dealing with different
objective functions and constraints is described. Real-life experiments are given.
© 2007 Elsevier Ltd. All rights reserved.
0098-1354/$ – see front matter © 2007 Elsevier Ltd. All rights reserved.
doi:10.1016/j.compchemeng.2007.08.016
1736 J. Martı́nez, J.M. Martı́nez / Computers and Chemical Engineering 32 (2008) 1735–1745
constrained model with the constraint structure of Sovová’s y(t) = QCO2 Y ∗ [t − tCER exp(Zw (t) − Z)],
model. In practice, we run the method with a rather modest
if tCER < t ≤ tFER (3)
precision parameter and we complement the execution with two
local optimization procedures. The first one is a local varia- and
tion method, which is very efficient to detect decrease in the
Y∗ WX0
coordinate directions, and the second is the algorithm of Nelder y(t) = mSI X0 − ln 1 + exp −1
and Mead (1965) which works efficiently to detect oblique final W Y∗
descent directions. This combination of global and local strate- WQCO2 Xk
gies is recommendable, in practice, for many global optimization × exp (tCER − t) (4)
mSI X0
problems.
This paper is organized as follows. In Section 2 we describe if t > tFER .
Sovová’s model and we show the way in which it must be The function Zw used in (3) is defined by
formulated for suitable estimation using a global optimization ZY ∗ X0 exp[WQCO2 (t − tCER )/mSI ] − Xk
algorithm. In Section 3 merit functions are discussed. In Sec- Zw (t) = ln .
WX0 X0 − X k
tion 4, global optimization reasons are discussed. In Section 5 (5)
we describe the optimization algorithm. Practicalities are dis-
cussed in Section 6. Section 7 is devoted to fitting in real-life We define
experiments. Conclusions are stated in Section 8. tCER Y ∗ ZQCO2
Xp = and Xk = X0 − Xp . (6)
mSI
2. Sovová’s model
The quantity Xp represents the amount of easily extractable
Many mathematical models to reproduce SFE curves can be material. Therefore, in a realistic simulation, one must have
found in the literature. Part of them are simply empirical equa- Xp ≤ X0 . In other words, a simulation using (2)–(5) may rep-
tions that fit to experimental data (Esquı́vel, Bernardo-Gil, & resent a real process only if
King, 1999), while others are based on the mass balance inside mSI X0
Z≤ . (7)
the extraction bed. Some models adopt simplifications to the tCER Y ∗ QCO2
mass balance that allow one to have an analytical solution, and
then equation for the extraction curve (Goto, Sato, & Hirose, The value of W > 0 is implicitly defined by the equation:
1993; Martı́nez, Monteiro, Rosa, Marques, & Meireles, 2003; mSI Xk + Xp exp(WX0 /Y ∗ )
Sovová, 1994). Other authors present models where the extrac- tFER = tCER + ln . (8)
WQCO2 X0
tion curve is obtained by numerical solution of the mass balance
(Reverchon & Marrone, 1997, 2001; Zizovic, Stamenic, Orlovic, The model of Sovová has been used to fit experimental
& Skala, 2005). In this work we deal with the model of Sovová. SFE data from many vegetable raw materials, and the results
This model assumes that part of the extractable material (Xp ) are mostly good (Del-Valle, Jiménez, & De-la-Fuente, 2003;
is easily accessible to the solvent, due to the breaking of the Kiriamiti, Rascol, Marty, & Condoret, 2002; Perakis, Louli, &
cell structures which contain the solute, during the milling of Magoulas, 2005; Povh et al., 2001; Sousa et al., 2005).
the raw material. The other fraction of the solute (Xk ) remains Given QCO2 , Y ∗ , mSI , X0 > 0, tCER > 0, W > 0 and Z >
inside the cell structures that were not broken by milling, so its 0 satisfying (7), the formulae (2)–(5) may be used to
contact with the solvent is more difficult. The SFE process, in simulate supercritical fluid extraction processes. The param-
this case, can be divided in three periods, the first one being eters QCO2 , Y ∗ , mSI , X0 are contextual parameters whereas
controlled by convection in the fluid phase, the last one con- tCER , Z, W are model parameters. W may be interchanged with
trolled by diffusion in the solid, and an intermediary period when tCER as model parameter since there exists a biunivoque corre-
both convection and diffusion are important. We have, then, spondence between them.
one mass transfer coefficient for each phase, and these are two Assume now that a set of experimental data (t1 , yobs
adjustable parameters of the model. The third adjustable param- (1)), . . . , (tm , yobs (m)) is given. Let us consider the (inverse)
eter is the extraction rate at the first period, which is thought problem of estimating the extraction curve (2)–(4) that best fits
to be constant according to the model assumptions. The idea of these observations. The parameters QCO2 , Y ∗ , mSI and X0 are
broken and intact cells is used by other authors who show its given but tCER , Z, tFER and W need to be estimated. Due to the
validity using electronic microscopy (Reverchon & Marrone, arguments displayed in the Introduction, we recommend a global
2001; Zizovic et al., 2005). The detailed physical description, optimization approach for estimating the model parameters.
as shown above, and its validity shown by microscopy lead us Let us consider the possibility tCER = 0. In this case, by (6),
to select the model of Sovová to test our global optimization we have that Xp = 0. Then, by (8), tFER (W) = 0 for all W > 0.
method. So, tFER = 0. This means that the whole process is governed by
The analytic solution of the mass balance equations (Sovová, the third stage, when mass transfer inside the solid particles is
1994) is given by the following formulae: more important (4).
Now consider the possibility tCER > 0, Z = 0. Again, by (6),
y(t) = QCO2 Y ∗ [1 − exp(−Z)]t, if 0 ≤ t ≤ tCER , (2) we have that Xp = 0 and, by (8), we have that tFER = tCER . But,
1738 J. Martı́nez, J.M. Martı́nez / Computers and Chemical Engineering 32 (2008) 1735–1745
since Z = 0, by (2) we have that y(t) = 0 for all t ≤ tCER . Now, subject to
deriving (4) we obtain:
min
tCER ≤ tCER ≤ tm , (12)
−WX0
y (tCER ) = QCO2 Y ∗ 1 − exp . mSI X0
Y∗ Zmin ≤ Z ≤ , (13)
tCER Y ∗ QCO2
Since y (t) = 0 for t < tCER , to preserve the continuity of and
the derivative, we must have W = 0. By (4) this implies that
y(t) = 0 for all t. mSI X0
tCER (1 + Z) ≤ tFER ≤ tCER + . (14)
The last observation shows that the possibility Z = 0 should QCO2 Y ∗
be considered only associated to tCER = 0. The alternative Z = In this formulation, we assume that (y1 (tCER , Z, tFER ),
0, tCER > 0 is physically impossible. Concerning the possibility . . . , ym (tCER , Z, tFER )) is defined by (2)–(5) and that W is
that tCER = 0 (which certainly has physical meaning) we impose defined by (8).
the artificial bound tCER ≥ tCER
min , where t min > 0 is very small
CER
(say, 0.01t1 ). In this way, we will always have Xp > 0 although
3. Choice of the merit function
possibly extremely small. The impossibility of Z = 0, tCER > 0
leads us to include, also, a constraint of the form Z ≥ Zmin > 0.
In the previous section we stated minimal assumptions on the
Assuming Xp > 0, by Eq. (8), there is a biunivoque corre-
merit function Φ. This function must be continuous and nonneg-
spondence between the values of W belonging to (0, ∞) and
ative, but we do not impose differentiability. Differentiability of
the values of tFER belonging to the bounded interval (tCER +
merit functions is a desirable property because it allows one to
mSI Xp /QCO2 Y ∗ , tCER + mSI X0 /QCO2 Y ∗ ). In fact, it is easy to
use smooth optimization methods for solving the fitting problem
see that, if Xp = 0, Eq. (8) defines a strictly increasing function
but, many times, it is better to evaluate the error using a nons-
tFER = tFER (W) such that
mooth Φ. The most popular smooth merit function is the sum of
mSI Xp squares:
lim tFER (W) = tCER + = tCER (1 + Z) (9)
W→0 QCO2 Y ∗
m
Φ2 (y, yobs ) = (y(ti ) − yobs (ti ))2 .
and i=1
mSI X0 However, the sum of squared errors is highly affected by out-
lim tFER (W) = tCER + . (10)
W→∞ QCO2 Y ∗ liers (Huber, 1981). This fact motivates the use of nonsmooth
alternatives in which the weight of wrong observations is mod-
Therefore, one does not need to consider that both W and tFER erate. The best known alternative is the sum of absolute errors:
are unknowns. Given an estimation of W, tFER is automatically
determined and vice versa. We prefer to estimate tFER instead
m
of W because tFER varies in a bounded interval with a precise Φ1 (y, yobs ) = |y(ti ) − yobs (ti )|.
geometrical and physical meaning. Thus, our inverse estimation i=1
problem consists in finding Z > 0, tCER > 0, tFER > 0 such that
(7) holds The weighted counterparts of Φ1 and Φ2 should also be
mentioned. These are:
mSI Xp mSI X0 m
tCER + ≤ tFER ≤ tCER + , [y(ti ) − yobs (ti )] 2
QCO2 Y ∗ QCO2 Y ∗ Φ2w (y, yobs ) = and
yobs (ti )
i=1
and the values y(t1 ), . . . , y(tm ) computed by (2)–(5) fit closely
yobs (t1 ), . . . , yobs (tm ).
m
|y(ti ) − yobs (ti )|
Φ1w (y, yobs ) = .
We need a criterion for defining the proximity between yobs (ti )
i=1
the theoretical observations y = (y(t1 ), . . . , y(tm )) and the
experimental observations yobs = (yobs (t1 ), . . . , yobs (tm )). This In this paper, we will use Φ(y, yobs ) = Φ1w (y, yobs ).
criterion should be represented by a merit function Φ(y, yobs )
that must satisfy the following axioms:
4. Motivation for global optimization
1. Φ(y, yobs ) must be continuous. When fitting Sovová’s parameters to SFE it is common
2. Φ(y, yobs ) ≥ 0 for all y, yobs . to observe that quite different estimates produce very similar
3. If y = yobs then Φ(y, yobs ) = 0. extraction curves. In some cases, the graphic of the merit func-
tion Φ exhibits local minimizers or is extremely flat. These
Summing up, the optimization problem that we need to solve facts make difficult the determination of the best model parame-
is ters from problem data. Local minimization algorithms may be
trapped by local minimum points or may “decide” that a set of
Minimize Φ(y(tCER , Z, tFER ), yobs ) (11) parameters with small gradient of Φ is a solution.
J. Martı́nez, J.M. Martı́nez / Computers and Chemical Engineering 32 (2008) 1735–1745 1739
How serious is this inconvenient? The answer depends on our The purpose of solving (16)–(19) is the obtention of the
more general objectives. If we aimed merely smoothing a partic- “true parameters” tCER , Z, tFER using small-scale laboratory
ular empirical curve (in particular, interpolating at non-observed experiments. The problem (16)–(19) may have local-nonglobal
periods of time and discarding small experimental errors) the minimizers, therefore, local nonlinear programming algorithms
obtention of a small residual should be satisfactory enough since might not be efficient for finding the best parameters. More-
the best parameters would have no meaning at all. However, over, usual nonlinear programming algorithms usually exploit
our more general goal is more ambitious. We wish to develop a the smoothness of the objective function, which might not exist
Sovová-based model that, given the raw material, would predict in this case.
the extraction curve as a function of the contextual parameters. However, due to the small number of parameters to be esti-
The usefulness of such a model would be that, given very dif- mated, a suitable global optimization method may be employed.
ferent contextual parameters, with different scales, we would be We will use a lexicographical grid method (LGM). The idea is
able to predict reliable extraction performances. The problem to define a grid in the feasible region and to use exhaustive eval-
is that, as mentioned in the Introduction, a model that gives the uation of the objective function at the grid points. If the grid is
extraction curve as a function of contextual parameters alone fine enough, we obtain a global solution with arbitrary precision.
does not exist. The discovery of such a model must be preceded In order to give a precise geometrical meaning to the employed
by the discovery of the empirical relation: grid, we will use, instead of Z, the new variable S that represents
the slope of the line that defines y(t) for t ≤ tCER . By (2), we
Contextual parameters ⇒ model parameters. (15) have
The nature of the “equation” (15) is, up to the present, S = QCO2 Y ∗ [1 − exp(−Z)].
unknown. Very likely one must rely on empirical models, sur-
face response analysis or even neural networks. Independently Therefore, S < QCO2 Y ∗ and
of the technique adopted, the set of data used to fit (15) must be
S
as precise as possible. For fitting (15) it is not enough that a set Z = − ln 1 − .
QCO2 Y ∗
of model parameters fit well an empirical extraction curve. It is
necessary that this set of model parameters should be the true The minimum values for S and Z are connected by
model parameters for the specific extraction process. It follows
S min = QCO2 Y ∗ [1 − exp(−Zmin )] and
that, under our more general project, the obtention of global
For the execution of LGM we define n1 , n2 , n3 , the number F (x1∗ , x2∗ + jε2 , x3∗ ) ≥ F (x1∗ , x2∗ + (j − 1)ε2 , x3∗ ).
of grid points corresponding to the variables x1 , x2 , x3 , respec-
tively.
Update:
Algorithm LGM.
x2∗ ← x2∗ + (m2 − 1)ε2 .
Step 0. Set M ← ∞.
Step 1. Define δ1 = (b − a)/(n1 + 1). For i = 1, . . . , n1 , Step 6. If F (x1∗ , x2∗ , x3∗ + ε3 ) ≥ F (x∗ ) set ε3 = −ε3 .
define x1 ← a + iδ1 and execute Step 2. Step 7. Let m3 be the first index j = 1, 2, . . . such that
Step 2. Define δ2 = (g2 (x1 ) − g1 (x1 ))/(n2 + 1). For
j = 1, . . . , n2 , define x2 ← g1 (x1 ) + jδ2 and execute F (x1∗ , x2∗ , x3∗ + jε3 ) ≥ F (x1∗ , x2∗ , x3∗ + (j − 1)ε3 ).
Step 3.
Step 3. Define δ3 = (g4 (x1 , x2 ) − g3 (x1 , x2 ))/(n3 + 1). For Update:
k = 1, . . . , n3 , define x3 ← g3 (x1 , x2 ) + kδ3 and exe-
cute Step 4. x3∗ ← x3∗ + (m3 − 1)ε3 .
Step 4. If F (x1 , x2 , x3 ) ≤ M, set M ← F (x1 , x2 , x3 ) and
(x1∗ , x2∗ , x3∗ ) ← (x1 , x2 , x3 ). Step 8. If m1 = m2 = m3 = 1, stop. Else, go to Step 1.
7. Experiments
x1∗ ← x1∗ + (m1 − 1)ε1 .
The following problems refer to supercritical CO2 extrac-
Step 4. If F (x1∗ , x2∗ + ε2 , x3∗ ) ≥ F (x∗ ) set ε2 = −ε2 . tion from vetiver roots at 200 bar and 40 ◦ C (Martı́nez, 2005).
Step 5. Let m2 be the first index j = 1, 2, . . . such that The problems 1, 2 and 3 correspond to small scale experi-
J. Martı́nez, J.M. Martı́nez / Computers and Chemical Engineering 32 (2008) 1735–1745 1741
Table 1 Table 3
Experiments with Problem 1: global minimizer Experiments with Problem 2: global minimizer
Time Observed y Computed y Error Percent error Time Observed y Computed y Error Percent error
0.00 0.0000 0.00000000 0.00000000 0.00000000 0.00 0.0000 0.00000000 0.00000000 0.00000000
10.00 0.0408 0.04080002 0.00000002 0.00004040 10.00 0.0508 0.05080000 0.00000000 −0.00000071
20.00 0.0871 0.08160003 −0.00549997 −6.31454310 20.00 0.0914 0.09979088 0.00839088 9.18039145
30.00 0.0961 0.10512415 0.00902415 9.39037530 30.00 0.1030 0.10300002 0.00000002 0.00001780
45.00 0.1098 0.10980000 0.00000000 0.00000066 45.00 0.1118 0.10666094 −0.00513906 −4.59665042
60.00 0.1181 0.11425753 −0.00384247 −3.25357618 60.00 0.1131 0.11019149 −0.00290851 −2.57162503
80.00 0.1212 0.11987873 −0.00132127 −1.09015508 80.00 0.1140 0.11470404 0.00070404 0.61758099
100.00 0.1258 0.12515252 −0.00064748 −0.51468673 100.00 0.1206 0.11900352 −0.00159648 −1.32378371
120.00 0.1301 0.13010027 0.00000027 0.00020713 120.00 0.1231 0.12309994 −0.00000006 −0.00005187
140.00 0.1343 0.13474203 0.00044203 0.32913883 140.00 0.1255 0.12700285 0.00150285 1.19749263
ments, and the problems 4, 5 and 6 are large scale runs at Running the lexicographical method with ngrid = 100, we
the same pressure and temperature, with the same material. obtained tCER = 16.987, tFER = 18.747, Z = 0.102214, W =
Some data needed for the model are common to all problems: 0.044910 with f = 1.67919. After the local variations step, we
X0 = 0.0619, Y ∗ = 0.06, ρ = 0.829, ρS = 1.3. obtained tCER = 19.291, tFER = 21.267, Z = 0.102214, W =
Other data are specific for each problem. In all the prob- 0.009489 with f = 0.20564.
lems we used the weighted sum of absolute errors Φ1w as merit Finally, using this initial point to initialize the Nelder–Mead
function. method, we obtained: tCER = 18.766, tFER = 20.740, Z =
0.104925, W = 0.009490 with f = 0.194876. Results are
Problem 1. The data for this problem are: QCO2 = 0.85, shown in Table 3.
mtotal = 3.53, H = 2.9, db = 2 = 0.557. The Nelder–Mead method was applied in this case, starting
Running the lexicographical method with ngrid = 100, we from tCER = 20, W = 1, Z = 1. We got as result f = 1.01599
obtained tCER = 19.815, tFER = 21.508, Z = 0.084229, W = with tCER = 1.572, W = 0.0144, Z = 1.0450. The results were
0.046563 with f = 1.2859. After the local variations step, we far from the ones observed above, although the starting value of
obtained tCER = 21.482, tFER = 23.300, Z = 0.084229, W = tCER was close to the optimum. As we can see in Table 4, this
0.017796 with f = 0.27092. fitting may lead us to a wrong result, as well as in Problem 1.
Finally, using this initial point to initialize the Nelder–Mead
method, we obtained: tCER = 24.372, tFER = 26.411, Z = Problem 3. The data for this problem are: QCO2 = 6.3,
0.083382, W = 0.012494 with f = 0.20893. Table 1 shows mtotal = 26.2, H = 2.9, db = 5.45 = 0.702.
these results. Running the lexicographical method with ngrid = 100, we
It is interesting to observe that the Nelder–Mead method, obtained tCER = 12.745, tFER = 15.919, Z = 0.247610, W =
starting from different initial points, fails to arrive to the 0.049935 with f = 0.20350. After the local variations step, we
nice minimum exhibited above. For example, initializing obtained tCER = 12.845, tFER = 16.042, Z = 0.247610, W =
Nelder–Mead with tCER = 21, W = 1, Z = 1, we got f = 0.050090 with f = 0.20124.
1.28 with tCER = 1.57, W = 0.0195, Z = 0.952. The fitting Finally, using this initial point to initialize the Nelder–Mead
obtained with these values is in Table 2. Observe that this fit- method, we obtained: tCER = 13.627, tFER = 16.762, Z =
ting is not terribly bad and could be easily taken (wrongly) as a 0.228515, W = 0.056783 with f = 0.18717. Results are shown
satisfactory one. in Table 5.
In this case the Nelder–Mead method was applied start-
Problem 2. The specific data for this run are exactly the same ing from tCER = 10, W = 1, Z = 1. We got f = 0.18999 with
as in Problem 1. tCER = 13.684, W = 0.0532, Z = 0.2285. It is interesting to
Table 2 Table 4
Experiments with Problem 1: simplex fitting Experiments with Problem 2: simplex fitting
Time Observed y Computed y Error Percent error Time Observed y Computed y Error Percent error
0.00 0.0000 0.00000000 0.00000000 0.00000000 0.00 0.0000 0.00000000 0.00000000 0.00000000
10.00 0.0408 0.08109449 0.04029449 98.76100615 10.00 0.0508 0.08712682 0.03632682 71.50949280
20.00 0.0871 0.08709951 −0.00000049 −0.00056187 20.00 0.0914 0.09140023 0.00000023 0.00025105
30.00 0.0961 0.09281505 −0.00328495 −3.41825819 30.00 0.1030 0.09552006 −0.00747994 −7.26208181
45.00 0.1098 0.10087562 −0.00892438 −8.12785185 45.00 0.1118 0.10142376 −0.01037624 −9.28107566
60.00 0.1181 0.10835967 −0.00974033 −8.24752810 60.00 0.1131 0.10701186 −0.00608814 −5.38297388
80.00 0.1212 0.11751180 −0.00368820 −3.04306558 80.00 0.1140 0.11400055 0.00000055 0.00048110
100.00 0.1258 0.12580040 0.00000040 0.00031634 100.00 0.1206 0.12049516 −0.00010484 −0.08693098
120.00 0.1301 0.13330632 0.00320632 2.46450644 120.00 0.1231 0.12653042 0.00343042 2.78669095
140.00 0.1343 0.14010299 0.00580299 4.32091404 140.00 0.1255 0.13213862 0.00663862 5.28974060
1742 J. Martı́nez, J.M. Martı́nez / Computers and Chemical Engineering 32 (2008) 1735–1745
Table 5 Table 7
Experiments with Problem 3: global minimizer Experiments with Problem 4: global minimizer
Time Observed y Computed y Error Percent error Time Observed y Computed y Error Percent error
0.00 0.0000 0.00000000 0.00000000 0.00000000 0.00 0.0000 0.00000000 0.00000000 0.00000000
5.00 0.4166 0.38610013 −0.03049987 −7.32114033 5.00 1.0345 1.03450132 0.00000132 0.00012757
10.00 0.7722 0.77220026 0.00000026 0.00003351 10.00 2.2807 2.06900264 −0.21169736 −9.28212218
20.00 1.1284 1.20022982 0.07182982 6.36563476 20.00 3.2398 3.38177516 0.14197516 4.38221976
30.00 1.2435 1.24350016 0.00000016 0.00001292 30.00 3.5512 3.55119998 −0.00000002 −0.00000066
40.00 1.2912 1.28096762 −0.01023238 −0.79247059 40.00 3.7113 3.68406316 −0.02723684 −0.73388944
50.00 1.3219 1.31340364 −0.00849636 −0.64273825 50.00 3.7992 3.78817237 −0.01102763 −0.29026196
60.00 1.3471 1.34147882 −0.00562118 −0.41728042 60.00 3.8697 3.86969983 −0.00000017 −0.00000441
80.00 1.3868 1.38679970 −0.00000030 −0.00002139 80.00 3.9631 3.98344051 0.02034051 0.51324736
100.00 1.4146 1.42072618 0.00612618 0.43306815 100.00 4.0189 4.05303232 0.03413232 0.84929498
120.00 1.4269 1.44611325 0.01921325 1.34650263 120.00 4.0605 4.09557515 0.03507515 0.86381353
140.00 1.4449 1.46510483 0.02020483 1.39835521 140.00 4.0801 4.12156870 0.04146870 1.01636480
Table 8
notice that in this case the method arrived to a minimum near to Experiments with Problem 4: simplex fitting
the nice value shown above. This indicates that sometimes the
Time Observed y Computed y Error Percent error
Nelder–Mead method may lead to excellent fittings, and give us
the impression of being an appropriate method to this model. 0.00 0.0000 0.00000000 0.00000000 0.00000000
Table 6 shows the calculated points and errors for this run. 5.00 1.0345 1.03450080 0.00000080 0.00007715
10.00 2.2807 2.28069943 −0.00000057 −0.00002491
Problem 4. The data for this problem are: QCO2 = 17, mtotal = 20.00 3.2398 4.12154082 0.88174082 27.21590276
30.00 3.5512 4.16179161 0.61059161 17.19395169
71.68, H = 8, db = 5.45 = 0.557. 40.00 3.7113 4.16233483 0.45103483 12.15301452
Running the lexicographical method with ngrid = 100, we 50.00 3.7992 4.16234210 0.36314210 9.55838326
obtained tCER = 14.159, tFER = 17.404, Z = 0.226667, W = 60.00 3.8697 4.16234219 0.29264219 7.56240003
0.100588 with f = 0.18176. After the local variations step, we 80.00 3.9631 4.16234220 0.19924220 5.02743295
obtained tCER = 14.132, tFER = 17.371, Z = 0.226667, W = 100.00 4.0189 4.16234220 0.14344220 3.56919046
120.00 4.0605 4.16234220 0.10184220 2.50811957
0.100588 with f = 0.18168. 140.00 4.0801 4.16234220 0.08224220 2.01569067
Using this initial point to initialize the Nelder–Mead
method, we obtained: tCER = 14.269, tFER = 17.537, Z =
0.226704, W = 0.097600 with f = 0.17931. Results are shown Running the lexicographical method with ngrid = 100, we
in Table 7. obtained tCER = 11.332, tFER = 14.402, Z = 0.267119, W =
In this case the Nelder–Mead method was applied starting 0.107272 with f = 0.25958. After the local variations step, we
from tCER = 13, W = 1, Z = 1. We got as result f = 0.86804 obtained tCER = 11.783, tFER = 14.963, Z = 0.267119, W =
with tCER = 8.431, W = 1.7064, Z = 0.2267. Once again we 0.087663 with f = 0.22563.
observe that the method converged to a bad solution, although Using this initial point to initialize the Nelder–Mead
in this case we notice significant errors in more than one point method, we obtained: tCER = 12.182, tFER = 15.371, Z =
as can be seen in Table 8. 0.259076, W = 0.087644 with f = 0.22080. Results are shown
in Table 9.
Problem 5. The data for this problem are the same as in
The Nelder–Mead method was applied starting from tCER =
Problem 4, except for: mtotal = 72.26.
13, W = 1, Z = 1, exactly the same initial values of Problem
Table 6 Table 9
Experiments with Problem 3: simplex fitting Experiments with Problem 5: global minimizer
Time Observed y Computed y Error Percent error Time Observed y Computed y Error Percent error
0.00 0.0000 0.00000000 0.00000000 0.00000000 0.00 0.0000 0.00000000 0.00000000 0.00000000
5.00 0.4166 0.38610018 −0.03049982 −7.32112825 5.00 1.4160 1.16400009 −0.25199991 −17.79660354
10.00 0.7722 0.77220036 0.00000036 0.00004654 10.00 2.3280 2.32800019 0.00000019 0.00000807
20.00 1.1284 1.20314104 0.07474104 6.62363035 20.00 3.2725 3.34297320 0.07047320 2.15349747
30.00 1.2435 1.24350009 0.00000009 0.00000687 30.00 3.5227 3.51005933 −0.01264067 −0.35883472
40.00 1.2912 1.27876336 −0.01243664 −0.96318476 40.00 3.6634 3.64461268 −0.01878732 −0.51283838
50.00 1.3219 1.30956895 −0.01233105 −0.93282760 50.00 3.7529 3.75289897 −0.00000103 −0.00002751
60.00 1.3471 1.33647631 −0.01062369 −0.78863393 60.00 3.8174 3.84000142 0.02260142 0.59206312
80.00 1.3868 1.38049622 −0.00630378 −0.45455592 80.00 3.9516 3.96632670 0.01472670 0.37267691
100.00 1.4146 1.41405728 −0.00054272 −0.03836531 100.00 4.0479 4.04790006 0.00000006 0.00000150
120.00 1.4269 1.43963595 0.01273595 0.89256102 120.00 4.0926 4.10053348 0.00793348 0.19384928
140.00 1.4449 1.45912587 0.01422587 0.98455721 140.00 4.1386 4.13447660 −0.00412340 −0.09963265
J. Martı́nez, J.M. Martı́nez / Computers and Chemical Engineering 32 (2008) 1735–1745 1743
Table 10 Table 12
Experiments with Problem 5: simplex fitting Experiments with Problem 6: simplex fitting
Time Observed y Computed y Error Percent error Time Observed y Computed y Error Percent error
0.00 0.0000 0.00000000 0.00000000 0.00000000 0.00 0.0000 0.00000000 0.00000000 0.00000000
5.00 1.4160 1.15607342 −0.25992658 −18.35639714 5.00 0.4259 0.42590001 0.00000001 0.00000249
10.00 2.3280 2.31214683 −0.01585317 −0.68097796 10.00 1.0096 0.85180002 −0.15779998 −15.62995036
20.00 3.2725 3.34297644 0.07047644 2.15359627 20.00 1.8654 1.70360004 −0.16179996 −8.67374063
30.00 3.5227 3.51006156 −0.01263844 −0.35877146 30.00 2.4846 2.55540006 0.07080006 2.84955580
40.00 3.6634 3.64461417 −0.01878583 −0.51279770 40.00 2.9375 3.06271512 0.12521512 4.26264251
50.00 3.7529 3.75289992 −0.00000008 −0.00000208 50.00 3.2299 3.22990002 0.00000002 0.00000051
60.00 3.8174 3.84000199 0.02260199 0.59207807 60.00 3.4321 3.37386283 −0.05823717 −1.69683772
80.00 3.9516 3.96632682 0.01472682 0.37267980 80.00 3.6526 3.60419257 −0.04840743 −1.32528688
100.00 4.0479 4.04789997 −0.00000003 −0.00000074 100.00 3.7742 3.77420014 0.00000014 0.00000373
120.00 4.0926 4.10053331 0.00793331 0.19384526 120.00 3.8573 3.89941168 0.04211168 1.09173976
140.00 4.1386 4.13447643 −0.00412357 −0.09963688 140.00 3.9239 3.99148249 0.06758249 1.72232952
4. We got as result f = 0.23321 with tCER = 12.286, W = flat characteristic of the objective function, which can lead to
0.087644, Z = 0.25706. As can be observed in this values and discrepant solutions from small fluctuations of the initial point.
in Table 10, in this case the Nelder–Mead method converged to Table 12 shows the calculated values for the fitting above.
a good solution, quite near to that obtained by local variations
of the global minimizer. It is interesting to notice that Problems 7.1. Statistical significance
4 and 5 are replicates of experiments at equal operational con-
ditions. This means that the Nelder–Mead method can lead to The analysis in this section has been suggested by an
quite different solutions, even for very similar situations. anonymous referee. Let us denote the parameter estimate by
B = (b1 , b2 , b3 )T , where b1 = tCER , b2 = Z, b3 = W (remem-
Problem 6. The needed data for this problem are: QCO2 = ber that there exists a biunivoque correspondence between W
6.3, mtotal = 73.1, H = 8, db = 5.45 = 0.699. and tFER ).
Running the lexicographical method with ngrid = 100, we For each experiment, we denote m the number of obser-
obtained tCER = 24.057, tFER = 31.165, Z = 0.282593, W = vations, k the number of parameters (k = 3 in our case). We
0.228374 with f = 0.37369. After the local variations step, we computed the variance–covariance matrix:
obtained tCER = 24.177, tFER = 31.309, Z = 0.282593, W = −1 2
0.220769 with f = 0.36504. Var(B) = (DT D) s ,
Finally, using this initial point to initialize the Nelder–Mead where s2 is the variance estimate and the elements of the matrix
method, we obtained: tCER = 24.083, tFER = 31.207, Z = D are:
0.283353, W = 0.220752 with f = 0.36355. Results are shown
∂y(ti )
in Table 11. dij = .
The Nelder–Mead method was applied starting from tCER = ∂bj
25, W = 1, Z = 1. We got as result f = 0.37252 with tCER = In order to test the null hypotheses E(bj ) = 0, we computed:
30.704, W = 0.169769, Z = 0.255336. As we can see, the val-
ues of the adjusted parameters tCER and W are far from the values
obtained with global minimization. Nevertheless, the value of bj2
cj = , j = 1, 2, 3,
the objective function was very near. This probably indicates a Var(bj )
where Var(bj ) is the diagonal element jj of the matrix Var(B).
Table 11 Finally, for testing the statistical significance of the model
Experiments with Problem 6: global minimizer parameters (Bates & Watts, 1988; Gallant, 1975), we com-
Time Observed y Computed y Error Percent error pared each cj with F1−ᾱ(1,m−k) (the Fisher distribution) for
ᾱ = 0.05. The results are displayed in Table 13. It can be
0.00 0.0000 0.00000000 0.00000000 0.00000000
5.00 0.4259 0.46634990 0.04044990 9.49751179
10.00 1.0096 0.93269981 −0.07690019 −7.61689724 Table 13
20.00 1.8654 1.86539961 −0.00000039 −0.00002086 Statistical significance
30.00 2.4846 2.64923503 0.16463503 6.62621870 Problem m−k F1−ᾱ(1,m−k) c1 c2 c3
40.00 2.9375 2.93750011 0.00000011 0.00000367
50.00 3.2299 3.17070186 −0.05919814 −1.83281645 1 7 5.59 375.1 680.9 75.02
60.00 3.4321 3.36328406 −0.06881594 −2.00506806 2 7 5.59 913.5 1504. 51.51
80.00 3.6526 3.65259930 −0.00000070 −0.00001929 3 9 5.12 735.6 1474. 97.86
100.00 3.7742 3.84797306 0.07377306 1.95466739 4 9 5.12 391.4 759.0 49.59
120.00 3.8573 3.97935021 0.12205021 3.16413575 5 9 5.12 2963 × 102 1169.3 86.49
140.00 3.9239 4.06743946 0.14353946 3.65808162 6 9 5.12 576.1 886.3 82.80
1744 J. Martı́nez, J.M. Martı́nez / Computers and Chemical Engineering 32 (2008) 1735–1745
deduced from this table that, in all cases, the value of α for chemical engineering processes where mass, heat or momen-
which F1−α(1,m−k) = bj2 /Var(bj ) is much smaller than 0.05, so tum transfer, thermodynamic equilibrium, chemical reactions
bj is statistically significant. This is not in contradiction with the and other phenomena are present. Thus, we propose the appli-
error flatness mentioned in Section 4 since this flatness has been cation of this global optimization tool not only in SFE, but in
detected rather far from the global solution. Of course, such a other fields of Chemical Engineering.
phenomenon may not occur in linear models.
Acknowledgements
8. Conclusions
We are indebted to an anonymous referee for his/her sharp
Fitting models to empirical data is a common task in comments. Following the referee’s suggestion we included the
Engineering and Applied Sciences. The objectives of fitting statistical analysis in Section 6. Thanks are also given to Prof.
parameters are quite diverse. Sometimes one is interested in Ronaldo Dias, who helped us with the statistical interpretation
discovering the true values of the parameters due to its intrinsic and indicated the references (Bates & Watts, 1988; Gallant,
physical meaning. Sometimes fitting a model is merely a way of 1975).
completing data at points were one does not have measurements.
Many intermediate situations exist. In our case, the objective of
fitting is prediction. Ultimately, we are interested in developing References
a model that should predict extraction curves as a function of the
so called contextual parameters but, for the development of such Adjiman, C. S., Androulakis, I. P., & Floudas, C. A. (1998). A global optimiza-
tion method, αBB, for general twice-differentiable constrained NLPs. II.
a model, we need to establish a correspondence between con- Implementation and computational results. Computers & Chemical Engi-
textual parameters and model parameters of an extraction model neering, 22, 1159–1179.
like Sovová’s. We invoke global optimization because different Adjiman, C. S., Dallwig, S., Floudas, C. A., & Neumaier, A. (1998). A global
model parameters may produce similar extraction curves usually optimization method, αBB, for general twice-differentiable constrained
holds for a particular process but it is very unlikely that it would NLPs. I. Theoretical advances. Computers and Chemical Engineering, 22,
1137–1158.
happen in all cases. Therefore, it is crucial to use, as “obser- Bates, D. M., & Watts, D. G. (1988). Nonlinear regression analysis and its
vation set” for (15), the true contextual parameters and “true applications. New York: Wiley.
model parameters” for each observed extraction experiment. Berna, A., Tárrega, A., Blasco, M., & Subirats, S. (2000). Supercritical CO2
Obviously, the only reasonable definition of “true model param- extraction of essential oil from orange peel; effect of the height of the bed.
eters” vinculated to the Sovová’s model are the ones that come Journal of Supercritical Fluids, 18, 227–237.
Birgin, E. J., Floudas, C. A., & Martı́nez, J. M. (2006). Global minimiza-
from global minimization of some sum of observation errors. tion using an augmented Lagrangian method with variable lower-level
Concerning the error minimization criterion adopted, we constraints. Available in Optimization Online, E-Print ID:2006–12-1544,
decided that the merit function given by the sum of weighted http://www.optimization-online.org/DB HTML/2006/12/1544.html.
absolute errors is the most adequate one, for two reasons: the Box, M. J. (1965). A new method for constrained optimization and comparison
sum of absolute errors is less prone to the presence of outliers with other methods. Computer Journal, 8, 42–52.
Brunner, G. (1994). Gas extraction: An introduction to fundamentals of super-
than the sum of squares and weights are necessary in order to pro- critical fluids and applications to separation processes. New York: Springer.
duce good predictions when t is small. Consequently, our merit Campos, L. M. A. S., Michielin, E. M. Z., Danielsky, L., & Ferreira, S. R. S.
function is nonsmooth but, fortunately, our global optimization (2005). Experimental data and modeling the supercritical fluid extraction of
procedure is insensitive to this fact. marigold (Calendula officinalis) oleoresin. Journal of Supercritical Fluids,
The optimization problem that leads to the global estima- 34, 163–170.
Del-Valle, J. M., Jiménez, M., & De-la-Fuente, J. C. (2003). Extraction kinet-
tion of Sovová’s parameters is restricted by the well structured ics of pre-pelletized jalapeño peppers with supercritical CO2 . Journal of
constraints defined in (17)–(19). Strictly speaking, we are in Supercritical Fluids, 25 (1), 33–44.
presence of a nonlinear programming problem with continuous Esquı́vel, M. M., Bernardo-Gil, M. G., & King, M. B. (1999). Mathematical
nonsmooth objective function and constraints. From the point model for supercritical extraction of olive husk oil. Journal of Supercritical
of view of computer time, more efficient methods than the one Fluids, 16(1), 43–58.
Fletcher, R. (1987). Practical methods of optimization. London, UK: Academic
exhibited here may be probably developed using augmented Press.
Lagrangian techniques (see Birgin, Floudas, & Martı́nez, 2006). Floudas, C. A. (1999). Deterministic global optimization: Theory, methods and
The underestimators used in Birgin et al. (2006) (see Adjiman, application. Kluwer Academic Publishers.
Androulakis, & Floudas, 1998; Adjiman, Dallwig, Floudas, & Gallant, A. R. (1975). Nonlinear regression. The American Statistician, 29,
Neumaier, 1998; Floudas, 1999) depend heavily on second- 73–81.
Gordillo, M. D., Blanco, M. A., Molero, A., & Martı́nez-de-la-Ossa, E. (1999).
derivative information that is not present in our problem. Solubility of the antibiotic Penicillin G in supercritical carbon dioxide.
However, more research is expected in order to extend the Journal of Supercritical Fluids, 15(3), 183–190.
techniques of Birgin et al. (2006) to estimation problems with Goto, M., Sato, M., & Hirose, T. (1993). Extraction of peppermint oil by super-
nonsmooth merit function as the ones considered here. critical carbon dioxide. Journal of Chemical Engineering of Japan, 26(4),
Finally, we must emphasize that the global optimization tool, 401–407.
Huber, P. (1981). Robust statistics. New York: Wiley.
which was applied in this work to a supercritical fluid extraction Kiriamiti, H. K., Rascol, E., Marty, A., & Condoret, J. S. (2002). Extraction rates
model, is not restricted to this particular process. Mathemati- of oil from high oleic sunflower seeds with supercritical carbon dioxide.
cal models and simulation are used to explain and predict many Chemical Engineering and Processing, 41(8), 711–718.
J. Martı́nez, J.M. Martı́nez / Computers and Chemical Engineering 32 (2008) 1735–1745 1745
Martı́nez, J. (2005). Extração de Óleos Voláteis e Outros Compostos com CO2 Reverchon, E., & Marrone, C. (1997). Supercritical extraction of clove bud
Supercrı́tico: Desenvolvimento de uma Metodologia de Aumento de Escala essential oil: Isolation and mathematical modelling. Chemical Engineering
a partir da Modelagem Matemática do Processo e Avaliação dos Extratos Science, 52(20), 3421–3428.
Obtidos. Doctoral Thesis. Food Engineering Department, State University Reverchon, E., & Marrone, C. (2001). Modeling and simulation of the super-
of Campinas, Brazil. critical CO2 extraction of vegetable oils. Journal of Supercritical Fluids, 19,
Martı́nez, J., Monteiro, A. R., Rosa, P. T. V., Marques, M. O. M., & Meireles, 161–175.
M. A. A. (2003). Multicomponent model to describe extraction of gin- Sousa, E. M. B. D., Martı́nez, J., Chiavone-Filho, O., Rosa, P. T. V., Domingos,
ger oleoresin with supercritical carbon dioxide. Industrial and Engineering T., & Meireles, M. A. A. (2005). Extraction of volatile oil from Croton
Chemistry Research, 42(5), 1057–1063. zehntneri Pax et Hoff with pressurized CO2 : Solubility, composition and
Mathews, M., & Seymour, S. (1997). Excel for Windows: The complete reference kinetics. Journal of Food Engineering, 69, 325–333.
(2nd ed.). Mc Graw Hill Inc. Sovová, H. (1994). Rate of the vegetable oil extraction with supercritical CO2 .
Méndez-Santiago, J., & Teja, A. S. (2000). Solubility of solids in supercritical I. Modeling of extraction curves. Chemical Engineering Science, 49(3),
fluids: Consistency of data and a new model for cosolvent systems. Industrial 409–414.
and Engineering Chemistry Research, 39, 4767–4771. Su, C., & Chen, Y. (2007). Correlation for the solubilities of pharmaceutical
Moura, L. S., Carvalho, R. N., Stefanini, M. B., Ming, L. C., & Meireles, M. A. compounds in supercritical carbon dioxide. Fluid Phase Equilibria, 254,
A. (2005). Supercritical fluid extraction from fennel (Foeniculum vulgare): 167–173.
Global yield, composition and kinetic data. Journal of Supercritical Fluids, Zhou, W., Anitescu, G., & Tavlarides, L. L. (2004). Desorption of polychlo-
35(3), 212–219. rinated biphenyls from contaminated St. Lawrence river sediments with
Nelder, J. A., & Mead, R. (1965). A simplex-method for function minimization. supercritical fluids. Industrial and Engineering Chemistry Research, 43,
Computer Journal, 7(4), 308–313. 397–404.
Perakis, C., Louli, V., & Magoulas, K. (2005). Supercritical fluid extraction of Zizovic, I., Stamenic, M., Orlovic, A., & Skala, D. (2005). Supercritical carbon
black pepper oil. Journal of Food Engineering, 71, 386–393. dioxide essential oil extraction of Lamiaceae family species: Mathematical
Povh, N. P., Marques, M. O. M., & Meireles, M. A. A. (2001). Supercritical modeling on the micro-scale and process optimisation. Chemical Engineer-
CO2 extraction of essential oil and oleoresin from chamomile (Chamomilla ing Science, 60, 6747–6756.
recutita [L.] Rauschert). Journal of Supercritical Fluids, 21, 245–256.