Professional Documents
Culture Documents
LNS0280012
obernardsophia.inria.fr
Abstra
t
These le
ture notes detail how to design a model for a biologi
al
pro
ess. The diÆ
ulty is due to the fa
t that, on the
ontrary to other
elds (me
hani
s, ele
troni
s, et
.) there does not exist any validated
law to des
ribe the behaviour of biologi
al systems. Nevertheless, these
systems satisfy the mass
onservation prin
iple. On this basis, the
le
ture explains how to derive a model whi
h will represent the main
mass transfer within the system. The method
onsists of three steps.
First determine the rea
tion s
heme and dene a model in whi
h the
mi
robial kineti
s are not spe
ied. Then nd an analyti
al expression
for the biologi
al kineti
s. Finally validate the model trying to test
separately the dierent hypotheses assumed during model design.
Contents
1 Introdu tion
System modelling in general is diÆ
ult and requires time to properly under-
stand the system and identify a model. This exer
ise is
ompli
ated when
the system integrates living organisms. On the
ontrary to domains like
physi
s where laws that are known sin
e
enturies (Ohm law, ideal gas re-
lationship, fundamental prin
iple in me
hani
s, thermodynami
prin
iple,
...)
an apply, most of the biologi
al models rely on empiri
al mathemati
al
expressions. These laws result from a priori ideas on the working of the
system (metabolism, trophi
relationships, et
.) or, in some rare
ases, have
been estimated from some experiments. Sin
e it is not possible to use laws
that are admitted by everybody and that have been extensively validated
and used, it is primordial to
hara
terise the reliability of the mathemati-
al expressions used during the model development. This implies that the
reliability of the used relationships must be sorted hierar
hi
ally during the
model development. In this
hapter, we will see how to organise the knowl-
edge in the model in order to distinguish a reliable part issued from the
mass balan
e and a more spe
ulative part whi
h will represent the ba
terial
kineti
s.
The model quality and the model stru
ture must above all be determined
with respe
t to the model obje
tives. Indeed, a model
an be developed for
very dierent purposes that must be
learly identied. Will the model be
used in order to:
Reprodu
e an observed behaviour
Explain an observed behaviour
Predi
t the system evolution
Understand some of the system me
hanisms
Estimate non measured variables
Estimate pro
ess parameters
A
t on a system to regulate and impose the values for its variables
Dete
t anomalies in the pro
ess working
...
774 O. Bernard
Depending on the modelling obje
tives and resour
es, a formalism must
be
hosen. If the spatial heterogeneity is important and must be taken into
a
ount in the model, a parameter distributed model must be written (using
e.g. partial dierential equations). If the modelling aims at the improvement
of a metabolite produ
tion during transient phases, the system dynami
s
must be represented in the model.
Moreover, besides its obje
tives, the model must also be in adequation
with the available data. Indeed a
omplex model involving a large number of
parameters will also require a large amount of data to identify its parameters
and to validate the model.
Finally, if we remember that most of the laws used in biology are spe
u-
lative, the key step in the modelling of biopro
esses is the model validation.
This step is often negle
ted, despite its determinant role to guaranty the
model quality. In parti
ular it is
ru
ial to demonstrate that the model
rea
hes properly the goals for whi
h it was developed.
Qin Qout
Sin
111
000
000
111
000
111
Substrate
Biomass
Product
11111
00000
00000
11111
00000
11111
00000
11111
0
1
00000
11111
00000
11111
0
1
00000
11111
00000
11111
0
1
0
1
0
1
0
1
0
1
1111111
0000000
1111111
0000000 0000000
1111111
0000000
1111111
0000000
1111111
0000000
1111111 0000000
1111111
0000000
1111111 0000000
1111111
0000000
1111111 0000000
1111111
0000000
1111111
111111111
000000000
000000000
111111111 111111111
000000000
000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111
000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111
000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111
000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111
000000000
111111111 111111111
000000000 000000000
111111111
000000000
111111111
000000000
111111111 000000000
111111111 000000000
111111111
000000000
111111111
000000000
111111111 000000000
111111111
000000000
111111111 000000000
111111111
00000
11111
000000000
111111111 000000000
111111111 000000000
111111111
00000
11111
000000000
111111111 000000000
111111111 000000000
111111111
00000
11111
000000000
111111111 000000000
111111111
00000
11111
00000
11111
00000
11111
biotransformation pro
ess along the fermentation. The fedbat
h pro
esses
are often in
losed loop. This operating mode is parti
ularly used when the
produ
t to be re
overed ne
essitates to empty the biorea
tor like e.g. for
intra
ellular
omponents.
111111
000000
000000
111111
000000
111111
000000
111111
000000
111111
000000
111111
000000
111111
0000000000
1111111111
1111111111
0000000000
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
0000000000
1111111111
1111111111
0000000000 0000000000
1111111111
0000000000
1111111111 0000000000
1111111111 1111111111
0000000000
0000000000
1111111111
0000000000
1111111111 0000000000
1111111111 0000000000
1111111111
0000000000
1111111111 0000000000
1111111111
0000000000
1111111111 0000000000
1111111111
0000000000
1111111111
111111
000000
000000
111111
000000
111111
000000
111111
000000
111111
000000
111111
1111111111
0000000000
0000000000
1111111111 0000000000
1111111111
0000000000
1111111111
1111111111
0000000000
0000000000
1111111111 111111111
000000000 0000000000
1111111111
000
111 000000000
111111111 0000000000
1111111111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
4−Sludge removal
000
111 5−Emptying 6−Back to step 1 ...
Figure 3: SBR (sequen
ing bat
h rea
tors): representation of the dierent steps
780 O. Bernard
3.4 Example 1
We will
onsider here the example of anaerobi
digestion. This pro
ess is
used to remove a polluting substrate (S1 ) from wastewater thanks to anaer-
obi
ba
teria. In fa
t, this is a very
omplex pro
ess whi
h involves several
dierent ba
terial populations [3℄. If the modelling obje
tive is to
ontrol
this intri
ate e
osystem in order to improve the pollution removal, then we
need a rather simple model. This is why, to limit the model
omplexity, we
onsider only two main ba
terial populations. We assume therefore that the
dynami
s
an be des
ribed by two main steps:
An a
idogenesis step (with a rate r (:)) in whi
h the substrate S is
1 1
degraded by a
idogeni
ba
teria (X1 ) and is transformed into volatile
fatty a
ids (VFA) (S2 ) and CO2 :
k1 S1 1! X1 + k2 S2 + k4 CO2
r (:)
(1)
782 O. Bernard
A methanogenesis step (with a rate r (:)), where the volatile fatty a
ids
2
are degraded into CH4 and CO2 by methanogeni
ba
teria (X2 ).
k3 S2 2! X2 ; + k5 CO2 + k6 CH4
r (:)
(2)
The
onstants k1 ; k2 ; k4 , respe
tively represent the stoi
hiometri
oef-
ients asso
iated with substrate S1
onsumption, produ
tion of VFA and
CO2 during a
idogenesis. k3 ; k5 and k6 respe
tively represent stoi
hiomet-
ri
oeÆ
ients asso
iated with VFA
onsumption and with CO2 and CH4
produ
tion during methanogenesis.
It is worth noting that in some sense this rea
tion s
heme has no biolog-
i
al reality sin
e biomasses X1 and X2 represent a set of dierent spe
ies. In
the same way for substrates S1 and S2 whi
h gathers a set of heterogeneous
ompounds. A lot of models
an be found in the literature for this pro
ess
[4, 3, 5℄. Generally, the des
ription of the pro
esses within the biorea
tor are
mu
h more detailed [6, 7℄ but it leads to models diÆ
ult to use for
ontrol
purpose.
4.2 Example 2
We will
onsider here the very simple example of the growth of a mi
ro-
organism X on a substrate S with rate r(:):
r(:)
kS !X
The yield
oeÆ
ient asso
iated with substrate
onsumption is denoted
k.
Mass Balan
e Modelling of Biopro
esses 783
We assume that the in
uent
ow rate is Qin and that the euent
ow
rate is Qout . We denote by x and s the total amount of biomass and substrate
in the volume V of the biorea
tor.
Let us
onsider the evolution of V (t), x(t) and s(t) between two very
lose time instants t and t + dt.
The evolution of the total liquid volume V is rather simple:
V (t + dt) = V (t) + Qin dt Qout dt
For the biomass, we have to take into a
ount the new biomass produ
ed
between t and t + dt. The produ
tion term in the whole volume V is r(:)V dt,
and thus: x
x(t + dt) = x(t) + r(:)V dt Qout dt
V
Note that, in order to
ompute the biomass lost in the euent (in the volume
Qout dt) we assume that the
on
entration in the small volume is the same as
in the whole biorea
tor (i.e. Vx ). At this point the hypothesis of homogeneity
in the rea
tor is
ru
ial.
In the same way, for the substrate, we must also
onsider the quantity of
substrate (with
on
entration Sin ) arriving between the two time instant:
s
s(t + dt) = s(t) + QinSin kr(:)V dt Qout dt
V
For a very small dt, we
an then derive the following equations:
8 dx x
>
>
> = r(:)V Qout (3)
>
> dt
< ds V
s
> dt
= kr ( : ) V + Qin S in Qout
V
(4)
>
>
>
>
: dV
dt
= Qin Qout (5)
Now, let us rewrite this model in term of
on
entration i.e. using the
variables X = Vx and S = Vs ). It is straightforward to see that we get the
following model:
dX = r (:) DX
dt
dS = kr (:) + D (S
dt in S) (6)
dV = Q
dt in Qout
where D = QVin
orresponds to the dilution rate.
Model (6) simplies for the various working modes:
784 O. Bernard
2 3 2 3
66
0 7 66
0 77
66 0 77 66 0 77
6 77 6 77
Zin 0
in = 666 77 ;
77 Q = 666 77 ; (22)
66 S1in 66 0 77
64 77 64 75
S2in 5 0
Cin qC ( )
An elimination of variables HCO3 , CO2 , and PC using equations (17),(15)
(18) and (20), leads to the following expression for PC ( ) (
f [9℄):
p 2
4KH PT (C + S2 Z )
PC ( ) = (23)
2KH
setting: = C + S2 Z + KH PT + kk6a r2 (:), we nally get
L
qC ( ) = kL a(C + S2 Z KH PC ( )) (24)
Note that the similitude between enzymati
rea
tion and growth of a
mi
roorganism are often used to justify the analyti
al expression of a rea
tion
rate [14, 15℄.
ν1,1 ω1
S1
ω2
S2 ω3
µ
ωk
νm,1
ωn h
Sm
νm,n h
X
nh X
m
(S1 ; : : : ; Sm ) = !k ( kiSi ) (34)
k=1 i=1
6 Model validation
6.2.2 Example 4
Let us
onsider here the simple example of the growth of the lamentous
fungi Py
noporus
innabarinus (X ) on two substrates, glu
ose (
arbon (C )
sour
e) and ammonium (nitrogen (N ) sour
e). We assume therefore that
the rea
tion s
heme is
omposed by a single rea
tion:
796 O. Bernard
N +C !X
The stoi
hiometri
matrix K asso
iated to this rea
tion is the following
( = (X N C )t ):
K = (1 k1 k2 )t ; and in = (0 Nin Cin )t (37)
Let us
onsider the two following ve
tors orthogonal to the
olumns of K :
1
v = (1 0)t and v = (1 0 )t
1
1
k1 2
k2
We
an then dene the following quantities:
Z t2
X (t1 ; t2 ) = X (t2 ) X (t1 ) + D( )X ( )
t1
Z t2
N (t1 ; t2 ) = N (t2 ) N (t1 ) D( )(Nin ( ) N ( ))d
t1
Z t2
C (t1 ; t2 ) = C (t2 ) C (t1 ) D( )(Cin ( ) C ( ))d
t1
whi
h will allow us to rewrite the following regressions asso
iated with v1
and v2 :
1
(t ; t ) = (t ; t ) (38)
X N
1 2
k1 1 2
X (t1 ; t2 ) =
1 (t ; t ) (39)
C 1 2
k2
It is now easy to verify if the relationships (38) and (39) are signi
ative
from a statisti
al point of view.
Figure (5) presents a validation example on the basis of a series of exper-
iment. The obtained regression is highly signi
ative. This means that rela-
tions (38) and (39) are valid. As a
onsequen
e, the rows of matrix K , whi
h
are orthogonal to v1 and v2 are ne
essarily of the type K = (1 1 2 )t .
Therefore the rea
tion s
heme is valid, and subsequently the mass balan
e
model as well.
Note that these te
hniques lead also to the estimate of the yield
oeÆ-
ients k1 and k2 .
Mass Balan
e Modelling of Biopro
esses 797
2.5 2.5
A B
X estimated from N (g/l)
1.5 1.5
1 1
0.5 0.5
0.5 1 1.5 2 2.5 0.5 1 1.5 2 2.5
X measured (g/l) X measured (g/l)
Figure 5: Validation of the linear relationship relating X and N (A); X and C (B)
6.3.1 Example
For example, Hansen and Hubbell (1980) study the
ompetition between two
ba
terial spe
ies in a
hemostat. The rea
tion s
heme is
omposed of two
growth rea
tions:
k1 S !X 1
k2 S !X 2
The growth rate asso
iated to these rea
tions is assumed to be of Monod
type, i.e.:
S
i (S ) = max i
S + Ks i
where max i and Ks i are the maximum growth rate and the half saturation
onstant asso
iated with substrate S for spe
ies i.
Hansen and Hubbell showed that the winner of the
ompetition predi
ted
by the model depends on the dilution rate. More pre
isely, the winner is
the spe
ies with the smaller ratio Ji = max Ks i . The
omparison of the 2
i D
ratios J1 and J2 leads to the study of the quantity r = max 1 max 2 with
max2 D
respe
t to the threshold value KKss 12 1. If we assume that we are in the
ase where D < max 1 < max 2 , then spe
ies 2 wins for a dilution rate
lower than D0 = max 2 KKss 11 Kmax
s2
1 Ks 2 , whereas for higher values, it is spe
ies
1 (see gure 6). These qualitative properties are veried experimentally (see
Figure 7).
µ max 1 − µ max 2
µ max 2 − D
K s1
−1
Ks2
µ max 1
−1
µ max 2
Species 2 survives Species 1 survives Both species disappear
Figure 6: Competition in a
hemostat with respe
t to the dilution rate (dis
ussion of
the quantity max 1 max 2 with respe
t to ks 1
max 2 D ks 2 1). We
onsider here the
ase where
D < max 2 < max 1
visual adequation between simulations and data. This subje
tive
riterion
an be reinfor
ed by an analysis of the
orrelation between predi
tions and
measurements.
6.4.1 Example
The following validation example presents the results obtained with the
anaerobi
digestion model exposed throughout the paper. Figures 8 and
9 present model simulations
ompared to dire
t measurements [9℄. The pe-
riods of time
onsidered for the
alibration step are shown on the gures.
The model
orre
tly reprodu
es the behaviour of the system for the
on-
sidered period in spite of the fa
t that it has been
alibrated only using
Mass Balan
e Modelling of Biopro
esses 801
150 250
200
Qco2 (l/h)
Qch4 (l/h)
100
150
100
50
50
0 0
10 20 30 40 50 60 70 10 20 30 40 50 60 70
Time (days) Time (days)
350 7.5
300 7
250
Qtot (l/h)
6.5
200
pH
6
150
5.5
100
50 5
0 4.5
10 20 30 40 50 60 70 10 20 30 40 50 60 70
Time (days) Time (days)
Figure 8: Comparison between simulation results and measurements for the gaseous
ow
rates and the pH. The periods
onsidered for the
alibration step are represented on the
time axis
positive, su
h that:
v+ K = 01k
Consequen
e: Let us
onsider the s
alar quantity w+ = v+ . It veries
the following equation: (35):
dw+
dt
= D(win+ w+ ) v+ Q( ) (40)
We have to assume an hypothesis for Q( ), whi
h is veried in most of
the
ases:
Hypothesis 2 (H3) There exists a positive real a and a real b, su
h that
Q( )
an be
ompared to a linear expression as follows:
v+ Q( ) av+ + b
Mass Balan
e Modelling of Biopro
esses 803
10 40
35
8
VFA (mmol/l)
30
COD g/l 25
6
20
4 15
10
2
5
0 0
10 20 30 40 50 60 70 10 20 30 40 50 60 70
Time (days) Time (days)
80 70
70 60
60
C (mmol/l)
Z (mmol/l)
50
50
40
40
30
30
20
20
10 10
0 0
10 20 30 40 50 60 70 10 20 30 40 50 60 70
Time (days) Time (days)
Figure 9: Comparison between simulation results and measurements for COD, VFA,
alkalinity and total inorgani
arbon. The periods
onsidered for the
alibration step are
underlined
dt D+a
+
(41)
+
if we apply propertyP1, we
an dedu
e: w+ max(w+ (0); DwDin+a b ).
In other words, wi+ i is bounded. Sin
e wi+ > 0, the state variables i
are bounded.
804 O. Bernard
One will verify that this system has three real equilibrium points (one of
them being the trivial equilibrium X = 0). These equilibria, in in
reasing
order, are respe
tively lo
ally stable, unstable and lo
ally unstable. With
respe
t to the parameters values, the equilibria will be positive (and therefore
admissible) or not. For the parametri
domains where there exists a single
positive equilibrium, this equilibrium is globally stable.
8 Con lusion
the data. The important point is that the data whi
h must be used for
model validation must not have been already used in the model
onstru
tion
phase. During the validation step, not only the quality of the model will
be assessed, but also its validity domains: the working domains (in terms of
state variable and inputs) where the model is satisfa
tory.
To
on
lude, we insist on the fa
t that the modelling step
an be very
long and expensive, but the quality of a model is a ne
essary
onditions to
ensure that a
ontroller or an observer based on it will properly work.
the dimension of the spa
e where u(t) lives. Note that we are looking for
a full rank matrix K . Otherwise, it would mean that the same dynami
al
behaviour
ould be obtained with a matrix K of lower dimension.
Determining the dimension of the v(t) spa
e is a
lassi
al problem in
statisti
al analysis. It
orresponds to the prin
ipal
omponent analysis that
determines the dimension of the ve
torial spa
e spanned by the ve
tors ki ,
rows of K . To rea
h this obje
tive, we
onsider matrix U obtained from a
set of N re
ording of v(t):
V = (~v (t1 ); : : : ; v~(tN ))
We will also
onsider the asso
iated matrix of rea
tion rates, whi
h is
unknown:
W = (w(t1 ); : : : ; w(tN ))
We assume that matrix W is of full rank. This means rst that there are
more measurements than rea
tions. It means also that the rea
tions are in-
dependent (none of the rea
tion rates
an be written as a linear
ombination
of the other ones).
Property 5 For a matrix K of rank k, if W has full rank, then the n
n matrix M = V V T = KW W T K T has rank k. Sin
e it it is a positive
symmetri
matrix, it
an be written, by:
M = P t P
where P is an orthogonal matrix (P T P = I ) and
0 1
1 0 ::: 0
B
B C
C
B
B
0 2 0 0C
C
B
B ..
.
... C
C
B C
=B
B
B k C
C
C
B
B 0 C
C
B
B C
B . . . .. C
. C
A
0 ::: 0
with i 1 i > 0 for i 2 f2; :::; kg.
Mass Balan
e Modelling of Biopro
esses 809
Referen es
[11℄ M. Hou and P. Mller, \Design of observers for linear systems with un-
known inputs," IEEE Trans. Autom. Contr., vol. AC-37, no. 6, pp. 871{
875, 1991.
[12℄ M. Daroua
h, \On the novel approa
h to the design of the unknown
input observers," IEEE Trans. Autom. Cont., vol. 39, no. 3, pp. 698{
699, 1994.
[13℄ J. Monod, Re
her
hes sur la
roissan
e des
ultures ba
triennes. Paris,
Fran
e: Hermes, 1942.
[14℄ L. A. Segel, Modeling Dynami
Phenomena in Mole
ular and Cellular
Biology. Cambridge: Cambridge University Press, 1984.
[22℄ O. Bernard and J.-L. Gouze, \Transient behavior of biologi
al loop mod-
els, with appli
ation to the Droop model," Mathemati
al Bios
ien
es,
vol. 127, no. 1, pp. 19{43, 1995.
812 O. Bernard