Professional Documents
Culture Documents
Download
Download
net/publication/2480602
CITATIONS READS
12 756
2 authors, including:
SEE PROFILE
All content following this page was uploaded by Kim Emil Andersen on 17 August 2015.
Plasma Glucose
Liver Periphery
Figure 2: ‘Bergman’s Minimal Model’ describing the glucose and insulin kinetics in an IVGTT study.
%
400
150
: the basal preinjection level of glucose [mg/dl].
< %
: the basal preinjection level of insulin [H U/ml].
Glucose (mg/dl)
300
Insulin ( U/ml)
[min I ].
50
;
: the insulin-dependent increase in glucose uptake abil- < %
50 100
4
ity in tissue per unit of insulin concentration above
[min I H U/ml I ].
0
. ( 87 /
STATISTICAL MODEL product between its width and its left end point, that is
5/
"8 A 6/ %
" +Q % 8 -,
7 9/ :* -, # +/
I
The glucose and insulin concentrations are positive, and ex- 4
"98
perience shows that the variability in the samples increases
where
# the random process /
I
with
<5" the mean. Therefore we assume
are log-normally distributed and introduce
that both and is well-known to depend on
a normal distribution with mean zero and variance I
* /
only and to follow
.
"
" " " ,
"
"
If we introduce"aAmore
subscript, e.g.
convenient notation using as a
, then the stochastic minimal model
"9 (7"
"!
(3
, can be rewritten as
=< >/ +7 /
(7
" <5" ,
F ;2 , , 8 -,
<=
<5" 32
F =< 4/ 7 /
, , B8 -,
" " ;2
F =< >/ +7 /
, , 8 -,
where and are normally distributed. This logarith-
mic transformation implies that the minimal model can be
?/ % +QN$ % I@-A 98 A ( ,
where
rewritten as
< , ,4/
" +Q@$&%
I
,
< , ,4/ ?/ 54B%Q@ J K C A >< % D I A ( ,
" =42+Q@ $< % 6,
< , ,4/ 8E/ F% )?AQ@ I A < %'
J K I (1)
" @?A+Q. < % B8 C# ED5 F -,
8 CC (
I A A ED5 F ,
I I
subject to the initial conditions
A 5 1
0 -,
and we for notational convenience have sup-
pressed the , functional
,54S, % , < % ,?,C
dependencies
D
of the parame-
, ters J K and .
A "!
5 <-0 F;2 ,
32 ;2
The conditional distributions for the processes
F and F are given as
Hereby the minimal model has no unit of measurement, and
32 @G
, , D*
, , -,9< -, 4/ -*
-* IH+J 9<
F I
;2 G 4/ D*
the three processes can be re-parameterized, such that they
, , , , 6, -,
-* H+J 9<
are on the same scale. F I
4/ -* K!
(2)
32 G
, , , , -,
Glucose disposal described by the deterministic minimal
model in (1) may, however, not comply with the actual
F
H+J I
glucose and insulin time courses from an IVGTT study. The statistical dependencies in this model specifies a
#
We therefore introduce a stochastic version of the mini-
# directed graphical model (Lauritzen, 1996) that can
#
mal model, where Brownian motion fluctuations , be illustrated by the directed acyclic graph in Fig-
and are used to model possible model deviations, i.e. ure 3, in which
, 4 , D* D* *
we have omitted the parameter ver-
, &% , < % , ?,+C#, DB, ,
' )( $ +* -, $ #
the stochastic minimal model takes the differential form J!K
$ &%
2 +Q=& % # B8
4
"6,
tices and . In addition we
( $ .* -, $ #
have added the random variables
BL L L M L
I I
$ %
" =4 +Q= &< % 8
4
"-, 5 = < -,
$
%
2 >?A+Q=
J K
< % B8
I
C# ($ I
ED= F L L <
and
-,
+* $ #
8
I4
-,
I where and are the random
served for specific values of .
variables actually ob-
I
L L
* * *
where , and denote the reciprocal variances (the so-
We model the measurement error on
7 "N
and by the ran-
7 N *
called precisions) of the Brownian motions accounting for *
dom white noise processes; and
L ;2 L ;2 N
with precisions
N
and , i.e. the model assumptions for F and F are
L ;2 G ;2 D* H J 32 D*
model deviations. The analysis of the differential form of
, , -,
F F
L 32 G ;2 D* N H J ;2 -* N
the stochastic minimal model can be transferred by simple F I
integration"
to that of an equivalent set of integral equations, , , (3)
F F
F
N I .
N
10 32
e.g. for we obtain
&/
"B8 "A
F
Q@>&% 4 $
Notice that the mean structures of the logarithmically
-, I transformed observations are modelled by the underlying
;2 ;2
8+* I #
4
&/ # "!
"B8 # " non-observable and hereby latent processes F
F ;2
,
F
and .
L implies that the posterior distribution factorizes as
, G 3 1 G 1" G , -,
1
2 F32 where represents our beliefs about
the parameters
1
be-
, G G
I fore having observed any data and and
form the likelihood determined by (2) and (3). Performing
2 F;2
inference about the parameters are reduced to the compu-
tational task of evaluating integrals over the state space of
I the latent variables and parameters, e.g.
L
the huge state space, however, MCMC methods provide
an approximative integration technique whereby marginal
posterior means, for example, are estimated by using
the sample mean from a representative series of random
Figure 3: Directed acyclic graph illustrating the statistical draws from the posterior distribution. These random
dependencies for the , and processes. draws are
chain
obtained
, --, by !!!
constructing an irreducible Markov
4 , 4 6, with state space P
and with stationary distribution . MCMC sampling was
4 SIMULATION BASED INFERENCE
first introduced by Metropolis et al. (1953) and was sub-
/ / ! ! ! /
If we assume
-, -,
that
6,
, ,
, where
sequently adapted by Hastings (1970). Over the past ten
years such methods have enjoyed widespread popularity
, denotes the three latent processes
within the statistical literature and there exist various stan-
in (2) and that the observed data are described by the model
dard techniques for constructing the necessary chains (see
given in (3), then the statistical problem is to estimate the
e.g. Brooks, 1998; Robert and Casella, 1999).
vector of unobserved parameters
/ !, 4 , ,C , ?, DB, &% , < % , 0 , 02, -* ,D* ,D* ,D* ,D*
J!K
N N 4.1 Metropolis–Hastings updates
L L
given the vector Metropolis–Hastings updates are used to move around
of , the logarithmically transformed obser-
, where denotes the set of ob-
vations the parameter space by proposing moves which are sub-
G
servation times. sequently either accepted or rejected. Suppose that
, For this we need to establish the posterior ,
distribution , which represents our beliefs about we are currently in configuration
! , , then we draw
,
the feasible structures of after having observed the a new,configuration
%,
from some proposal den-
data . Dividing all the quantities into subsets of data, , sity " $# . This proposal is then accepted with
latent processes, and parameters, , the statistical de- probability
pendencies in the model defined in (2) and (3) can be sum-
& , , 2('*),+.-=Q ,
% , G % , ,
!
G
marized by the simple directed acyclic graph in Figure 4, " #
$# , , , 0/
where it should be noticed that the time aspect depicted in " $#
Figure 3 has eventually vanished.
However, if the proposal is rejected, the chain remains in
the current state. Many proposal distributions lead to ir-
reducible Markov chains which ensure the convergence of
the posterior mean estimate, though several forms possess
useful analytic properties. For example, ,when!,the pro-
G
posal distribution " is symmetric, i.e. " $#
% , ,
" # , the acceptance function reduces to
2 D*
depend locally on the updated quantity itself, its parents,
its children and its childrens other parents, the so-called P I , I ,
G 2
* * * F , 4
Markov blanket. This approach eventually appears to be I
I
very inefficient due to bad mixing properties of the algo- , 1 ,
rithm caused by the highly correlated quantities.
with
Another approach would be to block the updates into
, 12
Q
* ?< 4 8.* ?< 4 8.* ?< 4 ,
and , by first proposing a new state of the parameters
drawn from a symmetric proposal distribution " # .
Conditioned on and the acceptance probability sim-
and
< M< 2 , 2 ,4/-,
ply becomes
&
'*)2+.- Q2,
% % % , G G ! < M< I 2 , I 2 ,4/6,
# 1" 1 , / G G
< M< I 2 , I 2 ,4/K!
Then afterwards updating by proposing a new state I I
drawn from a symmetric proposal distribution " $# , Furthermore
where the acceptance probability conditioned on and , # G
L G L
2 ,-* N " L G L I 2 ,D* N
* , 4 , , ,
is I
&
'*),+ - Q ,
1 , G G ! *
$# 1 , / G G N N
with
However, again the highly correlated quantities are ex- , ,
Q
* L 4 8 * L 4 ,
N N
pected to lead to a very inefficient MCMC simulation al-
gorithm.
Alternatively we choose to update by proposing a can- where G G denotes the number of observations.
!
didate from a symmetric proposal
and then simulate from
! G
distribution " #
. This proposal is sub-
In practice the Metropolis–Hastings updating scheme de-
scribed above can be used either to update the entire state
sequently accepted with probability
vector or individual elements therein. Since proposals con-
& , , - Q2,
% % " %, G G ! sisting of perturbations of the entire state vector tend to
$# 1 1" , / G G have correspondingly small acceptance probabilities, the
proposed perturbations need to be kept at a suitable small
By updating and simultaneously we may suppress the level in order to achieve a satisfactory acceptance proba-
strong inter-relation between them, and thereby improve bility. However, this full component Metropolis–Hastings
the simulation algorithm’s mixing properties and overall ef- updating technique is known to lead to a Markov chain with
ficiency. slow convergence properties and therefore typical MCMC
algorithms consist of a sequence of updates focusing upon
In order to guarantee that the posterior distribution is dom- each element of the state vector in turn. In Section 5, we
inated by
1
the likelihood, we adopt a vague prior distri- shall combine this approach, known as single component
bution on . Thus we assume that the elements Metropolis–Hastings, with updating of the entire vector to
of are,5independent
4S,
and that each of 0 the system param-
, ?,+C#, DB, % , < % , 0 obtain a MCMC simulation algorithm with good conver-
eters J K and * -* -* -*
are log-normally
, , , * gence and mixing properties.
distributed and that the precisions
Gamma-distributed, all with large variances.
and
N
are
N
5 RESULTS
Consequently the prior density takes the simple form
1A3 4 "?B C9 D5 &%' < %' In this section we consider the performance of our approach
J!K
C* C* C* C* C*
P 0 0 " 6, on data simulated from the stochastic version of the mini-
N N mal model derived in Section 3. The simulated data comes
Table 1: The choice of parameter values used for simulating the data depicted in Figure 5, prior assumptions, proposal
distributions (numbers indicate the standard deviances of the proposals), initial values and the resulting posterior means,
standard deviances and credible intervals for the unknown parameters.
One component
Full component
C.I.
0 4 0 4 4 0 0 0 0 0 4 0 0 0 0 0 4 0 4 '
0 4 R 4 ; R 0
NN
& ( +0 0 0 0 0- 4 0
&)
'
–
–
R ; 4
300
mg/dl, i.e. we have chosen parameters according to the nor-
Glucose (mg/dl)
mal glucose tolerant population, see Table 1 for specific
200
details.
The logarithmic transformation of the three latent
processes brings them approximately on the same
100
scale. We will therefore assume that the Brownian
motions inherent within each process are indepen-
*
dent and *
identically *
distributed. *
Subsequently we
0
0 50 100 150
let , whereby we have reduced the Time (minutes)
/
number of parameters
"-,
to be"
estimated. The three latent
87
processes / Q and were simulated according )
0.01
Remote insulin (time
to (2) with . The data available for statistical
inference was then generated /. according to (3) with obser-
, , ,0 ,1 ,, ,
vations recorded at , where
* ,2 , Q , Q , Q0=, Q , , Q 3 , , ,40 ,51 ,* ,
, 1 , 0 ,1 ,, ,
0.005
ter of interest when using the single component Metropolis- 0 50 100 150
Hastings algorithm whereas less radical perturbations are Time (minutes)
proposed when updating the entire vector. We base our a Figure 5: Simulated data. From top to bottom is shown:
priori knowledge upon reported normal ranges for the pa- The glucose concentration; the remote insulin action; and
rameters of interest, however, we may pre-record approxi- the insulin concentration. Dots represent observed data and
mate basal lines in glucose and insulin prior to the exper- the lines represent the underlying processes.
0.035
SG
87
)
0.01
0.025
0.005
SI
0.0004
0
5 6 7 8 9 10 0 50 100 150
Sweeps (in millions) Time (minutes)
Figure 6: Trace plots from the Markov chain: From top to Figure 7: Posterior mean (white line) and 95% credible in-
bottom: The trace plot of the glucose effectiveness J L and terval superimposed in gray for the remote insulin action
the insulin sensitivity J K . (black line).
iment and will therefore model these two quantities with Furthermore, our approach also allows us to assess the un-
highly informative priors. Table 1 provides details on the certainty on the latent processes, e.g. the remote insulin
(7ac-
"
prior information used within the MCMC simulation. tion may be of special interest. The posterior mean of
is shown in Figure 7 together with its 95 per cent credible
The MCMC simulation algorithm is based upon the ran- interval.
dom walk Metropolis–Hastings
!
updating mechanism. Thus
a candidate is generated by a symmetric perturbation
of the current state , i.e. for the full-component ,.
pro- 6 DISCUSSION
posal
diag 6
4
distribution
,
we !!!
,
&
4 will choose
H J , where
is the diagonal covariance matrix
In this work we have adopted a graphical model as a pow-
specified by the standard deviances provided in Table 1.
*
erful and flexible modeling framework for parameter esti-
Likewise we update the single elements in by random
*
mation in general systems of coupled differential systems.
walk Metropolis–Hastings. Note that we update
N
and
In particular we have addressed the problem of regulariz-
N
separately.
ing the highly ill-posed inverse problem possessed by cou-
The0
MCMC algorithm was initiated in an arbitrary vector pling the three differential equations in Bergman’s minimal
far from the ‘true’ , see Table 1 and ran for 10 000 000 model for glucose and insulin kinetics. We have illustrated
iterations. The chain proposes single component updates how the reconstruction may efficiently be implemented by
30 per cent of the time leading to an overall acceptance a Bayesian graphical model where posterior sampling is
probability of 54.7 per cent. In Figure 6 we give the trace performed through the use of Markov chain Monte Carlo
plots for the latter 5 000 000 samples obtained for the glu- techniques. Hereby we have made estimation of the highly
cose effectiveness, J9L , and the insulin sensitivity, J K . correlated parameters in the minimal model possible even
though we consider all three differential equations simulta-
The trace plots for the remaining parameters in ex-
neously, and we have provided a quantitative assessment of
hibit similar behavior and it is therefore apparent that the
the metabolic portrait of a single individual, which is very
Markov chain exhibits rather excellent mixing properties.
useful in the prognosis and prevention of diabetes.
Inference about is based upon the latter 5 000 000 sam-
ples. The posterior means and 95 per cent credible inter- The method has been performed on simulated data and
vals for the parameters are given in Table 1, from which we seems rather promising and very robust, but must also be
may conclude that the Markov chain is sampling from the proven useful on real field data. The Bayesian approach is
desired distribution. Note that all the ‘true’ values of the based on specification of prior distributions on the param-
parameters are within the corresponding credible intervals, eters, which is a strength when prior information is avail-
implying that the Bayesian approach to ill-posed inverse able. We have used priors based on reported normal ranges,
problems is a useful tool for regularization of the minimal but with relatively large variances, requiring a prior sensi-
model. tivity analysis to be considered.
The minimal model is developed for a single individual,
and the extension to population modeling for the purpose
of validating the normal metabolic portrait, is a potential
of our method that is worth pursuing. However, this would
probably require an even faster and more efficient simu-
lations algorithm based upon e.g. the Metropolis adjusted
Langevin algorithm.
Acknowledgements
References
Bergman, R. N., Ider, Y. Z., Bowden, C. R. and Cobelli,
C. (1979). Quantitative estimation of insulin sensitivity,
American Journal of Physiology 236(6): E667 – E677.
Brooks, S. P. (1998). Markov chain Monte Carlo method
and its application, The Statistician 47: 69 – 100.
Gaetano, A. D. and Arino, O. (2000). Mathematical mod-
elling of the intravenous glucose tolerance test, Journal
of Mathematical Biology 40: 136 – 168.
Hastings, W. K. (1970). Monte Carlo sampling methods
using Markov chains and their applications, Biometrika
57: 97 – 109.
Lauritzen, S. L. (1996). Graphical Models, Clarendon
Press, Oxford, UK.
Martin, B. C., Warram, J. H., Krolewski, A. S., Bergman,
R. N., Soeldner, J. S. and Kahn, C. R. (1992). Role of
glucose and insulin resistance in development of type 2
diabetes mellitus: results of a 25-year follow-up study,
The Lancet 340: 925 –929.
Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N.,
Teller, A. H. and Teller, E. (1953). Equations of state cal-
culations by fast computing machines, Journal of Chem-
ical Physics 21: 1087 – 1092.
Pacini, G. and Bergman, R. N. (1986). MINMOD: a com-
puter program to calculate insulin sensitivity and pan-
creatic responsivity from the frequently sampled intra-
venous glucose tolerance test, Computer Methods and
Programs in Biomedicine 23: 113 – 122.
Robert, C. P. and Casella, G. (1999). Monte Carlo Statisti-
cal Methods, Springer-Verlag, New York.