2000-The Mathematics of Markov Models What Markov Chains Can Really Predict in Forest Successions

Ecological Modelling 126 (2000) 285 – 298
www.elsevier.com/locate/ecolmodel
The mathematics of Markov models: what Markov chains

can really predict in forest successions
Dmitrii O. Logofet a,*, Ekaterina V. Lesnaya b
a
Laboratory of Mathematical Ecology, Institute of Atmospheric Physics RAS, 3 Pyzhe6sky Lane, Moscow 109017, Russia
b
Mechanical-Mathematical Department, Moscow State Uni6ersity, Vorob’yo6 Hills, Moscow 117234, Russia
Accepted 23 February 2000
Abstract
The formalism of Markov chains is presented as a mathematical description of a succession process with a known
scheme of successional transitions and their time characteristics. The basic hypotheses and assumptions behind the
formalism are formulated and discussed from the viewpoint of the prognostic potential in the resulting models.
Fundamental mathematical results from the theory of stochastic processes guarantee the existence of a stationary
probability distribution of states in any finite, regular, time-homogeneous Markov chain, and any initial distribution
of chain states does converge to some stationary distribution, matching the paradigm of classical succession theory.
We propose a general method for estimation of time-homogeneous transition probabilities applicable for any kind of
successional scheme, yet with strong requirements to the expert data: average duration times should be known for
each specified stage of succession as well as the likelihood proportions among the transitions from the ramifying states
of the scheme. Constructed by this method on a single set of data, the discrete- and continuous-time models are
proved to converge to the identical distributions, while the average sojourn times in each state are shown to be also
the same in the two models. Succession through forest types in a mixed (coniferous – deciduous) forest in Central
Russia is considered as an example. © 2000 Elsevier Science B.V. All rights reserved.
Keywords: Succession; Markov-chain models; Discrete; Continuous; Transition probabilities; Estimation; Prediction
1. Introduction are nowadays called Marko6-chain models of suc-

cession. These models have long been used to
When the course of succession in a plant com- formalise the process of successional changes in
munity is described as a chain of discrete events terrestrial vegetation and to produce certain pre-
each designating the onset of the subsequent stage dictions in quantitative form. Publications on us-
in succession, the first step is made to build what ing this kind of models in ecological studies were
booming in the 1970s and early 1980s (see reviews
* Corresponding author. Tel.: +7-95-2846047; fax: +7-95-
in Horn, 1975; Usher, 1979; Horn, 1981; Usher,
9531652. 1981; Mirkin and Naumova, 1984), after which
E-mail address: danilal@postman.ru (D.O. Logofet) the boom was markedly eroded. Recent applica-
0304-3800/00/$ - see front matter © 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 3 0 4 - 3 8 0 0 ( 0 0 ) 0 0 2 6 9 - 6
286 D.O. Logofet, E.V. Lesnaya / Ecological Modelling 126 (2000) 285–298
tions however revived both practical interest in the transition matrix P from empirical evidence
Markov models and theoretical debate on what which does not require the ergodic hypothesis and
these models can really predict in vegetation dy- which is quite different from what is proposed in
namics (see e.g. Usher, 1992; Acevedo et al., 1995; the majority of landscape models relying on regis-
Li, 1995; Logofet, 1999; Balzter, 2000). tration of the proper transitions in a set of obser-
What a model can predict depends on what vations (see e.g. Turner and Gardner, 1991;
assumptions (or postulates) the model is based Vinogradov and Schitov, 1994).
upon and in which way it is constructed. In the We illustrate the analysis with an example of
context of Markov-chain models this can be seen succession through forest types in a mixed (conif-
from the matrix of transition probabilities (or erous-deciduous) forest in Central Russia. Fig. 1
transition matrix), P(s,t) =[pij (s, t)], the core of presents a scheme that describes, in the long term,
any chain model, whose entries show the proba- the course of natural overgrowing after a tillage
bility that the chain, being in the state i at time s, area has been abandoned — a scenario of forest
will be in the state j at time t. When the state of secondary succession typical for the land-use his-
the chain is registered at discrete instants of time, tory in the region (Korotkov, personal communi-
we deal with a discrete chain, and when the time cation). Since no disturbances are considered in
variable is considered continuous, the chain is this scheme, the general direction of changes is
called continuous. The former may seem more unique: from the pioneer stage of succession to
natural to a field ecologist who observes the state the climax, which is considered to be a polydomi-
of vegetation year by year, or in a longer time nant spruce-broad-leafed forest (Popadyuk et al.,
step. The latter however captures the natural 1994). Expert knowledge of how long a stage of
course of time and has certain advantages for a succession can last (numbers of years in boxes of
modeller, which will be clear from the comparison the scheme) is gained from both direct observa-
of both approaches to be presented in this paper. tions and available historical data.
Relations between the discrete-time and contin-
uous-time Markov processes is a theme that
mathematical books could hardly leave un- 2. General formalism of Markov-chain models for
touched. However, traditional textbooks, like successions
Doob (1953), Chung (1967), or Ventzel and
Ovcharov (1991), treat the theme in a special To construct a Markov chain, one identifies the
context of discrete approximations for a given states of the chain with the specified stages of
continuous process. This is different from the succession and enumerates them by natural num-
problem considered here: two kinds of processes bers from 1 to K, as shown in Fig. 1. The set
constructed by the same method on a single data S= {1, …, K} is thereafter the set of all the chain
set. states, or stages of succession. Although the order
We also analyse briefly the assumptions of in- of stages in this list may be arbitrary, marking the
6ariance and ergodicity, whose acceptance or re- pioneer stages with the lowest and the final stages
jection underlies, either explicitly or implicitly, with the largest numbers has certain technical
any case study applying a chain-modelling ap- convenience. Let X(t)S indicate the state at time
proach and determines the general properties of instant t and thus serve, with the course of time t,
the resulting model including its prognostic abil- a model representation for the course of succes-
ity. Furthermore, we present a rigorous analysis sion. Process X(t) is called a Marko6 chain when
of the mathematical properties inherent in what the following property holds true: if the chain is in
are called time-homogeneous Markov chains, i.e. a state i at a given time moment s, then the
when the matrix P = P(s, t), which is generally a probability pij (s,t) that it will be in a state j at a
function of both time instants t and s, actually subsequent time instant t\ s does not depend on
depends only on the difference t −s. The outcome the chain behaviour before the moment s (the
of the analysis suggests a mode of constructing Marko6 property, see e.g. Karlin, 1968).
D.O. Logofet, E.V. Lesnaya / Ecological Modelling 126 (2000) 285–298 287
In formal terms, for any sequence of time mo- even in the case where the scheme of transitions is
ments t1 Bt2 B …B tn − 1 Btn, we should have non-ramified, unlike in Fig. 1 (see, e.g. Denisenko
et al., 1996, for succession in the forest-steppe
Pr{X(tn )=in X(t1) = i1, …, X(tn −1) =in − 1}
zone).
=Pr{X(tn ) =in X(tn − 1) = in − 1}. (2.1) If the hypothesis of invariant environment con-
ditions is adopted, then the Markov chain can be
Thus, the conditional probability Pr{X(t)=
considered time-homogeneous, i.e.
jX(s)=i}=pij (t, s) by definition. Arranged in a
2D-array, the transition probabilities pij (s, t)=pij (s−t), Ö i, jS.
{pij (s, t)i, j= 1, …, K} compose the transition
probability matrix, of size K ×K, denoted nor- This means that each transition probability can
mally by P(s, t). only depend upon the length of the time interval
The question may arise why we are speaking of in which it should act, rather than the actual time
probabilities while the essence of classical succes- instant at which it is considered. Speaking in
sion theory is a regular (deterministic) nature of ecological terms, the in6ariance hypothesis should
any successional dynamics. The logical answer mean that no important changes occur in the key
would be because we normally see, when observ- factors of the environment affecting the course of
ing particular realisations of the ‘regular’ dynam- succession during the period of prediction as com-
ics, quite a variety of succession trajectories from pared with the factors during the period of obser-
the pioneer stage to the climax one(s), and which vations in which estimation of the transition
particular trajectory is realised in a particular case probabilities can be relied upon.
may depend on a number of uncertain terms (like Whether or not the invariance hypothesis can
availability of proper seeds, variations in whether be adopted is a matter of formulation for each
conditions, etc.). Even this alone makes room for particular modelling case study. Here we concen-
stochastic description of the course of succession. trate on the mathematical benefits to be gained
Besides, whether or not a transition occurs in a from the invariance hypothesis. It will be seen
fixed time interval is also a matter of probability, that all the mathematical properties of the chain
Fig. 1. Ideal course of succession through forest types (after-tillage series) in Prioksko-Terrasnyi Biosphere Reserve, Moscow
Region, Russia (Korotkov, V.N., personal communication).
model, including its predictions, are entirely deter- the knowledge of which transition is more likely
mined by the transition probability matrix. In might be formalised in the above ‘weight propor-
Section 7 we discuss what might be done when the tions’. Assumptions (a1) and (a2) are made to
invariance hypothesis is inappropriate. substantiate the mode of estimating the transition
Another major assumption to be or not to be probabilities that will ensue from our analysis. In
behind any quantified Markov-chain model is the what follows, we express mi in time steps.
so-called ergodic hypothesis, and we shall also
discuss it in relation to the mode of estimating the
transition probabilities from empirical evidence. 3. Discrete model formalism
From the modelling viewpoint, the statistical
properties of process X(t) are of practical impor- The transition probabilities require the follow-
tance. In particular, the dynamics of vector ing conditions to hold for any time instant n=
x(t)=[x1(t), x2(t), …, xK (t)], which is the finite 0, 1, 2, …:
distribution of K probabilities xi (t), each meaning
pij (n)] 0, i, j= 1, …, K (3.1)
the probability, P{X(t) =i x(0)}, that the chain
K
is in state i at time t, given an initial distribution % pij (n)= 1, i= 1, …, K (3.2)
x(0). The basic (matrix-vector) equation for x(t), j=1
x(t +Dt)=x(t)P(Dt) (2.2) which are called the (row) stochastic matrix prop-
erty. The Markov property (2.1) implies that, for
ensues from the probability meanings and the any two time instants n, m, we have
formal definitions adopted, while more intricate
properties of x(t), in particular, the existence and P(n+m)= P(n)P(m) (3.3)
stability of a limiting distribution x() or how The Kolmogorov–Chapman equation, with the
long, on the average, it takes to reach certain initial condition:
states, were established in the theory of finite
Markov chains, at least in the homogeneous case P(0)=I (3.4)
(see, e.g. Kemeny and Snell, 1960; Prokhorov and where I= diag{1, …, 1} designates the diagonal
Rozanov, 1987). The limiting distribution(s) and (identity) matrix of the proper size.
the average times are usually considered to be the In particular, P(n+1)= P(n)P(1) and Eq. (2.2)
major outcome of the mathematical analysis fol- hence transforms into
lowing the estimation of transition probabilities.
Theoretically, further formalisation and analy- x(n +1)= x(n)P(1) (3.5)
sis are possible along the following two ap- a system of first-order, linear difference equations
proaches: (a) time is discrete and process X(t) is with a constant matrix P(1)=P of one-step tran-
observed only in certain time steps, i.e. t= sition probabilities.
0, 1, 2, …, or (b) time is continuous (0 5t B) The initial-value problem for Eqs. (3.3) and
and process X(t) is observed at all t. (3.4) can be easily shown to have the solution
We proceed along the both approaches, assum-
ing that (a1) the average waiting time, mi, till the P(n)= P n n= 0, 1, 2, … (3.6)
state i changes into another one is known for each and that for Eq. (3.5) to have respectively
state of the chain, and (a2) if a state can change
x(n)= x(0)P n =0, 1, 2, … (3.7)
directly into more than one others (ramification of
the scheme), then all these transitions have known Given an initial distribution of state probabilities
weight proportions among them (e.g. are x(0) — identified usually with the relative area
equiprobable in a particular case). distribution of the corresponding types in a mo-
The values of mi could be gained from the saic of vegetation — Eq. (3.7) turns calculation of
expert knowledge of the time period that the the model distribution at any given time instant
forest remains in the corresponding state i, while into a matter of simple matrix multiplication.
Now we consider how the assumption (a1) en- To calculate the remaining pij, j" i, one can use
ables one to use the available data to calculate the assumption (a2). Let there be ki states {r, s, …, z}
probabilities pij of transitions from state i into j in in the scheme attainable from state i in one step
one time step. Let random variable Wi be the and let expert knowledge believe their likelihood
waiting time of X(t) in a state i (i= 1, …, K), i.e. to be in proportions bir :bis :…:biz. We normalise
the time the chain has been waiting until the state these proportions in a way that bir + bis + …+
changes into another one. Let the law of probabil- biz = 1 and call them weights. If we put now bij = 0
ity distribution for Wi be given by function for any state j non-attainable from i in one step,
Fi (n)= Pr{Wi 5 n}, n = 1, 2, … According to the then the transition probabilities
definition, we have
pij = bij (1−pii ), j" i= 1, 2, …, K (3.9)
1−Fi (n)=Pr{Wi \nX(0) =i } obey both the stochastic conditions (Eqs. (3.1)
= Pr{X(1) =X(2) = …=X(n) =iX(0) and (3.2)) and the expert-given weight
proportions.
=i } If, in a particular case, all the transitions from
state i to other ki attainable states are equiproba-
which, by the Bayes formular of conditional prob- ble, then, for each i, all non-zero bij are equal, so
abilities and Markov property (Eq. (2.1)), obeys that
the following chain of equalities.
1
pij = , j" i (3.10)
Pr{X(1)=X(2)= …=X(n) =iX(0) = i } mi ki
1 Note that, given a scheme of transitions, any
= Pr{X(0) = X(1) =…
Pr{X(0) =i } reasonable choice of the time step insures mi ] 1
for any i (mi = 1 implies Wi 1). If we presume
=X(n)= i }
mi \ 1, it follows that pii \ 0 for any i. Mathemat-
1 ically, this means regularity of the Markov
= Pr{X(n) = iX(0) =…
Pr{X(0) =i } chain.The assumption that mi \ 1 is not crucial
and can be relaxed (for transient states) without
= X(n− 1) = i } · Pr{X(0) = …
affecting the major results (see Section 5). Techni-
cally, one could increase mi for any state i by
=X(n−1) =i }
reducing the time step of the chain, so that the
pii assumption is not too restrictive.
= Pr{X(0) =…
Pr{X(0) =i } The basic result for regular chains is the follow-
ing (Karlin, 1968): there exists the limiting matrix
=X(n−1) =i } =… =p nii P() = nlim P n such that for any initial distribu-

tion x(0) we have
For the expectation of Wi we have by definition
(we put formally Fi (0) =0): x(n)= x(0)P(n) x(0)P() = x* as n
(3.11)

mi = E(Wi )= % n[(1 − Fi (n − 1)) −(1 − Fi (n))] Vector x* is called the stationary distribution
n=1 due to the property x*P() = x*. It can be found

1 by a standard routine for finding the eigenvector
= 1+ % (1− Fi (n)) = % p nii =
n=1 n=0 1 −pii of the eigenvalue l=1 for matrix P(1) (see, e.g.
Roberts, 1976). If P(1) has only one unitary ei-
So, genvalue, then x* (Eq. (3.11)) can be easily shown
to be actually independent of x(0) as all the rows
1 of matrix P() are identical in this case and
pii =1− , i=1, 2, …, K (3.8)
mi equal to x*. Yet there may well be cases (rarely
addressed in applied manuals though) where P(1) Æ l11 l12 ... l1K Ç
has more than one eigenvalues lj =1, the limiting Ã Ã
L= Ã Ã, lii = − li
matrix P() still having some regular pattern Ã Ã
interpretable in terms of the succession scheme l
È K1 l K2 ... lKK É
behind it (see, e.g. Denisenko et al., (1996) as an
example of two climax states in the succession with the following property guaranteed by Eq.
scheme for the forest-steppe zone and respectively (4.2) for any finite Markov chain:
two unitary eigenvalues in the spectrum of P(1)). K
A well-known practical approach to calculation of li = % lij (4.6)

j = 1, j " i
matrix P() consists in multiplying matrix P(1)
by itself unless a regular pattern becomes visible Due to this construction, the Kolmogorov–Chap-
in the product. man differential equations for transition probabil-
ities take now the form
ÁP%(t)= P(t)L= LP(t),
4. Continuous model formalism Í
ÄP(0)= I,
When time is considered to be continuous, the
with the solution P(t)=exp{Lt}. Therefore, to
stochastic and Markov properties are similar to
calculate the transition probability matrix P(t) for
those in the discrete formalism:
any t\ 0, it is sufficient to know the matrix L. As
pij (t)] 0, i, j =1, …, K (4.1) well as in the discrete case, we use the expecta-
K
tions mi to calculate L, relying upon the following
% pij (t)=1, i = 1, …, K (4.2) mathematical ground.
j=1 Let Wi be the random variable defined in Sec-
P(s +t)= P(s)P(t), t, s]0 (4.3) tion 3 and Fi (t) be its probability distribution law.
Similar to the ideas behind derivation of Eq. (3.8),
Now t, s denote the continuous-time variables; the we come from the Markov property to the follow-
initial condition for Eq. (4.3) takes the form ing functional equation:
lim P(t)= I (4.4) 1−Fi (t+s)= [1− Fi (s)][1− Fi (t)] Ö s, t] 0,

t0+
As well as in the discrete case, the state distribu- w.r.t. unknown function [1−Fi (t)]. The solution
tion x(t) can be expressed in terms of the initial is well known to be exponential, whereby it fol-
distribution x(0) and matrix P(t): lows that
x(t)=x(0)P(t) (4.5) Pr{Wi 5 tX(0)= i }= Fi (t)= 1− exp {−ci t}
The matrix function P(t) is continuous in t (Kar- where ci is a positive constant. What is also
lin, 1968). Furthermore, the following limits al- known for (and being advantageous of) continu-
ways exist: ous Markov chains is that ci = lii (Chung, 1967),
hence we have, as the expectation of the exponen-
1− pii (t) tial distribution, E[Wi ]= 1/li, and
lim = li ; 0 5li B;
t0+ t
li = 1/mi (4.8)
p (t)
lim ij =lij ; i "j; 0 5lij B, Altogether, we have the following construction.
t0+ t
Given mi, the average waiting times in the process
where the l values are interpreted as the transi- states i= 1,…,K, the densities li are immediately
tion probability densities. Theoretically, given a calculated by Eq. (4.8). From (Chung, 1967) it is
matrix P(t), one can always construct the transi- known that the probability of transition to state j
tion density matrix, upon exit from state i (i" j ) is equal to lij /li.
Since these probabilities have the same meaning both models will produce the same stationary
as the weights bij for the discrete chain, it is distribution x*.
natural to set lij /li =bij. Hence, There is one more property that the discrete
and continuous descriptions have in common.
lij = bij ·li (4.9) To see it, we need some further notions. Let Ti
Using Eq. (4.6), for equiprobable transitions, i.e. denote the total time (number of steps) the dis-
when bij = 1/ki, we obtain thereafter crete chain sojourns at state i. The state i is
called transient if E[Ti ]B and ergodic if
lij = li /ki (4.10) E[Ti ]= . It is known from general theory
(Kemeny and Snell, 1960) that:
for each j linked from the i and, setting the (1) Any finite-state Markov chain has at least
remaining entries to zero, the whole matrix L, one ergodic state;
which completely determines matrix P(t) for any (2) Renumbering, if needed, the chain states,
t ] 0. it is always possible to represent the transition
It is known (Ventzel and Ovcharov, 1991) probability matrix in the following form:

that for any continuous-time chain the limiting
matrix P() = lim P(t) exists, such that for any
t
initial distribution x(0) we have Pd(1)
Q R n (5.3)
0 A
x(t)=x(0)P(t) x(0) P() =x* as t .
where submatrix Q (of size q× q) contains only
transitions between all pairs of transient states,
5. Comparison between the two formalisms while submatrix A (of size (K−q)× (K− q))
corresponds to the ergodic states.
In view of the apparent similarity between There always exists the fundamental matrix
two previous Sections, it is natural to consider
now whether the discrete and continuous models N= [nij ]= % Q k = (I− Q) − 1 (5.4)
yield identical results. In this Section we show k=0
the asymptotic equivalence between these two

with the entries, nij = Ei [Tj ], representing the
formalisms.
conditional expectation of Tj on condition that
LetPd(n) and Pc(t) be the transition probabil-
the chain comes out from state i.
ity matrices for the discrete and continuous
In simpler words, the chain eventually leaves
cases respectively, constructed on a single set of
all the transient states and comes to the ergodic
mi values. If mi are expressed in time steps, then
ones. The set of ergodic states is often called the
Eqs. (3.8) and (3.9) and Eqs. (4.8) and (4.9)
absorbing set, because after reaching this set the
result apparently in the following relationship
chain can never leave it. For this reason, the
among the parameters of the discrete chain and
ergodic states are normally interpreted as the
the continuous one:
climax ones. It is interesting to know how long
Pd(1)=L+ I. (5.1) it takes, on the average, to reach the climax
states from any initial transient state i. Mathe-
Suppose that mi \1, i.e. the chain is regular, matically, we should find the expectation of the
hence the major theorems on the limit distribu- time Ttransient the chain sojourns totally at the
tion are applicable in the discrete and continu- relevant transient states starting at state i. With
ous cases. The following result holds true: matrix N, the answer can be written immedi-
lim [Pd(n)− Pc(n)] =0. (5.2) ately:
n
q
A simple proof is given in the Appendix. As a Ei (Ttransient)= % nij (5.5)
consequence, given an initial distribution x(0), j=1
time among all stages), then the scheme of Fig. 1

where q denotes the number of transient states.
suggests the following vectors of mi -values:
In the Appendix we prove that Eq. (5.5) can be
expanded to the continuous case too, in which Dmin
matrix P is calculated formally by Eq. (5.1) and
= [1 4 12 12 28 40 24 40 40 40 24 24 24 24],
represented in the block form (Eq. (5.3)).
Dmax
= [1 6 12 12 28 40 24 50 50 50 24 24 24 24].
6. Example In the discrete case, the matrix of one-step
transition probabilities constructed by Eqs. (3.8)
Fig. 1 represents a scheme of succession and (3.10) for vector Dmin takes on the form:
through forest types in a mixed (spruce-broad-
leafed) forest in Central Russia as a result of The continuous chain is defined by its matrix
natural overgrowing on an abandoned tillage area Lmin that refers to the matrix Pd,min(1) as shown in
(Korotkov, V.N., personal communication). As Eq. (5.1). Solving the initial-value problem for Eq.
the first effort in the Markov-chain modelling (4.7) yields thereafter the transition probability
technique, we consider an ideal course of succes- matrix
sion with no disturbances; the general direction of
Pc,min(t)= exp{Lmint},
changes is therefore unique: from the pioneer
stage of succession to the climax one, which is that can be quantified further by means of the
considered to be a polydominant spruce-broad- well-known formula
leafed forest (Popadyuk et al., 1994). The chain
P(t)= exp{Lt}=I+ Lt + L 2t 2/2+…
has apparently 14 transient states and one ab-
sorbing state that corresponds to climax. Expert To illustrate the convergence ensuing from Eq.
knowledge of how long a stage of succession can (5.2) for the two types of models, we calculate
last (numbers of years in boxes of the scheme) x(n) and x(t) by Eq. (3.7) and Eq. (4.5) respec-
until it turns into another one has rather the form tively, at some discrete time instants t= n=
of ranges than exact estimates of the mathemati- 0, 1, 2, …, for the initial distribution
cal expectations mi. It makes sense in this case to x(0)= [1, 0, …, 0], that corresponds to the whole
model the fastest and the slowest courses of suc- area being originally at the pioneer stage of suc-
cession. The corresponding chains can be con- cession. In practice, one could monitor all kinds
structed by calculations as above for all the of distinguishable forest types, yet we confine, in
minimal, Dmin, and all the maximal, Dmax, values the illustration, to the subclimax and climax
of mi. If the time step is chosen to be Dt = 5 year stages (components 11–15). Fig. 2 shows dynam-
(it cannot be greater than the shortest waiting ics of these components in both models plotted
Fig. 2. Continuous and discrete model dynamics of the climax and subclimax components for the initial distribution x(0) =
[1,0,…,0].
versus a single time axis in the log-linear scale, thus define the potential range of variations in
chosen to emphasise deviations which are hardly the rate of succession process.
visible in the linear scale. Each subclimax curve For both discrete and continuous cases, Eqs.
is clearly seen to decline after a temporal in- (5.3) and (5.4) produce the fundamental matrix N,
crease, while the spruce trajectories walk away whose elements nij estimate the number of time
from the others, which looks natural due to the steps the succession stays at stage j if started from
shorter waiting time of the spruce stage. Both stage i. If, in particular, we use Pd,min(1) as P, then
common sense and mathematical results per-
suade that everything gets eventually into the
climax stage in the ideal course of succession, i.e.
x() =x* = [0, …, 0, 1], and both the discrete
and continuous descriptions of that course do
converge.
Calculated in the same way for the Dmax set of
mi -values, the matrix Pd,max(1) has logically the
same pattern as Pd,min(1), yet different numerical
entries (not given) in those rows which corre-
spond to differing components in vectors Dmax Its row sums (multiplied by the time step length)
and Dmin. The transition probability matrices now give the estimation (in years) of how long the
Pd,min(1) and Pd,max(1) (or the transition density succession stays in the transient stages if started
matrices Lmin and Lmax for the continuous chain) from any earlier stage i:
Ttransient tory. In general, if an expert suggests a particular

scheme of successional transitions in the form of a
=[440 435 300 380 460 520 240 320 320 320 1
directed graph (like that in Fig. 1), then he/she
20 120 120 120]T.
already acts as if the Markov postulate has been
When calculated for matrix Nmax (not cited), the
accepted: for the stages where the graph does not
estimation yields:
ramify, the transition must be unique for any kind
Ttransient of models; at a ramifying stage, the total set of
possible transitions is already defined by the very
=[487.5 482.5 300 430 510 570 240 370 370 37
graph and it only remains to apply actual data/
0 120 120 120 120]T.
knowledge to estimate the probability distribution
So, it would take from 440 to 487 years, on the over that set. Therefore, the validity of the
average, for the ideal course of succession to Markov postulate is rather a matter of the succes-
reach the climax stage from the pioneer one and sion knowledge than of the way it is formalised in
respectively lesser times from the later stages. chain modelling.
Although some of these predictions might be Current theories of forest succession might
regarded trivial by common sense relying on Fig. rather suggest another motive to criticise the
1, like e.g. those for the pathways 3 7 11 15, Markov-chain approach: while an absorbing
300 years, or 6 … 15, 520 – 570 years, the chain predicts inevitable absorption in the ab-
predictions are not so obvious for the ramifying sorbing state(s) to be associated with the climax
pathways to climax, especially if the alternative one(s), the mosaic concept of age-space structure
transitions were not be equiprobable. and dynamics of forest ecosystems (Borman and
Likens, 1979; McIntosh, 1985; Smirnova et al.,
1990; Korotkov, 1991; Popadyuk et al., 1994)
7. Discussion treats the equilibrium state as a permanent cycling
of spatial forest elements at different stages of
As can be seen in Section 3, the convergence succession (e.g. stage 15 in Fig. 1) sustained due
behaviour of homogeneous chains, to say nothing to the balance between disturbance and regenera-
of the ‘equilibrium’ terminology, fits surprisingly tion processes (see also Logofet, 1999). This mo-
well to the Clementsian paradigm of succession tive however could hardly discredit Markov
theory suggesting a persistent motion to the cli- chains since they are also able to model the above
max state of vegetation which is in equilibrium kind of asymptotic behaviour. For this purpose,
with the environment. This conformity was no- the scheme of transitions should now include the
ticed since the pioneers of Markov-chain mod- disturbance links to the earlier stages that would
elling of successions (Horn, 1975; Usher, 1979) provide for the directed graph (or its substantial
and has certainly been appealing to later follow- part) to be strongly connected (Logofet, 1993).
ers. The opponents however could not accept a The task to reproduce an actual ratio among
methodology ‘‘in which the future development of various stages in the limiting vector would there-
a system is determined by the present state and is after pose a problem to estimate both the space
independent of the way in which that state has ratio in the field and the temporal rates of distur-
developed’’ (Jeffers, 1978; page 89) — a wide- bance — this problem might be solved, in the
spread and vulgar (oversimplified) interpretation future projects, by means of the modern GIS-
of the Markov property. Actually, it follows from technologies developed in the time dimension
Eq. (3.5) and the meaning of pij values that ‘‘the (Acevedo et al., 1995). Markov models have long
present state’’ does not determine completely ‘‘the been known to be able to produce the non-ab-
future development of a system’’: it rather does sorbing asymptotic dynamics too (see, e.g. Jeffers,
only the next step of the model and only in the 1978).
sense that the probability distribution of the next Another, more profound, motive to criticise
state is independent of the current state prehis- Markov models is the postulate of homogeneity,
i.e. the assumption that pij values are time-inde- to have more than one solutions or/and diverging
pendent (Section 2). Prediction by a homogeneous solutions for close original data. As shown in
model can be correct only to the extent that the Sections 3 and 4, this is fortunately not the case
invariance hypothesis holds true. The hypothesis for the Markov-chain formalisms: given a unique
is fairly dubious when the time horizon is long set of the average waiting times mi and likelihood
enough, yet a strict threshold can hardly be fixed proportions bij for the ramifying transitions, the
here. It rather depends on the objectives of the transition probability matrix is restored in a
modelling study. For example, if the study be unique way, the backward problem thus being
aimed at effects of climate change, the invariance rather well-posed here.
hypothesis would apparently not be true and the This gives a mathematical ground to the pro-
course of succession should depend on the posed method of pij estimation, which differs dras-
changes in the key factors of the environment tically from what is used traditionally and based
which the succession is known to depend upon. on substituting the frequency of an event within a
Those factors, like e.g. ground-water level and set of observations for its probability within a
salinity for the flood-plain vegetation (Knyazkov model. Statistically, the latter approach implies
et al., 1989, 1992) or the thermal and moistening registering all possible transitions in a great num-
factors in the forest-steppe zone (Logofet et al. ber of trials, which means quite a long history of
1997), while affecting the rate of successional observations, much longer than the life span of
processes, may well undergo some long-term (even several generations of) the observers. His-
trends induced by climate change. The resulting torical data can hardly meet the formal require-
models (ibid.) represent non-homogeneous Markov ments which a confident statistical inference
chains: the transition matrix is constructed as a imposes on the number of ‘repetitions of the trial’:
function, P[E(t)], of the key factors E(t), which for instance, in a case study of forest successions
summarises the knowledge and data about the in the State Nature Reserve ‘Bryanskii Les’ (Cen-
succession under study. Loosing the algebraic ele- tral Russia), with a seeming abundance of data in
gance of their homogeneous prototypes, the non- hand, only one half of the edaphotopes under
homogeneous models now gain sensitivity to study turned out to be supplied with a proper
climate-sensitive parameters of the environment, number of ‘repetitions’ (Pomaz et al., 1998).
hence acquire a specific prognostic ability. A common trick to ride out the obstacle is to
Although the history of non-homogeneous apply air/space imaging and to accumulate data
Markov-chain models is not so long as homoge- for the statistics through the space dimension
neous ones, it includes some case studies (ibid.; instead of the temporal one. In geobotanic terms,
Logofet, 1999) and attempts to develop the it means to substitute the ecological series (i.e. a
methodological principles (Logofet and sequence of ecosystems changing along an envi-
Denisenko, 1999). A crucial point in the method- ronmental gradient, see, e.g. Walter, 1979) for the
ology is finding the range of potential variations successional (chronological) series. Extensively
in matrix P[E(t)], like those given by matrices used also in demography and life sciences, this
Pmin, Pmax and Lmin, Lmax in Section 6 (see also substitution is generalised in the mathematical
Denisenko et al., 1996, and Logofet et al., 1997, notion of ergodicity. In random process theory,
for an example of how the range can be found an ergodic process is such that a statistical charac-
from observations by a non-traditional usage of teristic of any realisation of the given process in
traditional models). The task to construct a model time (of any trajectory) coincides with the charac-
for a given data set, seeming to be quite a’for- teristic over the whole ensemble of realisations
ward’ task for the modeller, in mathematical (the state space) for any fixed moment of time.
terms, means solving a backward problem (finding So, using observations in the space dimension to
a system of equations whose solution(s) reproduce estimate the transition probabilities in time means
the data). Backward problems are known to be accepting the ergodic hypothesis. This hypothesis
typically ill-posed in the mathematical sense, i.e. may be true, like in the famous ‘Indiana dunes’
where Cowles made his pioneer studies of changes tool for realistic ecological prediction but can
in vegetation (Cowles, 1899; Odum, 1983) and certainly be further developed to non-homoge-
where was a gradient apparently coincident with neous, or even non-Markov, versions formalising
the historical process: while retreating to its the further knowledge of the succession under
present boundaries, Lake Michigan exposed suc- study. These kinds of models could be carefully
cessively newer and newer areas for sand dunes, calibrated in case studies using modern GIS-tech-
so that increasingly older stages of successions nologies to represent the mosaic of forest vegeta-
could be found as one proceeded away from the tion types and may be challenged to provide for
shore. In other cases, matching the temporal gra- the time dimension in those representations; they
dient with an environmental one may not be so have certain potential to respond to this
obvious, thus leaving the issue of ergodicity as a challenge.
matter of modelling hypotheses, either explicit or
implicit ones. Note that existence of ergodic sets
of states in a stochastic process does not yet Acknowledgements
guarantee the process to be ergodic in the above
sense. (More details on the ergodic hypothesis and The research is supported by INCO-Copernicus
‘non-ergodic’ models can be found in Logofet, Programme of the EC, project No. IC15-CT98-
1999). 0104. Calculations and graphics are implemented
The method we propose for pij estimation is by means of the MatLab® software, Version 5.3.
apparently free of both the ergodic hypothesis in, We are grateful to the anonymous reviewer whose
and statistical requirements to, the data behind critique has prompted us to raise mathematical
the model. The empirical evidence is rather sup- generality of the results to its present explicit
posed to be generalised in the expert views of the level, his editing corrections have also improved
succession scheme and the rates at which the the manuscript.
transitions occur. Mathematical analysis is thus
separated from statistical problems inherent in
estimation of mi and bij values. Appendix A
Statement 1.
8. Conclusions
lim [Pd (n)− Pc (n)]= 0. (A.1)
n
Traditional, time-homogeneous Markov chains Proof. It is known (Karlin, 1968) that the stochas-
serve an adequate didactic means to illustrate, tic matrix Pd for a regular Markov chain has one
either by their discrete formalism or continuous or more eigenvalues equal to one, and the corre-
one, the classic paradigm of succession theory sponding eigenvectors are linearly independent.
when expressed as a quantified scheme of succes- All the remaining eigenvalues lie inside the region
sional transitions. As a straightforward formalisa- z B1. It follows that Pd can be represented, in
the Jordan basis, as a matrix J= CPdC − 1 =
n
tion, free of many technical assumptions that
other dynamic models normally do, the homoge-
Ir 0
neous model gives numerical certainty to what the , where Ir is the identity matrix r× r
scheme alone could predict in rather qualitative 0 L
terms. For example, the chain can predict the (r] 1) and matrix L contains only eigenvalues l
such that lB 1. Therefore, CPd (n)C − 1 = J n
n
theoretical course of succession, the ratio between
its final stages, and the mean time it would take to
Ir 0
reach the final stage(s) under the hypothesis of J() = as n . For the continuous
invariance. It thus gives geobotanic knowledge the 0 0
quantitative form capable of prediction. The ho- chain with matrix L calculated by Eq. (5.1) we
mogeneous Markov chain can hardly become a have, in the same basis, CLC − 1 = J− I, hence
CPc(t)C − 1 =exp{(J −I)t}

=%
q
q Ã
− % nikpkj (t) Ã

n j=1 k=1 Ã0

0 0
= exp J() q q q
0 (L −IK − r )t t
=% % nikpkj (0) = % nij
j=1 k=1 j=1
which completes the proof. which completes the proof.

Note that the proof gives also a method for
calculating Pd() =Pc() = C − 1J()C.
Statement 2. In the continuous-time Markov
References
chain
q
Acevedo, M.F., Urban, D.L., Ablan, M., 1995. Transition and
gap models in forest dynamics. Ecol. Appl. 5, 1040 – 1055.
Ei (Ttransient)= % nij (A.2)
j=1 Balzter, H., 2000. Markov chain models for vegetation dynam-
ics. Ecol. Model. 126 (2-3), 139 – 154.
where [nij ]=N= (I − Q) − 1, and q × q-submatrix Borman, F.H., Likens, G.E., 1979. Pattern and Process in a
Forested Ecosystem. Disturbance, Development and the
Q is defined by Eq. (5.1) and the block form Eq. Steady-State Based on the Hubbard Brook Ecosystem
(5.3). Study. Springer-Verlag, New York 253 pp.
Proof. It follows from Eq. (5.1) and Eq. (5.3) that Chung, K.L., 1967. Markov Chains With Stationary Transi-
matrix L takes on the following block form: tion Probabilities. Springer-Verlag, Berlin 301 pp.
Cowles, H.C., 1899. The ecological relations of the vegetation
L=
Q −I R
.
n of the sand dunes of Lake Michigan. Bot. Gaz. 27, 95 –
117, 167 – 202, 281 – 308, 361 – 391.
Denisenko, E.A., Kargopolova, U.D., Logofet, D.O., 1996.
0 A− I
Vegetation primary succession in a technogenic landscape
of the forest-steppe zone (a Markov model). Biol. Bull. 23
Confined to the transient states, the Kolmogorov– (5), 451 – 459.
Doob, J.L., 1953. Stochastic Processes. Wiley, New York
Chapman equations P%(t) = LP(t) can now be
(reprinted in 1964 and 1990).
rewritten as follows: Horn, H.S., 1975. Markovian properties of forest successions.
In: Cody, M.L., Diamond, J.M. (Eds.), Ecology and Evo-
P0 (t)= (Q − I) − 1P0 %(t) = − NP0 %(t). (A.3)
lution of Communities. Belknap Press, Cambridge, MA,
pp. 196 – 211.
Consider a random variable u(t) that is equal to 1 Horn, H.S., 1981. Succession. In: May, R.M. (Ed.), Theoreti-
if X(t)=j5q (any transient state) and to 0 other- cal Ecology: Principles and Applications. Blackwell, Ox-
wise. Obviously, ford, pp. 253 – 271.
&
Jeffers, J.N.R., 1978. An Introduction to Systems Analysis:
with Ecological Applications. Edward Arnold, London 198
Ttransient = u(t)dt. pp.
0 Karlin, S., 1968. A First Course In Stochastic Processes.
Academic Press, New York 502 pp.
The expectation of Ttransient is thereafter, in view of Kemeny, J.G., Snell, J.L., 1960. Finite Markov Chains. Van
Eq. (A.3), Nostrand, Princeton, NJ 210 pp.
& n &
Knyazkov, V.V., Logofet, D.O., Tursunov, R.D., 1989. An
attempt in Markov description of the plant succession.
Ei [Ttransient]=Ei u(t)dt = Ei [u(t)]dt Izvestiya AN SSSR, Seriya Biologicheskaya 2, 297 – 301 (in
Russian).
& &
0 0
Knyazkov, V.V., Logofet, D.O., Tursunov, R.D., 1992. Non-
q q homogeneous Markov model for plant successions in
= % pij (t)dt = % pij (t)dt Tigrovaya Balka Reserve. In: Logofet, D.O. (Ed.), Mathe-
&
0 j=1 j=1 0
matical Modelling of Plant Populations and Phytocenoses.
=%
q q
( − % nikp%kj (t))dt
n Nauka, Moscow, pp. 37 – 48 (in Russian).
Korotkov, V.N., 1991. A new paradigm in forest ecology.
j=1 0 k=1 Biologicheskie nauki 8, 7 – 19 (in Russian).
Li, B.-L., 1995. Stability analysis of a nonhomogeneous Roberts, F.S., 1976. Discrete Mathematical Models with Ap-
Markovian landscape model. Ecol. Model. 82, 247–256. plications to Social, Biological and Environmental Prob-
Logofet, D.O., 1993. Matrices and Graphs: Stability Problems lems. Prentice-Hall, Englewood Cliffs, NJ 559 pp.
in Mathematical Ecology. CRC Press, Boca Raton, FL 308 Smirnova, O.V., Chistyakova, A.A., Popadyuk, R.V., et al.,
pp. 1990. Population Organisation of the Vegetation Cover in
Logofet, D.O., 1999. Successional dynamics of vegetation: Forest Areas (exemplified by broad-leafed forests of the
classical concepts and contemporary models. In: I.A. European part of the USSR). Research Center of Biologi-
Shilov (Ed.), Ecology in Russia at the Turn of the 21st cal Studies, USSR Acad. Sci., Pushchino, 92 pp. (in
Century, 1. Nauchnyi Mir. Moscow 297-317, 70–98. Russian).
Logofet, D.O., Denisenko, E.A., Golubyatnikov, L.L., 1997. Turner, M.G., Gardner, R.H. (Eds.), 1991. Quantitative Meth-
Inhomogeneous Markov models for succession of plant ods in Landscape Ecology. Springer-Verlag, New York,
communities: new perspectives of an old paradigm. Biol. 536 pp.
Bull. 24, 506 – 514.
Usher, M.B., 1979. Markovian approaches to ecological suc-
Logofet, D.O., Denisenko, E.A., 1999. Principles in modelling
cession. J. Anim. Ecol. 48, 413 – 426.
the vegetation dynamics under climate changes. Zhurnal
Usher, M.B., 1981. Modelling ecological succession, with par-
Obschei Biologii 60, 527–542.
ticular reference to Markovian models. Vegetatio 46, 11 –
McIntosh, R.P., 1985. The Background of Ecology: Concept
18.
and Theory. Cambridge Univ. Press, New York 383 pp.
Mirkin, B.M., Naumova, L.G., 1984. On the successions in Usher, M.B., 1992. Statistical models of succession. In: Glenn-
plant communities. Ekologiya 6, 3–11. Levin, D.C., Peet, R.K., Veblen, T.T. (Eds.), Plant Succes-
Odum, E.P., 1983. Basic Ecology. Saunders College, Philadel- sion: Theory and Prediction. Chapman-Hall, London, pp.
phia 613 pp. 215 – 248.
Pomaz, V.L., Evstigneev, O.I., Logofet, D.O., 1998. Applica- Ventzel, E.S., Ovcharov, L.A., 1991. Theory of Stochastic
tion of Markov models for studying successional changes Processes and Its Applications in Engineering, Nauka,
in forest communities. Lesovedenie 5, 59–66. Moscow, 384 pp. (in Russian).
Popadyuk, R.V., Chistyakova, A.A., Chumachenko, S.I., et Vinogradov, B.V., Schitov, A.G., 1994. Modelling ecosystem
al., 1994. East-European Broad-leafed Forests. Nauka, dynamics in the southern-taiga subzone by means of
Moscow, 364 pp. (in Russian). Markov chains. Lesovedenie 1, 13 – 21 (in Russian).
Prokhorov, Yu.V., Rozanov, Yu.A., 1987. Theory of Proba- Walter, H., 1979. Allgemeine Geobotanik: Eine kurze Ein-
bilities. Nauka, Moscow, 400 pp. (in Russian). führung. Stuttgart: Eugen Ulmer Verlag, 260 pp.

2000-The Mathematics of Markov Models What Markov Chains Can Really Predict in Forest Successions

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

2000-The Mathematics of Markov Models What Markov Chains Can Really Predict in Forest Successions

Uploaded by

Copyright:

Available Formats

Ecological Modelling 126 (2000) 285 – 298

The mathematics of Markov models: what Markov chains

Accepted 23 February 2000

1. Introduction are nowadays called Marko6-chain models of suc-

A well-known practical approach to calculation of li = % lij (4.6)

lim P(t)= I (4.4) 1−Fi (t+s)= [1− Fi (s)][1− Fi (t)] Ö s, t] 0,

x(t)=x(0)P(t) (4.5) Pr{Wi 5 tX(0)= i }= Fi (t)= 1− exp {−ci t}

the asymptotic equivalence between these two

time among all stages), then the scheme of Fig. 1

Ttransient tory. In general, if an expert suggests a particular

CPc(t)C − 1 =exp{(J −I)t}

which completes the proof. which completes the proof.

You might also like

2000-The Mathematics of Markov Models What Markov Chains Can Really Predict in Forest Successions

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

2000-The Mathematics of Markov Models What Markov Chains Can Really Predict in Forest Successions

Uploaded by

Copyright:

Available Formats

Ecological Modelling 126 (2000) 285 – 298

The mathematics of Markov models: what Markov chains

Accepted 23 February 2000

1. Introduction are nowadays called Marko6-chain models of suc-

A well-known practical approach to calculation of li = % lij (4.6)

lim P(t)= I (4.4) 1−Fi (t+s)= [1− Fi (s)][1− Fi (t)] Ö s, t] 0,

x(t)=x(0)P(t) (4.5) Pr{Wi 5 t X(0)= i }= Fi (t)= 1− exp {−ci t}

the asymptotic equivalence between these two

time among all stages), then the scheme of Fig. 1

Ttransient tory. In general, if an expert suggests a particular

CPc(t)C − 1 =exp{(J −I)t}

which completes the proof. which completes the proof.

You might also like

x(t)=x(0)P(t) (4.5) Pr{Wi 5 tX(0)= i }= Fi (t)= 1− exp {−ci t}