Professional Documents
Culture Documents
Editors
S. S. Chern B. Eckmann P. de la Harpe
H. Hironaka F. Hirzebruch N. Hitchin
L. Hormander M.-A. Knus A. Kupiainen
J. Lannes G. Lebeau M. Ratner D. Serre
Ya.G. Sinai N. J. A. Sloane J.Tits
M. Waldschmidt S. Watanabe
Managing Editors
M. Berger J. Coates S.R.S. Varadhan
Springer- Verlag Berlin Heidelberg GmbH
Thomas M. Liggett
Stochastic
Interacting Systellls:
Contact, Voter and
Exclusion Processes
With 6 Figures
Springer
Thomas M. Liggett
Mathematics Department
University of California
Los Angeles, CA 90095-1555
USA
email: tml@math.ucla.edu
ISSN 0072-7830
ISBN 978-3-642-08529-1 ISBN 978-3-662-03990-8 (eBook)
001 10.1007/978-3-662-03990-8
This work is subject to copyright. All rights are reserved, whether the whole or
part of the material is concerned, specifically the rights of translation, reprinting,
reuse of illustrations, recitation, broadcasting, reproduction on microfilm or in
any other way, and storage in data banks. Duplication of this publication or parts
thereof is permitted only under the provisions of the German Copyright Law of
September 9,1965, in its current version, and permission for use must always be
obtained from Springer-Verlag. Violations are liable for prosecution under the
German Copyright Law.
© Springer-Verlag Berlin Heidelberg 1999
Originally published by Springer-Verlag Berlin Heidelberg New York in 1999.
Softcover reprint of the hardcover 1st edition 1999
Cover design: MetaDesign plus GmbH, Berlin
Typesetting: Photocomposed from the author's AMSTEX files after editing and
reformatting by Kurt Mattes, Heidelberg, using a Springer TEX macro-package
Cover design: de'blik, Berlin
SPIN: 10728278 41/3143-543210 Printed on acid-free paper
Preface
Interacting particle systems is a branch of probability theory that has rich con-
nections with a number of areas of science - primarily physics in the early days,
but increasingly biology and the social sciences today. Stochastic processes of the
sort that are studied in this field are used to model magnetism, spatial competition,
tumor growth, spread of infection, and certain economic systems, to mention but
a few of the many areas of application.
The subject is by now about thirty years old. At the midpoint of that thirty year
period, I wrote the book Interacting Particle Systems (IPS) as an attempt to give
some order to the work that had been done by then, and to make the field more
accessible to new researchers, and more useful to workers in areas of application.
Judging from the rapid development of the field since then, this attempt appears
to have been successful.
My earlier book covered more or less the entire field, as it was at that time.
Even so, some topics, such as zero range processes and the then emerging area
of hydrodynamics, were mentioned only briefly. By now, the field has grown
to the point where it would be impossible to cover it entirely in one book. In
fact, a number of books that treat special topics within the field have appeared in
the interim - see for example Chen (1992), DeMasi and Presutti (1991), Durrett
(1988), Kipnis and Landim (1999), Konno (1994), and Spohn (1991).
IPS was organized horizontally, in that a separate chapter was devoted to each
type of model: stochastic Ising models, voter models, contact processes, nearest
particle systems, exclusion processes, and linear systems. The present book has a
more vertical appearance. It takes but three of these models - the ones given in
the title - and traces their development since 1985. Nearest particle systems are
omitted because, even though substantial progress has been made on them since
1985 (especially by T. Mountford), they are by their nature somewhat special.
Linear systems are omitted because they have been less active recently, while
stochastic Ising models are omitted because developments in that area alone would
justify an entire book.
Even my relatively modest objective of covering recent work on three models
cannot be attained in a book of reasonable size, so I have had to make some
choices about what to include. These choices reflect to some extent, of course,
my own interests and perspective on the field. Other work on these models is
described briefly in the Notes and References section for each of the three parts.
VI Preface
occurs in this context that is absent in the case of Zd. Briefly, the contact process
on Zd has one critical value, while the contact process on a homogeneous tree
(other than Z 1) has two distinct critical values. Between these two critical values,
the finite process survives globally, but dies out locally. Unlike Zd, the tree is
large enough that the infected set can wander out to infinity without dying out,
but this can only happen for intermediate values of the infection parameter.
The story is quite different for voter models. The voter models discussed in
Chapter V of IPS are what are now known as linear voter models. Their ergodic
theory was more or less completed in IPS. While significant progress has been
made on linear voter models since then, and is discussed in the Notes and Refer-
ences section, the focus of Part II is on their nonlinear cousins. Nonlinear voter
models require quite a different approach, primarily because their duals (when they
exist) are harder to analyze. While the theory of nonlinear voter models is still
very far from being complete, there are close connections to the contact process,
and this makes it a natural candidate for inclusion in this book. The main theorem
in Part II gives a complete classification of threshold voter models with threshold
level = 1. The proof given there is a substantial improvement over my original
treatment, which was computer aided and contained a serious error.
The situation for exclusion processes is again different. The material in Chapter
VIII of IPS has in general not been superceded by subsequent developments.
However, there is a whole new collection of issues that have been investigated,
and it is to these that we address our attention in the final part of this book.
We again omit a treatment of the by now mature area of hydrodynamics, partly
because it is well covered in the books of De Masi and Presutti (1991), Spohn
(1991), and Kipnis and Landim (1999), and partly because it has quite a different
flavor from the topics we will cover here.
The first main section of Part III gives a probabilistic treatment of shocks in the
asymmetric, nearest neighbor, exclusion process in one dimension, based on work
by Ferrari and his coauthors. The main technique used here is coupling. Then we
move to a more analytic treatment of roughly the same issues that was developed
by Derrida and his coworkers. This is known as the matrix approach. Finally,
we turn to central limit theorems for tagged particles in more general exclusion
processes, based on work of Varadhan and coauthors. IPS has a treatment of
this only in the case of the symmetric, nearest neighbor, one-dimensional system,
which has a different behavior than the general system considered here.
The Background and Tools section at the beginning of the book describes the
basic particle system setup, and some of the key techniques that are useful in the
analysis of many models - coupling, monotonicity, correlation inequalities and
subadditivity, for example. These first few subsections should be read before ven-
turing into the book proper, but the latter subsections can be skipped, and read
when they are used later on. Each of the three parts begins with a brief description
of that particular model, and gives precise statements of results from the corre-
sponding chapters ofIPS (or from other references) that are used later. With this
exception, the numbered sections within each part are largely self-contained. Each
VIII Preface
part ends with a Notes and References section that has two functions. First, it
details the sources of the material in that part. Secondly, it contains brief descrip-
tions of the large amount of related work that I have not been able to include in
the book itself.
While this book is more or less self-contained, the reader may find that reading
parts of IPS first makes the going easier. Here are my suggestions about what parts
of IPS to read in this case:
(a) The first four sections of Chapter I and the first three sections of Chapter II
before starting this book.
(b) The first three sections of Chapter VI before reading Part I.
(c) The first two sections of Chapter V before reading Part II.
(d) The first three sections of Chapter VIII before reading Part III.
A popular (I think) feature of IPS was its sets of open problems. I have not
attempted to do anything formal of this sort here. There are simply too many
open problems, and many of them are not directly about the three types of models
I treat here, but rather about other models that are nevertheless closely related
to contact, voter and exclusion processes. However, I do mention problems that
I think should be looked at when they arise naturally, mainly in the Notes and
References sections.
As I mentioned in the preface to IPS, my wife Chris had a lot to do with my
writing that book. For the last several years, she has been lobbying for a follow-up.
It took a while, but she finally got it. In the earlier preface, I mentioned some of
the people who had had the most impact on my work, as well as on the subject as
a whole. Most have continued to be leaders in the field, but they have now been
joined by a large and impressive group of younger mathematicians. I won't list
them here, but most appear prominently in the bibliography. One of the measures
of a field of research is the caliber of researcher that it attracts. By this measure,
interacting particle systems has been a great success.
Pablo Ferrari, Norio Konno, Tom Mountford, Roberto Schonmann, and espe-
cially my former students, Amber Puha and Li-Chau Wu, have read parts of this
book, and made suggestions for improvement - I very much appreciate their input.
I would like to acknowledge the National Science Foundation for its support of
my work over the past quarter of a century, and the Guggenheim Foundation for
freeing my time in 1997-98, so that I could devote much of it to writing this book.
Without their support, this work would not have been possible.
Bibliography 317
Index 331
Background and Tools
We begin this section by setting up the basic tenninology and notation to be used
in this book. Then we will discuss briefly the main foundational results and other
tools that will be used later. Many of these are taken from IPS, so the proofs will
often not be given here. Insofar as possible, we will use the notation from IPS.
The first part of this section should be read at the outset. The latter material is
more special, and can be read when it comes up later. This material appears in
roughly the order in which it is used in the rest of the book.
The Processes
The models studied in this book are continuous time Markov processes 1)( with
state space X = {O, l}s, where S is a countable set of sites. Usually S will be Zd
or a tree. Note that X is compact in the product topology. A configuration 1) E X
has the following interpretations in the three cases to be considered:
Contact Processes. There is an individual (or plant, or cell, or ... ) at each site
XES that is infected if 1) (x) = 1 and healthy if 1) (x) = O.
Voter Models. There is an individual (i.e., a voter) at each site x who has possible
opinions 0 or 1 at any given time. Alternatively, each site is occupied by an
individual of one of two types, labelled 0 or I.
Exclusion Processes. At each time, a site x is either occupied by a particle (if
1)(x) = 1) or vacant (if 1) (x) = 0).
uniform norm
Ilfll = sup If(I])I·
ryEX
All the processes we will consider will have the Feller property, so that we can
define the semigroup of the process on C(X) by
(The Feller property is just the statement S(t)f E C(X) whenever f E C(X).)
For I] E X and x, yES, define I]x and I]x,y by
{ 1 - I](x) if Z = x,
I]x (z) = I] (z)
if Z =1= x
and
if Z = x,
{ "(y)
I]x,y(z) = I] (x) if Z = y,
I](z) if Z =1= x, y.
Thus I]x is obtained from I] by flipping the xth coordinate, while I]x,y is obtained
from I] by interchanging the xth and yth coordinates. With the occupancy inter-
pretation of exclusion processes, the effect of this is to move a particle from x to
y (if I] (x ) = 1, I](Y) = 0), to move a particle from y to x (if I](Y) = 1, I] (x ) = 0),
or has no effect (if I](x) = I](Y)).
These are the new configurations obtained from I] following a single transition.
The intuitive meaning of the function c in each case is then
and
pry(l]t = I]x,y) = c(x, y, I])t + oCt), (if I] (x) =1= I](Y))
as t -J.- O. Strictly speaking, these statements are only correct if S is finite, since
otherwise the probabilities on the left are typically zero for t > O. When S is
infinite and c(x, 1]) is bounded below by a positive number, there will be infinitely
many transitions in every finite time interval.
More formally, the connection between the rate function c and the process I]t
is made through the generator Q of I]t. For functions f on X that depend on
finitely many coordinates (these are known as cylinder junctions), define
(Bl)
x
or
in the two cases. The restriction to cylinder functions is needed so that these series
will converge. In (Bl), for example, there are only finitely many nonzero terms. In
Background and Tools 3
(B2) the series will converge provided that c(x, y, 1/) satisfies natural summability
conditions.
The fundamental construction of the process 1/1 is given by the following
theorem. It is a special case of Theorem 3.9 on page 27 of IPS. We will state
it only for spin systems, but the corresponding existence theorem for exclusion
processes is entirely analogous. The assumptions in that case are given in (1.1) of
Part III.
Qf = lim S(t)f - f,
ItO t
QS(t)f = S(t)Qf,
and u(t) = S(t)f is the unique solution to the evolution equation
d -
dt u(t) = Qu(t), u(O) = f.
Then Q is the linear operator on C (X) whose graph is the ordinary closure of the
set G. Part of the statement of the theorem is that the closure of G is the graph
of a (single valued) linear operator.
Often we will carry out some computation on a finite system, and then will
argue that the result applies to infinite systems as well. This extension will usually
not be carried out explicitly, but will be left to the reader. The extension from
finite to infinite systems is usually justified by the following result, which is a
special case of Corollary 3.14 on page 29 of IPS.
Theorem B5 (Trotter-Kurtz). Suppose Cn(x, 1/) and c(x, 1/) are transition rates
that satisfY (B4). Define Qnf and Qf for cylinder functions f by (B1). Suppose
that
4 Background and Tools
for all f E C(X) and t ::::: 0. The convergence in (B6) is uniform on bounded t
intervals.
Invariant Measures
Much of the study of interacting particle systems involves their invariant measures
and convergence to them. If JL is a probability measure on X, the distribution of
1]1 when the initial distribution is JL is denoted by JLS(t), and is defined by
The fact that this relation determines JLS(t) uniquely is a consequence of the Riesz
Representation Theorem (Theorem 2.14 of Rudin (1966». The probability measure
JL is said to be an invariant measure if it satisfies JLS(t) = JL for all t > O. The
set of all invariant measures is denoted by .9.
The following theorem summarizes some elementary, but important, proper-
ties of .9. See pages 10-18 of IPS for their proofs. The topology on the set of
probability measures on X is that of weak convergence. The compactness of X
implies the compactness of the set of probability measures on X in this topology,
and this is essential for several parts of the theorem. The fact that the process
satisfies the Feller property is also crucial.
exists for some probability measure /L and some sequence Tn t 00, then v E g.
(g) In the context of Theorem B5, if /Ln is invariant for the process with generator
Q n and /Ln -+ /L weakly, then /L is invariant for the process with generator Q.
One consequence of (c) and (d) is that g has at least one extreme point. The
set of all extreme points will be denoted by .9;.
Reversible Measures
According to part (c) of Theorem B7, the process always has at least one invariant
measure. Sometimes an invariant measure satisfies a symmetry property known as
reversibility, and when it does, additional tools become available, and results are
generally more complete. The probability measure /L on X is said to be reversible
for the process if it satisfies
f fS(t)gd/L = f gS(t)fd/L
Lrr(x)q(x, y) = 0, YES,
x
Comparing these two properties, one can see that the second is quite strong, and
should be expected to hold only in very special cases. For example, if n is strictly
positive, then reversibility of n implies that q (x, y) > 0 if and only if q (y, x) > o.
Even when a measure is not reversible, quantities that one might call the
defects from reversibility,
n(x)q(x, y) - n(y)q(y, x),
can playa useful role. An example of this occurs in the proof of Theorem 3.1 of
Part III.
This leads naturally to the definition of stochastic mono tonicity for probability
measures IL on X:
ILl S ILz provided that
(B8)
Ix fdlLl S Ix fdILz for all increasing f on X.
This stochastic monotonicity for probability measures is best understood in
terms of the idea of coupling. A coupling of random variables or stochastic pro-
cesses is simply a joint construction of them on a common probability space.
Taken by itself, this is not a particularly compelling definition. However, making
a judicious choice of the joint distribution of the random variables or processes
involved turns out to be a very powerful technique. This book provides many
illustrations of this. The following is Theorem 2.4 on page 72 of IPS, and gives
the connection between coupling and stochastic monotonicity.
Theorem B9. Suppose ILl and ILz are probability measures on X. Then ILl S ILz if
and only if there is a coupling (1], I;) so that 1] has distribution ILl, l; has distribution
ILz, and 1] S l; a.s.
Remark. One direction of the proof is easy: If a coupling (1], I;) with these prop-
erties exists and f is increasing, then f(1]) S f(1;) a.s., so that
Theorem BI0 (Holley). Suppose S is finite and I-ll, 1-l2 are probability measures
on X that assign strictly positive probabilities to each point in X. If
(Bl1)
Remark. It is important to keep in mind that (B 11) is much stronger than I-ll ::'S 1-l2.
For example, (B11) implies that the conditional measures obtained by specifying
the configurations on a subset of S are also stochastically ordered, while this is
certainly not the case if only (B8) is assumed.
and
(B 13) I-ll ::'S 1-l2 implies I-ll Set) ::'S 1-l2S(t) for all t :::: o.
The proof is an immediate consequence of the definitions. A process that satisfies
these equivalent conditions is called monotone or attractive.
According to Theorem 2.2 on page 134 of IPS, the following is a necessary
and sufficient condition for a spin system to be attractive:
{
c(x,1))::'S c(x,{) if 1)(x) = sex) = 0,
(B14) 1) ::'S s implies
c(x, 1)) :::: c(x, {) if 1)(x) = sex) = 1.
We can use coupling to see that (B14) implies attractiveness, for example. Take
initial configurations 1), s satisfying 1) ::'S S. Construct a coupled process (1)1, Sl) on
X x X that satisfies 1)1 ::'S Sl a.s. for all t :::: 0 by allowing the following transitions:
8 Background and Tools
{
(1/, n -+ (1/x, n at rate c(x, 1/),
if 1/(x) = 0 and ~(x) = 1, then
(1/, n -+ (1/, ~x) at rate c(x, n
Note that the marginals have the right transition rates. For example, if ~(x) = 0,
then ~ -+ ~x at rate
Correlation Inequalities
Correlation inequalities are also very useful in the study of interacting particle
systems. A probability measure J.L on X is said to have positive correlations if
Theorem B15 (FKG). Suppose S is finite and J.L assigns positive probability to
every point in X. If
(BI6)
For almost any Markov process, it is practically impossible to check that J.LS(t)
has positive correlations using Theorem B 15, partly because (B 16) essentially
requires that J.LS(t) be known explicitly, but also because (BI6) is often false.
For example, Liggett (1994) showed that (at least for some times and parameter
values), the distribution at time t of the one dimensional contact process with
initial condition 1/ == 1 does not satisfy (B 16). In view of these comments, it
Background and Tools 9
should not be surprising that the following result is useful. It is a special case of
Theorem 2.14 on page 80 ofIPS.
Theorem B17 (Harris). !fTJt is an attractive spin system, then for every t > 0,
To deduce Corollary B18 from Theorem B15, simply note that both sides of (B16)
are
n a(x)1)(x)+~(x)[1 - a(x)]2-1)(X)-~(X).
x
lim p(TJt(x)
t ..... oo
= 1) = a(x),
so
v = lim JLS(t).
t ..... oo
if TJ E A,
if TJ 'f- A
10 Background and Tools
In other words, increasing events are positively correlated in the usual sense.
Often it is important to have inequalities in the opposite direction. Clearly the
event appearing on the left side of the inequality must be smaller than A I n A2
in order to have the opposite inequality. Here is the appropriate definition. For
AI, A2 C X, define
Al = {1] : 1](x) + 1](Y) :::: I} and A2 = {1] : 1](Y) + 1](z) :::: I},
where x, y, Z are distinct points in S. Then A 1 0 A2 = {1] : 1] (x ) + 1](Y) + 1](z) :::: 2},
so that
Therefore Theorem B21 implies that v(AI nA2) ::s V(AI)V(A2). But this is equiv-
alent to v(AI n AD :::: v(AI)v(A~).
Background and Tools 11
Theorem B21 for increasing events is due to van den Berg and Kesten, and
has long been known as the BK inequality. See Section 2.3 of Grimmett (1989)
for a proof in this context. The proof of the general form of the theorem is due
to Reimer, and this leads to our calling it the BKR inequality. Reimer's proof is
given in Section 6 of Chayes, Puha and Sweet (1999). The proof given there is
for the case that v {1] : 1] (x) = I} = 1
for all XES. The fact that this special
case of Theorem B21 implies the general case had been proved earlier by van
den Berg and Fiebig (1987) - see their Lemma 3.5. In that paper, they proved the
inequality in several cases, including that in which A and B are intersections of an
increasing and a decreasing event. In his second edition, Grimmett (1999) states
the general BKR inequality (see his Theorem 2.19), but again proves it only for
increasing events.
Results such as Corollary B 18 and Theorem B21 have been stated for inde-
pendent Bernoulli random variables. However, they can be used to obtain similar
results for independent Poisson processes. This is important, since all of the pro-
cesses discussed in this book can be constructed from collections of independent
Poisson processes.
Here is the idea. Suppose N is a rate one Poisson process on [0, 1], i.e., NO
is a random measure on [0, 1] with the following properties:
(i) For each Borel set A C [0, 1], N(A) is Poisson distributed random variable
with mean meA), where m is Lebesgue measure.
(ii) If {Ad are disjoint, then {N(Ai)} are independent.
Define random variables by
if N(A) = 0,
M(A) = {~ if N(A) ~ 1.
If {Ad are disjoint, then {M(Ad} are independent Bernoulli random variables.
Furthermore,
Duality
Two Markov processes 1]( and ~( (with possibly different state spaces) are said to
be dual with respect to the function H if
12 Background and Tools
for all 1] in the state space of the first process and ~ in the state space of the
second. The function H should be jointly measurable, and either nonnegative or
bounded, so that the above expectations are well defined. Duality is often a useful
tool because it permits the computation of certain probabilities for one of the
processes in terms of probabilities for the other. It has other important uses as
well, as we will see later in this book.
A general discussion of duality can be found in Section 3 of Chapter II of IPS.
Rather than repeat this here, we will limit ourselves to the observation that duality
will arise in our discussion of the basic contact process in Part I of this book, in
our discussion of the threshold contact process, and the linear and threshold voter
models in Part II, and in our discussion of the symmetric exclusion process in
Part III.
Subadditivity
Subadditive sequences and functions will come up frequently. The much more
powerful subadditive ergodic theorem (Theorem 2.6 on page 277 of IPS) plays an
important role in some aspects of the study of the contact process, but will not be
used in this book.
Here is the main result we will use.
Proof Let
a = infa(t).
1>0 t
Fix s > 0 and write t = ks + u, 0 :s u :s s, where k is an integer. Then
Oriented Percolation
Oriented site percolation is a very useful comparison process for interacting par-
ticle systems - especially the contact process. Here is a description of the site
percolation model with parameter p: An is a discrete time Markov chain on the
collection of finite subsets of Z with the following evolution: conditional on the
process up to time n, the events {x E An+d are independent and have probability
if An n {x -l,x} =1= 0,
if An n {x - 1, x} = 0.
This is not quite the traditional description of the process, but it has the advantage
of making clear that oriented percolation can be viewed as a discrete time version
of the one dimensional contact process.
The following result summarizes the main facts we will use about An. Note
that if Ao = {O}, then An C [0, n]. Let T = inf{n 2: 1 : An = 0}.
Theorem B24. If P is sufficiently close to 1, then there are constants C and E > 0
such that
A proof of Theorem B24 can be found in Durrett (1984) for oriented bond
I
percolation, in which the transition probabilities given above are replaced by
To deduce parts (a) and (c) of Theorem B24 for oriented site percolation from the
corresponding statements in the bond case, it suffices to note that one can couple a
site percolation process An with parameter p(2 - p) to a bond percolation process
Bn with parameter p so that Bn C An, provided that the initial states satisfy
Bo C Ao. Part (b) is not quite so easy, since {n < T < oo} is not a monotone
event. However, it can be deduced from the bond case by using the restart argument
described on pages 1031-1032 of Durrett (1994). (We will encounter a version of
this argument in the proof of Theorem 2.30 of Part I.)
For the one dimensional contact process, the analogues of the three parts of
Theorem B24 can be found in IPS as Theorem 2.28 on page 284, Theorem 3.23
on page 302 and Theorem 3.29 on page 303 respectively.
More quantitative statements related to Theorem B24 have been proved by
Liggett (1995b): If p 2: ~, then
14 Background and Tools
An = {k: there is an oriented path from (i, 0) to (k, n) for some i E A}.
Theorem B26. Fix k, d :::: 1, and let ~ = #{y E Zd : Iyl ::::: k}. Then whenever
{Xx, x E Zd} are Bernoulli random variables that satisfY
Remark. The most important situation in which this theorem is used is that in
which the Xx's are k-dependent. Recall that a collection of random variables {Xx}
indexed by Zd is said to be k-dependent provided that whenever A and Bare
subsets of Zd that satisfy
Proof of Theorem B26. Let {xn, n 2: O} be any enumeration of the points in Zd,
and write Xn = XX n ' We assume without loss of generality that all probabilities of
the form P(Xo = EO, ... , Xn = En) are strictly positive. If it were the case that
for all n and all choices of Ej E {O, I}, then it would be easy to construct recursively
a coupling that would realize the desired inequality jJ., 2: vp. Alternatively, one
could apply Theorem BlO to check this, since (Bll) reduces to (B27) when jJ.,1 =
vp and jJ.,2 = jJ.,. However, (B27) is too strong a condition to expect to check in
any significant generality.
The idea of the proof is to let {Yn , n 2: O} be an i.i.d. sequence of Bernoulli
random variables with P(Yn = 1) = r that is independent of the X's, and try
to check that the sequence Zn = Xn Yn satisfies (B27). Since Zn .::; X n, this will
suffice. Since
1- s
< --------------------------------
- (l-r)INoIP(X i = l,i E NI I Zi =Ei,i EM)'
where the final inequality comes from (B28) and the fact that each Yi is indepen-
dent of all the X's and all the other Y's.
We will now use (B31), which is true for any ordering of Zd and any n, to
prove inductively that if r is chosen appropriately, then
for all orderings of zd, all n and all Ei. By (B28), it is true for n = -1 provided
that r S s. Write the P(Xi = 1, i E NI I Zi = Ei, i EM) that appears in the final
expression of (B31) as a product of INil conditional probabilities of the form
for I E N I • Then we see from (B31) that (B32) holds for a given n, provided it
holds for all smaller values of n and that
l-s
(B33) ---------,----,-- S 1 - r.
(l - r)INolrlNll
!
We conclude that if s ::: r ::: and (B34) all hold, then (B28) implies that (B29)
holds for p = r2 (by (B30) and (B32)), and hence by the remarks at the beginning
of the proof, that f.1- ::: vp. Take r = -/p and s so that equality holds in (B34) to
complete the proof.
=L
n
(B35) u(n) f(k)u(n - k), n 2: 1.
k=l
= n) = fen), n 2: I,
P(Xk
. 1
(B37) hm u(n) = 00 •
n--+oo Lk=l kf(k)
An easy way to see this is to define a Markov chain Yn on to, 1, ... } with transition
probabilities
k 0 _ f(k+l) F(k + 2)
p( , ) - F(k + 1) and p(k,k+ 1) = F(k+ 1)'
=L
00
F(n) f(k).
k=n
The chain Yn can be interpreted as the age process associated with the renewal
process:
Yn = n - max{Sj : Sj ::: n}.
Note that with this definition, Yn increases by one at each unit of time, except that
it is reset to zero when a renewal occurs. Therefore Yn represents the age of the
object currently in service, and Yn = 0 corresponds exactly to a renewal occurring
at time n. To check that it is a Markov chain with the transition probabilities given
above, consider conditioning on the values Yo, Y1 , ••• , Yn - l and Yn = k. In this
18 Background and Tools
p(Xj + 1 = k + 1) p(Xj + 1 ~ k + 2)
and
P(Xj+l ~ k + 1) P(Xj+l ~ k + 1)
respectively.
From these observations, it is easy to see that
r = min{n : Yn = O}
starting from 0 has density f. Therefore, (B37) follows from the convergence
theorem for Markov chains, which says in this case that
. P o( Yn
lIm = 0) = -0-.
1
n->oo E r
See Chapters 3 and 5 of Durrett (1996) for more on this.
A property that will be useful in our applications of renewal theory in Part II
is logconvexity. A positive sequence {c(n), n ~ no} is said to be logconvex if the
successive ratios are monotone:
c(n) c(n + 1)
(B38) --- > n
c(n + 1) - c(n + 2) , ~no.
Note that the logconvexity of the sequence c(n) is equivalent to the nonnegativity
of the 2 x 2 determinants
De Bruijn and Erdos (1953) discovered the following connection between renewal
theory and logconvexity.
Theorem B39. Let f be any strictly positive probability density on {I, 2, ... } and
let u(n) be the corresponding renewal sequence. Iff is logconvex, then so is u.
The proof of this result depends on an identity that relates determinants based
on f to determinants based on u. Note the similarity between (B41) below and
the convolution equation (B35) that defines the renewal sequence.
Lemma B40. Let f be any probability density on {1, 2, ... } and let u (n) be the
corresponding renewal sequence. Then
Background and Tools 19
for n 2: 1.
Proof Expanding the determinants and using (B35) four times gives the following
for the sum on the right side of (B41):
L
n
L f(j)u(n -
n
fen + 2)u(n + 1) f(j)u(n - j) - fen + 2)u(n) j + 1)
}=l }=l
n
- fen + l)u(n + 1) L f(j + l)u(n - j)
}=l
n
+ fen + l)u(n) L f(j + l)u(n - j + 1)
}=l
= fen + 2)u(n + l)u(n) - fen + 2)u(n)[u(n + 1) - fen + 1)]
- fen + l)u(n + 1) [u(n + 1) - f(l)u(n)]
+ fen + l)u(n)[u(n + 2) - f(l)u(n + 1) - fen + 2)]
= fen + 1)[u(n)u(n + 2) - u 2 (n + 1)].
(B42) u(n)u(n + 2) 2: u 2 (n + 1)
Applying (B41) with n = m, we see that all the determinants on the right side are
nonnegative. Therefore (B42) holds for n = m as well.
sequence c is nonnegative, and for strictly positive sequences, the matrix is TP2 if
and only if c is logconvex. The generalization of Theorem B39 is the following:
L [u(k -
n
(B44) 1) - u(k)][ F(n + 2)F(n - k + 1)
k=l
- F(n + I)F(n - k + 2)].
gIves
n-l n
F(n + 2) L u(k)f(n - k) - F(n + 1) L u(k)f(n - k + 1)
k=O k=O
-u(n)F(n + 2) + u(n)F(n + I).
Now apply (B35) to get the result.
Before stating the next result, we observe that the logconvexity of the density
I implies the logconvexity of the tail probabilities F:
L
00
Theorem B45. Suppose {F(n), n :::: I} is logconvex. Then u(n) t, andfor n :::: 2,
Background and Tools 21
F(n+2) ]
u(n) - u(n + 1) >[u(n - 1) - u(n)] [ - F(2)
- F(n + 1)
(B46)
F(n+2) ]
+ [u(n - 2) - u(n - 1)] [ F(2) - F(3) .
F(n + 1)
F(n + 2) F(n - k + 2)
--->-----
F(n + 1) - F(n - k + 1)
for 1 :::: k :::: n. Therefore, u(n) ~ u(n+ 1) follows from Proposition B43 and induc-
tion. Now we know that all the summands on the right of (B44) are nonnegative.
Inequality (B46) comes from dropping all but the two summands corresponding
to k = nand k = n - 1.
Theorem B47. Suppose that fL is a shift invariant probability measure on {O, I}ZI
that puts no mass on the == 0 configuration, and let Y] have distribution fL. Define
Xb -00 < k < 00 by
(B48)
for all k.
00
=L L nP {I1(O) = 1, X k = m, X k+1 = m + n)
n=1 m=k
m~-I
L 11(i) = 0, l1(m + n) = 1
)
i=m+1
t;
00 00 m+n-I ( m-/-I
=~~ P 11(-1)=1, i~/I1(i)=k-l'l1(m-t)=I,
L
m+n-/-I )
l1(i) = 0, l1(m + n -I) = 1 ,
i=m-/+I
where the final step uses shift invariance. Making the change of variables
w = -t, u = m -t, v = m + n -t
00 0 u-k ( u-I
~ u~oow~ooP I1(W) = 1, i];ll1(i) = k - 1, I1(U) = 1,
are independent of t for all choices of n and of tl, ... , tn. It is said to be ergodic if
in addition it satisfies the following property: for every event G in path space that
is invariant under time shifts, P(I1. E G) = 0 or 1. The main result concerning
stationary ergodic processes is the Birkhoff Ergodic Theorem - see Section 6.2 of
Durrett (1996) for example. Here it is:
Background and Tools 23
Theorem B50. If 1]1 is stationary and ergodic, and if 1 is any bounded measurable
function on X, then
lit
-
t 0
1(1]s)ds ---+ EI(1]o) a.s.
as t ---+ 00. If 1]t is only stationary, the above limit exists a.s., but may not be
constant.
Theorem B50 can be applied to obtain an often useful criterion for ergodicity:
for all bounded measurable (or equivalently, all bounded continuous) functions 1
and g of n variables, all choices of SI < S2 < ... < Sn, and all n 2: 1.
For a proof of Theorem B51, see Proposition 4.11 of Chapter I of IPS, for
example. The general continuous functions that appear there can be replaced by
functions of finitely many variables to get the above statement.
One common way to construct stationary processes is to take a Markov process
1]t and use an invariant measure I-t as its initial distribution. The following result
gives an important connection between extremality of f.-t and ergodicity of the
resulting stationary process.
Theorem B52. Suppose that 1]1 is a stationary Markov process whose distribution
at each fixed time is the measure f.-t E g. Then each of the following is equivalent
to ergodicity of the process:
(a) f.-t E.9;.
(b)
lim
1--->00
~t 10t EF(1]o)G(1]s)ds = f f Fdf.-t Gdf.-t
111
- EI(TJo)g(TJs)ds = -
t o t
Ill! I(TJ)E~g(TJs)d/1ds
11 f E~g(TJs)dvds
0
={
= { 11 f S(s)gdvds
= f gd[{ 11 VS(S)dS].
Therefore, by Theorem B51 and the Markov property, TJI is ergodic if and only if
for every such I,
(B53) -
t
111
0
vS(s )ds ::::} /1,
(B54)
for two probability measures VI, V2. Then Vi is absolutely continuous with respect
to /1, so it may be written as Vi = fi /1, with 0 :s fi :s 2 and II + h = 1. Also,
since /1 E .9',
/1 = ~
2t
r
10
vIS(s)ds + ~
2t
r
10
v2S(s)ds.
and
G(TJ) = E~g(TJo, TJs 2 -S 1,··· , TJSn-Sl)·
Then, using the Markov property, for s > Sn - Sl, we can write
Branching Processes
Branching processes are very useful in making comparisons with interacting par-
ticle systems. Suppose {fen), n 2: O} is a probability density on the nonnegative
integers. Construct a discrete time Markov chain Xn on to, I, ... } by letting the
conditional distribution of Xn+! given Xn = k be the distribution of the sum of
k independent random variables with density I. Then Xn can be interpreted as
the number of individuals in the nth generation for a population in which each
member replaces itself at integer times with a random number of offspring, chosen
with density I.
Branching processes have been studied in this and more general forms for many
years. Athreya and Ney (1972) provides a good account of the theory, though all
the facts we will need can be found in any of several standard probability books
- see Chapter 4 of Durrett (1996), for example. Here we will summarize the basic
properties of a branching process. To rule out uninteresting special cases, we will
assume that 1(0) + 1(1) < 1; otherwise, Xn cannot grow.
The first question one asks is whether the survival probability
is strictly positive or not. Note in this connection that state 0 is absorbing for the
process. The answer to this question, and other aspects of the behavior of X n , are
basically determined by the mean of I,
plays a key role in the theory. The martingale convergence theorem implies that
M = lim Mn
n~oo
exists a.e.
L I(k)x
00
k = x.
k=O
(b) p > 0 if and only ifm > 1.
(c) If Xo = 1, m > 1, and Lk k 2 1(k) < 00 then EM = 1. In particular, M is not
identically zero.
26 Background and Tools
Assume for simplicity that there is a unique point x* E S where rr achieves its
maximum value, and normalize rr so that rr(x*) = 1.
Define now a queuing system TJr associated with q (., .) in the following way:
At any given time, TJr(x) E {O, 1, ... ,oo} is regarded as the number of customers
in queue x. For x i= y such that TJr(x) ::: 1, at rate q(x, y), a customer moves from
queue x to queue y. The effect is that TJr (x) decreases by 1 and TJr (y) increases
by 1. The process can be formally defined by monotonicity arguments, since we
have allowed the number of customers in a queue to be infinite. To do so, note
first that the process is well defined whenever the initial configuration TJ satisfies
For two different initial configurations TJ, l; that satisfy TJ(x) ::::: l;(x), XES, the
two resulting processes can be coupled so that TJr (x) ::::: l;r (x), XES at all later
times. Thus the process can be constructed for a general initial TJ by taking a
sequence of finite configurations TJn t TJ, and defining TJr = limn TJ~.
Suppose that p (.) is a function on S that satisfies
By convention, v{TJ : TJ(x*) = oo} = 1. We would like to see under what conditions
we would expect v to be invariant for the system. Queue x* is automatically in
equilibrium, so we compute formally for x i= x*, k ::: 1,
Background and Tools 27
It is not too hard to show that under this condition, v is invariant. See Andjel
(1982) for a proof of this type. Distributions of this sort that are invariant for
queuing systems are known as product form - see Kelly (1979). Note that by
(B56), (B57) and (B58),
L p(x)q(x, x*).
x=l=x'
By the observations made above, the rate of the resulting Poisson process is
no mystery. The proof is based on the reversed process with respect to v, i.e., the
process 11; whose generator is the formal adjoint of the generator of 111 in L2(V).
An example of the use of such reversal in the context of the exclusion process is
given in the proof of Theorem 1.17 in Part III. The reversed process is simply the
queuing system corresponding to the rates
28 Background and Tools
*( ) p(y)q(y, x)
q x, y = p(x) ,
an observation that explains assumption (B60). In the reversal, the roles of arrivals
and departures are interchanged. In particular, {D t , t 2: O} and {A;, t 2: O} have
the same distribution. The latter process is Poisson by construction, and hence so
is the former.
(B62) L iT (x)
---'--'--- < 00,
1 - p(x)
xoj=x'
"q(y, x)
SUPiT(Y) ~ - - < 00,
Y xo/=y iT (x)
and
" q(y, x)
sup [p(y) - iT(Y) ] ~ () _ () < 00.
Y xoj=x',y PX iT x
(B63)
(B64)
Remark. By applying the central limit theorem for the Poisson process, one im-
mediately deduces that the Dt in Theorem B59 and the X t in Theorem B61 satisfy
the central limit theorem.
The idea of the proof of this theorem is similar to that of Theorem B59.
The main difference is that the process is decomposed into a sum 1)t = 1)f + 1)~,
where the summands keep track of customers of two types, called black and red
respectively. The black customers are thought of as having entered the system
from x*, and the red ones are the ones that have entered the system from 00.
All customers at x* are labelled black. When some customer at an x =1= x* is
supposed to move, the customer that does move is chosen uniformly from among
all the customers at x, some of which will generally be black and others red.
The analogue of v for this bicolored process is the measure /L constructed in the
following way: 1) is first chosen according to v. Then each customer at queue
x =1= x* is labelled black with probability iT(x)/ p(x), and red otherwise. It turns
out that /L is invariant for the evolution of (1)f, 1)~). The reversed process (1)f*, 1)~*)
has a similar evolution to (1)f, 1);), except that the transition rates are different, and
Background and Tools 29
different for customers of the two colors. For a detailed description, see Ferrari
and Fontes (1994).
Decomposition (B63) comes about in the following way: R t is the departure
process of red customers, and Bt is the number of black customers in the system
(at queues other than x*) at time t. Since no red customers enter the system,
the decomposition is clear. Since the process is in equilibrium, B t is a stationary
process. To check (B64), note that
Bt = L 1J~(x),
x =/=x *
so that
=n E[
P(X) +n(x)(eE _l)]ryt(X)
n
x=/=x* p(x)
1- p(x)
- x=/=x* 1 - p(x) - (e E - l)n(x)'
n--+oo n
in probability, and
(B66)
.
hm
L~:6 E[(Mk+l - Md, IMk+1- Mkl > EJn]
=0
n--+oo n
for every E > O. Then
Mn
In => N(O, a 2),
30 Background and Tools
Our main applications of this result will be in the case that {Mn+ 1 - Mn} are
identically distributed. Note that in this case, (B66) is equivalent to E(M 1 -Mo)2 <
00.
Part I. Contact Processes
1. Preliminaries
The contact process is often thought of as a model for the spread of infection. The
collection of individuals that may be infected at any given time is taken to be the
set of vertices of a connected, undirected graph S. For such a graph, the degree of
a vertex x is the number of vertices y that are connected to x by an edge. The main
examples to be treated below are the d dimensional integer lattice Zd (in which
the degree of each vertex is 2d), and the homogeneous tree Td in which every
vertex has degree d + 1. In general, we will assume that the degrees of the vertices
are uniformly bounded. A path through S is a sequence of consecutive edges in
the graph, and its length is the number of edges used. The distance between two
vertices x, yES is the minimal length of a path from x to y, and is denoted by
Iy -xl·
While we will use the language of infection in talking about the contact process,
this process has arisen in other contexts, such as Reggeon Field Theory in high
energy physics. The contact process is a fundamental model that is often used
as a test case for new techniques or results that might apply more generally. It
has been the subject of intensive research, both rigorous within the mathematics
community, and numerical in the physics literature. An understanding of it and of
the tools that are used in its study is an important first step toward being able to
work with other models of interacting particle systems.
Here # denotes cardinality. At times, we will also use IAI to denote the cardinality
of a finite set. In words, infected individuals recover from their infection after an
exponential time with mean 1, independently of the status of their neighbors, while
32 Part I. Contact Processes
I if 'fJ(x) = I,
c(x, 'fJ) ={
A Lly-xl=l 'fJ(y) if 'fJ(x) = O.
The fact that these rates uniquely define a well behaved Markov process is a
consequence of Theorem B3. Often we will denote the initial state of the process
by a superscript: A~ is the process with initial state A. At other times, the usual
Markov process notation will be used: pA [At E -]. A key feature of these rates is
that the infection cannot appear spontaneously. In other words, (0 is a trap for the
process.
The contact process we have just defined is often called the basic contact
process. Another version of the process will be considered in Part II, and still
other versions have been studied elsewhere - see Section 5 for details. In Part I,
we will omit the word basic from the name of the process.
An alternative way of thinking about the contact process is as follows: Infected
sites become healthy at rate I as before. In addition, each infected site generates a
new infection at rate A at each neighboring site. If the neighbor is already infected,
this new infection has no effect.
This point of view leads to a useful comparison with a simpler process known
as a branching random walk. This is a process I;t with a state space that is a
reasonable subset of to, 1,2, ... }s. It is not particularly important what the word
reasonable means here. Suffice it to say that it should not allow for explosions
to occur. Regarding I;(X) as the number of particles at x, the process evolves
according to the following rules: Particles die at rate I, and generate offspring
at each neighboring site at rate A. From this perspective, the contact process can
be thought of as a branching random walk in which particles at the same site
coalesce. Alternatively, the branching random walk can be regarded as a contact
process in which we keep track of the multiplicity of infections. Mathematically,
the branching random walk is easier to study because the offspring of different
parents evolve independently.
*
-4 -3 -2 -1 o 2 3 4
Figure 1
an infection arrow ---+ is placed from (x, t) to (y, t). This construction is shown
in Figure 1 in case S = Zl.
An active path in S x [0,00) is a connected oriented path which moves along
the time lines in the increasing t direction without passing through a recovery
symbol, and along infection arrows in the direction of the arrow. For example, in
Figure 1, there is an active path from (2,0) to (1, t), but not to (2, t). The process
A~ with initial state A can be obtained explicitly by setting
In Figure 1, for example, A)O} = 0, while A)l} = {O, I}. Generally speaking, the
symbol P with no superscript will refer to a probability computed with respect to
the probability space on which the Poisson processes are defined.
One advantage of the graphical construction is that it provides a joint cou-
pling of the processes with arbitrary initial states. In fact, it provides a monotone
coupling, in the sense that
(1.1)
Thus the graphical representation allows us to conclude that the contact process is
attractive. Of course, it is easy to see this by checking condition (B 14) directly. It
also follows from the graphical representation that the contact process is additive:
(1.2) A AUB - AA U AB
t - t t·
34 Part I. Contact Processes
f
particular,
fdlLt
1. Preliminaries 35
(1.4) v = t-+oo
lim JLt
exists. This is the biggest (or upper) invariant measure of the process.
The fact that v is invariant comes from Theorem B7(e). To see that it is the
biggest invariant measure, let v be any invariant measure. Then v S JLo, so that
(1.5) v({0}) = 0 or 1.
To see this, suppose p = v({0}) < 1. Then the conditional measure v(·) = v(. I
{0}c) is again invariant, and satisfies v S v, since
whenever v 1= 8o.
Duality
(1.7)
for all A, B c S (Theorem 1.7 on page 266 of IPS). Here we have used At and
B t to denote the contact process with initial states A and B respectively, in order
to avoid confusion. The most general way of proving relations such as (1. 7) is via
a generator computation. Letting H(A, B) = 1{AnBj0}, it is not hard to check that
decreasing t direction, reversing the directions of the arrows, and then using the
basic symmetry of the graphical construction.
Taking B = S in (1.7), letting t --+ 00, and using the fact that the event
{At =1= 0} is monotone in t, we see that the survival probability
(1.9)
whenever v =1= 80 .
The self-duality (1.7) and graphical representation can be used to construct
invariant measures that are potentially different than 80 and V. To do so, take
B C S, use the shorthand {At n B =1= 0 i.o.} (infinitely often) for the event that
At n B =1= 0 for a sequence of times t t 00, and {At n B =1= 0 f.o.} (finitely often)
for the complement of this event. Define the measure VB by prescribing the cylinder
probabilities in the following way. For finite disjoint G, H C S, put
It is easy to check that these cylinder probabilities are consistent. To check that
VB is invariant, we need a little notation. For any probability measure II and any
A C S, define
/L(A) = /l{T) : T) nA =1= !o},
so that
vB(A) = P(A~ n B =1= 0 i.o.).
Then by duality,
(VBS(t))~(A) = EAvB(A t ) = vB(A),
where the second equality comes from the fact that {A;xJ n B =1= 0 i.o.} is an
invariant event. Note that V0 = 80 , and Vs = V. When B =1= 0 is finite, VB is the
invariant measure introduced by Salzano and Schonmann (1997). Whether or not
V B is different from 80 and v depends very much on the nature of the graph S.
Convergence
The general problem of determining when convergence of IIS(t) as t --+ 00 occurs
is difficult, and the answer depends heavily on the structure of the graph Sand
the value of A. However, if S is appropriately homogeneous (e.g., if S = Zd or
Td), and if the initial distribution II is also homogeneous and satisfies 1I(0) = 0,
then it is not too hard to prove that
as t --+ 00, where:::} denotes weak convergence. For example, this is Theorem
4.8 on page 309 of IPS if S = Zd and J-L is translation invariant. One consequence
of (1.1 0) in this case is that there are at most two extremal translation invariant
measures in g.
We tum next to the more important concept of complete convergence. This
term refers to the following property: For every initial configuration A,
(1.11)
where
Ci"A = pA(A I =f 0 V t ::: 0)
is the survival probability. Again an immediate consequence of property (1.11) is
that all invariant measures are mixtures of v and 80 . The main tool we will use in
proving complete convergence is the following:
One inequality in (1.15) is easy to see: Using the graphical representation and the
independence of the Poisson processes used in it for disjoint parts of space-time,
p(A~nB =f 0)
= P(3 an active path from B)
(x, 0) to (y, 2t) for some x E A, y E
.:s P(3 an active path from (x, 0) to (z, t) for some x E A, z E S)
x P(3 an active path from (z, t) to (y, 2t) for some z E S, Y E B)
= P(A~ =f 0)P(A~ =f 0).
liminf pA(Au n D
u----+oo
*- 0)::: lim pA(TB(n) < oo)pD(TB(n) <
n----+oo
00).
But by (1.13),
pA(TB < 00) ::: aA
for all A C S and all finite B C S, and this completes the proof of one direction.
For the other direction, suppose (1.11) holds for all A. Then (1.15) holds, and
(1.14) follows immediately from this, by taking A = B = B(n) and using (1.9).
To check (l.13), use (1.15) to conclude that
n --+ n + 1 at rate n K A
(l.l8)
provided that IAI = 1. To see this, simply ignore recoveries in the contact process,
and note that any set of size n has at most nK neighbors.
Let Tl, T2, ... be independent, exponentially distributed random variables with
means
1
ETn = --.
nKA
These can be thought of as the holding times at the various integers for the process
Yt . Therefore, for e > 0, the exponential form of Chebyshev's inequality gives
= eet N nKA [ N
et- e ]
< ex
DnKA+e - p ~nKA+e '
were we have used the inequality 1 - x :::: e- X in the final inequality. Taking
e= mKA for any integer m leads to
since
L --
N I
::: L I N m+n+ l 1
-dx =
I m+ N + l 1 m+N +1
-dx = log - - - -
m +n
n=l n=l m+n X m+l X m+1
(l.l9)
Now we can use (1.19) to check continuity in A of various quantities. The idea
is to take AA < AB, and let At and B t be the contact processes with parameters
AA and AB respectively with a common initial configuration A, coupled by using
the graphical representation with the same Poisson processes associated with the
recovery symbols, and Poisson processes associated with the infection arrows that
are obtained as follows:
Conditional on the Poisson processes {Nx }, {N(~,y)} for all x, y, the number of
infection arrows that could lead to an extra infection in the process Bs up to time
t has a Poisson distribution with parameter
It follows that
P(B s =1= As for some s :s t) :s 1 - EA exp [ - (AB - AA)K lot lAs Ids ]
(1.20)
:s (AB - AA)K lot EAIAslds,
where we have used the inequality 1 - e- u :s u in the last step. Using (1.19)
and (1.20), continuity in A can be easily shown for any reasonable function of the
process on a finite time interval. Rather than formulate a general theorem here,
which would necessarily have unpleasant assumptions, we will show how to use
(1.19) and (1.20) to prove continuity when the need arises later. However, the
idea should be fairly clear: (1.19) says that the set of sites ever infected by time
t is not too large. But then taking AB - AA small in (1.20) says that with large
probability, the two processes agree up to time t. One place where this argument
is worked out in detail is the proof of Proposition 4.33.
Rate of Growth
Bound (1.19) says nothing about how rapidly the cardinality of At grows as t t 00.
If there were no restriction on the number of infections per site (i.e., if this were
a branching random walk), then the size of the infection would in general grow
exponentially in time. However, this restriction leads to slower growth in general,
and polynomial growth on Zd, for example. To see this, let At be the contact
process on Zd, and let Bt be the process obtained from At by suppressing all
recoveries. Let Pt (x, y) be the transition probabilities for the simple random walk
on Zd that moves to each neighbor at rate A.
Proposition 1.21.
1. Preliminaries 41
Proof Let l;t be the branching random walk with no deaths: l;t (x) increases by 1
at rate
A l;t(Y)· L
ly-xl=!
Then the means of l;t satisfy the system of differential equations
Since BiO} can be coupled to l;t with l;0(0) = 1 and l;o(x) = 0, x =F 0 so that
BiO} C {x : l;t (x) ::: I},
we have
p(x E BiO}) ::: p(l;t(x) ::: 1) ::: El;t(x) = e2dtA pt(0, x),
and the result follows.
In order to control the right side of the inequality in Proposition 1.21, we need
the following weak form of the large deviations bound for random walks.
Proof It is enough to prove this for d = 1, since the one dimensional result can
be applied to the d coordinates of the d-dimensional random walk. By symmetry,
it is then enough to prove (1.23) where the sum is taken over positive x's only.
Let Xt be the one dimensional random walk starting at 0 that moves to each
neighbor at rate A. Then for y ::: 0, the exponential form of Chebyshev's inequality
gives
42 Part I. Contact Processes
EIAjO}l k S c(l + t kd ).
Then breaking up the multiple sum below according to whether any IXi I ~ nand
if so, which IXi I is largest, we see that
Now use the Schwarz inequality, Proposition 1.21, and Lemma 1.22, replacing n
by bt, where b is chosen to satisfy (1.23) for an a > 4dA. The result is that
The second summand on the right tends to zero as t ~ 00, since the expression
inside the square root grows polynomially in t, and this gives the result.
Relation (1.8) leads to a second interpretation of AI: V is the point mass at the
empty set 80 if A < AI, but is nontrivial if A > AI .
A host of questions is implicit in these definitions. For example,
(a) Is Al > O?
(b) Is A2 < oo?
(c) Is Al < A2?
(d) What happens when A = AI or A = A2?
(e) What is the limiting behaviour of the process for initial configurations other
than S itself? In particular, what are the invariant measures for the process.
Here are a number of facts that are either easy to see, or are proved in IPS:
(a) If all vertices of G have degree at most K, then
1
(1.26) AI> - .
-K
This is an easy consequence of comparison (1.3) of the contact process with a
branching random walk ~t. To see this, simply note that
1
(1.27) AI>--
- 2d-l
(page 166 of IPS). If d = 1, it has been further improved to Al 2: 1.539 (page
289 of IPS).
(b,c) If S contains a copy of Zl, then both critical values for S are bounded
above by the corresponding critical value for Zl by an easy coupling based on
the graphical representation. For S = Zl,
(1.28)
and in fact the process with A = 2 survives (Theorem 1.33 on page 274). For
S = Zd,
(1.30) · d'1\.1(d)
11m -- ~
d~oo 2
«4.7) on page 308 of IPS).
Very little was known about the answers to questions (c), (d) or (e) at the
time IPS was written outside of the case S = Z I. In that case, it was known that
44 Part I. Contact Processes
complete convergence (1.11) holds for A > A\ = A2. (Theorem 2.28 on page 284
of IPS). For larger values of d, results such as the complete convergence theorem
were known to hold for very large A, but not for all A.
Preview of Part I
The main objective of the next section is to give complete answers to questions
(c), (d) and (e) when S = Zd. It turns out that the answers are those that were
expected based on what was known about the one dimensional case, but the proofs
are quite different. Following this, we will derive exponential bounds for various
quantities: In the supercritical case, if the process does die out, it does so very
quickly. In the subcritical case, the process does die out very quickly.
Section 3 deals with the question of how the critical behavior of infinite systems
is reflected in the behavior of large finite systems. For the system on {I, ... ,N}d
starting with all sites infected, the process dies out after a time that is logarithmic
in N in the subcritical case (i.e., A is subcritical for the infinite system), and after
a time that is exponential in N in the supercritical case. Some of the results in
this section are based on theorems proved for the infinite system in Section 2.
Section 4 gives answers to questions (c), (d) and (e) for contact processes on
homogeneous trees. We will see that not only the techniques, but also the results,
tum out to be quite different from the Zd case, and it is this fact that makes them
so interesting. In particular, we will see that, unlike the case of Zd, A\ < A2, and
for values of A between the two critical values, there are infinitely many extremal
invariant measures.
Our main objectives in the first part of this section are to prove the following for
the contact process on Zd:
(a) There is no intermediate phase, i.e., A\ = A2.
(b) At dies out at this common critical value.
(c) Complete convergence (1.11) holds for all A.
Following this, we will prove some exponential bounds in the supercritical case,
and then focus on the subcritical case, proving that the process dies out exponen-
tially rapidly.
Statements (a) and (c) were proved in IPS (Theorem 2.28 on page 284) for the
case d = I, using arguments based on edge speeds that work only in one dimen-
sion. Bezuidenhout and Grimmett (1990) developed entirely different techniques
that led to proofs of all three statements for all d ~ I. Note that even though we
have stated (a), (b) and (c) separately, (c) easily implies (a), so the main point is
to prove
(d) At dies out at A\, and
2. The Contact Process on the Integer Lattice Zd 45
lim p(A~-n,nld
n-+oo
-+ 0 V t :::: 0) = 1.
For L :::: 1, let LAt be the truncated contact process defined via the graphical
representation, but using only paths with vertical segments corresponding to sites
in (-L, L)d and infection arrows from (x, ,) to (y, ,) with x E (-L, L)d, The
next two results combine to say that there are many infected sites in an orthant
of the top of the (large) space-time box (-L, L)d x [0, t], In these and the results
that follow them, arguments based on correlation inequalities play a prominent
role,
46 Part I. Contact Processes
Remark. Note that the order of the limits above is important. Since the contact
process on a finite set dies out,
for every L.
it follows that
For an initial configuration of cardinality n, the probability that all n sites recover
before there is any infection is at least the probability that the maximum of n
independent exponential random variables with parameter I is smaller than the
minimum of 2dn independent exponential random variables with parameter A.
Therefore, since this mininmum is exponentially distributed with parameter 2dnA,
1 ]IA'I
P(A t = 0 for some tl,¥') > [ ,
- 1 + 2dAIAsi
(2.5) lim
t-'>oo
IAtl = 00 a.s. on {As =fo 0 V s ::::: OJ.
Proof Let Xl = ILA~-n,nld n [0, L)dl, and X2, ... ,X2d be defined similarly with
respect to the other orthants in R d , so that
2. The Contact Process on the Integer Lattice Zd 47
Next we tum to the sides of the space-time box. For x E Zd, write x
(XI, ... , Xd) and Ix I = maXi Ix;!. The inequality x ::: 0 will mean Xi ::: 0 for all i.
Let
S(L, T) = {(x, s) E Zd X [0, T] : Ixl = L}
be the union of the sides of the box (- L, L)d X [0, T] and put
which is the set of space-time points that are infected by the truncated process.
Let NA(L, T) be the maximal number of points in a subset of S(L, T) n LAA
with the following property: If (x, Sl) and (x, S2) are any two points in this set
with the same spatial coordinate x, then lSI - s21 ::: 1.
Proposition 2.S. Suppose L j t 00 and 'Fj t 00. For any M, N and any finite
A C Zd,
Proof Let .'.¥L,T be the a-algebra generated by the Poisson processes from the
graphical representation in (-L, L)d x [0, T]. The first step is to prove that if
A c (-L, L)d, then
(2.9)
p( A: = 0 for some SI.'.¥L'T) ::: [1 ::~Jk
a.s. on {NA(L, T) + ILA:I ::: k}.
To begin to check (2.9), note that for each point x E LA: there is probability
(1 + 2d'A)-1 that a recovery symbol occurs on the time line above (x, T) before
any infection arrows occur emanating from that time line. To see this, consider this
time line {x} x [T, 00), and the Poisson processes associated with it in the graphical
48 Part I. Contact Processes
representation. The first recovery symbol after time T comes after an exponential
time with parameter 1, while the first infection arrow to a given neighbor of
x comes after an exponential time with parameter A. These exponential times
are independent. An elementary computation shows that if O"i are independent
exponential random variables with parameters Yi respectively, then for any j,
be a maximal set of points on this time line in S(L, T) n LAA with the property
that each pair is separated by at least distance 1. Assume j :::: 1, since otherwise,
nothing on this time line can contribute to survival. Let
These events are independent, since they refer to disjoint parts of the graphical
representation, so the probability that none of the points on this time line in
S(L, T) n LAA contributes to survival of the process is at least
e-4dA ]'
[
1 +2dA
The numerator comes from the points in I, while the denominator comes from
points in the complement of I in {x} x [0, T]. Considering the contributions from
all the various x's gives (2.9).
Write G = {A~ = 0 for some s} and H j = {NA(L j , 'Fj) + I LjA~ I :s k} for a
fixed k. By the martingale convergence theorem, }
2. The Contact Process on the Integer Lattice Zd 49
{Hj i.o.} c G.
It follows that
Next, write
Proof Let XI = N~-n,njd (L, T), and define X 2 , ••. , X d2 d similarly by replacing
the first coordinate in the definition of S+(L, T) by any of the d choices of
coordinates, and the positive signs used in the definition of S+(L, T) by any of
the 2d choices of signs. These random variables are identically distributed, and
are positively correlated by Corollary B18. Furthermore,
Therefore
(2.13)
and
Proof The idea of the proof is to use Propositions 2.2,2.6,2.8 and 2.11 to construct
a big space-time box with many infected points on its boundary, and in fact, on
certain orthants of its boundary. If there are enough infected points, then at least
one of them will generate an infected cube of side length 2n in the extra time
period of length 1 that we are allowing ourselves. We will start with a 0 < 8 < 1,
and show at the end how to choose it in terms of the given E > O.
Given 8 > 0, use Proposition 2.1 to choose an n so that
(2.l5)
Choose N so large that any N points in Zd will contain a subset of at least N'
points, each pair of which is separated by an Loo distance of at least 2n + 1, where
N' is chosen so large that N' independent trials with success probability
(2.16)
for each j ::: 1. Applying Proposition 2.8 with M and N replaced by M d2 d and
N2 d respectively, it follows that for some j,
(2.17)
(2.18)
and
(2.19)
By our choices of Nand M, and the fact that the Poisson processes used in the
graphical representation are independent on disjoint space-time regions, (2.18) and
(2.19) then imply that
p( L+2nA~-:(ld :l x + [-n, n]d for some x E [0, L)d) ::: [1 - 82- d][l - 8],
and
Proof Given £ > 0, choose n, L, T so that (2.13) and (2.14) are satisfied. Using
(2.14) first, we see that with probability :::: I - £, there exist x and t with the prop-
erty appearing in that probability. Now consider the process restarted at time t + I
with initial state x + [-n, n]d. Use the strong Markov property and monotonicity
and apply (2.13) to conclude that (conditionally on the first event considered) with
probability:::: I - £ there is a y so that y - x E [0, L)d and the restarted process
at time T + I covers y + [-n, n]d. Putting these statements together, it follows
that
Next we will carry out the fundamental construction that will shortly lead to
the comparison with oriented percolation. Recall that an active path is a connected
oriented path in the graphical representation that moves along the time lines in the
increasing t direction without passing through a recovery symbol, and along the
infection arrows in the direction of the arrows.
P (3(Y, t) E [a, 3a] x [-a, a]d-l x [Sb, 6b] and there are active paths
(x, s) + ([-n, n]d x {O}) to every point in (y, t) + ([-n, n]d x {on) : : 1- E.
Proof The idea is to apply Proposition 2.20 repeatedly (between four and ten
times) to move the center (x, s) of a cube in four to ten steps to the center (y, t)
of a cube in such a way that if the first cube is fully infected, then so will be the
final one. In doing so, it is important to remember that while Proposition 2.20 was
stated for x in the positive box [L + n, 2L + n] x [0, 2L)d-l, (2.21) is true by
symmetry if this box is replaced by boxes obtained from it by reflections about
the coordinate planes in Zd. Thus we are free at each stage of the construction to
use any sign for each of the d coordinates.
2. The Contact Process on the Integer Lattice Zd 53
Finally we are ready for the comparison with (independent) oriented site perco-
lation that provides the converse to Theorem 2.12. To avoid confusing the percola-
tion process with the contact process, we will denote the oriented site percolation
process defined prior to Theorem B24 by B k .
Theorem 2.23. Suppose the condition appearing in Theorem 2.12 is satisfied. Then
for every p < 1 there are choices ofn, a, b with n < a so that the following holds:
If the initial configurations Bo and A satisfy
j E Bo implies A J x+[ -n, n]d for some x E [a(4j -1), a(4j + 1)] x [-a, a]d-l,
for some
(2.24b) (x, t) E [a(4j -2k-l), a(4j -2k+ 1)] x [-a, a]d-l x [5bk, b(5k+ 1)].
In particular, At survives.
Remark. As will be clear from the proof, this coupling can also be achieved if
At is replaced by the process obtained using only the Poisson processes in the
graphical representation that correspond to x E Zd with Ix;! :::: 5a, 2 :::: i :::: d.
54 Part I. Contact Processes
Proof of Theorem 2.23. There are two stages in the construction. In the first,
we do not try to achieve the conditional independence properties required in the
definition of the oriented percolation process. The Bernoulli random variables
needed in constructing the Bk 's are generated recursively in k, using the graphical
representation on which the construction of the contact process is based. Suppose
{Bi' i :s k} have been constructed. If Bk n {j - 1, j} =1= 0, then (2.24) holds for
j -lor j. The construction provided by Proposition 2.22 succeeds with probability
2: 1 - E, so this can be used to generate the appropriate Bernoulli random variable,
provided that 1 - E > p.
This completes the first stage ofthe construction. We are not done yet, however,
since these Bernoulli random variables are not independent. However, they are
m-dependent for some m - see the definition of m-dependence following the
statement of Theorem B26. It is for this reason that we wanted the active paths
occurring in the statement of Proposition 2.22 to remain in [-5a, 5a]d x [0,6b].
Because of this m-dependence, we can use Theorem B26 to construct independent
Bernoulli random variables that lie below the dependent ones, provided that we
take 1 - E » p. It is important, of course, that the value of m not depend on the
choices of a, b, n, but this is clearly the case.
To check that At survives, it is enough to take p large enough so that the
conclusions of Theorem B24 are satisfied.
Proof For part (a), take A > A]. Then At survives. By Theorems B24, 2.12 and
2.23, there exist n, a, b and a corresponding supercritical oriented site percolation
process Bk with Bo = {OJ which lies below it in the sense of (2.24). Again by
Theorem B24, P(B2k = k) is bounded below in k, so that P(B2k = k i.o.) > O.
Therefore by (2.24), with positive probability, there are infinitely many choices of
k so that
A\-n.n]d :> x + [-n, n]d for some (x, t) E [-a, a]d x [lObk, b(lOk + 1)].
For every x E Zd,
p(x E A\-n.n]d)
is strictly positive for t > 0 and continuous in t (by Theorem B3). Therefore it is
bounded below by a positive number for (x, t) in compact subsets of Zd x (0, 00).
For the process with initial state {OJ, there is positive probability of covering
[-n, n]d by time 1. By the Markov property and monotonicity, we may therefore
2. The Contact Process on the Integer Lattice Zd 55
consider the process starting with [-n, n]d instead of {OJ. Every time a box of side
length 2n that is a bounded distance from the origin is covered by the process, there
is a positive probability that the process will cover 0 one unit of time later. This
is a consequence of the observation at the end of the last paragraph. Therefore,
Since we now know that Al = A2, we will denote their common value by Ae for
the rest of this section and Section 3. The next result is the complete convergence
theorem. Since the process dies out for A ::: Ae, the only case of interest is A > Ae.
as t ---+ 00, where::::} denotes weak convergence, and (XA is the survival probability
Proof We need to check the conditions of Theorem 1.12. The first one is easy:
Let G be the event
(2.28)
Here is the argument that leads to (2.28): First, G is an invariant event (i.e.,
invariant under time shifts), and is therefore a tail event. Therefore the equality in
(2.28) is just the Markov property at time s. To check the inequality in (2.28), let
a = inf{t : 0 E At},
and let g;;: be the a-algebra associated with this stopping time. On the event
{x E As},
56 Part I. Contact Processes
P(O E A)x) for some t) = P(x E A)O) for some t) ::: p(O)(G),
and therefore
(2.29)
By (2.26) p(O)(G) > 0, and by the martingale convergence theorem, P(GI~ -+
1Ga.s. as s -+ 00. So, (2.29) implies that
At :) Uj Aj,r.
if m ::: 1 is odd,
p(A~-6ma,6ma)d n (-6ma, 6ma)d = 0) :s [p(o rf. At6a ,6a)d)r.
(a)
and
(b)
Remark. One of the reasons for our interest in (a) is that it provides exponential
rates of convergence to the upper invariant measure. To see this, let ILl be the
distribution at time t of the contact process with initial configuration Zd. By
duality (1.7), for any finite A,
Proof of Theorem 2.30. The idea of the proof of (a) is to use the percolation
construction of Theorem 2.23 repeatedly. Choose a p < 1 so that the oriented
percolation process Bk with parameter p satisfies the conclusions of Theorem
B24. By Theorems 2.12 and 2.23, there are choices of n, a, b with the coupling
property (2.24). Let
Then
P(A~ ::::> x + [-n, n]d for some x E Zd) 2: 8
for all A =1= 10 by monotonicity. Start the process with any A =1= 10. We will
define a random variable N (so that N + I is a stopping time with respect to the
58 Part I. Contact Processes
either A: = 0 or rA = 00. In other words, a ::: rA on the event {rA < oo}.
Therefore,
p(t < rA < 00) :::: pea > t).
By the construction, Land N; have exponentially decaying tail probabilities. By
Theorem B24, the same is true of M;. It follows that a has exponentially decaying
tail probabilities. To see this, take E\ > 0 so that Ee E1L < 00, and then take E2 > 0
so that
Then
For the proof of part (b), consider the contact process At on the tube in Zd
given by T = {x E Zd : Ix; I :::: Sa, 2 :::: i :::: d}, and note that it is enough to
prove the analogue of (b) for At. To see this, suppose that (b) holds for At. write
Zd as the disjoint union of translates Tn of T, and let An,t be the contact process
restricted to Tn. Then
A -AnT,
At :J UnAn,t '
and the An,r's are independent, so by the analogue of (b) for At.
P(r A < (0) :::: np(A:,~T'
n
= 0 for some t) :::: n
n
e-fIAnT,1 = e- fIAI .
To prove the analogue of (b) for At, we will use Theorem 2.23 again, so fix
a large p and the corresponding n, a, b. Write T as the disjoint union
Take 8 > 0 so that the contact process restricted to (- 2a, 2a] x [- Sa, Sa]d-I
starting at any singleton E (-2a,2a] x [-Sa, 5a]d-1 covers [-n, n]d at time I
with probability ~ 8.
Given any initial configuration A for At. thin it so that the resulting set contains
at most one point in each of the boxes appearing on the right of (2.32). The
cardinality of the resulting set will be at least a constant multiple of IA I. For
each point x in this thinned set, run a contact processes restricted to its box up
to time 1. These contact processes are independent for different x's, so that with
the exception of an event of exponentially small probability (exponentially small
in the number of x's in the thinned set, and hence exponentially small in the
cardinality of A itself), at least a fraction 8/2 of the processes starting at these x' s
will at time 1 cover the cube of side length 2n + 1 centered at the center of the
corresponding boxes. We conclude that for some E > 0,
P(A~ = 0 for some t) :::: e- E1A1 + P(A~ = 0 for some t, A~ contains EIAI
boxes of the form (4ja, 0, ... ,0) + [-n, n]d).
The first term on the right corresponds to the exceptional event with the exponen-
tially small probability mentioned above. On the complementary event, at least
some fraction of the boxes of side length 2n + 1 will be covered at time 1, and
that leads to the second term on the right. Now use the Markov property at time 1,
together with Theorems 2.23 and B24 to conclude that for some E > 0 and C,
(2.33)
[a(A)t :s Ce-EkIAI.
Taking kth roots and letting k --+ 00 leads to (2.33) with C = 1.
(2.35)
and
(2.36) P(x E AiO) for some s 2: °and some Ixl 2: ct) :s ce- t + P(A;O) =f= 0).
Proof For the first statement, use the Schwarz inequality to get
(2.37) P(x E B/O) for some Ixl 2: ct) + P(A;O) =f= 0).
Using Proposition 1.21 and then Lemma l.22, the first summand in (2.37) is at
most
e2dtA L
Pt(O,x):s ce- t ,
Ixl~ct
Thus motivated, we are now ready to launch into the proof of exponential
decay of the survival probability I(A, t) = P(A;O) =f= 0) in the subcritical case.
We have included explicitly in the notation the dependence on the infection rate
A for reasons that will become clear below. Note that I is increasing in A and
decreasing in t. The idea of the proof of exponential decay is the following:
2. The Contact Process on the Integer Lattice Zd 61
(2.38)
a a
Cl-Iog I(A, t) - C2t-Iog I(A, t) 2:
t
2,
1+ fo
I -
aA at I(A, s)ds
where C 1 and C2 depend on A (mildly) but not on t. This will be valid whether or
not the process is subcritical, though it is not very interesting in the supercritical
regime.
(b) Secondly, use (2.38) to show that if lim Hoo I(A, t) = 0 for some A, then
I(A, t) decays exponentially in t for all strictly smaller values of A. To see that
(2.38) might in fact imply this, suppose that fooo I (A', t )dt < 00, so that the right
side of (2.38) grows linearly in t. It is easy to check that if either of the terms
on the left of (2.38) grows linearly in t for an interval of A's, then I(A, t) decays
exponentially in t for A'S in that interval.
We begin by evaluating the partial derivatives that appear on the left side of
(2.38). The first two lemmas are versions of what is known as Russo's formula
in percolation. (See Section 2.4 of Grimmett (1989), for example.) Let XI be the
number of infection arrows in the graphical representation with the property that
if the arrow is removed, then there is no active path from (0,0) to (z, t) for
any Z E Zd. Such arrows are known as pivotal. We will often use PI. or E).. to
indicate that probabilities or expectations are taken with respect to the graphical
representation with infection rate A.
Lemma 2.39.
Proof Take h > 0, and think of constructing the graphical representation with
parameter A from that with parameter A + h by independently deleting infection
"*
arrows with probability h/(A+h). If A)O} 0 for the graphical representation with
parameter A + h and a pivotal arrow is deleted, then A)O} = 0 for the graphical
representation with parameter A. Therefore
I(A + h, t) -
h
I(A, t) =~P
~ J.+h
(A{O}.../.. 0 X
I -r- , I
= k)~
h
[1 _(_A_)kJ
A+ h
+ 0
h
(1).
k=l
The Oh (1) term comes from the possibility that two or more arrows are deleted
that together lead to the elimination of all active paths to time t, even though no
one of the deleted arrows is itself pivotal. The fact that the total rate at which any
of these arrows is deleted has finite expectation comes from (1.19).
Now pass to the limit, using (1.19) and dominated convergence for justification,
to obtain
(2.40)
62 Part I. Contact Processes
Combining (1.19) and (1.20), we see that the + can be removed on the right of
(2.40), and then that the right side of (2.40) is continuous in A. It follows that the
partial derivative of I(A, t) with respect to A exists, and
a I(A, t)
A- = E).. ({O})
X t , At =1= 0 .
aA
Dividing by I(A, t) gives the result.
For the next result, let Yt be the total length of all vertical segments in the
graphical representation with the property that the addition of a recovery symbol
at any point in the segment means that there is no active path from (0,0) to (z, t)
for any Z E Zd in the resulting structure. Maximal segments with this property
are known as pivotal intervals. A convenient way of thinking of pivotal arrows
and pivotal intervals (on the event AjO} =1= 0) is that taken together, they form the
intersection of all active paths from (0,0) to Zd X it}.
Lemma 2.41.
Proof When we set up the contact process in Section 1, we placed the recovery
symbols in the graphical representation with rate 1. In this proof, it is convenient
to place them at a general rate 0 > 0. We will incorporate the 0 into our notation
in the obvious way. The scaling property of the Poisson process (i.e., if N (t) is a
Poisson process with rate A, then N*(t) = N(et) is a Poisson process of rate AC)
implies that
1(0, A, t) = l(l, A/O, Of) = I(A/O, ot),
so that
(2.42) a 1(0, A, t) I
- ao a I(A, t)
= AaA a I(A, t).
- ta-
8=1 t
Therefore, we need to compute the left side of (2.42), which we will do in a
manner analogous to the proof of Lemma 2.39.
Take h > 0, and construct the graphical representation corresponding to recov-
ery rate 0 + h from that with recovery rate 0 by adding recovery symbols at rate h.
Conditional on the graphical structure with recovery parameter 0, the probability
that one (or more) of these additional recovery symbols is placed in some pivotal
interval is 1 - e- hY" so that
a
- all f(8, A, t) = E8,;" [Yt,
{O}
At =1= 0] .
Taking 8 = 1, combining this with (2.42), and dividing by f(A, t) gives the
required result.
Next let Zt be the number of pivotal intervals. We will bound this in terms of
X t and Yt as follows:
Lemma 2.43.
Proof For the first inequality, it suffices to note that every pivotal arrow begins at
the end of a pivotal interval, and ends at the beginning of another pivotal interval.
Therefore, 1 + X t ::: Zt on {AjO} =1= 0}.
For the second inequality, which is the one we will actually use, fix y > 0;
a particular choice will be made at the end of the proof. Here is the idea of the
proof: Pivotal intervals with any of the following properties are easy to handle:
(i) Those of length at least y, since the total length of such intervals is at least
y x their number, and hence their number is at most y -I Yt .
(ii) Those that end at time t, since there is at most one such pivotal interval.
(iii)Those that end at a pivotal arrow, since the number of such pivotal intervals
is at most X t •
This explains the three summands that appear on the right of (2.44). So, it
will be enough to consider pivotal intervals that have none of the above three
properties, and show that the expected number of them is at most a constant
multiple of the expected number of pivotal intervals that do satisfy one of these
three properties.
In order to count pivotal intervals, it is useful to do some discretization. Choose
an E > 0, which will eventually be taken to approach zero. For fixed x E Zd and
integer k ::: 1 let F be the event (defined on the graphical structure of Poisson
processes) that there is a pivotal interval that A;O} =1= 0 and
(a) contains the point (x, kE),
(b) does not contain the point (x, (k - l)E),
(c) is of length less than y,
(d) ends strictly before time t, and
(e) does not end at a pivotal arrow.
For W E F, we will define a new configuration Twas follows. Since all points
in the graphical representation that we will consider here lie on the time line
{x} x [0, t], and all arrows will begin on this interval, we will omit the coordinate
x from the notation. So, for w E F, let [a, b] be the pivotal interval that begins
64 Part I. Contact Processes
between (k -1)E and kE, so that (k -1)E < a < kE, and kE < b < min(kE + y, t).
In what follows, we will assume for simplicity that kE + y < t; otherwise, simply
replace kE + y by t. Let c > b be the last point
such that there is an active path from it to Zd x {t}. Let T w be the configuration
obtained from w by removing all infection arrows in (kE, kE + y) except the one
at c (if c < kE + y.)
With this construction, T w has a pivotal interval containing [a, b) that either
ends in a pivotal arrow, or has length> y. To see this, consider two cases:
1. c > kE + y. Then the pivotal interval for T w contains kE + y, and hence is
of length > y.
2. c < kE + y. Then [a, c) is a pivotal interval for T w, and the infection arrow
at c is pivotal.
This is easiest to see by drawing some pictures, which is left to the reader.
Note that Tw rf. F, so at least one interval was deleted. Elementary properties
of Poisson processes imply that the Radon-Nykodym derivative of PoT-I with
respect to P satisfies
EJ..(ZtIA;O) =1= 0) ::: 1+ e2d J..y EJ..(XtIA!O) =1= 0) + y-I e2dJ..y EJ..(YtIA)O) =1= 0).
Now put y = 1/(2dA) to get (2.44).
We come now to the final ingredient in the proof of (2.38). Since the collection
of pivotal arrows and intervals make up the intersection of all active paths from
(0,0) to Zd X {t} in the graphical representation, they are, in particular, a subset
of any active path. Therefore, the projections of the pivotal intervals onto the time
line [0, t] are disjoint. Label these projections ([Pi, a;], 1 ::: i ::: Zt) in increasing
2. The Contact Process on the Integer Lattice Zd 65
order. For i > Zt. set Pi = ai = t. Let T be the extinction time for the contact
process starting at {O}: P(T > s) = p(A1°} =1= 0).
where TI, T2, ... are independent random variables with the distribution of T.
Proof Let Xi E Zd be the spatial coordinate of the points in the ith pivotal interval.
Note that XI = O. Every pivotal interval must end in an arrow; let Yi be the spatial
coordinate of the endpoint of the arrow that begins at (Xi, ai ).
Fix k ::: 1, and let G be the union of all active paths in the graphical represen-
tation up to time t that start at (0, 0) and do not have (Xb ak) as an interior point,
together with the arrow from (Xb ak) to (Yb ak). Note that PI, ... , Pk. 0'1, ... ,ak,
XI, ... , Xk and YI, ... , Yk are all measurable with respect to G. For s > 0,
P(A;O} =1= 0, PHI - ak > slG) :s P(there are disjoint active paths from (Xb ak)
to Zd x {ak + s} and from (Yb ak) to Zd x {t}, not passing through GIG).
To see this, condition on G, and argue as follows:
1. If AlO} =1= 0, then there must be an active path from (Xb ad to Zd x {t},
since every active path from (0,0) to Zd X {t} must pass through the kth pivotal
interval. There must be one such path that proceeds from (Xb ak) through the
arrow that begins there, rather than through the time line above (Xb ak), because
of the maximality of the kth pivotal interval.
2. If also PHI - ak > s, then there is no pivotal interval with time coordinate
in (ab ak + s), and this forces the existence of a disjoint active path from (Xb ak)
that starts up the time line above that point. If such an active path had to intersect
the path in point 1 above, then this forced intersection would constitute part of a
pivotal interval.
By Theorem B21 (applied to a discretization of the graphical representation
conditional on G), the right side above is at most
Proposition 2.46.
Proof To deduce (2.38) with these choices for C] and C2, write
if for a fixed k, L~=] (PH] - ai) :::: t - k, then either Zt ::: k, or Zt < k and
Yt > k. In other words
Combining the last two inequalities, setting N = min{n : T] + ... + Tn > t}, and
summing on k ::: 1 gives
EN = E "N
L....i=]
T"
I > t
t
.
ET] - 1 + Jo f(A, s)ds
This proves (2.47).
2. The Contact Process on the Integer Lattice Zd 67
Finally, we show that (2.38) implies exponential decay in the subcritical case.
(2.49) a(l
a
+ e + 2dae)-10gg(a, t) :::: t
t
- 2.
aa 0'+ fo g(a, s)ds
g(a2, t) ( t )
a2(l+e+ 2da2e )10g ::::(0'2-0'1) t -2 ,
g(al, t) 0'2 + fo g(a2, s)ds
or equivalently,
1 00
g(0'2, s)ds < 00
implies that g(a\, t) decays exponentially in t for 0'1 < 0'2, since then the integral
in the denominator in (2.50) remains bounded as t -+ 00.
2. It is also clear from (2.50) that if
C
(2.51 ) g(a2, s) :s 8s for s > 0,
1 00
g(a\, s)ds < 00
68 Part I. Contact Processes
for CTI < CT2. To check this, replace g(CT2' t) by Ct- 8 and g(CT2' s) by Cs- 8 (for
s, 1 2': 1, say). The result is that
for two constants C I and C2, and this is integrable for t 2': 1.
Combining these two observations, we see that it suffices to prove that for every
CT2 < Ac , (2.51) holds for some 8 > O.
To do so, take CTO > 0, 10 > 0, and define CTk, tk recursively by
tk
tk+1 = g(CTk, tk) .
Since 0 < g(CT, t) < 1 for CT > 0, t > 0, CTk t and tk t. There is a potential
problem that some CTk may become negative. If that happens, set that CTk and all
successive ones = 0, and set the tk'S that would then be undefined = 00. Suppose
CTk+1 > O. Apply (2.50) with CTI ~ CTk+I, CT2 ~ CTk. t ~ tk+l, and then use the
recursion and monotonicity of g to make the substitutions
(tHI
10 g(CTk, s)ds ~ tk + 1k+lg(CTk, tk),
CTk+1 - CTk ~ g(CTk, tk) 10gg(CTk, tk),
tk
tk+1 ~ ,
g(CTk, td
g(CTk, tk+l) ~ g(CTk, td
(2.52)
Since
g(O+, t) = lim/(CT, CT-I/) = 0
a.j,O
for t > 0, inequality (2.52) holds trivially if CTk > 0, even if CTk+1 = O. (In that
case, 1k+1 < 00.) Define
(2.53)
and iterating,
2. The Contact Process on the Integer Lattice Zd 69
Note that by making g(ao, to) small, we can make SUPk lak - aol small, and in
particular, ak > 0 for all k.
To summarize, if 0 < a < ao < Ac , we can choose a to so large that y(ao, to) >
o and g(ao, to) < e- I , and then take to even larger so that ak > a for all k. Now,
by (2.53) and the tk recursion,
to]Y
g(ak, td:S [ - = Y
it ,
tk [g(ak, tk)tHI]
where the equality comes from the tk recursion again. Simplifying this gives
which is (2.51).
This gives the statement of the the theorem with an extra constant on the right
side of the inequality. To remove it, use the fact that
Recall that a(Ac) = 0 by Theorem 2.25(b). To say that a(A) has a critical exponent
of y is to say that in some sense,
as A t Ac for some constant C. There are various forms that this statement can
take:
a(A)
C[ < < C2 for Ac < A < Ac + 1, and
- (A - Ac)Y -
. 10ga(A)
hm =y,
A,I).< log (A - Ac)
for example. The following result implies that if the critical exponent y for survival
exists in any of these senses, then y S 1. Calculating y rigorously is probably
hopeless, but it would be interesting to improve the next result to show at least
. a(A)
hm--=oo.
A - Ac
A-J,Ac
a
tg(a, t)
rl
+ Jo g(a, s)ds
da - 2(A - Ac)g(A, t).
Since
a(a) = 1--+00
lim g(a, t),
and therefore
·
11m tg(a, t) _ I. g(a, t)
1m = 1,
'f + +fo g(a, s)ds
1 - 1
Hoo a + fo g(a, s)ds HOO
Recalling that a(Ac) = 0 and discarding the term 2Aca(A) leads to the statement
of the theorem.
At first glance, one might object to the developments so far on the following
grounds:
(a) In the real world, all systems are finite.
(b) The contact process on a finite set always dies out, so it has no critical behavior.
(c) The main interest and challenge of the study of the contact process on Zd
comes precisely from the fact that it does exhibit critical behavior.
In view of these facts, how can the contact process on Zd be considered to be a
relevant model for real world phenomena?
To answer this question, it is necessary to remember that extinction is a t = 00
characteristic, and that in the real world, we are interested in large but finite times.
So the question becomes: Is it the case that infinite models observed over the
entire time axis capture important features of large finite systems at large finite
times? This section provides some answers to this question.
Let A L be the contact process on {l, ... , N}d with initial configuration A.
This is simply the contact process on Zd, modified so that no infections are allowed
off {I, ... , N}d. When no initial configuration A is specified, it will be taken to
be A = {l, ... , N}d. Since the contact process on {I, ... ,N}d is a finite state
Markov chain that is irreducible, except for having a single absorbing state 0, it
will eventually be absorbed at 0. Let
in one dimension, the proofs are significantly more difficult, and even in one di-
mension, the results that have been proved are not complete. See the discussion
in Section 5 for more on this.
if c < d
if c > d,
and so
TN
(3.1) ---+d
10gN
in probability, as N -+ 00. The main result in this subsection is that (3.1) holds
(with a limit depending on A) for 0 < A < Ac as well.
The first step is to identify the quantity (defined in terms of the unrestricted
process on Zd) that will turn out to be the limit. Recall that by the Markov property
and monotonicity,
exists, and
(3.2)
By Theorem 2.48, Y_(A) > 0 for A < Ac. Also, Y_(A) is decreasing in A, and
y_(O) = 1, since if A = 0, P(A)O) =1= 0) = e- I • Here is the general version of
(3.1):
TN d
---+ - -
10gN Y_(A)
in probability, as N -+ 00.
Proof The graphical representation provides a coupling between the contact pro-
cesses At,1 on {l, ... , N}d and A~ on Zd with the property that At,! c A~ for
all A c {I, ... , N}d and all t ::: O. Therefore, using additivity (1.2),
3. The Contact Process on {I, ... , N} d 73
+ n
XESN,k
p(Alx} ex + (-k, k)d for all s > 0, A)x) = 0).
The last inequality comes from the fact that the events
depend on disjoint parts of the percolation structure, and hence are independent.
Combining Theorems 2.34 and 2.48, we see that there are constants band E > 0
so that
(3.6)
Now take c < a < dlY_(A.). By the definition of y_(A.) above, b can be taken
larger so that for t 2: b,
(3.7)
Therefore, continuing with (3.5), using (3.6) and (3.7) and setting t = clogN, we
see that for clog N 2: b,
#SN,k ~ (~r
For example, kN = (log N)2 works.
If n, k, N are integers such that nk ::: N, then {l, ... ,N}d is contained in the
union of k d disjoint translates of {I, ... ,n}d. By additivity (1.2) and positive
correlations (see Corollary B 18 and the discussion following the statement of
Theorem B21),
kd
a(N) ::: [a(n)] ,
so
[a(N)t Nd ::: [a(n)fk/N)d.
Therefore,
. 10ga(N)
(3.8) Y+(A) =- hm
N-,>oo N
d
The right side is simply the probability that all N d sites recover before they have
a chance to infect any neighbor.
The next result is somewhat unsatisfactory, in that it does not identify the
limit in probability of (lOgTN)/N d (nor show the limit exists), but it does at least
determine the rate of growth of TN. For more precise results, see the discussion
in Section 5.
(3.10) . P (lOg
hm TN )
--d-::: Y = O.
N-,>oo N
3. The Contact Process on {I, ... , N}d 75
(3.11) lim
N-+oo
p(IOg~N
N
:::: 8) = O.
Proof For the first statement, take 8 so that 8E > Y+(A), where E is the number
appearing in Theorem 2.30(a). Then by that result,
(l •... •N}d - 0)
Iog P (A Nde
. -
(3.12) Y+(A) =- hm
N-+oo N
d .
then As is the set of x E Zd such that there is an active path in the graphical
representation from (y, kN d8) to (x, s) for some y E {I, ... , N}d. Since A N.t C
At for every time that is a multiple of N d 8 by definition (until the extinction time
of At), it follows that A N . t C At for all t. Since disjoint parts of the graphical
representation are independent, we conclude that
To complete the proof of (3.10), use (3.12) and (3.13), choosing k = kN so that
Figure 2
BkI C [k-2-,00.
+2 ]
We intentionally have not specified yet whether the B's are to be site or bond
processes. We are using (3.14) for the site processes, but will prove it for the bond
processes, since this is a bit more convenient. Since the site and bond processes can
be compared in either direction, provided the parameter p is adjusted appropriately,
and since we only need (3.14) for p sufficiently close to 1, (3.14) holds for the
site process if and only if it holds for the bond process (for different values of p).
So, from now on we take the B's to be bond versions of the processes. From
the traditional (i.e., graphical) description of the process, it is easy to see that
(3.15) p{l, ... ,nl(B~ = 0) = P{l,2, ... 1( B~ n {[k: 3]. ... , 1] +n} = 0).
[k:
where [.] is the greatest integer function. In fact, (3.15) is the analogue of contact
process duality. Let
h = min {i :i E B~ }
be the left edge of the semi-infinite processes. With this notation, (3.15) can be
rewritten as
(3.16) p{I, ... ,nJ(B~ = 0) = P{I,2, ... 1(h > [k; 1] +n).
Next, we will use the fact that there is an E > 0 and a C so that
The analogous statement for the process Bk is Theorem B24(c), and the proof for
B~ is similar. Combining (3.16) and (3.17) gives
78 Part I. Contact Processes
(3.18) pl l ,2, ... l(l j > [i ~ 1] +n for some i.:::: k) .:::: Cke-En, n:::: 1.
Now consider the reflected version of B£: Br is the process Bk with the
restriction that at time k,
Bk/I c [
1, N k+l] ,
+ -2-
Bn
and
rk = max {i :i E
(3.19)
PI ... ,N-I,Nl(rj < [i: 1] + N _ n for some i .: : k) .:::: Cke-En,
n::::1.
By checking each possible transition, one sees that if initially, BN.O = {I, ... ,N},
Bo = {l, 2, ... } and B~ = {... ,N - 1, N}, then on the event {lj < rj for all i .: :
k},
and therefore
P(TN .:::: k) .:::: P(lj > rj for some i .: : k) .:::: 2Cke- EN /2 ,
where for the last inequality, we have used (3.18) and (3.19) with n = N 12 (taking
N to be even for simplicity). Therefore, (3.14) holds for any /) < E/2.
Let Td be the homogeneous connected tree in which each vertex has d + 1 neigh-
bors. It is often useful to think of Td as a branching tree in which each vertex
has one parent and d children, and then it is natural to say that y is a descen-
dent of x if y is a child of a child ... of a child of x. More formally, define a
function I (x) from Td to Z I so that for each x, I (y) = I (x) - 1 for exactly one
neighbor y of x, and ley) = lex) + 1 for the other d neighbors y of x. Thus lex)
can be thought of as the generation number of x, and y is a descendent of x if
I(y) -lex) = Iy - xl :::: 1. Take {en, -00 < n < oo} in Td such that l(e n) = n
and len - en+11 = 1 and write e = eo. This provides an embedding of Zl in Td .
In Section 2, we saw that weak survival does not occur for the contact process
on Zd. Our first result in this section shows that the situation really is quite
different on Td - weak survival does occur. It is this fact that has led to much of
the interest in contact processes on trees, and is the primary justification for this
section. The occurrence of weak survival raises an entirely new set of questions
concerning the behavior of the process in the intermediate phase Al < A < A2.
Throughout Section 4, we will assume that d :::: 2, since TI = Zl, and this case is
covered by the results of Section 2.
4. The Process on the Homogeneous Tree Td 79
Theorem 4.1.
1
(a) Al < - - .
- d-l
(b)
Proof The final statement is an immediate consequence of the bounds in (a) and
(b). To prove (a) take 0 < P < 1 and define a function vp on the finite subsets of
Td by
Compute
d A vp(At)
-E I 0
(4.2) dt . t=
For any finite subset A of Td, the number of edges incident to points in A (counted
with multiplicity) is (d + 1)IAI. There are at most IAI - 1 edges that join two
vertices in A. Therefore if A =1= 0,
Using this bound in (4.2), we see that if pA(d - 1) :::: 1, then for nonempty A,
and compute
80 Part I. Contact Processes
(4.4)
:t EAwp{At)lt=o =~ [(A IY~=I /(Yl) - pl(Xl]
yj"A
Choosing
1
and A = 2..(J'
we see that the right side of (4.4) is zero, and hence that M t = wp{At) is a (pos-
itive) supermartingale. Therefore, M t converges a.s. On the event {x E At i.o.},
M t has to change by at least pl(xl i.o. Therefore pA{X E At i.o.) = 0 for each x
and A, and it follows that for this value of A, At does not survive strongly.
Pemantle (1992) proved Theorem 4.1, and then went on to improve the bounds
enough to conclude that Al < A2 for d > 2. Liggett (1996a) further improved the
bounds for d = 2. Here are their results:
(4.5)
The fact that there is an equality here instead of the inequality in (4.4) is a reflection
of the independence of the offspring of different parents. Because of the equality,
(4.5) can be solved explicitly to obtain
(4.6)
x x
where
Note that
and 1Jr achieves its minimum at p = l/,Jd. When we discuss the analogous
function (called 11) for the contact process shortly, we will find that it shares these
properties, though it cannot be computed explicitly.
It should be clear what strong and weak survival mean in this context, and how
the critical values A1 and A2 are defined. In order to relate them to the function 1Jr,
it is useful to consider also a sequence v(n) that is defined as follows: Abbreviate
the configuration that consists of a single particle at x by x itself, and for n ~ 0,
let
and
v(n) = peCan < (0).
Thus v(n) is the probability that an infection starting at e ever reaches e- n . Note
that v(n) is nonincreasing in n, since the infection can reach e-(n+l) only if it
reaches en first. It is left continuous in A, since it is a supremum of increasing
continuous functions of A:
For the continuity in A of the functions that appear in the supremum above, recall
the discussion surrounding (1.20).
The next result relates survival and strong survival to properties of the function
1Jr and the sequence v(n).
. v(n+l) 1-v'1-4dA2 1
lim v(n) = 0,
n~oo
and f3 = n~oo
hm
v(n)
= 2dA
< -
- Jd
Remark. Aside from the fact that properties of 1{! lead to the explicit computation
of Al and A2, the most interesting consequence of this theorem comes from (e)
and (t). Above A2, v(n) does not tend to zero. Below A2 it tends to zero at an
exponential rate :s ..let. So, there is an interval of parameters of exponential decay
that cannot be attained in this problem.
Proof of Theorem 4.8. Parts (c) and (d) follow easily from parts (a) and (b) re-
spectively, since (4.7) is so explicit. For part (a), it suffices to note that
I)t(X)
x
is an ordinary Galton-Watson branching process. Since 1{! (I) is its exponential rate
of growth by (4.6), the branching process is supercritical if 1{! (1) > I, critical if
1{!(l) = I, and subcritical if 1{!(l) < 1. (See Theorem B55.)
Turning to part (b), note first that (4.6) implies that
M _ Lx l;t(x)/(x)
t- [1{!(p»)I
L l;t(x)/(x)
x
~ ° a.s.
as t ~ 00, so l;t does not survive strongly. If 1{! (p) = I, then the limit of
4. The Process on the Homogeneous Tree Td 83
(4.9)
exists a.s. On the event {~t(x) ~ 1 i.o.}, (4.9) changes by at least pl(x) i.o., so it
follows that ~t does not survive strongly in this case either. Since Vr attains its
minimum at 1/-Jd, the result so far can be restated as follows: If Vr(1/-Jd) :::: 1,
then ~t does not survive strongly.
For the converse, use the strong Markov property, monotonicity and spatial
homogeneity to write
(4.10)
The events whose probabilities appear on the right of (4.10) are decreasing in n.
On the intersection of these events, ~t(e) ~ 1 for an unbounded sequence of times
t. Therefore, if ~t does not survive strongly, then the right side of (4.10) tends to
zero as n ---* 00, and hence
Take n ~ 1, and run the process until the first transition occurs. Using the strong
Markov property at that time and the independence of the offspring of different
parents, one gets
. 1
(4.13) hm v(n) = 1 - or O.
n-->OO A(d + 1)
Suppose the limit is zero, which by (4.11) is true if ~t does not survive strongly.
Then given E > 0 there is an N so that the left side of (4.12) is at most (1 +E)v(n)
for n ~ N. Using the arithmetic-geometric mean inequality on the right side of
(4.12) gives
v(n - 1) v(n + 1)
and
v(n) v(n)
are uniformly bounded. Therefore, taking the mth root, then letting m -+ (Xl in
(4.14), and finally letting E {, 0, we see that 2A,Jd s 1. But this means that
1/!(~) = e2Jcv'd-1 S 1,
as required to complete the proof of (b). Note that since we only used (4.11) in
this argument, we have also proved (e) and the first part of (t).
To prove the second part of (t), we need to look at the ratios
v(n +1)
f3n = v(n)
This function is increasing and concave in p, and (4.12) can be written in the form
e ± ";e 2 - 4dA 2
P±(e) = .
2dA
If c = 1, these are exactly the solutions of 1fr (p) = 1. The smaller fixed point is
unstable, while the larger one is stable. In other words, the k-fold iterate I?) of
II satisfies
lim I?\p) = p+(l), p > p-(l).
k->oo
(4.15)
and hence
(4.16)
4. The Process on the Homogeneous Tree Td 85
p + (c)
In... < A2, p-(1) < p+(1), so that (4.15) and (4.16) are incompatible. Therefore
we conclude that
I-JI-4dA2
(4.17) f3n .:::: p-(1) = 2dA
for all n. Since f3n is a left continuous function of A, it follows that (4.17) holds
for A = A2 as well. To get a lower bound for f3n, we argue in a similar way. Fix
c> I, and take n so large that Cn < c. Then
(4.18)
If f3n < p_(c), then the right side of (4.18) becomes negative for some k, which
is impossible. Therefore,
Since p_(c) is continuous at c = I, (4.17) and (4.19) combine to give part (f) of
the theorem.
Here are some properties of 1/1 and f3 that are immediate consequences of
Theorem 4.8, and that should be kept in mind as we develop analogous properties
for the contact process:
(a) f3 is a strictly increasing function of A for A .:::: A2.
(b) 1/1 (f3) = I for A .:::: A2·
(c) If A = A\ then f3 = I/d, while if A = A2 then f3 = I/-Jd.
86 Part I. Contact Processes
(4.20)
where es is the shift by time s of the Poisson processes used in the graphical
representation that was described in Section 1. Therefore for p > 0,
The inequality above comes from the fact that the union in (4.20) is not neces-
sarily disjoint. Taking conditional expectations with respect to 31f, the a-algebra
generated by the process up to time s, we see that
(4.22)
4. The Process on the Homogeneous Tree Td 87
(4.23)
(4.24)
by Theorem B22. Since the contact process At and the branching random walk
can be coupled together so that x E At implies /;t (x) ::: 1, it follows that
Next we will show that </> has many of the qualitative properties of 1jr - the
main difference is that </> cannot be computed explicitly. We need these properties
so that we can use </> in the analysis of the contact process much as we used 1jr in
the analysis of the branching random walk. We start with some easy combinatorial
facts.
Lemma 4.26. (a) Let an,k be the number of x E Td such that Ix - el = nand
lex) = n - 2k. Then
ifk = 0,
if 1 :s k :s n - 1,
ifk = n.
(b) Let
an(p) = L /(x),
Ix-el=n
for n ::: 1.
Proof The cases k = 0 and k = n of part (a) are immediate. For the other cases,
take x E Td such that Ix - el = n, and let k ::: 0 be the largest index so that e-k is
on the shortest path joining e and x. Then Ix - e_kl = n - k, and lex) = n - 2k.
Therefore, an,k is the number of x E Td so that e_k is on the geodesic joining e
and x, but e_k-l is not on it. In traversing such a geodesic from e to x, there is
one choice of edge at each step until reaching e-b d - 1 choices at the next step,
and d choices at the remaining n - k - 1 steps.
88 Part I. Contact Processes
For part (b), take n ::: 1 and dp2 =1= 1, and use part (a) to write
n
an(p) = L /(x) = Lan,kpn-2k
Ix-el=n k=O
n-l
= (dp)n + (d _l)dn-1pn L(dp 2)-k + p-n
k=l
(dp)n[dp2 - 1] + (d - l)d n- 1pn[1 - (dp2)-n+l] + p-n[dp2 - 1]
dp2 - 1
Simplifying gives the required result. The result for dp2 = 1 follows by using
L'Hopital's rule.
Since
00
the first statement in part (a) is immediate, and the second follows from it by the
definition of 4> in (4.23).
For (b), note that the left inequality is just (4.24). By part (a), it is enough to
prove the right inequality when p > 1/ Jd, which we now assume. The inequality
in (4.22) (which led to (4.24)) came from the additivity property. The idea is to
show that except for a constant factor, the opposite inequality holds in (4.22)
because there is a substantial amount of disjointness in the union in (4.20).
For a finite set A C Td, let {Bx, x E A} be the subsets of Td that are defined
as follows: Bx is the set of descendents of x whose closest predecessor in A is x
itself. These sets are disjoint, so by additivity (1.2),
(4.29)
EAwp(At) = EW p( UxEA An::: EW p( UxEA (A; n Bx)) = E L wp(A; n Bx).
XEA
4. The Process on the Homogeneous Tree Td 89
To find a lower bound for the right side of (4.29), we will need the following
inequality:
(4.30) dp L pl(x)::: (dp - l)d n L pl(x).
xEA.YEBx XEA
Ix-YI=n
To write a similar expression for the left sides, note that B~ C Bx for all x E A,
and Y E Bx \B~ if and only if x' E Bx and y is a descendent of x'. In particular,
there is at most one x E A such that B~ =1= Bx. If there is such an x, then
L pl(x) = /(x)dn-l(x'l+l(x).
yEB, \B~
Ix-yl=n
So
LH S' = LH S + d n+1 pl(x'l+l _ dn-l(x'l+l(xl+l pl(x)+l,
where the last term appears only if there is an x E A such that B~ =1= Bx. Therefore,
to complete the induction step in the proof of (4.30), we need
dp - (dp)l(x)-l(x')+l ::: dp - l.
dp-l ~
= --wp(A) ~ peen E At)(dpt.
dp n=O
90 Part I. Contact Processes
where an(p) is defined in the statement of Lemma 4.26. Since dp2 > 1, that lemma
implies that an (p) is asymptotic to a constant multiple of (dp)n as n -+ 00, so
that by (4.29), (4.31) and (4.32), there is a constant C(p) so that
[Ewp(At)f::: C(pt-IEwp(A nt ).
The right hand inequality in part (b) of the proposition now follows by taking nth
roots and passing to the limit, recalling the definition of ¢ in (4.23).
Proof Part (a) comes from (4.23) and (4.28), together with the fact that an(p) is
increasing in p for p ::: 1/..,(J and peen E At) is increasing in A. The monotonicity
of an (p) is easiest to see by pairing up the summands in the expression for it given
in the proof of part (b) of Lemma 4.26, and rewriting the sum of pairs as
x x
4. The Process on the Homogeneous Tree Td 91
where we have used the Schwarz inequality. The first factor on the right is finite
by comparison with the branching random walk process. To bound the second,
use the Schwarz inequality again to get
EIA;'\A;I J
= E(IA;'\A;I, A;' =1= A;) S JEIA;'1 2 P(A;' =1= A;).
Now use (1.19) to bound the first factor above, and both (1.19) and (1.20) to
show that the second factor is small if A" - A' is small. This argument shows
that E W P (At) is continuous in A, uniformly for (A, p) in compact subsets of
[0,00) x (0,00). It is easy to check the analogous statement for continuity in p,
using the inequality
where the + and - denote the right and left hand limits of ¢. But strict inequality
is ruled out by the upper semi continuity that holds for all p, including l/./J.
the graphical representation by using only the recovery symbols in See]) U {e} and
only the infection arrows that join vertices in S (ed U {e}.
Proof Since At C At, one inequality is clear from (4.23). To prove the other
inequality, note that if y E At n See]), then there must be an infection arrow from
e to e] at some time r < t such that there is an active path from (e], r) to (y, t).
So, since infection arrows occur at rate )...,
(4.36)
By the spatial homogeneity of At and the fact that an (p) is asymptotic to a constant
multiple of (dp)n for p > 1/ y'd, there is a constant C so that
00
(4.38)
Combining (4.24), (4.36), (4.37) and (4.38) gives the following inequality, where
C' = Cd)",
r
P(A] = fed)
would imply that for some I < a < ¢ (p) and all s beyond some point,
4. The Process on the Homogeneous Tree Td 93
Remark. It is not known whether (4.40) is true for the critical contact process on
Zd. It is thought to be false.
Proof We need only prove that cp(l) = 1, since then (4.40) follows from Proposi-
tion 4.27(b). (Recall that WI (A) = IAI.) That (4.40) implies extinction is a standard
Markov chain fact, which follows from
inf
(A:IAI=n)
peAt = 0 for some t) > O.
See (2.5), where the corresponding fact was proved on Zd.
If A > AI, then At survives, and hence (by the argument just given)
Therefore cp(l) > 1 by Proposition 4.27(b). So, Proposition 4.33(b) implies that
cp(l) 2: 1 for A = AI.
For the opposite inequality, we will use Lemma 4.34. For any finite A C Td ,
define its frontier F(A) to be the set of points x E A for which at least one of its
children - call it x' - has Sex') n A = 0. Let A' = the set of x' such that x' is the
child of some x E A and Sex') n A = 0. Since every point in A has d children,
and points in A\F(A) have no children in A',
We will check
It follows that if (4.42) holds for B, it holds for A. This is the induction step.
Combining (4.41) with (4.42) gives
94 Part I. Contact Processes
d-I
(4.43) IF(A)I::: -d-1A1.
limsupEIAtl = 00.
t-+oo
By (4.43),
lim sup EIF(At)1 = 00
t-+oo
as well. Choose a t so that EIF(At)1 > 1. Then construct a discrete time process
Bn in the following way: Bo = {e} and Bl = F(At ) for that t. In general, Bn+l
is defined by applying the construction that led from Bo to Bl to each of the
points x E Bn (using the graphical recovery symbols and infection arrows for the
time period [nt, (n + I)t], falling in Sex') U {x}, where x' is a child of x with no
descendents in Bn) and then taking the union of the resulting sets. Then IBn I is a
supercritical branching process that satisfies
Bn CAnt a.s.
Therefore At survives, from which it follows that A ::: A1. So, we have shown that
A < Al implies cp(l) ~ 1. By Proposition 4.33(b), it follows that cp(l) ~ 1 for
A = Al as well.
Proposition 4.44. (a) Suppose that At does not survive strongly. If 1j.j(j ~ PI <
P2 and CP(P2) ::: 1, then CP(Pl) < CP(P2).
(b) If cP (p) < 1 for some P > 0, then At does not survive strongly.
Proof It is enough to prove (a) for PI > Ij.j(j because of Proposition 4.33(a).
By Lemma 4.26(b),
lim a n (Pl) = 0.
an (P2)
n-+oo
an (Pl)
--<E
an (P2) -
for n ::: N. Applying this, together with (4.28) for both PI and P2 gives
(4.45)
EWpJAt) <
E
+ L:=oan(pj)P(en EAt) .
EW p2 (A t ) - EW p2 (A t )
4. The Process on the Homogeneous Tree Td 95
Since At does not survive strongly, the numerator of the second tenn on the right
side of (4.45) tends to zero as t -7 00. By (4.24) and our assumption,
EW p2 (A t ) 2: [<I>(p2)Y 2: 1.
Therefore,
By Proposition 4.27(b),
Mn = wp(Ant)
[Ewp(At)f
lim wp(Ant)
n---+oo
=0 a.s.
Therefore, P(x E Ant i.o.) = 0 for each x. Every time x E Ant, it remains in the
infected set for an exponential time with parameter 1. So, the process does not
survive strongly.
We come now to the main result in the first part of this section - the one that
is the primary justification for the study of the contact process on Td • It is a very
simple consequence of the developments up to this point.
Proof If A = A\, then <1>0) = 1 and At dies out by Proposition 4.39. Therefore
<I>(p) < 1 for I/Jd S P < 1 by Proposition 4.44(a), applied to PI = P and
P2 = 1. Fix such a p. By Proposition 4.33(b), there is a A> AI so that <I>(p) < 1
for this A as well. But Proposition 4.44(b) implies that At does not survive strongly
for this A, so A S A2. Therefore, A2 > A\.
This is probably a good time to see how well we have done so far in proving
analogues of parts (a) and (b) of Theorem 4.8 for the contact process. Recall that
those statements for branching random walk are:
(a) ~t survives if and only if 1/r0) > 1, and
(b) ~t survives strongly if and only if 1/r (1 / Jd) > 1.
So far we have a complete analogue of (a), but only a weaker fonn of one direction
of (b):
96 Part I. Contact Processes
(a') '7t survives if and only if ¢ (1) > 1, by Propositions 4.27(b) and Proposition
4.39, and
(h') ¢ (1 /,Jd) < 1 implies that '7t does not survive strongly, by Proposition
4.44(b).
Our objective is to prove as much of the analogue of Theorem 4.8 in the contact
process context as we can. It turns out that we will be able to prove somewhat
weaker versions of essentially everything, except for statements that involve ex-
plicit formulas. Later we will use these results in a number of ways, including
a proof of the complete convergence theorem above A2, and a construction of
nontrivial invariant measures in the intermediate phase A\ < A < A2. Another
interesting fact that will emerge is that u (n) is discontinuous as a function of A at
A2 for every n ~ 1 - see Theorem 4.65(f). The analogous statement for branching
random walks (at least for large n) follows from Theorem 4.8 (see also (4.16)),
which implies that
2,Jd 1
v(n) > 1 - - - n>1 A > - -
- d+ l' -, 2,Jd
1
v(n) ::s d- nj2 , n ::: 1, A ::s 2,Jd.
We start by obtaining some inequalities that will lead to the existence of an
exponential decay rate for u(n). In order for At to reach en +m it must first reach
en. Letting
r = inf{t > 0 : en E Ad,
one can use monotonicity and the strong Markov property to show that
(4.49)
by the discrete version of Theorem B22. Note that (4.48) can be regarded as a
Cesaro version of the convergence statement in Theorem 4.8(f).
We will begin to make the connection between ¢ and f3 by proving two
inequalities. It turns out that both are in fact equalities (the first one for A < A2 -
see Theorem 4.83 and Corollary 4.78), but the proofs of the reverse inequalities
are harder, and will be deferred until we develop some more machinery. Often we
will show explicitly the dependence of ¢(A, p) = ¢(p) on A as well as on p.
Proof For part (a), suppose that A, p > 0 satisfy ¢(A, p) < 1. Then
Letting p approach f3(A) and using Proposition 4.33(b) gives part (a).
Part (b) also follows by this argument. To see this, recall that Proposition 4.39
implies that ¢(A\, 1) = 1, and Propositions 4.27(a) and 4.44(a) then imply that
¢(A\, p) < 1 for ~ < p < 1. By (4.52), f3(A\) S ~.
Lemma 4.34 said that the process At C At is equivalent to At. at least in the
sense of the exponential rate of growth of EWp(A t ), and therefore the function
¢ could be defined equivalently in terms of either process. The next result is a
similar statement relative to the definition of f3. There are two ways in which
quantities are made smaller below, and both tum out to be inconsequential in
terms of exponential growth rates: The t is brought out of the probability, and At
is replaced by At in the definition of u(n). Let B(x, n) = {y E Td : Iy - xl S n}
be the ball centered at x of radius n, and as usual, B(n) = B(e, n).
98 Part 1. Contact Processes
lim [suPP(e n E
n-:H)O t
At)]~ = f3(A).
For the other inequality, we argue as follows: Let vk(m) be the probability that
there is an active path in the graphical representation from (e,O) to (em, t) for
some t ::: k that remains inside B(k). Then
since the union of the events whose probabilities appear on the left is the event
whose probability is u(m). Now take positive integers j, k, m, n satisfying
(4.55) jm +k::: n.
Let Xo = en-jm,XI = en-(j-I)m,'" ,Xj = en. Suppose that there are times 0 <
TO < TI < ... < Tj so that Ti+1 is a stopping time relative to the post Ti collection
of Poisson processes in the graphical representation and TO ::: 1, Ti +I - Ti ::: k for
o ::: i < j, and there are active paths
Then en E At for some t ::: 1 + jk ::: nk. Applying the strong Markov property
to the Poisson processes in the graphical representation and spatial homogeneity
gives
peen E At for some t ::: nk) ::: P(en-jm E At for some t ::: 1)[vk(m) t
On the other hand
peen E At for some t ::: nk) ::: nk max peen E At for some t E [i, i + I)).
O:::i<nk
-I
(4.56) sup peen EAt)::: =--k P(en-jrn E At for some t :::: l)[vk(m)t
t n
Take nth roots and then for fixed k, m, let n -+ 00, with j = [n;;;k] so that (4.55)
is satisfied throughout. Note that with this choice, n - mj takes the finitely many
values k, ... , k+m, so that the probability on the right of (4.56) takes only finitely
many (positive) values as n, j -+ 00 in this way. Therefore
I
liminf[suPP(en
n..-....+oo t
EAt)]"::: [vk(m)]~.
Now let k -+ 00 and then m -+ 00 and use (4.48) and (4.54) to complete the
proof.
Proposition 4.57. If
I
(4.58) f3(A.) > ..{J'
then
Proof Suppose that (4.58) holds. It will be slightly more convenient here to re-
define At so that it is constructed using all the Poisson processes in See), instead
of only those in S(el) U {e}. This is even larger than the process used in Lemma
4.53, so its statement holds for this redefined process also. It is not hard to check
that the conclusion (4.59) holds for the modified At if and only if it holds for the
original one. By Lemma 4.53, there exist a > ./ct, n :::: 1 and t > 0 (that we now
fix) so that
(4.60)
Bj C A jt .
The basic limit theorem for supercritical branching processes (Theorem B55(c»
says that
.
11m IBjl
.
j-+oo (d n a n )1
exists a.s. and is not identically zero. Therefore, there is an E > 0 so that
(4.61)
ri = pee E A 2ijt ).
Then
(4.63)
where the equality comes from (4.60). To handle the first probability on the right
of (4.62) note that for each x E Bj , there is probability at least ri that x E A(2i+ l)jr.
and the appropriate events with those probabilities are independent for different
x's. Therefore, letting N be the integer part of E(da)n j , we have
(4.64) P(x E A(2i+l)jt for some x, Ix - el = nj) ::: P(IBj I ::: N)[l - (1- ri)N].
where
fer) = E[1 - (1 - r)N]a nj .
Note that f (0) = 0 and f' (0) = EN a nj ::: E2 (da 2)nj - w nj , which can be taken
to be > 1 by taking j large since da 2 > 1 and a < 1. In this case, since
f(1) = w nj < 1, f has a fixed point r* in (0, 1), i.e., f(r*) = r*.
Now we can prove inductively that ri ::: r* for i ::: o. Since ro = 1, the basis
step is automatic. If ri ::: r*, then the monotonicity of f in r implies that
From Proposition 4.57 we get immediately the following result, which collects
a number of properties of {3, particularly in the vicinity of ).,2.
4. The Process on the Homogeneous Tree Td 101
Remark. In part (a), only the left continuity of f3 is asserted. Note that by parts
(b) and (d), f3 is not right continuous at A2.
Proof of Theorem 4.65. The monotonicity of f3 is clear from (4.48) and the fact
that each u(n) is nondecreasing in A. For the left continuity, write
for fixed n. The probability on the right is continuous in A for fixed T, since it
involves the graphical representation for only a finite time period. Since u (n) is
an increasing function of A, it follows that u(n) is left continuous in A. By (4.48)
and (4.49),
1
f3(A) = supu(n);;,
n
so (a) holds. Since At CAt. (4.59) implies strong survival. Therefore by Proposi-
tion 4.57, A < A2 implies f3(A) ::s ~. Combining this with (a) gives one inequality
in (b): f3(A2) ::s ~. We will return to the other inequality shortly. For part (c), it
suffices to use this half of (b), which implies that limn u (n) = 0 by (4.49), and to
note that
Now apply the extended Borel-Cantelli Lemma (page 240 of Durrett (1996)).
Part (d) also follows from (4.66), since it implies that u(n) is bounded below for
A> A2.
102 Part I. Contact Processes
so that by (4.66),
P(GI..¥0 :::: P(G)21(As*0}.
By the martingale convergence theorem,
P(GI..¥0 -+ Ie a.s.
as s -+ 00. Since P(G) > 0 for A above A2, it follows that
{As =1= 0 V s} c G,
and hence that peAs =1= 0 V s) :s P(G). The other inequality is clear.
4. The Process on the Homogeneous Tree Td 103
Continuing with the proof of (f), it now follows from (4.66) that
where
E = peAt =1= 0 'V t)1),=),2 > O.
But by (4.49) and part (b) above,
1
u(n) < - for A <_ A2.
- dn/ 2
It follows that u(n) is discontinuous at A2 once E2d n > 1, which completes the
proof of (f), since we observed earlier that a discontinuity in one u (n) leads to a
discontinuity in all the u(n)'s.
For part (g), apply (4.67) with A = {el}, using monotonicity and homogeneity,
to get
(4.69) (1 + A)U(l) ?: A.
Combining (4.69) with (4.49) gives (g). Part (h) is an immediate consequence of
parts (b) and (g).
Remarks. (a) The above statement is trivially true for A :::: A\, since then v = 80.
It is trivially false for AI < A :::: A2, since the limiting distribution is 80 whenever
A is finite.
(b) An immediate consequence of Theorem 4.70 is that 80 and v are the only
extremal invariant measures if A > A2. Later, we will see that there are infinitely
many extremal invariant measures in the intermediate regime - see Theorems
4.107 and 4.12l.
Proof of Theorem 4.70. Assume A> A2. By Theorem 4.65(d), {3(A) = 1, and then
by Proposition 4.57,
104 Part I. Contact Processes
Therefore
so that
lim liminf p(A~(n) n B(n) =1= 0) = l.
n~oo (---+00
Proof Clearly CXA == I if A is infinite, so we will consider only finite A's. Since
more than one value of J... will be used below, we use a subscript to indicate its
value: PAO. The right continuity of CXA is immediate, since
as t t 00, and PA (A~ =1= 0) is increasing and continuous in A for each t. To prove
left continuity, define the event Dn,t by
Take Al < A" < A' < A, and use the strong Markov property, monotonicity and
spatial homogeneity to write
Since the first factor in the products above depends on the graphical representation
only up to time t, it is continuous in A', so that we can let A' t A and conclude
that
CXA(A-) 2: P). (Dn,t)CXB(n) (J...II).
Now let t t 00 and note that PA (Dn,oc;) = CXA (A) for every n. The reason for this is
that if x E At for some x and t, then there is a positive probability (depending on
4. The Process on the Homogeneous Tree Td 105
n) that B(x, n) C At+l, so on the survival event, this will occur with probability
1. Therefore,
aA(A-) 2: aA(A)aB(n)(A").
But by duality (1.7), and the fact that v(0) = 0 for A> Al by (1.5),
lim aB(n)(A)
n---+oo
= n---+oo
lim v{B: B n B(n) =1= 0} = 1,
so that aA (A-) 2: aA (A). It then follows that aA is left continuous at every A > AI.
But it is == 0 for A S Al (by definition for A < AI, and by Proposition 4.39 for
A = AI), so it is left continuous everywhere.
Remark. Continuity of the survival probability (above the critical value) was
proved for the one dimensional contact process on page 266 of IPS using the fact
that there are only two extremal (translation invariant) invariant measures. For the
contact process on Td , this proof works perfectly well, once that fact is proved,
and it can be proved in a manner similar to that on page 168 of IPS.
and
(4.76) limV(t) = 0.
ttO
1
Vet) = supu(n, nt)., t > 0,
n
so that (4.77) follows from (4.78). This proves the logconcavity statement.
Turning to (4.75), note that u(n, t) s u(n), so (4.49) implies that Vet) s fJ(A)
for all t. On the other hand, since Ant CAnt,
Then
< P(Xt _
u(n ,t)- > n) = peSn_< t) <
-
Eeo(t-Sn) = eOt (_A_)n
A+8 '
4. The Process on the Homogeneous Tree Td 107
where Sn is the partial sum of n i.i.d. exponential random variables with parameter
A, and e > O. Replacing t by nt and optimizing over e gives the bound
Proof Since U is logconcave by Proposition 4.74, the limit in (4.81) exists and
equals the infimum. Since u(O, t) S EWp(At) for any p > 0,
1
(4.82) lim [u(O,
1-+00
t))' S ¢(p)
by (4.23). (The limit on the left exists by Theorem B22, since u(O, s)u(O, t) s
u(O, s + t).) The argument that led to (4.72) also implies
Passing to the limit as n ---+ 00 and using (4.82) and the definition of UO gives
1 1
[U(t)]'[U(1)]' S ¢(p).
The second inequality in (4.81) now follows from Theorem 4.65 (e). The first
inequality is easier: Take n = 1 in (4.73), and use the fact that once el E At, el
stays infected for an exponentially distributed time.
Now we are in a position to relate the functions ¢ and U, and then to show
that f3 can be used to generate solutions to the equation ¢ (p) = 1. Recall that
this was the reason for introducing the growth profile U. Recall in this connection
Proposition 4.50(a), which gave one inequality in (4.85) below.
(4.85)
108 Part I. Contact Processes
Proof For the first statement, fix a p > II Jd, and let a be the supremum
appearing on the right of (4.84). Note that by (4.28),
00
By Proposition 4.74 and Lemma 4.80, the function f(t) = 10g[dpU(t)] is concave
and has limiting slope 00 at t = 0 and a finite negative limiting slope at 00.
Furthermore, log a is the supremum of the slopes of lines joining the origin to
points on the graph of f. If a' < a, then the line with slope log a' must intersect
the graph of f at some point, call it s. Then f(s) = s loga'. Therefore
00
Taking tth roots and passing to the limit using (4.86) gives ¢ (p) ::: a, which
completes the proof of (4.84).
The first equality in (4.85) is just Proposition 4.27(a). Taking p = IldfJ(A) in
(4.84) gives
where the inequality comes from Proposition 4.74. Combining this with Proposi-
tion 4.50(a) concludes the proof of (4.85).
Proof Recall that f3 is left continuous on [0, (0) by Theorem 4.65(a), so we need
only check right continuity on [0, A2). By Proposition 4.50(a),
Take A < A2. We want to show that f3(A+) = f3(A). If not, then f3(A) < f3(A+) :::
~ by Theorem 4.65, so that (4.88) is an equality by Theorem 4.83. But this
contradicts Proposition 4.44(a) with
1
PI = df3(A+) and P2 = df3(A)·
To prove part (b), it suffices to note that A > Al implies that
1
f3(A) ::: d'
since otherwise
d + 1 00
EI Ut Atl ::: Lu(lx - el)::: -d- L [df3(A)f < 00
x n=O
by (4.49). Therefore by part (a), f3(AI) ::: ~. Combining this with Proposition
4.50(b) gives the result.
For (c), suppose A > Al and f3(A) = ~. Then <p(l) = 1 by Theorem 4.83,
so SUPt EIAtl < 00 by Proposition 4.27(b). It follows that At dies out, which
contradicts the definition of AI.
and then
1
(4.91) ¢({3) = ¢(d{3) = 1
by Theorem 4.83. In order to construct nontrivial invariant measures for Ar. start
by taking a nonnegative function a on Td. Let JL = JLa be the product measure on
the subsets of Td with marginals
and let JLt be the distribution of the process at time t when the initial distribution
is JL. By duality (1. 7), for any finite set B,
M tB = L a(x)(d{3)-lx-e l.
xeA~
As we will see shortly, this quantity essentially controls the behavior of JLr. so
we need to develop some of its properties. In fact, in the proof of Theorem 4.107
below, we will prove and use the following bounds
Proof It is enough to consider the case a == I. In this case, since there are
(d + l)d n - 1 sites a distance n away from e for n :::: I,
d + I 00
(4.94) EMt = E L(d{3)-lx-e l = Pee EAt) + -d- L peen E A t ){3-n.
xeA t n=l
On the other hand, taking p = 1/d{3 > l/,jd in (4.28) and using the fact that by
Lemma 4.26(b), an(p) is asymptotic to a constant multiple of (dp)n = (3-n we
see that the right side of (4.94) is bounded above and below by constant multiples
of EWp(At). The lemma now follows from Proposition 4.27(b) and (4.91).
4. The Process on the Homogeneous Tree Td 111
F or the next result, recall that S (x) is x, together with all the descendents of x.
L (df3)-lx-e l = r k f3- k - n ,
Ix-ekl=n
XES(ek)
d-l . 2·
L I I k
(df3)-x-e = - d - r J f3- ]-n+ for 0 < j < k,
Ix-eki=n
XES(ej )\S(ej+l)
and
L (df3) -Ix-el = f3k-n.
Ix-ekl=n
xETd\S(ej)
Using the above lemma, we can investigate the limiting behavior of EM;k
as t ---+ 00. The boundary aTd of Td is defined as the collection of semi-infinite
self-avoiding paths emanating from e. This is a natural extension of the idea of
identifying a vertex x E Td with the (finite) path that leads from e to x. A base for
the natural topology on aTd is given by collections D(xo, ... ,xn ) of such paths
that share a common initial segment {e = Xo, Xl, ... ,xn }. With this topology,
Td U a Td becomes a compact metrizable space. An example of a metric for this
topology is the following. Fix a e E (0,1). For x, y E Td U aTd, let z be the
endpoint (other than e) of the intersection of the self-avoiding paths from e to x
and from e to y respectively. Then
dist(x, y) = e le - ZI [2 - e 1x - zl - e1y-zll
Note that S(ej) can be naturally interpreted as a subset of aTd for j :::: 1:
S(ej) = D(eo, ... ,ej). Let y be the uniform probability measure on aTd, i.e., the
one for which
1
y(D(xo, ... ,xn )) = (d + l)d n - l ' n:::: 1.
Lemma 4.96. Suppose that a is uniformly bounded on Td, and that the limit
where
for z E Seek),
for Z E S(ej)\S(ej+d, 0 < j < k and
for Z E Td\S(ed.
Remark. It is not hard to construct many bounded a's that satisfy (4.97). To see
this, let Xn be the discrete time simple random walk on Td , i.e., the one that moves
to each neighbor with probability I/(d + I). Since Xn is transient,
Xoo = lim Xn
n---..oo
E aTd
exists a.s. If a is a bounded Borel function on aTd , one can define an extension
a on Td by
= L a(x)(d.B)-lx-elp(x E A~k)
(4.98) Ix-e,I::ok
00
with similar statements holding for sums and integrals over S(ej)\S(ej+l), 0 <
j < k and over Td\S(el). Recalling that Nt is Mt when a == 1, comparing (4.94)
to (4.98), and using (4.99), Lemma 4.95,
(4.100) EM
lim _
t-'?oo
_t
E Nt
x
= 1 aTd
a(z)GxCz)dy, x i= e.
Using the explicit expression for G, one can check that
(4.101)
.
hm (dfJ)lx-e l
X-'?W
1 aTd
a(z)Gx(z)dy =
(d
d(l - fJ2)
+ 1)(1 -
2 a(w),
dfJ )
for a.e. w E aTd.
For example, suppose w = {eo, el, ... } E aTd, and assume that a(x) -+ a(w) as
x -+ w. Then
(4.102)
Passing to the limit using the expressions for y(S(ek)), ... given in the proof
of Lemma 4.96 and summing the resulting geometric series gives (4.101) in this
case.
We will also need information about the second moments of The first M:.
step is an application of the BKR inequality (or more specifically, the special case
known as the BK inequality - see the discussion of Theorem B21).
P(y, Z E A:) ::::: A L t P(u E A~)P(3 disjoint active paths from (u, s)
Iu-vl=! 10
to (y, t) and from (v, s) to (z, t), or 3 disjoint active paths
from (u, s) to (Z, t) and from (v, s) to (y, t))ds.
By Theorem B2l,
P(3 disjoint active paths from (u, s) to (y, t) and from (v, s) to (z, t»
::::: P(3 an active path from (u, s) to (y, t»
x P(3 an active path from (v, s) to (Z, t»
= P(y E A~_s)P(z E A~_s)'
Applying this twice and using symmetry in the resulting sum gives the result.
y z
= L(d{3)-2 Iy -e P(y El An
y
+ 2A L 1t P(u E A~)EMtU_sEMtV_sds.
lu-vl=! 0
4. The Process on the Homogeneous Tree Td 115
Take a p satisfying
where the final inequality comes from Proposition 4.27(b). The right side of (4.106)
tends to zero as t -+ 00 by (4.90), (4.91) and Proposition 4.44(a). To handle the
second term on the right of (4.105), use (4.100) to write
lim sup L
t-'>oo lu-vl=!
t P(u
Jo
E A;)EM~_sEMtV_sds
We are now ready to carry out the basic construction of invariant measures.
Theorem 4.107. Suppose that a is uniformly bounded on Td, and that the limit
If a] S a2 are two such a's, then the corresponding invariant measures can be
taken to satisfY val S Va2 ·
Proof Define IJ.t given in (4.92) to be the distribution at time t of the process with
the appropriate initial product measure. Since a is bounded, and no limiting proper-
ties of IJ.t depend on a(x) for any fixed x, we may assume that a(x)(dfJ)-lx-e l S 1
for all x. Using the elementary bounds
1
1- "E·
~.-
< n(1 - E·) < 1- "E· + -2~'J
.-~.
"E·E·
i i i iOPj
(4.111)
(4.112) K = nlim -
1
..... oot(n)
I
0
t (n)
ENtdt
I
and
1 t (n)
(4.113) Va = nlim -
..... oo ten) 0
IJ.tdt
exist. Therefore,
.. IJ.t{A:XEA} . IJ.t{A:XEA}
K hm Illf < Va {A : x E A} < K hm sup .
t ..... oo ENt - - t ..... oo ENt
and
. (df3)lx-ell· IJ.t{A : x E A} d(1 - f32)a(z)
I1m 1m sup =
(d + 1)(1 - df32)
-----~
HZ 1->00 ENt
for a.e. zE aTd. Combining these statements leads to
for a.e. z E aTd. The extra constant in front of a(z) can be removed by replacing
the a used in the initial product measure by an appropriate multiple of it. This
gives (4.109). The fact that Va is invariant is a consequence of Theorem B7(t).
4. The Process on the Homogeneous Tree Td 117
To prove (4.11 0), let B C Td be chosen randomly according to Ma, and in-
dependently of the Poisson processes used in the graphical representation of the
contact process. Use additivity and duality «1.2) and (1.7)) to write
The sum of the terms on the right of (4.114) that have U =1= v is at most EM: M r
By Lemma 4.104 and the Schwarz inequality,
Ix-el+lv-el
lim (dfJ)-2-"-limsupEM:M( = O.
x,y-+aTd 1--+00
(4.115) lim lim sup sup (dfJ)n Lex (u) (dfJ) -tu-et P (u E A: n Ai) = 0,
k-+oo n--+oo tx-et~n,ty-et~n u
tx-yt~k
L(d,6)-lu-e p(u
1 E A; nAn = L(d,6)-lu-e p(x, Y E 1 A~)
u u
::;)" L(dfJ)-I(U) L
u tw-zt=110
[pew r E A~)
(4.116)
+ P(z E A~)]P(x E A~_s)P(y E A;_s)ds
where the equality comes from duality, the first inequality comes from Lemma
4.103 together with the fact that leu) ::; lu - el, and the second inequality comes
from Proposition 4.27 and (4.91), since pew E A~) is symmetric in u and w.
It is enough to consider one of the dfJ terms that appears on the right of
(4.116). Break up the sum below according to whether Iy - z I < j or :::: j:
118 Part I. Contact Processes
L (d{3)-I(w) 1 00
P(x E A~)P(y E A;)ds
1
Iw-zl=1 0
:s (d + 1) L(d{3)-I(W) 00
P(x E A~) sup P(u E A.,)ds
w 0 lu-el:::j
+ IW~=I (d{3)-I(w) 1 00
P(x E A~)P(y E A;)ds
Iy-zl<j
:s (d + I)C(~)(d{3)-I(X)
d{3
1 00
sup P(u
lu-el:::j
E As)ds
1
0
where in the second inequality, we have used Proposition 4.27(b) on the first term,
and the triangle inequality Ix - wi :::: k - Iy - wi :::: k - j on the second term.
Without loss of generality, we may take x, y E See), in which case lex) = Ix - el
and ley) = Iy - el. In passing to the limit on k in (4.115), we can first let k -+ 00
and then j -+ 00. Therefore, we will have proved (4.115) if we prove
(4.117) lim
j->OO
1 0
00
sup P(u
lu-el:::j
E As)ds = O.
To do this, take a p between Ij,J(i and Ij(d{3), and use (4.28), Proposition 4.27(b),
and the fact that an (p) is asymptotic to a constant multiple of (pd)n (which tends
to (0) to conclude that
lim
a->oo
Va = v,
the upper invariant measure.
4. The Process on the Homogeneous Tree Td 119
Proof First note that the above limit exists by monotonicity, and is invariant by
Theorem B7(c). For constant a, let fJ";. be the product measure with marginals
M~{A : x E A} = min[a(d,B)-lx-YI, 1].
Then
Y > z
Ma - Ma(dfJ)-IY-ZI'
An analogous property holds for the invariant measures constructed using the
method of Theorem 4.l07 with these two initial measures. Letting a ---+ 00, we
see that lima - Hlo Va is stochastically larger than any "translate" of itself. But that
implies that it is invariant under the automorphisms of Td . Now use Theorem 5.l8
on page 168 of IPS to complete the proof. (This theorem was proved in IPS for
particle systems on Zd, but the proof is the same for particle systems on Td.)
Remark. It is not hard to extend the statement of Theorem 4.l07 to allow un-
bounded functions a on Td, whose boundary limit a(z) is allowed to be infinite
on a set of positive measure. For a z E aTd for which a(z) = 00, (4.l09) is to be
interpreted as meaning
(4.120) lim va{A : x E A} = v{A : YEA}.
x---+z
Theorem 4.121. Let B = U~=I S'(xn) c h where the S'(x n) are disjoint. Then
there is an invariant measure VB for the contact process that satisfies
120 Part I. Contact Processes
Proof Let !J,t be the distribution of the contact process with initial set B. By
duality (1.7),
!J,t{A : An c =1= 0} = peA; n B =1= 0)
for any finite C C Td . Since the process does not survive strongly, Af eventually
leaves every finite set, and therefore,
Therefore,
VB = lim !J,t
1--+00
exists, and is invariant by Theorem B7(e). To prove (4.123), take x E S'(x n), and
write
(4.125) VB = n--+oo
lim VB n •
It mayor may not be the case that VB is not trivial; i.e., =1= 80 . The next result
gives some indication of how large B needs to be for this to be the case.
4. The Process on the Homogeneous Tree Td 121
Theorem 4.126. Take 0 < p < 1, and consider bond percolation on Td in which
bonds are open with probability p. Let B C aTd be the (random) set of points
that are connected to e by an open path. Then with probability one on the event
{B =1= 0},
if P < 1
I dfJ'
1
Iif p> dfJ'
so that
lim v Dn {A : x E A} = 0
n--+oo
by (4.122), and hence VB = 80 .
For the second case, we will carry out a construction similar to that in the
proof of Proposition 4.57. Let {Bj, j 2: O} be the random sets constructed in that
proof corresponding to a fixed nand t that satisfy (4.60) for an a such that
(4.127) dap> l.
That such a choice can be made follows as before from Lemma 4.53 since d{3p >
l. Recall that Bj C Ajt and the cardinalities IB j I form a Galton-Watson branching
process whose offspring distribution has mean d nan. Therefore IB j n Cjn I is again
a Galton-Watson branching whose offspring distribution has mean d nan pn. This
process is supercritical by (4.127), so its survival probability
vD.{A : e
J
E A} = lim peAs
s~oo
n Djn =1= 0)
2: lim P(B k
k--+oo
n Ckn =1= 0) =q > O.
and therefore
vB{A : x E A} 2: q > O.
122 Part I. Contact Processes
An easy way to carry out the combined construction is the following: Suppose
VI and V2 are invariant measures, and let A I and A2 be independent with distri-
butions VI and V2 respectively. Let v be the distribution of Al U A 2, and Vt be the
distribution at time t when the initial distribution is v. As a consequence of the
monotonicity and additivity properties of the contact process, for specific choices
of AI, A2, and i = lor 2,
so that
lim vdA : x
x--+z
E A} =0
for some z E aTd for example, then
lim J1{A : x E A}
x~z
= lim v2{A
x~z
: x E A}
for that z.
It is also possible to interpolate between the constructions in Theorems 4.107
and 4.l21. We will indicate briefly what we have in mind, but will not strive for
maximum generality, nor give full details, since we would still not have a charac-
terization of all invariant measures. There are two parameters in the construction:
apE (0, 1] and a connected set B C Td that satisfies the property that
. #{YES'(x)nB:ly-xl=n}
(4.128) hm = a(x)
n-+oo an
exists for every x =1= e, where ap{3 = 1. The limit necessarily satisfies
(4.129) a(x) = a-I L a(y).
YES'(x).ly-xl=1
Also, put aCe) = a-I Lly-el=1 a(y). One way of generating such a B is via the
bond percolation process used in the statement of Theorem 4.126 with P = J.
4. The Process on the Homogeneous Tree Td 123
so it is natural to define
M: = L ply-el.
yEA:nB
The analogue of (4.98) is
The analogue of Lemma 4.95 implies that all limits as t --+ 00 of M: k are bounded
above and below by constant multiples of
k-I
la(ek) + LP2j-kaj-k[a(ej) - a-Ia(ej+t>] + p-ka-k[a(e) - a-Ia(el)].
j=!
Remark. One consequence of this result is the analogue of the remaining im-
plication of Theorem 4.8(b): If the process does not survive strongly, then
¢(1/v'd) ::: 1. To see this, take A < A2. By Theorems 4.65(b) and 4.130,
f3 < 1/v'd. By Theorem 4.65(e), ¢(1/v'd) < 1. By Proposition 4.33(b), we
can let At A2, to conclude that at A2, ¢(1/v'd) ::: 1. In fact, passing to the limit
in (4.85) gives ¢(1/v'd) = 1 at A = A2.
Proof of Theorem 4.130. The proof uses heavily the graphical representation of the
contact process that is described in Section 1. Recall that it is based on a collection
of rate 1 Poisson processes {Nx, x E Td } that generate recovery symbols, and
rate A Poisson processes {N(x,y), x, y E Td , Ix - yl = I} that generate infection
arrows. To each infection arrow a, associate a Bernoulli random variable ~a with
parameter p. These are to be conditionally independent given the collection of
Poisson processes.
124 Part I. Contact Processes
A* = ____A_p_ _ __
1 + (d + 1)(1 - p)A
where N n is the number of arrows that are pivotal for G n (in the original graphical
representation). Recalling the definition of f3 in (4.48), and using
we see that in order to prove the theorem, it suffices to show that for some C and
some a < 1,
(4.131)
Let am be the mth infection arrow whose endpoints lie in this cluster, ordered by
the time coordinate Tm of am. Ifthere are fewer than m such arrows, set Tm = 00. If
Tm < 00, let Xm and Ym denote the tail and head of am respectively. By definition,
5. Notes and References 125
there is an active path from (e,O) to (Xm, i m), and therefore also an active path
from (e, 0) to (Ym, i m). Then am is pivotal for G n if and only if
(a) there is an active path from (xm, i m) to {en} X (0,00) or there is an active
path from (Ym, i m) to {en} X (0,00), and
(b) there is no active path from (Arm \{Xm, Ym}) X {im} to {en} X (0,00).
For fixed n, let Fm be the event that am is pivotal for G n, and let Dm,k be the
event that the shortest paths from Xm and Ym to the tail w of the next pivotal arrow
(or to en if there are no further pivotal arrows) intersect the path eo, el, ... , en
in a segment of length at least k. Using some graph theoretic arguments, one
can show that on Fm n Dm.k. there is an active path from {xm, Ym} X lim} to
{w} X (0,00) whose projection onto Td travels a distance:::: k on eo, el, ... , en,
and does not intersect the active path guaranteed by (a) above, except possibly at
the endpoints. By Theorem B21 applied to the events Fm and Dm,k. and the strong
Markov property applied at the stopping time i m , the conditional distributions
of the lengths of the parts of the path eo, el, ... , en covered between successive
pivotal arrows is dominated by a distribution with exponentially decaying tails.
This gives (4.131).
which is part of the statement that v has positive correlations. Belitsky, Ferrari,
Konno and Liggett (1997) proved the following inequality, which generalizes (5.1)
and (5.2):
a(A n B)a(A U B) :::: a(A)a(B).
126 Part I. Contact Processes
Using duality again, this can be viewed as a correlation inequality for the upper
invariant measure v.
Other types of correlation inequalities have been conjectured for special graphs.
For example, if S = Zl, Konno (1994) conjectured that v satisfies
where in the cylinder probabilities above, there are m zeros to the left of the one
and n zeros to the right of the one. Liggett (1994a) proved that if Ac < A < 2,
then the above inequality holds (strictly) for some choices of m, n ::: 1. Some
numerical evidence for the conjecture in case 1 ::: m, n ::: 2 is given in Tretyakov,
Belitsky, Konno and Yamaguchi (1998).
Recurrence vs. Survival. Salzano and Schonmann (1997, 1999) have studied the
contact process on fairly general graphs, and discovered a number of phenomena
that occur in that context but do not occur on Zd or Td . The second lowest extremal
invariant measure Vr that appears in the titles of these papers is defined as follows:
Define the recurrence probability by
for finite B. Note that by (1.8), v can be thought of as having been defined in
the same way, but with f3B replaced by the survival probability aBo The measure
Vr is the second lowest extremal invariant measure in the sense that any invariant
measure v that puts no mass on 0 lies above it in the following weak sense:
for all finite B. As mentioned by Salzano and Schonmann (1997), Andjel (private
communication) has proved that Vr ::: v in the stronger sense of (B8). Of course,
it is often the case that Vr = 00 or Vr = V. For the tree Td , for example,
00 = Vr = V if A::: AI,
00 = Vr =I=- v if Al < A ::: A2, and
00 =I=- Vr = V if A> A2.
(1.11) for all finite initial configurations A is not monotone. On the other hand,
recurrence (in the sense that fJA > 0 for all finite A =1= 0) and what they call
partial convergence ((1.11) for finite A with aA replaced by fJA) is a monotone
property.
Their second paper is primarily devoted to the study of continuity properties
of aA and fJA as functions of A.
Grippenberg (1996) gave another lower bound in this case that improves the .03
to .4.
The complete convergence theorem for the contact process on Zd has a long
history. Griffeath (1978) proved it in one dimension for A above the critical value
for the one-sided contact process. Durrett proved it in one dimension for all A > Al
- see page 284 of IPS. Durrett and Griffeath (1982) proved it for all d and
sufficiently large A. Schonmann (1987b) simplified their proof. Andjel (1988)
proved it for a larger class of A'S. The final result, Theorem 2.27, was proved
by Bezuidenhout and Grimmett (1990), though the proof given there is somewhat
different.
Here are some other results that have been proved for the contact process on
Zd:
Chen, Durrett and Liu (1990) gave a necessary and sufficient condition for the
convergence in Theorem 2.27 to be exponentially rapid in the one dimensional
case. Examples of initial distributions that satisfy this condition are homogeneous
product measures and deterministic finite configurations.
Gray (1991) proved a number of monotonicity properties for the one dimen-
sional contact process, including the following:
p(x E A)O))
is a decreasing function of Ix I. Note that even though this might appear to be
obvious, there does not appear to be any simple way to prove it.
Durrett and Schonmann (1988b) proved that the upper invariant measure v
for the one dimensional supercritical contact process has the usual large deviation
behavior:
1 {IAn[l,n]1
lim -logv A : E [a, b]
}= - .
mf ¢(x),
n~oo n n aSxSb
128 Part I. Contact Processes
Tn
. {
= mf t > 0:
IA~n[1,n]l}
n > x
satisfies
Tn
f3n => T,
where T has the unit exponential distribution and {f3n} is an appropriate normal-
izing sequence. For related results in a more general context, see Lebowitz and
Schonmann (1987).
The Shape Theorem. This theorem states that the supercritical contact process A;O)
has an asymptotic shape in the following sense: Let Ht = us<tA1° C Zd and
d (0) Zd -=
K t = {x E Z : At = At }. Define H t = UxEH,C(X) and K t = UXEK,C(X),
-
Critical Values. Small improvements have been made in critical value upper
bounds: Liggett (1995b) improved (1.28) to Ac :::: 1.942. The point of this was not
so much that the numerical value is a bit smaller than 2, but rather that the new
bound results from a procedure that in principle can be used to generate succes-
sively better bounds. Stacey (1994) improved (1.29) for d = 2 to A~2) :::: .79.
Durrett (1992) developed another technique for getting upper bounds that ap-
plies in significant generality, which is based on certain computations on small
finite sets. For the one dimensional contact process itself, though, the best result
he gets is Ac :::: 3.95.
Various upper bounds on the survival probability and corresponding lower
bounds for the critical value have been obtained by Katori and Konno, in a series
of papers listed in the Bibliography. Much of this material is treated in Konno's
1997 lecture notes.
for any function f that depends on finitely many coordinates, where =} denotes
convergence in distribution and N (0, (J"2) is the normal distribution with mean
zero and variance (J"2. If f is increasing and not constant, then (J"f > O.
Edge Processes in One Dimension. Consider the supercritical one dimensional con-
tact process whose initial configuration has a rightmost infected site, and infinitely
many infected sites on its left. Let rt be the position of the rightmost infected site
at time t:
rt = max{x : x EAt}.
Galves and Presutti (1987a) proved that properly scaled, rt converges to a nonde-
generate Brownian motion. A simpler proof, presented in the context of oriented
percolation, was given by Kuczek (1989). An extension of his argument to non-
nearest neighbor contact processes in one dimension is provided by Mountford
and Sweet (1999).
According to results in Section 2 of Chapter VI of IPS,
. rt
hm -
t--+oo t
= peA)
a.s., where peA) > 0 in the supercritical case. Galves and Presutti (1987b) proved
that the distribution of the process shifted by p(A)t converges to the symmetric
mixture of the two extremal invariant measures:
1 1
-8 0 + -v.
2 2
The process viewed from rt is defined by shifting by rt units:
Galves and Presutti (1987b) proved that this process has a unique invariant measure
for A > Ac. The existence had been proved earlier in the oriented percolation
setting by Durrett (1984). Andjel, Schinazi and Schonmann (1990) proved that
this invariant measure can be coupled with the upper invariant measure v of
the unmodified contact process in such a way that there are only finitely many
discrepancies to the left of the origin. Galves and Schinazi (1989) proved that the
invariant measure is the limit as n -+ 00 of the invariant measures for a truncated
process that is not allowed to die out or to have cardinality greater than n. Cox,
Durrett and Schinazi (1991) proved the existence and uniqueness of the invariant
measure in the critical case.
Long Range Contact Processes. Consider the contact process on the graph Zd in
which vertices x, yare connected by edges if their Euclidean distance is at most
M. Renormalize the infection parameter so that A is the total infection rate from
a single isolated site, and let A) (M) be its critical value for survival. Bramson,
Durrett and Swindle (1989) proved that
lim Al(M) = 1,
M--+oo
130 Part I. Contact Processes
which is the critical value for the corresponding branching random walk. More
interestingly, they found the asymptotics of the error in this limiting statement:
There are positive constants C I , C2 (depending on d) so that
2 2
CIM-} ::: Al (M) - I ::: C2M-} if d = 1,
(5.3) C I (logM)M- 2 :::
AI(M) - I::: C2 (logM)M- 2 if d = 2,
d
CIM- ::: Al (M) - 1 ::: C2M-d if d 2: 3.
Durrett and Perkins (1999) proved that rescaled long range contact processes con-
verge to super Brownian motion in two and higher dimensions, and as a conse-
quence were able to give sharp constants for the asymptotics in (5.3): Let N be
the number of neighbors of a point. Then
61T log N
Al '" 1 + N
in d = 2, and
where W is a space time white noise process on {(t, x) : t > 0, x E R}. They also
show that there is a critical value (}c so that solutions to this SPDE die out (i.e.,
are identically zero in x for some t) with probability one if () < (}c and survive
with positive probability if () > (}c' In this statement, the initial condition u(O, x) is
assumed to be continuous with compact support, nonnegative, but not identically
zero.
Penrose (1996) proved a continuum limit for the threshold contact process on
Zd. (The threshold contact process will be used in Part II as a comparison process
for the threshold voter model.) In this model, recovery occurs at rate 1, and sites
become infected at rate A if there is an infected site within distance M, and zero
otherwise. Let Al (M) be the critical value for survival. The result is that
5. Notes and References 131
as M -+ 00, where f.-Lc is the critical value for a threshold contact process on
Rd. This is analogous to the first Bramson, Durrett and Swindle result described
above. It would be interesting to investigate the analogue of their more refined
result: How does
behave as M -+ oo?
Contact Processes with Stirring. Another way of passing to a limit is to add stirring
(also known as symmetric exclusion - see Part III) at a large rate. Consider the
process whose generator is the sum of the generator of the contact process and D x
the generator of the symmetric nearest neighbor exclusion process on Zd. (D is
the rate at which the values of '1 (x) and '1 Cy) are interchanged if Ix - y I = 1.) This
process is also attractive and self-dual, but as D gets large, it behaves increasingly
like a branching process. The reason is that (for the finite system), the fast stirring
separates particles, so that they are not likely to be close together, and hence are
not likely to affect each other. In fact, Durrett and Neuhauser (1994) proved that
. 1
hm )1.) (D) =-.
D-+oo 2d
The limit is of course the critical value for the associated branching process.
Konno and Sato (1995) obtained explicit lower bounds on the critical value (and
corresponding upper bounds on the survival probability) as a function of D:
ACD» __1_+_C_2d
__-_1_)D
__
I - C2d - 1)(1 + 2dD)
Katori (1994) proved upper bounds on this critical value for d ::: 3. Konno (1995)
proved the following analogue of (5.3) in this context:
I 1 I
CID-'j :::: AICD) - 2d :::: C2 D -'j if d = 1,
1
C I (log D)D- I :::: Al CD) - 2d :::: C2(log D)D- I if d = 2,
1
CI D- i :::: Al CD) - 2d :::: C2D- I if d ::: 3.
In the homogeneous case, survival implies linear growth of At - see the discussion
of shape theorems above. Bramson, Durrett and Schonmann (1991) proved that
this is not necessarily the case for inhomogeneous systems. In their examples,
they take d = 1, A(X, y) == 1, and {8(x), x E Z} to be i.i.d. with a particular
distribution. Madras, Schinazi and Schonmann (1994) give examples in which the
critical process survives, unlike the homogeneous case.
°
Now take 8 == 1 and {A(X, y), Ix - yl = I} i.i.d. In one dimension, Liggett
(1991 a, 1992) showed that At dies out if E log A < and survives if E 2~t 1 < 1.
For d > 1, Klein (1994) proved that there is extinction if
E[ log(l + A) t d)
is sufficiently small, where f3(d) is of order 2d 2 for large d. Andjel (1992) proved
the complementary result that for any f3 < d, survival is possible even if
is arbitrarily small. There is a natural open problem here - what is the correct
power f3, or at least, what is its asymptotic behavior as d ---* oo?
Newman and Volchan (1996) take d = 1, A(X, x-I) == AI, A(X, x + 1) == An
ALAr> 0, and {8(x), x E Z} i.i.d., and prove survival under a condition that is
slightly stronger than
E[ -log8t = 00.
More generally, one can ask what moment assumptions on the transition rates
imply survival or extinction.
in probability, as N ---* 00. For the analogous problem for d > 1, see the discussion
of metastability below.
Durrett, Schonmann and Tanaka (1989) showed in one dimension that TN
grows polynomially in the critical case:
for any a, b > 0. It is not known what the correct power is in one dimension. It
has also not been proved yet that the growth of TN is polynomial in the critical
case in higher dimensions.
5. Notes and References 133
Here are some other results that have been proved for the contact process
restricted to cubes in Zd:
TN -- . f{t >
III _ 0 .. A{l·
N,t.. · ,N) -- 0} .
exists. This limit is positive by Theorem 3.9. Combining these results leads to the
following strengthened form of Theorem 3.9 in all dimensions:
log TN
------;:jd --+ Y+ (Ie )
in probability.
Simonis (1996) proved (b) in this multidimensional setting.
A --+ AU{x}
134 Part I. Contact Processes
10grN
~ 2
10gN
in probability. Sweet (1997) then proved the stronger distributional limit theorem:
was conjectured by Liggett (1996b) and proved by Lalley and Sellke (1998).
The proof of Theorem 4.65(b) is given in Lalley (1999). Theorem 4.65(h) is an
improvement of a result in Pemantle (1992) that gives an upper bound for A2 that
is asymptotic to ejJd as d ~ 00. The proof given here is completely different.
Theorem 4.70 was proved by Zhang (1996); the simplified proof given here is
due to Salzano and Schonmann (1998). Theorem 4.71 was proved by Pemantle
(1992) (under the assumption of the then not fully verified fact that AI < A2). The
proof given here is taken from Salzano and Schonmann (1999). Theorem 4.83 is
due to Lalley (1999). Corollary 4.87(a) and (b) was proved by Schonmann (1998),
5. Notes and References 135
though the proof of part (a) given here is different. Theorem 4.130 is due to Lalley
(1999).
Turning to the construction of invariant measures, Theorem 4.107 is an exten-
sion of the construction given in Liggett (1996b). Theorem 4.121 is due to Durrett
and Schinazi (1995), who also proved that these measures are extremal.
Here are some other results that are related to the contact process on Td:
Critical Values. Let Aid) for i = 1, 2 denote the critical values for the contact pro-
cess on Td • Combining Theorems 4.1(a) and 4.8(c) and using the natural coupling
At C {x : {t(x) ::: l}, we see that
_1_ < A(d) < _1_
d+1- I -d-l'
and hence that
(5.4) lim dA;d) = 1.
d-+oo
(For the analogous statement on Zd, see (1.26).) It follows from Theorems 4.1(b)
and 4.65(h) that
(5.5)
Pemantle (1992) gives improved bounds that lead to the replacement of on the !
left side (5.5) with 2 - J2 : : : .5858. This bound could be improved a bit more
by using the results in Liggett (1996a). Note that unlike (5.4), the limit in (5.5)
cannot be the same as the corresponding limit for the branching random walk
process, which is !
by Theorem 4.8(c). It would be interesting to evaluate the
limit in (5.5).
Some numerical work on critical values and critical exponents for the contact
process on T2 have been carried out. For example, Tretyakov and Konno (1995)
give the estimate Al ~ .542.
and
CI(AI - A)-I::: E 1 00
IAtldt ::: C2(AI - A)-I, A < AI.
rt = min
xEA,
Ix - el, Rt = max
XEA,
Ix - el, Nn(t) = #{x EAt: Ix - el = n}.
136 Part I. Contact Processes
. rt 1 Rt 1.
= -, hm Nn(nsF = dUes)
1
hm - =-, lim -
t-+oo t S2 t-+oo t Sl n-+oo
a.s. on {At =1= '" V t}, provided in the last case that d U (s) > 1.
The Process on a Finite Tree. Stacey (2000) shows that the contact process on
a ball B in Td with A > A2 and Ao = {e} survives for a time that is almost
exponential in the cardinality of B with positive probability.
Liggett (1999) has studied branching random walks on the ball of radius N
on Td . Unlike the contact process, this process survives for large A. Let A~ be the
critical value for this survival. One would expect A~ ~ A2, and in fact it turns
out that
(Recall from Theorem 4.8(d) that A2 = 1/2Jd.) Liggett also gives precise asymp-
totics for the time tN at which the expected number of particles is I when the
initial configuration is ~ == I: For 0 < A < 1/2Jd,
lim
N-->oo
[tN(1-2A-/d)-NIOgd+~IOgN]=C,
2
where C is an explicit function of A and d.
Anisotropic Processes. Heuter (2000) has proved several results analogous to those
discussed in Section 4 for a contact process on T2d+ I, d 2: 1, in which different
infection rates apply in different directions: there are parameters AI, ... , Ad+1 so
that an infected site x with neighbors XI, ... , X2d+2 infects X2i-l, X2i at rate Ai
each.
( 41)
C1 A -
l+ffi/2
:s peAt =F 0 V t):s
(
A-
1)5/2 '
4
1
-<A<1
4 - - ,
thus showing that the critical exponent for the survival probability, if it exists, is
between 2.5 and 2.803. Note the contrast with (5.6), which says that this critical
exponent is 1 for the contact process.
Part II. Voter Models
1. Preliminaries
Interest in voter models began at about the same time that people started working
on the contact process - the mid 1970's. As was the case for the contact process,
these models provided a fertile ground for the use of some of the basic tools in the
area of interacting particle systems. In fact, the main reason for their introduction
was not so much a desire to model political systems, as the name might suggest,
but rather the fact that voter models are exactly the class of spin systems to
which duality can be applied most completely and successfully. After applying
duality, one was often led to problems involving sustems of random walks, and
that provided a close link to one of the most active areas of research in probability
of the previous two decades.
Property (a) implies that the pointmasses 80 and 81 on the constant configurations
YJ == 0 and TJ == 1 are invariant. Property (b) says that the evolution of the system
is not changed by interchanging the roles of 0 and 1, while property (c) makes the
process attractive - i.e., (BI2) and (BI3) hold. Finally, the last property implies
that the process is invariant under spatial shifts.
There are various interpretations that one can give to such a process. Individu-
als placed at the points of Zd might have one of two possible opinions on an issue,
and they change their opinions at random times, based on the opinions of their
neighbors. This interpretation leads to the name voter model. Alternatively, one
could think of Zd as representing territory, each parcel of which is controlled by
one of two competing populations. The transition 0 ~ 1 at x, for example, then
corresponds to control of parcel x changing from population 0 to population 1.
With either interpretation, properties (a)-(d) should appear quite natural.
140 Part II. Voter Models
for all x, Y E Zd and all initial configurations T/. A desirable, but currently unre-
alistic, objective would be to classify all possible rate functions c(x, T/) according
to whether the corresponding processes coexist or cluster.
where p(., .) are the transition probabilities for an irreducible random walk on Zd.
One way of thinking of this process is that the individual at x E Zd waits a unit
exponential time, then chooses ayE Zd with probability p(x, Y), and adopts the
opinion of that y. The main reason why the analysis is easier in this case, is that
linear voter models satisfy a very useful form of duality:
(1.2)
I
2[P(x, y) + p(y, x)].
Theorem 1.3. The linear voter model TJt clusters if X t is recurrent, and coexists
if X t is transient.
In particular,
(a) the process clusters if d = I and
or if d = 2 and
L Ixl2 p(O, x) < 00,
x
and
(b) the process coexists if d ::: 3.
In order to contrast this with the behavior of the threshold voter models that
will be discussed shortly, note that whether the linear voter model clusters or
coexists depends almost exclusively on the dimension of the set of sites, rather
than on the size of the range of interaction.
A lot can be said about the invariant measures in case of coexistence, and
about convergence to invariant measures in both cases. Here are special cases of
some of the results in the first two sections of Chapter V of IPS.
Theorem 1.4. Suppose X t is recurrent, and J-L is any translation invariant proba-
bility measure on {O, l} Z d. Then
Theorem 1.5. Suppose X t is transient. Then the extremal invariant measures for
TJt form a one-parameter family {J-Lp, 0 :s p :s I}, where J-Lp is translation invariant
and ergodic, and J-Lp{TJ : TJ(x) = I} = p. Furthermore, J-Lp has covariances given
by
Cov!-'p (TJ(x), TJ(Y)) = pO - p )pY-x (X t = 0 for some t ::: 0).
If J-L is any translation invariant, ergodic probability measure on {O, I }Zd with
J-L{TJ : TJ(x) = I} = p,
then J-LS(t) ::::} J-Lp as t ---+ 00.
One property of the linear voter model that is quite special is that it has what
is known as a conserved quantity (on average). In this case, this is the statement
that if J-L is translation invariant with density J-L{TJ : TJ(x) = I} = p, then J-LS(t)
satisfies
142 Part II. Voter Models
fJ,S(t){IJ: IJ(x) = I} = P
for all t 2: 0. (Take A to be a singleton in (1.2) and integrate with respect to fJ, to
see this.) It is for this reason that one expects the process to have a one-parameter
family of extremal invariant measures, indexed by the conserved quantity (if there
is coexistence). This will not be true for more general voter models in which there
is no conserved quantity.
If T is large, the process will coexist according to our definition for an un-
interesting reason. For example, if d = 1, AI = {-I, 0, I} and T = 2, then the
following configuration is a trap for the process:
···110011001100
In fact, if IJt (x) = IJt (x + 1) for some x and t, then those two coordinates will
never flip again. It is clear from this observation that starting from any initial
configuration, each site will flip only finitely many times. We will say that the
process fixates if it gets trapped in this way, i.e., if each IJt (x) flips only finitely
often for every initial configuration. This concept is not relevant for linear voter
models. In fact, results in Cox and Griffeath (1983) imply that nearest neighbor
voter models on Zd do not fixate.
There is a small technical issue that must be resolved in carrying out this
construction. It is important to know that only finitely many previous decisions
are relevant in deciding whether to flip the configuration at site x at time t. In
one dimension, this issue is easily resolved by noting that for any k and t, with
probability one, there are infinitely many positive and negative n's so that N x has
l. Preliminaries 143
had no event times by time t for any n :::; x :::; n + k. This breaks up Z into finite
"islands" that have no influence on each other up to time t.
This idea does not work quite so easily in higher dimensions. If d > 1, one
constructs the process for small times, using a percolation argument to control
these finite islands of influence. The small time restriction comes from the fact
that the percolation parameter must be kept small in order to prevent percolation
from occurring. But once the process has been constructed for 0 :::; t :::; E, it can
be recursively constructed for later times by restarting the procedure at integer
multiples of E.
More explicitly, fix t > 0 and construct a random oriented graph with vertex
set Zd by placing an edge from x to y if Nx has an event time by time t and
y E x + .ff. By a potential path of length n, we will mean a sequence Xo, ... ,Xn
of distinct vertices in Zd so that Xi+! E Xi + JV for each i. There are at most
(1J1/ I - 1) n potential paths of length n starting at x, and the probability that any
one of them is a path in the random graph is (1 - e- t r. Therefore, if t is so small
that (IA/·I- 1)(1 - e- t ) < 1, the expected number of vertices connected to x in
the random graph is finite, so only finitely many sites can influence the evolution
of 1]s(x) up to time t.
Duality when T = 1
The graphical representation given above, unlike that described in Part I for the
contact process, does not lend itself to defining a dual process. There is another
graphical representation for some nonlinear voter models, including the threshold
model with T = 1, that does permit the definition of a dual process via arrow
reversal. The representation is of the type known as cancellative. (The duality for
the contact process that was used in Part I is known as additive. For a treatment of
both types of duality, see Section 4 of Chapter III of IPS.) A general description
of this graphical representation and the corresponding duality in the context of
voter models is provided in Section 2 of Cox and Durrett (1991).
Rather than discuss the graphical representation itself here, we will describe
the dual process for the threshold voter model with T = 1, and explain analytically
why the duality equation holds. Let At be an annihilating branching process that
evolves in the following way. At all times, At is a finite subset of Zd. Each
x E At has a rate 2 exponential clock, and when its clock rings, a subset B is
chosen uniformly from all even subsets of x +JV, and At is replaced by AtfJ.B,
where fJ. denotes the symmetric difference of two sets. In other words, any point
falling on an already occupied site leads to the annihilation of both points. The
duality equation is then
(1.7)
Here 1]t is the threshold voter model with T = 1, and is regarded as a subset of
Zd.
144 Part II. Voter Models
(1.8)
The key property that leads to (1.8) is that the derivative of both sides with respect
to t be the same at t = 0 for all choices of A and 1J. The integration of the resulting
identity to get to (1.8) is described in Section 4 of Chapter III ofIPS.
The derivative of the left side of (1.8) at t = 0 is just the generator of the
process 1JI applied to the function H as a function of its second argument. The
generator is given by (B 1):
(1.9)
x
-H(A,1J) if x E A
{
H(A, 1Jx) = +H(A,1J) if x 1: A.
We need to write c(x, 1J) in terms of the H's. To do so, we argue as follows. The
set of all functions that depend on coordinates in a finite set A is a vector space
of dimension 21AI. A basis for this space is the collection {H(B, .), B C A}. The
easiest way to see that they form a basis is to check that they are orthonormal
in L2(V~), and this is an immediate consequence of (1.10). (As usual, v~ is the
product measure with density !.)
Therefore, any function 1 in this vector space
can be written as a linear combination
To evaluate the coefficients, multiply (l.ll) by H (C, 1J) and integrate with respect
to v 1. Using the orthogonality of the H' s, the result is
2
=1 - L H(B, 1]) 1
Bcx+. V (('=0 or ('=1 on x+. V}
H(B, r;)dVl
2
1 )1. VH
=1 - (- L H(B,1]).
2 Bcx+. V.B even
But the right side above is the generator of the dual process At applied to H as a
function of its first argument, and hence is the derivative of the right side of (1.7)
at t = 0.
The dual process is often described in somewhat different terms: Instead of
choosing an even subset B of x + ,/1/' and taking the new state to be At~B,
one chooses an odd subset C of x + ./V uniformly, and takes the new state to
be (At \ {x l) ~ C. This is really the same process, since the mapping B --+ C =
B ~ {x} takes even subsets to odd subsets in a one-to-one fashion, and satisfies
A~B = (At \{xl)~C.
Preview of Part II
The next section deals with general threshold voter models. If a threshold voter
model does not fixate, we should expect that the process will coexist for small
threshold and cluster for large threshold - where large and small are interpreted
as being relative to the size of the neighborhood, I.ffl. The reason for this is that
having a small threshold makes it easy for flips to occur, so it is likely that there
will be a lot of both O's and 1's around at all times. In Section 2, we will verify
this by proving the following results:
(a) The process fixates if T > I.V~I-I.
(b) If d = 1 and T = I· V~H , then the process clusters.
(c) If T = elJVI with e sufficiently small and IJVI sufficiently large, then
the process coexists.
Section 3 is devoted to the case T = 1, which is the only situation in which
anything like complete results are available. In this case, we will show that the
process coexists in all cases except d = 1, ./1/' = {-I, 0, I} (in which case it
clusters by (b) above). Note that this is a very different state of affairs than the
one we saw in Theorem 1.3 for linear models.
Throughout the rest of Part II, we will consider only threshold voter models.
We will use Ix I for x E Zd to denote the restriction to Zd of any norm on Rd.
146 Part II. Voter Models
In this section, we consider the threshold voter model with neighborhood JV and
threshold T, whose transition rates are given by (1.6). We treat fixation, clustering
and coexistence of the model, in that order. Recall that for a fixed choice of
neighborhood, we expect these to correspond to large T, moderate T and small T
respectively.
x,Y:X-yE.V
ry(x)ofory(y)
Note that w(l]) < 00 for all I] E to, l} Z d. We need to see what the effect of a
flip is on the value of w. So, letting I]u be the configuration obtained from I] by
flipping the uth coordinate, write
YEu+,/V,yofou YEU+.,Y
ry(y)=ry(u) ry(y)ofory(u)
and hence
By assumption, IJY'I- T - 1 < T, so that we can choose E small enough that the
last factor in (2.3) is < O. Since every flip at u decreases w by at least a certain
amount, there can only be finitely many flips at u.
d
dtILS(tHI1 : 11(1) = ... = l1(k) = 1}1t=0
k
(2.4) =L IL{11 : 11(j) = 0, l1(i) = 1 for 1 ::: i ::: k, i =1= j}
j=\
If IL is invariant for the process, the left side of (2.4) is zero, so the right side is
zero as well. If IL is also translation invariant, then the two negative terms on the
right of (2.4) are (in magnitude) ::: the first and last terms in the sum respectively.
Therefore the other terms in the sum must be zero:
(2.5) IL{11 : 11(j) = 0, l1(i) = 1 for 1 ::: i ::: k, i =1= j} = 0, 1 < j < k.
Since IL is invariant, it is not hard to show that the fact that these cylinder
probabilities are zero implies that all other cylinder probabilities in which there is
at least one coordinate set to zero and another coordinate set to one are also zero.
This implies that IL is a convex combination of 80 and 8\. We leave the details for
general T to the reader, since we will prove directly the stronger result that this
process clusters.
Here is how it works for T = 1. Take j = 2, k = 3 in (2.5) to conclude that
IL (101) = 0, where we are using a natural shorthand to denote cylinder proba-
bilities. Therefore, IL puts no mass on configurations with a singleton zero. Since
configurations with a doubleton zero can flip to configurations with a singleton
zero with a positive rate, and since IL is invariant, it follows that IL puts no mass
on configurations with a doubleton zero. Arguing inductively, it follows that all
cylinder probabilities of the form IL(10··· 01) are zero. Since IL is translation
invariant, IL(10000· .. ) = O. To see this, let
These are disjoint for different n's, and have the same probability by translation
invariance, so IL(An) = 0 for all n. Therefore,
Theorem 2.6. The threshold voter model in one dimension with ./V = {- T, ... ,
T}, T 2: 1, clusters.
Remarks. (a) One might guess from this result and Theorem 2.1 that threshold
voter models in higher dimensions cluster if T = I· J~I-I . This is not correct. For
example, take d = 2, T = 2 and f f = {(O, 0), (0, 1), (l, 0), (0, -1), (-1,0)}. If
TJ is constant on alternating vertical infinite strips:
···00001111
in which infinitely many zeros are followed by infinitely many ones. Then only the
zero and one at the boundary can flip, so that the configuration will always look the
same, except that the boundary will move like a simple symmetric random walk.
The fact that this random walk is recurrent implies that every site flips infinitely
often.
Proof of Theorem 2.6. The idea of the proof is to construct two sequences of
random times Un, Vn for n 2: 1 with the following properties:
(a) 0 = Vo < UI < VI < U2 < V2 ··· ,
(b) {Uk+ 1 - Vko k 2: O} are i.i.d. with E(Uk+1 - Vk) < 00,
(c) {Vk - Uko k 2: I} are i.i.d. with E(Vk - Uk) = 00,
(d) the random variables in (b) and (c) are independent of each other, and
(e) TJIO is constant on {-T, ... , T} for every t E Ubl[Uko Vk).
Once this construction is made, it will follow from renewal theory that
lim P(TJI(l)
1-+00
=1= TJt(O)) = 0,
so that the process clusters.
We begin with several general comments about the construction, and then
explain concretely how to carry it out. The U's and V's will be defined in terms of
the Poisson processes N x used in the graphical representation described in Section
2. Models with General Threshold and Range 149
1, together with two additional rate one Poisson processes N +, N _. In fact, the
U's and V's will be stopping times with respect to the associated filtration. Also,
Uk+l will depend only on TJVk and the Poisson processes for times t > Vb and
Vk will depend only on TJUk and the Poisson processes for times t > Uk. This
will guarantee the independence required in (b), (c) and (d). The fact that the U's
and V's are separately identically distributed will follow automatically from the
construction. Since the U's and V's will be defined recursively, we might as well
just explain how a U is constructed starting from a general configuration TJ (which
will be the configuration at the previous time V), and how a V is constructed
starting from a configuration TJ that is constant on {- T, ... , T} (which will be
the configuration at the previous time U). We start with the latter.
So, suppose that TJ is constant on {-T, ... , T}. Without loss of generality,
assume that TJ(x) = 1 for Ixi ::: T. We will define two simple, symmetric random
walks L t , R t so that Lo = -T, Ro = T, and TJt(x) = 1 for all L t ::: x ::: R t up
until
Suppose Rs, s ::: t has been constructed, and R t = x. Then R stays at x until the
first time that one of the following happens:
1. There is an event time in N x ; at that time R moves one step to the left.
2. There is an event time in Nx+l and TJ(x + 1) = 0 at that time; at that time R
moves one step to the right.
3. There is an event time in N+ and TJ(x + 1) = 1 at that time; at that time R
moves one step to the right.
L t is constructed in a similar (reflected) way, using N_ instead of N+. Note that
L t , R t defined in this way have the property required above: TJt(x) = 1 for all
L t ::: x ::: Rt up until the time V defined in (2.8). Now, V is the minimum of two
independent random variables, each of which has the distribution of the hitting
time of {O} for a simple symmetric random walk on Z starting at 1. That hitting
time has tail probabilities of order Ct-~ by the reflection principle. (See Section
3.3 of Durrett (1996), for example.) Therefore
{ - T, . .. , T}, so the result is immediate. Suppose now that n > 1, and the
result has been proved for all configurations with fewer intervals of constancy.
Let {j, ... ,k - I} and {k, ... ,I - I} be two consecutive maximal intervals
of constancy of 17 on {- T, ... , T}. Without loss of generality, assume that
17(k - 1) = 1, 17(k) = O. Then
k-I+T k+T
(2.9)
L [1-17(i)]+ L 17(i)=I-17(k-l-T)+17(k+T)+2T~2T.
i=k-I-T i=k-T
Therefore, at least one of the sums on the left of (2.9) is ~ T. If the first sum is
~ T, then c(k - 1,17) = 1, while if the second sum is ~ T, then c(k, 17) = 1. In
the first case, the site U at which we will flip is taken to be k - 1; in the second
case, it is taken to be k. If both sums are ~ T, either choice can be made.
To be specific, suppose that it is the first sum on the left of (2.9) that is ~ T,
so that U = k - 1. Then
k-2+T k-I+T
L [I-17k-l(i)] = I-17(k-2-T)+17(k-I+T)+ L [1-17(i)]~T,
i=k-2-T i=k-I-T
so that site k - 2 can be flipped next. Continuing in this way, we flip sites until site
j is flipped. At that point the number of intervals of constancy has been reduce
to n - 1, so that the induction hypothesis can be applied.
Let UI(17), U2(17), ... ,U m (ry)(17) E {-T, ... ,T} be the sequence constructed
above. It has the property that the successive application of flips at these sites
makes the configuration constant on {- T, ... , T}, and that each of the flips will
occur if there is an event time in the appropriate Poisson process. Note that the se-
quence of sites that are flipped to achieve a constant configuration on {- T, ... , T}
depends on 17 only through {17(X), Ixl :::: 2T}. Therefore, m = maxry m(17) < 00.
Extend the sequence Ui(17) to i :::: m by setting Ui(17) = 0 for m(17) < i :::: m. This
choice has the following property: For the process starting with configuration 17,
if the first m event times among the Poisson processes {Nx, Ix I :::: 2T} occur at
UI (17), U2 (17), ... , Um (17) in that order, then at the last of these event times t, 17t
will be constant on {- T, ... , T}.
Now partition the time axis into intervals of length 1. Let Ak be the event that
the only event times for {Nx, Ixl :::: 2T} in the time interval [k, k + 1) occur at
where 17 is the configuration of the process at time k, and that they occur in that
order. These are independent and have the same positive probability, so that
U = min{k ~ 0 : Ak occurs} + 1
Unlike the standard contact process that is the subject of Part I, the threshold
contact process is not self-dual. Therefore it is no longer immediate that survival is
equivalent to the existence of a nontrivial invariant measure. (See (l.8) of Part I.)
So, we will use the phrase has a nontrivial invariant measure instead of survives
in this context. Here is the basic comparison that will be used for the remainder
of this section, and in Section 3:
Proposition 2.11. For any d, JV' and T, if the threshold contact process with
A = 1 has a nontrivial invariant measure, then the threshold voter model coexists.
Proof The proof relies on a comparison of these two processes with a third one
- a process in which each site flips independently at rate 1. This is the very
simple spin system with c(x, 1) == 1. We will use the superscripts v, c, i to denote
quantities corresponding to the threshold voter model, threshold contact process
with A = 1, and the independent flips process respectively. Let v be the upper
invariant measure for the threshold contact process, and v 1 be the product measure
!.
2
with density
The transition rates for the three processes satisfy the following inequalities:
and
CV(X, 1) :s CC(c, 1) = c i (x, 1) if 1) (x) = 1.
This means that the processes can be coupled so that they satisfy
(2.12)
for all t ::: 0 if these inequalities are satisfied initially. This coupling is analogous
to that used in our discussion of attractiveness - see (B 14).
By the convergence theorem for finite state Markov chains,
(2.13)
as t -+ 00. By (2.12),
(2.14)
152 Part II. Voter Models
Combining (2.13) and the first part of (2.14), we see that v :s v1. Combining this
with the second part of (2.14) and the fact that 117 is attractive (see (B13)) gives
Therefore all Cesaro averages of v 1 sv (t) are stochastically larger than v. Any
weak limit JL of Cesaro averages of v 1 sv (t) is invariant for the threshold voter
2
model by Theorem B7(f), and is stochastically larger than v. But v concentrates on
configurations with infinitely many ones. This statement is a consequence of (1.5)
of Part I for the standard contact process - the proof for the threshold contact
process is identical. Therefore, JL concentrates on configurations with infinitely
many ones. But JL is unchanged by interchanging the roles of zeros and ones,
since both the initial distribution and the transition mechanism for the threshold
voter model have that symmetry. Therefore, JL concentrates on configurations with
infinitely many zeros as well, and so it is nontrivial as required.
The next result explains why it is easier to deal with the threshold contact
process than the threshold voter model. The analogous result for the voter model
(with coexistence replacing the existence of a nontrivial invariant measure) may
well be true, but it certainly does not follow from the simple argument that works
for the contact process.
Proposition 2.15. Suppose 17: is the threshold contact process on Zd 1 with neigh-
borhood Jh: threshold TI and parameter AI, and 11; is the threshold contact pro-
cess on Z d2 with neighborhood .A2: threshold T2 and parameter A2. Assume that
dl ::: d2 ,
(XI, ... ,Xdl) E Jh. implies (XI, ... ,Xdl' 0, ... ,0) E A"2·,
TI 2: T2, and AI:S A2.
Proof All that is required is to couple the two processes so that they are both
== I at time zero, and 11: (XI, ... ,Xdl) :s 11; (XI , ... ,Xdl' 0, .. ,0) for all t and
all (XI, ... ,Xd1 ) E Zd 1 • This is easy to do, using the type of coupling that is
discussed following (B 14). In order to carry out the construction, use the fact that
the transition rates for the two processes satisfy
(2.16)
if 111 (XI, ... ,Xdl) = 112 (XI , ... ,Xdl' 0, ... ,0) = 1 and
CI ((XI, ... , Xd 1 ), 111) :s C2((XI, ... , Xdl' 0, ... ,0),112)
if 111 (XI, ... , Xd 1 ) = 112 (XI , ... , Xdl' 0, ... ,0) = o. The reader should be able to
write down the coupling explicitly.
2. Models with General Threshold and Range 153
. Tn
(2.18) hm sup ;//' < c,
n-+oo 1./ r n I
then 1]7 has a nontrivial invariant measure for all sufficiently large n.
Then while the number of ones in {- L, . " , L}d is at least T, every site in
{-3L, ... ,3L}d evolves like a two state Markov chain YI with rate I for each
of the transitions 0 ~ I and 1 ~ O. Therefore, thinking about the evolution of
the number of ones in a box of side length 2L + 1, it will be useful to make a
comparison with the Markov chain XI on the nonnegative integers with transitions
for k = fJ (2L + I)d and for any t, and in particular for the t that satisfies (2.19),
which we now fix. By the coupling argument used above, (2.20) holds uniformly
for k :::: fJ(2L + I)d.
We are ready to make the comparison with oriented percolation. By (2.19) and
(2.20), by making L sufficiently large, we can guarantee that if
and
L TJt(X) 2: (J(2L + \)d.
XE{L •... ,3Ljx{-L, ... ,Ljd-l
So, we can use Theorems B24(a) and 826 to conclude that if TJo == 1, then
and hence the upper invariant measure for the threshold contact process is non-
trivial.
Proof By Theorem 2.17, the threshold contact process has a nontrivial invariant
measure for large n. Now apply Proposition 2.11.
The first case is used as a comparison process for all one dimensional models
other than the one we know clusters, and the second for all models in two or more
dimensions. As we will see shortly, it is fairly easy to show that the second case
reduces to the first. Most of the work will be required to prove the result in case
(3.1). The proof in that case is a significantly more elaborate version of the proof
of (1.28) of Part I.
Since the proof in case (3.1) is fairly long and difficult, we will wann up by
showing that the threshold contact process with A = 1 has a nontrivial invariant
measure if the range of interaction is somewhat larger:
computation, where Set) is the semigroup for the threshold contact process with
T = 1, and A is a finite subset of Zd:
~J-tS(t){1) : 1) = 0 on A}I
dt t=O
XEA XEA
Proposition 3.6. At survives if and only if 1)t has a nontrivial invariant measure.
Pass to the limit in t to conclude that the upper invariant measure v for the
threshold contact process satisfies
Proposition 3.7. Ifthe threshold contact process has a nontrivial invariant measure
in case (3.1), then it has a nontrivial invariant measure in case (3.2).
Proof Let At be the dual process with transition rates (3.4) in case (3.1), and Bt
be the dual process in case (3.2). By Proposition 3.6, we can equally well prove
that if At survives, then Bt survives. The proof is based on a coupling of the
processes At and Bt . To describe this coupling, define a mapping rr : Z2 --+ Zl
by rr (i, j) = i + 2 j. Here is a picture that shows this mapping, putting the value
rr(i, j) on the point (i, j):
2 3 4 5 6
o 2 3 4
-2 -1 o 2
-4 -3 -2 -1 o
-6 -5 -4 -3 -2
The point is that the four neighbors of an (i, j) E Z2 that has rr(i, j) = k have
corresponding rr values k - 2, k - 1, k + 1, k + 2, which are the four neighbors of
k E Zl. In other words, rr respects the neighborhood structure. Define a relation
A :::: B for A C ZI, B C Z2 by saying A :::: B if and only if
where JK and J1;2' are the neighborhoods in the cases (3.1) and (3.2) respectively.
Since A :s B, A =1= 0 implies B =1= 0, the result follows.
=L
00
F(n) f(k).
k=n
Then
F(n)
fJ..(1000· ··000) = fJ..{rJ: rJ(O) = 1, rJ(1) = ... = rJ(n -1) = O} = L .
j:o:l F(j)
that explicit computations can be carried out. Product measures do not work, and
renewal measures are among the simplest measures on {O, I}ZI that need not be
product measures. Furthermore, the contact process is a nearest particle system
(see Chapter VII of IPS for more on this topic), and reversible nearest particle
systems have renewal measures as invariant measures.
We certainly do not expect such a special /1 to be invariant for the process.
But, we might try to find one that satisfies some of the equations: RHS of (3.3) =
O. For example, since we now have a one (discrete) parameter family of unknowns
U(k), k :::: I}, and therefore expect to be able to satisfy a one parameter family
of equations, we can try to choose the fen), n :::: I, so that the right side of (3.3)
is zero whenever A is an interval {I, ... ,n}, n :::: 1. The hope is that the resulting
measure /1 (if it exists) will tum out to make /1S(t) increase over time in some
sense, and therefore will make this have a nontrivial limit as t ---+ 00. This limit
would be the required nontrivial invariant measure. Proposition 3.44 below states
this more precisely.
Here are the equations one obtains by setting the right side of (3.3) equal to
zero when A is an interval of length n (after dividing by the mean of f):
n = 1: 1 = F(2) + F(3) + F(4) + F(5),
n = 2: F(2) = F(3) + F(4) + F(5),
n = 3: 2F(3) + F2(2) = 3F(4) + 3F(5),
and
L F(k)F(n -
n
(3.9) k + 1) = 4F(n + 1) + 2F(n + 2), n:::: 4.
k=l
To derive the first one, for example, take A = {OJ and use our shorthand for
cylinder probabilities to express the right side of (3.3) as
1 1
(3.10) F(1) = 1, F(2) = ~, F(3) = 4' F(4) + F(5) = -.
4
We embark now on a somewhat lengthy analysis of (3.9) and (3.10), and of
properties of the solution. This effort would certainly not be justified if it led only
to a solution of the easier problem discussed above, in which we replaced the
full collection RHS of (3.3) = 0 for all finite A by a one parameter subfamily of
equations. It will in fact be crucial to the solution of the real problem.
n=l
F(I)F(m)x l +m =4
l.m::o:l:l+m::o:5 n=5 X n=6
1 + 2x - Jp(x)
¢(x) = ,
x
where P is the polynomial
P(x) = 1 + 4x + 2x 2 - 5x 3 - 3
2x 4 - 2f3x 5 - ( 2f3 - 41) x 6 .
The radius of convergence of ¢ (x) is the the magnitude of the (complex) zero of
P of odd multiplicity that is closest to the origin, since J (z - a)n is analytic in
the whole complex plane if n is even, but has a singularity at z = a if n is odd.
Therefore, we need to show that there is a unique choice of f3 so that P has no
zeros of odd multiplicity in the unit disk of the complex plane.
Note that
3
P(l) =- - 4f3 and
4
so that P has a root of odd multiplicity in (-1, 1) if
3 9
f3 > - or f3 < - -
16 8'
Therefore, we may assume
(3.12)
P(u)
u-
= P ( -2- 1) .
Then
where the middle inequality follows from (3.12). Therefore, by Rouche's Theorem
(see Chapter 10 of Rudin (1966), for example),
have the same number of zeros in the unit disk. So, P(u) has exactly two zeros
in the unit disk, and hence P (x) has exactly two zeros in the disk If Ix - ! I ::: !.
these two zeros are simple, then F(n) is not bounded. Therefore, we can restrict
ourselves to the case that these two zeros agree, and hence are real.
So, we need to find -1 < x < 0 and f3 so that P(x) = 0 and P'(x) = O.
Eliminating f3 from these two equations gives
(3.13)
The left side of(3.13) is -20 at x = 0 and is +11 at x = -1, so there is a root
in (-1, 0). It is easy to check that this root is unique; Mathematica gives it as
xo = - .425465 . .. The corresponding value of f3 is .149772 ...
Fix these values of Xo and f3. Since P(x) has a double root at xo, it can be
factored as
where
ao = 5.524 ... , a] = 3.871 ... , a2 = 1.272 ... ,
a3 = .257 ... , a4 = .0495 ...
Since ao > a] + a2 + a3 + a4, the other four roots of P lie outside the unit disk.
Therefore, we see that for this choice of f3, F(n) decays exponentially rapidly. In
particular, it is bounded.
The Density
Proposition 3.11 is not entirely satisfactory, because we will need to know that the
bounded {F (n), n 2: I} whose existence is guaranteed by that result is decreasing
(so that fen) is nonnegative) and satisfies some other inequalities. In principle,
F(n) can be computed by expanding v' P(x) in a power series, but this approach
3. Models with Threshold = I 163
makes it difficult, if not impossible, to check any properties of F(n). So, we must
take a different tack.
The system of equations (3.9) and (3.10) can be rewritten in terms of the
density f:
(3.14) :t
k=i
f(k) = 1, f(1) =~, f(2) = ~,
and
2f(1)f(2) + 2f(4) + 2f(5) - 4f(3) = 0,
2f(1)f(3) + f2(2) + 2f(5) + 2f(6) - 5f(4) = 0,
(3.15) n-i
L f(k)f(n - k) + 2f(n + 1) + 2f(n + 2) - 6f(n) = 0, n 2: 5.
k=i
as we did in (3.3). An important feature of(3.15) that is not shared by the equations
(3.9) for F is that there is only one negative term on the left. This property is used
in a crucial way in the proof of Proposition 3.17 below, and leads to the following
motivating remarks.
A probabilist looking at equations (3.15) should be struck by their similarity to
the equations that define a harmonic function for a continuous time Markov chain
on {l, 2,3, ... }. If the transition rates for such a chain are given by q(n, m), then
f is harmonic if it satisfies
L q(n, m)f(m) - fen) L q(n, m) = 0, m 2: 1.
m:mi=n m:mi=n
Equation (3.15) for n 2: 5, for example, almost has this form for a chain that leaves
n at rate 6, going to n + 1 or n + 2 at rate 2 each, and to sites to the left of n at
a total rate of 2. Of course, (3.15) is different in that the terms that correspond to
moving to the left are quadratic in f rather than linear. Nevertheless, this analogy
is useful in trying so solve (3.15). In particular, the form of (3.15) suggests that
we define a family of evolutions Ut (n), n 2: I} by setting
1
ft(1) == 2' ft(2) == 4'
d
dt ft(3) = 2ft (1)ft (2) + 2ft (4) + 2ft(5) - 4!t(3),
d
(3.16) - ft(4) = 2ft (1)ft (3) + ft2(2) + 2ft(5) + 2ft(6) - 5ft(4),
dt
d
L ft (k)ft (n -
n-i
- ft(n) = k) + 2ft(n + 1)
dt k=i
+ 2ft(n + 2) - 6ft(n), n 2: 5.
164 Part II. Voter Models
Proposition 3.17. Let It(n) be defined by (3.16) with initial condition lo(n) = 0
for n ::: 3. Then
(a) !ten) is nonnegative and non decreasing in t for all n ::: 1,
(b) I(n) = lim t - Hx) !ten) < 00 for each n,
(c) {f (n), n ::: I} is the unique positive solution of the system (3.14), (3.15),
and
(d) L:l nl(n) < 3.
Proof For part (a), note that the only negative sign on the right of (3.16) is on
the term whose derivative appears on the left side. Therefore, if at some time t
and for some n, It(n) = 0, while It(k) ::: 0 for all k -=1= n, then the derivative of
It (n) is nonnegative, so It (n) is forced up by its differential equation. It follows
that It (n) remains nonnegative for all n at later times if this is the case at t = O.
The proof of monotonicity is similar. Differentiate the equations in (3.16).
Again, all the terms on the right of the differentiated equations have positive
signs, except for the last one. Therefore the derivatives
(3.18)
f/k+l)(n) = fo(n)
t e- (t-s) [n-l
+ 10 6 f; f}k) (j)f}k) (n - j)
for n :::: 5, with similar equations for small n. This makes it clear that the successive
approximations are always nonnegative if the zeroth one is. The same argument
proves the monotonicity statement - the only difference is that the integrating
factor technique is applied to the differentiated versions of (3.16), thus yielding
successive approximations for the derivatives (3.18).
The previous argument is rather soft, and applies quite generally. If we were
considering the analogues of equations (3.14), (3.15) for a A below the critical
value of the threshold contact process, instead of A = 1, then nothing would
change in part (a). The problem would be that ft(n) might blow up as t -+ 00, or
if not, the limit would not satisfy (3.14), (3.15). The real work comes in the next
part of the proof.
Turning to part (b), note that the existence of the limit is immediate from the
monotonicity statement in part (a). The real issue is the finiteness of the limit.
Introduce the generating functions
00
1jJ(t,x) = Lfr(n)x n .
n=l
In the following computations, we will leave the initial conditions general at first,
since we will need to take different ones in the next subsection. Multiplying the
nth equation in (3.16) by xn and summing yields
(3.20)
d 2 I
dta(t) = [I - a(t)] + 4' - 2ft(3) - ft(4),
d
(3.21 ) dt bet) = 2[1 - a(t)][3 - b(t)], and
d 2 3
d/(t) = 2[3 - b(t)] - 2[1 - a(t)] [e(t) + 8] - 2 + 8ft(3) + 8ft(4).
These hold as long as aCt) remains finite. The fact that aCt) appears squared on
the right of the first equation means that in principle, aCt) could blow up in finite
time. We will see shortly that it remains finite for all t. The fact that bet) appears
only to the first power in the second equation, and e(t) appears only to the first
power in the third equation means that bet) and e(t) will remain finite as long as
aCt) does.
By part (a), ft(n) is nondecreasing in t for each n. It follows that aCt), bet),
and e(t) are also nondecreasing, and hence each of the derivatives in (3.21) is
nonnegative. We will need one other inequality. By the first and fourth expressions
in (3.16) and the fact that ft (4) is nondecreasing,
so that ft(2) + ft(3) + ft(4) + ft(5) + ft(6) s aCt) -~. Using this in (3.22) gives
they must both change sign at the same time. By (3.24), they cannot be zero
simultaneously. Therefore, they never change sign. Since a(O) = ~ and b(O) = 1,
it follows that
(3.25) aCt) < 1 and bet) < 3 for all t ~ O.
In particular, ft (n) is bounded in t for each n, so that
fen) = lim ft(n) < 00.
t--> 00
n=l
Table I
f(n) F(n) u(n-I)
n fen) F(n) u(n) f(n+l) F(n+l) ---,;(ri)
It would be enough to prove that the evolution in (3.16) with the initial con-
ditions used in Proposition 3.17 has the property that
for all t :::: 0, since then we could simply pass to the limit in these inequalities.
Note that (3.27) is true for t = 0, since the right side is zero for k :::: 3. However,
the asymptotics
as t t 0, which are easy to read off from (3.16), make it clear that (3.27) fails for
small t, at least for k = 4. In fact, it fails for all even k. So, we will have to argue
differently. The first step is to prove the following weaker statement.
Lemma 3.28.
f(5)]n-4
fen) :::: f(4) [ f(4) ,
Proof The idea of the proof is to follow the proof of Proposition 3.17 with a
different initial condition for the evolution (3.16). So, write the initial condition
as
fo(3) = a, fo(4) = b, fo(n) = ca n - 5 for n :::: 5,
1
4 + 2b + 2e = 4a,
1
a + 16 + 2e + 2m = 5b,
(3.29)
1
b + "l a + 2ea + 2ea 2 = 6e
1
e + "lb + a 2 + 2m2 + 2ea 3 = 6ea.
Solve the first three equations in (3.29) for a, b, e in tenns of a, and substitute
into the fourth equation. The result is
This polynomial has a root a = .585 ... , and then the values of a, b, e become
a = .0972 ... , b = .0464 ... , e = .0229 ...
Comparing with the corresponding entries in Table 1, we see that these are slightly
smaller than f(3), f(4), f(5). This is encouraging, since we hope to show that
with this initial condition, ft(n) t fen), n ~ 3. So, we will take these values for
a, b, e, a from now on.
The right sides of the other expressions in (3.16) are respectively
1
(n=7) m + "le + 2ab + 2m 3 + 2ea 4 - 6ea 2 ,
1
(n=8) ea 2 + "lea + 2ae + b 2 + 2ea 4 + 2ea 5 - 6ea 3 ,
1 n 7
ea n - 6 + _ea - + 2aea n - 8 + 2bea n - 9 + (n _ 9)e 2 a n - 10
2
So, we have proved the statement of the lemma for any n for which
ea n- S > f(5)[f(5)]n-S
- f(4)
This is clearly true for large n, since f(5)lf(4) = .52 ... < .58 ... = a, by Table
1. In fact, it is true for n ::: 7, since
2 f3(5)
ea = .00785 ... > .00708 ... = f2(4).
But the statement of the lemma is obvious for n = 4 and n = 5, and follows from
Table I for n = 6, so the proof of the lemma is complete.
Proof of Proposition 3.26. Any proof of this result that uses the evolution (3.16)
runs into difficulties caused by the fact that the logconvexity actually fails for
small values of n. We will get around this problem by modifying the evolution
in such a way that the first few values do not change at all. So, take gt(n) to be
defined by
gt(n) == fen) for n ::: 5, go(n) = f(5)a n- S,
where a = f(5)lf(4) = .52 ... , and
d
L gt(k)gt(n -
n-l
- gt(n) = k) + 2gt (n + 1) + 2gt (n + 2) - 6g t (n), n::: 6.
dt k=l
Note that
(3.31) gt(2) > gt(3) > ... > gt(n) > ...
gt(3) - gt(4) - - gt(n + 1) -
for t = 0 by Table 1. Our plan is to prove that (3.31) holds for t > 0 as well, and
that
3. Models with Threshold = I 171
argue as follows. The inequalities (3.33) are clearly true at t = 0 for all n, and for
all t 2: 0 if n ::S 5 by Proposition 3.17. For n 2: 6, the equations of evolution are
the same for the two systems, so (3.33) holds by the maximum principle. Now,
by Lemma 3.28,
holds at t = O. Since f satisfies (3.15), the maximum principle shows that (3.34)
holds for all t 2: O. Combining (3.33) and (3.34) with Proposition 3.17 gives
(3.32).
Finally, we tum to the proof of (3.31), which can be restated as
(3.35)
This is automatically true for k = 3,4, and for all k if t = O. We will again use
the maximum principle to show that (3.35) is true for t > O. To do so, we need
to check the following statement: If (3.35) is true for a given t and all k 2: 3, and
holds with equality at that time for a fixed k, then
d
(3.36) dt [gt(k - l)gt(k + 1) - g;(k)] 2: 0
(3.37) f(4) [f(5) + ~ f(4) + f2(3) + 2gt (7) + 2gt (8) - 6gt(6) l
Using
to get a lower bound for (3.38). First replace gt(9) in (3.38) by g;(8)/gt(7). This
results in a quadratic function of gt (8). This quadratic is increasing in gt (8) for
I:
j=]
Since
1(4) < 1(1) 1(2) 1(3)
1(5) - 1(2)' 1(3)' 1(4)
by Table 1, the above expression is nonnegative under our assumption
gt(4) gt(5)
- - > - - > ... >
gt(k - 1)
= gt(k)
>
gt(k + 1) > ....
(3.42)
gt(5) - gt(6) - gt(k) gt(k + 1) - gt(k + 2) -
The only term for which this is not completely clear is the last one. But by (3.42)
and the inequality between geometric and arithmetic means,
As pointed out earlier, 1 and u are not logconvex, so we cannot use Theorem
B39 directly to deduce inequalities for u from Proposition 3.26. However, Lemma
B40 can be used to prove the slightly weaker fact that is true in our case.
(ii) u(O) - u(l) :::: u(2) - u(3) :::: u(3) - u(4) :::: '" :::: u(n) - u(n + 1) :::: ... ,
1 1
(iii) u(n) - u(n + 1) :::: 3[u(n - 1) - u(n)] + 6[u(n - 2) - u(n -1)], n:::: 3,
and
1
(iv) u(n) - u(n + 1) :::: 2" [u(n - 1) - u(n)], n:::: 2.
for a fixed n :::: 5. (Note that the values in Table 1 show that this statement is true
for n = 5.) We need to prove that
which is just the statement that the left side of (B41) is nonnegative. Consider the
terms on the right side of (B41). All the determinants involving 1 are nonnegative
by Proposition 3.26, together with the fact that
174 Part II. Voter Models
f(1) f(4)
-->--
f(2) - f(5)'
which can be seen from Table 1. All the determinants involving u on the right
side of (B41) are nonnegative by the induction hypothesis, except for the one
corresponding to j = n - 1. Take 2 .::: j .::: n - I, and write
Therefore, it is enough to prove that the sum above is nonnegative. But this sum
is
u(n + l)[u(l) + u(2) + u(3)] - u(n)[u(2) + u(3) + u(4)].
By the values in Table 1 and the induction hypothesis,
as required.
Part (ii) follows from part (i), the arithmetic-geometric mean inequality and
Table 1. For part (iii), the result holds for n ::: 8 by Theorem B45, since
F(lO) 5
-->-
F(9) - 6
by Table 1. The other cases follow directly from Table 1. Part (iv) is a consequence
of parts (ii) and (iii) and Table 1.
Proposition 3.44. Suppose f-L is a probability measure on {O, l} Z d and Set) is the
semigroup for the threshold contact process on Zd with T = 1 and A ::: O. If
f-LS(t){1J: 1) = 0 on A}
is a nonincreasing function of t for all finite A C Zd. In particular,
d
-f-LS(t){1J:
dt
1) = 0 on A} = Lc pA(A/ = C)-f-LS(s){1)
d
ds
: 1) = 0 on C} I .
s=o
Therefore (3.45) implies (3.48). It is perhaps worth emphasizing that it is not the
case that (3.45) for a particular A implies (3.48) for that A.
Theorem 3.49. All threshold voter models (with T = 1) coexist except for the one
with d = 1, .IV = {-I, 0, I}.
Proof By Propositions 2.11, 2.15, 3.7 and 3.44, it suffices to consider the case
d = 1, JV = {-2, -1,0, 1, 2}, and to find a measure f-L j 80 satisfying (3.45)
for the corresponding threshold contact process with A = 1. In choosing such a
measure, it seems reasonable to look for one that satisfies (3.45) with equality for
176 Part II. Voter Models
as many A's as possible - say for all connected sets. Not coincidentally, we have
already found a candidate: the stationary renewal measure p, corresponding to the
density f that satisfies (3.14) and (3.15). These equations are exactly the ones that
say that (3.45) holds with equality for all intervals A.
By (3.3), we must show that
L p, {r! : IJ == 0 on A, IJ ¢. 0 on k + JV}
kEA
(3.50)
- LP,{IJ: IJ == 0 on A\{k}, IJ(k) = I} :::: 0
kEA
for all finite A C ZI. In order to take advantage of the renewal property, the left
side of (3.50) is best written in terms of the following conditional probabilities:
It is not too hard to write the negative terms on the left of(3.50) in terms of LA and
R A . For the positive terms, the first step is to write the following decomposition
(which holds a.s.) according to the locations of the first 1's to the right and left
of a particular k E A:
To check it, use a decomposition based on the location of the first 1 to the left of
k as follows:
Next we will use these identities to rewrite the second term in (3.51). Using
(3.52) and (3.53),
LLA(k)RA(k)= L LA(J)f(k-j)f(l-k)RA(l)
kEA j<k<l
kEA:j.l~A
Note that the right side of (3.54) is the difference of two divergent series. The
interpretation is that all identical summands should be cancelled before the sum-
mations take place. After this cancellation, the remaining sums are convergent.
Perhaps we should pause a minute before plunging into the next set of compu-
tations to see what the objective is. Clearly for the proof to work, we must at some
point use the fact that f satisfies the convolution equation (3.15). The computation
in (3.54) is designed to introduce a convolution, so that we can use (3.15) at that
point. After doing so, there is simply some bookkeeping to do. Looking ahead to
(3.59), the reader will see an expression for (3.51) that has several virtues: (a) the
convolution equation defining f has been used, as it must be, and (b) the values of
f (n) do not appear explicitly. This latter fact is quite important, since there is no
explicit expression for f to be used. In carrying out the following computations,
it will be useful to have the explicit expressions for the first few f (.), s:
1 1 1
(3.55) f(l) = 2' f(2) = 4' f(4) = 2f3 - -.
4
Here f3 = F(4) as before.
The first term on the right of (3.54) can now be rewritten using (3.15) and
these values in the following way:
Now use (3.52) and (3.53) again to rewrite the first sum in (3.56) as
178 Part II. Voter Models
- LRA(l)[LA(l + 1) - ~LA(l)J
I¢A 2
(3.57)
- LLA(j)[RA(j -2) - ~RA(j -l)IU-l¢A} - ~RA(j)J
j¢A 2 4
Note that in each of the terms rewritten in this step, there is a choice between
using (3.52) and (3.53), depending on the order in which the sums on j and I
are taken in the first expression in (3.56). To maintain symmetry we have used
(3.52) on half of the terms of each type, and (3.53) on the other half. Note that the
i
factors of 4 and in (3.57) are f(l) and f(2) respectively. The terms in braces
in (3.57) are just the left sides of (3.52) and (3.53), after one or two terms from
the right sides have been moved to the left.
The second term on the right of (3.54) is easier. Using (3.52) and (3.53), it
becomes
- L LA(k)RA(k).
k¢A
Using (3.52) and (3.53) in the first four sums in (3.58), they become:
kEA.k-2¢A kEA,k-l¢A
+ L LA(k + I)RA(k + 1) + L LA(k + 2)RA(k + 2)
kEA,k+l¢A kEA,k+2¢A
1 1
2 L LA(k - 2)RA(k - 1) - 2" L LA(k + I)R A(k + 2).
kEA;k-l,k-2¢A kEA;k+l,k+2¢A
- L {LACk+I)RA(k)+LA(k)RACk)+LACk+I)RACk+l)}
k,k+lotA
1
+- L {2L A(k)R A(k+ 1) + 2LA(k+ I)RACk+2) + L A(k)R A(k+2)}
4 k.k+l,k+2otA
+ (2,B - ~) L LA(k)RA(k+4).
4 k,k+2.k+4otA
Note that all constraints on the indexes in (3.59) are of the form j ~ A. When
in the previous expressions one encounters a term with a constraint of the form
j E A, it is changed to the desired form by writing
lUEA) = 1- lUotA)'
The argument leading up to (3.59) is a bit tedious, but it involves nothing other
than careful bookkeeping.
As a check, consider (3.59) where A = {-n, -n + 1, ... , -1, O}, in which
case we know (3.59) should be zero. For simplicity, take n not to be too small, so
that all the summands in (3.59) can be unambiguously attached to the left or right
components of A c. Then the contributions to each of these components should be
zero. Consider those corresponding to the right component, which for each sum
correspond to k 2: 1. Since R( -1) = ~ and R(O) = R(I) = ... = 1, these
contributions become
Note that all the terms cancel in this sum, as they should.
- * - * - * - * -,
where * denotes a point in A and - denotes a point in the complement of A.
The fact that the indexes in some of the sums in (3.59) are not consecutive is
a consequence of the fact that the process is not nearest neighbor. So, this is a
difficulty that does not have to be faced for the nearest neighbor contact process
(threshold or basic).
To carry out the verification of the nonnegativity of (3.59) for sets with no
isolated points, and at the same time do the first part of the proof for general sets,
proceed as follows. Let [m, n) be a maximal interval in the complement of A,
i.e., m :s n, m - 1, n + I E A, m, ... , n f/. A. Consider the terms in (3.59) with
the property that all of the indexes appearing in the constraint in the sum fall in
[m, n). That is, these are the terms in (3.59) with some index in [m, n) with the
property that they would appear no matter what the status (in A or not) of points
outside [m - I, n + I) might be. In a natural way, we will associate half of these
with the left boundary (m - 1, m) of the interval, and the other half with the right
boundary (n, n + 1). This suggests the following definitions:
3. Models with Threshold = I 181
+ m:C:~_2LA(k)[ -RA(k)+~RA(k+l)+(~-f3)RA(k+2)]
+ (~ - f3 ) m:C:~-3 LA (k)RA (k + 3) + (f3 - t) m:C:~-4 LA (k)RA (k + 4)
and
Q(n, n + 1) = L
m:c:k:c:n
RA(k)[LA(k + 2) + LA(k + 1) - ~LA(k)]
L Q(m-l,m)+ L Q(n,n+l).
m:m-IEA,m\!A n:n\!A,n+IEA
If A contains isolated points, then the additional terms that appear in (3.59) are
those for which there are two constrained indexes in the sum that lie on either
side of an isolated point in A.
To check that each of the Q's is nonnegative, it is enough by symmetry to
consider Q(m - 1, m) when m - 1 E A, m rf. A. Rewrite it in the form
Comparing with the definition of Q(m - 1, m) above, we can solve for the coef-
ficients in (3.60) as follows:
5
c = RA(m - 2) + RA(m - 1) - 4RA(m)
(3.6la)
1 5
= RA\{m-l)(m - 2) + -RA(m - 1) - -RA(m)
2 4
if n = m;
182 Part II. Voter Models
(3.62a)
if m S k = n - 1; and
that is used in going from the first to the second expression for c is a consequence
of (3.53).
Clearly, to show that (3.60) is nonnegative, we will need to know that the L's
and R's satisfy some inequalities. What we know from Proposition 3.43 is that
the renewal sequence u satisfies some inequalities. Therefore, we need to relate
the L' sand R' s to u. Here are the relevant relations for an arbitrary finite set B:
To check the first of these, for example, note that both sides are equal to
The right side of the first line of (3.63) is a decomposition of the above event
according to the location of the leftmost 1 in B n (-00, k).
By Proposition 3.43 and Table 1, u(n) ..}, so it follows from (3.63) that
(3.64a)
and
(3.64b)
By (3.64a) and (3.60), Q(m - 1, m) will be nonnegative if c :::: 0 and Ck :::: 0 for
m ::: k ::: n - 1. To check the latter statement, use (3.62), (3.64b) and the fact that
k·
f3 > For example, for (3.62b) when m ::: k ::: n - 4, write
5
Ck = [RA(k -1) - RA(k)] + 2[R A(k) - RA(k+ 1)]
3
+ 4[RA (k + 1) - RA (k + 2)] + f3 [ RA (k + 2) - RA (k + 3) ]
+ (f3 - t) [RA (k + 3) - RA (k + 4)].
The verification that C :::: 0 uses the same argument, together with the obser-
vation that
RA(m - 1) ::: RA\{m-l}(m - 2),
which follows like (3.64b) did from u(n) ..} and (3.63). Thus we see that (3.59) is
nonnegative whenever A contains no isolated points.
Before continuing, we will pause to explain two aspects of this argument, one
of which is explicit above, while the other is implicit. In the harder arguments
to come, both will appear explicitly. First, (3.59) is a quadratic form in the LA's
and RA's. We have seen that monotonicity of these functions is used crucially in
the proof that (3.59) is nonnegative. But these functions are monotone only in
intervals in the complement of A. When their arguments cross a site in A, this
monotonicity is lost. Therefore, LA and RA are written in terms of LB and R B,
where B is obtained from A by deleting a few points in order to regain the lost
monotonicity at those critical locations.
The second aspect to comment on is the following. Suppose one is trying to
show that the quadratic form
i.j
where u; = Ui+1 - Ui and vj = Vj - Vj+l, and <'j is whatever the coefficient turns
out to be after the change of variables, then the quadratic form is nonnegative if
184 Part II. Voter Models
the new coefficients are nonnegative, and this is much easier to see. Rewriting
expressions in terms of differences of LA'S and RA'S will be used repeatedly in
what follows for that very simple reason.
(3.65)
(n-m)/2
- LA(n + I)RA(n + 1) + (1 - 2fJ) L LA(m + 2k - I)R A(m + 2k + 1)
What we have done here is to list all terms from (3.59) for which some of the
indexes in the constraint in the sum are in (m, n), together with as much of the
contribution from the Q's (recall they are positive, so we want to use as much as
possible of them) as we can without double counting. For example, if m - 2 ~ A,
3. Models with Threshold = 1 185
by the argument that led to (3.64). Note that these inequalities would not neces-
sarily be true for LA because of the presence of I and m in A. However, by the
above inequalities for Land R, the following quantities are nonnegative:
for m < j :s I,
LA(m + 3) = L(m + 3) - ~L(m + 2) - (~ - fJ )L(m)
for m < j < l. Using this inequality in evaluating Q(m, m + 1) in (3.60) (note
that the m appearing there is not the same as the m appearing here, but rather is
smaller by one), it follows that (3.66) is ::::
Already we can see in this case that not all the coefficients of ai bj are nonnegative,
which makes the necessary analysis quite delicate. If I = m + 3, then (3.67)
becomes
(3.70)
Recall that (3.67) contains only half of the tenus (3.65). The other half are
obtained by symmetry. By this we mean that we interchange the roles of LA (m - i)
and RA (m + i), or equivalently, of am-i and bm+i . If k = m - 2 and I = m + 2,
then (3.68), together with its counterpart obtained by symmetry, imply that (3.65)
can be written as
M= 0 2 2 0 0
3
2 0 0 0 0
0 0 0 0
Since all entries of M are nonnegative and the a's and b's are nonnegative, (3.65)
is nonnegative in this case. So, we see that even though some of the terms in (3.68)
are negative, they are compensated by positive terms in the symmetric expression.
If k = m - 3 and I = m + 3, then (3.69), together with its counterpart obtained
by symmetry, imply that (3.65) can be written as
i - ~,8 ~ - 2,8 9
"8
1
4
3
2
l4 + .!.,8 9 5
M= 2 "8 2 3
1
-1 +,8 4 3 2 0
3
0 2 0 0
Recalling that ,8 = .1497 ... , we see that the only negative entry in this matrix is
-1 +,8, which appears twice. To handle these, use (3.63) to show that
::: o.
The nonnegativity comes from part (ii) of Proposition 3.43. Similarly, bm +1 ::: bm .
Therefore, the ~ + !,8 entries can be used to compensate for the -! + ,8 entries.
It follows that (3.65) is nonnegative in this case as well.
Now take k = m - 2 and I = m + 3. Using (3.69) and the version of (3.68)
obtained by symmetry, we see that (3.65) can be written as
3. Models with Threshold = I 189
1+113
4 2
5
8" 2 2 0 0
M=
-! + 13 1
4 3 2 0 0
3
0 2" 0 0 0
(3.72)
(3.73)
which is nonnegative, since f(j) t by Proposition 3.26 and Table 1. Using (3.73),
we see that the left side of (3.72) is
8
1
[ -am-z 1
+ -am-I
4
- -am
2
1]
j=m+S
bj L
I
-i+.8 -1+.8 I
4 3
if k = m - 2 and I :::: m + 4,
-i+.8 -1+.8 ~ 3 2
+.038 +.051 +.251 +.825 -.350
+.238 +.251 +.451 1.125 +.250
+.762 +.825 1.125 2.500 3.000
-.475 -.350 +.250 3.000 2.000
if k = m - 3 and I :::: m + 4, and
-i + 13 -4 + 13 4
1
3 2
+.025 +.038 +.238 +.762 -.475
9
+LA(n)RA(n - 1) - -LA(n - I)R A(n - 1)
4
9
+LA(m + l)R A(m) - 4'L A(m + l)R A(m + 1)
to j E S\{m, n}.
It turns out that while (3.76) is nonnegative, (3.74) and (3.75) are not necessar-
ily nonnegative. They contain negative terms that apparently cannot be compen-
sated for by positive terms in the same collection. The solution to this difficulty is
to use positive terms in (3.74) to compensate for negative terms in (3.75) and vice
versa. But, since these terms and their compensators may be located quite far from
each other if n - m is large, it seems not to be possible to carry out the trade-off
directly. We must move the terms through the (3.76)'s, using the positivity of the
intervening (3.76)'s to prevent a loss of positivity that might occur otherwise.
With these comments as motivation, we will now write down the collections
that we will show are nonnegative. The idea is to add to (3.74-3.76) bilinear
expressions in the LA'S and RA 's that satisfy the natural symmetry conditions and
whose total contribution to (3.65) is zero. To simplify the notation, put
Note that the sum over all j E S of all the added terms is zero, by a telescoping
series type of argument, so that
Therefore, we need to show that each of the terms on the right is nonnegative if
the constants C I , C2 and C3 are chosen appropriately.
We will look at (3.74'), (3.75') and (3.76') separately. The first two are related
by a symmetry, so it is enough to consider one of them. We begin with (3.76') for
a fixed j E S\{m, n}. Write
RA(j + 3) =
bj+4, RA(j + 2) = bj +4 + bj+3,
1
RA (j + 1) = 2(bj +4 + bj+3) + bj+2,
1
RA (j) = 2(bj+4 + bj +3) + bj+2 + bj+l ,
RA(j - 1) = (~ + fJ) (bj+4 + bj +3) + ~(bj+2 + bj+l) + bj .
Substituting into (3.76') leads to
194 Part II. Voter Models
(3.77) (aj-4, aj-3, aj-2, aj_l, aj)M(bj +4, bj+3, bj+2, bj+l, bj ),
where M = Mo + elMI + e2M2 + e3M3 and the Mi's are the matrices
2f32 - if3 2f32 - ~f3 + ft -if3 + tz 2f3 - ft 2f3 - ~
3
Mo= i - 2f3 8
2R
}J
-..!.
16 2f3 - ft 3
8
3
2 2
I
2f3 - ~ 2f3 - ~ -4 2 2
I
i - 2f3 i - 2f3 2 0 -1
I
i - 2f3 i - 2f3 2 0 -1
I I
MI= 2 2 2 0
0 0 0 0
-1 -1 0 0 0
I I
i - 2f3 ~-f3 2 -2 -1
I
~-f3 2 0 0
I
M2 = 2 2 0 0
I
-2 0 0 0 0
-1 0 0 0 0
and I
0 2 0 -1 0
I
2 0 0
M3 = 0 0 0 0
-1 0 0 0 0
o 0 0 0 0
To check that (3.77) is nonnegative, we need to know how to choose the
constants el, e2 and e3, and to do that, we need to consider also the case (3.74').
Thus we defer the verification that (3.77) is nonnegative a bit. By analogy with
the earlier cases, in considering (3.74'), it is natural to let
3. Models with Threshold = I 195
and
LA(n - 3) = a n-4,
LA(n - 2) = an-4 + an-3,
1
LA(n - 1) = 2(an-4 + an-3) + an-2
1
LA (n) = 2(an-4 + an-3) + an-2 + an-I,
LA(i) = [1 - u(i - n + 2) - ~U(i - n)}an- 4+ an-3)
if n < i S I, and if 1 = n + 2,
LA(n + 3) = c: 1
f3 - ~~)(an-4 +an-3) + (f3 + ~)(an-2 +an-l)
+ 2(an +an+l) +an+2.
U sing the inequality LA (i + 1) - LA (i) :::: ai for n < i < 1 as before, and dropping
the terms involving ai for i > n (aU of which are nonnegative), the analogues of
(3.68), (3.69), and (3.70) are ::::
if 1= n + 3, and
if I ::: n + 4.
The terms that appear in (3.74) that do not appear in (3.66) (with m replaced
by n) are
~f3 - fz
2f3 - .l
16 2f3 - ~ - C I
I
~ +CI -4
3
~+f3 :2 2
o 2 2
If 1= n + 3, (3.74') is ::::
(3.79) (a n-4, an-3, an-2, an-I, an)N3(b n+3, bn+2, bn+l , bn),
5
1.4 + 1.f3
2 "8 2 2
-! + f3 I
4 3 2
0 0 -!C2 - C3 -C I - C2
!C2 + C3 !C2 + C3 0 -C I
+ CI +C2 C I +C2 CI 0
0 0 0 0
0 0 0 0
198 Part II. Voter Models
(3.80)
16
3
+ 2"1 fJ 14 + IfJ
2
5
"8 2 2
-~ + fJ -! + fJ 4
1
3 2
0 0 0 -!C2 - C3 -C 1 - C2
0 0 0 0 0
0 0 0 0 0
Next we need to decide how to choose values of C 1 , C2, C3 that make it
possible to show that (3.77)-(3.80) are all nonnegative. Looking first at (3.77),
note that aj-4 and bj+4 are values of Land R, while the other a's and b's are
differences of such values. This means that the latter ones may well be much
smaller in size than the former. In particular, it will be hard to make (3.77)
nonnegative unless the upper left entry of M is nonnegative, and the top row (or
equivalently, left column) of M has a nonnegative sum. In other words, we will
need
(3.8Ia)
and
(3.8Ib) 11
32
( 45)
2fJ 2 + 4fJ - - - C 1 2fJ + - - C2 fJ + - ( 13)
8
1 - O.
- 2 -C3 >
(3.81c)
So, a reasonable strategy is to choose C\, C2 , C3 so that the left sides of (3.81)
are all zero. Using the value of f3 from Proposition 3.11 gives (rounded to four
decimals):
C\ = -.1231, C2 = .2729, C3 = .0133.
Using these values leads to
o +,0693 +.0687 +.0873 -.2252
o 2.0000 2.0000
+.0164 +.1312 +.3620 -.2252
Most entries in the above matrices are nonnegative, but there are a few negative
ones that we must deal with. Start with M. The negative entries in the comers are
compensated by the positive entries in the first column and row respectively. To
see this, use (3.63) to write
(3.82)
aj= L [u(j-i)-u(j-i)]LA(i).
i::;j-4,iEA
Therefore, in order for the overall contribution of the first column of M to (3.77)
to be nonnegative, we need
for i ::: 1. Recalling that the sum of the coefficients above was taken to be zero (in
making (3.8Ib) an equality), (3.83) follows immediately from Proposition 3.43(ii)
for i ::: 2. Using the values in Table 1, we see that (3.83) is true for i = 1 as well.
The - .25 entries in M are compensated by the entries just below and to the
right respectively. In order for this to be true, we would need
But this follows from Proposition 3.43(iv). This (together with the analogues with
a's replaced by b's) completes the proof of the nonnegativity of (3.77).
The treatment of the negative entries in Nl, N2, N3 is similar. Since the sum
of the entries in the first row of N2 is negative, it might appear that there would
be a problem in that case. So, we will treat this case only, leaving the other entries
in the N's to the reader. We need to check that
1 1 1
S[u(i + 1) - u(i + 3)] + 4[u(i + 3) - u(i + 4)] - 2:[u(i + 4) - u(i + 5)] 2: 0
for i 2: O. For i = 0, this follows from the values in Table 1, while for i 2: 1, it
follows from Proposition 3.43(ii).
This completes the proof of the nonnegativity of (3.65) for all choices of
k < m < n < l. Together with the already proved nonnegativity of the Q's, this
shows that (3.59) is nonnegative for all finite A C Zl, and hence that (3.50) is
true. Therefore, the proof of Theorem 3.49 is complete.
Clustering in Two Dimensions. Recall from Theorem 1.3 that the two dimensional
linear voter model clusters, and in fact, that the critical dimension for clustering
is 2, in the sense that higher dimensional linear voter models coexist. Cox and
Griffeath (1986b) and Bramson, Cox and Griffeath (1986) quantify the way in
which this clustering occurs. Take the initial distribution to be the product measure
v p with density p: v p {1) : 1) (x) = I} = p for all x E Z2. The limiting behavior of
the voter model can be described in terms of the Fisher-Wright diffusion process
Y (t) on [0, 1], which is the process with generator
1
Qf(x) = /I
2:x(l - x)f (x).
Take the initial condition for this process to be Y(O) = p. Then for a E [0, 1], the
following limiting statements hold as t -+ 00:
(a) For a E [0, 1], {TJt(xt a / 2), x E Z2} converges in distribution to {~(x), x E
Z2}, where the limit is an exchangeable Bernoulli random field with
a random walk that starts at xy't, x =F 0, will not have hit 0 by time t with
large probability. If a = 0, Y is being viewed at time 00. Since Qf = 0 for
f(x) = x, yet) is a (bounded) martingale. Its limit exists a.s., and cannot be in
(0,1). Therefore, yet) -+ 1 with probability p and yet) -+ 0 with probability
I - p. So, the above result reduces in this case to a special case of Theorem 1.4.
(b) The block averages
Occupation Times. Consider the linear voter model IJt on Zd, and let Tt be the
occupation time of the origin up to time t:
Tt = 1t 1]s (O)ds.
Cox (1988) established a central limit for Tt when the initial distribution is one
of the nontrivial invariant measures if d 2: 3, or an appropriate deterministic
configuration if d = 2. This followed earlier work by Cox and Griffeath, in which
the initial distribution is a product measure.
Bramson, Cox and Griffeath (1988) proved the following large deviation results
for Tt when the initial distribution is the product measure vp: For any a E (p, 1)
there are positive constants C 1, C2 (depending on d and a) so that for large t,
Consensus Times for Finite Systems. Cox (1989) treats a topic somewhat analogous
to that discussed for the contact process in Section 3 of Part I. Consider the
nearest neighbor linear voter model on the box {I, ... , N}d, regarded as a torus by
identifying opposite sides. Use the initial distribution vp. This process is eventually
4. Notes and References 203
absorbed into one of the traps '1 == 0, '1 - 1. Let TN be this absorption time.
Then
-N2 '*TN
T if d = 1,
TN
~--
N210gN
'* T if d = 2,
-Nd '*
TN
T if d ::: 3,
Modified Linear Voter Models. Several variants of the linear voter model have been
studied. For example, Granovsky and Madras (1995) have considered the model
obtained by adding constants to the transition rates for 0 ~ 1 and 1 ~ O. Ferreira
(1990) analyzes a one dimensional voter model in a random environment.
Sudbury (1999) studies the process '1t on {O, l}ZI in which
if '1 (x) = 0,
if '1(x) = 1.
The nearest neighbor one dimensional voter model corresponds to do = d\ = 1.
He proves that if do = 1 or 2, d\ > do, and the initial configuration contains
infinitely many blocks of 1's of length at least d\, then '1t converges weakly to
the pointmass on all 1'So
Other Voter Models. Mountford (1992) considers a class of voter models that
includes linear, but not threshold models, and proves a weak form of clustering
for them in one dimension. He assumes that the process is finite range, that c(x, '1)
satisfies a mild positivity condition, and most importantly, that the generator Q
satisfies the following condition: if fn ('1) = Llxl::n '1 (x), then
The role of this last condition is to guarantee that fn ('1t) is almost a martingale.
(If Qf = 0, then f('1t) is a martingale.) The conclusion is that 80 and 8\ are the
only extremal invariant measures that are translation invariant. To check (4.1) for
finite range linear voter models, write
204 Part II. Voter Models
so that
IQln(TJ)1 :::: 2 I: Izlp(O, z) < 00.
To show that (4.1) is not satisfied for the threshold voter model with JV
{- T, ... , T} (in which case we know the process clusters by Theorem 2.6), note
that if TJ is a configuration in which intervals of zeros of length T alternate with
intervals of ones of length T + 1, then
Qln(TJ) = I: [1 - 2TJ(x)),
Ixl:sn
Muititype Voter Model with Mutation. This is a model in which there are infinitely
many potential types, rather than two. The types are indexed by the interval (0, 1),
so that a configuration TJ is a point in (0, l)zd. There are two kinds of transitions:
(a) (Dispersal) For nearest neighbor pairs x, y, site Y adopts the type of site x
1
at rate U'
(b) (Mutation) Each site Y adopts a new type chosen at random from (0, 1) at
rate a> 0.
°
Note that for each a E (0, 1), y (x) = 1(ry:ry(x)~cr) is an ordinary linear voter
model with additional spontaneous flips from to 1 at rate a(1 - a) and from 1
°
to at rate eta. This process has a unique invariant measure with density 1 - a.
Bramson, Cox and Durrett (1996, 1998) have used the multitype voter model
with mutation for small mutation rate et in two dimensions as a model to study
the abundance of species. They begin by observing that the process has a unique
stationary distribution, to which the distribution at time t converges as t --+ 00,
for any initial configuration. That should not be surprising, in view of the above
comment. The proof is a straightforward application of duality. Let ~ E (0, 1)Z2
have that stationary distribution.
Their first paper is devoted to the question of how the number of species in
a region depends on the size of the region. For r > 0, let N r •a be the number of
distinct types in the restriction of ~ to the square centered at the origin of side
length L', where L = 1/,Ja. They prove the following asymptotics for N r •a as
at 0:
(a) If r :::: 1, then
Nra 2
. --+ -
L2r-2(log L)2 7r
°: :
in probability.
(b) If r < 1, then
N r •a => Fr ,
where Fr is a distribution that is of order (1 - r) -1 as r t 1.
4. Notes and References 205
The second Bramson, Cox and Durrett paper obtains results on the relative
abundance of species for this model in a large box as ex t O.
Consider a sequence of threshold voter models TJ~ with initial distribution the
product measure with density 4,
and parameters A{; Tk so that
Tk
IA{] ~8.
Note that by Theorem 2.1, fixation occurs for large k provided that 8 > 4. Then
for each x E Z 1,
and
. 1
(4.3) lim P ( lim TJ~ (x) lim TJ~ (x + 1)) = 1
= t--+oo If 2" < 8 < 8c ·
k--+oo t--+oo
Durrett and Steif conjecture that a similar result holds for d > 1, but with 8c = ~.
(They prove (4.2) in all dimensions - the hard part is (4.3).)
Theorem 2.6 was proved in case T = 1 by Cox and Durrett (1991). The
general case was proved by Andjel, Liggett and Mountford (1992). In the latter
paper, the following results were also proved for the threshold voter model in one
dimension with JV. = {-T, ... , T}:
(a) If the initial distribution f-L is translation invariant, then
Tn 1
liminf-- > -
n--->oo 1A;;l 4'
then the threshold contact process with A = 1 does not have a nontrivial invariant
measure for large n. Conjecture 6.1 in Durrett (1995) is that ~ is sharp for Corollary
2.21 also, in the sense that if
1 Tn 1
(4.4) - < lim - - <-
4 n--->oo Iffn1 2'
then the threshold voter model clusters for large n. (Recall that by Theorem 2.1,
the process fixates for large n if the limit in (4.4) is> !.)
In Section 3, it is proved that with T = 1, the threshold voter model coexists
in all cases except d = 1, JV = {-1, 0, 1}. It would be interesting to know what
happens if
d=l, T=2, JV'={-n, ... ,n},
for example. In this case, we know from Theorem 2.1 that the process fixates if
n = 1, from Theorem 2.6 that the process clusters if n = 2, and from Theorem 2.17
that the process coexists if n is sufficiently large. Cox and Durrett (1991) proved
that it is enough that n ~ 47 for the process to coexist. They quote computer
simulations to guess that the process clusters if n = 3 and coexists if n ~ 4.
exact integer arithmetic with integers of nearly 2000 digits, so that the proof was
very computationally intensive. The proof of part (a) given here is entirely analytic,
and eliminates the need for a computer, except to do small calculations that could
be carried out on a calculator.
The improvement in part (b) is more significant. In working out the proof
for this presentation, the author discovered a serious error in the treatment of
some of the cases appearing on pages 777-787 of Liggett (1994b). (In the rest
of this paragraph, equation numbers refer to that paper.) In case 3, for example,
the bilinear expression (3.24) was supposed to be shown to be nonnegative for all
choices of LA and RA for which the corresponding Land R satisfy the inequalities
(3.25). In the proof, some reductions were made that led to an expression whose
nonnegativity was checked by verifying it at the extreme points of the convex set
detenuined by the inequalities. That is fine for a bilinear expression. However,
in the reductions, certain linear tenus were replaced by nonlinear tenus, thus
invalidating the proof. An example that satisfies (3.25) but for which (3.24) is
negative is
and correspondingly
(4.5)
where v is the limiting distribution of the process starting with the product measure
with density 1/2, which is nontrivial by Theorem 3.49, and
208 Part II. Voter Models
In particular, 1Jt => v for any initial configuration with infinitely many zeros and
infinitely many ones. A consequence of this is that v is the only nontrivial extremal
invariant measure for the process. Her proof of (4.5) uses duality (1.7) in a crucial
way.
Part III. Exclusion Processes
1. Preliminaries
A common feature of the contact processes and voter models that are treated in
Parts I and II is that only one coordinate of the configuration changes at each
time. One consequence of this property is that these processes tend to have only a
few invariant measures - typically there are one or two trivial ones, and then with
a substantial amount of work, one can often prove the existence of a nontrivial
invariant measure. The reason for this scarcity of invariant measures is that the
process has no conserved quantity, i.e., a quantity that does not change with time.
The existence of a conserved quantity tends to break up the state space {O, l}s
into classes determined by the value of this quantity, and then there tends to be
an invariant measure for each of its possible values. This corresponds roughly to
the difference between irreducible and reducible Markov chains.
One of the simplest models with a conserved quantity is the exclusion process
that we will study in Part III. This process is usually thought of as modelling
particle motion, and the conserved quantity is the number, or density, of particles.
There are other situations that can be modelled with the exclusion process, though.
One example is traffic flow, where particles are replaced by cars. In another (per-
haps the first studied), the particles are ribosomes (centers of protein production
in cells) that move along messenger RNA as they read genetic information.
(See (0.2) of Chapter VIII oflPS.) Note that (1.1) is automatically satisfied if p
is symmetric, or if S = Zd and p is translation invariant. This result provides the
formal construction and basic properties of the process.
To get an intuitive feeling about why a condition like (1.1) is needed in or-
der to have a well behaved process, consider what would happen if the initial
configuration TJ were given by
if x = x*
TJ(X)={~ if x -=1= x*,
If this sum were infinite, the only reasonable definition of the process (i.e., one
obtained by constructing the process on a large finite part of S and then pass-
ing to a limit) would have TJo+ == 1, so that the process would not even have
right continuous paths. Assumption (1.1) is just a uniform version of the above
condition.
Invariant Measures
What are the invariant measures for the exclusion process? This question has not
been answered completely, but a lot is known about it. The pointmasses on TJ == 1
and TJ == 0, are certainly invariant, since these two configurations are traps for the
process. It turns out that there are many invariant measures that are easy to write
down but do not concentrate on traps. Recall that .9 is defined to be the set of
l. Preliminaries 211
all invariant measures for the process. For a function a : S --+ [0, 1], let Va be the
product measure on {O, l}s with marginals
then Va E g where
n(x)
a(x) - XES
- 1 + n(x)' .
When (1.3) is satisfied, part (a) of the theorem produces a one parameter family
of invariant measures, indexed by particle density. If n satisfies (1.4), then so does
cn for any positive constant c, so part (b) generates a one parameter family of
invariant measures as well, though it is not so clear in this case what the parameter
represents.
(1.7)
where c is a constant. If p = !,
then this n is constant, so the invariant measures
produced by part (b) of the theorem are the same as those produced by part (a). If
p > !,
though, n grows exponentially rapidly at +00 and decays exponentially
rapidly at -00, so the corresponding Va concentrates on configurations satisfying
There are countably many such configurations, and they form the disjoint union
of
Xn = {'7: I>(x) = L (1- '7(x)] < oo}
x<n x2':n
for integers -00 < n < 00. When restricted to X n , '7t is an irreducible countable
state Markov chain, and is positive recurrent since it has a stationary distribution
given by the conditional measure
if x < n.
Symmetric Systems
Much of Part III deals with asymmetric systems - they tum out to be significantly
more interesting than the symmetric ones, which are defined by
Already we got a hint of this difference in the discussion of Example 1.5, where we
saw that there are more invariant measures for the process with p =I=- ~ than for the
one with p = ~. However, it is helpful for comparison purposes to review some
of the known results for symmetric systems. So, assume in this subsection that
(1.8) holds. In this case, the process satisfies the following self-duality property -
see Theorem 1.1 of Chapter VIII of IPS:
Here At is the same exclusion process, but thought of as a process on the collection
of finite subsets of S, with the identification A = {x : '7(x) = I}. Property (1.9) is
the main reason for the difference between the symmetric and asymmetric theories.
One consequence of (1.9) is that the k site marginal probabilities
1. Preliminaries 213
of the process at time t depend on the initial distribution only through its k site
marginals. This is because the cardinality of At does not change with time. This
property does not hold for asymmetric systems - in general even P (rJt (x) = 1)
depends on the full structure of the initial distribution.
The fact that the dependence on the initial distribution is relatively simple
makes it possible to determine all the extremal invariant measures for symmetric
systems. To describe them, let
be the harmonic functions for p(., .) taking values between zero and one. The
following result is proved in Section 1 of Chapter VIII of IPS. The fact that the
/-La defined below is invariant is a consequence of Theorem B7(e).
Theorem 1.10. Suppose the Markov chain with transition probabilities p(x, y) is
irreducible.
(a) For every ex E .~,
/-La = t--+oo
lim vaS(t)
Corollary 1.11. Assuming irreducibility, if either (a) the Markov chain with tran-
sition probabilities p(x, y) is recurrent, or (b) S = Zd and p(x, y) = p(O, Y - x),
then
g,; = {v p : p = constant E [0, In.
In the following example, .~ is very large, so the exclusion process has many
extremal invariant measures by Theorem 1.10.
Example 1.12. Consider the simple random walk Yn on the tree Td in which each
vertex has d + I neighbors; Yn moves to each of its neighbors with probability
d~ 1 . Take d 2: 2, so that Yn is transient. Fix a pair of neighbors L , x+ and write
Td = S_ U S+, where S_ consists of all vertices that are closer to x_ than to
x+, and S+ consists of all vertices that are closer to x+ than to x_. Since Yn is
214 Part III. Exclusion Processes
transient, it visits x_ only finitely many times, and hence Yn is either eventually
in S_ or eventually in S+. Define
This function is harmonic by the Markov property, and can be computed explicitly
as
(d + l)d1x-x-1
a(x) ={ 1
1------
(d + l)d 1x - x+ 1
By Theorem 1.10, the corresponding exclusion process has an extremal invariant
measure with marginal probabilities given by this function a. The measure de-
pends on the choice of x_, x+, so we see that the exclusion process has many
inhomogeneous extremal invariant measures in this case.
We conclude our discussion of the symmetric case by stating a convergence
theorem that is proved in Section 1 of Chapter VIII of IPS. For its statement, let
Pt(x, y) be the transition probabilities for the continuous time Markov chain with
unit exponential holding times and transition probabilities p(., .).
and
It is important to emphasize that the proofs of all these results depend heavily
on the duality property (1.9). This is not simply a matter of technique. In the
absence of symmetry, the results, and not just the proofs, are different. Example
1.5 showed that Corollary 1.11 is not correct without the symmetry assumption.
So far, we have discussed primarily ergodic properties of symmetric systems.
Many other results are known. Some of them are described in Section 5. At this
point, we mention only one that plays an important role in the proofs of Theorems
1.10 and 1.13. It concerns inequalities that can be viewed (using duality) as com-
parisons between the symmetric exclusion process and systems of independent
Markov chains. Here is one, which is a special case of Proposition 1.7 of Chapter
VIII ofIPS: For any A c S,
(1.14) pry(1Jt == 1 on A) S n
XEA
pry (1Jt (x) = 1).
l. Preliminaries 215
Proposition 1.15. If f.-tl and f.-tz are invariant for the exclusion process, then there
is an invariant measure v for the coupled process (I1t, l;t) that has marginals f.-t 1 and
f.-tz respectively. If f.-tl and f.-tz are both extremal, then v can be taken to be extremal
as well. If f.-t 1 :::: f.-tz, then v can be taken to concentrate on {( 11, l;) : 11 :::: l;}.
proofs are based on coupling. Example 1.5 shows that the mean zero assumption
in part (b) is needed.
To see how coupling is used in this context, we outline the proof of part (a)
of Theorem 1.16. Consider the coupled process (ryt, ~t). Suppose that the initial
distribution v is shift invariant, and let Vt be the distribution at time t. Then shift
invariance implies that
-[p(x, y) + p(y, x)]v{ (ry, l;) : ry(x) = ~(y) = 1, ry(y) = ~(x) = OJ.
Therefore
v{ (ry, l;) : ry(x) = ~(y) = 1, ry(y) = ~(x) = O} = 0
whenever p(x, y) + p(y, x) > O. Using the irreducibility of the symmetrized
random walk, it follows that v puts no mass on pairs of configurations that contain
discrepancies of opposite type. In other words,
Now, by Proposition 1.15, given any pair of invariant measures for ryr. there
is an invariant measure for (ryt, ~t) that has those two measures as marginals. In
the construction used in the proof of that result, shift invariance is preserved, so
that if the original marginal measures are shift invariant, the invariant measure for
the coupled process will also be shift invariant. Applying this to the pair vp , J-L,
where J-L E (9' n y:t, it follows that either J-L ~ vp or J-L :::: vp for each p. Since
this is true for all p, it follows that J-L = vp for some p.
Even though the full class of invariant measures has not been determined
for asymmetric translation invariant systems, it is still possible to show that the
product measures vp are extremal.
Proof Let Q be the generator for the exclusion process with transition probabilities
p(x, y) from x to y, and Q* be the generator for the process with transition
1. Preliminaries 217
for any cylinder functions f, g. To see this, fix x and y. Make the change of
variables TJ --* TJx,y in the integral below and use the fact that vp is exchangeable:
This identity makes the terms involving TJx,y cancel in the next computation. Let-
ting T C Zd be finite so that f and g depend only on the coordinates in T, we
see that
+ L
xET,y1J
p(x,y)[p-TJ(x)]+ L
x11,YET
P(X,Y)[TJ(Y)-P]]dVp.
and
Identity (1.18) extends to any g in the domain of Q and any f in the domain
of Q*, So, we may replace g by Set - s)g and f by S*(s)f, where S(t) and S*(t)
are the semi groups corresponding to Q and Q* respectively. Therefore,
Since these semigroups are contractions and the space of continuous functions is
dense in L 2 (v p ), (1.19) extends to all f, g E L 2 (v p ).
The idea of the proof is now the following: Suppose
1 1
(1.20) vp = 2ILI + 21L2'
where ILl, 1L2 E .9'. We need to show that ILl = 1L2 = vp. To do this, it is enough
to show that IL I and 1L2 are invariant for the process with generator Q + Q*, since
this is a symmetric exclusion process, and we know by Corollary 1.11 that vp is
extremal invariant for symmetric translation invariant exclusion processes.
By (1.20), ILl is absolutely continuous with respect to v p , so there is a measur-
able function h so that ILl = hvp. In fact, it follows from (1.20) that h is bounded:
o ::: h ::: 2. Since ILl is invariant with respect to the process with generator Q,
(h, Set)!} = f hS(t)fdvp = f S(t)fdILI = f fdlLI = f hfdvp = (h,!)
for any f E L 2 (v p ). In particular, setting f = h and using the contraction property
of Set), we get
over a second class particle. This rule has no effect on whether or not a given
site is occupied at a given time. The advantage, though, is that viewed by itself,
the collection of first class particles is Markovian, and has the same law as the
exclusion process. The collection of second class particles is clearly not Markovian.
However, the collection of first and second class particles is Markovian, and again
evolves like an exclusion process.
As mentioned earlier, this is just a slightly different way of thinking about
the coupling described above. To see this, let (17(, ~() be the coupled process, and
assume that 17( :s ~( at t = 0, and hence for all t. Think of the sites at which
17( (x) = ~(x) = 1 as being occupied by first class particles, and the set of sites at
which 17(X) = 0, ~t(x) = 1 as being occupied by second class particles. Then the
joint evolution of first and second class particles is exactly that described above.
To see this, note that the coupled process makes the transition
~:
17: o o
x y x y
at rate p(x, y), and when viewed in terms of first class and second class particles,
this transition becomes the exchange of positions when the first class particle at x
attempts to move to y, which is occupied by a second class particle.
Clearly, we can also consider third class particles, fourth class particles, etc. In
each case, mth class particles have priority over nth class particles if m < n. The
joint evolution of particles of different classes can again be realized by coupling
several copies of the exclusion process using the graphical representation. If 17: :s
17; :s ... :s 17~ coordinatewise, then we regard sites x for which 17: (x) = 1 as
locations of first class particles, sites for which 17; (x) = 1, 17: (x) = 0 as locations
of second class particles, etc. It follows that for any j, the collection of all particles
with class :s j is a version of the exclusion process.
~( =* N(O, (fl- P)
t4 V~ P
as t --+ 00, where N(O, a 2 ) denotes the normal distribution with mean zero and
variance a 2 .
220 Part III. Exclusion Processes
if x < 0,
(2.1)
if x :::: o.
Schematically, we are in the following context:
q p q p
... ~
q<p ... ~
-3 -2 -1 o +1 +2 +3
p
Figure 4
!
The reason we exclude the case p = from consideration is that the situation is
rather simple and uninteresting there. Theorem 1.13 implies in this case that
where as usual, Vy denotes the homogeneous product measure with density y. The
limiting behavior is more complex if p > !.
Here is the analogue of the above
limiting statement if p > !:
if A :::: ! and p :s !,
if p :::: ! and A + p > 1,
if A :s ! and A + p < 1,
if 0 < A < p and A + p = 1.
The first three cases correspond to the regions labelled I, II and III respectively in
the figure below, while the fourth corresponds to the line of slope -1.
II
III
o
o
Figure 5
222 Part III. Exclusion Processes
The most interesting case is the last one, in which the limit is a mixture of
product measures, rather than a single product measure. Note also that, unlike the
symmetric case p = !, the limit is not continuous in (A, p) at the line A + p = 1.
Our major objective in this section is to explain the above result.
The reason for considering this particular initial distribution is twofold. First,
it is about the simplest initial distribution for which one cannot easily guess what
the limiting distribution should be. The second reason is that the answer connects
up with important issues in nonlinear partial differential equations, such as shock
propagation. Here is the basic question we would like to answer: If x(t) is a
reasonable function of t, what is the approximate distribution of v)",pS(t), viewed
from position x(t)?
Heuristics
We begin with an informal calculation. Let ILt be the distribution of the process
at time t, and set u (x, t) = ILt {17 : 17 (x) = I}. We will often use the following
shorthand for cylinder probabilities:
The easiest way to work out equations of this type is to consider separately the
positive and negative contributions to the derivative. For the positive ones, ask
what situations can lead to transitions from 17(X) = 0 to 17(x) = I (i.e., transitions
that increase the probability being differentiated), and write one term for each such
situation. The term to be written is simply the rate of the transition multiplied by
the probability that the situation occurs. The negative terms are similar, but the
transitions to be considered are now from 17 (x) = I to 17 (x) = o.
We see already in (2.2) the main reason that asymmetric systems are harder to
analyze than symmetric systems: The derivative of a cylinder probability involving
one site contains terms that involve two sites. If p = !,
then some cancellation
occurs that permits one to write the right side of (2.2) as
1 1
2'u(x - 1, t) + 2'u(x + 1, t) - u(x, t).
This can also be seen as a special case of duality (1.9). In this symmetric case,
(2.2) becomes the discrete heat equation - a discrete version of
au 1 a2 u
at 2 ax 2 '
If p > !, on the other hand, this cancellation does not occur, and it is not possible
to write the right side of (2.2) in terms of the function u.
2. Asymmetric Processes on the Integers 223
Unlike the heat equation that one gets in the symmetric case, (2.3) is nonlinear.
One big difference between these two partial differential equations is the following:
The heat equation is well known to be smoothing - the solution at time t is much
smoother in x than the initial condition. This is not the case for Burgers' equation.
Discontinuities - also known as shocks - can persist for all time, or develop later
even if they are not present initially.
In our case, by analogy with (2.1), the natural initial condition for (2.3) is
if x < 0,
(2.4) U(X,o)={~ if x ::: 0,
which is discontinuous if J... =1= p. The nature of the solution (here we mean the
so-called entropy weak solution - the entropy condition is supposed to pick out
the physically relevant solution when there is nonuniqueness) depends on whether
)... < P or )... > p. (If)... = p, the solution is clearly constant in space and time.)
To see how this works, let's try to find a solution of (2.3) with initial condition
(2.4) that is of the following form:
if x .:s cit
u(x. I) ~ { :(I)X +b(t) if Cit .:sx .:sC2t
if x ::: C2t,
where Cl < C2 and aCt), bet) are chosen so that u is continuous. By the continuity
requirement,
p-J... J...C2 - PCI
aCt) = , bet) = .
(C2 - Cl)t C2 - Cl
Substituting into (2.3) gives two linear equations in Cl, C2, whose solution is
All is well provided that this solution satisfies Cl < C2. This occurs if J... > p, but
not otherwise. In this case, the shock disappears immediately, and the solution is
224 Part III. Exclusion Processes
continuous (though not smooth) for t > O. If A < p, however, this procedure does
not produce a solution, and the entropy weak solution turns out to be
i.e., the shape of the solution does not change, but it moves at velocity v. Note
that this v is the average of c] and C2 in (2.5). In this case, the shock persists, and
moves linearly with speed v. For more on partial differential equations of type
(2.3), see Section 3.4 of Evans (1998).
Results that have been proved in case A > P are described in Section 5. This
case falls within the rubric of general hydrodynamic results that are not restricted
to nearest neighbor processes in one dimension. These more general results apply
only away from the shock in the corresponding partial differential equation. Our
interest here is precisely to determine what happens at the shock itself.
x -+ x+l at rate
{ : if I1teX
if I1teX
+ 1) =
+ 1) =
0,
1,
:
(2.7)
if I1t(X - 1) = 0,
x -+ x-I at rate
{ if I1t(X - 1) = l.
Note again the simplification that occurs if p = q = ~. In this case, Zt is a simple
random walk.
Let Tit be the process I1t, viewed from position Zt:
C lim P,{I1: 11
n---+-oo
= I on A +n} = AlAI and
(2.8)
C lim P,{I1: 11
n~+oo
= Ion A +n} = piAl
for every finite A C Zl. Here C lim means Cesaro limit. Recall that a sequence
Un converges to U in the Cesaro sense if
I N
lim -
N~oo N n=1
LU =U. n
(2.9)
But this implies that the process viewed from Zt satisfies this property also. To see
this, suppose that 11 is any random configuration, and X and Z are two randomly
chosen sites. Then
INI ~ P (11 = I on A + X + n) -
N I N
N ~ P (11 = I on A + Z + n)
I
I
:::; -E
I X+N
L I{~=I on A+k} - L
Z+N
I{~=I on A+k}
I :::; -EIX
2
- ZI·
N k=X+I k=Z+1 N
The key to defining the position XI, and to proving these facts, is to find a
closely related process for which an invariant measure '" v;.,p can be computed
more or less explicitly. To this end, consider the process (rl1, 11;, I1n in which the
particles in 11; are first class, the particles in 11; are second class, and the particles
in 11; are third class. Each site contains at most one particle overall, of course.
Defining 11;,3 = 11; + 11;, which amounts to not distinguishing between second and
third class particles, we see that (17;, 17;,3) can be viewed as a process of first and
second class particles.
Recall from our discussion in Section 1 that (11; , 11; + 11;.3) can be regarded as
the coupling of two exclusion processes, the second of which lies above the first.
2. Asymmetric Processes on the Integers 227
Therefore, by Theorem 1.2(a) and Proposition 1.15, together with its extension to
shift invariant situations mentioned in the outline of the proof of Theorem 1.16,
there exists an invariant measure v for the process (111, 11;·3) that is shift invariant,
and such that, with this distribution, 111 has distribution VA and 111,2,3 = 111 + 11;,3
has distribution vp. Note that V{(11, n:
~(x) = I} = p - A > O. Even though
the distributions of 11 and 11 + ~ under v are product measures, v itself is not in
general a product measure. In particular, there is no reason to expect 11 (x) and
~ (y) to be independent according to v for x =F y. In fact, the distribution of
{11(X), x < 0, 11(X) + ~(x), x:::: O} is not vA,p.
If the process (111, 11;, 11i) is started from a configuration in which 11~'\0) = 1,
let X t be the position at time t of the particle that began at the origin at time
0, using the (11 tl , 11;,3) interpretation. To be more specific, X t is defined so that it
does not move when there are interchanges of positions of second and third class
particles. Define (111,11;, 11i) as the process (111, 11;, 11i) viewed from X t :
The process (111,11;,3) is defined analogously. Let Set) and Set) be the semigroups
of (111,11;,3) and (111,11;,3) respectively. Write M for the set of shift invariant
probability measures p., on {(11, n:
11 + ~ E {O, l}zJ} such that p.,{(11, ~(O) = n:
I} > O. For p., E M, let
/I(.) = p.,(. I ~(O) = 1)
be the measure on {( 11, n:~ (0) = I} obtained by conditioning on the presence
of a second class particle at the origin.
When applied to shift invariant measures, there is a close relationship between
the action of Set) and the action of Set):
/IS(t) = p.,S(t).
(b) If v E M is invariant for the process (111, 11;,3), then v is invariant for the
process ( -I -23) .
11 t , 11t'
+
nx)=I,ry(y)={(y)=O
and
228 Part III. Exclusion Processes
where the subscript x, Y on 17 and ~ means that the x and y coordinates are
interchanged, and ry shifts a configuration y units, to bring the second class particle
back to the origin: r y17(u) = 17(U + y). The first two sums in the expression for Q
give the contributions from transitions that do not involve the second class particle
at the origin, while the last two sums are contributions from transitions involving
that particle.
We need to relate Q and Q, and this is done via the mapping T that is defined
by
T/(17, n = ~(0)/(17, n
In the following computation, separate the terms corresponding to x, y =1= 0, x =
O,y = 0:
Q[T /(17, n] = ~(O)
x,#O,ry(x)=I,ry(y)=O
+ ~(O) p(x, y)[J(17, ~x,y) - /(17, n]
x,y",O,~(x )=1, ry(y)=\(y)=O
+ 17(O)~(y) L p(O, y)/(170,y, ~o.y) - ~CO) L p(O, Y)/(17, n
y ry(y)=\(y)=O
- ~CO) L p(x, O)/C17, n + [1 - 17(0) - ~(O)] L p(x, O)/C17, ~x,o).
ry(x)=1 \(x)=1
Therefore,
+ q[ G 3 0 LI - G 3 + G 2 0 TI - G 2 ](1], n
Here (G 0 T)(1], n = G(T1], Tn represents the composition of G with the shift.
It follows that for any translation invariant measure p"
Next we need to obtain a similar relation for the two semigroups. To do this,
write
Now apply (2.11) to the function S(s)f and the measure p,S(t - s) (which is also
shift invariant) to conclude that the right side of (2.12) is zero. But by definition,
and
f fd[JIS(t)] = f S(t)fdJI =
p,{(1], n :1
~(O) = l}
f TS(t)fdp,.
230 Part III. Exclusion Processes
The denominators in the two expressions above are equal, because f1 is transla-
tion invariant, and second class particles are neither created nor destroyed by the
evolution, so this completes the first proof of part (a).
For the second (version of the) proof, note from the last display, that what we
f f
need to show is
S(t)Tfdf1 = TS(t)fdf1
for cylinder functions f. For any x so that 1)~.3(x) = 1, let X~ be the position
at time t of the particle that was originally at x. Then, breaking the left side up
according to the initial location of the 1)2.3 particle that is at the origin at time t,
we see that
The third equality above comes from the translation invariance of the process,
while the fifth comes from the translation invariance of the initial measure f1.
Remark. An analogue of Proposition 2.10 for the exclusion process itself (i.e.,
the process consisting only of first class particles) appears later as Proposition 4.3.
the particle at the origin is taken to be a second class particle with probability ,~c'
all particles to the right of the origin are second class particles, and all particles
to the left of the origin are third class particles.
The choice of these particular probabilities is motivated by the invariant mea-
sures constructed in Example 1.5. Note that if A. = 0, p = 1, then according to
the v; we have constructed, there are no first class particles, every site is occu-
pied by either a second or a third class particle, and the second class particles are
distributed according to the invariant measure described in Example 1.5.
Proposition 2.13. The measure v; is invariant for the process (11;,11;, 11i).
Proof Let ~/ be the generator for the process (11;,11;, 11i). By Theorem B7(b), we
need to show that for every cylinder function f of three variables, J n* fdv; = O.
We can write
-*
n -* -*
=n,+n 2,
where n; consists of all the terms in the sum corresponding to transitions that
change the value of (11;,11;,3), and n; is the rest of the summands, i.e., the ones
that involve an exchange between neighboring second and third class particles.
Recall that such exchanges do not affect XI> so that n; contains no translations.
Since v is invariant for (11:,11;·3), and the process with generator n; does not
change the labelling of second vs. third class particles, v; is invariant for this
n;
process, and hence J f dv; = O. It remains to show that v; is invariant for the
process with generator n;.In fact, it turns out to be reversible with respect to
this process. To see this, we may consider the configuration (11;,11;,3) to be fixed,
since the process with generator n; does not change it. Consider two adjacent
sites x, x + 1 such that
Let nand n+ 1 be the numbers associated with these two particles in the assignment
of second/third class labels. Then reversibility is simply the statement that after
conditioning on everything except the second/third class labels at those sites,
(2.14)
c(p/q)n I 1 c(p/q))n+'
p = q,
1+ c(p/q))n 1 + c(p/q))n+! 1 + c(p/q))n 1 + c(p/q))n+!
Remark. More generally, if f.1 is any shift invariant initial distribution for the
process (T}I, T};,3), we will define Ii; via the same procedure that was used to
232 Part III. Exclusion Processes
construct v;
from v. The corresponding result is that the distribution of the sec-
ond/third class labellings for the process (1/1, 1/;, 1/i) (relative to the particle at X t )
is stationary in time.
f1{(1/, l;) : {(k) = 0 for all 1 : : : k :::::: n} :::::: f1{ (1/, l;): I~ ~ 1/(k) - AI> to}
+ f1{ (1/, l;) : I~ ~ [TI(k) + {(k)] - pi> E}
= vA{1/: I~ ~1/(k) -AI> E}
+vp{1/: l~t1/(k)-pl >E}'
which decays exponentially rapidly to zero by the large deviations theorem for
independent Bernoulli random variables. (See Section l.9 of Durrett (1996), for
example.)
Theorem 2.16. Let (1/1,1/;,1/;) be the coupled process of first, second and
third class particles. Assume that (1/6, 1/~,3) has distribution Ii, where f1 has good
marginals. Let X t be the position of the particle in 1/;,3 that began at the origin. If
°
(both sums = if p = 1) then 11:,2 = 11: + 11; is a version of the exclusion process
such that the distribution of rx, 1It '" v)",p uniformly in t.
Proof Note first that changing the labelling of the second and third class particles
does not affect the process Xt. This is important, since we will consider different
labellings in the proof. Fix a c > 0, and let (t;/, t;/, t;?) be a version of the three
class process, but with initial distribution 7I~, and let t;/,2 = t;/ + t;/. Without loss
of generality, we can assume that (t;J, l;~,3) = (116, 11~,3), and then the coupling
maintains this relation at later times. By (2.17),
(2.18)
when t = 0. But the basic properties of the coupling imply that the probabilities
in (2.18) are increasing in t for each c. Therefore, the convergence in (2.18) is
uniform in t.
Now write
(2.19) P(rX,1I:,2 = 1 on A + n) ~P(rx,l;/,2 = 1 on A + n) + p(1I:. 2 i l;/,2).
°
The second term tends to as c -+ 00 uniformly in t by (2.18). So, we need to
show that the first term on the right side of (2.19) tends in the Cesaro sense to
AlAI as n -+ -00 and to piAl as n -+ +00 for each value of c > 0, uniformly in
t. This will give one inequality, and the other comes from using
where';l, ';2, ... are Bernoulli random variables, distributed according to the law of
the second class particles at time t for the two class process with initial distribution
/L. This is a consequence of Proposition 2.10.
We will now use Lemma 2.15 to show that the right side of (2.21) tends to
zero as n ---+ -00. Since Lemma 2.15 is unifonn in the measure /L with good
marginals, this conclusion will be unifonn in t. For n > 0, write
C- lim P(rx,s/
n--+-oo
= I on A + n) = AlAI
unifonnly in t. To do so, use Proposition 2.10 to write
t
I~ n=1 P(rx,s/ = 1 on A - n) - AlAI I ::::
p
~ AE ~ It 1{~=1
n=1
on A-n} - NAIAII,
where'; is distributed like the first class particles in the two class process at time
t with initial distribution /L. But this distribution is just VA, so the right side above
tends to zero by the law of large numbers.
Remark. If /L in Theorem 2.16 is taken to be the product measure with each site
occupied by a first class particle with probability A and a second class particle
with probability p - A, and the two sums in (2.17) are taken to be zero, then 111. 2
has initial distribution vA•P on ZI\{O}.
positions. Zt is the location of an rd,3 particle that moves with a priority inter-
mediate between those of second and third class particles - it has priority over
third class particles, while second class particles have priority over it. The copy of
the exclusion process that we focus on is YJ;,2 = yJ; + YJ;. By the priority rule we
have chosen for Zt, (YJ;,2, Zt) is a copy of the exclusion process, together with the
location of a particle that is second class with respect to it. Thus the evolutions
of the processes (YJ;, YJ;, YJ;, X t ) and (YJl,2, Zt) are consistent with the definitions
given earlier in this section.
Next we must choose an initial distribution for the process (YJ;, YJ;, YJ;, X t , Zt).
We want to do it in such a way that the relative positions of X t and Zt are in
equilibrium. The construction is similar to that used in the context of Proposition
2.13, but now we use a probability measure m(·) on ZI and a collection of numbers
satisfying 0 < mk(l) < 1. Initially, choose (YJ6, YJ~,3) according to Ti, where /-L is
a measure with good marginals, and put Xo = O. Next number the YJ~,3 particles
consecutively as before, with the particle at 0 being numbered O. Independently
'*
of what we have done so far, choose Zo to be the YJ~,3 particle numbered k with
probability m(k). Finally, if Zo = k and I k, the YJ~.3 particle numbered I is
called second class with probability mk(l) and third class otherwise.
Consider for a moment the case p = 1. Suppose that initially Xo = Zo = 0
and the YJ~,3 particles at negative sites are third class, and those at positive sites are
second class. Then at all future times, X t = Zt, and the YJ;,3 particles at sites to
the left of X t are third class, and those at sites to the right of X t are second class.
In particular, the relative positions of X t and Zt are automatically in equilibrium,
and conclusions (2.25a,b) below are automatic. For this reason, we exclude the
case p = 1 from the next result.
Theorem 2.22. Suppose that! < p < 1 and that m(k) and mk(l) are chosen so
that
qm(k)mk(k + 1) = pm(k + l)mk+l (k),
(2.23) mk(k + 1)] = qm(k + 1)[1 - mk+l(k)],
'*
pm(k)[1 -
pmk(l)[1 - mk(l + 1)] = qmk(l + 1)[1 - mk(l)], I, I + 1 k.
Let Xt(k), -00 < k < 00, be the locations of the YJ;,3 particles, when numbered in
order, with Xt(O) = XI' Then
(2.24)
and
1
L
00
L
00
Remarks. (a) It is not hard to check that the following is the most general solution
to (2.23): Take a positive function a(k) on Zl and put
m(k + 1) p + qa(k)
(2.26a)
m(k) q + paCk)
and
(2.26b)
mk(l) = (fl)k-l x { a(k - 1) if I < k,
1 - mk(l) P a(k) if I > k.
Note that the resulting m is summable if a(k) tends to 0 at -00 and to 00 at +00.
In fact, it can be normalized to make a probability measure that has exponential
tails in both directions. In particular, the right sides of (2.25a,b) are finite.
(b) If the initial distribution of (11:, 11;, 11;, X t , Zt) is modified by conditioning
on an event of positive probability, and m has a finite first moment, then it follows
from (2.25a) that
supEIZt - Xtl < 00
t~O
for this modified initial configuration as well. A natural example of such an event
IS
If 11 is the product measure with good marginals, then the distribution of 116,2 =
11& + 116 after this conditioning is vA,p on ZI\{O}.
Proof of Theorem 2.22. Giving a completely formal proof of (2.24) would involve
introducing a lot of notation that would obscure the main point, so we will argue
somewhat informally. First recall that the process {(11;, 11;·3), s 2: O}, is Markov,
and that {X s, s 2: O}, is measurable with respect to it. Furthermore, while the
transitions of {(11;, 11;,3), s 2: O}, can change the locations of the 11;,3 particles,
they do not change the labellings of these particles as second vs. third class, or the
determination of which of these particles is the one with location Zt. To check the
latter statement, note that any transitions of (111, 11;,3) that affect Zt correspond to
a first class particle switching positions with Zt, or Zt moving to an empty site.
Therefore it will be enough to check that the labellings as second vs. third class
particles and the choice of which of these is at Zt are in equilibrium with respect
to the part of the evolution that does not change (11;, 11;,3),
Any such transition involves two adjacent sites, x, x + 1, that are occupied by
11;,3 particles, not both of which have the same class. We will use some shorthand
to describe the situation at these two sites, The possible situations are called
fez, 2, k), (z, 3, k), (2, z, k), (3, z, k), (2,3, k), (3,2, k), k E Zl},
The third coordinate k is determined by Xt(k) = Zr, The first two coordinates are
the classes of the particles at x, x + 1 respectively, with 2=second class, 3=third
2. Asymmetric Processes on the Integers 237
class and z = the special Zt particle. Thus (z, 2, k) denotes the situation in which
Zt = Xt(k) = x, rd(x + 1) = 1, for example. The possible transitions and their
rates are
Note, as observed before the statement of the theorem, that if p = 1, then the
third coordinate k above does not change if there are no (z, 3)'s or (2, z)'s in the
configuration. The above transitions may be easier to visualize in the following
form:
z 2 2 z
--.q
I
Z = X(k) X(k + 1) X(k) Z = X(k+l)
z 3 3 z
I --.
p
I
Z = X(k) X(k + 1) X(k) Z = X(k+l)
2 z z 2
--.p
3 z z 3
I --.q
I
X(k - 1) Z = X(k) Z = X(k-l) X(k)
2 3 3 2
I I --.p
3 2 2 3
I I --.q
I
X(j) X(j + 1) XC)~ X(j + 1)
Figure 6
238 Part III. Exclusion Processes
The rates are shown above the arrows. X (k) is the special Z particle, and in the
last two transitions, k -=1= j, j + 1, i.e., the special Z particle is not at x or x + l. The
three equalities in (2.23) are exactly the detail balance, or reversibility, conditions
for these transitions. This establishes not only (2.24), but the stronger fact that the
full assignment of classes to the particles in rJ~,3 is in equilibrium.
To prove (2.25a), observe first that (2.24) implies that the event {Zt = X t (k)} is
independent of {(rJ.;, rJ;,3), s 2: O}, and hence of the sequence {Xt(k), -00 < k <
oo}, which is measurable with respect to it. By Proposition 2.10, the distribution
of rJ~,3 is a shift invariant measure, conditioned on rJ~,3 (0) = 1, so
The proof of (2.25b) is similar, noting that by Lemma 2.15, Xt(k + 1) - Xt(k)
has moments of all orders that are bounded in t.
Theorem 2.28. In the context ofthe discussion preceding the statement of Theorem
2.22,
EXt = vt,
where v = (p - q)(1 - P - A). If, in addition, m(·) has mean zero, then
EZt = vt.
Remark. One way to give m mean zero is to make it symmetric about the origin.
If m is written as in (2.26), it is symmetric about 0 if and only if
2. Asymmetric Processes on the Integers 239
This symmetry is easy to achieve, while still making m have exponential moments.
It is enough to define {a(k), k ~ O} so that limk-+oo a(k) = 00, and then define
{a(k), k < O} by (2.29).
Proof of Theorem 2.28. The second statement follows from the first by doing the
computation in (2.27) without the absolute values. This leads to
1
E(Zt - Xt) = - Lkm(k) = O.
p - A k
1]~.3 (x) = 1, let X~ be the location at time t of the particle that was at x at time
O. Then the current of 1];.3 particles can be written as
Taking expected values with respect to the process with initial distribution fJ., with
good marginals and using translation invariance, we have
x:<:,O x>O
(2.31 )
Now we need to compute the left side of (2.31) in a different way, to determine
the value of EXt. Write
The two terms on the right side depend only on the marginal processes 1]1,2,3
and 1]1 respectively, and these are in equilibrium under the evolution with initial
distribution fJ.,. At a time when the distribution of the exclusion process is y,
the rate at which particles cross from (-00, x] to (x, (0) is just py(1xOx+l) -
qy(Oxlx+l), where we have used the shorthand for cylinder probabilities from
(2.2). Therefore
d d
(2.33) -Ell Jl = (p - q)A(1 - A) and -Ell J/,2,3 = (p - q)p(1- p).
dt t dt
Proof For any configuration rJ and x E Z 1, let N (x, rJ) be the signed number of
particles in (0, x]:
if x > 0,
(2.35) if x < 0,
if x = O.
Then, since particles of a given class do not change their order,
on the event {rJo' (0) = I}. So, we need to consider laws oflarge numbers for 1:.
23 .
and then
EM; =).,(l - ).,)t.
In particular,
Mt
(2.39) --+0 t-+oo
t '
in probability.
Since VA is extremal invariant for the exclusion process (see the discussion
of Example 1.5 and Theorem 1.17), ry1 is a stationary and ergodic process by
Theorem B52. Therefore, the ergodic theorem (Theorem B50) gives
in probability with respect to pfL. A similar statement holds for J/,2.3 with ).,
replaced by p, so by (2.32),
J 2 ,3
(2.40) _t_ -+ (p _ ).,)v
t
in probability with respect to pli. On the other hand, since the distributions under
pfL of ry1 and ryt1,2,3 are independent of t, the weak law of large numbers for
independent Bernoulli random variables gives
1 I rt
L ry1 (y) -+ ).,r,
rt
t - "" ryl,2,3(y) -+ pr
t ~ t
y=1 y=1
in probability with respect to pll-. The statement of the theorem in the sense of
convergence in probability comes from comparing (2.41) and (2.42). To check LI
convergence, note that IXII is dominated by a Poisson process, so that
{Xdt,t> I}
is uniformly integrable.
Theorem 2.43. Suppose TJI has initial distribution v).,p on ZI\{O), with a second
class particle placed at the origin. Then ZI, the location o/the second class particle
at time t, satisfies
. Var(ZI) p(l - p) + A(l - A)
D = hm = (p - q) .
1-+00 t p - A
Remark. Note that this expression for the variance tends to 00 when p - A --+ O.
This suggests that if p = A, the motion of the second class particle is superdiffu-
sive. See Spohn (1991) for a discussion of this.
The proof of Theorem 2.43 is based on a number of reductions, which are stated
below as propositions. Throughout this discussion, t-t is the product measure with
good marginals, and Ii is t-t conditioned on TJ 2 ,3 (0) = 1. As in the case of first
moments, we will prove a result analogous to that in Theorem 2.43 for XI first,
and deduce the result for ZI easily from it later.
In (2.31), we related the first moments of 112 ,3 and XI' Let's try to do the same
thing for second moments. The first observation is that the ordering of the tagged
TJ;,3 particles is preserved, so using the notation from the proof of Theorem 2.28,
we see that
(2.44) x < y and TJ~,3(x) = TJ~'\Y) = 1 implies X~ < xi.
In particular, the product of any pair of summands in (2.30), one from the first
sum, and one from the second sum, is zero. Writing (2.30) in the form 1/,3 =
1?,3,+ - 112,3,-, where the terms on the right are defined as the two sums in (2.30),
it follows that
(2.45)
To compute the expected value of the first term on the right of (2.45), square out
the sum and use (2.44) again, to obtain
2. Asymmetric Processes on the Integers 243
Arguing as in (2.31),
(2.47)
y<x:,::O
(2.49) y<O:,::x
common graphical representation, and let xi be the position of the second class
particle that started at y for the un starred process, and xi'* be the position of the
second class particle that started at y for the starred process, Then (2.49) is simply
the average value with respect to Ii of
(2.50)
The expected value in (2.50) refers to averaging over the evolution for a fixed
initial condition.
Let··· < Y_I < Yo = 0 < YI < ... be those values of Y so that y 2 ey) = 1.
Then at all times t, ... < Xtl < xio < xii < ... are the locations of the
7]2,3 particles in the unstarred process, and· .. < xi- I ,* < xii" < ... are the
locations of the 7]2,3 particles in the starred process. These two sets of locations
are the same, except for Vt which is in the first set, but not the second. For a
given realization of the processes and time t, define i by setting xi; = Vt . Since
the labels must match up far out to the left and far out to the right, it must be the
I
case that
X tYi + 1 if i :::; j < 0,
xi" ~ XYj-l
t
X Yi
ifO<j:::;i,
otherwise.
t
Therefore
j=i
=I U<oJ[eXr)+ - eX;o)+]
=l rxt >vtJ[V/ - xi].
Averaging over Ii gives the following expression for (2.49):
(2.51)
Similarly, the third sum on the right side of (2.48), except for the factor of A,
can be written as
(2.52)
The difference between this and the computation leading to (2.51) is that y', the
initial configuration of the starred process, is obtained from y by replacing the
7]2,3 particle at the origin by a first class particle. Ut is the site at which there is
an 7]2,3 particle in the unstarred process and an 7]1 particle in the starred process
at time t.
Using translation invariance again, we can compute the sum of the negative
terms on the left side of (2.48):
2. Asymmetric Processes on the Integers 245
(2.53)
y>O
Combining (2.31), (2.45), (2.54) and (2.55) gives the following result:
Before proceeding, a few words are in order on ~w this result will be used.
Recall that we are trying to find the asymptotics of Var fl X t . Proposition 2.56 relates
this to the variance of the current (on the left side of the identity) and first order
properties of Xt, V t and Vt (on the right side of the identity). We will actually end
up using Proposition 2.56 in both directions: to compute the variance of the current
in terms of the variance of the tagged particle, and vice versa. This is a profitable
approach for the following reason. We will be able to compute the asymptotic
variance of X t directly when A = 0 or p = 1. This is because of Proposition 2.13.
As it stands, that result refers to an invariant measure v for the coupled process. In
general, this cannot be written down explicitly. However, if A = 0, for example,
then v is the same as the product measure with good marginals - this is the key
fact. Once we carry out this part of the argument, Proposition 2.56 will give us the
variance of the current in this case. However, we will be able to use the variance
of the current in this case to compute the variance of the current in the general
case, and then use Proposition 2.56 again to get the variance of X t in the general
case.
246 Part III. Exclusion Processes
But first, we need to handle the first order tenns in Proposition 2.56. The law
of large numbers given in Theorem 2.34 will allow us to do this for XI' So, we
need to prove analogous laws of large numbers for VI and VI to be used in the
tenns that involve them. Note that V t is the position of a second class particle
starting at the origin when the rest of the system is made up of first class particles
that have initial distribution VA on Z 1\ {O}, while Vt is the position of a second
class particle starting at the origin when the rest of the system is made up of first
class particles that have initial distribution vp on Z 1\ {O}. Therefore the results we
need for VI and VI are the same, except for the density of the initial distribution.
To avoid confusion, we will call the density of particles away from the origin fJ
in the next result, and the position of the second class particle Wt .
Proposition 2.57. Consider the exclusion process 17t that consists of first class
particles on Zl \ {O} with initial distribution vj3, and a single second class particle
initially at O. Let Wt be the position of the second class particle at time t. Then
in LI.
Proof Choose A = fJ < p. Consider the process (171, 17;, 17;, Xr, Zt) with the initial
distribution used in Theorem 2.22, based on a choice of m(·) that is symmetric
about 0 and has exponentially decaying tails. Let L t be the position of the leftmost
particle in 17; and R t be the rightmost particle in 17;. Note that these are finite,
since by (2.26b),
L mk(l) <
l<k
00, L [I - mk(l)] <
l>k
00.
Recall from the proof of Theorem 2.22 that given {(171 , 17;,3), s ::: O}, the law of
the location of Zt relative to XI> and the labelling of the 17;,3 particles as second
vs. third class are in equilibrium. Again let Xt(k) be the ordered locations of the
17;,3 particles at time t, with Xt(O) = Xt. Then, as in the computation that led to
(2.27),
Then
2. Asymmetric Processes on the Integers 247
E(ILt - Xtl
-
I G(k, E») = E"(IXt(l) - Xt(O)I) = -III- ,
P-A
where I = min {j : Ej = I}. Therefore,
for appropriate choice of a (.) in (2.26). Since the right side is independent of t,
even if the initial distribution is conditioned on the event
. Lt
(2.58) hm -
t-+oo t
= (p - q)(l - A - p)
in probability.
Now consider coupling together the processes (1/t, Wt ) and (1/1, 1/;, 1/:, x t , Zt),
using a common graphical representation. The initial configurations are coupled
by saying that Xo = Wo = 0 and 1/0 = 1/6 on Zi\{O}. Note that with this coupling,
Wt :::: L t for all t, provided that it is true at t = O. To see this, it suffices to check
that 1/; (Wt ) = 1, i.e., the sole second class particle in (1/l> Wt ) is always at a site
occupied by a second class particle in (1/] , 1/;, 1/:, X t , Zt). On the set {1/5 (0) = I},
Lo = Wo = O. Combining these observations with (2.58), we see that
A < P = {3, using R t instead of Lt. To check LI convergence, note that IWtl is
dominated by a Poisson process, and hence {Wt / t , t > I} is uniformly integrable.
in LI.
Recalling the discussion following the statement of Proposition 2.56, we see
that the next step in the proof of Theorem 2.43 is to obtain the asymptotics of
VarJi X t in the special case A = 0. We tum to this next.
t~1
and
. Varl" X t
hm - -
t-->oo t
= (p - q)(1 - p).
Proof Since A = 0, the product measure with good marginals is just the measure
in which there are no first class particles, and the 7]2,3 particles have distribution
vp. In this case, therefore, this measure is invariant for the process (7]}, 7];,3). Using
the assignment of labels described prior to Proposition 2.13 (with c = 1, say), we
have then that (Tj;, Tji) is stationary. Let L t be the position of the leftmost 7];
particle at time t. Then all moments of L t - X t are uniformly bounded in t, so
we may as well prove the result for L t in place of X t .
The proof for L t is based on Theorem B61. The point is that we can map
the evolution of iL t 7]; to a series of queues. To make the connection, think of
the number of sites between successive particles in 7]; as queue lengths. When a
particle moves from x to x + 1, for example, this can be thought of as a customer
moving from one queue to the previous one. Therefore, in the context of Theorem
B61, we should take S = {-I, 0,1, ... } with x* = -1, and
f
if y = x-I, x ::: 0,
q(x,Y) ~ { if y = x + 1, x ::: - 1,
otherwise.
Then
rr(x) = (~r+l,
and we should take p(x) = rr(x) + (1 -rr(x) )(1 - p), so that the A from Theorem
B61 is just (p - q) (1 - p) as required. Since with this mapping, the net output
2. Asymmetric Processes on the Integers 249
process for the queuing system is just L t - L o, the result for L t follows from
Theorem B61.
There is one difficulty, however. We need to know that our mapping takes the
distribution of iL , I1; to the measure v that is relevant to Theorem B6l, for the
present choice of p (.). This should not be surprising, since both v and the image of
the distribution of iL, 11; are invariant for the queuing system, and both distributions
have a queue length that is asymptotically distributed as the same geometric, since
p (x) -+ 1 - p as x -+ 00. What we need then is the following: Let {X (k), k E Zl}
be the ordered locations of particles in vp , so that the increments X (k + 1) - X(k)
are independent and satisfy
qk
r(k) = k k
P +q
There is a leftmost particle among the remaining ones, since Lk<O [1-r(k)] < 00.
Let Y (0) < Y (l) < ... be the positions of the remaining particles. Then it should
be the case that the increments Y(k + 1) - Y(k) are independent and satisfy
Next we need to compute the joint distribution of the Nk'S, so that we can compute
the expected value of the right side of (2.64). Fix an m well beyond the point that
the Uk'S become 1, and write
_(p)no+.+n
- -
m- 1 _
P(No - n m ).
q
=
n[
k
pn(k)uk ]
l - uk[l-p+pn(k)] E
(P)mNO
q
Since the left side is 1 when Uk == 1, the right side must be 1 as well in that case.
This allows us to evaluate the final factor on the right side. Using this, we find
that
as required.
Remark. Just as in Theorem B61, this proof implies that X/ satisfies the central
limit theorem. We will not need this in what follows, however.
We can now use this information on the asymptotic variance of the tagged
particle to get corresponding information for the current.
VarJl J1
lim _ _ I = (p - q»).,,(1 - )"')12)", - 11
/---+00 t
and
Varl1 J 1,2,3
lim / = (p - q)p(1 - p)12p - 11.
/---+00 t
Proof Part (b) follows immediately from part (a), since J/. 2.3 is the current for
a system with distribution vp and J/ is the current for a system with distribution
VA' To prove part (a), we evaluate asymptotically the right side of the identity in
2. Asymmetric Processes on the Integers 251
Proposition 2.56. In doing so, we use Proposition 2.61, together with the following
limiting statements that come from Theorem 2.34 and (2.60):
Vf Xt Vt
--+p-q, - -+ (p - q) (1 - p), - -+ (p-q)(1-2p)
t t t
as t -+ 00 in L 1. Therefore
VarJL J2.3
lim t =p2(p _ q)(1 - p) + p(1 _ p)2(p _ q)
t--+oo t
+ 2p(l - p)(p - q)[(l - 2p)+ - (l - p)]
=(p - q)p(1 - p)12p - 11·
(2.66)
. J/ = (p -
hm - '
r
q)A(1 - A) - rA
t
t--+oo
in probability and L 1. The mean of the left side of (2.66) is the right side for fixed
t, except for small errors coming from the fact that rt may not be an integer. The
analogue of the conclusion of part (a) of Corollary 2.65 is then that if A = 0,
VarJL J2.3
(2.67) lim t,r = p(1 - p)l(p - q)(1 - 2p) - rl.
t--+oo t
Even though J?,3 = J/,2,3 - J/, Corollary 2.65(b) does not give enough
information to compute the asymptotic variance of J?,3, since at this point we do
not know what correlations might exist between J/,2,3 and J/. To get around this
problem, we will relate currents to initial configurations, and it is here that (2.67)
is relevant. For the next statement, recall the definition of N(x, TJ) in (2.35).
Proposition 2.68. Suppose fJ., is the product measure with good marginals and
A = O. Then
252 Part III. Exclusion Processes
Proof The idea is to use (2.67) in the case that the limit is zero, so take r =
(p - q)(1 - 2p). By (2.66), the asymptotic mean of Jt~~3 is (p - q)p2 t . For ease
of exposition, assume r < O. Write I] = 1]~,3, which has distribution vp' As usual,
write X~ for the position at time t of the particle that began at x. Then
(2.69)
= L l](x)l{x;>o) - L l](x)l{x;::oo} + L I](x) - (p - q)p2 t
x~-rt x>-rt
But the right side of (2.69) has the same distribution as Jt~~3 - (p - q) p 2t, so the
result follows from (2.66) and (2.67).
Corollary 2.70. Suppose f.L is the product measure with good marginals. Then
+ N(p
(2.71)
J/
o
- q)(2A - l)t, 1]6) - (p - q»)."h
-+ 0,
(2.72)
and hence
(2.73)
o
N«p - q)(2p - l)t, 1]6,2,3) - N(p - q)(2A - l)t, 1]6)
+ 0 -+0
in L2. In particular,
1I. mVarJt2,3
---
I
t ..... oo t
Proof Statements (2.71) and (2.72) are just Proposition 2.68, applied to the
marginal processes that have distribution VA and Vp respectively. Statement (2.73)
is obtained by taking differences of the first two statements, recalling (2.32). The
final statement comes from (2.73), since the numerator of the second expression,
after some cancellation, is a sum and difference of independent Bernoulli random
variables. One does have to exercise some care in the computation, since different
2. Asymmetric Processes on the Integers 253
We are now finally able to detennine the asymptotics of the variance of the
shock.
. Varl/,3 2. VarX t
hm - - =(p -)..) hm - -
t ..... oo t t ..... oo t
+ (p -)..)(1 - p + )..)(p - q)ll -).. - pi
+ 2(p - )..)(1 - p)(p - q)[(1 - 2p)+ - (1 -).. - p)+]
+ 2(p - )..»)..(p - q)[(1 - 2),,)- - (1 -).. - p)-]
2· VarXt 2
=(p -)..) hm - - - (p - q)(p - )..)(1 -).. - p)
t ..... oo t
+ 2(p - q)(p - )..)[(1 - p)(1 - 2p)+ +)..(1 - 2),,)-].
Proposition 2.74. Suppose 1]1 has initial distribution vA,p on Z'\{O}, with a second
class particle placed at the origin. Then ZI, the location of the second class particle
at time t, satisfies
(p - ),,)ZI - (p - q)(p - )..)t + Ll x l:S(P-q)(P-A)I1]O(X) --+ 0
(2.75)
.fi
254 Part III. Exclusion Processes
Proof The mean of the expression in (2.75) tends to zero, by Theorems 2.22 and
2.28. Therefore, it is enough to show that the variance tends to zero. Its variance
is (up to small errors caused by the fact that (p -q)(p - A)t may not be an integer
- we will ignore such errors in this computation) is
VarZ
(2.76) (p - A)2 _ _t + (p _ A)2 D + 2(p _ A)
COV(Zt'LI x::o
I ( )( A) 1I0(X»)
p-q p- t ,
t t
where D is defined in Theorem 2.43 and Cov denotes the covariance of two
random variables. Theorem 2.43 gives us the asymptotics of the first term, so we
need only consider the covariance term. In particular, the covariance term should
in the limit exactly cancel the first two terms in (2.76).
Take x > 0, and compute
Of course,
(2.80) lim
(-+00 (p - q)t
"
~
[E(Zt 11I0(x) = 0) - E(Zt 11I0(x) = 1)] = 1.
Combining (2.80) with (2.78), it follows that asymptotically, in the Cesaro sense
over the same range,
p(1 - p)
Cov(Zt' 1I0(X») '" - .
P-A
This, together with the corresponding result for negative x's and Theorem 2.43
implies that the limit of (2.76) is zero as t -+ 00.
So, we need to prove (2.80). Choose (116, 1I~,3) according to the product measure
J1 with good marginals, and let Xt(k) be the ordered locations of the 11;·3 particles.
Define processes Yt (k) so that
by setting Yo(k) = Xo(k), and letting these positions evolve according to the
graphical representation with the following priority rule:
Since this construction is shift invariant, the distribution of Xt(k) - Yt(k) is inde-
pendent of k. Therefore,
1
(2.81 ) E[Yt(k + I) - Yt(k)] = E[Xt(k + I) - Xt(k)] = - - ,
P-A
where the second equality comes from Theorem B47.
The priority rules we have chosen are intended to guarantee that when viewed
from Yt(k), the process
'7: + 1{Y,(j).j>k}
has distribution v).,pS(t). Consider then the process ('7t, Yt(O), Yt(-I)), where
Couple the two processes (~tO, U tO, Vto) and (~/ ' U/, Vt1) together with the graph-
ical representation. For a certain amount of time, the configuration of the coupled
processes will be of the form
*' *... 0
*' * ...
where *' and * represent the locations of the particles earlier denoted by uj and
V/' in either order, and the· .. represent the rest of the configuration of O's and
1'so These agree in the two configurations. The location of the ~ can be thought
of as moving as a second class particle with respect to the process without the *S.
At some point, the : and the ~ may be at adjacent sites. (By this time, * may
be either of the star particles, since their order is not preserved by the evolution.)
At this time, the following transitions affecting these two sites are possible:
(2.83) * 0 0 * and * 0 * 0
* * * *
at rates p and q respectively. It is important to note that the transitions in the
opposite direction do not occur, so that once one of the transitions in (2.83) has
occurred, there will no longer be a site with a ~ in the pair of configurations.
At this stage, the three special sites are of the form *:, 0 and *1' These are
* two
all second class with respect to the other particles. The latter * of them interact
256 Part III. Exclusion Processes
with each other, with ~ having greater priority than ~. The interactions among
the other two pairs is more interesting. Here are the possible transitions in these
cases:
* *' at rate p if *' > *,
*'
(2.84) *' * -+
* *' at rate q if * > *',
*' *'
*' *
at rate q if *' > *,
*'
where * > *' means that * has priority over *', and
0 *'
at rate p if * > *',
*' *
*' 0 0 *'
(2.85) -+ at rate p if *' > *,
*' * * *'
*' 0
at rate q if * > *'.
* *'
The thing to notice is that after these transitions, the 1 is paired with the higher
priority * (in the first case) and the 0 is paired with the lower priority * (in the
second case). After each of these types of transitions has occurred, this pairing
will persist forever, and then VI O = V/ at all later times.
Breaking up the following expectation according to whether 1)o(x) = 0 or 1,
we have
(2.86)
By (2.81), the left side of (2.86) is (p - A)-I. Pretend for a minute that VtO = V/
with probability 1. Then we would be able to rewrite (2.86) as
(2.87)
for the appropriate (x, t) range. This would follow from (2.87), provided that
But the left side of (2.89) can be thought of as the expression on the right side,
but computed for a shifted x' = x + Yo (0) - Yo (-1). Since what we are interested
in is a Cesaro average of these expressions as x varies, this shift plays no role in
the limit.
2. Asymmetric Processes on the Integers 257
This is essentially the entire proof of (2.80), except for the proof that
(a) the transitions (2.83), (2.84) and (2.85) will have occurred with large probability
by time t if x is in the range relevant to (2.80): 0 < x « (p - q)(p - ),.)t, and
(b) errors in the above argument caused by the fact that VtO = U/ is only true
with large probability, not with probability 1, disappear in the limit.
At this point we discuss the basic ideas for (a) only, referring to Ferrari (1 992a)
for the rest of the details. The *s are travelling along the shock, so that by Theorem
2.34, they are moving at rate (p - q)(l - ),. - p). Until it nears the *s, the ~
moves like a second class particle in an environment with distribution vp , so that
by Proposition 2.57, it moves at rate (p - q)(l - 2p). Therefore, these will meet
at approximately the time s at which
i.e., at time
x
s=------
(p - q)(p -),.)
So, they will have met by time t with high probability provided that
x «(p-q)(p-),.)t
as claimed.
Here is the central limit theorem for Zt. which follows easily from Proposition
2.74.
Theorem 2.90. Suppose rJt has initial distribution vA,p on Zi\{O}, with a second
class particle placed at the origin. Then Zt, the location of the second class particle
at time t, satisfies the following:
Zt - vt
as t ---+ 00.
Proof Since the evolution of the system is translation invariant, shifting the dis-
tribution at time zero has the same effect as shifting the distribution at time t.
Therefore, it is enough to show that for any cylinder function,
The right side of (2.96) tends to zero as t --+ 00 by Theorem 2.90. Applying this
to g = a translate of f gives (2.95).
(2.98) sup EIL t - Xtl < 00, sup EIR t - Xtl < 00.
t t
f f
limit of
fd(v)",pS(t)Tvt+av'!] IS fd[av)" + (1 - a)vp].
= lim [h(t)
t~oo
+ h(t) + h(t)],
where Mt), h(t), [3(t) are the expressions in the middle of (2.99), but where the
expected value is taken over the events
where what we mean by this in case the limit on t does not exist is that these
statements are true with limt replaced by either lim SUPt or lim inft . The two
statements in (2.100) are proved in the same manner, so we consider only the first
of them.
Recalling Theorem 2.22 and the proof of Proposition 2.57, we have
lim P(G t )
t-+oo
= a.
Ih(t) - ! fdV).P(Gt)1
!
(2.101)
= /2n ~ 1 jt.;n E[Tvt+av't+(2k+l)J(1Jt) - fdv)., GtJ/.
{vt + aJt + (2k + l)j - k, ... ,vt + aJt + (2k + l)j + k}.
Provided that t is large enough that 2(k + l)n +k :::: t l / 4 , on the event G" we may
replace the 1], above by 1]1. After making that replacement, the translates of f that
appear in (2.101) are i.i.d. with the distribution of f, evaluated at a v). distributed
configuration. Let Uj be i.i.d. with this distribution. Using the Schwarz inequality,
we see that the square of (2.101) is bounded above by
In
E [ - - "(U' - EU·)
]2 = --Var(U
1
1).
2n+l.~ j j 2n+l
j=-n
Now we can go back and answer the original question raised in this section
- what is the limit of v).,pS(t) for various values of A and p? We must restrict
ourselves to A < p, since that has been the underlying assumption throughout.
3. Invariant Measures for Processes on {I, ... , N} 261
ifA+p>l,
ifA+P < 1,
ifA+p=1.
(2.103)
° AP,
°
--+ 1 at site 1 at rate
1 --+ at site 1 at rate (1 - A)q,
° --+ 1 at site N at rate pq,
°
1 --+ at site N at rate (1 - p)p.
The resulting process 1/1 is a finite state, continuous time, Markov chain on
{O, I}N, which is irreducible except for a few choices of A, p, P E [0, 1]. The main
objective of this section is to study properties of its stationary distribution, and
to then relate these to some of the phenomena studied in Section 2. It is perhaps
surprising that the stationary distribution can be written down fairly explicitly. In
contrast with Section 2, the approach in this section is almost entirely analytic.
Theorem 3.1. Suppose that the matrices D, E and vectors iV, v satisfy
(3.2a) pDE-qED=D+E,
(3.2b) iV[ApE - (1 - A)qD] = iV,
(3.2c) [0 - p)pD - pqE]v = v
For 1/ E {O, I}N, put
(3.3) IN(1/) = iV n
N
i=1
[1/(i)D + (1 - 1/(i»E]v.
If (3.3) is well defined for each 1/ (i.e., the matrix products converge), IN satisfies
Li" IN(/;) =1= 0, and 1/1 is irreducible, then
Proof Let Q(1/, n be the rate at which the process goes from 1/ to ~:
if 1/(i) = 1, 1/(i + I) = 0,
if 1/(i) = 0, 1/(i + 1) = 1
for 1 :s i < N,
AP if 1/(1) = 0,
Q(1/, 1/1) = { (1 _ A)q
if 1/(1) = 1,
3. Invariant Measures for Processes on {I, ... , N} 263
pq if rJ(N) = 0,
Q(1), rJN) ={ (1 _ p)p
ifrJ(N) = 1,
and Q(1), 0 = 0 otherwise. Here, as usual, rJi is the configuration obtained from rJ
by flipping the ith coordinate, and rJ;,j is obtained from rJ by interchanging the ith
and jth coordinates. A finite state, irreducible Markov chain has a unique nonzero
invariant signed measure of given total mass, and it is strictly of one sign. So, it
will be enough to show that
(3.4)
;=2
[1;(i)D + (1 -1;(i))E]ii
wn[1;(i)D + (1 -1;(i))E]ii
N
=-
i=2
as required. In the middle equality, we have used (3.2b). The verification of (3.5c)
is similar, using (3.2c) in place of (3.2b). For (3.5b), we may assume that I; (i) = 1,
264 Part III. Exclusion Processes
qfN(~i,i+l) - pfN({)
=W n[~(j)D + (1 - ~(j»)E][qED
i-I
j=1
n [~(j)D + (1 - ~(j»)E]v
- pDE]
N
j=i+2
The reader might wonder how one would guess that (3.5) is true and/or relevant
here. As mentioned in the proof, the expressions on the left are natural because
they are defects from reversibility. Computing these for small values of N led to a
guess that these defects might be related in this way to the stationary distribution
of the smaller system, and that suggested that there might be a potentially useful
recursion here.
Now multiply (3.2a) by [)...(l - p)p2 - p(l - )...)q2]2 and use (3.8) to replace E
and D by A and B. This gives
The left side of (3.9) minus the right side of (3.9) factors nicely:
satisfy (3.2). Except for a constant multiple, the resulting IN gives the distribution
VA, so VA is the stationary distribution in this case. The same is true if p = by !
continuity. Part (a) in cases (ii) and (iii) is easy to check directly.
For part (b), assume that p = 1. Then (3.2) becomes
Suppose that D and E are finite dimensional, and that the vector u satisfies Eu =
u. Multiplying the first identity in (3.10) by u on the right gives Du = Du + u,
and hence u = o. It follows that E - I is invertible, so that we may solve the
first identity in (3.10) for D: D = E(E - I)-I. But this implies the E and D
commute.
Assumption: From now on, we will exclude the trivial cases in (a) above. There-
fore, the matrices D and E (if they exist) will necessarily not commute and will
usually be infinite dimensional. This also makes the Markov chain 11t irreducible,
so the stationary distribution is unique. In addition, we will assume that p > !.
There is no real loss in this assumption, since there is a symmetry between the
cases p > !
and p < !,
and as we saw in the last section, the symmetric case
p = !
is much simpler than the asymmetric case.
266 Part III. Exclusion Processes
do d'0 0 0 eo 0 0 0
0 dl d'I 0 e'0 el 0 0
D= 0 0 d2 d'2 E= 0 e'I e2 0
0 0 0 d3 0 0 e'2 e3
where we have set d_ 1 = d~1 = e_1 = e~1 = O. Assuming that d; =1= 0, e; =1= 0 for
i ::: 0, the last two equations in (3.11) become
With our choice of matrices, equations (3.2b,c) for v= (vo, VI, ... ) and IV =
(wo, WI, .•. ) become
(3. 13 a)
and
3. Invariant Measures for Processes on {I, ... ,N} 267
(3.13b)
For fixed positive choices of ei, d i , e;, d;, (3.13a,b) has solutions V, W that are
unique up to constant multiples, and can be computed recursively, provided that
A> 0, P < 1.
To understand the nature of the solutions, note first that ei ~ (p - q)-I and
di ~ (p - q) -I as i -+ 00 by (3 .12a), and it is consistent by (3 .12b) to take
e; ~ (p - q)-I and d; ~ (p - q)-I as well. Therefore, the solutions to (3.13)
should behave asymptotically like the solutions of the second order recursions
with constant coefficients
(3.14)
respectively.
In order for (3.3) to be well defined, we need Li IViWiI < 00. This will be
true for any choice of constants in (3.14) if P < A, q < A, p < p, but in general,
we would need at least one of C I, C2 to be zero.
Looking at (3.12b) and recalling that we need d~1 = e~1 = 0, it makes sense
to choose do, eo so that
q
(3.15) [1 - (p - q)dol[l - (p - q)eol = -,
p
and then
I I 1 - (qjp)i+1
d·I = e· = - -p_q
I
---
To solve (3.13) it is natural, in view of(3.14), to try solutions of the form Wi = Wi
and Vi = Vi, where
1- A q p q
W = - - or and V = -- or
A p 1-p p
p 1- A
A = 1 or p = 0 if v = - - and w = - -
I-p A
q 1- A
A = 1 or p = 1 or q = 0 if v = - - and w = - -
(3.16) P A
A = 0 or p = 0 or q = 0 if v = _P- and w = - f{
1- P P
q = 0 if v = - f{ and w = - f{.
p p
Correlation Functions
Next, we will see how to compute correlation functions for the stationary distri-
v,
bution p, N in terms of the vectors wand the matrices D, E and C = D + E.
L iN
ry
(I]) = wCNv,
and
Proof The proofs of all three are immediate from (3.3). For example, consider the
first statement with N = 2. Then the left side has the following four contributions:
wD 2v if I] = 11
wDEv if I] = 10
wED v ifl]=OI
wE 2v if I] = 00,
various choices of vectors and matrices that satisfy the assumptions of Theorem
3.l - see (3.16). When p = 1, (3.2) becomes
where keN) is a reasonable sequence, and Lk is the shift that moves site k to the
origin.
RN(X) = L 2Nk-
N
k=O k
(2N - k) x k+!
N
for N ::: 1.
Proof Note that eN = (D + E)N is a sum of products of D's and E's. Anytime
v
the product is of the form Ei Dj, WEi Dj can be computed easily, because w
v
is a left eigenvector of E and is a right eigenvector of D. Since E and D do
not commute in general, we cannot simply reorder the factors so that all the E's
precede all the D's. However, we can use the first part of (3.l8) to take any pair
that is in the wrong order, and replace it bye. Of course, that reduces the degree
of the product by one.
Here is the result of using this reduction repeatedly:
(3.20) N _
e-~
~_k_(2N -k) ~ j
~ED.
k-j
k=O 2N - k N j=O
C N +1 = L - - -
N k (2N _ k) LE)Dk-)(D+E).
k
(3.21 )
k=O 2N - k N )=0
The factor of D is on the correct side of E) D k -), but the factor of E is on the
incorrect side. To fix this, write for n 2: 1
D nE = D n + D n- 1 + ... + D + E.
Using this in (3.21) gives
C N +1 = t _k_(2N -k) t
k=O 2N - k N )=0
E) [Dk-j+l + D k-) + ... + D + E]
~
=~--- (2N - k k) "~ ..
E'DJ.
k=O 2N - k N I:Oi+):o:k+l
;,),:0
(3.22) bN k
2N - k
(2N - k)
N
1+1
= 2N - / + 1
(2N -/ +
N+1
1) .
This is clearly true for I = N, since both sides are = 1. Given that, (3.22) is
equivalent to the equality of the successive differences of the two sides of (3.22):
~
wCNv = L
~ N k (2N _k) k.
LA-J(l- p)J-k W
. ~ ~
' v
k=O 2N - k N )=0
=" kN
f=Q2N-k
(2N-k)(l-p)-k-l_A-k-l~ ~
N (l_p)-I-A- 1 W·V '
3. Invariant Measures for Processes on {I, ... , N} 271
where the final step comes from summing the finite geometric series. The statement
of the proposition follows by using the definition of RN .
1 4N
JIT(2x-l)2 N ~
2 4N 1
JIT NI/2
if
lX=2:
N+l
( )
(1-2x) x(l~X) if 0 < x < ~.
Proof One could look at the definition of RN directly to carry out the asymptotics,
but it is bit easier to argue indirectly. Multiply (3.23) by Xl+l and sum for 1 <
I .:'S N. The result, after some cancellation is
RN(X)
x-I
= ~RN+l(X) + 2N
1+ 1 (2NN ++ 11) .
This can be rewritten as
[x(1-x) ]NRN(x-)-
1 [ x(1-x) ]N+l RN+1(X- 1) = [ x(1-x) ]N
2N
1+ 1 (2N ++ 11)
N
.
Replacing N by k, and then summing this telescoping series for 0 .:'S k < N leads
to the following alternative representation for RN:
(3.25)
The advantage of this representation over the definition of RN is that the summands
on the right depend on k but not on N. The final ingredient of the proof is the
following form of the Taylor expansion of the square root function:
(3.26) 1 - }1 - 4y = 2y L --
00 1 (2k + 1) y , k
2k + 1 k + 1
k=O
This may be used with y = x(1 - x) since x(1 - x) .:'S t for 0 .:'S x .:'S 1. Passing
to the limit in (3.25) gives
if ~ .:'S x < 1,
= l-II-2xl = lx-
N 1 1
x- 1 _ lim [x(1-x)] RN(x-)
N-+oo 2x(1 - x) (1 - x)-l if 0 < x < ~.
This already gives the statement of the lemma if 0 < x < ~. If ~ .:'S x < 1, this
argument gives only
272 Part III. Exclusion Processes
N 1 ~
[x(l-x)] RN(X-) = 6[X(l-X)] 2k+ 1
k 1 (2kk++11) '
and the statement of the lemma follows from this and
_1+_
2k 1
(2k +
k+1
1) '" _1_
.fiT k~ ,
4k
Combining Proposition 3.19 and Lemma 3.24, it is not hard to determine the
asymptotics of wCNv. Recall that we are excluding the trivial cases A = 0 and
p=l.
The Current
Corollary 3.27 is already enough to determine the asymptotics of the current
JLN(lO) = JLN{IJ : 1J(i) = 1, IJU + 1) = OJ.
Since this is the rate at which particles move from i to i + 1 in equilibrium, this
quantity is independent of i. To see this, simply compute
1~(l
ifp:S!:SA,
lim JLN(lO)
N-->oo
= - A) if A :S ~ and A + P :S 1,
p(l - p) if p ::: ! and A + P ::: 1.
3. Invariant Measures for Processes on {I, ... , N} 273
WCN V
WCN-IV
WCN V '
where the second equality comes from (3.18). Now apply Corollary 3.27.
Theorem 3.29. Suppose p = 1, and keN) a sequence ofintegers such that keN) --+
I
00 and N - keN) --+ 00. Then
VI if P ::: ! ::: A,
lim
N->oo
Tk(N)fl,N = v~ if A ::: ! and A + P < 1,
vp if P :::: ! and A + P > 1.
If A < !, P > !, and A + P = 1, then any weak limit of Tk(N)fl,N is of the form
(3.30)
(3.31 )
for each N. Since fl,N(A, A) = VA for each A by Proposition 3.6(a), it follows that
(3.32)
for any A, p, where /\ and v denote the minimum and maximum respectively.
Consider a sequence Nt along which the limit
fl, = lim
N'
Tk(N')fl,N'
exists. Then fl, is invariant for the exclusion process on Zl by (the exclusion
version of) Theorem B7(g). Therefore, using the notation of Example 1.5, fl, is a
mixture of Va. a E [0, 1] and of vn , n E Zl. By Theorem 3.28
274 Part III. Exclusion Processes
if p ::: ~ ::: A,
(3.33) J,t(A, p)(10) = I ;(1 - A) if A ::: ~ and A + p ::: I,
pO - p) if p 2: ~ and A + p 2: 1.
Passing to the limit in (3.32) leads to
(3.34)
If A /\ P > ° °
or A v p < 1, (3.34) implies that J,t must be an average of the
Va, a E [0, 1] alone. Since we are assuming that A > and p < 1, the only case
in which we cannot yet reach this conclusion is A = I, p = 0. But in this case,
(3.33) tells us that J,t(1, 0)(10) = ~, so again it follows that none of the Vn are
involved in the representation of J,t as a mixture of extremal invariant measures.
To see this, note that for any mixture J,t of Va, a E [0, I], J,t(10) ::: ~, while for
any mixture J,t of Vn, n E Zl, J,t(10) = 0. It follows that in general,
Al\p
vay(da).
But by (3.33),
1
AVp 4 if p ::: ~ ::: A,
(3.37) ( a(1 - a)y(da) = {
A(1 - A) if A ::: ~ and A + p ::: 1,
JAI\P
p(1 - p) if p 2: ~ and A + p 2: 1.
Note that in each case, the right side of (3.37) is either the maximum value or the
minimum value of the function a(1 - a) for a E [A /\ p, A V p], and therefore,
°
y puts all of its mass on the point or points at which this extremum is attained.
This point is unique in all cases except < A < ~,A + p = 1. In this case, the
minimum is attained at both A and p. This proves the statement of the theorem in
all cases.
Next, we want to determine the value of a in (3.30). For this, we need to find
the asymptotics of J,tN{I] : I](k(N» = I}. Here is the expression that makes this
possible.
3. Invariant Measures for Processes on {I, ... , N} 275
. (2N - 2k - .) .
+ wCk-]Jj ,
N-k
,1 1 (1 )-J-]
WCNJj f;;J2N-2k-j N-k -p
t
(3.39)
+ _1_'-. (2n - j)DH], n::: 1.
j=] 2n - 1 n
t
(3.40)
+ - j-. (2n - j)DH](D + E).
j=] 2n - 1 n
As in the proof of (3.20), we have
DH] E = DH] + Dj + ... + D2 + c.
Using this in (3.40) gives
Applying (3.22) again, now with I = i-I, leads to (3.39) with n replaced by
n + 1. The statement of the proposition now follows by using the middle identity in
Proposition 3.17, (3.39) with n = N - k, and the fact that Dj+1Jj = (1 - p)-j-1Jj,
which comes from (3.18).
We can now use this expression to determine the value of a in (3.30). The
answer given below can be interpreted in the following way: There is shock that
is approximately uniformly placed on [0, N]. To the left of the shock, J-LN is
approximately VA, while to the right of the shock it is approximately vp. This
should be compared with Theorem 2.93.
Theorem 3.41. Suppose that p = 1, 0 < "A < ! and "A + p = 1. If k(N) satisfies
k(N)/N -+ a, 0 < a < 1, then
lim Lk(N)J-LN
N~oo
= (1 - a)vA + avp •
(3.43)
Applying (3.26) to the first expression on the right of (3.43) and Lemma 3.24 to
the second expression leads to
lim J-LN{TJ : TJ(k(N» = 1} ="A + a(1 - 2"A)
N~oo
as required.
there are no transitions at all, so the limiting distribution is the initial distribution.
One would expect for general r that the limiting distribution would be asymptotic
to vy(r) at -00 and to Vl-y(r) at +00. By the above remarks, yeO) = 1, yO) = !.
The question is, is it the case that y (r) > !
for all r < I? If so, then arbitrarily
small local perturbations in the dynamics do have global effects on the limiting
behavior of the system. This problem is open for r close to 1. It turns out that
Theorem 3.29 gives some information away from 1.
One case is easy to handle without using it. Suppose fL is invariant for the
system, and suppose that fL is asymptotic to Vy at -00. Since the current is constant
in equilibrium,
The right side of (3.44) is at most r, while the left side tends to y (1 - y) as
i -+ -00. Therefore
which is independent of i for i < 0, must be < ~. By the argument that led to
(3.45), this must be the case for r < ~. Next we will use Theorem 3.29 to extend
this to a larger range of r's.
Proof Let I]t be the exclusion process on {I, ... , N} with boundary conditions
1](0) = A = r, I](N + 1) = p = o.
The processes St and I]t have the same dynamics on {I, ... , N}, except for the
transition 0 -+ 1 at site 1. For I]t. this occurs at rate r, while for st. it occurs at
rate r if St- (0) = 1 and at rate zero otherwise. Therefore, the two processes can
be coupled so that St ::: I]t at all times, provided this is true at time O. It follows
that
(3.48)
and
whenever keN) --+ 00, N - keN) --+ 00. Passing to the limit in (3.49) gives the
result.
+ L
(y-x)=O
p(x, y)[J(y, Ty-xn - f(x, n],
4. The Tagged Particle Process 279
and
if y =1= 0, -x,
if y = -x,
if y = O.
See (B2) for the expression for 0.; the expressions for Q and 0. are analogous.
Note that if f(x, 0 = g(n, then Qf(x, 0 = 0.g(n, as it should be, indicating
that ~t is a Markov process in its own right.
Below we have the decomposition that expresses X t in tenns of the environ-
ment and a martingale. As we will see, the martingale part is fairly easy to deal
with. That will leave us with the task of detennining the asymptotic behavior of
the part that depends on the environment.
Then
(4.2)
where M t is a martingale.
Proof The process M t is defined by (4.2). Take f (x, 0 = x. Substituting into the
above expression for Q, we see that
Let jifbe the a-algebra generated by the process (X s , ~s) for s :'S t. To check the
martingale property of Mr, it suffices to take s < t and show that E [Mt - Ms I
.~] = O. Using the Markov property and the definition of Mr,
E[Mt - Ms I ~] = E[ Xt - Xs _ [t 1/I(~r)dr I ~]
= E(X,,(,) [ X t - s - Xo _I t
-
s
1/I(~r )dr l
280 Part III. Exclusion Processes
E(X,O[X t - x] = 1t E(x,01jJ({r)dr
for all t > O. In terms of the semigroup Set) for the process (X t , {t), this can be
written as
S(t)/(x, n-
I(x, n
= 1t S(r)Q/(x, ndr,
which is a consequence of Theorem B3, applied to the present semi group and
generator. (Strictly speaking, our I is not in the domain of Q, so one should
apply the above argument to a truncation of I and pass to a limit, but this step is
left to the reader.)
Proposition 4.3. For all t ::: 0, (ryt(X t + x), x =1= O} are i.i.d. random variables
with P (TJt (X t + x) = 1) = p. The process {t is stationary.
Proof For any x that is initially occupied, let X: be the position at time t of the
particle that was initially at x. In particular, X~ = Xt. In order to prove the first
part of the theorem, we need to show that for any finite A C Zd with 0 cJ. A,
(4.4) En{t(x)=pIAI.
XEA
pIAI+! =e p n
XEAU(O}
TJt(x) = f Ery[ n
XEAU(O}
TJt(X)]dV p
= pE {t(X).
XEA
The second part of the proposition is now a special case of the general fact
that a Markov process started off with an invariant measure is a stationary process.
°: :
Proof Since the transitions of X t correspond exactly to the shifts in 1;1> X t is a
function of {I;s, s ::: t}, say
(4.6)
Since appropriate functions of stationary processes are again stationary, the result
follows from Proposition 4.3.
Proof The idea is to deduce the ergodicity of I;t (with initial distribution vp (.) =
vp(' 11](0) = 1»
from that of 1]t (with initial distribution vp). The ergodicity of 1]t
follows from Theorem 1.17 and Theorem B52(a). We will argue by contradiction,
so assume that I;t is not ergodic. Then there is a set A of configurations 1] with
1](0) = 1 so that
(4.9)
and
(4.10)
for a.e. I; E A and all t > 0, i.e., A is invariant for the process I;t.
Here is one way to see this. Take a bounded continuous function G so that the
statement equivalent to ergodicity in Theorem B52(b) fails for some function F.
By Theorem B50,
W = lim -
t ..... oo
11t G(l;s)ds
t 0
exists a.s. By Fubini's theorem, we can consider this limit for the process with
initial configuration I; for a.e. I; (with respect to vp). Because the condition in The-
orem B52(b) fails, w(1;) = E~W is not constant a.s. Let also v(1;) = Var~(W) be
the variance of a random variable whose distribution is the conditional distribution
of W given 1;0 = 1;. Then
(4.11 )
282 Part III. Exclusion Processes
(4.12)
for a.e. S E B. It follows that A and B are closed for the part of the evolution
of 7]t that does not involve transitions to or from the origin. It is not invariant for
those transitions, of course, since 7] (0) = I on AU B. In order to find sets that are
invariant for all transitions of 7](, let
Since every transition for 7]t that involves the origin is a transition of St followed
by a translation, it follows that A and B are invariant for the process 7]t. Since 7]t
(with initial distribution vp) is ergodic, vp(A) and vp(B) are each either 0 or 1. If
vp(A) = 0, then vp(A) = 0, which contradicts (4.9). This, together with the same
argument applied to B implies that
(4.13)
In particular, A and B are not disjoint. This does not yet contradict the fact that
A and B are disjoint, since A and B are potentially much larger than A and B
respectively. So, we must work a bit harder.
We will argue shortly that (as a consequence of (4.13)) for a.e. 7] with respect
to vp , there are sites
satisfy properties (i)-(iv). Let N = {a,b,c,al, ... ,an,b l , ... ,bl,cl, ... ,cd.
Fix a time to and let ri' be the random configuration that agrees with 1'/ on N,
while on the complement of N it has the distribution that the exclusion process
would have at time to if it evolved starting with configuration 1'/, but allowing only
transitions on the complement of N. Recalling the graphical representation of the
exclusion process that is described in Section 1, we see that for any particular
way of transforming 1'/ into I'/a,c using only transitions in N, on a set of positive
probability, 1'/10 has the same distribution as I'/~,c' and the transitions on N have
occurred in the order dictated by that way of going from 1'/ to I'/a,c.
We will focus on two ways in which 1'/ can be transformed into I'/a.c, using
only transitions in N. They also provide two ways of transforming 1'/' into I'/~.c'
(1) Let {ij, 1 .:::: j .:::: m} be the successive values of i so that I'/(Ci) = 1. Move
the particle at Ci m to c, then the particle at cim~l to Ci m , ••• , and finally the particle
at a to Ci 1 • Since the particle at b has not moved in this sequence of transitions,
Tbl'/ E B, and B is closed for the process ~r. this shows that Tbl'/~,c E B a.s.
(2) Move the particle at b to C through the sites bi in a manner similar to (1)
above, and then move the particle from a to b through the sites ai. Recall at this
latter step that the sites ai are vacant by (ii). Since the particle originally at a is
now at b, Tal'/ E A, and A is closed for the process ~I' this shows that Tbl'/~,c E A
a.s.
But since A and B are disjoint, it cannot be the case that both Tbl'/~,c E B a.s.
and Tbl'/~,c E A a.s. This gives the required contradiction.
It remains to prove the existence of sites that satisfy properties (i)-(iv) for a.e.
1'/. For two distinct sites a, b, define C(a, b) to be the set of sites C for which there
is a path from a to C that avoids b and there is a path from b to C that avoids a.
Using the fact that the random walk is irreducible and is not nearest neighbor in
one dimension, we will show below that
Since AU B = {I'/ : 1'/(0) = I} modulo a null set, for a.e. 1'/, Twl'/ E AU B for
all w such that 1'/ (w) = 1. Fix an 1'/ with these properties, and (by irreducibility)
choose a path a = ao, ai, ... ,an, an+1 = b so that p(ai' ai+l) > 0 for each i,
where a, b satisfy (4.15). Then among the i's such that I'/(ai) = 1, there must be
two successive ones so that Taj 1'/ E A for the first of these, and Taj 1'/ E B for the
second. Therefore, by using these as new choices of a and b, we may assume that
l1(ad = 0 for 1 .:::: i .:::: n. This gives properties (i) and most of (ii). To get the rest
of (ii), and (iii) and (iv) as well, choose C E C(a, b) such that C -=f. ai for all i and
284 Part III. Exclusion Processes
TJ(C) = O. This choice is possible since there are infinitely many vacant sites in
C(a, b).
Finally, we need to check (4.14). We will consider only the case d = 1 - the
higher dimensional case is similar. Without loss of generality, we can take a < b,
and assume that p(O, x) > 0 for some x > 1. By irreducibility, there is a y < 0 so
that p(O, y) > O. Again by irreducibility, there is a path from 0 to 1; call it n. If
i > 0 is sufficiently large, then the path that begins with {b +x, b +2x, ... , b +i x},
and then continues with any number of shifts of n will remain to the right of b,
and hence will avoid a. Thus for a sufficiently large Co, for every c ::: Co there is
a path from b to c that avoids a. Similarly, if i is sufficiently large, then the path
that begins with {a + y, a + 2 y, . .. , a + i y}, and then continues with some number
of shifts of n will remain to the left of a, and will end at some Z for which b - z is
not a multiple x. Then, continuing this path by adding {z + x, z + 2x, . .. , z + j x}
for a large j (thereby avoiding b), and following it with any number of shifts of
n, leads to a path from a to any sufficiently large positive c, while still avoiding
b. Thus we conclude that C(a, b) contains a half line of the form [co, (0). This
concludes the proof of Proposition 4.8.
Just as Corollary 4.5 followed from Proposition 4.3, we get the next result as
a consequence of Proposition 4.8:
Theorem 4.17.
EXt = t(1 - p)m,
and
. Xt
11m ~
(4.18) - = (1- p)m
t--+oot
Proof Taking expected values of (4.2) and using the martingale property of Mt
and the stationarity property of ~t gives
Applying the ergodic theorem (Theorem B50) to both terms on the right side of
(4.2) gives (4.18).
4. The Tagged Particle Process 285
°
behavior. We will look at the first term on the right of (4.2) a bit later. Let N (0, I;)
denote the multivariate normal distribution with mean and covariance matrix I;.
Proposition 4.19.
Mt
(4.20) .ji => N(O, I;),
Proof We will write the proof in case d = 1 to simplify the notation. The proof for
general d is the same, except that quantities of interest are multiplied by arbitrary
vectors in Rd, in order to make them one dimensional. Since
sup
n:::t:::n+l
IMt - Mn I :s L Ivlp(O, v) +
v
sup
ns:::n+l
IX t - Xn I
by (4.2), and the last term above is dominated by a constant multiple of a Poisson
distributed random variable, it is enough to prove (4.20) along the integer sequence
t = n.
Define D. n by
n
Mn = LD.k,
k=l
Mn
In => N(O, (J
2
)
k=l
(4.22)
where
286 Part III. Exclusion Processes
By Propositions 4.3 and 4.8, the random variables on the right of (4.22) are
-1 1/I(~s)dsr
stationary and ergodic, so that (4.21) follows from the ergodic theorem, Theorem
B50, where
1
a 2 = E[ XI = EMf.
EM; = tEMf.
This allows us to compute
a 2 = lim EM(
1,),0 t
= lim ~E[XI _
t,j,o t
r 1/I(~s)dS]2 = lim VarX
J0 1,),0 t
1,
where the last equality comes from the fact that (since 1/1 is bounded)
so that
a 2 = (1- p) Lip(O, y).
y
as if they were one dimensional. If d > 1, the arguments are then applied to one
coordinate at a time. For example, decomposition (4.23) is simply the statement
that each component of the left side can be expressed as the sum of a (one
dimensional) martingale and a negligible process.
The decomposition (4.23) is based on the solution u).. to the resolvent equation
(4.24)
for A> 0, where 1/!(S) = 1/!(I;) - (1 - p)m. The solution to (4.24) can be written
down explicitly as
(4.25)
where Set) is the semigroup for the process {to The integral converges since A > O.
To check that the right side of (4.25) solves (4.24), simply put it into (4.24) and
integrate by parts, using the fact that
-- - d- -
Q S(t)1/! = -S(t)1/!.
dt
(See Theorem B3.) Now we can at least write down a first approximation to the
desired decomposition (4.23).
To begin, note that (4.2) can be written as
since I(x, S) = x and QI(x, S) = 1/!(S) - see the beginning of the proof of
Proposition 4.1. Applying the argument in the proof of Proposition 4.1 to the
generator Q and the function u).. gives the analogous expression
(4.26)
where N).. is a martingale with stationary ergodic increments. Using the resolvent
equation (4.24), this can be written as
where
D)..(t) = i t AU).. ({s)ds - u)..({t) + u)..({o).
The idea now is to pass to a limit in (4.27) as A -+ 0 to get (4.23). In order to do
so, it is necessary to have some control of the behavior of u).. as A -} O.
288 Part III. Exclusion Processes
f
and
~h(U) = ~ L p(O, x)[u(rx1) - U(1)]2dvp .
{x:ry(x)=O}
The subscript ex stands for exclusion or exchange, while the sh stands for shift.
Here is our basic assumption: There is a constant C so that
(4.28a)
and
(4.28b)
We state the next result for finite state Markov chains to avoid problems with
convergence of sums, but corresponding statements for more general processes
can usually be obtained easily by passing to limits,
4. The Tagged Particle Process 289
Lemma 4.29. Suppose the chain has a finite state space. Then
(a)
- L U(1])Q'u(1]);rr(1]) = M'(u),
and
(b) if the Markov chain is reversible with respect to;rr, i.e., if the expression
a(1], n = ;rr(1])q(1], n = ;rr(nq(~, 1])
is symmetric, then
Proof For part (a), write out both sides explicitly, and cancel common terms. The
resulting identity that must be checked is
To check it, interchange the roles of 1] and ~ in the sum on the right, and use the
fact that ;rr is invariant:
For part (b), use the symmetry of a(1], n to write the left side of (4.30) as
[LU(1])a(1], n[v(n- V(1])]f = [La(1], nU(1])v(n- La(1], n U(1])V(1])f
ry.t ry,t ry,t
:s ~'(u)~'(v),
where the final step comes from the Schwarz inequality.
Now we return to the exclusion process. One application of the last result is the
following. Multiply (4.24) by u).. and integrate with respect to vP ' using Lemma
4.29(a), to get
Proof The process (Xt. St) is reversible with respect to the product measure that
is counting measure on the first component and vp on the second (since p is
symmetric). Recalling from the proof of Proposition 4.1 that 1/1 = 1/1 = fl.!, where
lex, l;) = x, (4.28a) follows essentially from (4.30), since Ly
p(O, y)lyl 2 < 00.
It doesn't quite follow directly, because both sides of (4.30) are infinite (since
each x gives rise to an identical term). To fix this, simply replace p(x, y) by
1
p(x, y) if x, yET
PT(X, y) = ~ ifx=y~T
otherwise,
(4.33)
To check this, consider (4.31). Neglecting the first term on the left, and using
(4.28a) to bound the term on the right gives
and this implies (4.33), since M(u)..) < 00 for each A by (4.31).
u -+ f gudv p
IIgll-1 = IIG(g)lh·
Let H_I be the completion (after modding out functions of norm 0 again) with
respect to II . II-I. This is again a Hilbert space. Note that in this language, our
basic assumption (4.28) becomes
The only hard part of the equivalence is the uniform boundedness principle (The-
orem 5.8 in Rudin (1966», which is used to deduce (4.37).
Then
Next we will see that the solution to the resolvent equation U,l,. has some useful
properties as A t 0 that will enable us to pass to the limit in (4.27) to construct
the decomposition (4.23).
Furthermore,
292 Part III. Exclusion Processes
Therefore, the strong convergence AU). ---+ 0 in L 2 Cvp ), together with (4.35), im-
plies the weak convergence AU). ---+ 0 in H_ I . Recalling (4.24), it follows that
Qu). ---+ -1/1 weakly in H_ I .
Applying Lemma 4.38, there is a sequence Vn so that Vn is a convex combi-
nation of {UI, ... ,un} for each n so that Vn ---+ W strongly in HI and QVn ---+ -1/1
strongly in H_I. From the proof of Lemma 4.38, it is clear that the same convex
combinations can be used in the two cases. Again, by the definitions of the two
Hilbert spaces,
Therefore,
and hence
Ilwllf = f w1/ldvp.
Passing to the limit in (4.31) using (4.41) and the fact (from 4.34) that 1/1 E H_I,
we get
lim An
n---*oo
f u~dvp = 0 and lim lIunllf
n~oo
= IlwllT-
4. The Tagged Particle Process 293
The latter fact (together with the weak convergence of Un to w) implies that
Un ~ w strongly in HI, since
so that
(4.44)
Now take two subsequences An and A~ as in the first part of the proof, with
UAn ~ w and UA~ ~ w' strongly in HI. Recalling that AUA ~ 0 weakly in H_I,
it follows from (4.44) that IIw - w'III = 0 as required.
Theorem 4.45. Suppose (4.28) holds. Then there is a martingale N(t) with sta-
tionary ergodic increments so that
(4.46) 10 1
1jJ(ss)ds = N(t) + D(t),
(4.47)
and
Proof Begin with (4.26). Since NA is a mean zero martingale, ENi(t) = tENi(1)
is linear in t. As in the proof of Proposition 4.19, we compute the factor by looking
at small t. The first term on the right of (4.26) is of order t as t ~ 0, so
· ENi(t)
11m = l'1m ----"------=--
E[UA(S/) - UA(SO)]2
I,J,O t I,J,O t
. 2 f u~dvp - 2 f uAS(t)uAdvp
= hm --"----'-'---'-----"------'-
t-J,O t
= - 2 lim
t,J,O
f UA
S(t)UA -
t
UA
dv p
= - 2 f UAQuAdvp = 211uAIif.
294 Part III. Exclusion Processes
Therefore
(4.49)
Applying the same argument to the differences in (4.26) for two different values
of).. gives
E[N),,(t) - NA2(t)]2 = 2tllu A1 - UA211~.
By Theorem 4.39, UA is Cauchy in HI. Therefore, for any t, NA (t) has an L2 limit
as ).. t O. The martingale property allows us to conclude that there is a square
integrable martingale N(t) so that NA(t) -+ N(t) in L2 for every t. By (4.27),
DA (t) has an L2 limit as well, which we will call D(t). This gives (4.46). For
(4.47), use (4.31) to write
so that IluAllI .:::: 111/111-1' Now pass to the limit in (4.49) as ).. t O.
It remains to prove (4.48). Recall from the discussion that led to (4.27) that
DA can be expressed as
f
ED~(t).:::: 6 u~dvp +3)..2 E [l t UA(~s)dSr .:::: (6+3)..2 t 2) u~dvp. f
Since D(t) = DA(t) + [NA(t) - N(t)] by (4.27) and (4.46), this implies that
Divide this by t, put).. = lit, and let t -+ 00. Using the second statement in
Theorem 4.39 and the L2 convergence of N A(1) to N(1) leads to (4.48).
In fact, the same proof applies to the martingale Mt+N(t). Combining this with
(4.48) leads to the following central limit theorem for the position of the tagged
particle. Recall that we have proved (4.28) for symmetric systems (Proposition
4.32), and that it has been proved in many other cases.
Lemma 4.51. Suppose m = 0 (and p(., .) is not nearest neighbor in one dimen-
Sion}. Then there is a constant C so that
(4.52)
Remarks. Recall that 1/1 = 1/1 if m = O. Note that the above statement is not true
in the one dimensional nearest neighbor case. In that case, 1/1(17) = ~(-li-~(1). If
u is of the form
L a (k)17(k)
00
U(17) =
k=1
where a(k) = 0 for all but finitely many k's, then (4.52) becomes
L [a(k) -
00
where a(x) =fo 0 for only finitely many x's and LxiOa(x) = O. Therefore there
are numbers a (x, y), with only finitely many of them nonzero, so that
One way to argue this, is inductively on the number of nonzero a(x)'s. For the
induction step, take any two nonzero sites x, y so that a (x) =fo 0 and a (y) =fo O.
The contributions to (4.53) due to them are
we see that in representation (4.54), we may assume that a(x, y) =fo 0 only if
p(x, y) > O.
Since vp is invariant under permutations of nonzero coordinates, if x, yare
nonzero,
= f [1J(y) -1J(x)]u(1Jx,y)dvp •
Since a (x, y) =fo 0 for only finitely many pairs (x, y), the second factor on the
right is finite. Since a(x, y) =fo 0 only if p(x, y) > 0, the third factor on the right
is bounded by a constant multiple of ~xCu), so the result follows.
Now we can complete the statement of the central limit theorem for X t .
4. The Tagged Particle Process 297
Proof Begin again by multiplying (4.24) by UA and integrating with respect to vp.
Drop the first term on the left and use (4.52) to bound the term on the right. The
result is
(4.56)
+
If IluA111 -+ 0 as A. 0, then (4.49) implies that NA(t) -+ 0 in L2 for each t
and hence N(t) = O. In this case, the ~ in Theorem 4.50 is the same as the ~ in
Proposition 4.19, which is not zero. Therefore, we may assume IluA111 fr 0, and
hence by (4.56),
(4.57)
The final step is a modification of the first argument in the proof of Theorem
4.45. Let gA(X, n
= x + uA(l;). Combining (4.2) with (4.26) and recalling that
QUA = Qu since UA is a function of ~ alone, one can write
. E[NA(t) + M t ]2 . E[gA(Xt'~t)-gA(XO'~O)]2
lIm = lIm - - = - - - - - - - - - - - - ' ' - -
q,o t tin t
+ jLP(O,Y)[I-l;(Y)J[gA(y,ryn-gA(O,n]2dVp
y
and hence
E[N(t) + Mt ]2 ::: 2t limsup~xCuA)'
AiO
Combining this with (4.57) gives the result.
298 Part III. Exclusion Processes
for all t ::: O. Note that this is a generalization of (1.14). He then used (5.1) to prove
the following pointwise ergodic theorem: If S = Zd and p(x, y) = p(O, Y - x),
and if the initial configuration satisfies
then
(5.3) lim ~ 10
t-+oo t
r I(TJs)ds = f Idvp a.s.
for every continuous function I on {O, l}s. This had been proved earlier by Andjel
and Kipnis (1987) for d ::: 3, and for all d if I depends on only one coordinate.
Note that (5.2) is simply the hypothesis of Theorem 1.13 in case of deterministic
initial configurations (with a = constant - recall that 9(j consists only of constants
in the translation invariant context).
The negative correlation inequality (5.1) should be compared with Theorem
B17, which provides a positive correlation result for attractive spin systems. The-
orem Bl7 asserts in particular that I(TJt) and g(TJt) are positively correlated for
deterministic initial configurations for all increasing continuous functions I and
g, while (5.1) applies only to very special increasing functions - those of the form
if TJ == 1 on A
otherwise.
Occupation Times for Symmetric Systems. Statement (5.3) when !(11) = 11(0) is a
strong law of large numbers for the occupation time of the origin. Kipnis (1987)
proved the associated central limit theorem in the case of the nearest neighbor ex-
clusion process on Zd with initial distribution vp. The result displays an interesting
dimension dependence. Here it is:
f~ I1s (O)ds - pt 2
~---- => N(O, a ),
I
bet)
where
r
3
if d = 1,
b(t) = ~tlogt if d = 2,
Jt if d :::: 3,
and
4v'2 if d = 1,
a2 =p(1-p)x 1 3.jii
1 if d = 2,
; fooo Ps(O, O)ds if d :::: 3,
where Ps (x, y) is the probability that a simple random walk on Zd that starts at
x will be at y at time s.
There are also large deviations results in this context. Arratia (1985) and
Landim (1992) proved that if ex > p, then
if d = 1,
if d = 2,
if d :::: 3.
Process level large deviations results for the symmetric exclusion process on
Zd, d :::: 3 have been obtained by Quastel, Rezakhanlou and Varadhan (1999).
Occurrence of Rare Events. During the past decade and a half, a number of results
have been proved that say that the time at which a rare event occurs is nearly
300 Part III. Exclusion Processes
exponential. For Markov chains, see Aldous (1982), for example. In interacting
particle systems, there are results of this type for stochastic Ising models (Schon-
mann (1991), Schonmann and Shlosman (1998» and for zero range processes
(Ferrari, Galves and Landim (1994», among others. Here is a result of this type
for the symmetric exclusion process on Zd with nearest neighbor jumps. It was
proved for d = 1, pi = 1 by Ferrari, Galves and Liggett (1995) and for general
d, pi by Asselah and Dai Pra (1997).
Let 17t be the process with initial distribution vP' and set
and
(5.4)
IETI n
XEA
[17t(X) - PTI(17t(X) = I)JI.:::: CIAlt-IAI/8.
In the companion paper (1991b), they prove the following strong form of (5.3) in
this case: If 0< a < 1, n > 1, and 17 E {O, l}Zl, then
1 It+t. j(Tx17s) -
lim sup 1 -;;
t-->oo Ixign t t
f jdva(x,t)
I
= ° a.s.
TJo, which have particles exactly on the even sites and odd sites respectively. In
this case, the covariance is asymptotic to -et-~, where e is an explicit constant.
For symmetric exclusion processes in higher dimensions, one can prove al-
gebraic rates of convergence in L 2 (v p). Deuschel (1994) did so for some related
processes, and indicated that the same technique gives analogous results for sym-
metric exclusion processes.
For the asymmetric nearest neighbor exclusion process on Zd, Cancrini and
Galves (1995) proved that if the initial configuration is periodic, or if the initial
distribution is stationary and exponentially mixing, then there is a constant e
(depending on the initial distribution) so that
d
d nd 2d ( log t )
Ip(TJt == 1 on {I, ... , n} ) - pis en Jt
The Finite Symmetric System. Consider two particles moving according to a sym-
metric, translation invariant exclusion process on Zd, where Lx Ix 12 p(O, x) < 00,
and two particles with the same initial states moving according to the same rules,
but without the exclusion interaction. In an unpublished manuscript, Andjel ob-
tained the following upper bounds for the total variation distance between the
distributions of these two systems:
e logt
if d = 1,
Jt
e logt if d = 2,
t
e if d :::: 3,
where e is a constant. In the one dimensional nearest neighbor case, he was able
to remove the logarithmic term.
The Asymmetric System on a Finite Torus. Fill (1991) studied rates of convergence
to equilibrium for the system of k particles that move clockwise at rate 1 on a
discrete circle of size N with exclusion. The particles are regarded as labelled, so
that for a given initial configuration, there are kG) possible configurations at later
times. The limiting distribution f.1 is uniform on that set of configurations. Let f.1t
be the distribution at time t, where the initial state is deterministic, and let II . II
denote the total variation norm. Here is one of Fill's results. Take N = 2k. Then
have been written on this topic are listed in the Bibliography, and can be identified
by the presence of the word hydrodynamics in the title. We list here only some:
Rost (1981), Kipnis, Olla and Varadhan (1989), De Masi and Presutti (1991),
Landim (1991), Rezakhanlou (1991), and Varadhan (1994a).
Here is an informal description of the type of result that falls under the rubric
of hydrodynamics: Consider a translation invariant exclusion process on Zd, where
the individual particles have a drift
m = LXp(O, x) =1= O.
x
Suppose that u is a reasonable function on R d , and the initial distribution fl~ for
the exclusion process is close to being a product measure with density
VA if A S ~,
{
(5.5) lim vA.pS(t) =
1---7>-00
Vl
2
if p S ~ SA,
Vp
·f 1
1 P 2: 2'
follows from these more general results. To see this, recall our discussion of (2.3)
in case A > p: The solution with initial condition (2.4) is
5. Notes and References 303
if x ::: (p-q)(1-2A)t,
if (p - q)(1 - 2)..)t :::: x ::: (p - q)(1 - 2p)t,
if x ::: (p - q)(1 - 2p)t.
The Weakly Asymmetric Exclusion Process. This refers to a system in which there
are symmetric jumps at a fast rate E- i , and completely asymmetric jumps at rate
1. De Masi, Presutti and Scacciatelli (1989) consider a one dimensional process
of this type, and prove that for times of order 1, the hydro dynamical behaviour is
governed by the linear heat equation (as it would be in the symmetric case), while
for times of order E- i , the relevant equation is a (nonlinear) Burgers equation (as in
the asymmetric case). Other papers on this topic are Gartner (1988), Dittrich (1990,
1992), and Dittrich and Gartner (1991). Ravishankar (1992b) deals with similar
issues for a two-dimensional process in which the asymmetry applies to only one
direction. The weakly asymmetric process in a regime that leads to a nonlinear
stochastic partial differential equation was studied in Bertini and Giacomin (1997).
This corresponds to the case ).. = 1, p = O. It was clear by symmetry that the
answer had to be \J l, but at that time it was not even clear how to prove that.
The material id this section is based on Ferrari, Kipnis and Saada (1991),
Ferrari (1992a) and Ferrari and Fontes (1994a, 1994b). In Ferrari (1992a) and the
papers that followed, the property'" 1L).,p is defined as in (2.8), but without the
Cesaro averaging, and the analogue of Theorem 2.16 and corresponding result for
304 Part III. Exclusion Processes
Zt (see the discussion surrounding (2.9)) are asserted for this stronger version of
the property. Only the weaker fonn follows directly from the arguments given
there, however. It is possible to obtain the stronger (non-Cesaro) statement for Xt,
but that, together with (2.9), does not imply the stronger statement for Zt. Since
Zt = Xt if p = 1, there is no difficulty in this case.
Proposition 2.10 (for a system with only one class of particles) is due to Harris
(1967). The proofs given here are taken from Ferrari (1986) and Ferrari (1992b)
respectively. In the 1986 paper, Ferrari identifies all the invariant measures for the
process viewed from the tagged particle.
Ferrari, Kipnis and Saada proved the strong law version of Theorem 2.34,
rather than the weak law presented here. Theorem 2.43 was conjectured by Spohn
(1991). Proposition 2.74 was proved by Giirtner and Presutti (1990) for A = 0,
p=l.
The crucial variance computation given in Proposition 2.56 is Theorem 3.1 of
Ferrari and Fontes (1994a). The expression given here looks quite different from
that in the paper, but it is not hard to check that it is, in fact, the same. In carrying
out the verification of this, the reader should keep in mind that the roles of A and p
are reversed in the paper, and that our Ut and Vt are called Rt and R t respectively
in the paper.
°
Theorem 2.93 was proved for A = 0, a = by Wick (1985) in case p = 1 and
by De Masi, Kipnis, Presutti and Saada (1989) for p > !. The latter paper also has
°
a central limit theorem for XI> which is, in this case, the position of the leftmost
particle. Theorem 2.93 was proved for A + p = 1, a = by Andjel, Bramson
and Liggett (1988). (A Cesaro version of the statement was proved earlier by
Andjel (1986).) This is the third case in Corollary 2.102. The other two cases
were proved by Liggett (1975). They were extended to more general (not nearest
neighbor) exclusion processes by Liggett (1977).
The connection between the exclusion process and queuing theory that was
used in the proof of Proposition 2.61 was first employed by Kipnis (1986). This
connection was exploited in the opposite direction recently when Mountford and
Prabhakar (1995) used ideas about the exclusion process to settle an old problem
about series of queues. To state their result, recall Theorem B59, a special case
of which says that that if a stationary Poisson process of rate A < 1 is fed into
a single server queue, where the service is exponential of rate 1, then the output
process is again Poisson with rate A. Now suppose the input process is a general
stationary ergodic point process X of rate A < 1. Then the output process is
another stationary ergodic point process of rate A - call it T X. This can be fed
back into another queue of the same type, and the output process is then T2 X.
The Mountford-Prabhakar theorem states that
r X:::} the Poisson process of rate A
as n -+ 00. For other connections between exclusion processes and queuing sys-
tems, see Srinivasan (1993) and Seppiiliiinen (1997).
Here are some other results that have been proved for the nearest neighbor
asymmetric exclusion process on ZI:
5. Notes and References 305
The Rarefaction Fan. Consider the process with P = 1 and initial distribution vA•P
on Zl \ {OJ, A > p, and put a second class particle at the origin. Let Zt be the
position of the second class particle at time t. Ferrari and Kipnis (1995) proved that
Zt! t converges weakly as t ---+ 00 to the uniform distribution on [1 - 2A, 1 - 2p].
To understand the limiting distribution, recall that a second class particle in a sea
of first class particles of density y has drift (1 - 2y) ~ see Proposition 2.57. So,
a second class particle well to the left of the origin would travel at speed 1 - 2A,
while a particle well to the right would travel at speed 1 - 2p. The result asserts
that a second class particle starting at the origin chooses what speed to use at
random from among the allowed possibilities.
Ferrari and Kipnis also consider what happens when initially there are two
second class particles, one at 0 and the other at 1, the negative sites are occupied
by first class particles, and the other positive sites are empty. The second class
particles coalesce at rate 1 when they are at adjacent sites. If Z~ and Z/ are their
respective positions at time t, they prove that
Evolution of the Finite System. Schutz (1997b) computes the exact distribution of
the finite exclusion process At in the totally asymmetric case P = 1. The anSwer
I
is given in terms of the function Fm(n, t) that is defined for n :::: 0 by
Fm(n, t) =
e-
t
b (k + 1)
00 m-
m- 1 (k
t k +n
+ n)! if m :::: 1,
e
-t
b(-) (1m I)
Iml 1k
k
t k +n
(k+n)! if m :s o.
If A = {Xl, ... ,XN} and B = {YI, ... , YN} are configurations of size N, written
so that Xl < ... < XN and YI < ... < YN, then pA(A t = B) is the determinant
of the N x N matrix whose (i, j) entry is Fi-j(Yi -Xj, t).
This is reminiscent of the following old result by Karlin and McGregor (1959).
Suppose XI(t), ... ,XN(t) are independent continuous time birth and death pro-
cesses On Zl with transition probabilities Pt(x, Y), i.e., Markov chains that move
only One step to the left or right at each transition. Let G be the event that
XI(s) < ... < XN(s) for all s:s t. If Xl < ... < XN and YI < ... < YN, then
is the determinant of the N x N matrix whose (i, j) entry is Pt (Xi , Yj). Note that
if these chains move only to the right at rate 1, then Pt (x, y) = Fo (y - x, t).
306 Part III. Exclusion Processes
Shift Equivalent Measures. Consider the context of (2.7), in which the exclusion
process is viewed from the location of a second class particle. If the initial dis-
tribution is vA,p on ZI\{O}, Derrida, Goldstein, Lebowitz and Speer (1998) have
proved that the two measures obtained by treating the second class particle as
either a first class particle or an empty site are random translates of one another.
(5.7b)
(5.7c) p(l - P)ftN(EI ... EN-II) - qPftN(EI .•. EN-IO) = CNftN-I (EI ... EN-I)
for all choices of E/S in {O, I}. Note that (5.7b,c) can be thought of as versions
of (5.7a) for i = 0 and i = N respectively, since the boundary conditions have
the interpretation of making 11(0) = 1 with probability A and I1(N + 1) = 1 with
probability P, independently of the configuration on {I, ... , N}. By summing over
all values of the Ej'S, it is clear that eN is the net rate at which particles move
to the right in equilibrium for the process on {I, ... , N} - i.e., the current. He
then used this recursion to obtain the asymptotics of ftN as N ---+ 00, and then to
compute
lim vA,pS(t)
t~oo
They then observed this implies the following square root law for the spatial decay
of the system that is obtained by formally letting N ---+ 00:
. 1 1
hm ftN{11 : 11(i) = I} ~ - + '-"" i ---+ 00.
N~oo 2 2", rri
5. Notes and References 307
The recursions discussed above were the precursors of the matrix method,
which is the main subject of Section 3. In fact, it is easy to check that (5.7) is
a consequence of Theorem 3.1 (provided, of course, that there exist D, E, w, v
satisfying its assumptions), with (3.2a,b,c) being used to check (5.7a,b,c) respec-
tively. Besides, equation (3.5), which is the key to the proof of Theorem 3.1,
is a restatement of (5.7). The matrix method was introduced by Derrida, Evans,
Hakim and Pasquier (1993a) and used extensively in a series of papers by Derrida
and various coauthors. Much of the material in Section 3 is based on the paper
mentioned above and the review paper by Derrida and Evans (1997). Some of
these results were obtained independently by Schlitz and Domany (1993) using
the recursions directly. The analysis of the partition function corresponding to
Corollary 3.27 was carried out by Sandow (1994) for the general case p > !,
though parts of the argument do not appear to be entirely rigorous. Theorems
3.28 and 3.29 were originally proved in Liggett (1975) for general p > and in !
Liggett (1977) for one dimensional exclusion processes with Lx Ixlp(O, x) < 00
and Lx xp(O, x) > O.
Theorem 3.47 is taken from lanowsky and Lebowitz (1994). In that paper,
the authors observe that the same monotonicity arguments that are used in the
proof of Theorem 3.47 can be used to show that the current for the system on
{- N + 1, ... , N} with blockage between sites 0 and 1,
The Diffusion Constant. Consider the system on {I, ... ,N} with p = 1 in equi-
librium, and let Yt be the number of particles that have entered {l, ... , N} by
308 Part III. Exclusion Processes
EYt
- = A/LN{11 : 11(1) = O} = /LN(10),
t
which is the current whose asymptotics are given in Theorem 3.28. Derrida, Evans
and Mallick (1995) have computed the asymptotic variance
. Var(Yt )
/}.N = hm - - - .
t--+oo t
Their expression is rather complicated - see (58) in the paper - but simplifies
significantly in two cases: (a) if A = p,
I A(1I-A)I2A-l1
4JriN
ifA=F!,
if A = 2'
1 N -+ 00,
/}.N = 4(2N
3(N + 1)
+ 1)(4N + 3) e 2N+I
NN+2)2 ~
3",2:rr
64,.[Fi'
N -+ 00.
Analogous results for the exclusion process on {I, ... , N} with periodic boundary
conditions (i.e., where one identifies sites 0 and N) were obtained by Derrida,
Evans and Mukamel (1993) in case p = 1 and by Derrida and Mallick (1997) for
general p.
Exponential Rate of Growth. The next step after considering the asymptotics of
the first two moments is to study the behavior of exponential moments. Derrida
and Lebowitz (1998) have done so for the process with p = 1 with periodic
boundary conditions. To state their result, consider the process with n particles on
{I, ... , N}, and let Yt be the total distance travelled by all the particles by time
t. Then
log Ee"Y'
y (a) = lim ---=----
t--+oo t
can be computed by solving
yea) = -n 8 1
00
Nk _ 1
(Nk -
nk
1) xk
and
a = - f_l
k=1 Nk
(Nk)xk
nk
simultaneously, eliminating the x.
5. Notes and References 309
Invariant Measures Viewed from the Location of the Shock. In Section 2, we saw
that for the asymmetric system on Z I, the location of the shock can be thought
of as the position Zt of a second class particle moving in a sea of first class
particles. To understand the microscopic structure of the shock, it is natural to
study invariant measures for the process of first class particles, when viewed
from Zt. That such invariant measures exist was proved by Ferrari, Kipnis and
Saada (1991). Using the matrix approach, Derrida, Lebowitz and Speer (1997)
were able to write down these measures explicitly. Corresponding expressions
in case p = 1 were obtained earlier by Derrida, Janowsky, Lebowitz and Speer
(l993).
To describe these results, let TIt be the process of first class particles on Zl,
viewed from the position of a single second class particle - the second class
particle is always placed at O. Take 0 :::: A < p :::: 1, and consider three matrices
v w
D, E and A and two vectors and that satisfy the following analogue of (3.2):
is well defined and invariant for TIt. Furthermore, the measure is asymptotically
VA at -00 and vp at +00, with this convergence being exponentially rapid.
In fact, the single site probabilities are expressed in the following very explicit
form. Let Xn be a random walk on Zl that starts at 0, moves one step to the
right with probability p(l - A), one step to the left with probability A(l - p) and
remains where it is with the remaining probability, (1 - A) (l - p) + Ap. Note that
this random walk has a drift to the right, since A < p. Define
qk
f(k) = k Pk -q k
for k =1= 0, letting f(O) be arbitrary, and in terms of f,
I
F(k) = __ [p2(l - A)2 f(k + 2) - p(l - A)(A +p- 2Ap)f(k + I)
P-A
+ A(l - p)(A +p - 2Ap)f(k - I) - A2(l - p)2 f(k - 2)].
The single site probabilities for the negative sites can be obtained by symmetry:
Related results can be found in Sandow and Schiitz (1994) and Schiitz (1997a).
Speer (1997) showed that if "A > 0, p < 1, then system (5.8) has a finite
dimensional representation if and only if
( _pq)r = "A(l _ p)
p(1 -"A)
t+s-r )
- 2{s - r +y - x) cosh- I (
2,Jt{s - r + y - x)
Exclusion Processes with Different Update Rules. If one wishes to simulate the
exclusion process, one is naturally led to consider various rules that could be used
to determine the order in which the states of the various sites are updated. For
the process on {l, . " , N} that is studied in Section 3, for example, updates must
be performed at each endpoint of the interval, and at each nearest neighbor pair
of sites. One can imagine updating these in a random order, for example, or se-
quentially from left to right. Rajewsky, Santen, Schadschneider and Schreckenberg
(1998) use the matrix approach discussed in Section 3 to compare systems with
different update rules. That paper contains other references on this topic.
Exclusion Processes with Spontaneous Births and Deaths. Ferrari and Golstein
°
(1988) consider the symmetric nearest neighbor exclusion process on Z3 with the
addition of births at 0 at rate f3 and deaths at at rate 8. For each p E [0, 1], there
is an extremal invariant measure JL p for this process that has asymptotic density p
at 00. (Invariant measures for this type of process where spontaneous births and
deaths are allowed at any site are discussed by Schwartz (1976).) This is a product
measure if and only if p = f3! (f3 + 8). In all other cases, the covariance of 11 (x)
and 11 (y) relative to JL p for x, y -=f=. 0 lies between two negative constant multiples
of
Exclusion Processes with Spin System Dynamics. The exclusion process has been
added to other particle systems dynamics in several contexts. Its addition to the
contact process was mentioned in Section 5 of Part I. Here we consider its addition
to one dimensional reversible spin systems. The process has state space {O, 1}ZI
and the following transitions:
b2 + 1)) = 010 or 101,
1
if (7] (x - 1), 7](x), 7](x
7] ~ 7]x at rate ab if (7] (x - 1), 7](x), 7](x + 1» = 001,100,110 or 011,
a2 if (7] (x - 1), 7](x), I1(X + 1» = 000 or Ill,
5. Notes and References 313
if Ix - yl = 1,
17 -+ 17x,y at rate { ~ if Ix - yl =1= 1.
This process was first considered by De Masi, Ferrari and Lebowitz (1986), who
proved hydrodynamic type results for it. We will be concerned with the issue of
ergodicity: Does this process have a unique invariant measure? Especially, for
fixed a, b, what happens if M is very large or very small?
Here are the known results:
(a) If %> 1,
the process is ergodic for all M by Theorem 4.1 of Chapter I of
IPS.
(b) If %> ~, the process is ergodic for sufficiently large M, as was proved by
Brassesco, Presutti, Sidoravicius and Vares (1999).
(c) For any strictly positive a, b, the process is ergodic for sufficiently small
M - see Neuhauser (1990).
The process is clearly not ergodic if a = 0, since then the pointmasses on
17 == 0 and on 17 == 1 are invariant. The most interesting open problem involves
the case % < ~ and M large. In this case, one can make a heuristic argument for
nonergodicity as follows: Let the distribution at time t be ILt, and assume that the
initial distribution is shift invariant. Then
d
-ILt(1) = b2ILt(101) + 2abILt(100) + a 2ILt(000)
dt
- a 2ILl (111) - 2abILI (110) - b2ILl (010).
If ILl is the product measure with density p, then the right side above is
1
p=- and p(1 _ p) = (_a_)2
2 b-a
The system
d
(5.10) -pet) = f(p(t))
dt
has these three roots as fixed points; p = 1
is unstable, and the other two are
stable.
Here is the heuristic part of the argument: If M is large, then Theorems 1.10
and 1.13 suggest that the distribution of the process at time t is close to being
a product measure, since the exclusion part of the evolution should dominate
the spin-flip part. If this were the case, one would expect the process to have two
extremal invariant measures corresponding to the two stable fixed points of (5.10).
It would be quite interesting to determine whether this is in fact the case.
314 Part III. Exclusion Processes
Here is an argument that counters the above heuristic: If a > 0, the spin system
alone is exponentially ergodic (Holley and Stroock (1989», while the exclusion
process converges only algebraically rapidly (Deuschel (1994». Therefore only a
small amount of spin evolution might be enough to render the combined process
ergodic, no matter how large the exclusion component is.
Similar issues arise in another context. Consider a process T/I on {O, 1,2, ... }Zl
with the following transitions:
(a) increase T/(x) by 1 at rate f3(T/(x)),
(b) decrease T/(x) by 1 at rate 8(T/(x)),
(c) increase T/ (x) by 1 and decrease T/ (y) by 1 at rate M if Ix - Y I = 1.
This is sometimes called a reaction-diffusion process; the reaction part is the
increase and decrease in (a) and (b) above, and the diffusion part is given by
(c). Homogeneous product measures with Poisson marginals are invariant for the
diffusion part, so if f3 (-) and 8 (-) are chosen so that a particular Poisson distribution
is invariant for the reaction part, this Poisson will be invariant for the combined
evolution. The condition for this to be the case for the Poisson with parameter A
is
In a somewhat restricted version of this situation, Ding, Durrett and Liggett (1990)
proved that this is the only invariant measure, and there is convergence to it from
any initial configuration. This result was generalized by Chen, Ding and Zhu
(1994). Ergodicity has also been proved in some other (nonreversible) (f3, 8, M)
regions - see Chen (199S) for details.
There is again a heuristic argument for nonergodicity for certain choices of
(f3,8) if M is large: If M is large, the distribution of the process at large times
should be close to a homogeneous product of Poisson distributions. If the distri-
bution at time t really were such a product, with Poisson parameter A(t), then A(t)
would satisfy
=L
d 00 e-)'(I)[A(t)f
(S.12) -ET/I(x) [f3(n) - 8(n)].
dt n=O n!
As an example, suppose
where a, b, e, d > O. This is known as Schl6gl's second model. Then the right
side of (S.12) becomes
(S.13)
evaluated at A(t). If this cubic has three positive roots, the smallest and largest
will be stable, and that argues for the nonergodicity of the system. Note that in
this example, (S.11) holds for some A if and only if
5. Notes and References 315
a b
=
c d'
and in this case (5.13) has only one real root. Thus, the heuristic does not contradict
known results in the reversible case.
Asymmetric Exclusion Processes with Random Rates. Take a nearest neighbor one
dimensional exclusion process with jump probabilities that are different for dif-
ferent particles - the ith particle jumps to the right with probability Pi and to the
left with probability qi, where
1
Pi +qi = 1, ·>
P1 _
c > -.
2
Note that the jump probabilities are associated with particles, not with sites. Let
{Pi, i E Zl} be chosen according to a stationary, ergodic process, and ask for what
densities p, does there exist a product measure that is invariant for the process seen
from a tagged particle, and has asymptotic density p in both directions. Benjamini,
Ferrari and Landim (1996) give the following answer: There is a critical density
p* so that for almost all choices of the {Pi}, if p ~ p*, there is such a product
measure, while if p < p*, there is no such product measure. Now suppose that c
is the essential infimum of the distribution of Pi, and that the Pi's are i.i.d. Then
p* > 0 if and only if P(Po = c) < I~C. More explicit results for one sided models
were obtained by Krug and Ferrari (1996). Hydrodynamics for models of this sort
is studied by Seppiiliiinen and Krug (1999).
Long Range Exclusion Processes. The long range exclusion process differs from
the exclusion process we have considered in Part III in that when a particle attempts
to move to an occupied site, instead of returning at its original site, it continues
searching (instantaneously) for a vacant site until it finds one (which may never
happen). In other words, if it is at XES when its exponential clock rings, and
the configuration of the system at that time is 1'], it constructs a Markov chain Xn
with transition probabilities P (', .) and initial state x, and moves to X T, where
u(x, 0) = u(O, t) = 0.
The solution to this partial differential equation with these initiallboundary con-
ditions is u(x, t) = 2Ft, and it is this factor of 2 that gives the value of c
above.
For another view of this connection between Ulam's problem and particle
systems related to the exclusion process, see Seppiiliiinen (1996). A closely related
problem is solved in Seppiiliiinen (1997). Large deviations in this context are
considered by Seppiiliiinen (1 998b).
Bibliography
Papers on the main topics of this book - contact, voter and exclusion processes - are listed
by topic, following a list of books. In these sections, we have tried to include all papers
written about those models since 1985. Papers that are more general, or do not fit naturally
into one of the three categories, are listed at the end.
In most cases, only references after 1985 are listed. For earlier references, please see
the bibliography oflPS, Liggett (1985). In that book, we tried to list essentially all papers
written about interacting particle systems up to that time. By now, there are well over 1000
papers on this subject, so we have not tried to be as inclusive this time. Therefore, the
listing of "other papers" at the end contains only papers that are referred to explicitly in
the text.
Books
K. B. Athreya and P. E. Ney, Branching Processes, Springer, 1972.
M. F. Chen, From Markov Chains to Non-Equilibrium Particle Systems, World Scientific,
1992.
A. De Masi and E. Presutti, Mathematical Methods for Hydrodynamic Limits, Springer
Lecture Notes in Mathematics 1501, 1991.
R. Durrett, Lecture Notes on Particle Systems and Percolation, Wadsworth, 1988.
R. Durrett, Probability: Theory and Examples, second edition, Duxbury, 1996.
L. C. Evans, Partial Differential Equations, American Mathematical Society, 1998.
G. Grimmett, Percolation, Springer, 1989.
G. Grimmett, Percolation, 2nd edition, Springer, 1999.
F. P. Kelly, Reversibility and Stochastic Networks, Wiley, 1979.
C. Kipnis and C. Landim, Scaling Limits of Interacting Particle Systems, Springer, 1999.
N. Konno, Phase Transitions of Interacting Particle Systems, World Scientific, 1994.
T. M. Liggett, Interacting Particle Systems, Springer, 1985.
W. Rudin, Real and Complex Analysis, McGraw-Hill, 1966.
R. Schinazi, Classical and Spatial Stochastic Processes, Birkhauser, 1999.
H. Spohn, Large Scale Dynamics of Interacting Particles, Springer Texts and Monographs
in Physics, 1991.
Contact Processes
M. Aizenman and G. Grimmett, Strict monotonicity for critical points in percolation and
ferromagnetic models, J. Statist. Phys. 63 (1991), 817-835.
E. D. Andje1, The contact process in high dimensions, Ann. Prob. 16 (1988), 1174-1183.
E. D. Andjel, Survival of multidimensional contact process in random environments, Bol.
Soc. Bras. Mat. 23 (1992), 109-119.
E. D. AndjeJ, R. Schinazi and R. H. Schonmann, Edge processes ofone-dimensional stochas-
tic growth models, Ann. Inst. H. Poincare Probab. Statist. 26 (1990), 489-506.
318 Bibliography
D. J. Barsky and C. C. Wu, Critical exponents for the contact process under the triangle
condition,1. Statist. Phys. 91 (1998), 95-124.
V. Belitsky, P. A. Ferrari, N. Konno and T. M. Liggett, A strong correlation inequality for
contact processes and oriented percolation, Stoch. Proc. Appl. 67 (1997), 213-225.
C. Bezuidenhout and L. Gray, Critical attractive spin systems, Ann. Probab. 22 (1994),
1160--1194.
C. Bezuidenhout and G. Grimmett, The critical contact process dies out, Ann. Probab. 18
(1990), 1462-1482.
C. Bezuidenhout and G. Grimmett, Exponential decay for subcritical contact and percolation
processes, Ann. Probab. 19 (1991), 984--1009.
M. Bramson, R. Durrett and R. H. Schonmann, The contact process in a random environ-
ment, Ann. Probab. 19 (1991), 960--983.
M. Bramson, R. Durrett and G. Swindle, Statistical mechanics of crabgrass, Ann. Probab.
17 (1989), 444--481.
L. Buttel, J. T. Cox and R. Durrett, Estimating the critical values of stochastic growth
models, 1. Appl. Probab. 30 (1993), 455-461.
M. Cassandro, A. Galves, E. Olivieri and M. E. Vares, Metastable behavior of stochastic
dynamics: A pathwise approach, 1. Statist. Phys. 35 (1984), 603-634.
J. W. Chen, Small density fluctuation for one-dimensional contact processes under nonequi-
librium, Acta Math. Sci. 13 (1993), 399-405.
J. W. Chen, The contact process on afinite system in higher dimensions, Chinese J. Contemp.
Math. 15 (1994), 13-20.
1. W. Chen, Smoothness and stability of one-dimensional contact processes, Acta Math.
Sinica 38 (1995), 91-98.
J. W. Chen, R. Durrett and X. F. Liu, Exponential convergence for one dimensional contact
processes, Acta Math. Sinica 6 (1990), 349-353.
J. T. Cox, R. Durrett and R. Schinazi, The critical contact process seen from the right edge,
Probab. Th. ReI. Fields 87 (1991), 325-332.
1. T. Cox and A. Greven, On the long term behavior ofsome finite particle systems, Probab.
Th. ReI. Fields 85 (1990), 195-237.
R. Dickman, Nonequilibrium lattice models: series analysis of steady states, J. Statist. Phys.
55 (1989), 997-1026.
R. Durrett, The contact process, 1974-1989, Proceedings of the 1989 AMS Seminar on
Random Media (W. E. Kohler and B. S. White, ed.), vol. 27, AMS Lectures in Applied
Mathematics, 1991, pp. 1-18.
R. Durrett, Stochastic growth models - bounds on critical values, J. Appl. Prob. 29 (1992),
11-20.
R. Durrett and D. Griffeath, Contact processes in several dimensions, Z. Wahrsch. verw.
Gebiete 59 (1982), 535-552.
R. Durrett and X. Liu, The contact process on a finite set, Ann. Probab. 16 (1988),
1158-1173.
R. Durrett and E. Perkins, Rescaled contact processes converge to super Brownian motion
for d ~ 2, Probab. Theory ReI. Fields (1999).
R. Durrett and R. Schinazi, Intermediate phase for the contact process on a tree, Ann.
Probab. 23 (1995), 668-673.
R. Durrett and R. Schonmann, Stochastic growth models, Percolation Theory and Ergodic
Theory of Infinite Particle Systems (H. Kesten, ed.), Springer, 1987, pp. 85-119.
R. Durrett and R. Schonmann, The contact process on a finite set II, Ann. Probab. 16
(1988a), 1570--1583.
R. Durrett and R. Schonmann, Large deviations for the contact process and two dimensional
percolation, Probab. Th. ReI. Fields 77 (1988b), 583-603.
R. Durrett, R. Schonmann and N. Tanaka, The contact process on a finite set III. The critical
case, Ann. Probab. 17 (1989), 1303-1321.
Bibliography 319
M. D. Penrose, The threshold contact process: a continuum limit, Probab. Th. ReI. Fields
104 (1996), 77-95.
A. Puha, A reversible nearest particle system on the homogeneous tree, 1. Th. Probab. 12
(1999),217-254.
A. Puha, Critical exponents for a reversible nearest particle system on the binary tree, Ann.
Probab. (2000).
M. Salzano and R. H. Schonmann, The second lowest extremal invariant measure of the
contact process, Ann. Probab. 25 (1997), 1846-187l.
M. Salzano and R. H. Schonmann, A new proofthatfor the contact process on homogeneous
trees local survival implies complete convergence, Ann. Probab. 26 (1998), 1251-1258.
M. Salzano and R. H. Schonmann, The second lowest extremal invariant measure of the
contact process II, Ann. Probab. 27 (1999).
R. Schinazi, On multiple phase transitions for branching Markov chains, 1. Statist. Phys.
71 (1993), 521-525.
R. Schinazi, The asymmetric contact process on a finite set, 1. Statist. Phys. 74 (1994),
1005-1016.
R. Schinazi, A contact process with a single inhomogeneous site, J. Statist. Phys. 83 (1996),
767-777.
R. H. Schonmann, Metastability for the contact process, 1. Statist. Phys. 41 (1985),445-464.
R. H. Schonmann, Central limit theorem for the contact process, Ann. Probab. 14 (1986a),
1291-1295.
R. H. Schonmann, The asymmetric contact process, 1. Statist. Phys. 44 (1986b), 505-534.
R. H. Schonmann, A new look at contact processes in several dimensons, Percolation Theory
and Ergodic Theory of Infinite Particle Systems (H. Kesten, ed.), vol. 8, IMA Series
in Mathematics and its Applications, 1987a, pp. 245-250.
R. H. Schonmann, A new proof ofthe complete convergence theorem for contact processes in
several dimensions with large infection parameter, Ann. Probab. 15 (1987b), 382-387.
R. H. Schonmann, The triangle condition for contact processes on homogeneous trees, 1.
Statist. Phys. 90 (1998), 1429-1440.
R. H. Schonmann and M. E. Vares, The survival of the large dimensional basic contact
process, Probab. Th. ReI. Fields 72 (1986), 387-393.
A. Simonis, Metastability for the d-dimensional contact process, 1. Statist. Phys. 83 (1996),
1225-1239.
A. Simonis, Filling in the hypercube in the supercritica/ contact process in equilibrium,
Markov Proc. ReI. Fields 4 (1998), 113-l30.
A. M. Stacey, Bounds on the critical probabilities in oriented percolation models, Cambridge
University thesis (1994).
A. M. Stacey, The existence of an intermediate phase for the contact process on trees, Ann.
Probab. 24 (1996), 1711-1726.
A. M. Stacey, The contact process on afinite tree, (2000).
T. Sweet, The asymmetric contact process at its second critical value, J. Statist. Phys. 86
(1997),749-764.
G. Swindle, A mean field limit of the contact process with large range, Probab. Th. ReI.
Fields 85 (1990), 261-282.
A. Y. Tretyakov, V. Belitsky, N. Konno and T. Yamaguchi, Numerical estimation on cor-
relation inequalities for Holley-Liggett bounds, Mem. Muroran Inst. Tech. 48 (1998),
101-105.
A. Y. Tretyakov, N. Inui and N. Konno, Phase transition for the one-sided contact process,
1. Phys. Soc. Jap. 66 (1997), 3764-3769.
A. Y. Tretyakov and N. Konno, Phase transition of the contact process on the binary tree,
1. Phys. Soc. Japan 64 (1995), 4069-4072.
c. C. Wu, The contact process on a tree: behavior near the first phase transition, Stoch.
Proc. Appl. 57 (1995), 99-112.
322 Bibliography
Voter Models
E. D. Andjel, T. M. Liggett and T. Mountford, Clustering in one dimensional threshold
voter models, Stoch. Proc. Appl. 42 (1992), 73-90.
E. D. Andjel and T. Mountford, A coupling of infinite particle systems II, J. Math. Kyoto
Univ. 38 (1998), 635-642.
M. Bramson, J. T. Cox and R. Durrett, Spatial models for species area curves, Ann. Probab.
24 (1996),1727-1751.
M. Bramson, 1. T. Cox and R. Durrett, A spatial modelfor the abundance of species, Ann.
Probab. 26 (1998), 658-709.
M. Bramson, 1. T. Cox and D. Griffeath, Consolidation rates for two interacting systems in
the plane, Probab. Th. ReI. Fields 73 (1986), 613-625.
M. Bramson, 1. T. Cox and D. Griffeath, Occupation time large deviations of the voter
model, Probab. Th. ReI. Fields 77 (1988), 401-413.
D. Chen, The consensus times of the majority vote process on a torus, J. Statist. Phys. 86
(1997), 779-802.
1. T. Cox, Some limit theorems for voter model occupation times, Ann. Probab. 16 (1988),
1559-1569.
J. T. Cox, Coalescing random walks and voter model consensus times on the torus in Zd,
Ann. Probab. 17 (1989), 1333-1366.
1. T. Cox and R. Durrett, Nonlinear voter models, Random Walks, Brownian Motion and
Interacting Particle Systems, A Festschrift in honor of Frank Spitzer (R. Durrett and
H. Kesten, ed.), Birkhauser, 1991, pp. 189-201.
J. T. Cox and R. Durrett, Hybrid zones and voter model interfaces, Bernoulli 1 (1995),
343-370.
1. T. Cox, R. Durrett and E. A. Perkins, Rescaled voter models converge to super Brownian
motion, 2000.
J. T. Cox and A. Greven, On the long term behavior offinite particle systems: A critical
dimension example, Random Walks, Brownian Motion and Interacting Particle Systems,
A Festschrift in honor of Frank Spitzer, Birkhauser, pp. 203-213.
J. T. Cox and D. Griffeath, Occupation time limit theorems for the voter model, Ann. Probab.
11 (1983), 876--893.
J. T. Cox and D. Griffeath, Critical clustering in the two dimensional voter model, Stochastic
Spatial Processes (P. Tautu, ed.), vol. 1212, Springer Lecture Notes in Mathematics,
1986a, pp. 59-68.
1. T. Cox and D. Griffeath, Diffusive clustering in the two dimensional voter model, Ann.
Probab. 14 (1986b), 347-370.
M. 1. De Oliveira, Isotropic majority vote model on a square lattice, 1. Statist. Phys. 66
(1992), 273-281.
R. Durrett, Multicolor particle systems with large threshold and range, J. Th. Probab. 5
(1992), 127-152.
R. Durrett and J. E. Steif, Fixation results for threshold voter systems, Ann. Probab. 21
(1993),232-247.
1. Ferreira, The probability of survival for the biased voter model in a random environment,
Stoch. Proc. Appl. 34 (1990), 25-38.
B. Granovsky and N. Madras, The noisy voter model, Stoch. Proc. Appl. 55 (1995), 23-43.
S. Handjani, The complete convergence theorem for coexistent threshold voter models, Ann.
Probab. 27 (1999), 226--245.
T. M. Liggett, Coexistence in threshold voter models, Ann. Probab. 22 (1994b), 764-802.
T. S. Mountford, Generalized voter models, 1. Statist. Phys. 67 (1992), 303-311.
Bibliography 323
M. A. Santos and S. Texeira, Anisotropic voter model, 1. Statist. Phys. 78 (1995), 963-970.
A. Sudbury, Hunting submartingales in the jumping voter model and the biased annihilating
branching process, Adv. Appl. Probab. (1999).
Exclusion Processes
H. Guiol, Un rI?sultat pour Ie processus d'exclusion Ii longue portee, Ann. Inst. Henri
Poincare 33 (1997), 387--405.
L.-H. Gwa and H. Spohn, Bethe solution for the dynamic-scaling exponent of the noisy
Burgers equation, Phys. Rev. A 46 (1992), 844-854.
S. A. Janowski, Exact solution of the totally asymmetric exclusion process: shock profiles,
Rebrape 8 (1994), 85-91.
S. A. Janowski and 1. L. Lebowitz, Finite size effects and shock fluctuations in the asym-
metric simple exclusion process, Phys. Rev. A 45 (1992), 618-625.
S. A. Janowski and 1. L. Lebowitz, Exact results for the asymmetric simple exclusion process
with a blockage, 1. Statist. Phys. 77 (1994), 35-51.
1. D. Keisling, Convergence speed for simple symmetric exclusion: An explicit calculation,
1. Statist. Phys 90 (1998), 1003-1013.
J. D. Keisling, An ergodic theoremfor the symmetric generalized exclusion process, Markov
Proc. ReI. Fields 4 (1998), 351-379.
C. Kipnis, Recent results on the movement of a tagged particle in simple exclusion, Par-
ticle Systems, Random Media, and Large Deviations (R. Durrett, ed.), vol. 41, AMS
Contemporary Mathematics, 1985, pp. 259-265.
C. Kipnis, Central limit theorem for infinite series of queues and applications to simple
exclusion, Ann. Probab. 14 (1986), 397--408.
C. Kipnis, Fluctuations des temps d'occupation d'un site dans I 'exclusion simple symetrique,
Ann. Inst. H. Poincare Probab. Statist. 23 (1987), 21-35.
C. Kipnis, C. Landim and S. Olla, Hydrodynamicallimit for a nongradient system: The gen-
eralized symmetric exclusion process, Comm. Pure Appl. Math 47 (1994), 1475-1545.
C. Kipnis, C. Landim and S. Olla, Macroscopic properties of a stationary non-equilibrium
distribution for a non-gradient interacting particle system, Ann. Inst. H. Poincare
Probab. Statist. 31 (1995), 191-221.
C. Kipnis, S. Olla, and S. R. S. Varadhan, Hydrodynamics and large deviations for simple
exclusion processes, Comm. Pure Appl. Math 42 (1989),115-137.
C. Kipnis and S. R. S. Varadhan, Central limit theorem for additive functionals of re-
versible Markov processes and applications to simple exclusions, Comm. Math. Phys.
104 (1986),1-19.
K. Komoriya, Hydrodynamic limit for asymmetric mean zero exclusion processes with speed
change, Ann. Inst. H. Poincare Probab. Statist. 34 (1998), 767-797.
1. Krug and P. A. Ferrari, Phase transitions in driven diffusive systems with random rates,
1. Phys. A 29 (1996), L465-L471.
C. Landim, Hydrodynamical equation for attractive particle systems on Zd, Ann. Probab.
19 (1991),1537-1558.
C. Landim, Occupation time large deviations for the symmetric simple exclusion process,
Ann. Probab. 20 (1992), 206-231.
C. Landim, S. Olla and S. B. Volchan, Driven tracer particle and Einstein relation in
one-dimensional symmetric simple exclusion process, Resenhas 3 (1997), 173-209.
C. Landim, S. Olla and S. B. Volchan, Driven tracer particle in one-dimensional symmetric
simple exclusion process, Comm. Math. Phys. 192 (1998), 287-307.
C. Landim, S. Olla and H.- T. Yau, Some properties ofthe diffusion coefficient for asymmetric
simple exclusion processes, Ann. Probab. 24 (1996), 1779-1808.
C. Landim, S. Olla and H. T. Yau, First order correction for the hydrodynamic limit of
asymmetric simple exclusion processes in dimension d ::: 3, Comm. Pure Appl. Math.
50 (1997), 149-203.
C. Landim and M. E. Vares, Equilibrium fluctuations for exclusion processes with speed
change, Stoch. Proc. Appl. 52 (1994), 107-118.
C. Landim and H.-T. Yau, Fluctuation-dissipation equation of asymmetric simple exclusion
processes, Probab. Th. ReI. Fields 108 (1997), 321-356.
Bibliography 327
T. M. Liggett, Ergodic theorems for the asymmetric simple exclusion process, Trans. Amer.
Math. Soc. 213 (1975), 237-261.
T. M. Liggett, Coupling the simple exclusion process, Ann. Probab. 4 (1976), 339-356.
T. M. Liggett, Ergodic theorems for the asymmetric simple exclusion process II, Ann.
Probab. 4 (1977), 339-356.
T. M. Liggett, Long range exclusion processes, Ann. Probab. 8 (1980), 861-889.
C. T. MacDonald, J. H. Gibbs and A. C. Pipkin, Kinetics of biopolymerization on nucleic
acid templates, Biopolymers 6 (1968), 1-25.
F. P. Machado, Branching exclusion on a strip, J. Statist. Phys. 86 (1997), 765-777.
C. Maes and F. Redig, Anisotropic perturbations of the simple symmetric exclusion process:
long correlations, J. Phys. I 1 (1991), 669-684.
J. P. Marchand and P. A. Martin, Exclusion process and droplet shape, J. Statist. Phys. 44
(1986),491-504.
J. P. Marchand and P. A. Martin, Errata: Exclusion process and droplet shape, J. Statist.
Phys. 50 (1988), 469-471.
Y. Nagahata, The gradient condition for one-dimensional symmetric exclusion processes, J.
Statist. Phys. 91 (1998), 587-602.
C. Neuhauser, One dimensional stochastic Ising model with small migration, Ann. Probab.
18 (1990), 1539-1546.
J. Quastel, Diffusion of color in simple exclusion process, Comm. Pure Appl. Math. 45
(1992), 623-679.
J. Quastel, F. Rezakhanlou and S. R. S. Varadhan, Large deviations for the symmetric simple
exclusion process in dimensions d ~ 3, Probab. Th. ReI. Fields 113 (1999), 1-84.
N. Rajewsky, L. Santen, A. Schadschneider and M. Schreckenberg, The asymmetric exclu-
sion process: comparison of update procedures, 1. Statist. Phys. 92 (1998), 151-194.
K. Ravishankar, Fluctuations from the hydrodynamical limit for the symmetric simple ex-
clusion in Zd, Stoch. Proc. Appl. 42 (l992a), 31-37.
K. Ravishankar, Interface fluctuations in the two-dimensional weakly asymmetirc simple
exclusion process, Stoch. Proc. Appl. 43 (l992b), 223-247.
F. Rezakhanlou, Hydrodynamic limit for attractive particle systems on Zd, Comm. Math.
Phys. 140 (1991), 417-448.
F. Rezakhanlou, Evolution of tagged particles in nonreversible particle systems, Comm.
Math. Phys. 165 (l994a), 1-32.
F. Rezakhanlou, Propagation of chaos for symmetric simple exclusion, Comm. Pure Appl.
Math. 47 (1994b), 943-957.
F. Rezakhanlou, Microscopic structure of shocks in one conservation laws, Ann. Inst. H.
Poincare Anal. Non Lineaire 12 (1995), 119-153.
H. Rost, Non-equilibrium behaviour of a many particle process: density profile and local
equilibria, Z. Wahrsch. verw. Gebiete 58 (1981), 41-53.
E. Saada, A limit theorem for the position of a tagged particle in a simple exclusion process,
Ann. Probab. 15 (1987), 375-381.
S. Sandow, Partially asymmetric exclusion process with open boundaries, Phys. Rev. E 50
(1994),2660--2667.
S. Sandow and G. M. Schutz, On Uq [SU(2)]-symmetric driven diffusion, Europhys. Lett.
26 (1994), 7-12.
T. Sasamoto and M. Wadati, Dynamic matrix product ansatz and Bethe ansatz equation for
asymmetric exclusion process with periodic boundary condition, 1. Phys. Soc. Japan 66
(1997), 279-282.
G. M. Schutz, Generalized Bethe ansatz solution of a one dimensional asymmetric exclusion
process on a ring with blockage, 1. Statist. Phys. 71 (1993),471-505.
G. M. Schutz, Pairwise balance and invariant measures for generalized exclusion processes,
1. Phys. A 29 (1996), 837-843.
328 Bibliography
Other Papers
D. Aldous, Markov chains with almost exponential hitting times, Stoch. Proc. Appl. 13
(1982),305-310.
D. Aldous and P. Diaconis, Hammersley's interacting particle process and longest increas-
ing subsequences, Probab. Th. ReI. Fields 103 (1995), 199-213.
E. Andjel, Invariant measures for the zero range process, Ann. Probab. 10 (1982), 525-547.
1. T. Chayes, A. Puha and T. Sweet, Independent and dependent percolation, Probability
Theory and Applications (E.P. Hsu and S.R.S. Varadhan, eds.), IASlPark City Mathe-
matics Series, Vol. 6, AMS, 1999, pp. 49-166.
Bibliography 329
M.-F. Chen, On the ergodic region of SchlOgl's model, Proc. Intern. Conf. Dirichlet Forms
and Stoch. Proc., Walter de Gruyter, 1995, pp. 87-102.
M. -F. Chen, W. -D. Ding and D. -G. Zhu, Ergodicity ofreversible reaction diffusion processes
with general reaction rates, Acta Math. Sinica 10 (1994), 99-112.
N. G. de Bruijn and P. Erdos, On a recursion formula and some Tauberian theorems, 1.
Res. Nat. Bur. Standards 50 (1953), 161-164.
A. De Masi, P. A. Ferrari, S. Goldstein and W. D. Wick, Invariance principle for reversible
Markov processes with applications to random motions in random environments, 1.
Statist. Phys. 55 (1989), 787-855.
1. van den Berg and U. Fiebig, On a combinatorial conjecture concerning disjoint occur-
rences of events, Ann. Probab. 15 (1987), 354-374.
1. D. Deuschel, Algebraic L 2 decay of attractive critical processes on the lattice, Ann.
Probab. 22 (1994),264-283.
W.-D. Ding, R. Durrett and T. M. Liggett, Ergodicity of reversible reaction diffusion pro-
cesses, Probab. Th. ReI. Fields 85 (1990), 13-26.
R. Durrett, Oriented percolation in two dimensions, Ann. Probab. 12 (1984), 999-1040.
R. Durrett, Ten Lectures on Particle Systems, Proceedings of the 1993 St. Flour Summer
School, Springer Lecture Notes #1608, 1995, pp. 97-201.
R. Durrett, Stochastic spatial models, Probability Theory and Applications (E.P. Hsu
and S.R.S. Varadhan, eds.), lAS/Park City Mathematics Series, Vol. 6, AMS, 1999,
pp. 5-47.
R. Durrett and C. Neuhauser, Particle systems and reaction-diffusion equations, Ann.
Probab. 22 (1994), 289-333.
P. A. Ferrari and L. R. G. Fontes, The net output process of a system with irifinitely many
queues, Ann. Appl. Probab. 4 (1994), 1129-1144.
P. A. Ferrari, A. Galves and C. Landim, Exponential waiting time for a big gap in a one
dimensional zero range process, Ann. Probab. 22 (1994), 284-288.
S. Goldstein, Antisymmetric functionals of reversible Markov processes, Ann. Inst. Henri
Poincare 31 (1995),177-190.
T. E. Harris, Random measures and motions ofpoint processes, Z. Wahrsch. verw. Gebiete
9 (1967), 36-58.
R. A. Holley and D. W. Stroock, Uniform and L 2 convergence in one dimensional stochastic
Ising models, Comm. Math. Phys. 123 (1989), 85-93.
S. Karlin and 1. McGregor, Coincidence probabilities, Pac. 1. Math. 9 (1959), 1141-1164.
T. M. Liggett, Total positivity and renewal theory, Probability, Statistics and Mathematics:
Papers in Honor of Samuel Karlin, Academic Press, 1989, pp. 141-162.
T. M. Liggett, Survival of discrete time growth models, with applications to oriented perco-
lation, Ann. Appl. Probab. 5 (1995b), 613-636.
T. M. Liggett, Stochastic models of interacting systems, Ann. Probab. 25 (1997), 1-29.
T. M. Liggett, R. H. Schonmann and A. M. Stacey, Domination by product measures, Ann.
Probab. 25 (1997), 71-95.
T. S. Mountford and B. Prabhakar, On the weak convergence of departures from an irifinite
series of·/ M /1 queues, Ann. Appl. Probab. 5 (1995), 121-127.
R. H. Schonmann, An approach to characterize metastability and critical droplets in stochas-
tic Ising models, Ann. Inst. H. Poincare A 55 (1991), 591-600.
R. H. Schonmann and S. B. Shlosman, Wulff droplets and the metastable relaxation of
kinetic Ising models, Comm. Math. Phys. 194 (1998), 389-462.
T. Seppiiliiinen, A microscopic model for the Burgers equation and longest increasing sub-
sequences, Elect. 1. Probab. 1 (1996), 1-51.
T. Seppiiliiinen, Increasing sequences ofindependent points on the planar lattice, Ann. Appl.
Probab. 7 (1997b), 886-898.
T. Seppiiliiinen, Large deviations for increasing sequences on the plane, Probab. Th. ReI.
Fields 112 (l998b), 221-244.
F. Spitzer, Interaction of Markov processes, Advances Math. 5,246-290.
Index
A Selection
219. DuvautJLions: Inequalities in Mechanics and Physics
220. Kirillov: Elements of the Theory of Representations
221. Mumford: Algebraic Geometry I: Complex Projective Varieties
222. Lang: Introduction to Modular Forms
223. Bergh/U:ifstrom: Interpolation Spaces. An Introduction
224. Gilbargffrudinger: Elliptic Partial Differential Equations of Second Order
225. Schutte: Proof Theory
226. Karoubi: K-Theory. An Introduction
227. GrauertlRemmert: Theorie der Steinschen Riiume
228. SegaUKunze: Integrals and Operators
229. Hasse: Number Theory
230. Klingenberg: Lectures on Closed Geodesics
231. Lang: Elliptic Curves. Diophantine Analysis
232. GihmanlSkorohod: The Theory of Stochastic Processes III
233. StroocklVaradhan: Multidimensional Diffusion Processes
234. Aigner: Combinatorial Theory
235. DynkinlYushkevich: Controlled Markov Processes
236. GrauertlRemmert: Theory of Stein Spaces
237. Kothe: Topological Vector Spaces II
238. GrahamlMcGehee: Essays in Commutative Harmonic Analysis
239. Elliott: Probabilistic Number Theory I
240. Elliott: Probabilistic Number Theory II
en
241. Rudin: Function Theory in the Unit Ball of
242. HuppertlBlackburn: Finite Groups II
243. HuppertlBlackburn: Finite Groups III
244. Kubert/Lang: Modular Units
245. CornfeldIFominlSinai: Ergodic Theory
246. NaimarklStern: Theory of Group Representations
247. Suzuki: Group Theory I
248. Suzuki: Group Theory II
249. Chung: Lectures from Markov Processes to Brownian Motion
250. Arnold: Geometrical Methods in the Theory of Ordinary Differential Equations
251. ChowlHale: Methods of Bifurcation Theory
252. Aubin: Nonlinear Analysis on Manifolds. Monge-Ampere Equations
253. Dwork: Lectures on p-adic Differential Equations
254. Freitag: Siegelsche Modulfunktionen
255. Lang: Complex Multiplication
256. Hormander: The Analysis of Linear Partial Differential Operators I
257. Hormander: The Analysis of Linear Partial Differential Operators II
258. Smoller: Shock Waves and Reaction-Diffusion Equations
259. Duren: Univalent Functions
260. FreidlinlWentzell: Random Perturbations of Dynamical Systems
261. BoschlGuntzerlRemmert: Non Archimedian Analysis - A System Approach
to Rigid Analytic Geometry
262. Doob: Classical Potential Theory and Its Probabilistic Counterpart
263. Krasnosel'skiIlZabrelko: Geometrical Methods of Nonlinear Analysis
264. AubinlCellina: Differential Inclusions
265. GrauertlRemmert: Coherent Analytic Sheaves
266. de Rham: Differentiable Manifolds
267. ArbarellolCornalbalGriffithslHarris: Geometry of Algebraic Curves, Vol. I
268. ArbarellolCornalbalGriffithslHarris: Geometry of Algebraic Curves, Vol. II
269. Schapira: Microdifferential Systems in the Complex Domain
270. Scharlau: Quadratic and Hermitian Forms
271. Ellis: Entropy, Large Deviations, and Statistical Mechanics
272. Elliott: Arithmetic Functions and Integer Products
273. Nikol'skiI: Treatise on the Shift Operator
274. Hormander: The Analysis of Linear Partial Differential Operators III
275. Hormander: The Analysis of Linear Partial Differential Operators IV
276. Liggett: Interacting Particle Systems
277. Fulton/Lang: Riemann-Roch Algebra
278. BarrIWells: Toposes, Triples and Theories
279. BishoplBridges: Constructive Analysis
280. Neukirch: Class Field Theory
281. Chandrasekharan: Elliptic Functions
282. Lelong/Gruman: Entire Functions of Several Complex Variables
283. Kodaira: Complex Manifolds and Deformation of Complex Structures
284. Finn: Equilibrium Capillary Surfaces
285. Burago/Zalgaller: Geometric Inequalities
286. Andrianaov: Quadratic Forms and Hecke Operators
287. Maskit: Kleinian Groups
288. JacodlShiryaev: Limit Theorems for Stochastic Processes
289. Manin: Gauge Field Theory and Complex Geometry
290. Conway/Sloane: Sphere Packings, Lattices and Groups
291. HahnlO'Meara: The Classical Groups and K-Theory
292. Kashiwara/Schapira: Sheaves on Manifolds
293. RevuzIYor: Continuous Martingales and Brownian Motion
294. Knus: Quadratic and Hermitian Forms over Rings
295. DierkeslHildebrandtIKiisterlWohlrab: Minimal Surfaces I
296. DierkeslHildebrandtIKiisterlWohlrab: Minimal Surfaces II
297. PasturlFigotin: Spectra of Random and Almost-Periodic Operators
298. Berline/GetzlerNergne: Heat Kernels and Dirac Operators
299. Pommerenke: Boundary Behaviour of Conformal Maps
300. Orlikfferao: Arrangements of Hyperplanes
301. Loday: Cyclic Homology
302. LangelBirkenhake: Complex Abelian Varieties
303. DeVorelLorentz: Constructive Approximation
304. Lorentz/v. GolitschekIMakovoz: Construcitve Approximation. Advanced Problems
305. Hiriart-UrrutylLemarechal: Convex Analysis and Minimization Algorithms I.
Fundamentals
306. Hiriart-UrrutylLemarechal: Convex Analysis and Minimization Algorithms II.
Advanced Theory and Bundle Methods
307. Schwarz: Quantum Field Theory and Topology
308. Schwarz: Topology for Physicists
309. Adem/Milgram: Cohomology of Finite Groups
310. GiaquintaIHildebrandt: Calculus of Variations I: The Lagrangian Formalism
311. GiaquintalHildebrandt: Calculus of Variations II: The Hamiltonian Formalism
312. Chung/Zhao: From Brownian Motion to Schrodinger's Equation
313. Malliavin: Stochastic Analysis
314. AdamslHedberg: Function Spaces and Potential Theory
315. Biirgisser/ClausenlShokrollahi: Algebraic Complexity Theory
316. SafflTotik: Logarithmic Potentials with External Fields
317. RockafellarlWets: Variational Analysis
318. Kobayashi: Hyperbolic Complex Spaces
319. BridsonlHaefliger: Metric Spaces of Non-Positive Curvature
320. KipnislLandim: Scaling Limits of Interacting Particle Systems
321. Grimmett: Percolation
322. Neukirch: Algebraic Number Theory
323. NeukirchlSchmidt/Wingberg: Cohomology of Number Fields
324. Liggett: Stochastic Interacting Systems: Contact, Voter and Exclusion Processes
Springer
and the
environment
At Springer we firmly believe that an
international science publisher has a
special obligation to the environment,
and our corporate policies consistently
reflect this conviction.
We also expect our business partners -
paper mills, printers, packaging
manufacturers, etc. - to commit
themselves to using materials and
production processes that do not harm
the environment. The paper in this
book is made from low- or no-chlorine
pulp and is acid free, in conformance
with international standards for paper
permanency.
Springer