Professional Documents
Culture Documents
ISBN 978-3-03719-197-2
www.ems-ph.org
EMS Congress Reports publishes volumes originating from conferences or seminars focusing on any field
of pure or applied mathematics. The individual volumes include an introduction into their subject and
review of the contributions in this context. Articles are required to undergo a refereeing process and are
accepted only if they contain a survey or significant results not published elsewhere in the literature.
Previously published:
Trends in Representation Theory of Algebras and Related Topics, Andrzej Skowroński (ed.)
K-Theory and Noncommutative Geometry, Guillermo Cortiñas et al. (eds.)
Classification of Algebraic Varieties, Carel Faber, Gerard van der Geer and Eduard Looijenga (eds.)
Surveys in Stochastic Processes, Jochen Blath, Peter Imkeller and Sylvie Rœlly (eds.)
Representations of Algebras and Related Topics, Andrzej Skowroński and Kunio Yamagata (eds.)
Contributions to Algebraic Geometry. Impanga Lecture Notes, Piotr Pragacz (ed.)
Geometry and Arithmetic, Carel Faber, Gavril Farkas and Robin de Jong (eds.)
Derived Categories in Algebraic Geometry. Toyko 2011, Yujiro Kawamata (ed.)
Advances in Representation Theory of Algebras, David J. Benson, Henning Krause and
Andrzej Skowroński (eds.)
Valuation Theory in Interaction, Antonio Campillo, Franz-Viktor Kuhlmann and Bernard Teissier (eds.)
Representation Theory – Current Trends and Perspectives, Henning Krause, Peter Littelmann,
Gunter Malle, Karl-Hermann Neeb and Christoph Schweigert (eds.)
Functional Analysis and Operator Theory for Quantum Physics. The Pavel Exner Anniversary Volume,
Jaroslav Dittrich, Hynek Kovařík and Ari Laptev (eds.)
Schubert Varieties, Equivariant Cohomology and Characteristic Classes, Jarosław Buczyński, Mateusz
Michałek and Elisa Postinghel (eds.)
Non-Linear Partial Differential Equations, Mathematical Physics, and Stochastic Analysis, Fritz Gesztesy,
Harald Hanche-Olsen, Espen R. Jakobsen, Yurii Lyubarskii, Nils Henrik Risebro and Kristian Seip, Editors
Spectral Structures
and Topological Methods
in Mathematics
Michael Baake
Friedrich Götze
Werner Hoffmann
Editors
Editors:
2010 Mathematics Subject Classification: Primary 58J65, 52C23, 20F65, 11M41; secondary 46L54, 60J45,
35C07, 35Q55, 43A25, 20F36, 11R42, 14L05.
Key words: Universal distributions, free probability, Markov processes, Schrödinger operators, heat kernel,
spatial ecology, metastability, numerical analysis, critical regularity, aperiodic order, dynamical systems,
special Kähler structure, non-crossing partitions, localising subcategory, braided groups, zeta functions,
subgroup growth, representation growth, Brumer–Stark conjecture, p-divisible groups.
ISBN 978-3-03719-197-2
The Swiss National Library lists this publication in The Swiss Book, the Swiss national bibliography, and
the detailed bibliographic data are available on the Internet at http://www.helveticat.ch.
This work is subject to copyright. All rights are reserved, whether the whole or part of the material
is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation,
broadcasting, reproduction on microfilms or in other ways, and storage in data banks. For any kind of use
permission of the copyright owner must be obtained.
Contact address:
European Mathematical Society Publishing House
Seminar for Applied Mathematics
ETH-Zentrum SEW A21
CH-8092 Zürich, Switzerland
Email: info@ems-ph.org
Homepage: www.ems-ph.org
Typeset using the authors’ TeX files: le-tex publishing services GmbH, Leipzig, Germany
Printing and binding: Beltz Bad Langensalza GmbH, Bad Langensalza, Germany
∞ Printed on acid free paper
987654321
Preface
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 411
Chapter 1
Convergence and asymptotic approximations
to universal distributions in probability
F. Götze and H. Kösters
1.1 Introduction
In random matrix theory, the distributions of the eigenvalues of various kinds of ran-
dom matrices are investigated, see e.g. [3]. More precisely, we are interested in the
asymptotic behaviour of the eigenvalues as the matrix size tends to infinity.
Let us begin with one of the most famous results in random matrix theory, the
semi-circle law. For each n 2 N, let X n D .n 1=2 Xj k /16j;k6n be a symmetric
Wigner matrix, i.e. a symmetric n n matrix such that .Xj k /16j 6k6n is a family
of independent and identically distributed (i.i.d.) real random variables satisfying the
moment conditions
The random variables Xj k may depend on n, but this dependence is suppressed in the
notation. Let us mention that all the results described in this section continue to hold
for symmetric random matrices with non-identically distributed entries, and also for
Hermitian Wigner matrices. P
Let 1 ; : : : ; n denote the eigenvalues of X n , and write X n WD n1 njD1 ıj for
the (empirical) spectral distribution of X n . The investigation of the limiting spectral
1 Projects A4, B1
2 F. Götze, H. Kösters
distribution (after appropriate rescaling) has a long history and dates back to work by
Wigner in the 1950s. The famous semi-circle law states that
q
1
lim EX n D sc .dx/ WD 2 .4 x 2 /C dx (1.1.3)
n!1
in the sense of weak convergence. Similar results hold without the expectation, with
weak convergence in probability or almost surely. It is worth emphasising that the
limiting spectral distribution is always given by the semi-circle distribution sc , ir-
respective of the distribution of the random variables Xj k . In this respect, the semi-
circle distribution is universal.
There are two main approaches to prove the semi-circle law, the method of mo-
ments and the method of Stieltjes transforms.
If, for each m 2 N, the moments EjXj k jm are finite and uniformly bounded in n,
one can use the method of moments. Here one shows using combinatorial arguments
that
Z Z
lim x m .EX n /.dx/ D lim E n1 tr X mn D x m sc .dx/ (1.1.4)
n!1 n!1
for any m 2 N0 . For m D 2l even, the limit is the lth Catalan number, or the
number of non-crossing pair partitions of 2l elements. Since convergence in moments
implies convergence in distribution when the limiting distribution is determined by its
moments, this proves (1.1.3).
For any probability measure on the real line, the Stieltjes transform is the ana-
lytic function on the upper half-plane CC WD fu C iv 2 C W u; v 2 R; v > 0g defined
by
Z
1
m .z/ WD .dx/ ; z 2 CC : (1.1.5)
x z
We shall write mX n instead of m and msc instead of msc . It is well known
Xn
that the pointwise convergence of Stieltjes transforms (to a limit which is itself the
Stieltjes transform of a probability measure) is equivalent to the weak convergence of
the underlying probability measures.
In the context of Wigner matrices, the usefulness of the Stieltjes transform comes
from the observation that, for each z 2 CC ,
n
X
1 1 1
mX n .z/ D n
tr R n D n P .j /
n 1=2 X z n 1 Xj k Xj l Rkl
j D1 jj k;l¤j
n
(1.1.6)
X 1 1
1
D n 1
C En D C En :
z n
tr R n z mX .z/
j D1 n
.j /
Here, R n WD .X n zI n / 1 D .Rkl / and R .j n
/
WD .X .j
n
/
zI n 1 / 1 D .Rkl /
.j / .j /
denote the resolvents of the matrices X n and X n , where X n is obtained from X n
Convergence and Asymptotic Approximations 3
by deleting the j th row and the j th column, and En is an error term (see below). If
this error term were zero, (1.1.6) would yield the self-consistency equation
m2 .z/ C zm.z/ C 1 D 0 ; (1.1.7)
whose solution is given by the Stieltjes transform of the semi-circle distribution.
Hence, to prove the semi-circle law, one must show that En ! 0 as n ! 1 (in
the appropriate sense). More precisely, it is not hard to see that the error term in
(1.1.6) is given by
n
X X
1 1=2 1 .j /
.z C mX n .z//En D n
Rjj n Xjj n
Xj k Xj l Rkl
j D1 k;l¤j Wk¤l
X (1.1.8)
.j /
1
n
.Xj2k 1/Rkk C 1
n
tr.R n R .j
n /
/
;
k¤j
so it remains to show that the sums on the right-hand side in (1.1.8) tend to zero
as n ! 1. To this end, results from classical probability theory concerning the
moments and distributions of (possibly random) linear and quadratic forms of inde-
pendent random variables are useful.
The problem of optimal approximations of distributions of linear and nonlinear
functions of independent random variables is a classical problem of probability and
harmonic analysis, which originated from classical analytic number theory. For linear
functionals, the arithmetic structure of the summands strongly influences the distri-
bution of the sums, as in the Littlewood–Offord problem [96] and in the central limit
theorem (CLT). For nonlinear functionals like quadratic forms, this influence is al-
ready reduced and studied intensively in the value distribution on lattices by Landau
(1924) and the multivariate CLT for balls by Esséen (1945). Hence, one expects
that distributions of generic highly non-linear functionals of vectors and matrices of
independent variables (e.g. eigenvalues of matrices) exhibit a smooth distributional
behaviour irrespective of a possible arithmetic/lattice structure in the distribution of
these variables. This expected behaviour applies as well to the distribution of roots
of high-degree polynomials with independent random coefficients, which is related
to the distribution of algebraic numbers of growing height.
Random matrix theory is also related to free probability [3, 102]. To explain this,
let us start from the observation that, by generalisation of Eq. (1.1.4), the calculation
of the expected traces of more general products of random matrices is also of interest.
If .X n / and .Y n / are independent sequences of Wigner matrices as in (1.1.4), it turns
out that
j j
k k
lim E n1 tr X n1 . lim 1 E tr X 1 /I n Y n1 . lim 1 E tr Y 1 /I n
n!1 !1 !1
jm 1 jm km 1 k
Xn . lim E tr X /I n Y n . lim E tr Y m /I n D 0 (1.1.9)
!1 !1
of the powers X jn and Y kn . The relation (1.1.9) is called the asymptotic freeness of
the sequences .X n / and .Y n /.
The relation (1.1.9) is the asymptotic counterpart of the notion of freeness in free
probability. Here, one considers a unital algebra A endowed with a tracial unital
linear functional ' W A ! C and studies the moments '.an / of elements a 2 A.
Two elements a; b 2 A are called free (with respect to each other) if
' .aj1 '.aj1 //.b k1 '.b k1 // .ajm '.ajm //.b km '.b km // D 0
(1.1.10)
For the multiplicative convolution, we shall assume without further notice that the dis-
tributions a and b are supported on the positive half-line. One may then investigate
similar questions (limit theorems, infinite divisibility, asymptotic approximations and
expansions, . . . ) as for the classical convolutions. For instance, the free CLT shows
that if '.a/ D 0 and '.a2 / D 1, then limn!1 n p D weakly, which is the
a= n sc
direct analogue of the classical CLT. Thus, the semi-circle distribution plays a similar
role in free probability as the normal distribution in classical probability.
When the probability measures a and b have compact support, one may also
take an analytic approach based on suitable transforms, namely Voiculescu’s R- and
S -transforms [3, 102], instead of the combinatorial approach outlined above. These
transforms are analytic on certain domains in the complex plane and satisfy
and hence may be viewed as analogues of the logarithmic Fourier transform and the
Mellin transform in classical probability theory. Incidentally, the R- and S -trans-
forms are also closely related to the Stieltjes transform in (1.1.5). For instance, for
the R-transform, we have Ra .z/ D . ma .z//h 1i z 1 , where . /h 1i denotes
the inverse function.
In terms of random matrices, the free convolutions may be interpreted as fol-
lows: Suppose that .X n / and .Y n / are sequences of self-adjoint random matrices of
increasing dimension whose mean spectral distributions converge to X and Y in
moments and which are asymptotically free. (Roughly speaking, this means that their
Convergence and Asymptotic Approximations 5
eigenspaces are in general position to one another.) Then, the limiting mean spectral
distributions of X n C Y n and X n Y n are given by X Y and X Y , respec-
tively. Thus, the limiting spectral distributions of certain composite random matrices
may be investigated using tools from free probability.
As already mentioned, the semi-circle distribution may be regarded as the counter-
part of the normal distribution in free probability. It seems natural to study functionals
under which these distributions have certain extremal properties. For instance, it is
well known that the normal distribution maximises Shannon entropy among all distri-
butions with mean 0 and variance 1. Similarly, the semi-circle distribution maximises
Voiculescu’s free entropy in the same class. Thus, it seems natural to investigate limit
theorems with respect to the divergence measures associated with these entropies.
This brings us to the field of entropic limit theorems; see Section 1.6.2.
In the semi-circle law, in the limit as n ! 1, the eigenvalues are confined to a
bounded interval. This is the global (or macroscopic) level, where one studies the
weak convergence of the empirical eigenvalue distribution to some limiting distribu-
tion (typically with compact support). One may also consider the local (or micro-
scopic) level, where the eigenvalues are rescaled in such a way that the mean spacing
between neighbouring eigenvalues is of the order 1. One then studies the asymptotic
correlations between a small number of eigenvalues which are close to one another.
Interestingly, here the limiting distributions are often universal too, which means that
the same limits arise for different kinds of random matrices (from the same sym-
metry class). See Section 1.4 below for a sample of results. Moreover, these limits
also appear in a variety of other contexts. For instance, the asymptotic local spectral
distributions for Hermitian Wigner matrices also show up in representation theory
(asymptotics of Young diagrams), probability theory (non-colliding stochastic pro-
cesses, repulsive particle systems), number theory (zeros of L-functions) and physics
(quantisations of chaotic dynamical systems).
1.2.1 Rate of convergence in the semi-circle law It is natural to ask for the rate of
convergence in the semi-circle law. Given two probability measures and on the
real line with distribution functions F and G, we write
k k1 WD sup jF .x/ G.x/j
x2R
for the Kolmogorov distance between and . Then, one may consider the distance
either for the mean spectral distribution,
n WD kEX n sc k1 ;
6 F. Götze, H. Kösters
The problem to establish upper bounds on n has a long history. In 1993, Bai [7]
derived the rate O .n 1=4 / under a 4th moment condition. The rate O .n 1=2 / was
obtained independently by Girko (1998, 2002), Bai, Miao and Tsay (2002), and Götze
and Tikhomirov (2003) under various moment conditions; see [69] for references.
Next, the optimal rate O .n 1 / was obtained for the special case of random matrices
with Gaussian entries, first in the Hermitian case [65] and then in the symmetric case
[100]. Using concentration of measure techniques, Bobkov, Götze and Tikhomirov
[22] obtained the rate O .n 2=3 / under the assumption that the matrix entries satisfy
a Poincaré inequality. The optimal rate of convergence O .n 1 / under weak moment
conditions was finally established by Götze and Tikhomirov [68, 69], initially under
an 8th moment condition [68] and later under a .4 C ı/th moment condition [69].
More precisely, it was shown in [69] that if
for some ı > 0, then there exists a constant C D C.ı; M4Cı / such that
1
n 6 C n (1.2.2)
for all n 2 N.
The proof of this result required three major ingredients:
1. A suitable variant of the smoothing inequality to bound the Kolmogorov distance
in terms of the difference of the corresponding Stieltjes transforms, see Proposi-
tion 2.1 in [68]. This inequality uses a special contour which stays away from
the end-points ˙2 of the support of sc , where the Stieltjes transforms are more
difficult to control.
2. A recursive argument to derive good bounds on the diagonal entries of the resol-
vent close to the real axis, see Section 5 in [68]. Roughly speaking, this argument
shows that a bound on EjRjj j2p at distance v from the real axis entails a bound
on EjRjj jp at distance v=s0 from the real axis. It was inspired by similar results,
albeit under stronger moment conditions, by Cacciapuoti, Maltsev and Schlein
[26]. The proof under weak moment conditions is more involved, and uses recur-
sive expansions for the resolvent entries similar to (1.1.6), as well as Burkholder’s
inequality for martingale difference sequences to estimate the resulting quadratic
forms.
3. A recursive inequality for EjmX n .z/ mSC .z/j2 , see Lemma 7.24 in [68]. This
inequality is based on a Stein-type expansion adapted to the self-consistency equa-
tion (1.1.7), which facilitates the recursion argument considerably.
Using moment matching techniques, a simplified proof of (1.2.2) could be given
in [61].
Convergence and Asymptotic Approximations 7
Let us now turn to upper bounds on n . In 1997, Bai, Miao and Tsay [9] ob-
tained the rate OP .n 1=4 / under a 4th moment condition. The rate OP .n 1=2 / was
established in [64] under a 12th moment condition and in [10] under a 6th mo-
ment condition. More recent results by Erdős, Yau and Yin [42] imply that n D
OP .n 1.log n/C log log n /, see e.g. Section 1 in [68]. In [60], complemented by addi-
tional material in [58, 59], Götze, Naumov and Tikhomirov proved that, under the
condition (1.2.1),
n D OP .n 1
log n/ ; (1.2.3)
with some explicit constant D .ı/. In view of a result by Gustavsson [72] for
GUE matrices (i.e. Hermitian Wigner matrices with Gaussian entries), it seems clear
that the optimal rate cannot be better than OP .n 1 log1=2 n/. Thus, a result of the
form (1.2.3) is optimal up to logarithmic factors.
Götze, Naumov, Tikhomirov and Timushev [61] improved the result (1.2.3) by
showing that it is possible to take D 2. More generally, they showed that, un-
der the condition (1.2.1), there exist positive constants C D C.ı; M4Cı / and c D
c.ı; M4Cı / such that for 1 6 p 6 c log n, one has
C p log2p n
P.n > K/ 6 for all K > 0 :
K p np
These results for n were obtained by refining the methods developed in [68]. For
instance, in [60], the previous estimates from [68] were extended to the off-diagonal
entries of the resolvent, and Stein-type expansions were employed systematically in
order to bound the pth moment of the error term En in (1.1.8). Moreover, in [61],
the authors used moment matching techniques (motivated by results in [37, 83]) to
compare a general Wigner matrix to a suitable Wigner matrix with sub-Gaussian
entries. Finally, an essential ingredient in all these works was an appropriate version
of the local semi-circle law, which will be described in the next subsection.
1.2.2 Local semi-circle law As mentioned below (1.1.5), the proof of the semi-
circle law amounts to showing that jmX n .u C iv/ msc .u C iv/j ! 0 as n ! 1,
for any fixed u 2 R, v > 0. In the last few years, similar results have been obtained
for the situation where v may tend to zero as n tends to infinity, but not too fast
[40, 38, 60, 61]. More precisely, under suitable moment conditions,
general Wigner matrices, see also Section 1.4. The first version of the local semi-
circle law was derived under the assumption that the underlying matrix entries have
finite exponential moments (as well as further regularity properties). This assump-
tion was successively relaxed to .4 C ı/th moments in a series of papers by Erdős,
Knowles, Schlein, Yau and Yin; compare [38] and the references therein.
The paper [60] provided an alternative self-contained proof of the local semi-
circle law, also under a .4 C ı/th moment condition, by building upon the techniques
developed for the investigation of the rate of convergence [68]. One of the main
advantages of this approach is that the exponent of log n in (1.2.4) is reduced from
C log log n to a constant (at least in a certain region for the arguments u and v), with
a precise dependence on ı.
These results were further improved by Götze, Naumov, Tikhomirov and Timu-
shev [61], who showed that for any ı > 0, there exist constants C0 ; C1 ; C2 depending
only on ı and M4Cı (see (1.2.1)) such that
C0 p p
EjmX n .u C iv/ msc .u C iv/jp 6
nv
for all 1 6 p 6 C1 log n, 1 > v > C2 log n=n and juj 6 2 C v. By taking
p of the order log n and using Markov’s inequality, one re-obtains (1.2.4) for 1 >
v > C2 log n=n and juj 6 2 C v, but with the constant exponent D 1 instead of
D C log log n for the logarithmic factor.
As already mentioned, this version of the local semi-circle law plays a major role
in recent advances on the rate of convergence for n . Further applications include
the delocalisation of eigenvectors and the rigidity of eigenvalues; see [42, 60, 61] and
the references given there. In view of the results by Gustavsson [72], all these results
seem to be optimal up to logarithmic factors.
1.3.1 Circular law For each n > 1, let X n D .n 1=2 Xj k /16j;k6n be a real Girko–
Ginibre matrix, i.e. an n n matrix such that .Xj k /16j;k6n is a family of i.i.d. real
random variables satisfying the moment conditions
EXj k D 0; EXj2k D 1 .1 6 j; k 6 n/
Again, the random variables Xj k are allowed to depend on n. Also, all the results
described in this section continue to hold for non-symmetric random matrices with
Convergence and Asymptotic Approximations 9
in the sense of weak convergence, or a similar result without the expectation (with
weak convergence in probability or almost surely). Thus, the spectral distribution
converges to the uniform distribution on the unit disk in the complex plane, and the
limiting distribution is again universal.
This result was first stated by Girko in the 1980’s. One of his key ideas was
to study the spectral distributions of the matrices X n by considering the spectral
distributions of the Hermitian random matrices .X n zI n /.X n zI n / , for all
z 2 C. More precisely, for a probability measure with compact support on the
complex plane, let
Z
U .z/ WD log j zj.d/ .z 2 C/ (1.3.1)
C
and the proof of the circular law boils down to proving the convergence of the corre-
sponding logarithmic potentials.
It is worth emphasising that the logarithmic potential UX n .z/ is given by an inte-
gral over an unbounded function. Thus, to prove the convergence of (1.3.2), say, it is
not sufficient to establish the weak convergence of the singular value distributions of
the matrices X zI n , but one must additionally control the large and small singular
values of these matrices. The real problem is that of controlling the small singular
values. In the case where the matrix entries have a smooth density, this problem was
solved by Bai [8]. For general distributions, however, this problem remained open. Its
solution became possible only recently due to work by Rudelson [91] and Rudelson
and Vershynin [92], who derived stochastic lower bounds on the smallest singular
value of a non-symmetric random matrix with independent entries. These bounds
were obtained by a combination of geometric and probabilistic methods, inter alia
concentration inequalities for linear forms of independent random variables. Inde-
pendently of each other, Götze and Tikhomirov [66] and Tao and Vu [97] extended
these bounds to ‘shifted’ matrices X n zI n and applied them to complete the proof
of the circular law.
10 F. Götze, H. Kösters
PnTo continue, optimal bounds for the concentration in small balls of weighted sums
kD1 Xk ak of vectors ak with i.i.d. real weights Xk crucially depend on the arith-
metic properties of the vectors ak if Xk are Rademacher variables taking values ˙1.
Arithmetic consequences of strong concentration have been investigated by Nguyen,
Tao and Vu (2009, 2011) e.g. in [96, 87]. They called this inverse Littlewood–Offord
phenomenon.
Eliseeva, Götze and Zaitsev [35, 36] showed that this phenomenon can be de-
rived from seminal results of Arak (1980, 1981) and Arak and Zaitsev (1988), see [4]
and the references therein, who investigated a similar phenomenon in a more
P general
context. They studied how a large local concentration of the sum Sa WD nkD1 Xk ak
of i.i.d. variables Xk implies a good approximation of the distribution of Sa by an
induced measure of distributions supported on a higher dimensional lattice under a
linear map. As shown in [35, 36], this yields information about the arithmetic struc-
ture of the sequence ak extending the results of [96, 87].
1.3.2 Elliptic law After the proof of the circular law, various extensions were con-
sidered, see also the survey paper by Bordenave and Chafaï [23].
One line of research was the proof of the elliptic law [85, 86, 55]. Here, one con-
siders random matrices X n D .n 1=2 Xj k /16j;k6n with similar moment properties
as in the circular law, but for which the random variables Xj k and Xkj with j ¤ k
are correlated, with a fixed correlation coefficient % 2 . 1; C1/. It turns out that
the asymptotic spectral distribution is given by the uniform distribution on an ellipse,
with a shape determined by %. By and large, the proof of the elliptic law resembles
that of the circular law. However, due to correlations, several parts of the proof have
to be adapted, including the bounds on the small singular values.
Note that one may regard the ‘elliptic’ matrices as interpolations between Wigner
matrices and Girko–Ginibre matrices, and their limiting distributions as interpolations
between the semi-circular distribution and the circular distribution.
.k/ 1
circ .z/ D 1fjzj<1g dz : (1.3.3)
kjzj2.k 1/=k
Remarkably, this is the same limiting distribution as that for the kth power of a single
Girko–Ginibre matrix.
Furthermore, the same result holds for products of independent ‘elliptic’ random
matrices [56, 89]. Somewhat surprisingly, for products with k > 2 factors, one also
obtains the limiting eigenvalue density (1.3.3), irrespective of the correlation % of the
underlying factors.
Fluctuation results for linear statistics of singular values of products of random
matrices have also been studied [57].
1.3.4 Functions of random matrices Götze, Kösters and Tikhomirov [53] pro-
posed a general framework for the investigation of matrix-valued functions
F n .X 1;n ; : : : ; X k;n / of independent Girko-Ginibre matrices X 1;n ; : : : ; X k;n (all of
size n n). One of the main ideas was to exploit the principle of universality, thereby
partly avoiding the need to analyse the self-consistency equations for the Stieltjes
transforms.
In a first step, one identifies a set of conditions under which one may establish
the universality of the limiting singular value and eigenvalue distributions. The in-
vestigation of the singular values relies on the Stieltjes transform and uses classical
truncation and interpolation arguments from probability theory, while the study of the
eigenvalues is again based on Girko’s Hermitisation method.
In a second step, one determines the limiting spectral distributions in a simple
special case, namely for functions of Gaussian random matrices. Here one can take
advantage of the fact that these matrices are bi-orthogonally invariant (i.e. their dis-
tribution is not changed if we pre- or postmultiply them by orthogonal matrices), and
hence asymptotically free. Thus, it becomes possible to determine the limiting spec-
tral distributions by using the calculus of S - and R-transforms from free probability
(see the Introduction) as well as classical results from logarithmic potential theory, at
least in simple situations.
It turns out that the underlying assumptions may be verified in a number of exam-
ples. In particular, many results on products of independent Girko-Ginibre matrices
and their inverses may now be derived within a common framework [53]. Further-
more, the same approach may also be used for sums of independent random matrices
[99, 79], although a more explicit description of the limiting spectral distributions is
available only in a few special cases here. For example, for random matrices of the
form X 1;n .X 2;n X k;n / 1 , or even sums of independent copies thereof, the limit-
ing spectral distributions are closely related to the symmetric stable distributions from
free probability [79].
12 F. Götze, H. Kösters
Y n
Y
2
const jxk xj j exp 1
2
nxj2 dx1 : : : dxn ; (1.4.2)
16j <k6n j D1
That is, for any m 2 N and any smooth function f W Rm ! R with compact support,
we have
0 1
.n m/Š X
lim E @ f .Q j .X n /; : : : ; Q jm .X n //A
n!1 nŠ 1
j1 ;:::;jm (1.4.4)
Z
D f .x1 ; : : : ; xm / det Ksine .xj ; xk / j;kD1;:::;m dx1 dxm ;
Rm
Convergence and Asymptotic Approximations 13
where the sum inside the expectation is over all m-tuples with pairwise distinct el-
ements. Similar results, although with a different rescaling and a different limiting
kernel, the Airy kernel, hold at the edge of the spectrum, i.e. near the points a D ˙2.
Remarkably, the asymptotic local distributions are universal in the sense that the same
limits arise in a variety of other situations, both inside and outside the field of random
matrix theory.
1.4.1 Random matrices The same local correlations have been established for
several classes of random matrices. First of all, it has been known for some time
now that similar results hold if we replace the GUE with a unitarily invariant random
matrix ensemble defined by a potential V with suitable smoothness and growth
properties. Here the matrix density and the eigenvalue density are given by
const exp. n tr V .X // dX and
Y n
Y
const jxk x j j2 exp. nV .xj //dx1 : : : dxn ; (1.4.5)
16j <k6n j D1
respectively. Note that in the special case V .x/ D 21 x 2 , we re-obtain the results
for the GUE. A density of the form (1.4.5) is also called an orthogonal polynomial
ensemble. The reason is that one can use the orthogonal polynomials associated with
the weight e nV .x/ to rewrite the density (1.4.5) in determinantal form, with a kernel
Kn;V , similarly as in (1.4.3). Thus, the asymptotic analysis is again reduced to ortho-
gonal polynomials, which can now be analysed e.g. using Riemann–Hilbert problems,
see e.g. Deift [34] and the references therein. At the global level, one obtains a
deterministic limit density %V with compact support. It is worth emphasising that this
density is not universal, but depends on the choice of the potential V . At the local
level, after replacing %sc with %V in the rescaling of the eigenvalues, one obtains the
same result (1.4.4) as for the GUE in the bulk of the spectrum. Moreover, similar
universality results (with the sine kernel replaced by the Airy kernel) hold at the edge
of the spectrum. Thus, the local correlations are universal within the class of unitarily
invariant ensembles. Strong universality results for the kernel Kn;V including rates
of convergence and transitions between bulk and edge regions have been obtained in
[80].
For more general random matrix ensembles, however, a closed expression for the
joint distribution of the eigenvalues as in (1.4.2) or (1.4.5) is not available anymore,
and the analysis becomes considerably more complicated.
Johansson [74] investigated the local correlations for the so-called deformed
Gaussian unitary ensemble and proved the analogue of (1.4.4). A deformed GUE
matrix has the form X n C aY n , with X n a Hermitian Wigner matrix, Y n an inde-
pendent GUE matrix, and a > 0 fixed. The investigation of this ensemble starts from
two important observations. Firstly, the eigenvalue density may be represented as a
mixture of certain determinantal densities. Secondly, the correlation kernels associ-
ated with these determinantal densities admit a double contour integral representation
which is suitable for asymptotic analysis.
14 F. Götze, H. Kösters
Götze, Gordin and Levina [47, 46] investigated the local correlations for a partic-
ular fixed-trace ensemble, the so-called fixed Hilbert–Schmidt norm ensemble (HSE),
and proved the analogue of (1.4.4). For r > 0, this ensemble is given by the unique
probability measure on the set fX 2 Hn W tr.X 2 / D nr 2 g which is invariant with
respect to conjugation by unitary matrices. The main idea is that the GUE may be
represented as a mixture of the HSE, for different values of r, and that this relation
may be inverted in order to deduce the results for the HSE from the corresponding re-
sults for the GUE. The implementation of this idea is technically involved, however,
and requires extending the parameter r to the complex domain.
More recently, it has been proved by Erdős, Péché, Ramírez, Schlein and Yau [39]
and by Tao and Vu [98] that (1.4.4) continues to hold for general Hermitian Wigner
matrices satisfying certain technical conditions. Consequently, the local correlations
are universal within the class of Hermitian Wigner ensembles as well. The proof of
this result is based on sophisticated comparison arguments. The main idea in [39] is to
compare general Wigner matrices to random matrices from a suitable deformed GUE,
but with a ! 0 as n ! 1, and to show that the local correlations are asymptotically
the same. To this end, one needs the local semi-circle law (and several related results)
as an input. We refer to the survey paper [41] for details.
1.4.2 Products of random matrices It is well known that the singular value and
eigenvalue configurations of a Ginibre matrix (i.e. a non-Hermitian random matrix
with i.i.d. complex Gaussian entries) have spectral densities of determinantal form.
More recently, it has been observed by Akemann, Burda, Ipsen, Kieburg, Wei and
others that the same is true for products of independent Ginibre matrices; compare [1]
and the references therein. Up to rescaling, the joint density of the squared singular
values of a product of m independent Ginibre matrices is given by
const det.xjk 1
/j;kD1;:::;n det..D j 1
w/.xk //j;kD1;:::;n ; (1.4.6)
relation between the squared singular values and the eigenvalues for these matrices
[76]. In particular, it turns out that the class of bi-unitarily invariant random matri-
ces corresponding to a density of the form (1.4.6) is closed under taking indepen-
dent products [77]. This leads to a fairly large class which includes many prominent
ensembles from non-Hermitian random matrix theory and which gives rise to new
universality problems.
1.4.3 Repulsive particle systems Götze and Venker [70] considered more general
probability measures on Rn with a density of the form
Y Y
f .x1 ; : : : ; xn / D const h.xj xk / e nQ.xj / dx1 dxn ; (1.4.7)
16j 6k6n 16j 6n
1.5.4 Lattice point counting problems The existence of m ¤ 0 with jQŒmj < is
a consequence of precise results for counting lattice pointsin elliptic as well as hyper-
bolic shells defined via a quadratic form QŒx in Rd and Ea;r WD fQŒx 2 Œa; rg for
18 F. Götze, H. Kösters
where ˛ 6 1. In case that the optimal exponent ˛ D 1 can be shown, the factor ır
may be bounded below or tend to zero depending on the Diophantine properties of
the coefficients of Q. Such error bounds have been studied for a long time, starting
with Landau (1915, 1924), Jarnik (1928) for special positive definite forms QŒx
being diagonal or with integer coefficients, where ˛ D 1 could be shown. Optimal
exponents have been shown for d > 9 and definite as well as indefinite general forms
QŒx first by Bentkus and Götze (1997). For dimension d > 5, the optimal exponent
was shown in [45] for E0;r and in [54] for Hr .a; b/ such that b a tends to infinity.
As for the dependence of Diophantine properties of Q, Davenport and Lewis (1972)
conjectured that for positive definite irrational forms the distances of the ordered
elements of QŒZ d for d > 5 converge to 0. This is related to the famous conjecture
by Oppenheim (1929) that QŒZ d is dense in R for indefinite irrational forms for
d > 5, which was proved by Margulis (1986) even for d > 3. Quantitative versions
of these conjectures for irrational forms were shown for d > 9 in Bentkus and Götze
(1997) as well as for d > 5 and Q positive-definite in [45]. In [54], the problem for
both cases has been solved for d > 5 by means of a unified approach.
Lower bounds for the error for E0;r in dimensions d D 2; 3; 4 show that the
optimal exponents satisfy ˛ > 1=4; 1=2; 1 respectively, multiplied by logarithmic
factors like .log r/ˇ ; ˇ > 0; d D 2; 3 resp. .log log r/
;
> 0; d D 4, whereas
lower error bounds for dimensions 5 and higher are just given by .r 2 /.
Crucial for the proof of ˛ D 1 and ır ! 0 in (1.5.1) is the investigation of theta
sums on the generalised Siegel upper half-plane, that is on matrices iQ C A, where
A; Q are real symmetric d d matrices with Q positive definite, given by
X
.iQ C A/ D expŒ mT Q m C imT A m: (1.5.2)
m2Z d
The number of lattice points m such that Q.m/ 6 r may be expressed as an integral
along a theta function .sQ/ expŒit r=.2s/ on a line s D t C ir 2 ; t 2 R (that is
a degenerate horocycle), with given by (1.5.2). The following inequality between
theta functions turned out to be an essential step:
X
j.sQ/j2 6 .t/ WD c.Q/r d expŒ Ht .m; n/; (1.5.3)
m;n
Ht .m; n/ WD r 2 km t Q nk2 C r 2
knk2 (1.5.4)
Convergence and Asymptotic Approximations 19
of .m; n/ 2 Z 2d . Using lattice density bounds for .t/, via the first d successive
Minkowski minima Mt;1 6 Mt;2 6 : : : 6 Mt;d of Ht , that is via
rd
.t/ 6 c.Q/ ; (1.5.5)
Mt;1 : : : Mt;d
the problem of estimating the lattice point remainder is transferred to the estimation
of an integral along .sQ/ together with questions in metric number theory and the
geometry of numbers. This approach is closely connected to the study of ergodic
properties of unipotent and quasi-geodesic flows in the papers of Eskin, Margulis and
Mozes (1998). Indeed, we may write the quadratic form Ht .m; n/ as
where
r 2 SL.2d; R/; r > 0 denotes a quasi-geodesic flow and ut 2 SL.2d; R/; t 2
R a unipotent flow in SL.2d; R/.
1.5.5 Central limit theorems Let B denote a domain in Rd with 0 2 B and smooth
boundary, which is symmetric to 0. Let X1 ; : : : ; Xn denote independent and identi-
cally distributed Rd -valued random vectors with EX1 D 0; EkX1 k4 < 1 and iden-
tity covariance. A classical problem in probability theory is the question of optimal er-
ror bounds in the central limit theorem for the distribution of sums Sn D X1 C CXn
of random vectors on the system of sets Br D r B; r > 0, i.e. the determination of
the exponent ˛ in
ˇ ˚ ˇ
ˇ ˇ
sup ˇP Sn 2 n1=2 Br PfS 2 Br gˇ D O .n ˛ /; (1.5.7)
r>0
lattice point problems and the CLT and the extensive literature in this field, see the
reviews in [44] and [71].
The approximation results for Hr .a; b/ described above were essential as well
for proving the above rate of convergence ˛ D 1 for quadrics in the CLT down to
dimension 5 in [71]. Here, Götze and Zaitsev proved, for example, an explicit error
bound for the elliptic or hyperbolic regions of type B WD fQŒx 6 1g in dimensions
d > 5,
This result concludes a long series of investigations starting with the seminal results
by Esséen (1945) mentioned above. It relies on the fact that the characteristic func-
tion of quadratic forms Q.Sn / for sums of independent identically distributed vector
summands may be estimated via the average over random matrices A in the char-
acteristic functions of bilinear forms hAT1 ; T2 i of two independent sums T1 ; T2 of
Rademacher vectors taking values in the lattice Z d . These characteristic functions
in turn may be estimated again using local limit theorems via theta sums as outlined
in (1.5.3). Hence, the decisive step of controlling the local fluctuations of the distri-
bution of such indefinite quadratic forms could be reduced to the estimates outlined
above in order to obtain a full correspondence in terms of dimensions and rates be-
tween both areas. In order to investigate distribution functions of quadratic forms, a
crucial technical obstacle had been to extend the averaging (in t) of the characteristic
described in (1.5.5) from the measure dt (sufficient for narrow shells Hr .a; b/ with
bounded intervals Œa; b) to the harmonic measure dt=t on the real line.
For lower dimensions, for example d D 3, one cannot expect these optimal rates
in view of the correspondence to the lattice point error problem in these dimensions.
which are analytic functions from CC to C and hence closely related to the class of
Nevanlinna functions. A remarkable difference described for free measure decompo-
sitions in [29] as compared to classical ones is that the infinitely divisible measures
without indecomposable factors are trivial Dirac measures only. This observation
holds not only for the free additive convolution, but also for the free multiplicative
convolution on the positive half-line as well as on the unit circle [29].
As for analogs of results of classical parametric statistics, the independence of
sample mean and sample covariance for Gaussian random variables has a counterpart
in free probability for free semi-circular random variables in von Neumann algebras;
see Chistyakov, Götze and Lehner [32].
1.6.2 Entropic limit theorems Let us return to the classical central limit theorem.
Let X1 ; X2 ; X3 ; : : : be i.i.d. real random variables with mean 0 and variance 1, and
let Sp
1 ; S2 ; S3 ; : : : denote the associated partial sums. Let fn denote the density of
Sn = n (if existent), and let ' denote the density of the standard normal distribution.
The classical local limit theorem states that
kfn 'k1 ! 0
if and only if there exists some n0 2 N such that fn0 is bounded.
Since the existence of bounded densities is still a strong condition, it seems natural
to look for similar characterisations under weaker conditions. One such result is the
entropic central limit theorem by Barron (1985), which states that D.fn j '/ ! 0 if
and only if there exists some n0 2 N such that D.fn0 j '/ < 1. Here,
Z
D.f j '/ WD L.f .x/='.x// '.x/dx
denotes the relative entropy of f with respect to ', and L.x/ WD x log x.
A nice interpretation of this result arises from the observation that D.fn j '/ D
H.'/ H.fn /, where H denotes Shannon entropy, and that the standard normal
distribution maximises Shannon entropy among all random variables of mean 0 and
variance 1. Thus, the system converges to the state of maximal entropy. In this
respect, let us also mention the result by Artstein, Ball, Barthe and Naor [5] that
the entropy tends to that of the standard normal distribution monotonously, in line
with the second law of thermodynamics. Moreover, in [6], these authors proved that
the rate of convergence is O .n 1 / if the random variables Xn satisfy the Poincaré
inequality. However, for more general probability densities, the question of the rate
of convergence remained open.
This question was answered completely in a series of papers by Bobkov,
Chistyakov and Götze [17, 20] from the last few years. Their analysis combined tools
from information theory (such as the entropy convolution inequality) with more clas-
sical results on characteristic functions and their asymptotic approximation. In [17],
the authors derived Edgeworth-type expansions for the entropic central limit theorem.
In particular, if the underlying random variables Xj have finite fourth moments, the
rate of convergence is still O .n 1/, and hence much better than in the classical cen-
tral limit theorem. This is related to the fact that the term of order O .n 1=2/ in the
classical asymptotic expansion of the density fn is an odd function, and hence van-
ishes when taking the entropy integral. Moreover, optimal rates for the case where
the random variables Xj have finite fractional moments of order 2 < s < 4 have
also been obtained [17]. These results rest on technically involved approximations
in the local limit theorem for sums of i.i.d. random variables in the case of fractional
moments. Furthermore, Berry–Esséen bounds in the entropic central limit theorem
have also been established [20].
Convergence and Asymptotic Approximations 23
Similar results were also derived for related notions from information theory, e.g.
Fisher information [18], as well as for other limit laws, e.g. stable distributions [19].
However, in the latter situation some complications arise. For instance, since higher
moments do not exist, asymptotic expansions are not available anymore. Moreover,
the full analogue of the entropic limit theorem can only be obtained for the non-
extremal stable laws. For the extremal stable laws, additional technical conditions are
needed [19]. This might be related to the fact that (non-Gaussian) stable distributions
do not maximise the entropy functional anymore.
Chistyakov and Götze [31] studied related questions for the entropic free central
limit theorem, i.e. they studied the convergence of Voiculescu’s free entropy to its
maximum value (under fixed variance) assumed for the semi-circle distribution. As-
suming a finite moment of order four, they showed that the rate of convergence in
the entropic free CLT is also of the order O.n 1 /. Furthermore, they obtained an ex-
pansion up to an error of order o.n 1/. These results were based on previous results
about expansions for densities in the free CLT [30].
Besides sums of i.i.d. random variables, maxima of sums of i.i.d. random variables
have also been considered (Bobkov–Chistyakov–Kösters [21]). Here, the limiting
distribution is given by the one-sided normal distribution, which also maximises a
suitably defined entropy functional. Furthermore, one also obtains a characterisation
result here: The maxima of the sums converge to the one-sided normal distribution
in relative entropy if and only if the original random variables Xj have finite relative
entropy with respect to the one-sided normal distribution. The proof is also based on
a combination of the entropy convolution inequality with more classical results for
characteristic functions, including Spitzer’s formula.
References
[1] G. Akemann and J. Ipsen, Recent exact and asymptotic results for products of independent
random matrices, Acta Phys. Pol. B 46 (2015), 1747.
[2] N. Alexeev, F. Götze and A. Tikhomirov, Asymptotic distribution of singular values of pow-
ers of random matrices. Lith. Math. J. 50 (2010), 121–132.
[3] G.W. Anderson, A. Guionnet and O. Zeitouni, An Introduction to Random Matrices, Cam-
bridge University Press, Cambridge (2010).
[4] T.V. Arak and A.Yu. Zaı̆tsev, Uniform limit theorems for sums of independent random vari-
ables, Proc. Steklov Inst. Math. 174 (1988), 1–222.
[5] S. Artstein, K.M. Ball, F. Barthe and A. Naor, Solution of Shannon’s problem on the mono-
tonicity of entropy, J. Amer. Math. Soc. 17 (2004), 975–982.
[6] S. Artstein, K.M. Ball, F. Barthe and A. Naor, On the rate of convergence in the entropic
central limit theorem, Probab. Th. Rel. Fields 129 (2004), 381–390.
[7] Z.D. Bai, Convergence rate of expected spectral distributions of large random matrices. I.
Wigner matrices, Ann. Probab. 21 (1993), 625–648.
[8] Z.D. Bai, Circular law, Ann. Probab. 25 (1997), 494–529.
24 F. Götze, H. Kösters
[9] Z.D. Bai, B. Miao and J. Tsay, A note on the convergence rate of the spectral distributions
of large random matrices, Statist. Probab. Lett. 34 (1997), 95–101.
[10] Z.D. Bai, J. Hu, G. Pan and W. Zhou, A note on rate of convergence in probability to semi-
circular law, Electron. J. Probab. 16 (2011), 2439–2451.
[11] A.R. Barron, Entropy and the central limit theorem, Ann. Probab. 14 (1986), 336–342.
[12] V. Beresnevich, Rational points near manifolds and metric Diophantine approximation,
Ann. of Math. 175 (2012), 187–235.
[13] V. Beresnevich, V. Bernik and F. Götze, The distribution of close conjugate algebraic num-
bers, Compos. Math. 146 (2010), 1165–1179.
[14] V. Beresnevich, V. Bernik, F. Götze and O. Kukso, Distribution of algebraic numbers and
metric theory of Diophantine approximation. In Limit Theorems in Probability, Statistics and
Number Theory (P. Eichelsbacher, G. Elsner, H. Kösters, M. Löwe, F. Merkl and S. Rolles,
eds.) Springer, Heidelberg, 2013, 23–48.
[15] V. Bernik, F. Götze and A. Gusakova, On points with algebraically conjugate coordinates
close to smooth curves, Mosc. J. Comb. Number Th. 6 (2016), 56–101.
[16] V. Bernik, F. Götze and O. Kukso, Lower bounds for the number of integral polynomials
with given order of discriminants, Acta Arithm. 133 (2008), 375–390.
[17] S.G. Bobkov, G.P. Chistyakov and F. Götze, Rate of convergence and Edgeworth-type ex-
pansion in the entropic central limit theorem, Ann. Probab. 41 (2013), 2479–2512.
[18] S.G. Bobkov, G.P. Chistyakov and F. Götze, Fisher information and the central limit theorem,
Probab. Th. Rel. Fields 159 (2014), 1–59.
[19] S.G. Bobkov, G.P. Chistyakov and F. Götze, Fisher information and convergence to stable
laws, Bernoulli 20 (2014), 1620–1646.
[20] S.G. Bobkov, G.P. Chistyakov and F. Götze, Berry-Esseen bounds in the entropic central
limit theorem, Probab. Th. Rel. Fields 159 (2014), 435–478.
[21] S. Bobkov, G. Chistyakov and H. Kösters, The entropic Erdős–Kac limit theorem,
J. Theor. Probab. 28 (2014), 1520–1555.
[22] S.G. Bobkov, F. Götze and A.N. Tikhomirov, On concentration of empirical measures and
convergence to the semi-circle law, J. Theor. Probab. 23 (2010), 792–823.
[23] Ch. Bordenave and D. Chafaï, Around the circular law, Probab. Surv. 9 (2012), 1–89.
[24] A. Borodin, Biorthogonal ensembles, Nuclear Phys. B 536 (1999), 704–732.
[25] E. Brézin and S. Hikami, Characteristic polynomials of random matrices, Com-
mun. Math. Phys. 214 (2000), 111–135.
[26] C. Cacciapuoti, A. Maltsev and B. Schlein, Bounds for the Stieltjes transform and the density
of states of Wigner matrices, Probab. Th. Rel. Fields 163 (2015), 1–59.
[27] G.P. Chistyakov and F. Götze, Limit theorems in free probability theory. I. Ann. Probab. 36
(2008), 54–90.
[28] G.P. Chistyakov and F. Götze, Limit theorems in free probability theory. II. Cent. Eur.
J. Math. 6 (2008), 87–117.
[29] G.P. Chistyakov and F. Götze, The arithmetic of distributions in free probability theory,
Cent. Eur. J. Math. 9 (2011), 997–1050.
Convergence and Asymptotic Approximations 25
[30] G.P. Chistyakov and F. Götze, Asymptotic expansions in the CLT in free probability,
Probab. Th. Rel. Fields 157 (2013), 107–156.
[31] G.P. Chistyakov and F. Götze, Edgeworth-type expansion in the entropic free CLT, preprint;
arXiv:1701.04352.
[32] G.P. Chistyakov, F. Götze and F. Lehner, Freeness of linear and quadratic forms in von Neu-
mann algebras, J. Funct. Anal. 261 (2011), 2829–2844.
[33] J.B. Conrey, D.W. Farmer, J.P. Keating, M.O. Rubinstein and N.C. Snaith, Autocorrelation
of random matrix polynomials, Commun. Math. Phys. 237 (2003), 365–395.
[34] P.A. Deift, Orthogonal Polynomials and Random Matrices: a Riemann–Hilbert Approach,
American Mathematical Society, Providence, RI (1999).
[35] Yu.S. Eliseeva, F. Götze and A.Yu. Zaitsev, Estimates for the concentration functions in the
Littlewood–Offord problem, J. Math. Sci. (N.Y.) 206 (2013), 146–158.
[36] F. Götze and A.Yu. Zaitsev, New applications of Arak’s inequalities to the Littlewood–Offord
problem, Eur. J. Math. 4 (2018), 639–663.
[37] L. Erdős, A. Knowles, H.-T. Yau and J. Yin, Spectral statistics of Erdős-Rényi graphs II:
eigenvalue spacing and the extreme eigenvalues, Commun. Math. Phys. 314 (2012), 587–
640.
[38] L. Erdős, A. Knowles, H.-T. Yau and J. Yin, The local semicircle law for a general class of
random matrices, Electron. J. Probab. 18 (2013), 1–58.
[39] L. Erdős, S. Péché, J.A. Ramírez, B. Schlein and H.-T. Yau, Bulk universality for Wigner
matrices, Commun. Pure Appl. Math. 63 (2010), 895–925.
[40] L. Erdős, B. Schlein and H.-T. Yau, Local semicircle law and complete delocalization for
Wigner random matrices, Commun. Math. Phys. 287 (2009), 641–655.
[41] L. Erdős and H.-T. Yau, Universality of local spectral statistics of random matrices,
Bull. Amer. Math. Soc. (N.S.) 49 (2012), 377–414.
[42] L. Erdős, H.-T. Yau and J. Yin, Rigidity of eigenvalues of generalized Wigner matrices,
Adv. Math. 229 (2012), 1435–1515.
[43] V.L. Girko, The circular law, Teor. Veroyatnost. Primenen. 29 (1984), 669–679.
[44] F. Götze, Lattice point problems and the central limit theorem in Euclidean spaces.
In Proceedings of the International Congress of Mathematicians, Vol. III (Berlin, 1998),
245–255.
[45] F. Götze, Lattice point problems and distribution of quadratic forms, Invent. Math. 157
(2004), 195–226.
[46] F. Götze and M. Gordin, Limit correlation functions for fixed trace random matrix ensembles,
Commun. Math. Phys. 281 (2008), 203–229.
[47] F. Götze, M. Gordin and A. Levina, Limit correlation functions at zero for fixed trace random
matrix ensembles. Zap. Nauchn. Sem. S.-Peterburg. Otdel. Mat. Inst. Steklov 341 (2007), 68–
80.
[48] F. Götze and A. Gusakova, On algebraic integers in short intervals and near smooth curves,
Acta Arithm. 179 (2017), 251–265.
[49] F. Götze, D. Kaliada and D. Zaporozhets, Distribution of complex algebraic numbers.
Proc. Amer. Math. Soc. 145 (2017), 61–71.
26 F. Götze, H. Kösters
[50] F. Götze, D. Koleda and D. Zaporozhets, Correlation functions of real zeros of random poly-
nomials, J. Math. Sci. (N.Y.) 229 (2018), 664–670.
[51] F. Götze, D. Koleda and D. Zaporozhets, Joint distribution of conjugate algebraic numbers:
a random polynomial approach, Adv. Math. (to appear); arXiv:1703.02289.
[52] F. Götze and H. Kösters, On the second-order correlation function of the characteristic poly-
nomial of a Hermitian Wigner matrix, Commun. Math. Phys. 285 (2009), 1183–1205.
[53] F. Götze, H. Kösters and A. Tikhomirov, Asymptotic spectra of matrix-valued functions of
independent random matrices and free probability, Random Matrices Theory Appl. 4 (2015),
1550005 (85 pp).
[54] F. Götze and G.A. Margulis, Distribution of values of quadratic forms at integral points,
preprint (2013). https://sfb701.math.uni-bielefeld.de/files/preprints/sfb13003.pdf
[55] F. Götze, A. Naumov and A. Tikhomirov, On minimal singular values of random matrices
with correlated entries. Random Matrices Theory Appl. 4 (2015), 1550006 (30 pp).
[56] F. Götze, A. Naumov and A. Tikhomirov, On a generalization of the elliptic law for random
matrices, Acta Phys. Polon. B 46 (2015), 1737–1745.
[57] F. Götze, A. Naumov, and A. Tikhomirov, Distribution of linear statistics of singular values
of the product of random matrices, Bernoulli 23 (2017), 3067–3113.
[58] F. Götze, A. Naumov and A. Tikhomirov, Local semicircle law under moment conditions.
Part I: The Stieltjes transform, preprint; arXiv:1510.07350.
[59] F. Götze, A. Naumov and A. Tikhomirov, Local semicircle law under moment conditions.
Part II: Localization and delocalization, preprint; arXiv:1511.00862.
[60] F. Götze, A. Naumov and A. Tikhomirov, Local semicircle law under moment conditions:
Stieltjes transform, rigidity and delocalization, Theory Probab. Appl. 62 (2018), 58–83.
[61] F. Götze, A. Naumov, A. Tikhomirov and D. Timushev, On the local semicircular law for
Wigner ensembles, Bernoulli 24 (2018), 2358–2400.
[62] F. Götze, A. Naumov and V. Ulyanov, Asymptotic analysis of symmetric functions,
J. Theor. Probab. 30 (2017), 876–897.
[63] F. Götze and A. Reshetenko, Asymptotic expansions in free limit theorems, preprint;
arXiv:1408.1360.
[64] F. Götze and A. Tikhomirov, Rate of convergence to the semi-circular law, Probab. Th.
Rel. Fields 127 (2003), 228–276.
[65] F. Götze and A. Tikhomirov, The rate of convergence for spectra of GUE and LUE matrix
ensembles, Cent. Eur. J. Math. 3 (2005), 666–704.
[66] F. Götze and A. Tikhomirov, The circular law for random matrices, Ann. Probab. 38 (2010),
1444–1491.
[67] F. Götze and A. Tikhomirov, On the asymptotic spectrum of products of independent random
matrices, preprint; arXiv:1012.2710.
[68] F. Götze and A. Tikhomirov, Optimal bounds for convergence of expected spectral distribu-
tions to the semi-circular law, Probab. Th. Rel. Fields 165 (2016), 163–233.
[69] F. Götze and A. Tikhomirov, Optimal bounds for convergence of expected spectral distribu-
tions to the semi-circular law for the 4 C moment ensemble, preprint; arXiv:1511.03692.
[70] F. Götze and M. Venker. Local universality of repulsive particle systems and random matri-
ces, Ann. Probab. 42 (2014), 2207–2242.
Convergence and Asymptotic Approximations 27
[71] F. Götze and A. Zaitsev, Explicit rates of approximation in the CLT for quadratic forms,
Ann. Probab. 42 (2014), 354–397.
[72] J. Gustavsson, Gaussian fluctuations of eigenvalues in the GUE, Ann. Inst. H. Poincaré
Probab. Statist. 41 (2005), 151–178.
[73] R.A. Horn and Ch.R. Johnson, Topics in Matrix Analysis. Cambridge University Press, Cam-
bridge (1991).
[74] K. Johansson, Universality of the local spacing distribution in certain ensembles of Hermitian
Wigner matrices, Commun. Math. Phys. 215 (2001), 683–705.
[75] J.P. Keating and N.C. Snaith, Random matrix theory and .1=2 C it/, Commun. Math. Phys.
214 (2000), 57–89.
[76] M. Kieburg and H. Kösters, Exact relation between singular value and eigenvalue statistics.
Random Matrices Theory Appl. 5 (2016), 1650015 (57 pp).
[77] M. Kieburg and H. Kösters, Products of random matrices from polynomial ensembles,
Ann. Inst. H. Poincaré Probab. Statist. 55 (2019), 98–126.
[78] D.Y. Kleinbock and G.A. Margulis, Flows on homogeneous spaces and Diophantine approx-
imation on manifolds, Ann. of Math. 148 (1998), 339–360.
[79] H. Kösters and A. Tikhomirov, Limiting spectral distributions of sums of products of non-
Hermitian random matrices, Probab. Math. Stat. 38 (2018), 359–384.
[80] T. Kriecherbauer, K. Schubert, K. Schüler and M. Venker, Global asymptotics for the
Christoffel–Darboux kernel of random matrix theory, Markov Proc. Rel. Fields 21 (2015),
639–694.
[81] T. Kriecherbauer and M. Venker, Edge statistics for a class of repulsive particle systems,
Probab. Th. Rel. Fields, 170 (2018), 617–655.
[82] A. Kuijlaars and D. Stivigny, Singular values of products of random matrices and polynomial
ensembles, Random Matrices Theory Appl. 3 (2014), 1450011 (22 pp).
[83] J.-O. Lee and J. Yin, A necessary and sufficient condition for edge universality of Wigner
matrices, Duke Math. J. 163 (2014), 117–173.
[84] H.L. Montgomery, The pair correlation of zeros of the zeta function, Analytic number theory,
Proc. Sympos. Pure Math., XXIV, American Mathematical Society, Providence, R.I., 1973,
181–193.
[85] A. Naumov, The elliptic law for random matrices, Vestnik Moskov. Univ. Ser. XV Vy-
chisl. Mat. Kibernet. (2013), 31–38, 48.
[86] H.H. Nguyen and S. O’Rourke, The elliptic law, Int. Math. Res. Not. 2015 (2015), 7620–
7689.
[87] H. Nguyen and V. Vu, Optimal inverse Littlewood–Offord theorems. Adv. Math. 226 (2011),
5298–5319.
[88] A.M. Odlyzko, On the distribution of spacings between zeros of the zeta function. Mathe-
matics of Computation 48 (177) (1987), 273–308.
[89] S. O’Rourke, D. Renfrew, A. Soshnikov and V. Vu, Products of independent elliptic random
matrices, J. Stat. Phys. 160 (2015), 89–119.
[90] S. O’Rourke and A. Soshnikov, Products of independent non-Hermitian random matrices,
Electron. J. Probab. 16 (2011), 2219–2245.
28 F. Götze, H. Kösters
[91] M. Rudelson, Invertibility of random matrices: norm of the inverse, Ann. of Math. 168
(2008), 575–600.
[92] M. Rudelson and R. Vershynin, The Littlewood–Offord problem and invertibility of random
matrices, J. Adv. Math. 218 (2008), 600–633.
[93] K. Schubert and M. Venker, Empirical spacings of unfolded eigenvalues, Electron. J. Probab.
20 (2015), no. 120, 1–37.
[94] T. Shcherbina, On the correlation function of the characteristic polynomials of the Hermitian
Wigner ensemble, Commun. Math. Phys. 308 (2011), 1–21.
[95] E. Strahov and Y.V. Fyodorov, Universal results for correlations of characteristic polynomi-
als: Riemann–Hilbert approach, Commun. Math. Phys. 241 (2003), 343–382.
[96] T. Tao and V. Vu, Inverse Littlewood–Offord theorems and the condition number of random
discrete matrices. Ann. of Math. 169 (2009), 595–632.
[97] T. Tao and V. Vu, Random matrices: universality of ESDs and the circular law. With an
appendix by M. Krishnapur, Ann. Probab. 38 (2010), 2023–2065.
[98] T. Tao and V. Vu, Random matrices: universality of local eigenvalue statistics, Acta Math.
206 (2011), 127–204.
[99] D.A. Timushev and A.N. Tikhomirov, Limit theorems for spectra of sums of random matri-
ces, Proc. Komi Science Center of Ural Division of RAS 187 (2014), 24–32.
[100] D.A. Timushev, A.N. Tikhomirov and A.A. Kholopov, On the accuracy of the approximation
of the GOE spectrum by the semi-circular law, Theory Probab. Appl. 52 (2008), 171–177.
[101] M. Venker, Particle systems with repulsion exponent ˇ and random matrices, Electron. Com-
mun. Probab. 18 (2013), no. 83 (12 pp).
[102] D. Voiculescu, Lectures on free probability theory. In Lectures on Probability Theory and
Statistics (P. Bernard, ed.), LNM 1738, Springer, Berlin, 2000, 279–349.
Chapter 2
Kolmogorov operators and SPDEs
M. Röckner
This part reports on some recent progress and remaining challenging open prob-
lems in the study of linear elliptic and parabolic Fokker–Planck–Kolmogorov equa-
tions obtained in Project B4.
where aij and b i are Borel functions on Rd such that A D .aij /16i;j 6d is a non-
negative symmetric matrix.
1 Projects A9, B4
30 M. Röckner
L D 0 (2.1.1)
if aij ; b i 2 L1loc .jj/, where jj denotes the variation of , and for every function
u 2 C01 .Rd / Z
Lu d D 0:
Rd
If is given by a density % with respect to the Lebesgue measure, Eq. (2.1.1) can be
written as an equation for the density,
d
X d
X
@x @x .aij %/ @x .b i %/ D 0: (2.1.2)
i j i
i;j D1 i D1
where aij and b i are Borel functions on Rd .0; T / such that A D .aij /16i;j 6d is a
non-negative symmetric matrix.
We say that a bounded (i.e. of bounded variation) Borel measure on Rd .0; T /
(possibly signed) is defined by a family of Borel locally bounded measures .t /0<t <T
on Rd if for every bounded Borel set B the mapping t 7! t .B/ is measurable and
.dx dt/ D t .dx/ dt. Below, we will deal with measures of bounded variation.
A Borel measure on Rd .0; T / defined by a family of measures .t /0<t <T
on Rd satisfies the Fokker–Planck–Kolmogorov equation
@t D L (2.1.3)
if aij ; b i 2 L1loc .jj; Rd .0; T // and if, for every function u 2 C01 .Rd .0; T //,
Z T Z
@t u.x; t/ C Lu.x; t/ dt dt D 0:
0 Rd
for the modulus of continuity ! of A. The result is this. Suppose that, on every ball, A
has a modulus of continuity satisfying the Dini condition. Let det A > 0 everywhere
q
and jbj 2 Lloc .Rd /, where q > d . Then, the density % of every solution (possibly,
signed) to the elliptic equation L D 0 has a continuous version.
If, in addition, the matrix A is locally Hölder continuous of order ı 2 .0; 1/,
then % has a density that is locally Hölder continuous of order ı. Actually, the afore-
mentioned results are proved for equations on domains, but we formulate them for
simplicity in the case of the whole space.
Increasing the regularity of A, one gets more properties for solution densities. If
p;1
the matrix elements aij belong to the local Sobolev class Wloc with p > d (which
by the Sobolev embedding theorem yields the existence of a continuous version) and
A is nondegenerate at every point, while the coefficients b i are either in Lploc .Rd / or
in Lploc .jj/, then the solution density belongs to the same local Sobolev class Wlocp;1
.
Therefore, Eq. (2.1.2) can be written in the classical divergence form
@x .aij @x %/ C @x aij % b i % D 0;
i j j
(with b i changed appropriately). Similar results hold in the parabolic case. The
reader is warned that the list of references is far from complete. A vast literature on
the subject is given in the recent monograph [11] presenting results of many authors.
2.1.2 Uniqueness of solutions In the elliptic case, there are examples of infinitely
differentiable b on Rd with d > 1 such that, for A D Id (the unit matrix), there
are several different (hence infinitely many) probability solutions to the equation
L D 0.
Similarly, in the parabolic case, we have constructed examples of infinitely differ-
entiable b.x/ on Rd with d > 3 such that for A D Id the Cauchy problem with some
initial probability measure has infinitely many probability solutions. Therefore,
certain additional conditions rather than smoothness are needed. We have found con-
ditions of this sort expressed in terms of integrability of the coefficients with respect
to solutions and (an alternative set of assumptions) in terms of so-called Lyapunov
functions. We have also found sufficient conditions for the uniqueness of integrable
solutions. In particular, we have shown that the uniqueness conditions for the class
I of integrable solutions essentially differ from those for the class P . For example,
the following results have been obtained (see [11]).
Theorem 2.1.1. The elliptic equation L D 0 has at most one probability solution
in either of the following cases, assuming in these cases (for simplicity) that A is
locally Lipschitz and b is locally bounded.
(i) There is a non-negative function V 2 C 2 .Rd / such that lim V .x/ D C1
jxj!1
and
LV .x/ 6 C V .x/ for some C > 0.
aij bi
(ii) ; 2 L1 ./ for some probability solution .
1 C jxj2 1 C jxj
Kolmogorov operators and SPDEs 33
Note that (i) does not assume (and does not ensure) the existence of a probability
solution, while (ii) being satisfied by some solution, guarantees that this solution is
the only one.
However, we do not know whether it can happen that the elliptic equation L D
0 with A D Id and smooth b has no probability solutions, but has a nonzero signed
solution in the class of bounded measures (this is impossible when d D 1).
It follows from the previous theorem that, in case of the unit diffusion matrix (or
a nondegenerate Lipschitz diffusion matrix) and bounded b, a probability solution is
unique (if it exists). However, the case of irregular A has not been studied. Even the
case of bounded continuous nondegenerate A (and bounded b) has not been investi-
gated. The problem of existence of solutions has been better studied.
Here are parabolic analogues.
Theorem 2.1.2. Let A.x/ be locally Lipschitz and nondegenerate and let b.x; t/ be
locally bounded. Suppose that there is a positive function V 2 C 2 .Rd / such that
lim V .x/ D C1 and
jxj!1
Then, for any probability measure on Rd , there is at most one probability solution
to the Cauchy problem, with initial condition .
Theorem 2.1.3. Let A D Id, and let b be a locally bounded vector field. Then, for
the uniqueness of a probability solution to the Cauchy problem, it suffices to have a
function V 2 C 2 .Rd / with lim V .x/ D C1 and jrV .x/j 6 C1 such that
jxj!1
LV .x; t/ 6 C2 ;
LV .x; t/ > C2
is sufficient.
In the case of a radial function V , such conditions actually mean that, for the
uniqueness of a probability solution, the quantity .b.x; t/; x/ should not tend to C1
too quickly, and for the uniqueness of an integrable solution, .b.x; t/; x/ should not
tend to 1 too quickly. Such a function V is called a Lyapunov function.
A challenging open problem in this area concerns the cases d D 1 and d D 2:
it is still unknown whether in these cases, for A D Id and smooth b, the Cauchy
problem with a probability initial data has at most one probability solution (as noted
above, there are counterexamples for all d > 3).
We now proceed to less regular diffusion matrices. Let U.x; r/ denote the ball of
radius r centred at x.
34 M. Röckner
Cr 2
tZ “
2 2
O.g; R/ D sup sup r jU.x; r/j jg.y; s/ g.z; s/j dy dz ds:
.x;t /2Rd C1 r6R
t y;z2U.x;r/
If lim O.g; R/ D 0, the function g is said to belong to the class VMOx .Rd C1 /. If
R!0
g 2 VMOx .Rd C1 /, then one can always assume that O.g; R/ 6 w.R/ for all R > 0,
where w is a continuous function on Œ0; C1/ and w.0/ D 0.
The following condition on A is used in our uniqueness result in case of A of low
regularity.
(H1) for every ball U Rd , there exist numbers
D
.U / > 0 and M D M.U / >
0 such that
.A.x; t/y; y/ >
jyj2 ; kA.x; t/k 6 M
for all .x; t/ 2 U Œ0; T and y 2 Rd .
Theorem 2.1.4. Suppose that aij 2 VMOx;loc .Rd Œ0; T / and that the matrix A D
.aij / satisfies condition (H1). Then, the set
We emphasise that this theorem gives uniqueness not in the whole class of proba-
bility solutions but only in its subclass specified by the integrability of the drift b (for
uniformly bounded b, this condition holds automatically).
Let us say a few words about the infinite-dimensional case which attracts many
researchers due to applications in stochastic partial differential equations, infinite-
dimensional diffusions, and mathematical physics. For simplicity, we consider equa-
tions on the space of real sequences R1 (the countable power of the real line). This
space is a complete separable metric space with the distance
1
X
n
d.x; y/ D 2 min.1; jxn yn j/:
nD1
Due to the special structure of this space, the main concepts (but not the results)
connected with Fokker–Planck–Kolmogorov equations on it are very similar to the
case of Rd . Namely, given Borel functions aij and b i on R1 such that the finite-
dimensional submatrices .aij /i;j 6n are symmetric non-negative-definite, we say that
a Borel probability measure on R1 satisfies the equation
L D 0
Kolmogorov operators and SPDEs 35
if the functions aij , b i are -integrable and for every function ' in finitely many
variables belonging to the corresponding class Cb1 one has
Z
L' d D 0:
R1
Obviously, the function L' is -integrable since the series becomes a finite sum of
-integrable functions. Set b WD .b i /1 i D1 .
Similarly, one defines a parabolic equation and the corresponding Cauchy prob-
lem.
As in the finite-dimensional case, interesting problems arise already for aij D ıij
(which in Rd would mean the unit diffusion matrix). For example, if b.x/ D x, the
standard Gaussian measure
on R1 (the countable power of the standard Gaussian
measure on the real line) is a solution to the corresponding equation. This measure is
the only probability solution. Indeed, for a general b, the projection of any solution
to Rn , denoted by n , satisfies the equation on Rn whose diffusion coefficients
anij and drift coefficients bni are the conditional expectations of the functions aij and
b i with i; j 6 n with respect to the measure and the -algebra Bn generated
by the first n coordinates. This is obvious from the definitions. Therefore, in case
of constant aij , we have anij D aij and, in case of b i with i 6 n depending on
x1 ; : : : ; xn , we have bni D b i . In particular, for b i .x/ D xi , we have bni .x/ D xi
whenever i 6 n. Therefore, in the situation under consideration, the projection of
any probability solution to Rn satisfies the same equation as the standard Gaussian
measure on Rn , but this equation admits only one probability solution.
However, the situation may change for other b. Actually, examples of linear b
are known such that the corresponding equation (with aij D ıij ) has several different
probability solutions that are Gaussian measures (hence there exist also non-Gaussian
solutions, their convex combinations).
A relatively simple case arises if we take
b.x/ D x C v.x/;
where the perturbation v takes values in the Hilbert space H D l 2 and is bounded in
the usual l 2 -norm. In this case, it is known that every probability solution to the
elliptic equation L D 0 is absolutely continuous with respect to
and is unique
(it is also known that a solution exists under the stated assumptions). Certainly, the
assumption that v is a bounded l 2 -valued perturbation is very restrictive.
As a typical result on uniqueness in infinite dimensions, we mention the following
theorem from [9]. Let us consider the following Cauchy problem.
Let B D .B i / be a sequence of Borel functions on R1 .0; T0 /, where T0 > 0
is fixed, and let aij be Borel functions on R1 .0; T0 /. Let us consider the Cauchy
36 M. Röckner
problem
@t D L ;
(2.1.5)
jt D0 D ;
where L is the formal adjoint operator for the differential operator L defined by
1
X 1
X
L'.x; t/ D aij .x; t/@x @x '.x; t/ C B i .x; t/@x '.x; t/;
i j i
i;j D1 i D1
for every smooth function ' depending on finitely many coordinates of x. We con-
sider the following condition.
(A) aij D aj i , each function aij depends only on the variables t; x1 ; x2 ; : : : ;
xmaxfi;j g and is continuous and, for every natural number N , the matrix AN D
.aij /16i;j 6N satisfies the following condition:
there exist positive numbers
N , N and ˇN 2 .0; 1 such that for all x; y 2
RN and t 2 Œ0; T0 one has
N jyj2 6 hAN .x; t/y; yi 6
N 1 jyj2 ;
kAN .x; t/ AN .y; t/k 6 N jx yjˇN ;
where k : k is the operator norm and j : j is the standard Euclidean norm.
Let be a Borel probability measure on R1 and let P be some convex set of prob-
ability solutions D t .dx/ dt to (2.1.5), i.e., t > 0 and t .R1 / D 1 for every
t 2 .0; T0 /, such that jB k j 2 L2 ./ for each k 2 N and the following condition
holds:
(B) for every " > 0 and every natural number d , there exist a natural number
N D N."; d / > d and a Cb2;1 -mapping .b"k /N kD1
W RN Œ0; T0 ! RN such
that
Z T0 Z
jAN .x; t/ 1=2 .BN .x; t/ b".x1 ; : : : ; xN ; t//j2 t .dx/ dt < ";
0 R1
Theorem 2.1.5. Assume that conditions (A) and (B) hold. Then, the set P contains
at most one element.
Let us illustrate condition (B) by several examples. We shall use the following
notation: given a sequence D .n /n>1 of positive numbers, the weighted Hilbert
space
n X1 o
2 2
l D x D .xn /W kxkl 2 D n xn2 < 1
nD1
P1
will be equipped with the inner product hx; yi D nD1 n x n y n .
Kolmogorov operators and SPDEs 37
Example 2.1.6.
(i) Let B k depend only on the variables t; x1 ; x2 ; : : : ; xk . Then, in order to ensure
our condition (B), we need only the inclusion jB k j 2 L2 ./ for all k > 1.
Indeed, we set N D d and approximate each function B k separately.
(ii) Let ˛ D .˛k /k>1 , ˛k > 0 for each k 2 N and 1=˛ WD .˛k 1 /k>1 . Suppose that
aij satisfy condition (A) and that there exists a positive number C independent
of N such that
jAN .x; t/ 1=2yj 6 C kykl 2
1=˛
Then, for every B k , we find a smooth function b"k depending on the first nk
variables such that
Z T Z
0
˛k 1 jB k b"k j2 dt dt < ".2M / 1 ; k D 1; : : : ; M:
0 R1
(iii) Finally, for aij as in (ii), we can combine both examples. Let B D G C F ,
where
G k ; F k 2 L2 ./; G k .x; t/ D G k .x1 ; x2 ; : : : ; xk ; t/; F .x; t/ 2 l1=˛
2
and
kF k1=˛ 2 L2 ./:
Obviously, for given B k of this type, the set of all probability solutions D
t .dx/ dt to (2.1.3) satisfying the previous integrability conditions is convex.
38 M. Röckner
An important and interesting problem concerns the study of conditions for unique-
ness of solutions to nonlinear FPK equations. The accomplished results and our anal-
ysis of the linear case can give some insight and serve as auxiliary tools. Another
important direction pursued by many researchers is related to the study of unique-
ness for martingale problems associated with second order operators for which we
consider FPK equations. Certainly, when dealing with uniqueness for martingale
problems and connections between various kinds of uniqueness (the Cauchy problem
for FPK equations, martingale problems, uniqueness of semigroups, etc.), it is quite
natural to consider also pseudo-differential operators.
2.1.3 Bounds for solution densities Here, we mention some typical results on up-
per bounds for solution densities.
In the next theorem, the conditions ensure bounds of the form %.x/ 6 C=ˆ.x/.
For simplicity, we formulate this result for the unit diffusion matrix, but a Sobolev
differentiable matrix leads only to slightly longer assumptions.
and
jbj 2 Lˇ .jj/; where ˇ > 1:
Let ˆ 2 C 1 .Rd / be a positive function such that, for some > d , one has
Then,
C
%.x/ 6
ˆ.x/
with some constant C .
For example, if jbj is locally bounded and belongs to L2 ./, the only real restric-
tion in this theorem is the integrability of jrˆj % over the whole space. In turn, this
Kolmogorov operators and SPDEs 39
There are also lower bounds on solution densities in the elliptic and parabolic
cases, which is connected with Harnack’s inequality. A detailed discussion of such
results is given in [11]. An interesting direction of research in this area concerns the
infinite-dimensional case. There, many natural questions of a very basic character
remain open. For example, there are no broad sufficient conditions ensuring that
probability solutions to stationary and parabolic FPK equations are strictly positive
when applied to balls in Hilbert spaces (a property which in the finite-dimensional
case is much weaker than the positivity of densities). Returning to the framework
described at the end of the previous section, one can consider the elliptic equation
on R1 with aij D ıij and continuous bounded b i . Suppose that is a probability
solution on R1 . Let E be a Hilbert space continuously embedded into R1 (say, a
weighted Hilbert space of sequences) with .E/ D 1. Is it true that is positive on
balls in E? Of course, this is stronger than being positive on non-empty open sets
in R1 , since such open sets contain open cylinders with finite-dimensional bases, so
that the positivity on them follows from the finite-dimensional case. However, the
finite-dimensional case does not help when we consider balls in E.
40 M. Röckner
Remark 2.1.10. There is an obvious connection between the results in this subsec-
tion and the general theory of heat kernel estimates, which are also intensively studied
in Chapter 3. For a particularly singular case see [37].
2.2.1 The stochastic total variation flow The main reference for this section is [6].
Consider the nonlinear diffusion equation
h i
dX.t/ D div sign .rX.t// dt C X.t/ dW .t/ on .0; T / O ;
X D 0 on .0; T / @O ; (2.2.1)
2
X.0/ D x 2 L .O /;
Remark 2.2.1.
(i) In nonlinear diffusion theory, (2.2.1) is derived from the continuity equation
perturbed by a Gaussian process proportional to the density X.t/ of the mate-
rial, that is,
dX.t/ D div J.rX.t// dt C X.t/ dW .t/;
where J D sign is the flux of the diffusing material. (See [15, 16, 17].)
(ii) (2.2.1) is also relevant as a mathematical model for faceted crystal growth un-
der a stochastic perturbation as well as in materials science (see [18] for the
deterministic model and complete references on the subject). The resulting
equations are differential gradient systems corresponding to a convex and non-
differentiable potential (energy).
(iii) Other recent applications refer to the PDE approach to image recovery (see,
e.g., [12] and also [4, 13]). In fact, if x 2 L2 .O / is the blurred image, one
might find the restored image via the total variation flow X D X.t/ generated
by (2.2.1). In its deterministic form, this is the so-called total variation based
image restoration model and its stochastic version (2.2.1) arises naturally in
this context as a perturbation of the total variation flow by a Gaussian (Wiener)
noise.
H01 .O / WD W01;2 .O /:
BV .O / WD space of functions u W O ! R with bounded variation
Z
1 N
kDuk WD sup u div ' d W ' 2 C0 .O I R /; j'j1 6 1
O
Z
D jruj d; if u 2 W 1;1 .O / :
O
0
BV .O / WD all u 2 BV .O / vanishing on @O :
42 M. Röckner
Define
( R
kDuk C @O j
0 .u/j dHN 1
; if u 2 BV .O / \ L2 .O /;
.u/ WD
C1; if u 2 L2 .O / n BV .O /;
where 2 .0; 1, e .u/ D .u/ C u; 8u 2 RN . Here, is the Yosida approxi-
mation of the function .u/ D sign u, that is,
8 1
< u; if juj 6 ;
.u/ D : u
; if juj > :
juj
2.2.2 A new approach to SPDEs The main reference for this section is [7]. Con-
sider the following stochastic differential equation with linear multiplicative noise on
a separable Hilbert space .H; h ; i/
where:
(a) ˇj ; j 2 N, are independent (real) Brownian motions on a stochastic basis
.; F ; .Ft /; P/,
(b) ej 2 C 2 .ON / \ H; j 2 N, is an ONB of H , and there exist j 2 .0; 1/ such
that 8j 2 N
jy ej jH 6 j jej j1 jyjH ; y 2 H I
(c) j 2 R; j 2 N, such that
1
X
WD 2j .1 C 2j / jej j21 < 1:
j D1
Our assumptions on the drift A (where we write A.t; u/ instead of A.t; u; !/) are:
0
AW Œ0; T V ! V is progressively measurable and there exist p 2 .1; 1/,
ı 2 Œ0; 1/, c1 2 .0; 1/, ci 2 R, i 2 f2; : : : ; 6g such that on , 8t 2 Œ0; T ,
0
(“demicontinuous”) A.t; / W V ! V is strongly-weakly continuous,
(“quasi-monotone”)
V
0 hA.t; u/ A.t; v/; u viV > ı ju vj2H 8u; v 2 V;
(“coercive”)
V
0 hA.t; u/; uiV > c1 jujpV C c2 juj2H C c3 8u 2 V;
(“boundedness”)
jA.t; u/jV 0 6 c4 jujVp 1
C c5 jujH C c6 8u 2 V:
p
We fix p 2 .1; 1/ as above from now on and set p0 WD p 1
.
Theorem 2.2.7. Under the above hypotheses on the noise and drift, (2.2.4) has a
unique solution for every x 2 H . Moreover, t 7! e W .t / X.t/ is V 0 -absolutely
continuous on Œ0; T P-a.s and
Z T ˇ ˇp0
ˇ W .t / d ˇ
E ˇe .e W .t / X.t//ˇ 0 dt < 1:
0 dt V
46 M. Röckner
Theorem 2.2.9. Under the above hypotheses on the noise and drift, (2.2.5) has a
unique solution for every x 2 H .
V
0 hA.t; u/ A.t; v/; u viV > ju vj2H 8u; v 2 V:
One main problem is the following:
y 7! e W .t /
A.t; eW .t / y/ is no longer monotone from V to V 0
for fixed t 2 Œ0; T ; ! 2 :
So, we change to another Gelfand triple
V H V 0;
with spaces V , H, V 0 consisting of stochastic processes with norms scaled by eW
and study the resulting equations there. This is the main idea of the new approach to
SPDEs, announced in the title of this section.
Kolmogorov operators and SPDEs 47
a) For p > 2:
V H V 0;
B y C Ay D 0; (2.2.6)
2.2.3 Pathwise uniqueness for SDEs on Hilbert spaces with a merely bounded
measurable drift part The main reference for this section is [14]. Consider on a
separable (real) Hilbert space .H; h ; iH / (norm j:jH ) for a cylindrical Brownian
motion Wt , t > 0, on .; F ; .Ft /; P/, and BW H ! H bounded, Borel,
Assumptions:
1
.H1/ AW D.A/ H ! H self-adjoint and A 6 !I for some ! > 0, with A
of trace class.
.H 2/ V W H ! . 1; C1 convex, lower-semicontinuous, lower bounded func-
tion and let DV be the set of all x 2 fV < 1g such that its Gâteaux deriva-
tive rV exists at x.
.H 3c/0 There exists a separable Banach space E H; continuously and densely
embedded, such that E DV ,
.E/ D 1 and on E the function V is twice
Gâteaux-differentiable such that for all x 2 E its second Gâteaux derivative
VE00 .x/ 2 L.E; E 0 / (with E 0 being the dual of E) extends by continuity to
an element in L.H; E 0 / such that
Here,
and
WD N.0; Q/, i.e. the centred Gaussian measure in H with covariance Q D
1
2
A 1 and
Z
1 V .x/
.dx/ WD e
.dx/; Z WD e V .x/
.dx/:
Z
H
Remark 2.2.15. Theorem 2.2.14 generalises the seminal work [20] from Rd to infinite-
dimensional (Hilbert) spaces.
Kolmogorov operators and SPDEs 51
Idea of proof of Theorem 2.2.14. The idea of proof is to rewrite (2.2.7) in “elliptic
coordinates” as follows: Let i 2 .0; 1/; i 2 N, be the eigenvalues of A. Let
> 4kjBjH k21 and consider the following PDE’s for i 2 N:
where B i .x/ WD hB.x/; ei iH ; x 2 H . Then, one can prove that each ui is Lipschitz,
and
X1
U.x/ WD ui .x/ei ; x 2 H;
i D1
'.x/ WD x C U.x/; x 2 H:
1 3
jx yjH 6 j'.x/ '.y/jH 6 jx yjH :
2 2
Rewritten in these new coordinates, Eq. (2.2.7) becomes
d' i Xt D i Xti Di V Xt dt C C i ui Xt dt
˝ ˛ (2.2.9)
C Dui Xt ; dWt H C dWti ;
where i 2 N, Xti WD hXt ; ei i and Wti WD hWt ; ei i (so B does not appear anymore).
Fix z 2 SV and let X; Y be two solutions of (2.2.7) on .; F ; .Ft /; P/. Then, by
(2.2.9), we have for all i 2 N
dŒ' i .Xt / ' i .Yt / D Œi .Xti Yti / C Di V .Xt / Di V .Yt / dt C . C i /Œui .Xt /
ui .Yt / dt C hDui .Xt / Dui .Yt /; dWt iH :
that
h i
At
E e j'.Xt / '.Yt /j2H
1
62 jXs Ys jH 6j'.Xs / '.Ys /jH
Z t ‚ …„ ƒ
As
6 2E e j'.Xs / '.Ys /jH jU.Xs / U.Ys /jH ds
0
Z t
2 jrV .Xs / rV .Ys /jH
CE e As
j'.Xs / '.Ys /j2H ds
0 j'.Xs / '.Ys /jH
Z t X 1
.ui .Xs / ui .Ys //2
CE e As
j'.Xs / '.Ys /j2H 2 i ds
0 i D1
j'.Xs / '.Ys /j2H
Z 1 ˇ
ˇ i ˇ2
t
As 2
X Du .Xs / Dui .Ys /ˇH
CE e j'.Xs / '.Ys /jH ds
0 i D1
j'.Xs / '.Ys /j2H
Z t
E e As
j'.Xs / '.Ys /j2H dAs :
0
At < 1 P-a.s.
For the proof of this, which is quite hard and technical, we refer to [14].
The case where in (2.2.7) the nonlinear part of the drift is not gradient type is
much more difficult and is at present under further study.
References
[1] S. Albeverio, Y.G. Kondratiev, Y. Kozitsky and M. Röckner, The Statistical Mechanics of
Quantum Lattice Systems: A Path Integral Approach, EMS Tracts in Mathematics 8, European
Mathematical Society, Zürich, 2009.
[2] F. Andreu, V. Caselles, J. Díaz and J. Mazón, Some qualitative properties for the total variation
flow, J. Funct. Anal. 188(2) (2002), 516–547.
[3] F. Andreu-Vaillo, V. Caselles and J.M. Mazón, Parabolic Quasilinear Equations Minimizing
Linear Growth Functionals, Progress in Mathematics 223, Birkhäuser, Basel, 2004.
[4] T. Barbu, V. Barbu, V. Biga and D. Coca, A PDE variational approach to image denoising and
restoration, Nonlinear Anal. Real World Appl., 10(3) (2009), 1351–1361.
Kolmogorov operators and SPDEs 53
[5] V. Barbu, G. Da Prato and M. Röckner, Stochastic Porous Media Equations, LNM 2163,
Springer, Cham, 2016.
[6] V. Barbu and M. Röckner, Stochastic variational inequalities and applications to the total vari-
ation flow perturbed by linear multiplicative noise, Arch. Ration. Mech. Anal. 209(3) (2013),
797–834.
[7] V. Barbu and M. Röckner, An operatorial approach to stochastic partial differential equations
driven by linear multiplicative noise, J. Eur. Math. Soc. (JEMS) 17(7) (2015), 1789–1815.
[8] W.-J. Beyn, B. Gess, P. Lescot and M. Röckner, The global random attractor for a class of
stochastic porous media equations, Comm. Partial Differential Equations 36(3) (2011), 446–
469.
[9] V.I. Bogachev, G. Da Prato, M. Röckner and S.V. Shaposhnikov, An analytic ap-
proach to infinite-dimensional continuity and Fokker–Planck–Kolmogorov equations,
Ann. Sc. Norm. Super. Pisa Cl. Sci. 14(3) (2015), 983–1023.
[10] V.I. Bogachev, N.V. Krylov and M. Röckner, On regularity of transition probabilities
and invariant measures of singular diffusions under minimal conditions, Commun. Partial
Diff. Equs. 26(11–12) (2001), 2037–2080.
[11] V.I. Bogachev, N.V. Krylov, M. Röckner and S.V. Shaposhnikov, Fokker–Planck–Kolmogorov
Equations, Mathematical Surveys and Monographs 207, American Mathematical Society,
Providence, RI, 2015.
[12] A. Chamballe and P.L. Lions, Image recovery via total variation minimization, Numer. Math.
76(2) (1997), 17–31.
[13] T. Chan, S. Esedogly, F. Park and A. Yip, Total variation image restoration. Overview and
recent developments, In Handbook of Mathematical Models in Computer Vision, Springer,
New York (2006), 17–31.
[14] G. Da Prato, F. Flandoli, M. Röckner and A. Veretennikov, Strong uniqueness for SDEs in
Hilbert spaces with nonregular drift, Ann. Probab. 44(3) (2016), 1985–2023.
[15] Y. Giga and R. Kobayashi, On constrained equations with singular diffusivity, Methods
Appl. Anal. 10(2) (2003), 253–278.
[16] Y. Giga and R.V. Kohn, Scale invariant extinction time estimates for some singular diffusion
equations, Discr. Cont. Dynam. Syst. A 30(2) (2011), 509–535.
[17] M.H. Giga and Y. Giga, Generalized motion by nonlocal curvature in the plane, Arch. Ra-
tion. Mech. Anal. 159(4) (2001), 295–333.
[18] R. Kobayashi and Y. Giga, Equations with singular diffusivity, J. Stat. Phys. 95(5–6) (1999),
1187–1220.
[19] W. Liu and M. Röckner, Stochastic Partial Differential Equations: An Introduction, Springer,
Cham, 2015.
[20] A.Y. Veretennikov, On strong solutions and explicit formulas for solutions of stochastic inte-
gral equations. Math. USSR Sbornik 39 (1981), 387–403.
Chapter 3
Analysis and stochastic processes on metric
measure spaces
A. Grigor’yan
This contribution deals with the properties of certain differential and nonlocal oper-
ators on various spaces, with the emphasis on the relationship between the analytic
properties of the operators in question and the geometric properties of the underly-
ing space. In most situations, these operators are Markov generators. In such cases,
we are also concerned with probabilistic aspects, such as the path properties of the
corresponding Markov process.1
u C u 6 0 (3.1.1)
where > 1 is a constant, and ask if it has a positive solution u on M . This question
was initially motivated by certain problems in differential geometry, but after many
years of research of many authors it has become a popular question in PDEs.
1 Project A6
56 A. Grigor’yan
A classical result of Gidas and Spruck [15] says that the equation
u C u D 0 in Rn
where
2 1
p D and q D ; (3.1.4)
1 1
then the inequality (3.1.1) has no positive solution.
Note that p > 2 so that the assumption (3.1.3) is weaker than (3.1.2).
The conditions (3.1.3)–(3.1.4) are sharp in the following sense: if
2 1
p D and q > ;
1 1
then there is an example of M satisfying (3.1.3) and having a positive solution of
(3.1.1).
Similarly, An and Bn are defined for n < 0. The main result of [35] is the following
theorem.
1
g0 .x; y/ ' loghxi ^ loghyi C logC ;
jx yj
which follows from the estimate (3.1.11) of the heat kernel of C ˆ0 that is dis-
cussed below. The sign ' means that the ratio of both sides is between two positive
constants.
Although (3.1.6) covers most previously known upper bounds of Neg.ˆ; R2 /,
it still does not apply to some interesting potentials in R2 where the finiteness of
Neg.ˆ; R2 / can be seen in an ad hoc way.
58 A. Grigor’yan
3.1.1.3 Estimates of the Green function Another question about Schrödinger op-
erators is how to obtain estimates of the Green function gˆ .x; y/ of C ˆ on an
arbitrary manifold M in terms of the Green function g.x; y/ of . The following
universal lower estimate was proved in [18].
Theorem 3.1.3. On any nonparabolic Riemannian manifold M and for any function
ˆ > 0, we have
R
M g.x; z/ g.z; y/ ˆ.z/dz
gˆ .x; y/ > g.x; y/ exp : (3.1.7)
g.x; y/
A striking feature of this result is that it does not require any restriction on M .
Moreover, the same result holds in a higher generality of abstract harmonic spaces.
3.1.2 The heat equation A central object in the analysis on manifolds is the heat
kernel pt .x; y/, that is, the fundamental solution of the heat equation
@t u D u;
where t > 0 denotes time and x; y are points of M . For example, if M D Rn , then
the heat kernel is given by the classical Gauss–Weierstrass formula
!
1 jx yj2
pt .x; y/ D exp :
.4 t/n=2 4t
The problem of obtaining heat kernel estimates under certain geometric assump-
tions on the underlying manifold M has been extensively studied for several decades
(see [8], [12, 51]). For example, if the manifold M is geodesically complete and has
non-negative Ricci curvature, then, by a theorem of Li and Yau [45],
C d 2 .x; y/
pt .x; y/ p exp ; (3.1.8)
V .x; t/ ct
where d is the geodesic distance on M , and C; c are positive constant. The sign
means that both 6 and > are true, but with different values of C; c.
3.1.2.1 Heat kernels on connected sums Here, we consider heat kernel estimates
on the connected sum M1 #M2 of two manifolds M1 ; M2 of equal dimensions. By
definition, M1 #M2 denotes any manifold that is obtained by connecting exterior do-
mains in M1 and M2 via a compact connected manifold. For example, even estimat-
ing the heat kernel on Rn #Rn is a highly non-trivial task. Although the first approach
to the latter problem was initiated in [6] in 1996, the full answer was only obtained in
[41] in 2009.
Analysis and stochastic processes on metric measure spaces 59
Theorem 3.1.4. If x; y are two points lying on different sheets of M D Rn #Rn with
n > 3, then, for large enough t; jxj ; jyj,
C 1 1 d 2 .x;y/
pt .x; y/ n=2 C e ct :
t jxjn 2 jyjn 2
V .oi ; r/ ' r ˛i ;
and
˛ D min f˛i W 1 6 i 6 kg :
Then, for all t 1, x 2 Mi and y 2 Mj with i ¤ j and large enough jxj ; jyj,
!
1 1 1
pt .x; y/ C C ˛ =2 C
t ˛=2 jxj˛i 2 jyj˛j 2 t j jxj˛i 2 t ˛i =2 jyj˛j 2
.2 ˛i /C . 2 ˛j / C d 2 .x; y/
jxj jyj exp :
ct
M1 D R1C S2 and M2 D R3 :
In this case, the manifold M1 is parabolic with the volume growth exponent ˛1 D 1,
and M2 is non-parabolic with ˛2 D 3. It follows that
Theorem 3.1.6. Let x and y p be two points that lie on different sheets of M D
R2 #R2 . Then, for jxj ; jyj > t 1,
C 1 1 d 2 .x;y/
pt .x; y/ C e ct ;
t log jxj log jyj
p
while for jxj ; jyj 6 t we have
C p 2
p
pt .x; y/ p log t C log t log jxj log jyj :
t log2 t
Theorem 3.1.7. Let ˆ be a non-zero function with compact support in R2 . Then, the
heat kernel of H satisfies
C loghxi loghyi jx yj2
ptˆ .x; y/ p p e ct : (3.1.11)
t log.hxi C t / log.hyi C t/
Analysis and stochastic processes on metric measure spaces 61
loghxi loghyi
ptˆ .x; y/ ' :
t log2 t
which is on the borderline between the short and long range potentials.
Theorem 3.1.8. Let ˆ be a potential (3.1.12) in Rn with n > 2. Then, the heat kernel
of H , for all t > 0 and x; y 2 Rn , satisfies the estimate:
ˇ ˇ
C 1 1 1 1 jx yj2
ptˆ .x; y/ p C p C e ct ; (3.1.13)
t n=2Cˇ t hxi t hyi
where r 2
n n
ˇ D C1C 1 C b:
2 2
In particular, in the most interesting range t > hxi2 C hyi2 ; the estimate (3.1.13)
becomes
hxiˇ hyiˇ
ptˆ .x; y/ ' n=2Cˇ :
t
Note that the value of the coefficient b in (3.1.12) determines the exponent n2 C ˇ of
the power decay of the heat kernel as t ! 1. Since b takes values in .0; 1/, the
exponent of t ranges in . n2 ; 1/.
For comparison, let us mention that, for a long range potential
.2 ˛/
ˆ.x/ D b jxj ; for all jxj > 1;
with ˛ 2 .0; 2/, the long time decay of the heat kernel is already superpolynomial as
follows: 2 ˛
ptˆ .0; 0/ C exp ct 2C˛ :
3.1.2.3 Heat kernels of operators with singular drift Consider in Rn n f0g the
operator
Lu D u r ru
with a singular potential
˛
.x/ D jxj ;
where ˛ > 0. We have proved in [37] the following estimates of the heat kernel of L.
62 A. Grigor’yan
and
˛
sup pt .x; x/ > exp ct ˛C2 ;
x
for some positive constants C; c.
The singularity of the drift term at the origin causes a higher rate of blow up of
the heat
kernel
˛
at t ! 0, and the fact that the latter should be given by the term
exp t ˛C2 is not obvious at all and was not predicted by any “physical” argument.
By a suitable transformation, we reduce the problem to estimating the heat kernel
of a weighted Laplace operator, and the latter amounts to proving a certain isoperi-
metric inequality on a weighted manifold .Rn ; /, where the measure is given by
1
d D exp dx:
jxj˛
Due to specific properties of this measure , the previously known methods for ob-
taining isoperimetric inequalities on warped products did not work, and we had to
develop [37] a new machinery for that.
Fix two positive parameters ˛; ˇ. We look for conditions on the measure and
energy that would ensure the following heat kernel bounds:
sub-Gaussian bound in the local case:
0 ! ˇ1 1 1
ˇ
C d .x; y/
pt .x; y/ ˛=ˇ exp @ c AI (3.2.2)
t t
It was proved in [23] that, in both cases, ˛ is the Hausdorff dimension of .M; d /
and, moreover,
V .x; r/ ' r ˛ : (3.2.4)
In the case of (3.2.2), the parameter ˇ is called the walk dimension, which is an
invariant of .M; d / as well. By [23], ˇ > 2 in this case (in fact, ˇ > 2 for the most
interesting fractals). In the case of (3.2.3), the parameter ˇ is called the index of the
associated jump process.
There are many reasons for considering these two types of estimates. Firstly,
both are known to hold on various families of fractals, in particular, on the Sierpinski
gasket and carpet (cf. [2]).
Secondly, the following dichotomy was proved in [30]: if pt .x; y/ satisfies the
estimate
˛=ˇ d.x; y/
pt .x; y/ C t ˆ
t 1=ˇ
with some function ˆ, then this has to be either (3.2.2) or (3.2.3).
It was proved in [3] that the sub-Gaussian estimate (3.2.2) is equivalent to the
parabolic Harnack inequality .
An important problem is to find some practical conditions on .M; d; / and .E ; F /
that should be equivalent to (3.2.2) resp. (3.2.3).
Some results about existence of the heat kernel and its upper bounds were obtained
in [20, 22, 27].
In order to state the results about equivalent conditions for the estimates (3.2.2),
let us first define the following notions.
E .u; '/ D 0
Definition 3.2.2. We say that the uniform elliptic Harnack inequality is satisfied if
there is a constant C such that, for any function u 2 F that is harmonic and non-
negative in a ball B.x; r/ M ,
Definition 3.2.3. For any compact set K M and open set K, define the
capacity of the capacitor .K; / by
˚
cap.K; / D inf E .'; '/ W ' 2 F \ C0 ./; 'jK 1 :
The series of works [22, 44, 21] leads to the following result.
In the most interesting case ˇ > 2 that typically occurs in fractals, one replaces
the Poincaré inequality (3.2.6) by the ˇ-Poincaré inequality
Z Z
c
dhf; f i > ˇ .f f /2 d; (3.2.7)
B.x;r/ r B.x;r/
where hf; f i is the energy measure of f . Then, both (3.2.4) and (3.2.7) are also
necessary for (3.2.2), but not sufficient.
In order to state the next result, we need the notion of a generalised capacity.
66 A. Grigor’yan
Definition 3.2.5. Let u 2 F \ L1 .M /. For any compact set K M and open set
K, define the generalised capacity of the capacitor .K; / by
˚
capu .K; / D inf E .u2 '; '/ W ' 2 F \ C0 ./; 'jK 1 :
Theorem 3.2.6. The estimate (3.2.2) is equivalent to the conjunction of three prop-
erties:
the volume regularity (3.2.4);
the ˇ-Poincaré inequality (3.2.7);
the generalised capacity estimate: for any function u 2 F \ L1 and for any
two concentric balls B1 WD B.x; R/ and B2 WD B.x; R C r/;
Z
C
capu .B1 ; B2 / 6 ˇ u2 d: (3.2.8)
r B
However, the latter condition is still difficult to check. Our conjecture is that it
can be replaced by a simpler capacity condition (3.2.5). Note that (3.2.8) with u D 1
and R D r is equivalent to (3.2.5).
A similar question is in place for the stable-like estimate (3.2.3). Some approach
to upper bounds was developed in [24]. The equivalent conditions for the two-sided
estimates (3.2.3) in the case ˇ < 2 were obtained by Z.-Q. Chen and T. Kumagai
[9], who proved that (3.2.3) is equivalent to the volume regularity (3.2.4) and the
following estimate of the jump kernel J :
1
J.x; y/ ' (3.2.9)
d.x; y/˛Cˇ
(jump kernels of this type in Rn are considered also in Chapter 8). The condition
(3.2.9) replaces the Poincaré inequality in this case. The case ˇ > 2 is still open.
There is one specific setting though where obtaining heat kernel bounds for the
jump kernel J.x; y/ D d.x; y/ .˛Cˇ / is relatively easy for any ˇ > 0: this is the
case when .M; d / an ultra-metric space. The theory of Markov processes on ultra-
metric spaces was developed in [5], using specific properties of an ultra-metric. In
particular, this theory applies when M D Q p is the space of p-adic numbers with the
p-adic distance, and yields the estimate (3.2.3) with ˛ D 1 (see the estimate (3.2.12)
in Section 3.2.3 below).
Since W .dy/ corresponds in our notation to J.x; y/d.y/, we see that the Euclidean
distance in Rn is adapted to any Lévy process.
The main result of [26] is the following theorem.
Theorem 3.2.7. If J and d are adapted and if, for some x 2 M and c > 0,
then the jump process with the jump kernel J is stochastically complete.
that is obviously stronger than the usual triangle inequality. The ultra-metric inequal-
ity implies that any two metric balls B.x; r/, B.y; r/ of the same radius are either
disjoint or identical. This in turn implies that, for any non-negative real r, the family
of all distinct balls of radius r form a partition of M .
Let .M; d / be a locally compact ultra-metric space. A model example is the field
Q p of p-adic numbers with the p-adic distance or its straightforward generalisation
Q np . Fix a Radon measure on M with full support, a probability distribution func-
tion .r/ on Œ0; C1/ and define the following operator P on functions on M :
Z 1 Z
1
Pf .x/ D f d d .r/ (3.2.11)
0 .B.x; r// B.x;r/
(cf. [4, 5]). This operator is clearly a Markov operator. As it follows from the
aforementioned property of ultra-metric balls, P is a bounded non-negative definite,
self-adjoint operator in the Hilbert space L2 .M; /. The latter allows us to define
the heat semigroup fPt gt >0 simply by Pt D P t and, hence, the associated continu-
ous time random walk fXt gt >0 on M (note that typically Markov operators are not
positive definite, so that the operator P t cannot be defined in general).
The spectral decomposition for Pt follows easily from the representation given
in (3.2.11), which leads to explicit expression for the heat kernel pt .x; y/ of Pt and
then also to simple estimates of pt .x; y/ (see [5]).
68 A. Grigor’yan
where ˛; c > 0:
Theorem 3.2.8. In Q p , the heat kernel of the heat semigroup fPt g with the above
probability distribution function .r/ satisfies the estimate
t
pt .x; y/ ' ; (3.2.12)
.t 1=˛ C kx ykp /1C˛
for all t > 0 and x; y 2 Q p . Consequently, the Green function g.x; y/ of fPt g is
finite if and only of ˛ < 1, and in this case
As a locally compact abelian group, Q p has the dual group that is again Q p .
Hence, the Fourier transform is defined as a unitary operator in L2 .Q p ; /. Using
the Fourier transform, Vladimirov and Volovich [52, 53] introduced a class D˛ of
fractional derivatives on functions on Q p . This operator acts as follows:
Z
˛ p˛ 1 f .x/ f .y/
D f .x/ D d.y/:
1 p ˛ 1
kx yk1C˛p
Qp
Theorem 3.2.9. The operator D˛ coincides with the generator of the semigroup fPt g
with the probability distribution function
It does not seem possible to obtain this estimate of the heat kernel of D˛ by using
Fourier analysis approach.
where p > 0 and 1 D f0g. The homology groups Hp D Hp .G/ of this chain
complex are called the path homologies of G.
There are also dual notions of d -invariant p-forms, cochain complex .G/ and
path cohomologies H .G/ of G that we do not address here. If G is an (undirected)
graph, G can always be considered as a digraph, by turning each edge of G into a
double arrow.
There has been a number of attempts to define the notion of .co/homology for
graphs. For example, one can consider a graph as an one-dimensional simplicial
complex, or take into account all its cliques (= complete subgraphs) as simplexes of
the corresponding dimensions. However, such homologies do not ususally have the
necessary functorial properties.
Another approach to homologies of digraphs can be realised via Hochschild ho-
mologies, using a natural path algebra of a graph. However, it is known that, in this
case, the Hochschild homologies of order > 2 are trivial, which makes this approach
useless. In singular homology theories of graphs, certain "small" graphs are pre-
defined as basic cells. However, simple examples show that the singular homology
groups do depend essentially on the choice of the basic cells.
Our notion of path homologies of digraphs has the following advantages.
The path homologies of all dimensions can be non-trivial; even for planar
graphs, they can be non-trivial in dimension 2.
The path homologies can be easily computed using any software package con-
taining operations with matrices.
The path homology theory is compatible with the homotopy theories of graphs
[1] and digraph [31].
The path homologies have good functorial properties with respect to graph-
theoretical operations; for example, the homologies of the Cartesian product
of digraphs (as well as of the join) satisfy the Künneth formula.
The path homology theory is dual to the cohomology theory of digraphs. The
latter was introduced independently by A. Dimakis and F. Müller-Hoissen [13],
using the classification of Bourbaki [7] of exterior derivations on algebras.
One of the most essential and technically difficult results of our work is the Künneth
formula for products. For two digraphs X and Y , denote by X Y their Cartesian
product, that is, the product based on the pattern .
.X Y / Š .X / ˝ .Y /;
Theorem 3.3.3. For any two finite digraphs X , Y and for any integer r > 1, we
have M
e r .X Y / Š
e p .X / ˝
e q .Y / :
fp;q> 1WpCqDr 1g
References
[1] E. Babson, H. Barcelo, M. de Longueville and R. Laubenbacher, Homotopy theory of graphs,
J. Algebr. Comb. 24 (2006), 31–44.
[2] M.T. Barlow, Diffusions on Fractals, Springer, Berlin, 1998.
[3] M.T. Barlow, A. Grigor’yan and T. Kumagai, On the equivalence of parabolic Harnack in-
equalities and heat kernel estimates, J. Math. Soc. Japan 64 (2012), 1091–1146.
[4] A. Bendikov, A. Grigor’yan and Ch. Pittet, On a class of Markov semigroups on discrete
ultra-metric spaces, Potential Analysis 37 (2012), 125–169.
[5] A. Bendikov, A. Grigor’yan, Ch. Pittet and W. Woess, Isotropic Markov semigroups on ultra-
metric spaces, Russian Math. Surveys 69 (2014), 589–680.
[6] I. Benjamini, I. Chavel and E.A. Feldman, Heat kernel lower bounds on Riemannian mani-
folds using the old ideas of Nash, Proc. London Math. Soc. 72 (1996), 215–240.
[7] N. Bourbaki, Elements of Mathematics. Algebra I. Chapters 1-3, Herman, Paris, 1989.
[8] I. Chavel, Eigenvalues in Riemannian Geometry, Academic Press, New York, 1984.
[9] Z.-Q. Chen and T. Kumagai, Heat kernel estimates for stable-like processes on d -sets,
Stochastic Process. Appl. 108 (2003), 27–62.
[10] S.Y. Cheng and S.-T. Yau, Differential equations on riemannian manifolds and their geometric
applications, Commun. Pure Appl. Math. 28 (1975), 333–354.
[11] W. Cwikel, Weak type estimates for singuar values and the number of bound states of
Schrödinger operators, Ann. of Math. 106 (1977), 93–100.
72 A. Grigor’yan
[12] E.B. Davies, Heat Kernels and Spectral Theory, Cambridge University Press, Cambridge,
1989.
[13] A. Dimakis and F. Müller-Hoissen, Discrete differential calculus: graphs, topologies, and
gauge theory, J. Math. Phys. 35 (1994), 6703–6735.
[14] M. Fukushima, Y. Oshima, and M. Takeda, Dirichlet forms and symmetric Markov processes,
de Gruyter, Berlin, 1994.
[15] B. Gidas and J. Spruck, Global and local behavior of positive solutions of nonlinear elliptic
equations, Commun. Pure Appl. Math. 34 (1981), 525–598.
[16] A. Grigor’yan, Analytic and geometric background of recurrence and non-explosion of the
Brownian motion on Riemannian manifolds, Bull. Amer. Math. Soc. 36 (1999), 135–249.
[17] A. Grigor’yan, Heat kernels on weighted manifolds and applications, Contemp. Math. 398
(2006), 93–191.
[18] A. Grigor’yan and W. Hansen, Lower estimates for a perturbed Green function, J. d’Analyse
Math. 104 (2008), 25–58.
[19] A. Grigor’yan and E.P. Hsu, Volume growth and escape rate of Brownian motion on a Cartan-
Hadamard manifold, Int. Math. Ser. (N. Y.) 9 (2009), 209–225.
[20] A. Grigor’yan and J. Hu, Off-diagonal upper estimates for the heat kernel of the Dirichlet
forms on metric spaces, Invent. Math. 174 (2008), 81–126.
[21] A. Grigor’yan and J. Hu, Heat kernels and Green functions on metric measure spaces, Canad.
J. Math 66 (2014), 641–699.
[22] A. Grigor’yan and J. Hu, Upper bounds of heat kernels on doubling spaces, Moscow Math. J.
14 (2014), 505–563.
[23] A. Grigor’yan, J. Hu and K.-S. Lau, Heat kernels on metric-measure spaces and an application
to semi-linear elliptic equations, Trans. Amer. Math. Soc. 355 (2003), 2065–2095.
[24] A. Grigor’yan, J. Hu and K.-S. Lau, Estimates of heat kernels for non-local regular Dirichlet
forms, Trans. Amer. Math. Soc. 366 (2014), 6397–6441.
[25] A. Grigor’yan, J. Hu and K.-S. Lau, Generalized capacity, Harnack inequality and heat kernels
on metric spaces, J. Math. Soc. Japan 67 (2015), 1485–1549.
[26] A. Grigor’yan, X.-P. Huang and J. Masamune, On stochastic completeness of jump processes,
Math. Z. 271 (2012), 1211–1239.
[27] A. Grigor’yan and N. Kajino, Localized heat kernel upper bounds for diffusions via a multiple
Dynkin–Hunt formula, Trans. Amer. Math. Soc. 369 (2017), 1025–1060.
[28] A. Grigor’yan and M. Kelbert, On Hardy-Littlewood inequality for Brownian motion on Rie-
mannian manifolds J. London Math. Soc. 62(2) (2000), 625–639.
[29] A. Grigor’yan and V.A. Kondratiev, On the existence of positive solutions of semi-linear
elliptic inequalities on Riemannian manifolds, Around the research of Vladimir Maz’ya II,
A. Laptev (ed.), Int. Math. Series 12 (2010), 203–218.
[30] A. Grigor’yan and T. Kumagai, On the dichotomy in the heat kernel two sided estimates, Proc.
Symposia in Pure Mathematics 77 (2008), 199–210.
[31] A. Grigor’yan, Y. Lin, Yu. Muranov and S.-T. Yau, Homotopy theory for digraphs, Pure Appl.
Math. Quaterly 10 (2014), 619–674.
[32] A. Grigor’yan, Yu. Muranov and S.-T. Yau, Graphs associated with simplicial complexes,
Homology, Homotopy Appl. 16(1) (2014), 295–311.
Analysis and stochastic processes on metric measure spaces 73
[33] A. Grigor’yan, Yu. Muranov and S.-T. Yau, Cohomology of digraphs and (undirected) graphs,
Asian J. Math. 19 (2015), 887–932.
[34] A. Grigor’yan, Yu. Muranov and S.-T. Yau, On a cohomology of digraphs and Hochschild
cohomology, Homotopy Rel. Structures 11 (2016), 209–230.
[35] A. Grigor’yan and N. Nadirashvili, Negative eigenvalues of two-dimensional Schrödinger
equations, Archive Rat. Mech. Anal. 217 (2015), 975–1028.
[36] A. Grigor’yan, Yu. Netrusov and S.-T. Yau, Eigenvalues of elliptic operators and geometric
applications, Surveys Diff. Geom. IX (2004), 147–218.
[37] A. Grigor’yan, S.-X. Ouyang and M. Röckner, Heat kernel estimates for an operator with a
singular drift and isoperimetric inequalities, J. Reine Angew. Math. (Crelle) 736 (2018), 1–31.
[38] A. Grigor’yan and L. Saloff-Coste, Dirichlet heat kernel in the exterior of a compact set,
Commun. Pure Appl. Math. 55 (2002), 93–133.
[39] A. Grigor’yan and L. Saloff-Coste, Hitting probabilities for brownian motion on riemannian
manifolds, J. Math. Pures et Appl. 81 (2002), 115–142.
[40] A. Grigor’yan and L. Saloff-Coste, Stability results for Harnack inequalities, Ann. Inst.
Fourier, Grenoble 55 (2005), 825–890.
[41] A. Grigor’yan and L. Saloff-Coste, Heat kernel on manifolds with ends, Ann. Inst. Fourier,
Grenoble 59 (2009), 1917–1997.
[42] A. Grigor’yan and L. Saloff-Coste, Surgery of the Faber-Krahn inequality and applications to
heat kernel bounds, Nonlinear Analysis 131 (2016), 243–272.
[43] A. Grigor’yan and Y. Sun, On non-negative solutions of the inequality u C u 6 0 on
Riemannian manifolds, Commun. Pure Appl. Math. 67 (2014), 1336–1352.
[44] A. Grigor’yan and A. Telcs, Two-sided estimates of heat kernels on metric measure spaces,
Ann. Prob. 40 (2012), 1212–1284.
[45] P. Li and S.-T. Yau, On the parabolic kernel of the Schrödinger operator, Acta Math. 156
(1986), 153–201.
[46] E.H. Lieb, Bounds on the eigenvalues of the Laplace and Schrödinger operators, Bull. Amer.
Math. Soc. 82 (1976), 751–753.
[47] E. Mitidieri and S.I. Pokhozhaev, Apriori estimates and the absence of solutions of nonlinear
partial differential equations and inequalities, Proc. Steklov Inst. Math. 234 (2001), 1–362.
[48] G.V. Rozenblum, The distribution of the discrete spectrum for singular differential operators,
Dokl. Akad. Nauk SSSR 202 (1972), 1012–1015.
[49] E. Shargorodsky, On negative eigenvalues of two-dimensional Schrödinger operators, Proc.
London Math. Soc. 108 (2014), 441–483.
[50] M. Solomyak, Piecewise-polynomial approximation of functions from H ` ..0; 1/d /; 2` D d ,
and applications to the spectral theory of the Schrödinger operator, Israel J. Math. 86 (1994),
253–275.
[51] N.Th. Varopoulos, L. Saloff-Coste and T. Coulhon, Analysis and Geometry on Groups, Cam-
bridge University Press, Cambridge, 1992.
[52] V.S. Vladimirov, Generalized functions over the field of p-adic numbers, Uspekhi Mat. Nauk
43 (1988), 17–53, 239.
[53] V.S. Vladimirov and I.V. Volovich, p-adic Schrödinger-type equation, Lett. Math. Phys. 18
(1989), 43–53.
Chapter 4
Markov evolutions in spatial ecology:
From microscopic dynamics to kinetics
Yu. Kondratiev, O. Kutovyi and P. Tkachov
4.1 Introduction
Dynamics of interacting particle systems appear in several areas of complex systems
theory. In particular, we observe a growing activity in the study of Markov dynamics
for continuous systems. The latter fact is motivated, in particular, by modern prob-
lems of mathematical physics, ecology, mathematical biology, and genetics; compare
[19, 15, 16, 36] and the literature cited therein. Moreover, Markov dynamics are
used for the construction of social, economic and demographic models. Notice that
Markov processes for continuous systems are considered in the stochastic analysis as
dynamical point processes [31, 28, 27], and they even appear in the representation
theory of big groups [6, 7].
A mathematical formalisation of the problem may be described as follows. As a
phase space of the system, we use the space .Rd / of locally finite configurations
in the Euclidean space Rd . A heuristic Markov generator which describes the con-
sidered model is given by its expression on a proper set of functions (observables)
over .Rd /. With this operator, we can relate two evolution equations. Namely, the
backward Kolmogorov equation for observables and the Kolmogorov forward equa-
tion on probability measures on the phase space .Rd / (macroscopic states of the
system). The latter equation is known as the Fokker–Planck equation in the termi-
nology of mathematical physics. Comparing with the usual situation in stochastic
analysis, there is an essential technical difficulty: the corresponding Markov process
in the configuration space may be constructed only in special cases. As a result, a
description of Markov dynamics in terms of random trajectories is absent for most of
the models under consideration.
1 Projects A5, A10
76 Yu. Kondratiev, O. Kutovyi, P. Tkachov
of the solution depends on the asymptotic of the birth kernel and the initial condition,
where either a constant speed of the propagation or an acceleration may be observed.
Under additional assumptions, we also prove existence and uniqueness of travelling
waves.
The structure of this chapter is as follows. Section 4.2 contains a brief summary
of the mathematical description of complex systems. In Section 4.3, we discuss the
general concept of statistical dynamics for Markov evolutions in the continuum and
introduce the necessary mathematical structures. Then, in Section 4.4, this concept
is applied to an important class of Markov dynamics of continuous systems, namely,
to birth-and-death models. Here, general conditions for the existence of a semigroup
evolution in a space of quasi-observables are obtained. Then, we construct evolutions
of correlation functions as dual objects. It is shown how to apply general results to the
study of particular models of statistical dynamics coming from mathematical physics
and ecology. In Section 4.5, we discuss the Vlasov-type scaling for birth-and-death
stochastic dynamics. Finally, in Section 4.6, we study the kinetic (Vlasov) equation
for the birth-and-death BDLP model.
are continuous for any f 2 C0 .Rd /, the set of all continuous functions on Rd with
compact supports. It is worth noting that the vague topology is metrisable in such a
way that becomes a Polish space; see [34] and references therein.
78 Yu. Kondratiev, O. Kutovyi, P. Tkachov
Corresponding to the vague topology, the Borel -algebra B ./ appears as the
smallest -algebra for which all mappings
3 7! Nƒ . / WD j ƒ j 2 N0 WD N [ f0g (4.2.1)
holds
Z X
f .n/ .x1 ; : : : ; xn / d.
/
fx1 ;:::;xn g
Z
1
D f .n/ .x1 ; : : : ; xn /k.n/ .x1 ; : : : ; xn / dx1 : : : dxn (4.2.2)
nŠ .Rd /n
We put .0/ .Y / WD f¿g. As a set, .n/ .Y / may be identified with the symmetrisation
of ˚ ˇ
en D .x1 ; : : : ; xn / 2 Y n ˇ x ¤ x if k ¤ ` :
Y k `
Hence, one can introduce the corresponding Borel -algebra, which we denote by
B .n/ .Y / . The space of finite configurations in Y 2 B .Rd / is defined as
G
0 .Y / WD .n/ .Y /:
n2N0
This space is equipped with the topology of the disjoint union. Let B 0 .Y / denote
the corresponding Borel -algebra. In the case of Y D Rd , we will omit the index Y
in the previously defined notations. Namely,
from Example 4.2.1 may be defined as the projective limit of the family of measures
f ƒ gƒ2Bb .Rd / , where ƒ WD e m.ƒ/ is a probability measure on .ƒ/; B ..ƒ//
and m.ƒ/ is the Lebesgue measure of ƒ 2 Bb .Rd /; compare [2] for details.
compare [33, 37, 38]. The summation in (4.2.4) is taken over all finite subconfigu-
rations 2 0 of the (infinite) configuration
2 ; we denote this by the symbol
b
. The mapping K is linear, positivity preserving, and invertible, with
X
.K 1 F /./ WD . 1/jnj F ./; 2 0 : (4.2.5)
By [33], for any G 2 Bbs .0 /, we have KG 2 Fcyl ./, moreover, there exists
C D C.G/ > 0, ƒ D ƒ.G/ 2 Bb .Rd /, and N D N.G/ 2 N such that
N
jKG.
/j 6 C 1 C j
ƒ j ;
2 :
The expression (4.2.4) can be extended to the class of all non-negative measurable G W
0 ! RC , in this case, evidently, KG 2 F0 ./. Note that the left-hand side (l.h.s.)
of (4.2.5) has a meaning for any F 2 F0 ./. Moreover, in this case .KK 1 F /.
/ D
F .
/ for any
2 0 .
For G as above, we may sum up (4.2.2) over n and rewrite the result in a compact
form: Z Z
.KG/.
/d.
/ D G./k ./d./: (4.2.6)
0
As was shown in [33], the equality (4.2.4) may be extended on all functions G such
that the l.h.s. of (4.2.6) is finite. In this case, (4.2.4) holds for -a.a.
2 , and
(4.2.6) holds, too.
Markov evolutions in spatial ecology 81
will not give us much information about properties of the solution to (4.3.3), in par-
ticular, about its moments or correlation functions. To get it, we suppose now that
a solution t 2 M1fm ./ to (4.3.2) exists and remains locally absolutely continuous
with respect to the Poisson measure for all t > 0 provided 0 has such a property.
Then, one can consider the correlation function kt WD kt , t > 0. If we suppose that
X1 Z
1
hhG; kii D G .n/ .x1 ; : : : ; xn /k .n/ .x1 ; : : : ; xn / dx1 : : : dxn :
nD0
nŠ .Rd /n
d ˇ
hhG; kt ii D hhb
LG; kt ii; t > 0; kt ˇt D0 D k0 ; (4.3.7)
dt
for all G 2 Bbs .0 /. Here, the operator
.b
LG/./ WD .K 1
LKG/./; 2 0 ;
is defined point-wise for all G 2 Bbs .0 / under conditions (4.3.4). Consequently, we
are interested in a weak solution to the equation
d ˇ
k D b
L kt ; t > 0; kt ˇt D0 D k0 ; (4.3.8)
dt t
where b
L is the dual operator to b
L with respect to the duality (4.3.6), namely,
Z Z
.b
LG/./k./ d./ D G./.b L k/./ d./: (4.3.9)
0 0
Of course, in general, for a fixed n, any equation from (4.3.10) itself is not closed and
includes functions kt.m/ of other orders m ¤ n. Nevertheless, the system (4.3.10) is a
.n/
closed linear system. The chain evolution equations for kt constitutes the so-called
hierarchy which is an analogue of the BBGKY hierarchy for Hamiltonian systems;
compare [12].
Here, we restrict our attention to the so-called sub-Poissonian correlation func-
tions. Namely, for a given C > 0, we consider the Banach space
˚ ˇ
KC WD k W 0 ! R ˇ k C jj 2 L1 .0 ; d / (4.3.11)
In what follows, we study the initial value problem (4.3.8) using the following
scheme. We solve this equation in the space KC . The well-posedness of the initial
value problem in this case is equivalent with an existence of the strongly continuous
semigroup (C0 -semigroup in the sequel) in the space KC with a generator b L . How-
1 jj
ever, the space KC is isometrically isomorphic to the space L .0 ; C d/ whereas,
by the Lotz theorem [40, 3], in an L1 space any C0 -semigroup is uniformly contin-
uous, that is, it has a bounded generator. Typically, for the operator L, any operator
b
Ln;m , cf. (4.3.10), might be bounded as an operator between two spaces of bounded
symmetric functions of n and m variables, whereas the whole operator b L is un-
bounded in KC .
To avoid these difficulties, we use a trick which goes back to Phillips [45]. The
main idea is to consider the semigroup in the L1 -space not directly but as a dual
semigroup T .t/ to a C0 -semigroup T .t/ with a generator A in the pre-dual L1 -
space. In this case, T .t/ appears as a strongly continuous semigroup not on the
whole L1 , but on the closure of the domain of A only.
84 Yu. Kondratiev, O. Kutovyi, P. Tkachov
In our case, this leads to the following scheme. We consider the pre-dual Banach
space to KC , namely, for C > 0,
LC WD L1 0 ; C jj d : (4.3.13)
Proposition 4.4.1 ([19, Prop. 5]). For any G 2 Bbs .0 /, the following formula holds:
X X
.b
LG/./ D G./ K 1 d.x; [ n x/ . n /
x2
XZ (4.4.2)
1
C G. [ x/ K b.x; [ / . n / dx; 2 0 :
Rd
Proposition 4.4.2 ([19, Cor. 9]). For any k 2 Bbs .0 /, the following formula holds:
XZ
b
.L k/./ D k. [ / K 1 d.x; [ n x/ ./ d./
x2 0
XZ
1
C k. [ . n x// K b.x; [ n x/ ./ d./;
x2 0
where b
L k is defined by (4.3.9).
86 Yu. Kondratiev, O. Kutovyi, P. Tkachov
Theorem 4.4.3 ([17, Thm. 3.2]). Suppose that there exists a1 > 1, a2 > 0 such that,
for all 2 0 and a.a. x 2 Rd ,
XZ ˇ ˇ
ˇK 1 d .x; [ n x/ˇ ./ C jj d ./ 6 a1 D./; (4.4.5)
0
x2
XZ ˇ ˇ
ˇK 1
b .x; [ n x/ˇ ./ C jj d ./ 6 a2 D./ (4.4.6)
0
x2
and, moreover,
a2 3
a1 C < : (4.4.7)
C 2
Then, .b b .t/ on LC .
L; D/ is the generator of a holomorphic semigroup T
4.4.4 Evolution in the space of correlation functions In this section, we will use
the semigroup T b.t/ acting on the space of quasi-observables for the construction of a
solution to the evolution equation (4.3.8) on the space of correlation functions.
0
We denote dC WD C jj d; and the dual space .LC /0 D L1 .0 ; d C / D
1 0
L .0 ; dC /. As was mentioned before, the space .LC / is isometrically isomorphic
to the Banach space KC considered in (4.3.11)–(4.3.12). The isomorphism is given
by the isometry RC
The space LC is not reflexive, hence T b .t/ is not a C0 -semigroup in the whole
KC . The last semigroup will be weak*-continuous, weak*-differentiable at 0, and
b
L will be a weak*-generator of b T .t/. Therefore, one has an evolution in the space
of correlation functions. In fact, we have a solution to the evolution equation (4.3.8)
in a weak*-sense. This subsection is devoted to the study of a classical solution to
bˇ .t/ of the semigroup T
this equation. The restriction T b .t/ onto its invariant Banach
subspace Dom.b L / (here and below all closures are in the norm of the space KC ) is
ˇ
a strongly continuous semigroup. Moreover, its generator b L will be a part of b L ,
namely, ˇ
ˇ
n ˇ
o
Dom.b L / D k 2 Dom.b L /ˇb L k 2 Dom.b L / (4.4.9)
ˇ
and b
L kDL b k for any k 2 Dom.b ˇ
L /.
bˇ˛ .t/ of the semigroup T
One can consider the restriction T bˇ .t/ onto K˛C . It will
be a strongly continuous semigroup with the generator b ˇ˛
L , which is a restriction of
ˇ
b
L onto K˛C . Namely, cf. 4.4.9,
n ˇ o
ˇ˛ ˇ
Dom.b L / D k 2 K˛C ˇ b L k 2 K˛C ;
ˇ˛
and b
L k DL bˇ k D b
L k for any k 2 K˛C . In the other words, b
ˇ˛
L is a part of
b
L .
Now, we may proceed to the main statement of this section.
Theorem 4.4.4 ([18, Thm. 3.16]). Let (4.4.5), (4.4.6) hold together with the follow-
ing assumptions,
Then, for any k0 2 K˛C , there exists a unique classical solution to (4.3.8) in the
bˇ˛ .t/k0 . Moreover, k0 2 K˛C
space K˛C , and this solution is given by kt D T
implies kt 2 K˛C for t > 0.
88 Yu. Kondratiev, O. Kutovyi, P. Tkachov
Example 4.4.5. (BDLP model) This example describes a generalisation of the model
of plant ecology (see [13] and references therein). Let L be given by (4.4.1) with
X
d.x;
n x/ D m.x/ C ~ .x/ a .x y/; x 2
;
2 ;
y2
nx
X
C C
b.x;
/ D ~ .x/ a .x y/; x 2 Rd n
;
2 ;
y2
Then,
m.x/ 1
d.x; / C C ~ .x/ 6 d.x; / C 6 1C d.x; /;
4Cı 4Cı
C X C m.x/ C
b.x; / C C ~ C .x/ 6 ~ .x/ a .x y/ C < d.x; /:
4 4Cı 4
y2
1 C
Hence, (4.4.5), (4.4.6) hold with a1 D 1 C 4Cı ; a2 D 4
, which fulfills (4.4.7). Next,
under conditions (4.4.12), (4.4.14), we have
Remark 4.4.6. It was shown in [13] that, for the case of constant m; ~ ˙ , a condition
like (4.4.12) is essential. Namely, if m > 0 is arbitrarily small, the operator b
L will
not even be accretive in LC .
We would like to construct some scaling L" , " > 0, of the generator L such
that the following scheme holds. Suppose that we have a semigroup UO " .t/ with the
generator LO " in some LC" , " > 0. Consider the dual semigroup UO " .t/. Let us
choose an initial function of the corresponding Cauchy problem with a singularity
in ". Namely, "jj k0."/ ./ r0 ./, " ! 0, 2 0 for some function r0 which is
independent of ". The scaling L 7! L" should be chosen in such a way that, first of
all, the corresponding semigroup UO " .t/ preserves the order of the singularity,
This mapping is “self-dual” with respect to the duality (4.3.6). Moreover, R" 1 D
R" 1 . Having R" k0 r0 , " ! 0, we need rt R" UO " .t/k0 R" UO " .t/R" 1 r0 ,
."/ ."/
" ! 0. Therefore, we have to show that, for any t > 0, the operator family
R" UO " .t/R" 1 , " > 0, has limiting (in a proper sense) operator U.t/ and
But, heuristically, UO " .t/ D exp ft LO " g and R" UO " .t/R" 1 D exp ftR" LO " R" 1 g. Let
us consider the “renormalised” operator
In fact, we need that there exists an operator LO V such that exp ftR" LO " R" 1 g !
exp ft LO V g DW U.t/ with U.t/ satisfying (4.5.2). Therefore, a heuristic way to pro-
duce a scaling L 7! L" is to demand that
d
lim e .t ; / LO "; ren e .t ; / D 0; 2 0 ;
"!0 dt
provided t satisfies (4.5.1). The point-wise limit of LO "; ren will be a natural candidate
for LO V .
Note that (4.5.3) implies informally that LO "; ren D R" 1 LO " R" . Below we propose
a scheme to give a rigorous meaning to the idea introduced above. We consider, for
90 Yu. Kondratiev, O. Kutovyi, P. Tkachov
a proper scaling L" , the “renormalised” operator L O "; ren and prove that it is a generator
O
of a strongly continuous contraction semigroup U"; ren .t/ in LC . Next, we show that
the formal limit LO V of LO "; ren is a generator of a strongly continuous contraction
semigroup UO V .t/ in LC . Finally, we prove that UO "; ren .t/ ! UO V .t/ strongly in LC .
This implies weak*-convergence of the dual semigroups UO "; ren .t/ to UO V .t/. We also
explain in which sense UO V .t/ satisfies the properties above.
Let us consider, for any " 2 .0I 1, the following scaling of the operator L defined
in Equation (4.4.1):
X
.L" F /.
/ WD d" .x;
n x/ ŒF .
n x/ F .
/ C
x2
Z
1
" b" .x;
/ ŒF .
[ x/ F .
/ dx;
Rd
where d" and b" are defined for each model explicitly (see e.g. Example 4.5.2). We
define the renormalised operator LO ";ren WD R" 1 K 1 L" KR" . Using the same argu-
ments as in the proof of Proposition 4.4.1, we get
X X
.LO ";ren G/./ D G./" jnj K0 1 d" .x; [ n x/ . n /
x2
XZ
jnj
C G. [ x/" K0 1 b" .x; [ / . n / dx:
Rd
Suppose that there exist a1 > 1, a2 > 0 such that, for all 2 0 , for a.a. x 2 Rd ,
and for any " 2 .0I 1,
XZ ˇ ˇ
ˇK 1 d .x; [ n x/ˇ ./ " jj C jj d ./ 6 a D ./ ; (4.5.4)
0 " 1 "
0
x2
XZ ˇ ˇ
ˇK 1
b" .x; [ n x/ˇ ./ " jj
C jj d ./ 6 a2 D" ./ ; (4.5.5)
0
0
x2
a2 3
a1 C < : (4.5.6)
C 2
For all , 2 0 and a.a. x 2 Rd , the following limits exist and coincide:
lim " jj K0 1 d" .x; [ / ./ D lim " jj K0 1 d" .x; / ./ DW DxV ./I
"!0 "!0
(4.5.7)
jj
jj
lim " K0 1 b" .x; [ / ./ D lim " K0 1 b" .x; / ./ DW BxV ./:
"!0 "!0
(4.5.8)
We would like to emphasise that above limits should not depend on . A collection
of examples for such d" , b" can be found in [14].
Now we are able to state a result about convergence in LC .
Markov evolutions in spatial ecology 91
Theorem 4.5.1 ([17, Thm. 4.4]). Let conditions (4.5.4), (4.5.5), and (4.5.6) be satis-
fied. Suppose that convergence in (4.5.7), (4.5.8) holds for all 2 0 as well as in
the sense of LC . Assume also that there exists > 0 such that either
d" .x; / 6 DxV .¿/ or d" .x; / > DxV .¿/
s
is satisfied for all 2 0 and for a.a. x 2 Rd . Then, UO " .t/ ! UO V .t/ in LC
uniformly on finite time intervals.
In comparison with the previous notation, we have changed ~ ˙ into "~ ˙ . Clearly,
conditions (4.4.12), (4.4.14) imply the same inequalities for "~ ˙ . Note also that d"
is decreasing in " ! 0. Therefore, to apply all results of this section to the BDLP-
model, we should prove the convergence (4.5.7), (4.5.8) in LC . Note that
X
" jj K0 1 d" .x; [ / ./ D d" .x; /0jj C 1 .1/ ./ a .x y/
y2
X
jj
! m0 C 1 .1/ ./ a .x y/ DW DxV ./
y2
and, analogously,
X
jj
" K0 1 b" .x; [ / ./ D b" .x; /0jj C 1 .1/ ./ aC .x y/
y2
X
! 1 .1/ ./ aC .x y/ DW BxV ./:
y2
The convergence in LC is now obvious. The kinetic (Vlasov) equation has the fol-
lowing form
d
.x/ D ~ C .aC t /.x/ ~ t .x/.a t /.x/ mt .x/: (4.5.9)
dt t
We study the equation so obtained in the following section.
Here and below, for a function u D u.y; t/, which is (essentially) bounded in y 2 Rd ,
and a function (a kernel) a 2 L1 .Rd /, we denote
Z
.a u/.x; t/ WD a.x y/u.y; t/dy:
Rd
4.6.2 Basic properties The following theorem yields existence and uniqueness of a
solution.
Theorem 4.6.1 ([20, Thm. 2.2]). Let u0 2 E and u0 .x/ > 0, x 2 Rd . Then, for
any T > 0, there exists a unique non-negative solution u to the equation (4.6.1) in E
such that u 2 XT .
Below, j:j D j:jRd denotes the Euclidean norm in Rd , Br .x/ is a closed ball in
R with the centre at x 2 Rd and radius r > 0; and br is the volume of this ball.
d
Theorem 4.6.2 ([20, Thm. 2.8]). Suppose that there exists r0 > 0 such that
A
and, for some "; A > 0, one has aC .x/ 6 1Cjxjd C"
, for all x 2 Rd . Then, the
solution u > 0 to (4.6.1), with 0 6 u0 2 Cub .R /, belongs to Cub .Rd /.
d
94 Yu. Kondratiev, O. Kutovyi, P. Tkachov
For E D Cub .Rd /, one can prove a refined version of Theorem 4.6.3 for non-
differentiable (in time) functions. For any T 2 .0; 1, define the set DT of all
functions u W Rd RC ! R, such that, for all t 2 Œ0; T /, u.; t/ 2 Cub .Rd /, and,
for all x 2 Rd , the function f .x; t/ is absolutely continuous in t on Œ0; T /. Then, for
any u 2 DT , one can define the function (4.6.2), for all x 2 Rd and a.a. t 2 Œ0; T /.
Proposition 4.6.4 ([20, Prop. 3.3]). The statement of Theorem 4.6.3 remains true if
we assume that u1 ; u2 2 DT and, for any x 2 Rd , the inequality (4.6.3) holds for
a.a. t 2 .0; T / only.
Definition 4.6.5. For > 0, given by (4.6.4), consider the following sets
Hence E is either U or L .
Proposition 4.6.6 ([20, Prop. 3.4]). Suppose that (A1) and (A2) hold. Let 0 6 u0 2
E be an initial condition to (4.6.1), let u 2 XT be the corresponding solution on
any Œ0; T , T > 0. Then u 2 X1 , with kuk1 6 .
Let v0 2 E be another initial condition to (4.6.1) such that u0 .x/ 6 v0 .x/,
x 2 Rd ; and let v 2 X1 be the corresponding solution. Then,
Let us show that, if u0 6 0, the solutions to (4.6.1) are strictly positive; this is
quite a common feature of linear parabolic equations. However, in general, it may
fail for nonlinear ones. It is required that
there exists ; ı > 0 such that aC .x/ > ; for a.a. x 2 Bı .0/: (A3)
Proposition 4.6.7 ([20, Prop. 3.8]). Let (A1), (A2), (A3) hold. Let u0 2 U , u0 6 0,
u0 6 , be the initial condition to (4.6.1), and let u 2 X1 be the corresponding
solution. Then,
Theorem 4.6.8 ([20, Thm. 3.9]). Let (A1), (A2), (A4) hold. Let u1 ; u2 2 X1 be two
solutions to (4.6.1) such that u1 .x; 0/ 6 u2 .x; 0/, x 2 Rd , are in U . Then, either
u1 .x; t/ D u2 .x; t/, x 2 Rd , t > 0 or u1 .x; t/ < u2 .x; t/, x 2 Rd , t > 0.
96 Yu. Kondratiev, O. Kutovyi, P. Tkachov
Corollary 4.6.9 ([20, Cor. 3.10]). Let (A1), (A2), (A4) hold. Let u0 2 U , u0 6 ,
be the initial condition to (4.6.1), and let u 2 X1 be the corresponding solution.
Then, u.x; t/ < , x 2 Rd , t > 0.
Definition 4.6.10. Let M .R/ denote the set of all decreasing and right-continuous
functions f W R ! Œ0; .
Here and below, the function is said to be the profile for the travelling wave,
whereas c is its speed.
Theorem 4.6.12 ([20, Thm. 4.9]). Let (A1) and (A2) hold and 2 f 1; 1g be fixed.
Suppose also that (A5) holds. Then, there exists c ./ 2 R such that
1. for any c > c ./, there exists a travelling wave solution, in the sense of
Definition 4.6.11, with a profile 2 M .R/ and speed c,
2. for any c < c ./, such a travelling wave does not exist.
Proposition 4.6.13 ([20, Prop. 4.11]). Let 2 M .R/ and c 2 R be such that
Q 1
there exists a solution u 2 X1 to Equation (4.6.1) such that (4.6.5) holds, for some
2 f 1; 1g. Then, 2 C 1 .R ! Œ0; /, for c ¤ 0, and 2 C.R ! Œ0; /,
otherwise.
Corollary 4.6.14 ([20, Cor. 4.12, Prop. 4.13]). In the conditions and notations of
Proposition 4.6.13, is a strictly decaying function, for any speed c, and for any
speed c ¤ 0, the profile is in Cb1 .R/.
There exist r D r./ > 0, D ./ > 0, ı D ı./ > 0, such that
(A7)
aC .x/ > ; for a.a. x 2 Bı .r/.
There exists a critical situation, when the abscissa of the travelling wave coincides
with the abscissa of the kernel aC . In this case, properties of the travelling waves
may be different from the ‘generic’ case. Therefore, we introduce the following two
classes of functions to simplify the further statements.
Definition 4.6.15. Let m > 0, ~ ˙ > 0, 0 6 a 2 L1 .R/ be fixed, and assume that
(A1) holds. For an arbitrary 2 f 1; 1g, we denote by V the the class of all kernels
0 6 aC 2 L1 .R/ such that (A2), (A5)–(A7) and one of the following assumptions
holds:
1. 0 WD supf 2 R W a ./ < 1g D 1;
2. 0 < 1 and a .0 / D 1;
3. 0 < 1, a .0 / < 1 and t .0 / 2 Œ 1; m/, where t ./ is given by
Z
t ./ WD ~ C .1 s/aC .s/es ds 2 Œ 1; ~ C /; 2 Œ0; 0 /:
R
98 Yu. Kondratiev, O. Kutovyi, P. Tkachov
Similarly, we denote by W the class of all kernels such that 0 < 1, a .0 / < 1,
and t .0 / 2 Œm; ~ C/ instead of .1/ .3/.
The following result provides expressions of for the minimal speed of travelling
waves.
Now, we will formulate the uniqueness (up to shifts) of a profile for a travelling
wave with given speed c > c ./, c ¤ 0.
4.6.4 Propagation with a constant speed We will study here the behaviour of
u.tx; t/, where u solves (4.6.1), for big t > 0. The results of Section 4.6.3 together
with the comparison principle imply that if an initial condition u0 .x/ to (4.6.1) has a
minorant/majorant which has the form .x /, 2 f 1; 1g, where 2 M .R/ is a
travelling wave profile in the direction with a speed c > c ./, then, for the corre-
sponding solution u to (4.6.1), the function u.tx; t/ will have the minorant/majorant
.t.x c//, respectively. In particular, if the initial condition is “below” any
travelling wave in a given direction, then one can estimate the corresponding value
of u.tx; t/ (Theorem 4.6.19). Considering such a behaviour in different directions,
one can obtain a (bounded) set outside of which the solution exponentially decays to
0 (Theorem 4.6.20). Inside this set, the solution will uniformly converge to (Theo-
rem 4.6.21).
Here and below, for any measurable A R, we define tA WD ftx j x 2 Ag R,
and
( )
d 1 d ˇ
ˇ x
E .R / WD f 2 L .R / kf k
; WD sup jf .x/je
; <1 :
x2Rd
We now explain how a solution u.x; t/ to (4.6.1) behaves outside the sets ‡t;1 D
. 1; c .1/t/, ‡t; 1 D . c . 1/t; 1/, t > 0.
We are now ready to state a result about the long-time behaviour at infinity in
space. We consider the assumptions
aC 2 L1 .R/: (A8)
Z
There exists d > 0 such that aC .x/ed jxj dx < 1: (A9)
R
Clearly, (A9) implies Z
jxjaC .x/ dx < 1: (4.6.7)
R
100 Yu. Kondratiev, O. Kutovyi, P. Tkachov
Theorem 4.6.20 ([20, Thm. 5.9]). Let the conditions (A1), (A2), (A3), (A8) and
(A9) hold. Let u0 2 E be such that
and let u 2 X1 be the corresponding solution to (4.6.1). Then, for any open set
O ‡1 , there exists D .O/ > 0 such that
t
sup u.x; t/ 6 jjju0 jjje ; t > 0:
x…t O
Our second main result about the long-time behaviour states that the solution u 2
X1 uniformly converges to inside the set t‡1 D ‡t .
For a closed set A R, we denote by int.A/ the interior of A.
Theorem 4.6.21 ([20, Thm. 5.10]). Let the conditions (A1), (A2), (A4), (A8) and
(A9) hold. Let u0 2 U , u0 6 0, and u 2 X1 be the corresponding solution to
(4.6.1). Then, for any compact set C int.‡1/,
All results above about travelling waves and long-time behaviour of the solutions
were obtained under exponential integrability assumptions, cf. (A5) or (A9). In [29],
it was proved, in the case of the local competition (e.g. a D ı0 ), on R with local
nonlinear term, that the case with aC which does not satisfy such conditions leads to
‘accelerating’ solutions, i.e. in this case an equality like (4.6.8) holds for arbitrarily
big compact C R. The detailed analysis of the propagation for the slowly decaying
aC is done in the following section.
We will formulate an analogue of the first statement in [29, Thm. 1].
Theorem 4.6.22 ([20, Thm. 5.21]). Let the conditions (A1), (A2), (A4), (A8) and
(4.6.7) hold. Suppose also (cf. (A9)) that a ./ D 1 for any > 0 and for any
2 f 1; 1g. Let u0 2 E be such that there exist x0 2 R, > 0, r > 0, with the
property u0 > , for a.a. x 2 Br .x0 /. Let u 2 X1 be the corresponding solution to
(4.6.1). Then, for any compact set K R,
4.6.5 Accelerating propagation The main result of this section is Theorem 4.6.25,
where we demonstrate the accelerated propagation of solutions to (4.6.1) in the case
when either of the dispersion kernel or the initial condition has regularly heavy tails
at both ˙1, perhaps different. We show that, in such a case, the propagation to the
right direction is fully determined by the right tails of either the kernel or the initial
condition. Our approach in this section is based, in particular, on the extension of the
theory of sub-exponential distributions, which we introduced earlier in [23].
Markov evolutions in spatial ecology 101
In other words, r.t/ and l.t/ are given through the inverse functions to log b,
namely, for t > ,
1 1
r.t; b/ D log b R .ˇt/; l.t; b/ D log b R .ˇt/: (4.6.10)
C
We are going to find sufficient conditions on a and u0 such that the correspond-
ing solution u to (4.6.1) becomes arbitrarily close to (as t goes to 1) inside the
(expanded) interval . l.t/; r.t// and becomes arbitrarily close to 0 outside of this
interval.
Here, l.t/ D l.t; b/ and r.t/ D r.t; b/, where, cf. (4.6.10),
˚
log b.s/ log max a.x/; u0 .x/ ; x ! 1;
and we suppose that the function b has regularly heavy tails at 1, see Definition 4.6.24
below. In particular, for any small "; ı > 0, we will have that
˚ ˇ
x > 0 ˇ u.x; t/ 2 .ı; ı/ r.t t "/; r.t C t "/
for all t big enough; the corresponding result also holds for negative values of x and
the function l.t/ instead.
Theorem 4.6.25 ([24, Thm. 1]). Let either (A1)–(A4) and (4.6.7) hold or (A10)
hold. Let 0 6 u0 6 , u0 6 0 and u be the corresponding solution to (4.6.1).
Let u0 2 L1 .R/ and functions b; b1 ; b2 W R ! RC have regularly heavy tails at
both ˙1 in the sense of densities, and let the following assumptions hold,
r.t; b/ kt ! 1; l.t; b/ kt ! 1; as t ! 1:
References
[1] F. Achleitner and C. Kuehn, On bounded positive stationary solutions for a nonlocal Fisher–
KPP equation, Nonlinear Analysis: Theory, Methods & Applications 112 (2015), 15–29.
[2] S. Albeverio, Y. Kondratiev and M. Röckner, Analysis and geometry on configuration spaces,
J. Funct. Anal. 154 (1998), 444–500.
[3] W. Arendt, A. Grabosch, G. Greiner, U. Groh, H.P. Lotz, U. Moustakas, R. Nagel, F. Neubran-
der and U. Schlotterbeck, One-Parameter Semigroups of Positive Operators, Springer, Berlin,
1986.
[4] H. Berestycki, G. Nadin, B. Perthame and L. Ryzhik, The non-local Fisher-KPP equation:
travelling waves and steady states, Nonlinearity 22 (2009), 2813–2844.
[5] B. Bolker and S.W. Pacala, Using moment equations to understand stochastically driven spa-
tial pattern formation in ecological systems, Theor. Popul. Biol. 52 (1997), 179–197.
[6] A. Borodin and G. Olshanski, Harmonic analysis on the infinite-dimensional unitary group
and determinantal point processes, Ann. of Math. 161 (2005), 1319–1422.
[7] A. Borodin, G. Olshanski and E. Strahov, Giambelli compatible point processes, Adv. Appl.
Math. 37 (2006), 209–248.
[8] M. Bramson, Convergence of solutions of the Kolmogorov equation to travelling waves, Mem.
Amer. Math. Soc. 44 (1983).
[9] W. Braun and K. Hepp, The Vlasov dynamics and its fluctuations in the 1=N limit of inter-
acting classical particles, Commun. Math. Phys. 56 (1977), 101–113.
104 Yu. Kondratiev, O. Kutovyi, P. Tkachov
[10] J. Coville, J. Dávila and S. Martínez, Nonlocal anisotropic dispersal with monostable nonlin-
earity, J. Diff. Equ. 244 (2008), 3080–3118.
[11] R.L. Dobrushin, Vlasov equations, Functional Anal. Appl. 13 (1979), 115–123.
[12] R.L. Dobrushin, Y.G. Sinai and Y.M. Sukhov, Dynamical systems of statistical mechanics,
in: Dynamical Systems, volume II of Encyclopaedia Mathematical Sciences, (Y.G. Sinai, ed.),
Springer, Berlin (1989), pp. 208–254.
[13] D. Finkelshtein, Y. Kondratiev and O. Kutoviy, Individual based model with competition in
spatial ecology, SIAM J. Math. Anal. 41 (2009), 297–317.
[14] D. Finkelshtein, Y. Kondratiev and O. Kutoviy, Vlasov scaling for stochastic dynamics of
continuous systems, J. Stat. Phys. 141 (2010), 158–178.
[15] D. Finkelshtein, Y. Kondratiev and O. Kutoviy, Vlasov scaling for the Glauber dynamics
in continuum, Infin. Dim. Anal. Quantum Probab. Relat. Top. 14 (2011), 537–569.
[16] D. Finkelshtein, Y. Kondratiev and O. Kutoviy, An operator approach to Vlasov scaling for
some models of spatial ecology, Methods Funct. Anal. Top. 19 (2013), 108–126.
[17] D. Finkelshtein, Y. Kondratiev and O. Kutoviy, Semigroup approach to birth-and-death
stochastic dynamics in continuum, J. Funct. Anal. 262 (2012), 1274–1308.
[18] D. Finkelshtein, Y. Kondratiev and O. Kutoviy, Statistical dynamics of continuous systems:
perturbative and approximative approaches, Arab. J. Math. 4 (2015), 255–300.
[19] D. Finkelshtein, Y. Kondratiev and M. J. Oliveira, Markov evolutions and hierarchical equa-
tions in the continuum. I. One-component systems, J. Evol. Equ. 9 (2009), 197–233.
[20] D. Finkelshtein, Y. Kondratiev and P. Tkachov, Traveling waves and long-time behavior in a
doubly nonlocal Fisher–KPP equation, preprint; arXiv:1508.02215.
[21] D. Finkelshtein, Y. Kondratiev and P. Tkachov, Accelerated front propagation for monostable
equations with nonlocal diffusion, preprint; arXiv:1611.09329.
[22] D. Finkelshtein and P. Tkachov, The hair-trigger effect for a class of nonlocal nonlinear equa-
tions, preprint; arXiv:1702.08076.
[23] D. Finkelshtein and P. Tkachov, Kesten’s bound for sub-exponential densities on the real line
and its multi-dimensional analogues, preprint; arXiv:1704.05829.
[24] D. Finkelshtein and P. Tkachov, Accelerated nonlocal nonsymmetric dispersion for mono-
stable equations on the real line, preprint; arXiv:1706.09647.
[25] R. Fisher, The advance of advantageous genes, Ann. Eugenics 7 (1937), 335–369.
[26] N. Fournier and S. Méléard, A microscopic probabilistic description of a locally regulated
population and macroscopic approximations, Annals Appl. Probab. 14 (2004), 1880–1919.
[27] N.L. Garcia and T.G. Kurtz, Spatial birth and death processes as solutions of stochastic equa-
tions, ALEA Lat. Am. J. Probab. Math. Stat. 1 (2006), 281–303.
[28] N.L. Garcia and T.G. Kurtz, Spatial point processes and the projection method, Progress
Probab. 60 (2008), 271–298.
[29] J. Garnier, Accelerating solutions in integro-differential equations, SIAM J. Math. Anal. 43
(2011), 1955–1974.
[30] F. Hamel and L. Ryzhik, On the nonlocal Fisher–KPP equation: steady states, spreading speed
and global bounds, Nonlinearity 27 (2014), 2735–2753.
Markov evolutions in spatial ecology 105
[31] R.A. Holley and D.W. Stroock, Nearest neighbor birth and death processes on the real line,
Acta Math. 140 (1978), 103–154.
[32] A.N. Kolmogorov, I.G. Petrovsky and N.S. Piskunov, Étude de l’équation de la diffusion avec
croissance de la quantité de matiere et son application a un probleme biologique, Bull. Univ.
État Moscou Sér. Inter. A 1 (1937), 1–26.
[33] Y. Kondratiev and T. Kuna, Harmonic analysis on configuration space. I. General theory, Infin.
Dim. Anal. Quantum Probab. Relat. Top. 5 (2002), 201–233.
[34] Y. Kondratiev and O. Kutoviy, On the metrical properties of the configuration space, Math.
Nachr. 279 (2006), 774–783.
[35] Y. Kondratiev, O. Kutoviy and R. Minlos, On non-equilibrium stochastic dynamics for inter-
acting particle systems in continuum, J. Funct. Anal. 255 (2008), 200–227.
[36] Y. Kondratiev, O. Kutoviy and S. Pirogov, Correlation functions and invariant measures in
continuous contact model, Infin. Dim. Anal. Quantum Probab. Relat. Top. 11 (2008), 231–
258.
[37] A. Lenard, States of classical statistical mechanical systems of infinitely many particles. I,
Arch. Rat. Mech. Anal. 59 (1975), 219–239.
[38] A. Lenard, States of classical statistical mechanical systems of infinitely many particles. II.
Characterization of correlation measures, Arch. Rat. Mech. Anal. 59 (1975), 241–256.
[39] T.M. Liggett, Interacting Particle Systems, Springer, New York, 1985.
[40] H.P. Lotz, Uniform convergence of operators on L1 and similar spaces, Math. Z. 190 (1985),
207–220.
[41] D. Mollison, Possible velocities for a simple epidemic, Adv. Appl. Probab. 4 (1972), 233–257.
[42] D. Mollison, The rate of spatial propagation of simple epidemics, in: Proceedings of the Sixth
Berkeley Symposium on Mathematical Statistics and Probability, Vol. III: Probability theory,
(L.M. Le Cam, J. Neyman, and E.L. Scott, eds.), Univ. California Press, Berkeley, CA (1972),
pp. 579–614.
[43] G. Nadin, B. Perthame and M. Tang, Can a traveling wave connect two unstable states? The
case of the nonlocal Fisher equation, C. R. Math. Acad. Sci. Paris 349 (2011), 553–557.
[44] M.D. Penrose, Existence and spatial limit theorems for lattice and continuum particle systems,
Probab. Surveys 5 (2008), 1–36.
[45] R.S. Phillips, The adjoint semi-group, Pacific J. Math. 5 (1955), 269–283.
[46] H. Spohn, Kinetic equations from Hamiltonian dynamics: Markovian limits, Rev. Mod. Phys.
52 (1980), 569–615.
[47] Y.-J. Sun, W.-T. Li and Z.-C. Wang, Traveling waves for a nonlocal anisotropic dispersal
equation with monostable nonlinearity, Nonlinear Anal. 74 (2011), 814–826.
[48] J. Terra and N. Wolanski, Asymptotic behavior for a nonlocal diffusion equation with absorp-
tion and nonintegrable initial data. The supercritical case, Proc. Amer. Math. Soc. 139 (2011),
1421–1432.
[49] J. Terra and N. Wolanski, Large time behavior for a nonlocal diffusion equation with absorp-
tion and bounded initial data, Discr. Contin. Dyn. Syst. 31 (2011), 581–605.
[50] H. Yagisita, Existence and nonexistence of traveling waves for a nonlocal monostable equa-
tion, Publ. Res. Inst. Math. Sci. 45 (2009), 925–953.
Chapter 5
Metastability in randomly perturbed dynamical
systems: Beyond large-deviation theory
B. Gentz
x x
- - - - - -
x x
- - - - - -
Figure 5.1.1. (a) Asymmetric double-well potential U as an example and the corresponding
invariant density q" for (b) " D 1=2, (c) " D 1=4, (d) " D 1=10.
transition probability densities .x; t/ 7! p" .x; yI t/ satisfy Kolmogorov’s forward (or
Fokker–Planck) equation
@
p" .x; yI t/ D L" p" .x; yI t/ D rx rx U.x/p" .x; yI t/ C "x p" .x; yI t/ :
@t
If xt" .!/ t admits an invariant density q" , then L" q" D 0. For gradient systems with
confining potential, q" exists and the invariant measure (or equilibrium distribution)
is given by
1
" .dx/ D q" .x/ dx D e U.x/=" dx ; (5.1.3)
Z"
with normalisation Z
Z" D e U.x/=" dx :
Rd
As a consequence, for small noise intensities ", the invariant measure " concentrates
in the minima of the potential U , cf. Figure 5.1.1.
At this point, we need to ask how long it actually takes until xt" t is well de-
scribed by its invariant distribution (5.1.3). To illustrate the importance of the differ-
ent timescales present in the system (5.1.2), consider
a one-dimensional set-up with
a double-well potential U , and assume that xt" t starts at the bottom of the left-hand
well, i.e., x0" D x ? , using the notation from Figure 5.1.2. First, in a time of order
Trelax D 1=U 00 .x ? /, where U 00 .x ? / is the curvature of the potential U at the bottom
Metastability in randomly perturbed dynamical systems: Beyond large-deviation theory 109
x0?
H
x?
?
xC
Figure 5.1.2. Example of a double-well potential with local minima at x ? and xC ? and a saddle
at x0? . The dashed line indicates the barrier height H which needs to be surmounted in a transition
from the left-hand well to the right-hand well.
of its left-hand well, xt" t approaches a Gaussian distribution, centred in x ? . From
large-deviation theory we know that, with overwhelming probability, sample paths
will remain inside the left-hand well for all times significantly shorter than Kramers’
time TKramers D eH=" , where H D U.x0? / U.x ? / is the barrier height the stochastic
process xt" t has to surmount to reach the other well. Thus, for small noise intensities
", the paths will stay in the left-hand well for a long time, and only for t TKramers ,
the distribution of xt" approaches the bimodal stationary density q" . The transition
from one well to the other is crucial here—until such a transition occurs, the system
behaves as if the underlying deterministic dynamics, given by (5.1.1), had a unique
stable equilibrium at x ? . The necessary transition to the other well can be viewed as
the exit from a suitably chosen domain, which leads us to the more general question
where and when the solution of an SDE exits from a given domain.
Consider a general SDE
p
dxt" D b.xt" / dt C 2"g.xt" / dWt ; x0" D x0 2 Rd ; (5.1.4)
where we will assume that b und g satisfy the usual local Lipschitz and bounded-
growth conditions. For the sake of brevity of the presentation, let us also assume that
1
a.x/ D g.x/g.x/T > M Id for a constant M > 0, i.e., uniform ellipticity for the
infinitesimal generator A" ,
d
X @2
A" v.x/ D " aij .x/ v.x/ C h b.x/; rv.x/i ;
@xi @xj
i;j D1
"
of the diffusion xt t . Note that we do not assume that b derives from a potential.
For a bounded domain D with smooth boundary and initial condition x0 2 D , define
the first-exit time
D D" D infft > 0W xt" 62 D g :
Thefirst-exit location is then given by x" 2 @D . The following questions arise: Does
xt" t leave D ? If so: What do we know about the expected first-exit time or, more
precisely, the distribution of the first-exit time , and the first-exit location x" ?
110 B. Gentz
If the solution xtdet t of the corresponding deterministic ODE xP tdet D b.xtdet /
leaves D in finite time, xt" t will do the same, because, in finite time and for small
noise intensity, the deviation xt" xtdet will remain small. If xtdet does not leave D ,
i.e., if D is positively invariant under the deterministic flow, we want to study the
problem of diffusion exit, i.e., an exit which is noise induced and would not occur in
the absence of noise. Expected first-exit times and the distribution of first-exit loca-
tions can be characterised by means of partial differential equations (PDEs). Below,
we indicate the initial condition x0 for the SDE
(5.1.4) by writing P x0 and Ex0 for
probabilities and expectations relating to xt" t , respectively.
In the small-noise limit " ! 0, the exit location concentrates in ˛1 if ˛1 is the lower
of the two boundary points of D , i.e., if U.˛1 / < U.˛2 /, and in ˛2 if U.˛2 / < U.˛1 /.
If U.˛1 / D U.˛2 /, the geometry of the well starts playing a role as we can see from
1 1 1
lim P x fx"" D ˛1 g D C : (5.1.7)
"!0 D jU 0 .˛1 /j jU 0 .˛1 /j jU 0 .˛2 /j
Metastability in randomly perturbed dynamical systems: Beyond large-deviation theory 111
The noise-induced exit is more easily achieved on the side on which the wall of the
well is less steep. Later on, we will encounter other phenomena in which the geome-
try of wells also becomes important.
In general, the PDEs (5.1.5) and (5.1.6) cannot be solved explicitly, and a large-
deviations approach, developed by Wentzell and Freidlin, can be used to obtain the
exponential asymptotics of first-exit times and concentration results for first-exit lo-
cations. More subtle details as in (5.1.7) can typically not be obtained by Wentzell–
Freidlin theory, and we will content ourselves with a short review of the basic results
before we return to questions which go beyond this theory.
The key ingredient to all large-deviation results is the rate function or action func-
tional
8̂ Z T
2
ˆ 1
1=2
<
a.'s / Œ P
' s b.' s
ds ; for '
/ x0 2 H1 ,
I.'/ D IŒ0;T ;x0 .'/ D 2 0
ˆ
:̂ C 1 ; otherwise ,
where H1 denotes the space of functions ' W Œ0; T ! Rd , '.0/ D 0, which possess
an L2 -derivative. Recall that we introduced the notation a.x/ D g.x/g.x/T . The
large-deviation principle (LDP) shows that, in the small-noise limit, the probability
of a solution of the SDE (5.1.4) being close to a given path ' W Œ0; T ! Rd behaves
like expf I.'/=2"g. More precisely, the LDP states that, for any set of paths on
Œ0; T ,
infı I 6 lim inf 2" log Pf.xt" /t 2 g 6 lim sup 2" log Pf.xt" /t 2 g 6 inf I ;
"!0 "!0
It can be interpreted as the cost to reach z against the deterministic flow, and
V D inffV .x ? ; z/W z 2 @D g
reflects the cost of leaving D . With the help of the quasipotential, the key result of
the Wentzell–Freidlin theory can be expressed as follows.
112 B. Gentz
At first glance, it might be surprising that these results do not depend on the initial
condition x0 . This is a question of timescales. The deterministic dynamics brings
the process to a small neighbourhood of the unique stable equilibrium point in a time
of order one, while the random perturbation enables an exit only on an exponentially
long timescale. Thus, the first-exit time is long compared to the relaxation time, so
that the initial condition is “forgotten” long before an exit becomes likely.
In the gradient case with isotropic noise, i.e., for b D rU and g D IdRd ,
cf. (5.1.2), the variational principle defining the quasipotential with respect to a
unique, asymptotically stable equilibrium point x ? can be solved. We find that
V .x ? ; z/ D 2ŒU.z/ U.x ? / for all z 2 D such that U.z/ 6 inffU.z/W Q zQ 2 @D g. This
implies that the cost for leaving D is given by the minimal height to be surmounted,
i.e., V D infz2@D V .x ? ; z/ D 2ŒU.z ? / U.x ? /. Here z ? denotes the “lowest point
on @D ” in the sense that U.z/ is minimal for z D z ? . We also know that the infi-
mum is attained for paths going against the deterministic flow, i.e., for .'t /t satisfying
'Pt D CrU.'t /, cf. [23, Chapter 4].
In the gradient case with isotropic noise, transitions between wells are described
by the Arrhenius Law [1]. Using again the notations from Figure 5.1.2, we define the
first-hitting time
" ?
? D infft > 0W x 2 Bı .x /g
xC t C
? ?
of a small ball Bı .xC / around the local minimum xC . The Arrhenius Law states
that, in the small-noise limit, the expected first-hitting time Ex ? xC? behaves like
˚ ?
Ex ? xC? ' const exp ŒU.x /
0 U.x /=" , provided the process starts in x ? . This
?
where we once more recall the notations from Figure 5.1.2. Here 1 .x0? / denotes the
unique negative eigenvalue of r 2 U at the saddle x0? . Note that the geometry of the
potential well and saddle only affects the subexponential prefactor. The prefactor is
larger if (i) the saddle is flatter in the direction associated with the unique negative
eigenvalue, if (ii) the confining directions of the saddle are steeper, or (iii) if the well
in which the dynamics starts is shallower.
The first mathematically rigorous proof of (5.2.2) in finite dimension, includ-
ing a generalisation to more than two wells, is due to Bovier, Eckhoff, Gayrard and
Klein [15, 16]. It is based on potential theory. Finally, a full asymptotic expansion of
the prefactor in powers of " was proved in [26, 27], using analytical methods.
Obviously, Kramers’ Law in the form (5.2.1) or (5.2.2), respectively, can only
hold if none of the eigenvalues of r 2 U.x0? / and r 2 U.x ? / vanishes since it would
predict a vanishing or an infinite transition time. Vanishing eigenvalues correspond to
non-quadratic saddles or wells. One might wonder why this non-generic situation is
of interest. The immediate answer lies in the study of parameter-dependent systems.
As a parameter varies, a parameter-dependent system may undergo bifurcations, and
a quadratic saddle or well may not be quadratic any longer and even cease to exist as
the following simple example illustrates.
Example 5.2.1. Consider two harmonically coupled particles, each with (individ-
ual) dynamics given by the gradient dynamics in a symmetric double-well potential
4 x2
U.x/ D x4 2
, so that the global dynamics is determined by the potential
U
.x1 ; x2 / D U.x1 / C U.x2 / C .x1 x2 / 2 ; (5.2.3)
2
where
> 0 denotes the coupling strength.
A change of variable, corresponding to a rotation by =4, allows to rewrite
the potential as U b
.y1 ; y2 / D 1 y 2 1 2
y 2 C 1 y 4 C 6y 2 y 2 C y 4 , yielding
2 1 2 2 8 1 1 2 2
b
.0; 0/ D 1 2
. This shows that the 1-saddle at .0; 0/, present at large
det r 2 U
coupling strength
, ceases to be a 1-saddle at
D 1=2 and turns into a global max-
imum, while two new saddles are created in a transversal pitchfork bifurcation. If
the coupling strength is further decreased, another bifurcation occurs at
D 1=3. In
114 B. Gentz
3
D 4
2 . 12 ; 1/
D 1
2
D 3
8
2 . 31 ; 21 /
1 1
D 3
D 4
2 .0; 31 /
D0
Figure 5.2.1. Level lines of the global potential landscape (5.2.3). Coupling strength
D 34
corresponds to the strong-coupling regime, exhibiting two global minima and one saddle. At
critical coupling strength
D 21 the saddle at the origin is no longer quadratic. In the regime
2 . 31 ; 21 /, represented by
D 83 , the origin is no longer a 1-saddle, and two new saddles have
formed in a transversal pitchfork bifurcation. Coupling strength
D 31 is critical again, and the
two saddles which formed at
D 12 undergo a longitudinal pitchfork bifurcation, splitting into two
saddles and a well each. The resulting potential landscape for
< 31 is represented by
D 14 .
Finally, in the uncoupled case
D 0, there are four global minima and four 1-saddles.
this longitudinal pitchfork bifurcation, the saddles which were created at
D 1=2
split into two new saddles and a well. As a consequence, the optimal transition paths
for the stochastic dynamics from the global minimum at . 1; 1/, expressed in the
original variables, to the other global minimum at .C1; C1/ are not the same in the
different regimes. For strong coupling, the optimal transition path takes both par-
ticles simultaneously over the unique 1-saddle at the origin to the other well. For
intermediate coupling strength, there are two optimal transition paths of equal cost
(in terms of the large-deviation rate function), crossing one of the two saddles created
at
D 1=2. For weak coupling, there are still two optimal transition paths, but now
they involve crossing a saddle, followed by a well and another saddle. This can be
understood as an Ising-model-like behaviour, with the first particle entering the other
well and, thus, reaching a local minimum, before “pulling” the second particle to the
Metastability in randomly perturbed dynamical systems: Beyond large-deviation theory 115
other well. Figure 5.2.1 shows the level lines of the global potential landscape in
original variables.
In the general case of an arbitrary number N of particles, a sequence of symmetry-
breaking bifurcations is observed. For a detailed analysis of the potential landscape
and the resulting expected transition times between global minima, see [4].
Let us now turn to the general case of non-quadratic saddles. The following result
generalises (5.2.2), cf. [15], to non-quadratic saddles.
Theorem 5.2.2 ([9, Chapter 3]). Let x ? denote a quadratic local minimum of U , and
?
let xC be another local minimum of U . Assume that x0? D 0 is the relevant, i.e.
?
lowest, saddle for passage from x ? to xC . The normal form near the saddle x0? D 0
is then of the form
d
1X
U.y/ D u1 .y1 / C u2 .y2 / C j yj2 C : : :
2
j D3
In the case that the well containing the initial condition x ? is not quadratic, the
first factor, which stems from a Gaussian integral, is replaced by the evaluation of
a non-Gaussian integral. The theorem easily generalises to the case of more than
the two directions y1 ; y2 being non-quadratic, by including those directions in the
integrals on the right-hand side of (5.2.4) and having fewer factors in the product
accordingly, cf. [9, Chapter 3.3]. As a corollary, let us illustrate Theorem 5.2.2 in a
few standard cases.
(c) If the unique unstable direction at the saddle x0? is quartic, while all stable
P
directions are quadratic, i.e., if U.y/ D C4 y14 C 21 djD2 j yj2 C : : : , then
s
.1=4/ .2/1 2 : : : d ŒU.x ? / U.x ? /="
? D
Ex ? xC e 0 Œ1 C O ."1=4 jlog "j5=4 / :
2C41=4 "1=4 det r 2 U.x ? /
(d) If one of the stable directions at the saddle x0? undergoes a pitchfork bifurca-
P
tion, i.e., if U.y/ D 21 j1 jy12 C 12 2 y22 C C4 y24 C 12 djD3 j yj2 C : : : , and
2 changes sign from negative to positive, cf. Example 5.2.1, then
(i) for 2 D 2 ."/ > 0,
s p ? ?
.2 C 2"C4 /3 : : : d eŒU.x0 / U.x /="
? D 2
Ex ? xC p Œ1CR."/ ;
j1 j det r 2 U.x ? / ‰C .2 = 2"C4 /
(5.2.5)
where the function
p 2
‰C .˛/ D ˛.1 C ˛/=8 e˛ =16 K1=4 ˛ 2 =16 ; (5.2.6)
Note that, as opposed to the case of quadratic well and saddle, in all the other
cases the prefactor becomes "-dependent to the leading order. The result (5.2.5) on
the pitchfork bifurcation illuminates how the crossover between the regimes of a zero
eigenvalue and an eigenvalue of order one is realised: When 2 is of smaller order
than "1=2 , a saturation effect is observed, and the dynamics behaves as if the poten-
tial’s curvature were bounded below by .2"C4 /1=2 .
Figure 5.2.2 shows the subexponential prefactor in Corollary
p 5.2.3 (d) as a func-
tion of 2 . More precisely, we show .0; 1/ 3 2 7! 2 C "1=2 =‰C .2 ="1=2 /,
cf. (5.2.5), and the continuation to the interval . 1; 0.
The proof of Theorem 5.2.2 uses the potential-theoretic approach from [15]. It
is based on expressing expected transition times in terms of Newtonian capacities
between sets, which can be estimated by a variational principle involving Dirichlet
forms.
Metastability in randomly perturbed dynamical systems: Beyond large-deviation theory 117
2.0
1.5
1.0
0.5
0.0
-4 -2 0 2 4
Figure 5.2.2. The behaviour of the subexponential prefactor in the expected transition
time Ex ? x ? as a function of the eigenvalue 2 2 R in the case that the stable direction
C
corresponding to y2 undergoes a pitchfork bifurcation at the origin, cf. Corollary 5.2.3 (d). For the
sake of definiteness, we fix the parameter values in such a way that 2C4 D 1 and
3 : : : d D j1 j det r 2 U.x ? /. For 2 < 0, the eigenvalues ˙ ?
j at the two saddles near x0 satisfy
˙ ˙
2 22 and j j for j 6D 2, so that our choice indeed fixes all relevant parameters apart
from the noise intensity ". The curves show the values " D 0:5, " D 0:2, " D 0:1, " D 0:02 and
" D 0:002. The darker shades correspond to successively smaller values of ".
where .Wt /t denotes an N -dimensional standard Brownian motion. This model has
been analysed in great detail in [4, 5]. The deterministic dynamics (corresponding
to " D 0) exhibits a series of symmetry-breaking bifurcations, ranging from the
strong-coupling case in which noise-induced transitions of (5.3.1) between the global
118 B. Gentz
minima x ? D . 1; : : : ; 1/ and xC ?
D .C1; : : : ; C1/ are realised by all particles
switching wells simultaneously to the weak-coupling regime in which an Ising-like
behaviour is observed, with a first particle switching wells, then pulling one of its
neighbours to the other well, and so on, until the transition is complete. The optimal
transition paths and expected transition times have been derived.
Rescaling (5.3.1) appropriately and taking the infinite-particle-number limit N !
1 yields the Allen–Cahn SPDE on a compact interval Œ0; L, i.e.,
p
@t u.x; t/ D x u.x; t/ rx U.u.x; t// dt C 2" dW .t; x/ ; x 2 Œ0; L ; (5.3.2)
with real-valued pu.x; t/, periodic boundary conditions and weak space–time Gaus-
sian white noise 2" dW .t; x/, where W .t; x/ is a cylindrical Wiener process com-
patible with the boundary conditions. The deterministic PDE associated with (5.3.2)
admits several stationary solutions, namely the constant functions u.x; t/ C1,
u.x; t/ 1 and u.x; t/ 0. Out of these, u.x; t/ C1 and u.x; t/ 1 are
stable, and it seems natural to expect an analogue to Kramers’ Law to hold, govern-
ing the expected transition time, say from u.x; t/ 1 to u.x; t/ C1. To keep
the presentation simple, we will focus on the symmetric double-well potential U and
Neumann boundary conditions with zero flux, i.e.,
p
@t u.x; t/ D x u.x; t/ rx U.u.x; t// dt C 2" dW .t; x/ ;
u.; 0/ D './ ; (5.3.3)
@x u.0; t/ D @x u.L; t/ D 0 (Neumann b.c.) :
3u2 which governs the linearisation @t v D AŒuv of the Allen–Cahn equation at the
stationary solutions u. We find:
The uniform stationary states u˙ .x/ ˙1. In this case, AŒu˙ has eigenval-
ues k D .2 C .k=L/2 /, k 2 N0 , and there are no positive eigenvalues.
This implies that u˙ are stable states. Actually, u˙ are global minima of VL .
The uniform state u0 .x/ 0 which has eigenvalues k D .k=L/2 1,
k 2 N0 . Thus, u0 is unstable, and we will need to investigate whether it is a
transition state.
Metastability in randomly perturbed dynamical systems: Beyond large-deviation theory 119
For sufficiently large L, there are two additional stationary states for each k 2
N which satisfies L > k. These additional states are of the form
r kx
2m
uk;˙ .x/ D ˙ sn p C K.m/; m :
mC1 mC1
p
where K.m/ is defined by 2k m C 1K.m/ D L and sn denotes Jacobi’s el-
liptic sine.
Thus, the number of solutions to (5.3.4) depends on the interval length L, and bifur-
cations are observed whenever L is a multiple of . Regarding the transition state,
we observe the following.
For L < , the uniform state u0 .x/ 0 is the unique transition state with
activation energy WL D VL .u0 / VL .u˙ / D L=4. The large-deviation
result [22] shows that the expected transition time from u to a neighbourhood
of uC satisfies
1
Eu uC D eL=4"
0 .L/
1
for some prefactor 0 .L/ . Maier and Stein [32, 33, 37] derive the transition
rate v s
u1 p
u
j0 j t Y k 1 sinh. 2L/
0 .L/ ' D 3=4
2 jk j 2 sin L
kD0
by a formal computation.
For L > , u0 .x/ 0 remains unstable, but no longer forms the transition
state, and u1;˙ .x/ become the new transition states (of instanton shape). Thus
a pitchfork bifurcation occurs as L increases through , and the uniform transi-
tion state u0 bifurcates into pair of instanton states u1;˙ . The activation energy
can be calculated from WL D VL .u1;˙/ VL .u /, and a formal computa-
tion, using Gelfand’s method, allows to compute the spectral determinant in
the prefactor.
The subsequent bifurcations at L D k with k > 2 do not change the transi-
tion state as they have larger activation energy.
At this point, two questions come to mind. Firstly, what happens when L %
and u0 ceases to be a transition state? Secondly, is the formal computation correct in
infinite dimension? In [10], Kramers’ Law has been proved for a class of parabolic
SPDEs with a general double-well potential and periodic or Neumann boundary con-
ditions.
Indeed, a spectral Galerkin approximation enables us to draw on results for finite-
system size. Let us truncate the Fourier series for u to obtain
d
1 2 X 1 X
ud .x; t/ D p y0 .t/ C p yk .t/ cos.kx=L/ D p yQk .t/eikx=L ;
L L kD1 L jkj6d
120 B. Gentz
and also rewrite the potential in Fourier variables, retaining only modes with k 6 d ,
d
1X
VL.d / .y/ D .d /
k yk2 C VL;4 .y/
2
kD0
where
.d / 1 X
VL;4 .y/ D yQk1 yQk2 yQk3 yQk4 :
4L
k1 Ck2 Ck3 Ck4 D0
ki 2f d;:::;0;:::;Cd g
This fact was first observed in [3] for the so-called synchronised regime of the chain
of coupled bistable particles studied in [4, 5], a regime which excludes bifurcation
points. From (5.3.6), we therefore obtain
Theorem 5.3.1 ([10, Theorem 2.5]). For the Allen–Cahn SPDE (5.3.3) with Neu-
mann boundary conditions, the mean transition time between the uniform stable
states u˙ satisfies the following.
Metastability in randomly perturbed dynamical systems: Beyond large-deviation theory 121
ε =0.0003
10
ε =0.001
ε =0.003
5
ε =0.01
0
0.5 1.0 1.5
Figure 5.3.1. The map L 7! 0 .L/, cf. Theorem 5.3.1, for the SPDE (5.3.3) with Neumann
boundary conditions. We show the subexponential prefactor in the expected transition
time Eu uC as a function of the interval length L for different values of the noise intensity ".
Note that we rescaled the interval length L in such a way that the critical interval length L D
coincides with the value 1 on the horizontal axis.
1
For L < , the mean transition time satisfies Eu uC D 0 .L/
eL=4" , and
s p s
1 sinh. 2L/ 1
1
0 .L/ ' 3=4 p ‰C p
2 sin L 1 C 3"=4L 3"=4L
q p
.1=4/
! sinh. 2/ " 1=4 as L % .
2.3 7 /1=4
The function ‰C has been defined in (5.2.6).
For L > , the mean transition time satisfies Eu uC D 01.L/ eWL =" , with
explicitly known activation energy WL and a similar expression for 0 .L/,
involving the eigenvalues at the saddles which are created at the critical inter-
val length L D , see Figure 5.3.1 for the behaviour of 0 .L/ as L increases
through .
This result covers all finite positive values of the interval length L, and thus in-
cludes bifurcation values. For non-bifurcating one-dimensional SPDEs, see also [2].
Figure 5.4.1. Deterministic dynamics in R2 , exhibiting an unstable periodic orbit (broken black
curve) with a stable periodic orbit (solid black curve) in its interior. We show the orbits (grey
curves) of the dynamics for three different initial conditions: (i) inside the stable periodic orbit, (ii)
between the stable and the unstable periodic orbit, but close to the unstable periodic orbit, and (iii)
outside the unstable periodic orbit.
be of gradient type. Assume from now on that d D 2 and that the domain D is such
that @D is this unstable periodic orbit for the deterministic dynamics. Theorem 5.1.3
still applies and implies in particular that, for initial conditions x0 2 D , the expected
first-exit time satisfies Ex0 D" eV =2" in the small noise limit. See Figure 5.4.1
for an illustration of orbits for the deterministic dynamics in the case D contains a
stable periodic orbit … instead of a stable equilibrium point x ? . Theorem 5.1.3 also
generalises to this case. In either case, the quasipotential V .x0 ; / V or V .…; /
V , respectively, is constant on @D . This means that at the level of large deviations we
do not see any preferred exit points
on @D , cf. Theorem 5.1.3(c).
Actually, the solution xt" t of the SDE (5.1.4) in this case behaves quite differ-
ently from the reversible case discussed so far. Recall that in the reversible case, exit
locations concentrate near minima of the quasipotential as " ! 0. In contrast, if @D
is an unstable periodic orbit, the distribution of the exit location x"" on @D does not
D
converge as " ! 0. Instead, the density of x"" is translated along @D proportionally
D
to jlog "j. This surprising phenomenon was first discovered by Day [17, 18], who
named it “cycling”. In the same spirit, Maier and Stein [30] found that in a station-
ary regime,
˚ " obtained
by reinjecting the particle upon exit from D , the rate of escape
d
dt P x t 2
6 D has a jlog "j-periodic prefactor, see also [25].
Passage through an unstable periodic orbit plays an important role in many appli-
cations. For instance, it determines the distribution of noise-induced phase slips in
synchronised oscillators [34]. The first-exit distribution also determines the residence-
time distribution in stochastic resonance [24, 31, 7], and in neuroscience, the spiking
mechanism for the Morris–Lecar model may involve passage through an unstable
periodic orbit [35, 38, 39, 20].
A detailed analysis provides insight into the properties of the density of the first-
passage time. For the sake of brevity, we state an informal version here, valid under
non-degeneracy assumptions excluding symmetries of the system.
Metastability in randomly perturbed dynamical systems: Beyond large-deviation theory 123
Theorem 5.4.1 ([6, 7, 11, informal version]). There exists an explicit parametrisation
of @D s.t. the first-passage time distribution is given by
n . " / o
jlog "j
P D
2 Œt; t C D ftrans .t/C."/QT 2T
t t0 ; (5.4.1)
T
where
T is the period and the Lyapunov exponent of the unstable periodic orbit;
is a “natural” parametrisation of the boundary in the sense that
(i) 0 .t/ is an explicitly known, model-dependent, strictly positive, T -periodic
function, and
(ii) .t/ satisfies .t C T / D .t/ C T ;
QT is a universal T -periodic function;
0 D 1 e H=2" is the principal eigenvalue of the transition kernel of the
Markov chain .Xn /n which is given by a random Poincaré map, recording the
successive positions of the diffusion .xt" /t whenever .xt" /t has completed the
next rotation around the periodic orbit. The value of H is close to I.
1 /, i.e.,
the large-deviation rate function I , evaluated at an optimal transition path
1
which connects the stable to the unstable periodic orbit.
ftrans describes the influence of a deterministic start at a fixed time on the inte-
rior stable periodic orbit and grows from 0 to 1 in time t of order jlog "j;
H=2"
C."/ is the normalising constant which is of order e .
The universal profile y 7! QT .y/ is periodic with period T and given by the
periodicised Gumbel distribution
1
X
QT .y/ D A.T .n y// ; (5.4.2)
nD 1
where n 1 2x o
A.x/ D exp 2x e
2
is the density of a type-1 Gumbel distribution. Figure 5.4.2 shows this universal pro-
file for different choices of T . Observe that this profile determines the concentration
of first-passage times within each period. The larger T , the more pronounced are
the peaks, while for smaller values of T , the peaks overlap more. Thus for larger
values of T , the first-passage time is more concentrated within each period, showing
that there is one preferred time window for exit per period.
Figure 5.4.3 shows the resulting first-passage density over 16 periods, where we
have fixed H while varying T and ". Note the effect of the transitional phase which
suppresses the first peak(s) due to the time needed until exit becomes likely. For
124 B. Gentz
1.0
0.8
0.6
0.4
0.2
1 2 3 4
Figure 5.4.2. The cycling profile y 7! QT .y/, cf. (5.4.2), plotted over four periods. The
parameter values are T D 1:0, T D 1:5, T D 2:0, T D 4:0 and T D 20:0. The darker
shades correspond to successively larger values of T .
Figure 5.4.3. The first-passage density t 7! ftrans .t/C."/QT jlog2T
"j
t t0 , cf. (5.4.1), shown
for 16 periods. Note that the graphs have been rescaled in such a way that the maximum is at the
same height for all panels. We fixed H D 1. Panels (a),(c) and (d) show the effect of varying T
while keeping " D 0:5 fixed, while Panels (a) and (c) illustrate the “cycling” effect by varying " for
fixed T .
Metastability in randomly perturbed dynamical systems: Beyond large-deviation theory 125
larger times, first-passage becomes more unlikely again because the probability den-
sity has already lost mass due to exit having occurred earlier. Comparing the pan-
els (a) and (c), we can see the peaks moving according to the cycling phenomenon.
The proof of Theorem 5.4.1 is based on combining large-deviation results with
properties of random Poincaré maps described by continuous-space discrete-time
Markov chains. Spectral-gap estimates for the kernels of these Markov chains al-
low to estimate the first-passage times relevant for the proof, see [11].
Orbits with a small number of SAOs are less affected by noise than those with
a large number of SAOs;
There is an unexpected saturation effect: The typical value of the stochastic
return map and its spreading become independent of the number of SAO for
large enough SAO numbers.
This saturation effect sets in earlier for larger noise intensities.
The techniques developed in this work will certainly prove helpful in the analysis
of the effect of noise on a variety of related systems in which a local analysis is not
sufficient.
References
[1] S. Arrhenius, Über die Dissociationswärme und den Einfluss der Temperatur auf den Disso-
ciationsgrad der Elektrolyte, Z. Phys. Chem. 4 (1889), 96–116.
[2] F. Barret, Sharp asymptotics of metastable transition times for one dimensional SPDEs, Ann.
Inst. H. Poincaré Probab. Statist. 51 (2015), 129–166.
[3] F. Barret, A. Bovier, and S. Méléard, Uniform estimates for metastable transition times in a
coupled bistable system, Electron. J. Probab. 15 (2010), 323–345.
[4] N. Berglund, B. Fernandez, and B. Gentz, Metastability in interacting nonlinear stochastic
differential equations. I. From weak coupling to synchronization, Nonlinearity 20 (2007),
2551–2581.
[5] N. Berglund, B. Fernandez, and B. Gentz, Metastability in interacting nonlinear stochastic
differential equations. II. Large-N behaviour, Nonlinearity 20 (2007), 2583–2614.
[6] N. Berglund and B. Gentz, On the noise-induced passage through an unstable periodic orbit I:
Two-level model, J. Statist. Phys. 114 (2004), 1577–1618.
[7] N. Berglund and B. Gentz, Universality of first-passage and residence-time distributions in
non-adiabatic stochastic resonance, Europhys. Letters 70 (2005), 1–7.
[8] N. Berglund and B. Gentz, Noise-induced phenomena in slow–fast dynamical systems. A
sample-paths approach, Springer, London, 2006.
[9] N. Berglund and B. Gentz, The Eyring–Kramers law for potentials with nonquadratic saddles,
Markov Process. Related Fields 16 (2010), 549–598.
[10] N. Berglund and B. Gentz, Sharp estimates for metastable lifetimes in parabolic SPDEs:
Kramers’ law and beyond, Electron. J. Probab. 18 (2013), 1–58.
[11] N. Berglund and B. Gentz, On the noise-induced passage through an unstable periodic orbit II:
General case, SIAM J. Math. Anal. 46 (2014), 310–352.
[12] N. Berglund, B. Gentz, and C. Kuehn, Hunting French ducks in a noisy environment, J. Diff.
Eq. 252 (2012), 4786–4841.
[13] N. Berglund, B. Gentz, and C. Kuehn, From random Poincaré maps to stochastic mixed-mode-
oscillation patterns, J. Dynam. Differential Equations 27 (2015), 83–136.
[14] D. Blömker and A. Jentzen, Galerkin approximations for the stochastic Burgers equation,
SIAM J. Numer. Anal. 51 (2013), 694–715.
Metastability in randomly perturbed dynamical systems: Beyond large-deviation theory 127
[15] A. Bovier, M. Eckhoff, V. Gayrard, and M. Klein, Metastability in reversible diffusion pro-
cesses. I.Sharp asymptotics for capacities and exit times, J. Eur. Math. Soc. (JEMS) 6 (2004),
399–424.
[16] A. Bovier, V. Gayrard, and M. Klein, Metastability in reversible diffusion processes. II. Pre-
cise asymptotics for small eigenvalues, J. Eur. Math. Soc. (JEMS) 7 (2005), 69–99.
[17] M. Day, Conditional exits for small noise diffusions with characteristic boundary, Ann.
Probab. 20 (1992), 1385–1419.
[18] M. Day, Cycling and skewing of exit measures for planar systems, Stoch. Stoch. Rep. 48
(1994), 227–247.
[19] M. Desroches, J. Guckenheimer, B. Krauskopf, C. Kuehn, , H.M. Osinga, and M. Wech-
selberger, Mixed-mode oscillations with multiple time scales, SIAM Review 54 (2012), 211–
288.
[20] S. Ditlevsen and P. Greenwood, The Morris–Lecar neuron model embeds a leaky integrate-
and-fire model, J. Math. Biol. 67 (2013), 239–259.
[21] H. Eyring, The activated complex in chemical reactions, Journal of Chemical Physics 3
(1935), 107–115.
[22] W.G. Faris and G. Jona-Lasinio, Large fluctuations for a nonlinear heat equation with noise,
J. Phys. A 15 (1982), 3025–3055.
[23] M.I. Freidlin and A.D. Wentzell, Random Perturbations of Dynamical Systems, 3rd ed.,
Springer, Berlin (2012).
[24] L. Gammaitoni, P. Hänggi, P. Jung, and F. Marchesoni, Stochastic resonance, Rev. Modern
Phys. 70 (1998), 223–287.
[25] S. Getfert and P. Reimann, Suppression of thermally activated escape by heating, Physical
Review E 80 (2009), 030101 R.
[26] B. Helffer, M. Klein, and F. Nier, Quantitative analysis of metastability in reversible diffusion
processes via a Witten complex approach, Mat. Contemp. 26 (2004), 41–85.
[27] B. Helffer and F. Nier, Hypoelliptic Estimates and Spectral Theory for Fokker–Planck Opera-
tors and Witten Laplacians, Springer, Berlin, 2005.
[28] I. Karatzas and S.E. Shreve, Brownian motion and stochastic calculus, 2. ed., Graduate Texts
in Mathematics, Vol. 113, Springer, New York, 1991.
[29] H.A. Kramers, Brownian motion in a field of force and the diffusion model of chemical reac-
tions, Physica 7 (1940), 284–304.
[30] R.S. Maier and D.L. Stein, Oscillatory behavior of the rate of escape through an unstable limit
cycle, Phys. Rev. Lett. 77 (1996), 4860–4863.
[31] R.S. Maier and D.L. Stein, Noise-activated escape from a sloshing potential well, Phys. Rev.
Lett. 86 (2001), 3942–3945.
[32] R.S. Maier and D.L. Stein, Droplet nucleation and domain wall motion in a bounded interval,
Phys. Rev. Lett. 87 (2001), 270601-1.
[33] R.S. Maier and D.L. Stein, The effects of weak spatiotemporal noise on a bistable one-
dimensional system, in: Noise in complex systems and stochastic dynamics, L. Schimanski-
Geier, D, Abbott, A. Neimann, and C. Van den Broeck (eds.), SPIE Proceedings Series,
vol. 5114, 2003, pp. 67–78.
128 B. Gentz
Travelling and rotating waves are ubiquitous phenomena observed in time dependent
PDEs that model the combined effect of dissipation and nonlinear interaction. From
an abstract viewpoint, they appear as relative equilibria of an equivariant evolution
equation. In numerical computations, the freezing method takes advantage of this
structure by splitting the evolution of the PDE into the dynamics on the underlying
Lie group and on some reduced phase space. The approach raises a series of questions
which were answered to a certain extent: linear stability implies nonlinear (asymp-
totic) stability, persistence of stability under discretisation, analysis and computation
of spectral structures, first versus second order evolution systems, well-posedness of
partial differential algebraic equations, spatial decay of wave profiles and truncation
to bounded domains, analytical and numerical treatment of wave interactions, rela-
tion to connecting orbits in dynamical systems. A further numerical problem related
to this topic will be discussed, namely the solution of nonlinear eigenvalue problems
via a contour method.1
a W G ! GL.X /; 7! a. /: (6.1.2)
For the images we use the synonymous notation a.g/u D a.g; u/, g 2 G, u 2 X .
1 Projects A9, B3
130 W.-J. Beyn, D. Otten
We assume that Equation (6.1.1) is equivariant with respect to this group action,
i.e., the vector field F has the following property
F .a.
/u/ D a.
/F .u/; 8
2 G and 8u 2 Z; (6.1.3)
where we have assumed a.
/Z Z for all
2 G.
In Sections 6.2.2 and 6.3, we will deal with several classes of PDEs which fit into
this general setting. All of them are formulated for functions on a Euclidean space Rd
where the action is caused by the special Euclidean group SE.d / acting via rotations
and translations on their arguments or on their values.
Remark 6.1.1. For some applications, even this framework is not sufficient. For
example, travelling fronts which have finite but non-zero limits at infinity do not lie
in any of the usual Lesbesgue or Sobolev spaces, but in an affine space. To cover
such cases, but also more general PDEs on manifolds, one can generalise the whole
approach to Banach manifolds X , where F is a vector field defined on a submanifold
Z of X mapping into the tangent bundle TX , and a takes values in the space of
diffeomorphisms Diff.X; X /. Equivariance (6.1.3) is then expressed as F .a.
; u// D
du Œa.
; u/F .u/, where du Œa.
; u/ W Tu X ! Ta.
;u/ X denotes the tangent map.
For the sake of simplicity, we will not pursue this generalisation here (see [45]).
For this article, it is sufficient to work with the simple notion of a strong solution of
a Cauchy problem instead of dealing with mild solutions in time and weak solutions
in space. We refer to Chapters 2, 4 and 8 for related concepts of a weak solution as
they appear in the analysis of many PDEs, both local and nonlocal.
In the following, we will always assume that a strong solution of (6.1.4) exists
locally, i.e., on some interval Œ0; T / with T > 0, and that it is unique. For applications
to PDEs, it is typical that the group action is differentiable only for smooth functions.
Therefore, we impose the following condition.
When
D 1, the tangent space T1 G may be identified with the Lie algebra g of
G, and we have d
Œa.1/u W g ! X .
Computation and stability of waves 131
6.1.2 Relative equilibria Relative equilibria are special solutions of (6.1.1) which
lie in the group orbit of a single element.
Sometimes (see e.g. [18]), the whole group orbit OG .v? / D fa.g/v? W g 2 Gg is
called a relative equilibrium. However, we include the path t !
? .t/ on the group
as part of our definition since it will be relevant for both the stability analysis and
the numerical computations. The following theorem shows that the path may always
be written as
? .t/ D exp.t? / for some ? 2 g. Recall that exp W g ! G is the
exponential map and that
? .t/ D exp.t? / with t 2 R is the unique solution of the
Cauchy problem
?;t .t/ D dL
? .t / .1/? ;
? .0/ D 1; (6.1.6)
where L
g D
ı g; g 2 G, denotes the multiplication by
from the left. The
vector field on the right-hand side of (6.1.6) is often simply written as
? .t/? , but
in analogy to (6.1.5) we keep the slightly clumsier notation dL
? .t / .1/? for clarity.
Theorem 6.1.5. Under assumption 6.1.3, for every relative equilibrium v? 2 Z with
? 2 C 1 .Œ0; 1/; G/, there exists ? 2 g such that
Given v? and
? ./, the uniqueness of ? will follow from (6.1.8) in the first part
of the theorem if the stabiliser H.v? / of v? is simple, i.e.
But even then, Equation (6.1.7) does not determine the pair .v? ; ? / uniquely, since
relative equilibria always come in families. More precisely, Definition 6.1.4 and the
equivariance (6.1.3) show that any relative equilibrium v? ;
? of (6.1.4) is accompa-
nied by a family .w.g/;
.g; //; g 2 G of relative equilibria given by
1
w.g/ D a.g/v? ;
.g; t/ D g ı
? .t/ ı g ; g 2 G; (6.1.10)
see [17] for related results. This will be important for the stability analysis in Sec-
tion 6.4.
132 W.-J. Beyn, D. Otten
6.1.3 Wave solutions of PDEs Two important classes of semi-linear evolution equa-
tions, which fit into the above setting and to which our results apply, are the following:
ut D Auxx C f .u; ux /; u.x; t/ 2 Rm ; x 2 R; t > 0; u.; 0/ D u0 ; (6.1.11)
ut D Au C f .u/; u.x; t/ 2 Rm ; x 2 Rd ; t > 0; u.; 0/ D u0 : (6.1.12)
In both cases, A 2 Rm;m is assumed to have spectrum .A/ with Re. .A// > 0.
Note that Re. .A// > 0 leads to parabolic systems while .A/ iR occurs for
Hamiltonian PDEs. Intermediate cases with .A/ .f0g [ fRe z > 0g/ generally
belong to hyperbolic or parabolic-hyperbolic mixed systems. The nonlinearities
f W R2m ! Rm in (6.1.11) resp. f W Rm ! Rm in (6.1.12) are assumed to be
sufficiently smooth and to satisfy f .0; 0/ D 0 resp. f .0/ D 0.
For (6.1.11), the Lie group is .G; ı/ D .R; C/ acting on X D L2 .R; Rm / by the
shift Œa.
/u.x/ D u.x
/, x 2 R, u 2 X . With F .u/ D Auxx C f .u; ux / for
u 2 Z D H 2 .R; Rm /, equivariance is easily verified and F .u/ 2 X follows from the
Sobolev embedding H 1 .R; Rm / L1 .R; Rm / and f .0; 0/ D 0. For the derivative,
we find
d
Œa.1/v D vx ; 2 g D R; v 2 H 1 .R; Rm /:
Relative equilibria then turn out to be travelling waves,
u? .x; t/ D v? .x ? t/; x 2 R; t > 0; (6.1.13)
where the pair .v? ; ? / solves the second order system from (6.1.7)
0 D Av?; C ? v?; C f .v? ; v?; /; v./ 2 Rm ; 2 R:
In fact, our simplified abstract approach only covers pulse solutions (defined by
v? ./; v?; ./ ! 0 as ! ˙1), whereas fronts need the setting of manifolds;
see Remark 6.1.1. It is interesting to note that our numerical approach also applies to
travelling waves in evolution equations with nonlocal diffusion. Their existence and
their qualitative properties are analysed in Chapter 4.6.
In the multi-dimensional case (6.1.12), the phase space is X D L2 .Rd ; Rm /, and
we aim at equivariance w.r.t. the special Euclidean group G D SE.d / D SO.d /ËRd .
It is convenient to represent SE.d / in GL.Rd C1 / as
Q b
SE.d / D W Q 2 Rd;d ; Q> Q D Id ; det.Q/ D 1; b 2 Rd ; (6.1.14)
0 1
where the group operation is matrix multiplication. We represent the Lie algebra
se.d / D so.d / Rd accordingly,
S c
se.d / D W S 2 Rd;d ; S > D S; c 2 Rd : (6.1.15)
0 0
The action on functions u 2 X is defined by
Q b
Œa.
/u.x/ D u.Q> .x b//; x 2 Rd ;
D 2 SE.d /:
0 1
Computation and stability of waves 133
1
The derivative exists for functions u 2 HEucl .Rd ; Rm /, where
n o
k
HEucl .Rd ; Rm / D u 2 H k .Rd ; Rm / W LS u 2 L2 .Rd ; Rm / 8S 2 so.d /
2
Setting Z D HEucl .Rd ; Rm / one finds F .u/ D Au C f .u/ 2 X for u 2 Z in
dimension d D 2, since H 2 .R2 ; Rm / L1 .R2 ; Rm / by Sobolev embedding. But
for d > 3 one has to impose growth conditions on f to ensure this. Equivariance fol-
lows from the equivariance of the Laplacian under Euclidean transformations. Special
types of relative equilibria are waves rotating about a centre x? 2 Rd :
u? .x; t/ D v? .exp. tS? /.x x? //; v? 2 Z; S? 2 so.d /: (6.1.16)
When substituting D exp. tS? /.x x? / the system (6.1.7) reads
0 D Av? C v?; S? C f .v? /; 2 Rd :
Several examples of travelling and rotating waves will be dealt with in Section 6.3.
Conversely, one can show that a strong solution u 2 C 1 .Œ0; T /; X / \ C.Œ0; T /; Z/,
2 C.Œ0; T /; g/,
2 C 1 .Œ0; T /; G/ of (6.2.2),(6.2.3) leads to a strong solution
of (6.1.4) via (6.2.1). According to Theorem 6.1.5, a relative equilibrium v? ; ? of
(6.1.1) is a steady state of the first equation, (6.2.2). Following [51], we call Equa-
tion (6.2.3) the reconstruction equation. Due to the extra variables
2 G resp. 2 g,
the system (6.2.2), (6.2.3) is not yet well posed, but needs n D dim.G/ additional
algebraic constraints (called phase conditions) which we write as
.v; / D 0: (6.2.4)
This condition requires the group orbit of v.t/ to be tangent to its time derivative.
Altogether, Equations (6.2.2) and (6.2.4) constitute a partial differential algebraic
equation (PDAE) for the functions v and . The reconstruction equation (6.2.3)
decouples from the PDAE and may be solved in a post-processing step. Condition
(6.2.6) has a unique solution if d
Œa.1/v W g ! X is one to one and then leads to
a PDAE of (differentiation) index 1. Condition (6.2.5) leads to an index 2 problem,
but can be reduced to index 1 by differentiating with respect to t and then inserting
(6.2.2).
for the unknown quantities .v; ;
/. For initial data u0 close to a wave, we expect
v.; t/ ! v? , .t/ ! ? as t ! 1. Travelling waves in parabolic systems and their
stability are analysed in [34, 53, 60, 57], and numerical applications of the freezing
method for this case appear in [9].
Next, consider the parabolic system (6.1.12) in several space dimensions. With
the special Euclidean group (6.1.14) and its Lie algebra (6.1.15), the freezing system
(6.2.2),(6.2.3) takes the form
under the assumptions of Section 6.1.3. Suppose this system admits several travelling
waves .v?;j ; ?;j /, j D 1; : : : ; N , with different speeds ?;j and limit behaviour
lim!˙1 v?;j ./ D vj˙ for j D 1; : : : ; N . If the limits fit together, i.e. if
vjC D vj C1 ; j D 1; : : : ; N 1;
then one often observes N -waves (or multi-waves) of (6.2.10) which look like con-
catenations of the waves v?;j .x ?;j t/; j D 1; : : : ; N , see for example
Figure 6.3.6(c) for such a concatenation of two fronts from Example 6.3.1 travelling at
136 W.-J. Beyn, D. Otten
different speeds to the left. Strong interaction occurs when two or several fronts move
towards each other, while all other cases are called weak interactions. Many more in-
teraction phenomena of this type may be found in [61] and the references therein.
In [9, 55], we extend the freezing method in order to handle such interactions.
More precisely, we generalise (6.2.7) to
N
X
u.x; t/ D vj .x
j .t/; t/; (6.2.11)
j D1
'./ h X
N N
X i
C PN f vk .kj ; t/ f .vk .kj ; t/ C wk / ;
kD1 '.kj / kD1 kD1
0 D vj .; t/ vO j ; vO j; , vj .; 0/ D vj0 ;
L2
j;t D j ,
j .0/ D
j0 :
(6.2.12)
Computation and stability of waves 137
A particular difficulty of this system is that the right-hand side contains non-local
terms vk .kj ; t/ that need special treatment when discretised on bounded intervals
Œx ; xC ; see Section 6.3.5. We also mention that the stability of this approach for
weak interaction is analysed in [9, 55] and that a generalisation of the decompose and
freeze method to the abstract framework of Section 6.2.1 is proposed and applied in
[9, 12, 41]; compare Section 6.3.5.
Figure 6.3.1. QN-front: space-time of u (a), of v (b), profile v as a function of space (c), velocity
as a function of time (d).
138 W.-J. Beyn, D. Otten
domain as before, initial data v0 D u0 , and the template vO D u0 on the time range
Œ0; 3000. The front quickly stabilises at the shape v? shown in Figure 6.3.1(c), and
the velocity quickly reaches its asymptotic value ? 0:07 as shown in 6.3.1(d).
For the numerical solution of (6.1.11) resp. (6.2.8) we used an FEM space discreti-
sation with Lagrange C 0 -elements and maximal element size 4x D 0:3, the BDF
method for time discretisation with maximum order 2, time step-size 4t D 0:3, rel-
ative tolerances 10 2 and 10 3 , and absolute tolerances 10 3 and 10 4 , combined
with Newton’s method for nonlinear equations.
(a) (b)
Figure 6.3.2. Spinning soliton of QCGL: space-time along x2 D 0 (a), translational velocities
.1/ .t/, .2/ .t/ (two lines at the bottom) and angular velocity .3/ .t/ (topline) (b).
Computation and stability of waves 139
Figure 6.3.3. Rotating waves: spinning soliton for d D 2 (a) and d D 3 (b), spiral wave for d D 2
(c), untwisted scroll wave for d D 3 (d).
The spinning solitons of the QCGL from Example 6.3.2 are a special kind of a
localised rotating wave for d D 2; see Figure 6.3.3(a). Their extension to d D 3
dimensions is displayed in Figure 6.3.3(b), and non-localised rotating waves, such
as spiral waves, are shown in Figure 6.3.3(c). Finally, we show a so-called scroll
wave in Figure 6.3.3(c). These types of waves occur in various applications, for
instance in the QCGL [19, 40], the -!-system [37], the Barkley model [2], and the
FitzHugh–Nagumo system [26]. Their treatment via the freezing method is discussed
in [41, 9, 6].
6.3.2 Hyperbolic systems The following hyperbolic system in one space dimension
may be viewed as a special case of (6.1.11) with A D 0:
(see [51, 45]), the freezing approach has found interesting applications to detecting
similarity solutions in Burgers’ equation; see [51, 9] for the one-dimensional and
[49, 50] for the multi-dimensional case.
Finally, we refer to [47, 48] where the stability of travelling waves and the freezing
approach is analysed for mixed parabolic-hyperbolic systems of the partitioned form
A11 0 g.u/ f .u/
ut D uxx C C 1 ; u.; 0/ D u0 ; (6.3.3)
0 0 B22 u2 x f2 .u/
with u D uu12 , a positive diagonalisable matrix A11 and a real diagonalisable matrix
B22 . This covers the famous Hodgkin–Huxley model for propagation of pulses in
nerve axons; cf. [9, Ch.3.1].
6.3.3 Nonlinear wave equations Another area of application are systems of non-
linear wave equations in one space dimension
for the unknown quantities .v; 1 ; 2 ;
/. Travelling waves .v? ; ? / appear as steady
states of (6.3.5) (with 1 D ? ; 2 D 0) and satisfy the equation
Differentiating the algebraic constraint in (6.3.5) w.r.t. time at t D 0 and inserting the
initial conditions leads to a first consistency condition for 01 ,
Figure 6.3.4. QNWE-front: space-time of u (a), v (b), profile v as a function of space (c), velocity
1 .t/ (top) and acceleration 2 .t/ (bottom) as functions of time (d).
The local stability of the PDAE system (6.3.5) is analysed in [10], while a gener-
alisation to several space dimensions and a numerical example appear in [11]. It is
interesting to note that the system (6.3.4) may be written as a first order system (6.3.2)
of dimension 3m. Taking a positive square root N D .M 1 A/1=2 and introducing
the variables U1 D u, U2 D ut C N ux , U3 D ut N ux C cu (with c 2 R arbitrary)
leads to a system (6.3.2) with
0 1 0 1
N 0 0 cU1 C U3
E D @0 N 0 A ; f .U / D @ g.U / A
0 0 N g.U / C cU2 (6.3.8)
g.U / D M 1
fQ.U1 ; 12 N 1
.U2 U3 C cU1 /; 1
2
.U2 C U3 cU1 //:
Though we prefer to solve numerically the second order system (6.3.5), the first order
system (with a suitable choice of the constant c) is useful for applying the stability
results from [46]; see [10] and Section 6.4.
Example 6.3.3 (Quintic Nagumo wave equation). Taking the quintic nonlinearity
f D fQ from (6.3.1) with the wave equation (6.3.4), we obtain the quintic Nagumo
wave equation (QNWE); see [11].
Figure 6.3.4(a) shows the time evolution for a travelling front of the QNWE for
parameters M D 21 , b2 D 25 , b3 D 12 , b4 D 20 17
, spatial domain Œ 50; 50, initial
1 x
data u0 .x/ D 2 .1 C tanh. 2 //, v0 .x/ D 0 and time range Œ0; 800. At time t 600,
the front leaves our computational domain. Figures 6.3.4(b) and 6.3.4(d) show the
time evolution of the front profile and the velocity obtained by solving (6.3.5) with
homogeneous Neumann boundary conditions, parameters M , bj , spatial domain and
initial data as before, template vO D u0 and time range Œ0; 1000. An approximation
of the front profile v? (with v D 0, vC D 1) and the approach towards the limit
velocity ? 0:07 are shown in Figures 6.3.4(c) and 6.3.4(d). The data for the
numerical solution of (6.3.4) resp. (6.3.5) are the same as in Example 6.3.1, except
for the step sizes x D 0:1 and t D 0:2.
Figure 6.3.5. Solitary wave of the NLS with spike-like initial perturbation:
direct numerical simulation (left) vs. solution of the freezing system (right).
stability. This changes fundamentally for PDEs with Hamiltonian structure which
typically allow several or even infinitely many conserved quantities. They fit into
the general class of evolution problems described in Section 6.1.1 but require quite
different techniques for establishing existence and uniqueness of wave solutions [23]
as well as their stability ([30, 31]).
As a model example, consider the cubic nonlinear Schrödinger equation (NLS,
see [16, 23, 36, 56]),
which may be subsumed under (6.1.1) with X D H 1 .RI C/ , Z D H 3 .RI C/. Equiv-
ariance holds with respect to the action
i
1
a.
/v D e v.
2 /;
D .
1 ;
2 / 2 G
u? .x; t/ D ei1 t v? .x 2 tI 1 ; 2 /;
p
! 2ei2 =2 22 (6.3.12)
v? .I 1 ; 2 / D ; ! 2 D 1 ;
cosh.!/ 4
see for example [22, Ch.II.3]. For the following numerical computations, we choose
parameter values 2 D 0:3, ! D 1. Discretisation in time is done via a split-step
Fourier method with step size t D 10 3 . The spatial grid is formed by 2K D 256
Computation and stability of waves 143
equidistant points on the interval Œx ; xC with xC D x D 0:11 28:56. A
spike-like perturbation at x D 11 is added to the initial data. Figure 6.3.5 shows the
solution for both the original system and the freezing system. Clearly, the freezing
system prevents the wave from rotating and travelling, while the interference pat-
terns caused by the initial perturbation are essentially preserved. A theoretical result
supporting these observations will be described in Section 6.4.4, and a detailed pre-
sentation can be found in the PhD thesis [20].
Example 6.3.4 (QNE). Figure 6.3.6(c) shows the time evolution of the superposi-
P
tion 2j D1 vj .x
j .t/; t/, which can be considered as an approximation of a trav-
elling 2-front u of the original QNE (6.2.10) with f from (6.3.1). The quantities
.vj ; j / are the solutions of (6.2.12) and provide approximations of .v?;j ; ?;j /.
Figure 6.3.6(c) shows that the lower front v1 (travelling at speed 1 ) is faster than the
upper front v2 (travelling at speed 2 ), i.e., we may expect ?;1 < ?;2 < 0. Fig-
ure 6.3.6(a) and 6.3.6(b) (resp. 6.3.6(d)) show the time evolution of the single front
profiles v1 and v2 (resp. the velocities 1 ; 2 ) obtained by solving (6.2.12) with
1
homogeneous Neumann boundary conditions, f from (6.3.1), parameters b2 D 32 ,
2 73
b3 D 5 , b4 D 100 , spatial domain Œ 200; 200, multi-waves
N D 2, initial data
0 v2
0 1 v2
v1 ./ D 2 tanh. 5 / C 1 , v2 ./ D 2 tanh. 5 / C 1 with v2 D b3 ,
10 D
20 D
0, templates vO j D vj0 , bump function './ D sech. 20 / and time range Œ0; 3000. Ap-
proximations of the single front profiles v?;j (with v1 D 0, v1C D a4 D v2 , v2C D 1)
and velocities ?;1 0:159, ?;1 0:021 are shown in Figure 6.3.6(a), 6.3.6(b)
and 6.3.6(d). For the numerical solution of (6.2.12), we used the FEM for space
discretisation with Lagrange C 0 -elements and maximal element size x D 0:4, the
BDF method for time discretisation with maximum order 2, intermediate time steps,
time step-size t D 0:8, and the Newton method for solving nonlinear equations.
Figure 6.3.6. 2-front of QNE: profile v1 (a), profile v2 (b), superposition (c), and velocities 1 .t/
(top) and 2 .t/ (bottom) (d).
144 W.-J. Beyn, D. Otten
Figure 6.3.7. Multiwaves: 3-front of QNE (a), pulse front of QCGL (b), 3-soliton of QCGL (c) and
position of centres (d).
6.4.1 Notions of stability and the co-moving frame equation In order to have
some flexibility for the application to PDEs, the following definition uses two norms
k:k1 and k:k2 which need not agree with the norms in the Banach spaces Z and X .
Moreover, depending on the type of PDE, a solution concept different from the strong
solution in Definition 6.1.2 may be necessary.
It is called stable with asymptotic phase if for any " > 0 there exists ı > 0 such that
for all initial values u0 2 Z with ku0 v? k1 6 ı the Cauchy problem (6.1.4) has a
unique strong solution u, and, for some
1 D
1 .u0 / 2 G, the solution satisfies the
two conditions
(
6 "; 8t > 0;
ku.t/ a.
1 ı
? .t//v? k2
! 0; as t ! 1:
Stability in general requires to investigate the solution of (6.1.4) for initial data
u0 D v? C v0 where v0 is a small perturbation of the wave profile. For this, we
transform into a co-moving frame via
which by contrast to the general ansatz (6.2.1) assumes the group orbit
? to be
known. One obtains a special case of (6.2.2), the co-moving frame equation
. L/ C D f 2 C W Re./ 6 0g:
However, the special properties of the PDEs considered here usually require more:
(P1) determine eigenvalues on the imaginary axis caused by the group action,
(P2) analyse the essential spectrum ess .L/ .L/ which arises from the loss of
compactness for differential operators on unbounded domains,
(P3) compute isolated eigenvalues of the point spectrum pt .L/ .L/ different
from those in (P1), either by a theoretical or by a numerical tool.
Let us finally note that a proof of nonlinear stability becomes particularly delicate if
there is no spectral gap between the eigenvalues from (P1) and the remaining spec-
trum. This occurs if the spectrum touches the imaginary axis (wave trains, spiral
waves, see [21, 54]) or lies on the imaginary axis (Hamiltonian case).
146 W.-J. Beyn, D. Otten
6.4.2 Spectral structures Hardly anything can be said about problems (P2), (P3)
above within the abstract framework of Equations (6.1.4) and (6.4.1). However, the
eigenvalues caused by symmetry have some general structure. For this purpose, recall
the Lie bracket Œ; W g g ! g (see e.g. [27, Ch. 8]) which turns g D T1 G into a Lie
algebra. The abstract definition of the bracket is in terms of the adjoint representation
Ad.g/ W g ! g of g 2 G given by
1
Ad.g/ D dh Œg ı h ı g jhD1 ; 2 g;
Œ; D dg ŒAd.g/jgD1./; ; 2 g:
d Œa.1/v?. Œ; ? / D 0:
In particular, if the stabiliser H.v? / is trivial , see (6.1.9), then independent eigen-
vectors j j D 1; : : : ; k of Œ; ? W g ! g lead to independent eigenfunctions
wj D d
Œa.1/v?j of (6.4.3), j D 1; : : : ; k.
Proof. For the family of relative equilibria (6.1.10), we have by the chain rule
d
F .a.
.g; t//a.g/v?/ D a.
.g; t//.a.g/v?/
dt
D d
a.g ı
? .t/ ı g 1 /.a.g/v?/ dh .g ı h ı g 1
/jhD
? .t /
?0 .t/;
6.4.3 Stability with asymptotic phase We discuss sufficient conditions for the sta-
bility with asymptotic phase in case of our two model equations, (6.1.11) and (6.1.12)
(see [9, 34, 35, 53, 58]).
For a travelling wave .v? ; ? /, the linearised operator L from (6.4.2) reads
Lw D Aw C .? Im C D2 f .v? ; v?; //w C D1 f .v? ; v?; /w
(6.4.11)
D Aw C B./w C C./w:
which is covered by our abstract approach only in case v˙ D 0; see Remark 6.1.1.
However, note that L W H 2 .R; Rm / ! L2 .R; Rm / is well defined in the general
case (6.4.12), and that it has the eigenvalue 0 with eigenfunction w D v?; , compare
Theorem 6.4.2 and (6.4.9) with 0 2 .? /. The essential spectrum of L is determined
by the constant coefficient operators
L˙ D A@2 C B˙ @ C C˙ ; C˙ D D1 f .v˙ ; 0/; B˙ D ? Im C D2 f .v˙ ; 0/:
(6.4.13)
148 W.-J. Beyn, D. Otten
Theorem 6.4.3. Let the spectral Assumptions (6.4.15) and (6.4.16) from above hold,
and let f be of the form
Remark 6.4.4. The semilinear case f2 0 is well studied, compare [34, 35, 53].
The more general form (6.4.17) includes Burgers equation (f2 .u/ D u) and is treated
in [57, 58]. Note that the global Lipschitz conditions imposed there can be localised
via the Sobolev embedding H 1 .R; Rm / L1 .R; Rm /.
In Sections 6.3.2 and 6.3.3, we referred to stability results for travelling waves in
hyperbolic systems of first order (6.3.2), (6.3.3) and of second order (6.3.4). Here, we
consider in more detail the stability of rotating waves for the model system (6.1.12).
Following [6], we restrict to d D 2 and A D Im . Extensions to d > 3 are based on
[7, 8] and will be indicated below. Moreover, we mention an alternative approach [52]
towards asymptotic stability (without asymptotic phase) based on a centre manifold
reduction.
2 2 m
consider
As in (6.1.16), a rotating wave v? 2 HEucl .R ; R / centred at x? D 0
0 ?
and with S? D ? 0 ; ? ¤ 0. We assume decay of derivatives up to order 2,
Theorem 6.4.5. Let f 2 C 4 .Rm ; Rm / and let the rotating wave .v? ; S? / satisfy the
spectral assumptions above. Then, the rotating wave is asymptotically stable with
2
asymptotic phase for Equation (6.1.12) with initial data u0 2 HEucl .R2 ; Rm /, for
strong solutions in the function class
C 1 .Œ0; 1/; L2 .R2 ; Rm // \ C.Œ0; 1/; H 2 .R2 ; Rm //;
and with respect to the norms k:k1 D k:kH 2 , k:k2 D k:kH 2 .
Eucl
Theorem 6.4.6. Let the assumptions of Theorem 6.4.3 hold and let the template func-
tion vO in (6.2.8) satisfy
vO 2 v? C H 2 .R; Rm /; hvO ; v? O L2 D 0;
vi hvO ; v?; iL2 ¤ 0:
Then, the travelling wave .v? ; ? / is asymptotically stable for (6.2.8). More pre-
cisely, there exist constants ı, C , ˛ > 0 such that (6.2.8) has a unique solution
.v; / if ku0 v? kH 1 6 ı and hvO ; u0 vi O L2 D 0. Existence and uniqueness
holds for solutions with regularity 2 C Œ0; 1/, v 2 C.Œ0; 1/; H 1.R; Rm //, vt ,
f .v; v / 2 C.Œ0; 1/, L2 .R; Rm //, and v.t/ 2 H 2 .R; Rm / for t > 0. Furthermore,
the following estimate is valid:
˛t
kv.t/ v? kH 1 C j.t/ ? j 6 Ce ku0 v? kH 1 ; t > 0:
The papers [58, 59] transfer these properties to a spatially discretised sytem (time
is left continuous) on bounded intervals J D Œx ; xC with general linear boundary
conditions
P .v.x / v / C Q v .x / C PC .v.xC / vC / C QC v .xC / D 0; (6.4.20)
A 2? M is invertible:
Computation and stability of waves 151
The spectral assumptions concern the quadratic operator polynomial obtained from
linearising the comoving frame equation in the first line of (6.3.5),
Re disp.P / 6 ˇ < 0;
Re pt .P .; @ // n f0g 6 ˇ < 0; and the eigenvalue 0 is simple:
Theorem 6.4.7. Let the assumptions above be satisfied and let the template function
vO in (6.3.5) fulfil
Then, the pair .v? ; ? / is asymptotically stable for the PDAE (6.3.5). More precisely,
for all 0 < < ˇ, there exist , C > 0 such that, for all u0 2 v? C H 3 .R; Rm /,
v0 2 H 2 .R; Rm / and 01 2 R which satisfy
as well as the consistency condition (6.3.6), the system (6.3.5) has a unique solution
.v; 1 ; 2 / with 1 2 C 1 Œ0; 1/, 2 2 C Œ0; 1/ and regularity
v v? 2 C 2 .Œ0; 1/; L2 .R; Rm //\C 1 .Œ0; 1/; H 1.R; Rm //\C.Œ0; 1/; H 2.R; Rm //:
Note that the second consistency condition (6.3.7) does not appear in the theorem
but is used in the proof to make the acceleration 2 continuous at t D 0. The proof
of the theorem builds on a careful reduction to the first order system (6.3.8) and on
an application of the stability theorem from [46]. The theory for first order systems
is also the reason for measuring the convergence (6.4.24) in a weaker norm than the
initial values.
152 W.-J. Beyn, D. Otten
hiv;
O v? i0 D 0; hvO x ; v? i0 D 0;
hiv;O iv? i0 hiv;O v?;x i0
is invertible.
hvO x ; iv? i0 hvO x ; v?;x i0
Then, the solitary wave .v? ; ?;1 ; ?;2 / from (6.3.12) is Lyapunov-stable for the sys-
tem (6.3.10), (6.3.11). More precisely, for every " > 0, there exists ı > 0 such that
the system (6.3.10), (6.3.11) with ku0 v? kH 1 6 ı has a unique (weak) solution
.v; 1 ; 2 / with 1 2 C 1 Œ0; 1/, 2 2 C Œ0; 1/ and regularity
For the notion of weak solution employed here, we refer to [20, Ch. 1.2]. The
proof of Theorem 6.4.8 is mainly based on Lyapunov function techniques which are
quite different from the semigroup and Laplace transform approaches used in the
proofs of Theorems 6.4.5–6.4.7. We also emphasise that [20] contains applications to
other PDEs with Hamiltonian structure, for example the nonlinear Klein Gordon and
the Korteweg–de Vries equation, and that spatial discretisations are also studied.
solving linear eigenvalue problems for operators such as (6.4.11) on a bounded inter-
val J D Œx ; xC . In the following, we summarise two of the major results from [5]
on this problem.
Contour methods have been developed over the last years ([1, 4, 32]) and have
become rather popular since no a-priori knowledge about the location of eigenvalues
is assumed. The paper [5] generalises the contour method from [4] to holomorphic
eigenvalue problems
L./v D 0; v 2 X; 2 C; (6.5.1)
Theorem 6.5.1. Let the above assumptions hold and assume the following nonde-
generacy condition:
rank hwj ; xk ikD1;:::;~
j D1;:::;p D ~ D rank hy j ; v ikD1;:::;`
k j D1;:::;~ : (6.5.4)
First note that (6.5.4) implies p; ` > ~, i.e., the number of test functions and test
functionals should exceed the number of eigenvalues inside the contour. In fact, in
applications we expect to have p ` ~. The key to the proof is the theorem of
Keldysh (see [38, Thm. 1.6.5]) which describes the coefficients of the meromorphic
expansion of L./ 1 near its singularities in terms of (generalised) eigenvectors. We
mention that Theorem 6.5.1 generalises to eigenvalues of arbitrary geometric and al-
gebraic multiplicity. With the proper definition of generalised eigenvectors of (6.5.1),
it turns out that the Jordan normal form of the matrix EL in (6.5.6) inherits the exact
multiplicity structure of the nonlinear eigenvalue problem; see [5, Thm. 2.8]. For the
overall algorithm, one approximates the integrals in (6.5.3) by a quadrature rule (for
analytical contours , the trapezoidal sum is sufficient since it leads to exponential
convergence [4]) and solves linear systems L./uk D vk with k D 1; : : : ` at the
quadrature nodes 2 . Note that these solutions can be used for both integrals in
(6.5.3). The (shortened) singular value decomposition (6.5.5) involves a rank deci-
sion revealing the number ~ of eigenvalues inside the contour. Finally, solving the
linear (!) eigenvalue problem for the matrix EL 2 C~;~ is usually cheap if ~ is small.
Let us note that the algorithm also provides good approximations of the eigen-
functions associated to j , j D 1; : : : ; ~; see [4] and [5, Sec. 2.2]. There is even an
extension of the contour method to cases where the nondegeneracy condition (6.5.4)
is violated. Then, one computes some higher order moments
Z
1
E D E./d; D 0; 1; 2; : : : ; (6.5.7)
2i
and determines the eigenvalues from a suitable block Hankel matrix (see [4] for the
extended algorithm and for the number of additional integrals needed). Numerical
examples with more details on the algorithm may be found in [4], and applications to
the travelling waves considered here appear in [5, Sec. 6].
Another favourable feature of the method is that the errors occurring in the inter-
mediate steps (6.5.2), (6.5.3), (6.5.5), (6.5.6) are well controllable. We demonstrate
this for the operator L./ D I L with the differential operator L taken from
(6.4.11). The evaluation of the matrix E./ from (6.5.2) requires to solve inhomoge-
neous equations
Such -dependent boundary matrices P˙ ; Q˙ 2 C.; R2m;m / occur with the so-
called projection boundary conditions [3] and lead to fast convergence towards the
solution of (6.5.8) as x˙ ! ˙1. The matrices are determined in such a way (see [5,
Computation and stability of waves 155
Theorem 6.5.2. Let the assumptions of Theorem 6.4.3 hold except for the condi-
tion (6.4.16) on the point spectrum. Let fz 2 C W Rez > ˇg with ˇ from
(6.4.15) be a closed contour which lies in the resolvent set of the operator pencil
Then, for J D Œx ; xC sufficiently large, the linear boundary value problem with
projection boundary conditions
Note that the integrals (6.5.9) are the quantities approximating the integrals (6.5.7)
over the unbounded domain. With the estimates (6.5.10) at hand, it is not difficult to
show that the singular values obtained in (6.5.5) and finally the eigenvalues of EL in
(6.5.6) inherit the exponential error estimate; see [5, Sec. 4].
Let us finally note that the computation of isolated eigenvalues for the linearised
operator becomes rather challenging for waves in two and more space dimensions.
We consider the contour method to be a true competitor to classical methods for
computing eigenvalues of linearisations at such profiles.
156 W.-J. Beyn, D. Otten
References
[1] J. Asakura, T. Sakai, H. Tadano, T. Ikegami and K. Kimura, A numerical method for nonlinear
eigenvalue problems using contour integrals. JSIAM Lett. 1 (2009), 52–55.
[2] D. Barkley, Euclidean symmetry and the dynamics of rotating spiral waves, Phys. Rev. Lett.
72 (1994), 164–167.
[3] W.-J. Beyn, The numerical computation of connecting orbits in dynamical systems. IMA J.
Numer. Anal. 10 (1990), 379–405.
[4] W.-J. Beyn, An integral method for solving nonlinear eigenvalue problems, Linear Algebra
Appl. 436 (2012), 3839–3863.
[5] W.-J. Beyn, Y. Latushkin and J. Rottmann-Matthes, Finding eigenvalues of holomorphic Fred-
holm operator pencils using boundary value problems and contour integrals, Integral Equa-
tions Operator Theory 78 (2014), 155–211; arXiv:1210.3952.
[6] W.-J. Beyn and J. Lorenz, Nonlinear stability of rotating patterns. Dyn. Partial Differ. Equ. 5
(2008), 349–400.
[7] W.-J. Beyn and D. Otten, Spatial decay of rotating waves in reaction diffusion systems, Dyn.
Partial Differ. Equ. 13 (2016), 191–240; arXiv:1602.03393.
[8] W.-J. Beyn and D. Otten, Spectral analysis of localized rotating waves in parabolic systems,
Philos. Trans. A 376 (2018) no. 20170196.
[9] W.-J. Beyn, D. Otten and J. Rottmann-Matthes, Stability and computation of dynamic pat-
terns in PDEs. In Current Challenges in Stability Issues for Numerical Differential Equations
(L. Dieci, and N. Guglielmi, eds.), Springer, Cham, 2014, 89–172.
[10] W.-J. Beyn, D. Otten and J. Rottmann-Matthes, Computation and stability of traveling waves
in second order equations, SIAM J. Numer. Anal. 56(3) (2018), 1786–1817.
[11] W.-J. Beyn, D. Otten and J. Rottmann-Matthes, Freezing traveling and rotating waves in sec-
ond order evolution equations. In Patterns of Dynamics (P. Gurevich, J. Hell, B. Sandstede,
A. Scheel, eds.), Springer Proc. Math. Stat. 205, 2017, 215–241.
[12] W.-J. Beyn, S. Selle and V. Thümmler, Freezing multipulses and multifronts, SIAM J. Appl.
Dynam. Syst. 7 (2008), 577–608.
[13] W.-J. Beyn and V. Thümmler, Freezing solutions of equivariant evolution equations, SIAM J.
Appl. Dynam. Syst. 3 (2004), 85–116.
[14] W.-J. Beyn and V. Thümmler, Phase conditions, symmetries, and PDE continuation. In
Numerical Continuation Methods for Dynamical Systems (B. Krauskopf, H. Osinga, and
J. Galan-Vioque, eds.), Springer, Dordrecht, 2007, 301–330.
[15] A. M. Bloch and A. Iserles, Commutators of skew-symmetric matrices, Internat. J. Bifur.
Chaos Appl. Sci. Engrg. 15 (2005), 793–801.
[16] T. Cazenave, Semilinear Schrödinger Equations, Courant Lecture Notes in Mathematics
vol.10, AMS, Providence, RI, 2003.
[17] A.R. Champneys and B. Sandstede, Numerical computation of coherent structures. In Numer-
ical Continuation Methods for Dynamical Systems (B. Krauskopf, H. Osinga, and J. Galan-
Vioque, eds.), Springer, Dordrecht, 2007, 331–358.
[18] P. Chossat and R. Lauterbach, Methods in Equivariant Bifurcations and Dynamical Systems,
World Scientific, River Edge, 2000.
[19] L.-C. Crasovan, B.A. Malomed and D. Mihalache, Spinning solitons in cubic-quintic nonlin-
ear media, Pramana-J. Phys. 57 (2001), 1041–1059.
Computation and stability of waves 157
[41] D. Otten, Spatial Decay and Spectral Properties of Rotating Waves in Parabolic Systems,
PhD thesis, Univ. Bielefeld, 2014, 1–271.
[42] D. Otten, Exponentially weighted resolvent estimates for complex Ornstein–Uhlenbeck sys-
tems, J. Evol. Equ. 15 (2015), 753–799; arXiv:1510.00823.
[43] D. Otten, The identification problem for complex-valued Ornstein–Uhlenbeck operators in
Lp .Rd ; CN /, Semigroup Forum (2016), 1–38; arXiv:1510.00827.
[44] D. Otten, A new Lp -antieigenvalue condition for Ornstein–Uhlenbeck operators, J. Math.
Anal. Appl. 444 (2016), 753–799; arXiv:1510.00864.
[45] J. Rottmann-Matthes, Computation and Stability of Patterns in Hyperbolic-Parabolic Sys-
tems, PhD thesis, 2010, 1–187.
[46] J. Rottmann-Matthes, Stability and freezing of nonlinear waves in first order hyperbolic PDEs,
J. Dynam. Diff. Equ. 24 (2012), 341–367.
[47] J. Rottmann-Matthes, Stability and freezing of waves in nonlinear hyperbolic-parabolic sys-
tems, IMA J. Appl. Math. 77 (2012), 420–429.
[48] J. Rottmann-Matthes, Stability of parabolic-hyperbolic traveling waves, Dyn. Partial Differ.
Equ. 9 (2012), 29–62.
[49] J. Rottmann-Matthes, Freezing similarity solutions in the multi-dimensional Burgers’ equa-
tion, Nonlinearity 30(12) (2017), 4558–4586.
[50] J. Rottmann-Matthes, An IMEX-RK scheme for capturing similarity solutions in the multidi-
mensional Burgers’ equation, IMA J. of Numer. Anal., in press; arXiv:1612.04127.
[51] C.W. Rowley, I.G. Kevrekidis, J.E. Marsden and K. Lust, Reduction and reconstruction for
self-similar dynamical systems, Nonlinearity 16 (2003), 1257–1275.
[52] B. Sandstede, A. Scheel and C. Wulff, Dynamics of spiral waves on unbounded domains using
center manifold reductions, J. Diff. Eqs. 141 (1997), 122–149.
[53] B. Sandstede, Stability of travelling waves, in: Handbook of dynamical systems, Vol. 2,
(B. Fiedler, ed.), North-Holland, Amsterdam (2002), 983–1055.
[54] B. Sandstede and A. Scheel, Absolute versus convective instability of spiral waves. Phys. Rev.
E 62 (2000), 7708.
[55] S. Selle, Decomposition and Stability of Multifronts and Multipulses, PhD thesis, Univ. Biele-
feld (2009), 1–148.
[56] C. Sulem and P.-L. Sulem, The nonlinear Schrödinger equation: Self-focusing and wave col-
lapse. Appl. Math. Sci. 139, Springer, New York, 1999.
[57] V. Thümmler, Numerical Analysis of the Method of Freezing Traveling Waves, PhD thesis,
Univ. Bielefeld (2005), 1–151.
[58] V. Thümmler, Numerical approximation of relative equilibria for equivariant PDEs, SIAM J.
Numer. Anal. 46 (2008), 2978–3005.
[59] V. Thümmler, The effect of freezing and discretization to the asymptotic stability of relative
equilibria, J. Dynam. Diff. Eqs. 20 (2008), 425–477.
[60] A.I. Volpert, V.A. Volpert and V.A. Volpert, Traveling wave solutions of parabolic systems,
American Mathematical Society, Providence, RI, 1994.
[61] T. Watanabe, M. Iima and Y. Nishiura, A skeleton of collision dynamics: hierarchical network
structure among even-symmetric steady pulses in binary fluid convection, SIAM J. Appl. Dyn.
Syst. 15 (2016), 789–806.
Chapter 7
Initial value problems for nonlinear dispersive
equations at critical regularity
S. Herr
Global regularity results for nonlinear dispersive equations hinge on a thorough un-
derstanding of the Cauchy problem in spaces of functions of low regularity. This is
most challenging in scale invariant regimes as solutions interact strongly on multiple
frequency-scales. Here, some recent progress on the critical well-posedness theory
will be reviewed, with a focus on nonlinear Schrödinger and Dirac equations.1
7.1 Introduction
This is a survey on certain aspects of nonlinear dispersive equations in critical Sobolev
spaces. First, we illustrate key concepts and paradigms and explain the connection
to harmonic analysis. Second, we report on recent progress on the well-posedness
theory at the critical level of regularity and its impact on questions concerning the
longtime behaviour of solutions with a certain bias towards the results of Project
B8. The focus will be on nonlinear Schrödinger equations on compact manifolds in
Section 7.2 and the cubic Dirac equation, the Dirac–Klein–Gordon system, and the
Zakharov system in Section 7.3.
7.1.1 Scaling and conserved quantities Let us start with an example. Let p > 1
and consider the nonlinear Schrödinger equation
i @t u C u D ˙jujp 1
u; (7.1.1)
1 Project B8
160 S. Herr
We conclude that
i @t u C u D ˙ju jp 1
u
iff ˛ D p 1 . Now, suppose that initial data u.0; / D f 2 HP s .Rd / is given, where
2
The scaling argument above suggests that a well-posedness theory for initial data in
the critical space H sc .Rd / is key for the analysis of the longtime behaviour. The
regime s > sc is called subcritical and the regime s < sc is called supercritical. We
do not expect a well-posedness theory in H s .Rd / in the supercritical case s < sc ,
because small times of existence for small solutions can be rescaled to large times of
existence for large solutions and this is known to be false in many problems.
Nonlinear Dispersive equations at critical regularity 161
Suppose that we are given a smooth and decaying solution u of (7.1.1). Then, one
can easily check that
Z
d d
ju.t; x/j2 dx D 0 and e.u.t// D 0
dt Rd dt
for the energy
Z Z
1 1
e.u.t// D jru.t; x/j2 dx ˙ ju.t; x/jpC1 dx;
2 Rd pC1 Rd
so that both ku.t/kL2 .Rd / and e.u.t// are constant with respect to t.
Observe that sc D 1 iff p D 1 C d 4 2 if d > 2. Further, the sharp Sobolev
embedding implies
ku.t/kLpC1 .Rd / . kru.t/kL2 .Rd /
in this case. Here and in the following we will use the notation A . B if there exists
a harmless (e.g. only depending on d and p) constant c > 0, such that A 6 cB. If
we can choose c smaller than 1 (depending on other previously fixed parameters), we
write A B. We also write A B if A . B and B . A.
From the above consideration we conclude that e.u.t// kru.t/k2L2 .Rd / , pro-
vided that kru.t/kL2 .Rd / is sufficiently small. In other words, when p D 1 C d 4 2 ,
the critical space corresponds to the energy space. Hence, this problem is called
energy-critical. Notice that, for small initial data f 2 H 1 .Rd /, in this setting a local
existence result with a time of existence depending only on kf kH 1 .Rd / yields global
existence owing to the a-priori bound on ku.t/kH 1 .Rd / coming from conservation of
the energy and the L2 .Rd /-norm. To sum up, if d > 2, the well-posedness problem
for p D 1 C d 4 2 in H 1 .Rd / is both challenging, because it is scaling-critical, and
rewarding, because it allows to exploit the energy conservation to establish global
existence (at least for small data). We remark that (7.1.1) has a Hamiltonian structure
and there are interesting special solutions, see chapter 6.3.4.
For energy-critical Schrödinger equations on Rd , even for initial data of arbitrary
size, many of the fundamental questions concerning well-posedness, scattering and
blow-up have been solved within the last decade, see e.g. the surveys by Kenig [39,
40] and the references therein.
It is obvious that Lp -estimates for solutions are crucial to derive such results.
We will discuss these estimates, often called Strichartz estimates in this context, in
Subsection 7.1.2. In Section 7.2 we will address corresponding problems for energy-
critical Schrödinger equations in a slightly different context, namely on compact man-
ifolds. Further, we will consider analogous issues for certain systems on Rd in Sec-
tion 7.3.
7.1.2 Dispersive equations and Fourier restriction Consider the linear Schrödin-
ger equation
i @t u C u D 0:
162 S. Herr
For sufficiently smooth and decaying solutions, taking the spatial Fourier transform
Z
d
u.t; / WD .2/ 2
b e ix u.t; x/ dx
Rd
Indeed, if (say) f 2 Cc2 .Rd /, one can easily check that u is a solution in the classical
sense.
More generally, let us assume that W Rd ! R is measurable and that it does not
grow faster than some polynomial. It is the symbol of the Fourier multiplier . i r/,
i.e.
1
. i r/f ./ D ./f b./:
For, say, f 2 S .Rd /, the function
Z
d
u.t; x/ D .2/ 2 b./ d DW ei t .
ei t ./ eix f i r/
f .x/
Rd
solves
i @t u C . i r/u D 0:
Important examples are
holds true? Here, we implicitly restrict to compact subsets by considering the measure
on † defined by
Z Z
h d D h˛ dHn 1; for all h 2 L1 .†/;
† †
i @t u C . i r/u D 0;
with initial data given as the inverse Fourier transform of G ./; . To summarise,
we have proved that estimates in space-time Lebesgue norms for solutions u are
equivalent to estimates for the Fourier restriction operator. This is of particular impor-
tance in the case q 0 D 2, in which case Strichartz made this connection and proved
164 S. Herr
such estimates for important phase functions . Let us sketch a conceptual proof
which has been developed later. We notice that
kR† k2Lp .Rn /!L2 .†/ D kE† k2L2 .†/!Lp0 .Rn / D kE† R† kLp .Rn /!Lp0 .Rn / ;
and we compute
Z
n
E† R† F .y/ D .2/ 2 b ./ d ./
eiy F
†
Z Z
D .2/ n ei.y z/ d ./ F .z/ dz
Rn †
n
D .k† F /.y/; where k† D .2/ 2 . /:
b
solves i @t C u D 0 with initial data u.0/ with Fourier coefficients .cn / . This can
be viewed as a Fourier extension operator for the discrete paraboloid † D f.; n/ 2
Z 1Cd j D jnj2 g: Again, this defines a group of unitary operators in H s .Td /.
However, in this setting, there can be no dispersive estimate implying supx ju.t; x/j !
0 as t ! 1, because this would violate ku.t/kL2 .Td / D ku.0/kL2.Td / as Td has fi-
nite measure.
Still, one can prove Lp -estimates in this setting. To a large extent, this theory has
a number-theoretic flavour. Let us look at an example, which is due to Bourgain [11]
Nonlinear Dispersive equations at critical regularity 165
if d > 2 and to Zygmund [61] if d D 1, and reveals the connection to estimates for
lattice points and exponential sums. Assume that cn D 0 if jnj > N for some N > 0.
We express the exponential sum for u2 as a space-time Fourier series, i.e.
X 2 2
u2 .x; t/ D .2/ d ck cn k ei.t;x/. jkj jn kj ;n/
n;k2Z d
X
d
D .2/ an;m ei.t;x/.m;n/
.n;m/2Z d C1
P
where an;m D k2Sn;m ck cn k and Sn;m D fk 2 Z d j jkj 6 N; jkj2 C jn kj2 D
mg. Plancherel’s Theorem and the Cauchy-Schwarz inequality yield
X
kuk4L4 .Td C1 / D ku2 k2L2 .Td C1/ D jan;m j2
n;m
X X ˇ 2
6 max #Sn;m ˇck j jcn kj
2
jnj62N
jmj62N 2 n2Z d m2Z;k2Sn;m
X 2
6 max #Sn;m jcn j2
jnj62N
jmj62N 2 n
Completing the squares implies that k 2 Sn;m if and only if j2k nj2 D 2m jnj2 .
Let sd .r/ denote the number of integer lattice points on the .d 1/-sphere of radius
r. We obtain 1
kukL4 .Td C1/ 6 max sd .r/ 4 ku.0/kL2 .Td / :
06r62N
This estimate and its ramifications are of fundamental importance in the analysis of
the cubic nonlinear Schrödinger equation on Td . Recall that the asymptotics for sd .r/
are part of classical analytic number theory. More recently, Bourgain and Demeter
proved the decoupling conjecture [12]. This yields another approach to Lp -estimates
in the periodic setting more akin to Fourier restriction theory which does not rely on
analytic number theory, but we will not go into details here.
i @t u C g u D 0 on R M; (7.2.1)
In some cases, such as tori and spheres, the order of this derivative loss can be
extenuated, by exploiting more specific information on the spectrum and the eigen-
functions.
We are interested in nonlinear Schrödinger equations in this context, say
i @t u C g u D ˙jujp 1
u on R M; (7.2.2)
and ask the questions of local and global well-posedness analogous to the discus-
sion in Subsection 7.1.1, again with a special emphasis on the energy-critical case
p D 1 C d 4 2 and d D 3, also called the quintic nonlinear Schrödinger equation.
It is immediate that for results in scale-invariant regimes one needs scale-invariant
estimates, so that a loss of derivatives in Strichartz estimates is a significant problem.
In [15, 16, 17] Burq–Gérard–Tzvetkov proved multilinear versions of Strichartz
estimates on compact manifolds. Using spectral information, such as the precise
Nonlinear Dispersive equations at critical regularity 167
knowledge of the spectrum and sharp (multilinear) estimates for the eigenfunctions,
some scale-invariant estimates could be recovered. In previous joint work with Tataru–
Tzvetkov [35], we found some additional almost orthogonality principles and con-
structed critical function spaces to solve the first energy-critical global well-posedness
problem on a compact manifold [35], see also [36]. In subsequent work of Ionescu–
Pausader [38], this has been extended to large initial data in the defocusing case. This
is done by an indirect argument which, among other ideas, relies on the global well-
posedness result of Colliander–Keel–Staffilani–Takaoka–Tao [23] in the Euclidean
setting.
On flat tori, the spectrum of g , consisting of sums of squares of integers, is
a rather delocalised subset of the real line. On the other hand, relative to the cor-
responding eigenvalue the eigenfunctions are quite small in L1 .M /. Spectrally, an
other extreme case is the round sphere Sd . Here, the spectrum, being a shifted se-
quence of squares of integers, is well localised, but the eigenfunctions, the spherical
harmonics, saturate the worst case L1 .M /-bound given by Weyl’s estimate. It is a
special case of a Zoll manifold, where all geodesics are closed with a common mini-
mal period. Due to results of Duistermaat–Guillemin [25] and Colin de Verdière [22],
Zoll manifolds are characterised by the fact that the spectrum of g is contained
in a union of uniformly bounded intervals centered at a shifted sequence of squares
of integers. For this class of manifolds in d D 3, local and small data global well-
posedness for the quintic nonlinear Schrödinger equation in H 1 .M / was proved in
[29] and extended to large data by Pausader–Tzvetkov–Wang [46].
Given these results, one is inclined to ask: Under which spectral assumptions
can one prove scale-invariant well-posedness results? As a starting point, consider
general flat rectangular tori M D T3 WD R3 =.21 Z 22 Z 23 Z/ for j > 0
and products of spheres M D S S2 for > 0. Spectrally, these can be viewed as
intermediate cases between T3 and S3 . Strunk [54, 55] proved local and small data
global well-posedness for the quintic equation in H 1 .T3 / similar to the rational case
[35], besides further critical results in dimension two. His proofs are based on the
one-dimensional exponential sum estimate
X
2
1 2
cn ei t n
p . .1 C jJ j/ 2 p k.cn /k`2 ;
L .Œ0;2/
n2J \Z
for any bounded interval J R and fixed p > 4, which in case J D Œ1; N is
due to Bourgain [10], see also [29] and [55, Section 1.3.3] for the above version.
Around the same time, Killip–Visan [41] derived scale-invariant versions of Strichartz
estimates on irrational tori from [12], which allows for alternative proofs of critical
well-posedness results. For the defocusing quintic nonlinear Schrödinger equation
in dimension d D 3, Strunk [56, 55] extended his small data result to global well-
posedness for initial data of arbitrary size in H 1 .T3 /, using ideas of [38]. Concerning
M D S S2 small data global well-posedness was proved in joint work with Strunk
[34] in the case D 1 and in [55] in the general case > 0.
168 S. Herr
The aim is to invoke the contraction mapping principle in a suitable complete metric
space X 1 .I /, which we describe next, see [35, 29, 34] and the references therein for
details. For 1 6 r < 1, a step function a W R ! L2 .M / is called a Ur g -atom, if
K
X K
X
i t g
a.t/ D 1Œtk 1 ;tk /
e ak ; kak krL2 D 1;
kD1 kD1
for a partition 1 < t0 < < tK 6 C1. The normed vector space Ur g is
defined as the corresponding atomic space. The normed vector space Vr g consists of
all right-continuous v W R ! L2 .M / such that
K
! 1r
X
kvkV r D sup kei tk g v.tk / e i tk 1 g
v.tk r
1 /kL2 < C1;
1<t0 <<tK 6C1
kD1
Next, we transfer the estimate (7.2.3) from free solutions to functions in Y 0 , i.e. we
prove that there exists ı0 > 0, such that for all uj 2 Y 0 and N1 > N2 > N3 > 1
Y3
N 1 ı 0 Y
3
3
PNj uj
. N2 N3 C kuj kY 0 : (7.2.4)
L2 .Œ0;2M / N1 N2
j D1 j D1
This is a crucial step in scale-invariant problems and we sketch the proof devised in
[29] and [34, Section 3]. In a first step, consider atoms
Kj Kj
X X
i t g
uj D 1Œt ;t /e kj ; kkj k2L2 D 1;
k 1 k
j j
kj D1 kj D1
In addition, for any fixed r > 1, by Hölder’s inequality and the Sobolev embedding
we obtain the trivial estimate
Y3
1
PN uj
. jI j 2 kPN u1 kL1 L2 kPN u2 kL1 kPN u3 kL1
j L2 .Œ0;2M / 1 t x 2 t;x 3 t;x
j D1
(7.2.7)
3
Y
3
. .N2 N3 / 2 kuj kU r ;
g
j D1
. kkH 1 .M / C kuk5X 1 .I / ;
provided that
sup jhjuj4 u; viL2 .I M / j . kuk5X 1 .I / : (7.2.8)
v2Y 1 .I /61
Due to the polynomial structure, one can prove the contraction property along the
same lines and the proof of Theorem 7.2.1 can be completed by applying the contrac-
tion mapping principle and the energy conservation in the standard way. We omit the
details.
Estimate (7.2.8) follows from
X Y5 5
Y
h PN vj ; PN v0 iL2 .I M / . kv0 kY 1 kvj kY 1 (7.2.9)
j 0
j D1 j D1
where indicates summation over all dyadic N0 ; N1 ; : : : ; N5 > 1 under the constraint
N1 > : : : > N5 . This is because Y 1 ,! X 1 , vj either denotes uj or uj , and due to
the symmetry we are free to assume the ordering N1 > : : : > N5 . Now, we outline
how to sum up the dyadic pieces, using (7.2.4). On a general manifold, there are two
contributions,
X Y5
h PN vj ; PN v0 iL2 .I M / D †1 C †2 C †3
j 0
j D1
Nonlinear Dispersive equations at critical regularity 171
X N 1 ı 0 N4 1 ı 0 Y
5
5
. N2 N3 N4 N5 C C kPNj vj kY 0
N1 N3 N0 N2
IN2 6N0 N1 j D1
5
Y
. kv0 kY 1 kvj kY 1 :
j D1
2
From Ref6jH1 j2 jH2 j4 C 4H1 H23 H2 g > 2jH1 j2 jH2 j4 we conclude that
Z T Z
jH1 j2 jH2 j4 dx dt . kh1 kH 1 kh1 kH 1 kh2 k4H 1 :
0 M
By polarisation, we obtain the claimed inequality. We refer to [34] for more details,
also to the earlier work [15, Rem. 2.12] in the context of cubic Schrödinger equations.
Let us reiterate that the validity of (7.2.3) has been verified for M D T3 [35],
M D S3 (and 3d Zoll manifolds) [29], M D T3 [54], and M D S S2 [34, 55].
7.2.3 Related results There are other cases which can be treated similarly. For
instance, the case of radial initial data in the 3d unit ball with Dirichlet boundary
conditions is similar to the case M D S3 , as remarked in [46] and carried out ex-
plicitly in the Master’s thesis of Frieda Wall in 2016. Also, concerning critical local
well-posedness of equation (7.2.1) for sufficiently large odd integers p, the same
strategy based on multilinear estimates applies on, say, spheres and tori, which has
been verified in Master’s theses of Jakob Herrenbrück (M D Sd ) and Magnus Winter
(M D Td ) in 2016.
Concerning the cubic nonlinear Schrödinger equation, which is energy-critical in
dimension d D 4, we remark that bilinear estimates play a similar role. With Tataru
and Tzvetkov, we established small data global well-posedness in H 1 .T4 / and certain
product spaces in [36], see also [41] for the case of the irrational torus. Arguably, the
most interesting setting is the L2 -critical cubic nonlinear Schrödinger equation in
d D 2, where no critical well-posedness result is known.
In [24], Colliander–Keel–Staffilani–Takaoka–Tao proved that there is a low-to-
high frequency cascade leading to growth of Sobolev norms beyond the energy reg-
ularity, which supports a conjecture known as weak turbulence. The construction is
based on a certain discrete model dynamical system with a Hamiltonian structure.
With Marzuola [31], we analyse certain rarefaction wave-like solutions to this system
which transfer energy from low to high frequencies.
We only mention briefly that the methods described above have been applied to
rigorously derive defocusing nonlinear Schrödinger equations from the dynamics of
Nonlinear Dispersive equations at critical regularity 173
In joint work with Bejenaru [4, 5] we proved the critical global well-posedness
and scattering results in d D 2; 3 for small initial data in the massive case M > 0, and
Bournaveas–Candy [13] established this in the massless case M D 0. We summarise
these developments as follows.
Theorem 7.3.1. Let M > 0 and d D 2; 3. The cubic Dirac equation is globally
d 1
well-posed for small initial data in H 2 .Rd / and these solutions scatter to free
solutions as t ! ˙1.
Let us briefly mention selected previous results. Local well-posedness was ob-
tained in H s .R3 / for s > 1 (subcritical range) by Escobedo–Vega in [26], global
well-posedness and scattering was proved by Machihara–Nakanishi–Ozawa [43] for
small initial data in H s .R3 /, s > 1 as well as for small initial data in H 1 .R3 / with
some regularity in the angular variable in [44]. In d D 2, local well-posedness in
H s .R3 /, s > 21 was obtained by Pecher [47]. We refer to [4, 5] for more references.
The idea of proof is the following: First, the system is reduced to a system of
half-Klein–Gordon equations with null structure in the nonlinearity. Second, global
in time nonlinear estimates are derived which allow to invoke the contraction map-
ping principle. The key ingredient here are certain endpoint Strichartz and energy
estimates in adapted systems of coordinates. We now describe the ideas, focusing on
the easier case d D 3. By rescaling, it suffices to consider M D 1.
Multiplying equation (7.3.1) from the left by the matrix
0 DW ˇ leads to
i.@t C ˛ r C iˇ/ D . N /ˇ
where ˛ j D
0
j and ˛ r D ˛ j @j . Here, the operator ˛ r C iˇ is defined via the
symbol ˛ C ˇ. Due to .˛ C ˇ/2 D .jj2 C 1/I , the matrix ˛ C ˇ has the
eigenvalues ˙hi. Let …˙ ./ denote the projections onto the eigenspaces and …˙
the Fourier multiplier operator. This leads to the equivalent system
.i @t C hri/ C D …C .. N /ˇ /;
(7.3.2)
.i @t hri/ D … .. N /ˇ /
for ˙ D …˙ , D …C C … .
Next, we will discuss the issue of Strichartz estimates for half-Klein–Gordon
equations. Let U.t/ D ei t hri . The endpoint estimate
kUkL2 .R;L1 3
x .R //
. kkHP 1 .R3 /
t
fails to hold, as for the wave equation, even if b is localised to some dyadic annulus
Aj D f 2 R3 j jj 2j g. Let .j /j 2N0 be a smooth partition of unity subordinate
to .Aj /j 2N0 , where A0 is the full unit ball, and Pj the associated Fourier localisation
operator. In oder to deal with cubic nonlinearities we construct a useful replacement.
As discussed in Section 7.1, we need to understand the decay properties of the kernel
Z
Kj .t; x/ D 1
e˙i.t;x/.hi;/ 2j ./ d D 2j H .t; x/;
R3
Nonlinear Dispersive equations at critical regularity 175
which implies kKj; kL1 1. 1, from which we obtain the desired Strichartz type
t Lx
In the case d D 2, this is much more difficult, as L2t L1 x is the forbidden endpoint
for the Schrödinger equation. The strategy is to fix a finite time horizon T > 0 and
prove estimates in T -dependent norms, uniformly in T , similar to the Schrödinger
maps problem [9]. We omit the details and refer to [5] instead.
Now, we come back to the nonlinear system (7.3.2). The nonlinearity exhibits a
so-called null structure. It damps down products of waves with parallel frequencies,
similar to other nonlinear wave equations arising e.g. in general relativity and geo-
metric wave equations. Using the anti-commutativity relations of the Dirac matrices,
one can show that
1 1
…˙ ./… ./ D O .†.; // C O .hi C hi /
176 S. Herr
Let us describe some aspects of the proof, in particular a crucial bilinear estimate.
1
Suppose that supp. m .t// fjj 2jm g, 2j1 6 2j2 . Then,
`j
To prove this, we decompose the spinors on the Fourier side into caps of radius 2 ,
for 2`1 2`2 . 2j1 . Due to orthogonality, it suffices to prove
X
kh…˙ P1 1 ; ˇ…˙ P2 2 ikL2 . 2j1 `1 k 1 kS ˙ k 2 kS ˙
j1 j2
.1 ;2 /2D`1 ;`2
Consider 1 ; 2 of low modulation and 2`1 2`2 2j1 only. The null-structure
yields, ignoring terms of lower order,
X
kh…˙ P1 1 ; ˇ…˙ P2 2 ikL2
.1 ;2 /2D`1 ;`2
X X
`1
. 2 kP1 \ 1 kL2 L1 kP2 2 kL1 L2
t x t x
.1 ;2 /2D`1 ;`2 2Kj1
which implies (7.3.3) in this case. Given the bilinear estimate (7.3.3), it requires more
work to establish the contraction property, but we do not go into further detail here.
Nonlinear Dispersive equations at critical regularity 177
Instead, let us briefly discuss some related problems. First of all, the model (7.3.1)
is also interesting in d D 1 and critical global well-posedness has been established in
[18]. Very recently, Candy–Lindblad [20] proved a modified scattering result in this
case.
The Dirac equation can be coupled to other equations, which is relevant in physics
and gives rise to mathematical challenges. For instance, the Dirac–Klein–Gordon
system is a basic model of proton-proton interactions (one proton is scattered in a
meson field produced by a second proton) or neutron-neutron interaction. It is
i
@ CM
D ;
(7.3.4)
C m D N :
2
Theorem 7.3.2. Assume that > 0 and 2M > m > 0. Then, (7.3.4) is globally
well-posed for small initial data
1 1
0 2 H .R3 I C4 /; .0 ; 1 / 2 H 2 C .R3 I R/ H 2 C .R3 I R/
and these solutions scatter to free solutions for t ! ˙1.
The condition 2M > m > 0 guarantees that there are no nontrivial resonances.
The proof is based on Fourier-localised Strichartz estimates. Further, we exploit the
null-structure of this system, which is akin to the one described above for the cubic
Dirac equation, in conjunction with a careful analysis of the set of resonances.
In joint work with Candy [19], we work in scale-invariant subspaces of the critical
space, where we impose additional angular regularity via the vector fields
ij D xi @j xj @i :
In [19] we prove the following.
7
Theorem 7.3.3. Let 2M > m > 0 and > 0, or m > 2M > 0 and > 30 . Then,
the system (7.3.4) is globally well-posed, provided that the initial data satisfy
hi . ; ; /
1 1 1;
0 0 1 2 2L H 2 H
Here, we allow for resonances in the regime m > 2M > 0. These lead to slow
oscillations in the Duhamel integral, hence to weaker decay. However, we observe
that in resonant interactions the characteristic surfaces intersect transversally. The
key idea now is to use a novel version [19] of the bilinear Fourier restriction theory
[57], which allows to prove that for any 32 < p 6 2,
p4 2
kU1 U2 kLp .R1C3 / . k1 kL2 .R3 / k2 kL2 .R3 /
˛
provided that supp c b b
j fjj g, †.supp 1 ; supp 2 / ˛. For certain p, such es-
timates also follow from Hölder’s inequality and standard Strichartz estimates (only
relying on curvature), but the above goes beyond this in the sense that one can choose
smaller p. In fact, in [19] we prove such estimates for more general surfaces un-
der appropriate transversality, curvature and regularity assumptions. Furthermore,
we prove that these estimates extend to V 2 -perturbations of free solutions, which is
important for an application to nonlinear systems.
The problem (7.3.4) is also of interest in other spatial dimension. In particular, in
dimension d D 1, its local well-posedness theory is quite well-understood. Recently,
Selberg–Tesfahun [51] proved that solutions are real analytic. One of the most chal-
lenging open problems seems to be the longtime behaviour of solutions in dimension
d D 2, as the decay of free solutions marks the border of short-range and long-range
scattering.
Finally, let us mention a couple of related results obtained in our group. A more
complex system is the Yang–Mills equation. Here, one needs to fix a gauge. With re-
fined Fourier-analytic methods Tesfahun [59] recently proved a local well-posedness
result in the Lorenz gauge below the energy regularity.
Simpler models are semi-relativistic Hartree equations
p
. i @t C m2 /u D .V juj2 /u
with Coulomb or Yukawa potential V .x/ D e jxj jxj 1 and mass m > 0, where
denotes spatial convolution. Formally, they can be obtained by ignoring the vec-
tor structure in (7.3.4) and the second order time derivatives in the Klein–Gordon
equation.
With Lenzmann [30], we obtain sharp results on local well-posedness for radial
and for non-radial initial data in dimension d D 3. With Tesfahun [37] we extended
this to small data global well-posedness and scattering for the Yukawa potential ( >
0), while it has been shown by Pusateri [48] that there is modified scattering for the
Coulomb potential D 0. We would like to refer to [30, 37] and the references
therein for a more complete account on this equation.
7.3.2 Waves in plasmas In this subsection, we will review recent results on the
Zakharov system
i @t u C u D nu
(7.3.6)
@2t n n D juj2
Nonlinear Dispersive equations at critical regularity 179
References
[1] V.M. Babič, Eigenfunctions which are concentrated in the neighborhood of a closed geodesic.
Zap. Naučn. Sem. Leningrad. Otdel. Mat. Inst. Steklov (LOMI) 9 (1968), 15–63.
[2] I. Bejenaru, Z. Guo, S. Herr and K. Nakanishi, Well-posedness and scattering for the Zakharov
system in four dimensions. Anal. PDE 8 (2015), 2029–2055.
180 S. Herr
[3] I. Bejenaru and S. Herr, Convolutions of singular measures and applications to the Zakharov
system. J. Funct. Anal. 261 (2011), 478–506.
[4] I. Bejenaru and S. Herr, The cubic Dirac equation: small initial data in H 1 .R3 /. Commun.
Math. Phys. 335 (2015), 43–82.
1
[5] I. Bejenaru and S. Herr, The cubic Dirac equation: small initial data in H 2 .R2 /. Commun.
Math. Phys. 343 (2016), 515–562.
[6] I. Bejenaru and S. Herr, On global well-posedness and scattering for the massive Dirac–Klein–
Gordon system. J. Eur. Math. Soc., 19 (2017), 2445–2467.
[7] I. Bejenaru, S. Herr, J. Holmer and D. Tataru, On the 2D Zakharov system with L2 -
Schrödinger data. Nonlinearity 22 (2009), 1063–1089.
[8] I. Bejenaru, S. Herr and D. Tataru, A convolution estimate for two-dimensional hypersurfaces.
Rev. Mat. Iberoam. 26 (2010), 707–728.
[9] I. Bejenaru, A. Ionescu, C. Kenig and D. Tataru, Global Schrödinger maps in dimensions
d > 2: small data in the critical Sobolev spaces. Ann. of Math. 173(2) (2011), 1443–1506.
[10] J. Bourgain, On ƒ.p/-subsets of squares. Israel J. Math. 67 (1989), 291–311.
[11] J. Bourgain, Fourier transform restriction phenomena for certain lattice subsets and applica-
tions to nonlinear evolution equations. I. Schrödinger equations, Geom. Funct. Anal. 3 (1993),
107–156.
[12] J. Bourgain and C. Demeter, The proof of the l 2 decoupling conjecture. Ann. of Math. 182(2)
(2015), 351–389.
[13] N. Bournaveas and T. Candy, Global well-posedness for the massless cubic Dirac equation.
Int. Math. Res. Not. (IMRN) (2016), 6735–6828.
[14] N. Burq, P. Gérard and N. Tzvetkov, Strichartz inequalities and the nonlinear Schrödinger
equation on compact manifolds, Amer. J. Math. 126 (2004), 569–605.
[15] N. Burq, P. Gérard and N. Tzvetkov, Multilinear eigenfunction estimates and global existence
for the three dimensional nonlinear Schrödinger equations. Ann. Sci. École Norm. Sup. 38(4)
(2005), 255–301.
[16] N. Burq, P. Gérard and N. Tzvetkov, Bilinear eigenfunction estimates and the nonlinear
Schrödinger equation on surfaces. Invent. Math. 159 (2005), 187–223.
[17] N. Burq, P. Gérard and N. Tzvetkov, Global solutions for the nonlinear Schrödinger equation
on three-dimensional compact manifolds. In Mathematical Aspects of Nonlinear Dispersive
Equations (J. v. Bourgain, C.E. Kenig, and S. Klainerman, eds.), Ann. of Math. Stud. 163,
Princeton Univ. Press, Princeton, NJ, 2007, 111–129.
[18] T. Candy, Global existence for an L2 critical nonlinear Dirac equation in one dimension. Adv.
Diff. Equations 16 (2011), 643–666.
[19] T. Candy and S. Herr, Transference of bilinear restriction estimates to quadratic variation
norms and the Dirac–Klein–Gordon system. Anal. PDE 11 (2018), 1171–1240.
[20] T. Candy and H. Lindblad, Long range scattering for cubic Dirac equation on R1C1 . Differ.
Integral Equ., 31 (2018), 507–518.
[21] T. Cazenave, Semilinear Schrödinger Equations. Courant Lecture Notes in Mathematics 10.
AMS, Providence, RI, 2003.
[22] Y. Colin de Verdière, Sur le spectre des opérateurs elliptiques a bicaractéristiques toutes péri-
odiques. Comment. Math. Helv. 54 (1979), 508–522.
Nonlinear Dispersive equations at critical regularity 181
[23] J. Colliander, M. Keel, G. Staffilani, H. Takaoka and T. Tao, Global well-posedness and scat-
tering for the energy-critical nonlinear Schrödinger equation in R3 . Ann. of Math. 167(2)
(2008), 767–865.
[24] J. Colliander, M. Keel, G. Staffilani, H. Takaoka and T. Tao, Transfer of energy to high fre-
quencies in the cubic defocusing nonlinear Schrödinger equation. Invent. Math. 181 (2010),
39–113.
[25] J. Duistermaat and V. Guillemin, The spectrum of positive elliptic operators and periodic
bicharacteristics. Invent. Math. 29 (1975), 39–79.
[26] M. Escobedo and L. Vega, A semilinear Dirac equation in H s .R3 / for s > 1. SIAM J. Math.
Anal. 28 (1997), 338–362.
[27] A. Grünrock and S. Herr, The Fourier restriction norm method for the Zakharov–Kuznetsov
equation. Discr. Cont. Dynam. Syst. 34 (2014), 2061–2068.
[28] M. Hadac, S. Herr and H. Koch, Well-posedness and scattering for the KP-II equation in a
critical space. Ann. Inst. Henri Poincaré (C) Anal. Non Linéaire 26 (2009), 917–941. Erratum
ibid. 27 (2010), 971–972.
[29] S. Herr, The quintic nonlinear Schrödinger equation on three-dimensional Zoll manifolds.
Amer. J. Math. 135 (2013), 1271–1290.
[30] S. Herr and E. Lenzmann, The Boson star equation with initial data of low regularity. Nonlin-
ear Anal. 97 (2014), 125–137.
[31] S. Herr and J.L. Marzuola, On discrete rarefaction waves in an NLS toy model for weak
turbulence. Indiana Univ. Math. J. 65 (2016), 753–777.
[32] S. Herr and K. Schratz, Trigonometric time integrators for the Zakharov system. IMA J. Nu-
mer. Anal. 37 (2017), 2042–2066.
[33] S. Herr and V. Sohinger, The Gross–Pitaevskii hierarchy on general rectangular tori. Arch.
Ration. Mech. Anal. 220 (2016), 1119–1158.
[34] S. Herr and N. Strunk, The energy-critical nonlinear Schrödinger equation on a product of
spheres, Math. Res. Lett. 22 (2015), 741–761.
[35] S. Herr, D. Tataru and N. Tzvetkov, Global well-posedness of the energy-critical nonlinear
Schrödinger equation with small initial data in H 1 .T3 /. Duke Math. J. 159 (2011), 329–349.
[36] S. Herr, D. Tataru and N. Tzvetkov, Strichartz estimates for partially periodic solutions to
Schrödinger equations in 4d and applications. J. Reine Angew. Math. (Crelle) 690 (2014),
65–78.
[37] S. Herr and A. Tesfahun, Small data scattering for semi-relativistic equations with Hartree
type nonlinearity. J. Diff. Eqs. 259 (2015), 5510–5532.
[38] A.D. Ionescu and B. Pausader, The energy-critical defocusing NLS on T3 , Duke Math. J. 161
(2012), 1581–1612.
[39] C.E. Kenig, Recent developments on the global behavior to critical nonlinear dispersive equa-
tions. In Proceedings of the International Congress of Mathematicians. Volume I, Hindustan
Book Agency, New Delhi, 2010, 326–338.
[40] C.E. Kenig, Critical non-linear dispersive equations: global existence, scattering, blow-up and
universal profiles. Jpn. J. Math. 6 (2011), 121–141.
[41] R. Killip and M. Visan, Scale invariant Strichartz estimates on tori and applications. Math.
Res. Lett. 23 (2016), 445–472.
182 S. Herr
[42] K. Kirkpatrick, B. Schlein and G. Staffilani, Derivation of the cubic nonlinear Schrödinger
equation from quantum dynamics of many-body systems: the periodic case. Amer. J. Math.
133 (2011), 91–130.
[43] S. Machihara, K. Nakanishi and T. Ozawa, Small global solutions and the nonrelativistic limit
for the nonlinear Dirac equation. Rev. Mat. Iberoam. 19 (2003), 179–194.
[44] S. Machihara, K. Nakanishi and T. Ozawa, Endpoint Strichartz estimates and global solutions
for the nonlinear Dirac equation. J. Funct. Anal. 219 (2005), 1–20.
[45] L. Molinet and D. Pilod, Bilinear Strichartz estimates for the Zakharov–Kuznetsov equation
and applications. Ann. Inst. H. Poincaré Anal. Non Linéaire 32 (2015), 347–371.
[46] B. Pausader, N. Tzvetkov and X. Wang, Global regularity for the energy-critical NLS on S3 .
Ann. Inst. H. Poincaré Anal. Non Linéaire 31 (2014), 315–338.
[47] H. Pecher, Local well-posedness for the nonlinear Dirac equation in two space dimensions.
Commun. Pure Appl. Anal. 13 (2014), 673–685; corrigendum ibid. 14 (2015), 737–742.
[48] F. Pusateri, Modified scattering for the boson star equation. Commun. Math. Phys. 332 (2014),
1203–1234.
[49] J.V. Ralston, On the construction of quasimodes associated with stable periodic orbits. Com-
mun. Math. Phys. 51 (1976), 219–242.
[50] J.V. Ralston, Approximate eigenfunctions of the Laplacian. J. Diff. Geom. 12 (1977), 87–100.
[51] S. Selberg and A. Tesfahun, On the radius of spatial analyticity for the 1D Dirac–Klein–
Gordon equations. J. Diff. Eqs. 259 (2015), 4732–4744.
[52] M. Soler, Classical, Stable, Nonlinear Spinor Field with Positive Rest Energy. Phys. Rev. D 1
(1970), 2766–2769.
[53] E.M. Stein, Harmonic Analysis: Real-variable Methods, Orthogonality, and Oscillatory Inte-
grals. With the assistance of Timothy S. Murphy. Princeton University Press, Princeton, NJ,
1993.
[54] N. Strunk, Strichartz estimates for Schrödinger equations on irrational tori in two and three
dimensions. J. Evol. Eq. 14 (2014), 829–839.
[55] N. Strunk, Critical Well-posedness Results for Nonlinear Schrödinger Equations on Compact
Manifolds. Dissertation, Bielefeld University, 2015.
[56] N. Strunk, Global Well-posedness of the Energy-Critical Defocusing NLS on rectangular tori
in three dimensions. Differential Integral Eqs. 28 (2015), 1069–1084.
[57] T. Tao, Some recent progress on the restriction conjecture, in: Fourier Analysis and Convexity.
Appl. Numer. Harmon. Anal., (L. Brandolini, L. Colzani, A. Iosevich and G. Travaglini, eds.),
Birkhäuser, MA, 2004, 217–243.
[58] D. Tataru, On global existence and scattering for the wave maps equation. Amer. J. Math. 123
(2001), 37–77.
[59] A. Tesfahun, Local well-posedness of Yang–Mills equations in Lorenz gauge below the energy
norm. Nonlinear Differ. Equ. Appl. 22 (2015), 849–875.
[60] L. Thomann, The WKB method and geometric instability for nonlinear Schrödinger equations
on surfaces. Bull. Soc. Math. France 136 (2008), 167–193.
[61] A. Zygmund, On Fourier coefficients and transforms of functions of two variables. Stud. Math.
50 (1974), 189–201.
Chapter 8
Variational solutions to nonlocal problems
M. Kaßmann
8.1 Introduction
An operator L acting on functions from Rd to R is called local if, for every function
u in the domain of L, the support of Lu is contained in the support of u. Examples
of local operators are u 7! jruj and u 7! u. An operator L is called nonlocal if
it is not local. Convolution operators provide simple examples of nonlocal operators.
Here, we will study a small class of nonlocal operators only. We require them to be
linear and unbounded on L2 .Rd /. Furthermore, we assume the operators L under
consideration to satisfy the global maximum principle, i.e., we assume Lu.x/ 6 0
to hold, whenever the function u has a global maximum in x 2 Rd . A prominent
example of such operators is the fractional Laplace operator . /˛=2 for 0 < ˛ < 2.
For functions u 2 Cc1 .Rd /, it can be defined via
4
. /˛=2 u./ D jj˛b
u./ . 2 Rd /: (8.1.1)
For our purposes, it is very convenient that there is a different representation of
. /˛=2 as an integrodifferential operator. For u 2 Cb2 .Rd / and x 2 Rd , one has
Z
˛=2 u.x C h/ u.x/
. / u.x/ D C˛;d lim dh
"!0 Rd nB" jhjd C˛
Z
u.y/ u.x/
D C˛;d p: v: dy
Rd jy xjd C˛
C˛;d Z u.x C h/ 2u.x/ C u.x h/
D dh :
2 Rd jhjd C˛
1 Projects A8, A10
184 M. Kaßmann
The constant C˛;d can be chosen such that (8.1.1) holds true. It turns out that C˛;d
˛.2 ˛/ for ˛ 2 .0; 2/. This representation formula has been known for a long time;
[34].
We mainly will be concerned with nonlocal operators of the form
Z
Lu.x/ D p: v: .u.y/ u.x//k.x; y/ dy .x 2 Rd /; (8.1.2)
Rd
Note that Œu; uH ˛=2 is the seminorm in the Sobolev–Slobodeckij space H ˛=2 .Rd /
with differentiability order ˛2 . This bilinear form is a very special instance of forms
introduced in (3.2.1) of Chapter 3. We will not Rinvestigate the limit ˛ % 2, but we
remark that Œu; uH ˛=2 converges to Œu; uH 1 D Rd jruj2 if u is sufficiently regular.
The analogous result holds for subsets of Rd as shown in [5], [29] and [12].
The nonlocal operators considered here have a strong connection to Markov jump
processes. We do not dwell on this connection in this chapter but note that Markov
jump processes of the same type are central objects in Chapter 3 and Chapter 4.
In short, the nonlocal operators arise as generators on L2 .Rd / or Cb .Rd / of the
semigroups that are generated by the stochastic process. In the translation invariant
case, e.g., in the case of (8.1.2) with k.x; y/ D e k.x y/ for some function e k, the
corresponding jump processes are Lévy processes [39]. An important subclass is
provided by ˛-stable processes, which—in the simplest case—are generated by the
fractional Laplace operator as defined above. Basic definitions and contributions to
the potential theory can be found in [3]; see also the references in [4].
Beside Markov jump processes, there are many topics which would fit well into
the framework of this report but are not covered, e.g., stationary linear nonlocal sym-
metric operators in variational form. We do not mention questions related to minimal
Variational solutions to nonlocal problems 185
surfaces and the fractional perimeter. Moreover, we do not study semilinear equa-
tions, which have attracted attention recently; see the references in [40, 42, 33, 16,
32]. Last, let us mention that nonlocal variational problems arise also in the study of
models in peridynamics; see [30, 13, 31].
is finite. The corresponding bilinear form is denoted by Œ; . The constant ˛.2 ˛/
in (8.2.1) is important only when considering the limiting behaviour as ˛ & 0 or
˛ % 2. When considering domains Rd , the following subspaces are of interest.
(i) The function space H˛=2 .Rd / is defined as the subspace of all v 2 H ˛=2 .Rd /
such that vjRd n D 0.
(ii) The linear space V ˛=2 .jRd / consists of all v 2 L2 .Rd ; dm/ such that
Z Z
.v.y/ v.x//2
dy dx < 1:
Rd jx yjd C˛
The space has been used in several papers and has systematically been intro-
duced in [18], see the references in [18], [6] and in [1]. It is not hard to see that
H˛=2 .Rd / and V ˛=2 .jRd / are separable Hilbert spaces. If a function u belongs to
V ˛=2 .jRd /, then the bilinear form
Z Z
.u.y/ u.x//.v.y/ v.x//
.u; v/ 7! dy dx
Rd R d jx yjd C˛
is finite for every test function v 2 H˛=2 .Rd /. In a certain sense, the space H˛=2 .Rd /
is a nonlocal resp. fractional analogue of H01 ./, whereas R V
˛=2
.jRd / corresponds
1
to H ./ when considering classical local energy forms jruj2 . A particular prop-
erty of V ˛=2 .jRd / is that it encodes regularity across the boundary @.
186 M. Kaßmann
Note that the condition v 2 H˛=2 .Rd / carries some regularity information about
the behaviour of u across the boundary, too. The finiteness of the term
Z Z
v 2 .x/ jx yj d ˛ dy dx
Rd n
This resembles the classical Hardy inequality for functions v 2 H01 ./.
We are now in the position to formulate the concept of weak solutions for nonlocal
operators. Assume that k W Rd Rd n diag ! Œ0; 1/ is a measurable function that
satisfies k.x; y/ D k.y; x/ for all x; y 2 Rd together with
.2 ˛/ƒ 1 .2 ˛/ƒ
6 k.x; y/ 6 .x; y 2 Rd / (8.2.2)
jx yj d C˛ jx yjd C˛
for some constant ƒ > 1. We define an operator u 7! Lu by
Z
Lu.x/ D p: v: .u.y/ u.x//k.x; y/ dy
d
ZR (8.2.3)
D lim .u.y/ u.x//k.x; y/ dy
"!0 Rd nB .x/
"
for all functions u W Rd ! R for which the principal value integral exists. In the case
k.x; y/ D jx yj d ˛ , the integral exists for every x 2 Rd if u 2 Cb2 .Rd /. In the
same informal way, we define the carré du champ operator and the corresponding
bilinear form E as follows,
1
.u; v/ D .L.uv/ uLv v Lu/
2Z
1
D .u.y/ u.//.v.y/ v.//k.; y/ dy;
2 Rd
Z
E .u; v/ D .u; v/.x/ dx:
Rd
Note that symmetry of k and (8.2.2) do not imply that the expression Lu.x/, as
defined above, exists if u is smooth, say u 2 Cc1 .Rd /.
Variational solutions to nonlocal problems 187
Definition 8.2.3. Assume that Rd is open. Assume f belongs to the dual space
˛=2 ˛=2
H .Rd / . We say that a function u 2 H .Rd / is a weak solution to
Lu D f in
if
Lu D f in
(8.2.4)
u D 0 on Rd n
is called Dirichlet problem with complement data zero. The definition above pro-
vides the notion of weak solutions for this problem. On the one hand, the concept of
weak solutions is important because it allows to study the equation Lu D f even if
Lu.x/ does not exist. On the other hand, weak solutions naturally appear when study-
ing minimisers of variational functionals. Good early sources on the Dirichlet prob-
lem for nonlocal operators in domains include [21, 23]. Define I ˛=2 W H˛=2 .Rd / !
R by
Z Z
1
˛=2
I .v/ D .v.y/ v.x//2 k.x; y/ dy dx .f; v/ :
4 Rd Rd
Proposition 8.2.4.
(i) Every minimiser of I ˛=2 satisfies (8.2.4) in the weak sense.
(ii) If is bounded, then there is a unique minimiser of I ˛=2 .
The proof of this result is standard. For the proof of (i), one considers the map
t 7! I ˛=2 .u C t'/ for a minimiser u and some ' 2 H˛=2 .Rd /. When rewriting
d ˛=2
I .u C t'/jt D0 D 0;
dt
one obtains E .u; '/ D .f; '/, which is a weak formulation of (8.2.4). The proof of (ii)
makes use of the fact that the bilinear form E is coercive on H˛=2 .Rd /. This property
˛=2
itself follows from a nonlocal Poincaré–Friedrichs inequality. For v 2 H .Rd /,
one has
Z Z Z Z
2 2
E .v; v/ D .v.y/ v.x/// k.x; y/ dy dx C 2 v .x/ k.x; y/dy dx
Rd n
Z
> c1 v 2 .x/ dx;
188 M. Kaßmann
where c1 > 0 is a constant, which depends on and on ˛ 2 .0; 2/. One can modify
the proof such that c1 is independent of ˛ for ˛ > ˛0 > 0, but that is a bit tricky.
Let us formulate the weak solution concept for the inhomogeneous Dirichlet prob-
lem, i.e., with prescribed data on the complement of the domain Rd under
consideration.
Lu D f in ;
u D g on Rd n
˛=2
if u g 2 H .Rd / and
We refer the reader to [18] and to [15] for the setup of the Dirichlet problem and the
study of related nonlocal extension and trace theorems. The issue of trace theorems
is a bit special because, in the nonlocal context, “traces” are not supported on lower
dimensional subsets. The “trace space” corresponding to the space V ˛=2 .jRd / is
the space of all functions v 2 L2 .Rd n ; dm/ such that the quantity
Z Z
.v.x/ v.y//2
dx dy
c c .jx yj C ıx C ıy /d C˛
@t u Lu D f;
can be dealt with in a similar way. In this report, however, we concentrate on station-
ary equations. Hölder regularity results for variational solutions to parabolic nonlocal
problems have been established in [7] and [17], where the first article also treats non-
linear problems and the second one provides a robust result, which is truly local. For
corresponding results for fully nonlinear equations and viscosity solutions, see [9]
and the references in [9] and [28].
Variational solutions to nonlocal problems 189
where k was assumed to be symmetric and to satisfy (8.2.2). We have chosen this
framework because it is quite simple but still wide enough for several interesting
phenomena to occur. In this section, we comment on possible modifications of this
setup.
Condition (8.2.2) can be understood as imposing “bounded uniformly elliptic co-
efficients”. For given ˛ 2 .0; 2/, the condition implies
i.e., the ratio of the two quantities is bounded from above and below by some positive
constants independent of u. The operator L as given by (8.3.1) is special because,
for fixed x, the integral is taken with respect to a measure k.x; y/dy, which has a
density with respect to Lebesgue measure. There are several applications of nonlocal
operators, where one studies more general measures. Thus, it is desirable to consider
measures .x; dy/ instead of k.x; y/dy. We consider only the symmetric case. Re-
call that a family of measures .x; dy/ is called symmetric if for all measurable sets
A; B Rd
Z Z Z Z
.x; dy/ dx D .x; dy/ dx:
A B B A
Let us formulate a stronger condition than (8.3.2) that takes into account all spatial
scales. Given .x; dy/, a set M Rd and a function v 2 L2 .M /, we set
Z Z
EM .v; v/ D .v.y/ v.x//2 .x; dy/dx :
M M
The following condition turns out to be important: There is a constant A > 1 such
that for every ball B Rd and every v 2 L2 .B/
Z Z
.v.y/ v.x//2
A 1 EB .v; v/ 6 dy dx 6 A EB .v; v/ : (8.3.3)
B B jy xj d C˛
Property (8.3.3) ensures several important regularity results for solutions to nonlo-
cal equations such as Hölder regularity estimates. In this section, we will study the
question whether and how (8.2.2) can be relaxed without losing the property (8.3.3).
Since the quantities of (8.3.3) are defined with the help of double integrals, there is
ample space for generalisations. Here is a first example.
190 M. Kaßmann
ˇ P
Example 8.3.1. Assume V D fx 2 Rd ˇ jxd j2 > idD11 jxi j2 g. Let k be symmetric
with
ƒ 1 .2 ˛/ ƒ.2 ˛/
1 .x y/ 6 k.x; y/ 6 .x; y 2 Rd a:e:/
jx yjd C˛ V jx yjd C˛
for some ƒ > 1; ˛ 2 .0; 2/. Set .x; dy/ D k.x; y/dy. Then, (8.3.3) holds true.
One can prove the assertion in the example as follows. For every pair .x; y/ for which
k.x; y/ equals zero, there is a point z 2 Rd with jz xj jz yj such that both
k.x; z/ and k.z; y/ are comparable with jx yj d ˛ . Thus, one can prove (8.3.3) in
this case using a simple chaining argument and the Fubini theorem. It is possible to
relax the bound in (8.2.2) significantly further, still keeping (8.3.3). For example, the
cone V could be assumed to depend on the respective pair .x; y/. Then, the problem
becomes related to the study of the Boltzmann equation; compare with [22]. The
connection of kinetic equations and regularity questions for nonlocal operators seems
to be very promising.
Many interesting cases of operators
Z
L u.x/ D p: v: .u.y/ u.x// .x; dy/; (8.3.4)
Rd
d
R .x; dy/ .dy fxg/, where is a Borel measure on R n f0g such
satisfy
that min.1; jhj2 / .dh/ is finite. Here, the symbol denotes comparability after
integrating arbitrary non-negative measurable functions. Note that, for every operator
u 7! L u of the form
Z
L u.x/ D p: v: .u.x C h/ u.x// .dh/;
Rd
there is a corresponding Lévy jump process. The quantity .A/ describes the number
of jumps of this process from x into the set x C A within a unit time interval; see [39].
An interesting example is given by .dh/ D m.dh/ with
d
X Y
1 ˛
m.dh/ D jhi j dhi ıf0g .dhj / : (8.3.5)
i D1 j ¤i
Note that m only charges those sets which have an intersection with one of the axes
in Rd . If L is as in (8.3.4) with .x; dy/ D m.dy fxg/ and m as in (8.3.5),
the corresponding energy form E again can be shown to satisfy the comparability
condition (8.3.3), cf. Theorem 8.3.2. The weaker condition (8.3.2) can be verified
by looking at the Fourier transform. For the above choice of , for functions u 2
Cc1 .Rd / and 2 Rd ,
X
d
b
.Lm u/./ D c˛ ji j˛ b
u./ (8.3.6)
i D1
Variational solutions to nonlocal problems 191
P
for some appropriate constant c˛ > 0. Since the multiplier diD1 ji j˛ is compara-
ble with jj˛ , the comparability result (8.3.2) for the quadratic forms follows. The
approach by the Fourier transform is not helpful if one wants to verify (8.3.3). Let
us mention that the general
R case .x; dy/ .dy fxg/ with a Borel measure on
Rd n f0g satisfying min.1; jhj2 / .dh/ < 1 is challenging and several interesting
questions are open until now.
We have seen two examples of families ..x; dy//x2Rd that are not absolutely
continuous with a density satisfying (8.2.2), but they still lead to (8.3.3). It is a chal-
lenging task to find a natural condition on , which is as general as possible, still
allowing for (8.3.3). Let us explain a recent result in this direction. We recall that a
measure on Rd is called an ˛-stable measure for ˛ 2 .0; 2/, if
Z Z 1
.E/ D ˛.2 ˛/ 1E .r /r 1 ˛ dr .d /
Sd 1 0
for every Borel set E Rd , where is some finite measure on S d 1 ; see [39]
for details. The measure is called nondegenerate if the span of supp ./ equals the
whole space Rd . The following theorem provides satisfactory sufficient conditions
for (8.3.3). It is a special case of one of the main results in [14].
holds true for every x 2 Rd and every measurable non-negative function f . Then
(8.3.3) holds true.
is the operator u 7! @x@ .aij ./ @x@ u/, where the functions aij W Rd ! R are bounded
i j
and the matrix .aij / is uniformly positive definite. For purposes of regularity theory, a
weaker statement than (8.4.1) is often sufficient. The weak Harnack inequality asserts
that supersolutions u to an elliptic equation in satisfy
Z 1=p0
p0
inf00 u > c0 juj ; (8.4.2)
0
where p0 and c0 are positive constants and 00 is another domain whose closure
is contained in 0 . An important consequence of the Harnack inequality resp. its
weaker version (8.4.2) are regularity estimates for u in Hölder spaces, cf. [19].
Inequalities (8.4.1) and (8.4.2) cannot be expected to hold for solutions to nonlocal
equations without further changes. Let u W Rd ! R satisfy . /˛=2 u D 0 in
B2 D B2 .0/ Rd for some ˛ 2 .0; 2/. The additional assumption u > 0 in B2
is not sufficient for (8.4.1) or (8.4.2) to hold, because negative values of u in regions
outside of B2 influence the values of u inside of B2 . It is not complicated to construct
an example of a bounded function u 2 C.Rd / that solves . /˛=2 u D 0 in B2 , is
positive in B2 n f0g, and satisfies u.0/ D 0, cf. Theorem 3.3.1 in [6] or [26].
Let us formulate some recent results that make clear how one can deal with the in-
fluence of nonlocal terms. In the case of the fractional Laplace operator, the Harnack
inequality can be formulated as follows:
Proposition 8.4.1. There is a constant c > 1 such that, for 0 < ˛ 6 2 and u 2
C.Rd / with . /˛=2 u D 0 in B2 and u > 0 in B2 , the following inequality holds,
Z
u .z/
u.x/ 6 c u.y/ C ˛.2 ˛/ dz .x; y 2 B1 /: (8.4.3)
Rd jzj
d C˛
Here, u D min.u; 0/. The proof of this result is straightforward and uses the
representation of the Poisson kernel for . /˛=2 ; see [14]. Note that the influence of
the nonlocal term can nicely be seen in (8.4.3). This influence vanishes for ˛ ! 2 ,
as it should.
Next, let us look at the general case, i.e., we define L as in (8.2.3) and assume
(8.2.2) to hold. If one aims at a more general result, one can assume L to be as in
(8.3.4) with condition (8.3.3) in place. Given a function f 2 L2 ./ on a bounded
open subset Rd , we say that u 2 V ˛=2 .jRd / is a weak subsolution to Lu D
f in if, for every ' 2 H˛=2 .Rd / with ' > 0, one has
“
E .u; '/ WD .u.y/ u.x//.'.y/ '.x//k.x; y/ dy dx 6 .f; '/:
In this case, we say that Lu 6 f in holds in the weak sense. The notion of
supersolution and solution is defined accordingly. The nest result states that the weak
Harnack inequality holds true for functions u that satisfy Lu > f in the weak
sense. It is one of the main results of [14].
Variational solutions to nonlocal problems 193
Theorem 8.4.2. Assume 0 < ˛0 6 2 and ˛0 6 ˛ < 2. Let p0 > 0. Assume further
f 2 L2 .Rd / and u 2 V ˛=2 .B2 jRd /, u > 0 in B2 and E .u; '/ > .f; '/ for every
' 2 HB˛=2
2
.Rd /, ' > 0. Then,
Z 1=p0 Z
u .z/
p0
inf .u/ > c1 u.x/ dx c2 kf kL2 .B /
;
B1 B1 Rd nB2 jzjd C˛ 15
16
2
There are two important features of this result. The result is truly local in the sense
that Lu > f is required to hold only in a ball, not in the whole space. Moreover, the
constants c1 ; c2 do not depend on ˛. At least informally, one can consider the limit
˛ % 2 and recover the weak Harnack inequality for local differential operators of
second order with bounded measurable coefficients. The proof of the weak Harnack
estimate is very similar to the one in the classical case developed by Moser. One
uses test functions of the form ' D u pk k , where .pk / is a sequence of numbers
tending to 1 and .k / is a sequence of localising or cutoff functions with supp.k / &
B 1 . The critical case pk D 1 can be dealt with using a modification of the John–
2
Nirenberg lemma or using ideas of Bombieri–Giusti. Note that [10] establishes two
similar results: (a) the weak Harnack inequality for nonlocal operators of the form
(8.2.3) under the condition (8.2.2), and (b) a Harnack inequality with a tail term in
the same context. Both results are established in the more general nonlinear context
of p fractional Laplace operators. The proof in [10] is based on the method of De
Giorgi and Stampacchia rather than on the method of Moser. The main contribution
of [10] is the concise study of the nonlocal influence on the resulting inequalities such
as the Caccioppoli inequality.
The most important implication of Theorem 8.4.2 are regularity estimates in
Hölder spaces. Let us formulate this result in a simple case.
Theorem 8.4.3. Assume 0 < ˛0 6 ˛ < 2. Assume that u 2 V ˛=2 .B2 jRd / is a weak
solution to Lu D 0 in B2 . Then, the following Hölder estimate holds for almost every
x; y 2 B1 ,
where c > 1 and ˇ 2 .0; 1/ are constants which depend only on d; ˛0 and ƒ from
(8.2.2). In particular, they are independent of ˛.
Theorem 8.4.3 is proved in [25]. The nonlinear case of operators, which are
comparable to the p-Laplace operator, is covered in [11]. Note that, in the latter
article, the concept of solutions is slightly different and not all estimates are robust.
The main idea of the proof in both articles is to deduce the regularity estimate from the
weak Harnack inequality. Note that (8.4.4) is robust in the sense that the constants
stay bounded for ˛ % 2. Several extensions of this result have been established.
194 M. Kaßmann
Theorem 1.3 of [14] proves (8.4.4) for L as in (8.3.4) and .x; dy/ .dy fxg/
for some nondegenerate stable measure .
Let us briefly mention related results for nonlocal operators that are in non-diver-
gence form. Linear examples of such operators u 7! Au are provided by
Z
u.x C h/ 2u.x/ C u.x h/
Au.x/ D a.x; h/ dh;
Rd jhjd C˛
where a W Rd Rd ! .0; 1/ is bounded between two positive constants and satisfies
a.x; h/ D a.x; h/ for all x; h. Regularity conditions on a are not imposed. Note
that, unlike L as in (8.2.3), the operator A is not symmetric in L2 .Rd / in general.
The condition a.x; h/ D a.x C h; h/ would imply that A can be represented as in
(8.2.3) with k.x; y/ D 21 a.x; y x/. However, assuming this condition in addition to
a.x; h/ D a.x; h/ is too restrictive. First regularity results for non-divergence form
operators in the spirit of Theorem 8.4.3 are proved in [43]. The proof is rather short
and uses comparison functions. A robust version of this result for a large class of lin-
ear and nonlinear operators in non-divergence form is established in [8]. Anisotropic
cases have been studied in [27]. The reader is referred to [41] for ultimate results and
references in this direction.
References
[1] N. Abatangelo and E. Valdinoci, Getting acquainted with the fractional Laplacian, preprint,
arXiv:1710.11567.
[2] K. Bogdan, The boundary Harnack principle for the fractional Laplacian, Studia Math. 123
(1997), 43–80.
[3] K. Bogdan and T. Byczkowski, Potential theory for the ˛-stable Schrödinger operator on
bounded Lipschitz domains, Studia Math. 133 (1999), 53–92.
[4] K. Bogdan, T. Byczkowski, T. Kulczycki, M. Ryznar, R. Song and Z. Vondraček, Potential
Analysis of Stable Processes and its Extensions, (P. Graczyk and A. Stos, eds.), LNM 1980
Springer, Berlin, 2009, 194.
[5] J. Bourgain, H. Brezis and P. Mironescu, Limiting embedding theorems for W s;p when s " 1
and applications, J. Anal. Math. 87 (2002), 77–101.
[6] C. Bucur and E. Valdinoci, Nonlocal diffusion and applications. In Lecture Notes of the
Unione Matematica Italiana 20, Springer, Cham, 2016.
[7] L. Caffarelli, C.H. Chan and A. Vasseur, Regularity theory for parabolic nonlinear integral
operators, J. Amer. Math. Soc. 24 (2011), 849–869.
[8] L. Caffarelli and L. Silvestre, Regularity theory for fully nonlinear integro-differential equa-
tions, Commun. Pure Appl. Math. 62 (2009), 597–638.
[9] H.A. Chang-Lara and D. Kriventsov, Further time regularity for nonlocal, fully nonlinear
parabolic equations, Commun. Pure Appl. Math. 70 (2017), 950–977.
[10] A. Di Castro, T. Kuusi and G. Palatucci, Nonlocal Harnack inequalities, J. Funct. Anal. 267
(2014), 1807–1836.
Variational solutions to nonlocal problems 195
[11] A. Di Castro, T. Kuusi and G. Palatucci, Local behavior of fractional p-minimizers, Ann. Inst.
H. Poincaré Anal. Non Linéaire 33 (2016), 1279–1299.
[12] J. Dávila, On an open question about functions of bounded variation, Calc. Var. Part. Diff.
Equations 15 (2002), 519–527.
[13] Q. Du, M. Gunzburger, R.B. Lehoucq and K. Zhou, Analysis and approximation of nonlocal
diffusion problems with volume constraints. SIAM Rev. 54 (2012), 667–696.
[14] B. Dyda and M. Kassmann, Regularity estimates for elliptic nonlocal operators, Analysis of
PDEs, to appear, arXiv:1509.08320.
[15] B. Dyda and M. Kassmann, Function spaces and extension results for nonlocal Dirichlet prob-
lems, J. Funct. Anal. (2018), in press.
[16] M.M. Fall and T. Weth, Monotonicity and nonexistence results for some fractional elliptic
problems in the half-space, Commun. Contemp. Math. 18 (2016), 1550012 (25 pp.).
[17] M. Felsinger and M. Kassmann, Local regularity for parabolic nonlocal operators, Commun.
Partial Diff. Eq. 38 (2013), 1539–1573.
[18] M. Felsinger, M. Kassmann and P. Voigt, The Dirichlet problem for nonlocal operators, Math.
Z. 279 (2015), 779–809.
[19] D. Gilbarg and N.S. Trudinger, Elliptic Partial Differential Equations of Second Order,
Springer, Berlin, 2001.
[20] G. Grubb, Fractional Laplacians on domains, a development of Hörmander’s theory of -
transmission pseudodifferential operators, Adv. Math. 268 (2015), 478–528.
[21] W. Hoh and N. Jacob, On the Dirichlet problem for pseudodifferential operators generating
Feller semigroups, J. Funct. Anal. 137 (1996), 19–48.
[22] C. Imbert and L. Silvestre, The weak Harnack inequality for the Boltzmann equation without
cut-off, to appar in J. Europ. Math. Soc., arXiv:1608.07571
[23] N. Jacob, Various approaches to the Dirichlet problem for non-local operators satisfying the
positive maximum principle, in: Potential theory—ICPT 94 (Kouty, 1994), (J. Král, J. Lukeš,
I. Netuka and J. Vselý, eds.), de Gruyter, Berlin (1996), pp. 367–375.
[24] N. Jacob and R.L. Schilling, Lévy-type processes and pseudodifferential operators. In Lévy
Processes (O.E. Barndorff-Nielsen, S.I. Resnick, and T. Mikosch eds.), Birkhäuser Boston,
MA, 2001, 139–168.
[25] M. Kassmann, A priori estimates for integro-differential operators with measurable kernels,
Calc. Var. Partial Diff. Eq. 34 (2009), 1–21.
[26] M. Kassmann, The classical Harnack inequality fails for nonlocal operators Sonder-
forschungsbereich 611 preprint no. 360, University of Bonn, 2007.
[27] M. Kassmann, M. Rang and R.W. Schwab, Integro-differential equations with nonlinear di-
rectional dependence, Indiana Univ. Math. J. 63 (2014), 1467–1498.
[28] M. Kassmann and R.W. Schwab, Regularity results for nonlocal parabolic equations, Riv.
Math. Univ. Parma (N.S.) 5 (2014), 183–212.
[29] V. Maz’ya and T. Shaposhnikova, On the Bourgain, Brezis, and Mironescu theorem concern-
ing limiting embeddings of fractional Sobolev spaces, J. Funct. Anal. 195 (2002), 230–238.
[30] T. Mengesha and Q. Du, The bond-based peridynamic system with Dirichlet-type volume
constraint, Proc. Roy. Soc. Edinburgh Sect. A 144 (2014), 161–186.
196 M. Kaßmann
[31] T. Mengesha and Q. Du, On the variational limit of a class of nonlocal functionals related to
peridynamics, Nonlinearity 28 (2015), 3999–4035.
[32] G. Molica Bisci and J. Mawhin, A Brezis–Nirenberg type result for a nonlocal fractional
operator, J. London Math. Soc. 95 (2017), 73–93.
[33] G. Molica Bisci, V.D. Radulescu and R. Servadei, Variational Methods for nonlocal Frac-
tional Problems, Cambridge University Press, Cambridge, 2016.
[34] M. Riesz, Intégrales de Riemann–Liouville et potentiels, Acta Litt. Sci. Szeged 9 (1938), 1–42.
[35] X. Ros-Oton and J. Serra, The Dirichlet problem for the fractional Laplacian: regularity up to
the boundary, J. Math. Pures Appl. 101 (2014), 275–302.
[36] X. Ros-Oton and J. Serra, Boundary regularity for fully nonlinear integro-differential equa-
tions, Duke Math. J. 165 (2016), 2079–2154.
[37] X. Ros-Oton and J. Serra, Boundary regularity estimates for nonlocal elliptic equations in C 1
and C 1;˛ domains, Ann. Mat. Pura Appl. 196 (2017), 1637–1668.
[38] X. Ros-Oton and E. Valdinoci, Dirichlet problem for nonlocal operators with singular kernels:
convex and nonconvex domains, Adv. Math. 288 (2016), 732–790.
[39] K. Sato, Lévy Processes and Infinitely Divisible Distributions, Cambridge University Press,
Cambridge, 2013.
[40] O. Savin and E. Valdinoci, Density estimates for a nonlocal variational model via the Sobolev
inequality, SIAM J. Math. Anal. 43 (2011), 2675–2687.
[41] R.W. Schwab and L. Silvestre, Regularity for parabolic integro-differential equations with
very irregular kernels, Anal. PDE 9 (2016), 727–772.
[42] R. Servadei and E. Valdinoci, Mountain pass solutions for non-local elliptic operators, J. Math.
Anal. Appl. 389 (2012), 887–898.
[43] L. Silvestre, Hölder estimates for solutions of integro-differential equations like the fractional
Laplace, Indiana Univ. Math. J. 55 (2006), 1155–1174.
[44] R. Song and J.-M. Wu, Boundary Harnack principle for symmetric stable processes, J. Funct.
Anal. 168 (1999), 403–427.
Chapter 9
Spectral and arithmetic structures in aperiodic
order
M. Baake, F. Gähler, C. Huck and P. Zeiner
Systems with aperiodic order can display a variety of arithmetic, combinatorial and
spectral phenomena, some of which are reviewed and discussed here. At the same
time, the underlying compact tiling spaces can be compared via their topological and
spectral invariants. The latter are explicitly computable for substitution systems and
provide an important tool for their classification.1
9.1 Introduction
This contribution covers selected results from the theory of aperiodic order, with
special emphasis on spectral structures. Also covered are topological aspects of tiling
spaces with mixed spectrum and combinatorial problems around lattice enumeration
problems. A running theme will be that and how systems without any periodicity
extend the notions and results that are known from systems of classic crystallography,
including perfect (periodic) crystals.
The chapter is organised as follows. We begin, in Section 9.2, with the diffraction
spectra of weak model sets, under a rather natural condition on their density, which
extends the classic theory of regular model sets. Our focus is on systems with pure
point spectrum. Particular interest in such systems emerges from the connection with
dynamical systems of number-theoretic origin.
In Section 9.3, we go one step in another direction, by considering a decorated
version of the silver mean chain. This is the first example of a tiling with incommen-
surate tile lengths and mixed singular spectrum. It demonstrates that this structure,
hitherto only known from constant-length substitutions, is more general, and opens
an alley to the systematic study of finite-index covers of regular model sets.
Next, Section 9.4 discusses a paradigmatic example of a system with mixed spec-
trum (pure point and absolutely continuous) that is generated by a random inflation
rule. This system was originally suggested in [30], and has turned out to be an inter-
esting element in a versatile class of random dynamical systems.
Finally, with a more algebraic and number-theoretic approach, we discuss, in Sec-
tion 9.5, the structure of various lattice enumeration problems and their generalisa-
tions to systems that are needed for an extension to the theory of aperiodic order. We
1 Projects A1, B2
198 M. Baake, F. Gähler, C. Huck, P. Zeiner
2
Recently, and in parallel to our work, Keller and Richard [37] have developed an alternative view on model
Spectral and arithmetic structures in aperiodic order 199
Let us start with a cut and project scheme (CPS), which is a triple .G; H; L/ as
introduced by Meyer in [44]. Explicitly, it is given by a diagram
int
G G H ! H
[ [ [ dense
1-1
.L/ L ! int .L/ (9.2.1)
k k
?
L ! L?
with locally compact Abelian groups G; H and a lattice L G H (a discrete
and co-compact subgroup of G H ); see also [46, 47, 5] for background.3 Denote
by G and H the Haar measures on G and H , respectively, which we assume to
be consistently normalised to facilitate explicit calculations around densities of the
involved point sets, lattices (L in particular) and their sublattices.
Then, for a relatively compact set W H with H .W / > 0, usually refered to as
the window of the construction, one obtains a weak model set as
f.W / WD fx 2 L j x ? 2 W g:
Subsequently, we shall always assume that G is -compact and H is compactly gen-
erated. Here, -compactness of G is needed for the existence of van Hove averag-
ing sequences and their properties, while H compactly generated is no restriction,
as can be shown constructively [56]. In general, the lower and upper densities of
D f.W / with respect to a given van Hove averaging sequence A [56, 5] obey a
chain of inequalities [36],
dens.L/ H .W ı / 6 dens./ 6 dens./ 6 dens.L/ H .W /: (9.2.2)
The following result characterises, in terms of the autocorrelation of , the case when
the upper bound is attained. Note that, for a CPS .G; H; L/ according to Eq. (9.2.1)
and some c 2 Cc .H / which is a positive definite function on H , the weighted Dirac
comb X
!c WD c.x ? / ıx
x2L
is a Fourier transformable, translation bounded pure point measure.
This observation is useful for the autocorrelation measure of model sets as fol-
lows. Given a uniformlyP discrete point set , one considers the corresponding Dirac
comb, which is ı WD x2 ıx . With respect to a given van Hove averaging se-
quence A D .An /n2N , one defines the autocorrelation
of relative to A as the
Eberlein convolution [5, Sec. 8.8]
ın ı n
D ı ~ f
ı WD lim ;
n!1 vol.An /
sets via a systematic exploitation of the torus parametrisation for such systems; compare [8, 32, 53, 13]. Their
work includes the class of weak model sets and provides an independent way to derive several of our key results.
3
Note that our use of the notation A B includes the case that A D B.
200 M. Baake, F. Gähler, C. Huck, P. Zeiner
where n D \ An , provided the limit exists. The latter can always be achieved
by going to a suitable subsequence of A. The autocorrelation is a positive definite
measure, and thus Fourier transformable. In the setting of weak model sets, one has
the following result.
Proposition 9.2.1 ([10, Prop. 5]). Let .G; H; L/ be a CPS as in Eq. (9.2.1), with G
being -compact and H compactly generated, and let ¿ ¤ W H be relatively
compact. Next, consider the weak model set D f.W / and assume that a van
Hove averaging sequence A is given relative to which the density dens./ and the
autocorrelation measure
are to be defined. Then, the following statements are
equivalent.
(i) The lower density of is maximal, dens./ D dens.L/ H .W /;
(ii) The density of exists and is maximal, dens./ D dens.L/ H .W /;
(iii) The autocorrelation of exists and satisfies
D dens.L/ !c .
W
Here, cW D 1W 1f
W
is the covariogram function of W .
Definition 9.2.2. For a given CPS .G; H; L/ with -compact G and compactly gen-
erated H , the set f.W / is called a weak model set of maximal density relative to a
given van Hove averaging sequence A if the window W H is relatively compact
with H .W / > 0, if the density of f.W / relative to A exists, and if the density
condition dens.f.W // D dens.L/ H .W / is satisfied.
One obtains the following general result on the diffraction of weak model sets of
maximal density.
Theorem 9.2.3 ([10, Thm. 7]). Let D f.W / be a weak model set of maximal den-
sity for the CPS .G; H; L/, in the setting of Proposition 9.2.1. Then, the autocorrela-
tion
is a strongly almost periodic, pure point measure. It is Fourier transformable,
and c
is a translation bounded, positive, pure point measure on the dual group G.b
It is explicitly given by
X ˇ ˇ dens./ c
D
c ˇa.u/ˇ2 ı ; with amplitude a.u/ D 1W . u? /;
u
H .W /
u2L0
where 1c
W
b and, with L 0 being the annihilator
is a bounded, continuous function on H
1
of L G H in G H , the group L0 D .L 0 / G b is the corresponding Fourier
module in additive notation.
These sets are Meyer sets, and therefore perhaps less interesting than their coun-
terparts with maximal density. Nevertheless, Theorem 9.2.3 still holds for weak
model sets of minimal density if one replaces W by W ı in the formulas. In the spirit
of the chain of density inequalities in (9.2.2), one can derive the following ‘sandwich
result’ for an arbitrary autocorrelation of a weak model set.
Corollary 9.2.5 ([10, Cor. 10]). Let be a weak model set for the CPS .G; H; L/
from above, with relatively compact window W. If
is any autocorrelation of , it
satisfies
0 6 dens.L/ !c ı 6
6 dens.L/ !c ;
W W
We can now proceed with the general spectral theory, aiming at a result on the
dynamical spectrum of the hull of weak model sets of extremal density. In order to
do so, we first need to construct a suitable measure and establish its ergodicity. Let
D f.W / with compact W H be a weak model set of maximal density, relative
to a fixed van Hove averaging sequence A. The (geometric) hull of G is the
orbit closure G C in the local topology; compare [5, Sec. 5.4] for background. Note
that our point set is of finite local complexity, so that the local topology suffices (it
is a special case of a Fell topology [11]). The group G acts continuously on the hull
by translations. P
For our further reasoning, we represent by its Dirac comb ı WD x2 ıx ,
which is a translation bounded, positive pure point measure with support . Its hull
is
X WD fıt ı j t 2 Gg;
where the closure is taken in the vague topology. By standard arguments, X is
vaguely compact, with a continuous action of G on it. Clearly, ıt ı D ıt C , so
that the topological dynamical systems G C; G and .X ; G/ are topologically
conjugate, wherefore we tacitly identify them from now on.
The compactness of W together with the regularity of the Haar measure H im-
plies the existence of a non-empty compact set K H and a net of Œ0; 1-valued
functions g˛ 2 Cc .H / such that 1K > g˛ > 1W holds for all ˛ together with
lim˛ H .g˛ / D H .W /. Consider the weighted Dirac combs
X
!g˛ D g˛ .x ? / ıx :
x2f.K/
202 M. Baake, F. Gähler, C. Huck, P. Zeiner
Since each !g˛ , as well as ı , is supported in the same Meyer set f.K/, we have
pointwise convergence, lim˛ !g˛ D ı . This also implies norm convergence, for the
norm defined by kkK D supt 2G jj.t CK/. Note that the topology induced by k:kK
does not depend on the choice of K as long as it has non-empty interior. Moreover,
for each weighted comb !g˛ , there is a hull
X˛ D fıt !g˛ j t 2 Gg
that is compact in the vague topology and defines a topological dynamical system
.X˛ ; G/. In fact, one has more; see [39, Thm. 3.1] as well as [41].
Fact 9.2.6. Each dynamical system .X˛ ; G/ of the above type is minimal and admits
precisely one G-invariant probability measure, ˛ say, and is thus strictly ergodic.
Moreover, the system is topologically conjugate to its maximal equicontinuous factor,
wherefore it has pure point diffraction and dynamical spectrum, and the hull pos-
sesses a natural structure as a compact Abelian group.
The Dirac comb ıf.K/ clearly is a translation bounded measure. Thus, there is
a compact set K 0 G with non-empty interior and a constant C > 0 such that
kıf.K/ kK 0 6 C . By construction, we also have f.K/. It follows that both our
Dirac comb ı and the measures !g˛ are elements of
˚ ˇ
Y WD 2 M1 .G/ ˇ kkK 0 6 C ;
Note that this is the standard approach to lift continuous functions on G with compact
support to continuous functions on a compact measure space such as Y. It underlies
the fundamental relation between diffraction and dynamical spectra via the Dworkin
argument; compare [11, 14] and references therein. One now obtains the following
result.
Theorem 9.2.7 ([10, Thm. 17]). The net .˛ / of ergodic, G-invariant probabil-
ity measures converges in the vague topology, and the limit, say, is an ergodic,
Spectral and arithmetic structures in aperiodic order 203
For our approach, we started with an individual weak model set of maximal
density, which is then pure point diffractive by Theorem 9.2.3. Now, we obtained a
measure-theoretic dynamical system .X ; G; / with an ergodic measure as con-
structed above. Note that, relative to the (fixed) van Hove sequence A, it is the cluster
frequency measure.
Moreover, our weak model set of maximal density is generic for this measure
by Theorem 9.2.7, hence the individual autocorrelation
of is also the auto-
correlation of the dynamical system, and its Fourier transform, c
, is the diffraction
measure both of and of our dynamical system [11].
By the general equivalence theorem between diffraction and dynamical spectrum
in the pure point case [11, 14], the following consequence is now immediate.
Corollary 9.2.8. Let be a weak model set of maximal density, relative to a fixed
van Hove averaging sequence A, for a CPS .G; H; L/ as above. Then, is pure
point diffractive, and the dynamical system .X ; G; / with the measure from The-
orem 9.2.7 has pure point dynamical spectrum.
Note that regular model sets are special cases of weak model sets of maximal
density, and thus covered as well. Repeating the above analysis for weak model sets
of minimal density, one also obtains the following analogous result [10].
Corollary 9.2.9. Let be a weak model set of minimal density, relative to a fixed
van Hove averaging sequence A, for a CPS .G; H; L/ as above. Then, the auto-
correlation of relative to A exists, and is pure point diffractive. Moreover, the
dynamical system .X ; G; /, where is the cluster frequency measure relative to
A, has pure point dynamical spectrum.
Let us turn our attention to concrete systems with mixed spectrum, where we
begin with certain covers of regular model sets.
a preimage of the collared tile t0 t1 , the lifts of all other tiles to the right are fixed, and
analogously to the left as well. Consequently, ! 0 has precisely two preimages, and
the mapping ' W Y ! Y0 is globally 2-1.
As we have already remarked, wiping out all bars from the tiles of the original
inflation also induces a factor map, but it need not be globally 2-1. The image Y00
of this factor map must also be a factor of Y0 , so that we actually have a sequence of
factor maps
' '0
Y ! Y0 ! Y00 ;
2 -1 1-1 a.e.
where Y0 is obtained by identifying collared tiles related by a bar swap, whereas Y00
is obtained by identifying original tiles related by a bar swap. The second map, ' 0 ,
is 1-1 a.e., because the composition of ' and ' 0 is a.e. 2-1. Almost all tilings in Y00
consist of a single, infinite order supertile, and these have exactly two preimages in
Y, which differ by a bar swap. Only tilings consisting of two adjacent infinite order
supertiles may have more than two preimages, but these are of measure zero. Note
that Y0 and Y00 may coincide, but they are different in general.
Each of the translation dynamical systems has a maximal equicontinuous factor
(MEF), which is the topological counterpart of the Kronecker factor from measurable
dynamics, so that the above sequence of factor maps leads to
' '0
Y ! Y0 ! Y00
2-1 1-1 a.e.
? ? ?
? ?0 ? 00 (9.3.1)
y y y
Fact 9.3.1. The map in the commuting diagram (9.3.1) is a group homomorphism,
and can thus be either 1-1 or 2-1. If is 1-1, this implies that is 2-1 a.e., so that
Y must have mixed spectrum, whereas, if is 2-1, the mapping is 1-1 a.e., and Y
has pure point spectrum.
In order to compare YMEF and Y0MEF , we need to compare their respective return
modules. Recall that r is a return vector of a tiling dynamical system Y if there exist
two tiles t1 and t2 of the same type in some tiling ! 2 Y such that the distance of
their left endpoints is r. Let RY be the Z-span of all return vectors, which is a finite
index submodule of the Z-module generated by all tile lengths, TY . The (additive)
pure point spectrum of Y now consists of all those k 2 R such that, for any return
n
vector r, one has e2i kr ! 1 as n ! 1 by [55]. This pure point spectrum can
206 M. Baake, F. Gähler, C. Huck, P. Zeiner
Clearly, Y and Y0 have the same MEF if and only if MY D MY0 . For constant-length
substitutions, the module MY is known as the height lattice; compare [26].
Let us now look at a concrete example. Our goal is to construct an almost 2-1
extension of the silver mean inflation [5, Ex. 4.5]
sm W a 7! aba ; b 7! a;
in such a way that we gain a bar swap symmetry. In line with our previous comments,
we consider the geometric inflation rule here, forptwo prototiles. The scaling factor
(or inflation multiplier) of D sm is D 1 C 2, and the natural tile lengths for
a (long) and b (short) are and 1, respectively. Here, is a PV unit, and it is well
known that generates tilings with pure point spectrum; see [5, Chs. 7 and 9] for
details.
We now add a barred version of each prototile, thus enlarging the prototile set to
A D fa; b; a; N and look for an inflation which is primitive, commutes with the bar
N bg,
swap involution P , and reduces to under the identification of a with aN and b with
N It is easy to see that the inflation
b.
N W a 7! ab aN ; b 7! aN ; N ;
aN 7! aN ba bN 7! a: (9.3.2)
Figure 9.3.1. Covering windows of the inflation 2 . For each pair of prototiles, such as C and C ,
the lift of their left endpoints give disjoint point sets in internal space with the same closure; see
text for details.
such that, for each tile type i , there exists a window Wi that produces the point set
i D f.x/ j x 2 L; x ? 2 Wi g of left endpoints of tiles of type i in ! 0 . Choosing a
preimage ! of ! 0 under ' induces a decomposition of i into a disjoint union of two
subsets, C i and i , coming from the left endpoints of the unbarred resp. barred p tiles
C
of !. These subsets i and i are no model sets, but as subsets of ZŒ D ZŒ 2
we can still lift them to L, project them by int to internal space, and take the closure.
In this way, we can determine a covering window for each tile type of !. Of course,
we cannot expect these covering windows for the different tile types to be disjoint.
The result is shown in Figure 9.3.1.
We see that a barred and an unbarred tile always share the same covering window.
Of course, every left endpoint of a tile is either the left endpoint of a barred or an
unbarred tile, but not both. Consequently, the identical covering windows emerge
as the closure of disjoint point sets. This means that one cannot determine the bar
status of a tile by looking at its internal space coordinate. The bars represent a sort of
chemical modulation of the original silver mean tiling, where the modulation cannot
be described as a function of internal space coordinates in a reasonable way.
Another interesting point is that the hull Y00 of the original silver mean tiling is
indeed different from Y0 as generated by the inflation (9.3.3). It is not
p even MLD to
00 0 ?
Y . Even though all window boundaries of Y are in ZŒ D ZŒ 2 , there is an
additional accumulation point not contained in ZŒ? . Therefore, in contrast to Y00 ,
the corresponding fibre over the MEF has size 2. Two tilings correspond to this point,
both of which project to the same silver mean tiling. This degeneracy also shows up
208 M. Baake, F. Gähler, C. Huck, P. Zeiner
Lemma 9.3.2. The spectral measure of the translation action on Ytsm , confined to
either of the two sectors HC and H , is spectrally pure, so that it has only one
non-vanishing component in its Lebesgue decomposition. In particular, it is a pure
point measure on HC , and a continuous one on H , where the latter is either purely
singular continuous or purely absolutely continuous.
The nature of the spectrum in the odd sector now depends on the decay or non-decay
of tsm .z/ as z ! 1.
Spectral and arithmetic structures in aperiodic order 209
p
Lemma 9.3.3. One has limn!1 tsm .zn / D 1 2, where zn D .1 C /n .
Proof. We only sketch the strategy of the proof here; compare [2]. The contributions
to tsm .1 C / come from pairs of tiles where the second tile t2 is at distance 1 C
to the right of tile t1 . For any such pair, t2 is the second neighbour to the right of t1 .
We can therefore determine all possible triples of three consecutive tiles in the tiling,
and add up their contributions to the correlation at distance 1 C , weighted with the
relative frequency of each triple. The latter can be determined by Perron–Frobenius
theory [52, 5].
For the correlation at distance .1 C /n , we do the same with triples of supertiles
of order n. These have the same relative frequencies as the triples of tiles. The pairs
of tiles contributing to the correlation then consist of a left tile in the left supertile of
the triple, and a corresponding right tile at distance .1 C /n , which is contained
in one of the other two supertiles. Since the underlying silver mean tiling has pure
point spectrum, the density of tiles t1 not having a corresponding tile t2 with the
same geometry at distance .1 C /n asymptotically vanishes as n ! 1, so that
we just have to add up the contributions of matching and anti-matching pairs of tiles,
the latter ones having opposite bar status, but matching geometry. The details of the
computation are given in [2].
The main result can now be formulated as follows.
Theorem 9.3.4. The TSM dynamical system has two spectral components, both of
which are singular. The dynamical spectrum from the even sector under the bar swap
is pure point, whereas that from the odd sector is purely singular continuous.
Proof. By Lemma 9.3.2, the dynamical spectrum in the even sector is pure point,
whereas in the odd sectorPit is continuous and of pure spectral type. By Lemma 9.3.3,
the correlation measure z2 tsm .z/ ız does not decay to zero towards infinity,
so that, by the Riemann–Lebesgue lemma and the fact that is uniformly discrete
for a Pisot inflation, its Fourier transform must be singular. This implies that there
must be a singular continuous component in the diffraction spectrum, and hence in
the dynamical spectrum of the odd sector. As that spectrum is of pure type, it must
be purely singular continuous.
Remark 9.3.5. Instead of splitting each tile type of a pure point tiling into a barred
and an unbarred variant, we could have split it into k copies each, and modified the
inflation such that a permutation symmetry acting on these k copies results. Such
a procedure would lead to a mixed spectrum tiling, too, but a simple spectral purity
result for the continuous spectrum sector, as in Lemma 9.3.2, can only be expected if
k D 2.
Remark 9.3.6. The set C of left endpoints of all unbarred tiles of a TSM tiling is
a Meyer set (it is a relatively dense subset of a model set), which is linearly repeti-
tive (it is a component of a primitive inflation tiling), and which has mixed spectrum
210 M. Baake, F. Gähler, C. Huck, P. Zeiner
of singular type. This shows that there are highly ordered Meyer sets with mixed
spectrum. In fact, C has zero entropy, and must be distinguished from Meyer sets
arising from model sets with an (a posteriori) thinning disorder of positive entropy.
The latter would have a mixed spectrum with a non-trivial pure point part and an
absolutely continuous component in the spectrum. The simplest example would be
an uncorrelated thinning disorder of Bernoulli type, but more interesting are random
inflation systems [30, 45], as discussed below. These have a highly correlated thin-
ning disorder, but still positive entropy and an absolutely continuous component in
the spectrum.
%0 W a 7! ba; b 7! a;
and one can check explicitly that both % and % 0 define the same hull [5, Rem. 4.5].
As a consequence, both % and % 0 map XF into itself, as does any concatenation of
them. In fact, to any one-sided binary sequence, we can attach a sequence in these
Spectral and arithmetic structures in aperiodic order 211
two substitutions. The hull defined by the resulting concatenation would still be XF .
The topological dynamical system defined that way, .XF ; Z/ with the continuous shift
action of Z D hS i, has a number of interesting properties as follows; see [52, 5] for
details.
Proposition 9.4.1. The topological dynamical system .XF ; Z/ is strictly ergodic, and
has pure point dynamical spectrum. Moreover, it is deterministic, which is to say that
its topological entropy vanishes.
There is a natural way to turn this symbolic system into a geometric one, which is
analogous to what we did in the previous
p section. Here, one chooses the two prototiles
1
as intervals of length D 2 1 C 5 for type a and 1 for type b. This maps each
sequence from XF to a tiling of R by two intervals, where the marker is always sent
to 0. Taking now the closure of all translates by t 2 R gives a new hull, denoted
by YF , which is known as the continuous hull. This step, which can be formalised
by a standard suspension argument from ergodic theory, leads to another topological
dynamical system, .YF ; R/. Via standard results from suspension theory [24], one has
the following consequence of Proposition 9.4.1.
Corollary 9.4.2. The topological dynamical system .YF ; R/ is strictly ergodic, with
pure point dynamical spectrum and vanishing topological entropy.
Things change drastically if one allows to mix % and % 0 on a local level. Indeed,
let us consider the (Fibonacci) random substitution [30, 45] defined by
8̂ (
<a 7! ba; with probability p;
#W ab; with probability q D 1 p;
:̂
b 7! a;
where p 2 Œ0; 1 is arbitrary, but fixed. Clearly, the term ‘random’ is only justified
for 0 < p < 1. In that case, for any given p, the substitution # defines a much
larger hull, Xp , that contains the previous deterministic hull as a proper subset. Note
that we have X0 D X1 D XF , while Xp is larger, and actually independent of p, for
0 < p < 1. Our first result concerns the topological entropy.
Lemma 9.4.3 ([30, 50, 45]). For any p 2 .0; 1/, the topological entropy of the dy-
P
namical system .Xp ; Z/ is given by s D 1 log.`/
`D2 `C2 0:444398725.
Next, let us once again move from the symbolic picture to a geometric one, with
prototiles of length and 1 as above in the deterministic case. This gives a new hull,
Yp , whose elements are once again either viewed as tilings or as point sets (via left
endpoints). If these endpoints carry a ‘type’ (derived from the tile they belong to), one
obtains a unique decomposition of each point set in the hull as D a [ P b . These
212 M. Baake, F. Gähler, C. Huck, P. Zeiner
0.5
ac
pp
2 4 6 8 10
Figure 9.4.1. Illustration of the two components of the random Fibonacci diffraction measure. The
pure point part (black) is closely related to that of the perfect Fibonacci tiling. The absolutely
continuous part (grey) is represented by a bounded, continuous function.
point sets have a number of remarkable properties. In particular, for any 0 < p < 1,
one finds that
1. every 2 Yp is a Meyer set, and thus a subset of (a translate of) a model set;
2. the MEF of .Yp ; R/ is trivial, because the eigenfunctions (except the constant
one) fail to be continuous;
3. the Kronecker factor of .Yp ; R/ is non-trivial, and is given by the MEF of a
covering model set;
4. the topological entropy is positive.
The Meyer set property means that, despite having positive entropy, there is still
strong long-range order, and thus a non-trivial pure point part of the spectrum. Let us
present one result in this direction, formulated for the diffraction spectrum of Yp with
equal weight 1 on both types of points; see [5, Ch. 9] for background on diffraction.
2
with I.k/ D p5 sinc. k ? / being the intensity fuction from the deterministic
Fibonacci tiling and p the Radon–Nikodym density of b
ac , as computed previously
in [45, Prop. 6.18].
Spectral and arithmetic structures in aperiodic order 213
where the n are uniformly bounded, smooth functions. They are defined recursively
ˇ n 1 ˇ2
by nC1 .k/ D ˇp Cqe 2ik ˇ n .k/ for n > 2 together with the initial condition
2 .k/ D 1 cos 2 k , and the above sum converges uniformly.
where OK is the maximal order of K and thus contains O . Finally, the group
SOS. / is the same for all lattices in sim. /, the lattice similarity class of .
The multiplier ring is known for arbitrary planar lattices . In addition, the
Dirichlet series can be given explicitly for all that are similar to a maximal or-
der OK with class number 1; see [18] for details.
The analogous problem of counting CSLs is more difficult. This is partly due to
the fact that coincidence isometries are generally not in one-to-one correspondence
with CSLs, which gives rise to different counting functions. This can be illustrated
nicely in the case of the A4 lattice, where the Dirichlet series generating function of
the coincidence isometries [7] is simpler than the corresponding generating function
of the CSLs [34, 33, 61]. Here, it turns out that the counts are all multiples of 120, due
to an underlying action of the icosahedral group. In particular,p we have the following
result for the number of coincidence rotations, where K D Q. 5 / is a real quadratic
number field and K its Dedekind zeta function. Also, denotes Riemann’s zeta
function.
Theorem 9.5.2 ([7, Thm. 4]). Let 120 cArot4 .m/ be the number of coincidence rotations
of the root lattice A4 of index m. Then, the Dirichlet series generating function for
cArot4 .m/ is given by
X cArot .n/ K .s 1/ .s/.s 2/
‰Arot4 .s/ D 4
D
ns 1 C 5 s .2s/.2s 2/
n2N
1 C 51 s Y .1 C p s /.1 C p1 s / Y 1Cp s
D
1 52 s .1 p1 s /.1 p2 s / 1 p2 s
p˙1.5/ p˙2.5/
5 10 20 30 50 50 80 90 150 144
D 1C 2s
C 3s
C 4s
C 5s
C 6s
C 7s
C 8s
C 9s
C 10s
C 11s
C ;
and the spectrum of coincidence indices is N.
Spectral and arithmetic structures in aperiodic order 215
This allowed us to determine the asymptotic behaviour of cArot4 .m/; compare [7].
p
450 5
Corollary 9.5.3. With Arot4 D ressD3 ‰Arot4 .s/ D 6 .3/, the summatory asymp-
totic behaviour of cArot4 .m/ is given by
X x3
cArot4 .m/ Arot4 0:419375 x 3;
m6x
3
as x ! 1.
Analogous results hold for the number of CSLs of A4 . However, the formulas
are more complicated. Still, we have been able to calculate the Euler factors of the
corresponding generating function ‰A4 .s/ explicitly, but there seems to be no easy
way to write ‰A4 .s/ as product of Riemann -functions; compare [33, 61, 20].
Another result establishes the link between CSMs and SSMs for arbitrary embed-
ded Z-modules. Let
which means that scalM .R/ is the set of all scaling factors ˛ such that ˛RM is com-
mensurate to M , by which we mean that M \˛RM is a submodule of M of full rank.
Then, G D fscalM .R/ j R 2 OS.M /g has a natural group structure, which allows to
define a homomorphism OS.M / ! G. In particular, we have the following result,
where RC WD fx 2 R j x > 0g.
Theorem 9.5.4 ([61, Thm. 3.2.2]). Let M be an embedded Z-module. Then, the
kernel of the homomorphism
is the group OC.M /. Thus, OC.M / is a normal subgroup of OS.M /, and the factor
group OS.M /= OC.M / is Abelian.
This result was first proved for lattices in [28] and later generalised to S -lattices
in [29]. Here, we call a Z-module M an S -lattice, if S is a ring with identity that is
also a finitely generated free Z-module and if M is the S -span of d linearly indepen-
dent vectors in Rd .
If M Rd is a lattice or an S -lattice, all elements of OS.M /= OC.M / have finite
order. In particular, their order is a divisor of d ; see [28, 29] for details.
216 M. Baake, F. Gähler, C. Huck, P. Zeiner
Let us continue with a resultp on the icosian ring, I, which is a maximal order in
the quaternion algebra H.Q. 5 //. It is both a Z-module of rank 8, embedded in
R4 , and a ZŒ -module of rank 4; see [49] or [5, Ex. 3.9] for an explicit basis and
the relation to the root lattice E8 , which is the Minkowski embedding of I into R8 .
In particular, I is an S -lattice in the above sense, with S D ZŒ . Since I has class
number 1, all SSMs of I can be determined through the ideal theory of I and their
prime factorisation; see [16] for more. The result reads as follows, where K stands
for the Dedekind zeta function of a quadratic field K and I for that of the icosian
ring [58].
Theorem 9.5.6 ([16, Thm. 3]). The possible indices of similar submodules of the
icosian ring are precisely the squares of rational integers that can be represented by
the quadratic form x 2 C xy y 2 with x; y 2 Z. The number of SSMs of a given index
is a multiplicative arithmetic function, whose Dirichlet series generating function ˆI
reads 2 2
I .s/ K .2s/ K .2s 1/
ˆI .s/ D D
K .4s/ K .4s/
p
with K D Q. / D Q. 5 /.
Again, the cases of coincidence isometries and CSMs are more complicated; see
[20, 34, 61] for the known results in this direction.
Let us close this section with a result on well-rounded lattices in the plane. A
lattice in Euclidean d -space is called well-rounded if it contains d linearly indepen-
dent vectors of minimal (positive) length. In d D 2, the well-rounded sublattices of
a lattice are in close connection to its CSLs [19], though counting well-rounded
sublattices is generally more difficult than counting SSLs and CSLs. However, we
obtained precise results for the asymptotic growth rates for certain planar lattices. In
particular, we have proved the following result [19].
log.3/ L.1; 4/
A .x/ D x log.x/ 1 C c x C O x 3=4 log.x/
3 .2/
log.3/ log.3/
D x log.x/ C c x C O x 3=4 log.x/
2 2
Spectral and arithmetic structures in aperiodic order 217
0:6272237
1
P s
is the coefficient of .s 1/ in the Laurent series of n>1 a .n/n around s D 1.
References
[1] J.E. Anderson and I.F. Putnam, Topological invariants for substitution tilings and their asso-
ciated C -algebras, Ergodic Th. & Dynam. Syst. 18 (1998) 509–537.
[2] M. Baake and F. Gähler, Pair correlations of aperiodic inflation rules via renormalisation:
Some interesting examples, Topol. & Appl. 205 (2016) 4–27, arXiv:1511.00885.
218 M. Baake, F. Gähler, C. Huck, P. Zeiner
[3] M. Baake, F. Gähler and U. Grimm, Examples of substitution systems and their factors J. Int.
Seq. 16 (2013) art. 13.2.14 (18 pp), arXiv:1211.5466.
[4] M. Baake, F. Gähler and U. Grimm, Spectral and topological properties of a family of gener-
alised Thue–Morse sequences, J. Math. Phys. 53 (2012) 032701 (24pp), arXiv:1201.1423.
[5] M. Baake and U. Grimm, Aperiodic Order. Vol. 1: A Mathematical Invitation, Cambridge
University Press, Cambridge, 2013.
[6] M. Baake and U. Grimm, Squirals and beyond: Substitution tilings with singular continuous
spectrum, Ergodic Th. & Dynam. Syst. 34 (2014) 1077–1102, arXiv:1205.1384.
[7] M. Baake, U. Grimm, M. Heuer and P. Zeiner, Coincidence rotations for the root lattice A4 ,
Europ. J. Combin. 29 (2008) 1808–1819, arXiv:0709.1341.
[8] M. Baake, J. Hermisson and P.A.B. Pleasants, The torus parametrization of quasiperiodic LI
classes, J. Phys. A: Math. Gen. 30 (1997) 3029–3056, mp_arc/02-168.
[9] M. Baake and C. Huck, Ergodic properties of visible lattice points, Proc. V.A. Steklov Inst.
Math. 288 (2015) 184–208, arXiv:1501.01198.
[10] M. Baake, C. Huck and N. Strungaru, On weak model sets of extremal density, Indag. Math.
28 (2017) 3–31, arXiv:1512.07129.
[11] M. Baake and D. Lenz, Dynamical systems on translation bounded measures: Pure point
dynamical and diffraction spectra, Ergodic Th. & Dynam. Syst. 24 (2004) 1867–1893,
arXiv:math.DS/0302231.
[12] M. Baake and D. Lenz, Deformation of Delone dynamical systems and topological conjugacy,
J. Fourier Anal. Appl. 11 (2005) 125–150, arXiv:math.DS/0404155.
[13] M. Baake, D. Lenz and R.V. Moody, Characterization of model sets by dynamical systems,
Ergodic Th. & Dynam. Syst. 27 (2007) 341–382, arXiv:math.DS/0511648.
[14] M. Baake, D. Lenz and A.C.D. van Enter, Dynamical versus diffraction spectrum for struc-
tures with finite local complexity, Ergodic Th. & Dynam. Syst. 35 (2015) 2017–2043,
arXiv:1307.7518.
[15] M. Baake and R. V. Moody, Similarity submodules and root systems in four dimensions, Can.
J. Math. 51 (1999) 1258–1276, arXiv:math/9904028.
[16] M. Baake and R.V. Moody, Weighted Dirac combs with pure point diffraction, J. reine angew.
Math. (Crelle) 573 (2004) 61–94, arXiv:math.MG/0203030.
[17] M. Baake, R.V. Moody and P.A.B. Pleasants, Diffraction from visible lattice points and k-th
power free integers, Discr. Math. 221 (2000) 3–42, arXiv:math.MG/9906132.
[18] M. Baake, R. Scharlau and P. Zeiner, Similar sublattices of planar lattices, Can. J. Math. 63
(2011) 1220–1237, arXiv:0908.2558.
[19] M. Baake, R. Scharlau and P. Zeiner, Well-rounded sublattices of planar lattices, Acta Arithm.
166.4 (2014) 301–334, arXiv:1311.6306.
[20] M. Baake and P. Zeiner, Geometric enumeration problems for lattices and embedded Z -
modules, In Aperiodic Order. Vol. 2: Crystallography and Almost Periodicity (M. Baake and
U. Grimm, eds.), Cambridge University Press, Cambridge, 2017, 73–172, arXiv:1709.07317.
[21] M. Barge, Factors of Pisot tiling spaces and the Coincidence Rank Conjecture, Bull. Soc.
Math. France 143 (2015) 357–381, arXiv:1301.7094.
[22] A. Bartlett, Spectral theory of Z d substitutions, Ergodic Th. & Dynam. Syst. 38 (2018), 1290–
1341.
Spectral and arithmetic structures in aperiodic order 219
[23] N.G. de Bruijn, Algebraic theory of Penrose’s non-periodic tilings of the plane. I & II, Kon.
Nederl. Akad. Wetensch. Proc. Ser. A 84 (1981) 39–52 and 53–66.
[24] M. Einsiedler and T. Ward, Ergodic Theory with a View Towards Number Theory, GTM 259,
Springer, London (2011).
[25] N.P. Frank, Substitution sequences in Z d with a nonsimple Lebesgue component in the spec-
trum, Ergodic Th. & Dynam. Syst. 23 (2003) 519–532.
[26] N.P. Frank, Multidimensional constant-length substitution sequences, Topol. & Appl. 152
(2005) 44–69.
[27] F. Gähler, J. Hunton and J. Kellendonk, Integral cohomology of rational projection method
patterns, Alg. & Geom. Top. 13 (2013) 1661–1708, arXiv:1202.2240.
[28] S. Glied and M. Baake, Similarity versus coincidence rotations of lattices, Z. Krist. 223 (2008)
770–772, arXiv:0808.0109.
[29] S. Glied, Similarity and coincidence isometries for modules, Can. Math. Bull. 55 (2011) 98–
107, arXiv:1005.5237.
[30] C. Godréche and J.-M. Luck, Quasiperiodicity and randomness in tilings of the plane, J. Stat.
Phys. 55 (1989) 1–28.
[31] H. Helson, Cocycles on the circle, J. Operator Theory 16 (1986) 189–199.
[32] J. Hermisson, C. Richard and M. Baake, A guide to the symmetry structure of quasiperiodic
tiling classes, J. Phys. I (France) 7 (1997) 1003–1018, mp_arc/02-180.
[33] M. Heuer, Combinatorial Aspects of Root Lattices and Words, PhD thesis, The Open Univer-
sity, Milton Keynes, UK (2010).
[34] M. Heuer and P. Zeiner, CSLs of the root lattice A4 , J. Phys: Conf. Ser. 226 (2010) 012024 (6
pp), arXiv:1301.2001.
[35] A. Hof, On diffraction by aperiodic structures, Commun. Math. Phys. 169 (1995) 25–43.
[36] C. Huck and C. Richard, On pattern entropy of weak model sets, Discr. Comput. Geom. 54
(2015) 741–757, arXiv:1423.6307.
[37] G. Keller and C. Richard, Dynamics on the graph of the torus parametrisation, Ergodic Th. &
Dynam. Syst. 38 (2018), 1048–1085.
[38] J.-Y. Lee, R.V. Moody and B. Solomyak, Pure point dynamical and diffraction spectra, Ann.
Henri Poincaré 3 (2002) 1003–1018, arXiv:0910.4809.
[39] D. Lenz and C. Richard, Pure point diffraction and cut and project schemes for measures: The
smooth case, Math. Z. 256 (2007) 347–378, arXiv:math.DS/0603453.
[40] D. Lenz and N. Strungaru, Pure point spectrum for measure dynamical systems on locally
compact Abelian groups, J. Math. Pures Appl. 92 (2009) 323–341, arXiv:0704.2498.
[41] D. Lenz and N. Strungaru, On weakly almost periodic measures, Trans. Amer. Math. Soc. 371
(2019) 6483–6881, arXiv:1609.08219.
[42] M.J. Loquias and P. Zeiner, The coincidence problem for shifted lattices and crystallographic
point packings, Acta Cryst. A 70 (2014) 656–669, arXiv:1301.3689.
[43] M.J. Loquias and P. Zeiner, Coincidence indices of sublattices and coincidences of colorings,
Z. Krist. 230 (2015) 749–759, arXiv:1506.00028.
[44] Y. Meyer, Algebraic Numbers and Harmonic Analysis, North Holland, Amsterdam, 1972.
220 M. Baake, F. Gähler, C. Huck, P. Zeiner
[45] M. Moll, Diffraction of random noble means words, J. Stat. Phys. 156 (2014) 1221–1236,
arXiv:1404.7411.
[46] R.V. Moody, Meyer sets and their duals. In The Mathematics of Long-Range Aperiodic Order
(R.V. Moody, ed.), NATO ASI Series C 489, Kluwer, Dordrecht, 1997, 403–441.
[47] R.V. Moody, Model sets: a survey. In From Quasicrystals to More Complex Systems (F. Axel,
F. Dénoyer, and J.P. Gazeau, eds.), Springer, Berlin and EDP Sciences, Les Ulis, 2000, 145–
166, arXiv:math.MG/0002020.
[48] R.V. Moody, Uniform distribution in model sets, Can. Math. Bulletin 45 (2002) 123–130.
[49] R.V. Moody and J. Patera, Quasicrystals and icosians, J. Phys. A: Math. Gen. 26 (1993) 2829–
2853.
[50] J. Nilsson, On the entropy of a family of random substitutions, Monatsh. Math. 166 (2012)
1–15, arXiv:1103.4777.
[51] P.A.B. Pleasants and C. Huck, Entropy and diffraction of the k-free points in n-dimensional
lattices, Discr. Comput. Geom. 50 (2013) 39–68, arXiv:1112.1629.
[52] M. Queffélec, Substitution Dynamical Systems – Spectral Analysis, 2nd ed., LNM 1294,
Springer, Berlin, 2010.
[53] M. Schlottmann, Generalised model sets and dynamical systems. In Directions in Mathemat-
ical Quasicrystals (M. Baake and R.V. Moody, eds.), CRM Monograph Series vol. 13, AMS,
Providence, RI, 2000, 143–159.
[54] J.-P. Schreiber, Approximations diophantiennes et problemes additifs dans les groupes
abéliens localement compacts, Bull. Soc. Math. France 101 (1973) 297–332.
[55] B. Solomyak, Dynamics of self-similar tilings, Ergod. Th. & Dynam. Syst. 17 (1997) 695–738
and Ergod. Th. & Dynam. Syst. 19 (1999) 1685 (erratum).
[56] N. Strungaru, Almost periodic measures and Meyer sets. In Aperiodic Order. Vol. 2: Crys-
tallography and Almost Periodicity (M. Baake and U. Grimm, eds.), Cambridge University
Press, Cambridge, 2017, 271–342, arXiv:1501.00945.
[57] G. Tenenbaum, Introduction to Analytic and Probabilistic Number Theory, Cambridge Uni-
versity Press, Cambridge, 1995.
[58] M.-F. Vignéras, Arithmétique des Algebres de Quaternions, LNM 800, Springer, Berlin, 1980.
[59] P. Zeiner, Well-rounded sublattices and coincidence site lattices. In Aperiodic Crystals
(S. Schmid, R.L. Withers, and R. Lifshitz, eds.), Springer, Dordrecht, 2013, 43–48.
[60] P. Zeiner, Similar submodules and coincidence site modules, Acta Phys. Pol. A 126 (2014)
641–645, arXiv:1402.5013.
[61] P. Zeiner, Coincidence Site Lattices and Coincidence Site Modules, Habilitation thesis, Biele-
feld University, 2015.
Chapter 10
Affine special Kähler structures in real
dimension two
M. Callies and A. Haydys
10.1 Introduction
The notion of a special Kähler structure has its roots in physics [15, 10] and appears in
supersymmetric field theories, -models, and supergravity. The following definition
of an affine special Kähler structure is due to Freed [12].
Remark 10.1.2. In this contribution, we only consider affine special Kähler struc-
tures, which are simply abbreviated in what follows as special Kähler ones. These
should not be confused with projective special Kähler structures, which are not con-
sidered in this article.
There are many reasons for a mathematician to care about special Kähler struc-
tures. Perhaps one of the most important is the so called c-map construction [1,
6, 8, 12, 24], which associates to a special Kähler structure on M a hyperKähler
metric on the total space of T M . Moreover, each cotangent space is a complex
1 Project C11
222 M. Callies, A. Haydys
Such coordinate systems always exist in a neighbourhood of any point [12]. More-
over, for any j; k we have
@Wj @Wk
D
@Zk @Zj
and therefore there is a holomorphic function F such that
@F
Wk D :
@Zk
i X @2 F
! D Im dZj ^ dZN k : (10.2.1)
2 @Zj @Zk
j;k
One more useful object, which can be attached to a special Kähler structure, is
the so called holomorphic cubic form, which is defined as follows. Consider the
fiberwise projection .1;0/ onto the T 1;0 M TC M as a 1-form with values in TC M
Since this form vanishes on vectors of type .0; 1/, we can think of .1;0/ as an element
of 1;0 .M I TC M /. Then, the holomorphic cubic form is
„ WD ! .1;0/ ; r .1;0/ 2 H 0 M I Sym3 T M :
In terms of the holomorphic prepotential, the holomorphic cubic form can be ex-
pressed as follows:
1X @3 F
„ D dZj ˝ dZk ˝ dZl :
4 @Zj @Zk @Zl
j;k;l
One can show that „ measures the difference between the flat connection r, which
is part of the special Kähler structure, and the Levi–Civita connection [12].
224 M. Callies, A. Haydys
10.2.2 An extrinsic description In [1], the local description of special Kähler man-
ifolds in terms of special holomorphic coordinates was reformulated as an extrinsic
description of simply connected special Kähler manifolds:
Given an n-dimensional special Kähler manifold, then locally, special conjugate
coordinate systems fZj g and fWj g define an immersion
D .Z1 ; : : : ; Zn ; W1 ; : : : ; Wn /
into T Cn D C2n . Thinking of .Z1 ; : : : ; Zn ; W1 ; : : : Wn / as a canonical coordinate
system on T Cn , the standard complex symplectic form on T Cn is
X
D dZj ^ dWj :
j
Example 10.2.1. The basic example in this context is given by a closed holomorphic
P @F
1-form # D j Fj dZj on M D Cn with invertible real matrix Im. @Zjk /. Then, the
image of
#
M ,! T Cn D C2n
is a special Kähler manifold. Locally, # D dF is the differential of a holomorphic
function F, the holomorphic prepotential.
Using the global trivialisation of T provided by the real coordinates .x; y/ the con-
nection r is described by its connection 1-form !r 2 1 I gl.2; R/ . A computa-
tion shows that r is torsion-free and satisfies (10.1.1) if and only if !r can be written
in the form
!11 !11
!r D : (10.2.2)
!22 !22
Here denotes the Hodge star operator with respect to the flat metric. Moreover, r
preserves the symplectic form ! D 2e u dx ^ dy if and only if tr !r D !11 C !22 D
du. Thus, r is parameterised by a single 1-form, say !11 .
Furthermore, by a direct computation one obtains that the flatness of r implies
that WD e u !11 is closed. Hence, r is in fact parameterised by a single closed
1-form ; Moreover, r is flat if and only if
Here D @2xx C @2yy is the Laplacian with respect to the flat metric.
Assume that any class in H 1 .I R/ can be represented by a co-closed—hence,
harmonic—1-form. Then, choosing a co-closed representative of Œ, we can write
D C 21 d.h C e u /. A computation shows that (10.2.3) is equivalent to
the metric g D e u jdzj2 is special Kähler. Moreover, the connection 1-form of the
flat connection r is given by (10.2.2) with
In particular, for the punctured disc B1 WD B1 nf0g the first de Rham cohomology
group is generated by the following harmonic 1-form:
y dx x dy
'D :
x2 C y2
Hence, we have the following result.
Remark 10.2.4. The last equation of (10.2.5) is the celebrated Kazdan–Warner equa-
tion [20].
Theorem 10.2.5 ([17, Thm. 1.1]). Let g D wjdzj2 be a special Kähler metric on
B1 . Assume that „ is holomorphic on the punctured disc and the order of „ at the
origin is N 2 Z. Then,
w D jzjN C1 log jzj C C o.1/ or w D jzjˇ C C o.1/ (10.2.8)
Remark 10.2.6. In [17], the first formula of (10.2.8) appears in the form w D
jzjN C1 log jzj eO.1/ , which follows from McOwen’s analysis of solutions of the
Kazdan–Warner equation [27]. The asymptotics as stated in Theorem 10.2.5 can be
Affine special Kähler structures in real dimension two 227
obtained from [25, Prop. 3.1], which in fact provides even more refined asymptotics
near the origin.
Theorem 10.2.7 ([5]). Let g D wjdzj2 be a special Kähler metric on B1 such that
w D jzjˇ C C o.1/ or w D jzjN C1 log jzj C C o.1/ ;
where ˇ < N C 1 (in the second case, we put by definition ˇ D N C 1). Let Hol.r/
denote the monodromy of r along a loop that goes once around the origin. Then, the
following holds:
ˇ sin ˇ
If ˇ … Z, Hol.r/ is conjugate to cos
sin ˇ cos ˇ
;
11
If ˇ 2 2Z, Hol.r/ is trivial or conjugate to 01 ;
If ˇ 2 2Z C 1, Hol.r/ is id or conjugate to 0
1 1
1 .
Proof. Since Hol.r/ 2 Sp.2; R/ D SL.2; R/, the characteristic polynomial of Hol.r/
has integer coefficients if and only if tr Hol.r/ 2 Z. This implies that Hol.r/ is con-
jugate to a matrix lying in Sp.2; Z/ if and only if cos ˇ 2 f0; ˙ 12 ; ˙1g.
10.2.4 A link between two local descriptions Our next goal is to obtain a link be-
tween the two descriptions of special Kähler structures in terms of solutions of (10.2.7)
and in terms of special holomorphic coordinates. Notice that special holomorphic co-
ordinates always exist in a neighbourhood of a point, where the special Kähler struc-
ture is regular. However, in a neighbourhood of a singular point there may be no
special holomorphic coordinates. More precisely, we have the following.
Proof. Observe first, that for any negative harmonic function h the triple .h; u; a/ D
.h; log. h/; 0/ is a solution of (10.2.7). Moreover, in this case by (10.2.6) we have
!11 D 0, which implies that rdx D 0. In other words, z is a special holomorphic
coordinate.
228 M. Callies, A. Haydys
@2 F
Im D 2h:
@z 2
If is a disc, this equality determines F up to a polynomial of degree 2, cf. [12,
Prop. 1.38(c)].
If D B1 , by [3, Thm. 3.9] there is A > 0 such that h D A log jzj C h0 , where
h0 is a smooth harmonic function on B1 . Hence, we have the following result.
In particular, a special Kähler structure on B1 such that ord0 „ 6 2 does not
admit a special holomorphic coordinate in a neighbourhood of the origin.
10.2.6 Examples
with @F
@z
D w, as in Example 10.2.1.
This special Kähler structure is related to the Ooguri–Vafa metric [29], see [7].
The monodromy of r along the circle of radius 1 centered at the origin can be
computed explicitly and equals
2A
Hol.r/ D 1 B :
0 1
Example 10.2.13. Apply Proposition 10.2.11 to the Poincaré metric on the punctured
2
disc gQ D jzj 2 log jzj jdzj2 to obtain that the metric
is special Kähler.
jf 0 .z/j2
w D 4 2 ; (10.2.9)
1 C Kjf .z/j2
230 M. Callies, A. Haydys
where f is a meromorphic function on with at most simple poles such that f 0 .z/ ¤
0 on . Conversely, for any (not necessarily simply connected) and any f as
above, (10.2.9) determines a metric of constant negative curvature K. Hence, by
Proposition 10.2.11 the metric
ˇ ˇ
1 ˇ1 C Kjf .z/j2 ˇ
g D jdzj2
2 jf 0 .z/j
Example 10.2.15. By a classical result of Picard [30], for any given n > 3 pairwise
P points .z1 ; : : : ; zn / in C and any n real numbers .˛1 ; : : : ; ˛n / such that ˛j < 1
distinct
and ˛j > 2, there exists a unique metric of constant negative curvature gQ on
C n fz1 ; : : : ; zn g satisfying gQ D jz zj j 2˛j .c C o.1//jdzj2 near zj . Hence, the
corresponding special Kähler metric g has a conical singularity near zj ,
g D jz zj j˛j .c C o.1//jdzj2:
reads
Z k
1 1X
1
K D .P / C ˇj :
2 2
j D1
P1
Here, K is the curvature of g and is the Euler characteristic. Since K > 0, com-
pare [12, Rem. 1.35], we obtain
k
X
ˇj > 2.P 1 / D 4: (10.3.1)
j D1
10.3.2 Families of special Kähler metrics on P 1 Just like in Example 10.2.15, for
any k > 3 points z1 ; : : : ; zk on P 1 and any ˛1 ; : : : ; ˛k such that
k
X
˛j < 1 and ˛j > 2;
j D1
Proposition 10.3.1. For any k > 3 points on P 1 and any ˛1 ; : : : ; ˛k such that
k
X
˛j < 1 and ˛j < 2;
j D1
for all j D 1; k. Moreover, near 1, this metric satisfies Eq. (10.3.2), which corre-
sponds to
g D jj 6 c C o.1/ jdj2
in a local coordinate near 1.
232 M. Callies, A. Haydys
Applying a Möbius transformation, we can move .z1 ; z2 ; z3 / into any given triple
of points. Hence, Proposition 10.3.1 yields a family of special Kähler metrics with
singularities at k C 1 > 4 points parameterised by k C 2.k 3/ D 3k 6 real
parameters.
Remark 10.3.2. Restriction (10.3.1) does not apply to the special Kähler metrics
constructed in Proposition 10.3.1, since such metrics always have singularities of
order 3.
References
[1] D.V. Alekseevsky, V. Cortés and C. Devchand, Special complex manifolds, J. Geom. Phys. 42
(2002), 85–105.
[2] D.V. Alekseevsky, V. Cortés, M. Dyckmanns and T. Mohaupt, Quaternionic Kähler metrics
associated with special Kähler manifolds, J. Geom. Phys. 92 (2015), 271–287.
[3] Sh. Axler, P. Bourdon and W. Ramey, Harmonic Function Theory, GTM 137, 2nd ed.,
Springer, New York, 2001.
[4] O. Baues and V. Cortés, Realisation of special Kähler manifolds as parabolic spheres, Proc.
Amer. Math. Soc. 129 (2001), 2403–2407.
[5] M. Callies and A. Haydys, Local models of isolated singularities for affine special Kähler
structures in dimension two, Int. Math. Res. Notices, arXiv:1711.09118.
[6] S. Cecotti, S. Ferrara and L. Girardello, Geometry of type II superstrings and the moduli of
superconformal field theories, Int. J. Mod. Phys. A 4 (1989), 2475–2529.
[7] K. Chan, The Ooguri–Vafa metric, holomorphic discs and wall-crossing, Math. Res. Lett. 17
(2010), 401–414.
[8] V. Cortés, On hyper-Kähler manifolds associated to Lagrangian Kähler submanifolds of
T Cn , Trans. Amer. Math. Soc. 350 (1998), 3193–3205.
[9] V. Cortés, Special Kähler manifolds: a survey, in: Proceedings of the 21st Winter School
“Geometry and Physics” (Srní, 2001) Circolo Matematico die Palermo 69 (2002), 11–18.
[10] B. de Wit and A. Van Proeyen, Potentials and symmetries of general gauged N D 2
supergravity-Yang-Mills models, Nuclear Phys. B 245 (1984), 89–117.
[11] R. Donagi and E. Markman, Cubics, integrable systems, and Calabi–Yau threefolds. In Pro-
ceedings of the Hirzebruch 65 Conference on Algebraic Geometry (Ramat Gan, 1993) (M. Te-
icher, ed.), Bar-Ilan Univ., Ramat Gan, 1996, 199–221.
[12] D. Freed, Special Kähler manifolds, Commun. Math. Phys. 203 (1999), 31–52.
[13] D. Gaiotto, G.W. Moore and A. Neitzke, Four-dimensional wall-crossing via three-
dimensional field theory, Commun. Math. Phys. 299 (2010), 163–224.
[14] D. Gaiotto, G.W. Moore and A. Neitzke, Wall-crossing, Hitchin systems, and the WKB ap-
proximation, Adv. Math. 234 (2013), 239–403.
[15] S.J. Gates, Jr., Superspace formulation of new nonlinear sigma models, Nuclear Phys. B 238
(1984), 349–366.
Affine special Kähler structures in real dimension two 233
[16] A. Haydys, HyperKähler and quaternionic Kähler manifolds with S 1 -symmetries, J. Geom.
Phys. 58 (2008), 293–306.
[17] A. Haydys, Isolated singularities of affine special Kähler metrics in two dimensions, Commun.
Math. Phys. 340 (2015), 1231–1237.
[18] N. Hitchin, The moduli space of complex Lagrangian submanifolds, Asian J. Math. 3 (1999),
77–91.
[19] N. Hitchin, The self-duality equations on a Riemann surface, Proc. Lond. Math. Soc., III. Ser.
55 (1987), 59–126.
[20] J. Kazdan and F. Warner, Curvature functions for compact 2-manifolds, Ann. of Math. (2) 99
(1974), 14–47.
[21] D. Kraus, O. Roth and T. Sugawa, Metrics with conical singularities on the sphere and sharp
extensions of the theorems of Landau and Schottky, Math. Z. 267 (2011), 851–868.
d 2 log
[22] J. Liouville, Sur l’équation aux différences partielle dudv
˙ 2a2 D 0, J. Math. Pures Appl.
1re série 18 (1853), 71–72.
[23] Zh. Lu, A note on special Kähler manifolds, Math. Ann. 313 (1999), 711–713.
[24] O. Macia and A. Swann, Twist geometry of the c-map, Commun. Math. Phys. 336 (2015),
1329–1357.
[25] R. Mazzeo, Y. Rubinstein and N. Sesum, Ricci flow on surfaces with conic singularities, Anal.
PDE 8 (2015), 839–882.
[26] R. Mazzeo, J. Swoboda, H. Weiss and F. Witt, Ends of the moduli space of Higgs bundles,
Duke Math. J. 165 (2016), 2227–2271.
[27] R. McOwen, Prescribed curvature and singularities of conformal metrics on Riemann sur-
faces, J. Math. Anal. Appl. 177 (1993), 287–298.
[28] A. Neitzke, Notes on a new construction of hyperkahler metrics. In Homological mirror sym-
metry and tropical geometry, Springer, Cham, 2014, 351–375.
[29] H. Ooguri and C. Vafa, Summing up Dirichlet instantons, Phys. Rev. Lett. 77 (1996), 3296–
3298.
[30] E. Picard, De l’équation U D ke U sur une surface de Riemann fermée, J. de Math. 9
(1893), 273–291.
[31] M. Troyanov, Metrics of constant curvature on a sphere with two conical singularities. In
Differential Geometry (F.J. Carreras, O. Gil-Medrano, and A.M. Naveira, eds.), LNM 1410,
Springer, Berlin, 1989, 296–306.
Chapter 11
Non-crossing partitions
B. Baumeister, K.-U. Bux, F. Götze, D. Kielak and H. Krause
Non-crossing partitions have been a staple in combinatorics for quite some time.
More recently, they have surfaced (sometimes unexpectedly) in various other con-
texts from free probability to classifying spaces of braid groups. Also, analogues
of the non-crossing partition lattice have been introduced. Here, the classical non-
crossing partitions are associated to Coxeter and Artin groups of type An , which
explains the tight connection to the symmetric groups and braid groups. We shall
outline those developments.1
The authors would like to thank Holger Kösters for carefully reviewing parts of
this text and for his very helpful comments.
The subsets Bi are called the blocks of the partition p. Another way to look at this
is to consider p as an equivalence relation on U . In this perspective, the subsets
Bi are the equivalence classes. Let q be another partition of the same set U . We
say that q is a refinement of p if each block of q is contained in a block of p. In
terms of equivalence relations, if two elements of U are q-equivalent, they are also
p-equivalent. We also say that q is finer than p or that p is coarser than q; and we
write q p.
Let P.U / be the set of all partitions on the underlying set U . The refinement
relation is a partial order on the set P.U /, which is therefore a poset. Moreover,
it is a lattice, i.e., every non-empty finite subset P P.U / has a least upper bound
and a greatest lower bound. We remark that the partition lattice is complete, i.e., even
arbitrary infinite subsets have least upper and greatest lower bounds.
Remark 11.1.1. It is interesting that the definition of a complete lattice can be weak-
ened by breaking the symmetry between upper and lower bounds. If a poset has
upper bounds and greatest lower bounds, it is already a complete lattice (i.e. it also
has lowest upper bounds).
1 Projects A4, C3, C13
236 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
v3
v4 v2
v5 v1
v6 v8
v7
Figure 11.1.1. Visualisation of the partition f f 1 g; f 2; 6; 7 g; f 3; 5 g; f 4 g; f 8 g g.
Sketch of proof. Let P be a non-empty subset of the poset. We consider the the set
B C .P / of all common upper bounds for the non-empty subset P . Since the poset
has upper bounds, B C .P / is non-empty. Hence it has a greatest lower bound, which
turns out to be the lowest upper bound of P .
Consider the following reflexive and symmetric relations on U :
v3 w3 v3 w2
v4 v2 v4 v2
w4 w1
v5 v1 v5 v1
w5 w8
v6 v8 v6 v8
v7 w6 v7 w7
However, the noncrossing partition lattice is not a sublattice of the whole parti-
tion lattice: the join operation in both structures differ, i.e., the finest partition coarser
than some given non-crossing partitions does not need to be non-crossing; see Re-
mark 11.1.3 for a counterexample.
The complement map
NC.m/ ! NC.m/
p 7 ! pc
is an anti-automorphism of the lattice NC.m/: it reverses the refinement relation and
interchanges the roles of meet and join. It is, however, not an involution. In the
picture, taking the Kreweras complement twice seems to get you back to the original
partition. This is true; however, the indexing of the vertices shifts by one. Thus, the
square of the Kreweras complement is given by cyclically rotating the element of the
underlying set f 1; : : : ; m g.
The bottom (finest) element ? of NC.m/ is the partition with m blocks, each of
size one. The top (coarsest) element > of NC.m/ is the partition with a single block.
For each non-crossing partition p, we define its rank rk.p/ in terms of its number of
blocks:
rk.p/ WD m #f blocks of p g
For any non-crossing partition p, all maximal chains from the bottom element ? to
p have the same length, which coincides with the rank rk.p/. Let us summarise the
properties and non-properties of the poset of non-crossing partitions:
Fact 11.1.2. The set NC.m/ of non-crossing partitions of an m-element is partially
ordered by refinement. This poset is a lattice and self-dual with respect to the Krew-
eras complement, i.e.,
.p ^ q/c D pc _ q c
.p _ q/c D pc ^ q c
for any two p; q 2 NC.m/.
238 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
Remark 11.1.3. For m > 4, the non-crossing partition lattice NC.m/ is not a sub-
lattice of the partition lattice: the join operations do not coincide. A counterexample
for m D 4 is p D f f 1; 3 g; f 2 g; f 4 g g and q D f f 1 g; f 2; 4 g; f 3 g g. The join of
these partitions in the partition lattice is f f 1; 3 g; f 2; 4 g g whereas the join in NC.4/
is the top element. These two partitions also show that the non-crossing partition
lattice NC.m/ is not semi-modular, i.e., the following inequality does not hold for all
partitions p and q,
rk.p/ C rk.q/ > rk.p _ q/ C rk.p ^ q/:
Kreweras also determined the Möbius function for the lattice of non-crossing par-
titions. Recall that, for a finite poset P , the Möbius function
W f .u; v/ 2 P P j u 6 v g ! Z
is defined by the following recursion:
.u; u/ D 1;
X
.u; v/ D .u; w/:
u6w<v
Note that the value .u; v/ is completely determined by the isomorphism type (as a
poset) of the interval Œu; v WD f w 2 P j u 6 w 6 v g.
Fact 11.1.5 (see [41, Thm. 6] or [13, Cor. 3.2]). For the non-crossing partition poset
NC.m/, the Möbius function satisfies
.2m 2/Š
.?; >/ D . 1/m 1Cm 1 D . 1/m 1
(11.1.1)
.m 1/ŠmŠ
Non-crossing partitions 239
v4 w4 w3
v5 v3
w5 w2
v6 v2
w6 w1
v7 v1
w7 w12
v8 v12
w8 w11
v9 v11
v10 w9 w10
Figure 11.1.3. Two nested partitions p q and their blockwise complement. For the dual vertices
w4 and w10 , different conventions are possible to determine which dual vertex is to be used with
which block of q.
Remark 11.1.7. To every poset .P; 6/, one associates the order complex. This is
the simplicial complex .P; 6/ whose vertices are the elements of P and whose
simplices are chains in P , i.e., non-empty subsets of P on which 6 is a total order. By
a theorem of P. Hall, one can interpret the Möbius function as the Euler characteristic
of order complexes [49, Prop. 3.8.6],
for all k; l 2 N0 .
Let A1 and A2 denote unital sub-algebras in A, for instance generated by ele-
ments a and b respectively. They are called ‘free’ if the expectations of all products
with factors alternating between elements from A1 and A2 vanish whenever the ex-
pectations of all factors vanish. Hence the elements a; b 2 A are called free if
' .aj1 '.aj1 //.b k1 '.b k1 // .ajm '.ajm //.b km '.b km // D 0
(11.2.2)
that is, all adjacent elements in a1 : : : ak belong to different sub-algebras Aji . This
notion has similar properties as classical independence. For instance, polynomials
P .aj / of free self-adjoint elements aj (generating a sub-algebra) are free again.
The density .x/ D p1 exp. x 2 =2/ defines the standard Gaussian distribu-
2
tion. Hence, the classical central limit theorem (CLT) may be stated for independent
random elements ai ; i 2 N from a commutative C -probability space .A; '/ with
identical distribution such that '.ai / D 0; '.ai2 / D 1 (such variables are called
standardised).
242 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
For a partition p we have AN;p < N jpj . If all parts of p satisfy jBj j > 2 and one
block is of size at least three, the corresponding contribution in (11.2.5) is of order
jmp jAN;p N k=2 6 jmp jN 1=2 , that is all these terms are asymptotically negligible
as N tends to infinity.
k
Hence, computing the asymptotic limit of '.SN / reduces to considering all mixed
moments of k factors with each random element occurring precisely twice, a conse-
k
quence being that limN !1 '.SN / D 0 for k odd.
Recall that NC.n/ denoted the lattice of all non-crossing partitions on the set
Œn D f1; : : : ; ng. Furthermore, let NC2 .2k/ denote the subset of non-crossing par-
titions with blocks of size 2 only, called ’non-crossing pair partitions’ on a set of 2k
elements.
Now consider as an example three free standardised variables a, b, c. Then
the product abc 2ab corresponds to a pair partition with a crossing, that is p D
ff1; 5g; f3; 4g; f2; 6gg. Hence '.abc 2 ab/ D '.abab/'.c 2/ D 0 by freeness, that
is (11.2.2). Otherwise, for a non-crossing pair partition like ca2 b 2 c, we have
Non-crossing partitions 243
p
This means e.g. that the rescaled sum .a1 C a2 /= 2 of two free elements a1 ; a2
of a non-commutative probability space .A; '/ which both have density w.x/ again
has a Wigner distribution. In free probability an element s of .A; '/ with density
w.x/ is called semi-circular and its moments are given by
(
1 2k
n ; if n D 2k;
'.s / D kC1 k (11.2.8)
0; if n odd:
Recall that a 2 .A; '/ is called positive if there exists an c 2 .A; '/ with
a D c c. Thus a is self-adjoint. Define the free multiplicative convolution of two
compactly supported measures a ; b , of positive free elements a; b 2 .A; '/, say
a b , as follows by specifying its moments. Since in a C -probability space A
244 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
positive square roots a1=2 resp. b 1=2 of a resp. b as well as the positive element
pa;b WD a1=2 ba1=2 are again in A, we may define a b by:
Z
x k da b .x/ WD '.pa;b
k
/; k 2 N: (11.2.9)
k k
Since '.pa;b / D '.pb;a /; k 2 N, because ' is tracial, i.e. '.ba/ D '.ab/, we
conclude that the free convolution is commutative. By the same tracial property
k
and the relation of freeness, we show that '.pa;b / D '..ab/k / and this implies the
associativity of . Moreover it follows from this representation that the multiplicative
convolution measure a b is uniquely determined by the distributions of a and
b .
In order to effectively compute both additive and multiplicative convolution of
measures, one needs more properties of the lattice of partitions of 1; : : : ; n into blocks
and the subset of non-crossing partitions together with the notion of multi-linear cu-
mulant functionals. As above let Bj ; j D 1; : : : s denote the blocks of a partition
p 2 NC.n/ of 1; : : : ; n.
For p 2 NC.n/, the free mixed cumulants are multi-linear functionals p W An !
C defined in terms of a moment decomposition using the Möbius function .q; p/ of
the lattice of non-crossing partitions NC.n/. We define the general mixed cumulant
functionals p as follows:
X
p Œa1 ; : : : ; an D 'q Œa1 ; : : : ; an .p; q/; where (11.2.10)
q2NC.n/;pq
0 1 0 1
Y Y
'q Œa1 ; : : : ; an WD ' @ ak A ' @ ak A ;
k2B1 k2Bs
Q
and the products k2Bj ak repeat the order of indices within the block Bj . Note that
by Hall’s theorem, the coefficient .p; q/ can also be written as .q; p/ using the
relation of reversed refinement (see Remark 11.1.7).
Then one shows, see [44, Prop. 11.4], that
X
'.a1 an / D p Œa1 ; : : : ; an : (11.2.11)
p2NC.n/
Lemma 11.2.4 ([44, Thm 11.20]). The elements a1 ; : : : ; an 2 A are free if and only
if all mixed cumulants satisfy
n Œaj1 ; : : : ; ajk D 0;
whenever aj1 ; : : : ajk , 1 6 jl 6 n; 1 6 k 6 n contains at least two different ele-
ments.
Non-crossing partitions 245
In contrast to (11.2.3), this characterisation of freeness holds even if the '.aj / are
non-zero.
For a partition p 2 NC.n/, recall that pc denotes its Kreweras complement in
NC.n/. Then, one shows that for free elements a; b the following recursion involving
the Kreweras complement holds:
X
n Œab; : : : ; ab D p Œa; : : : ; apc Œb; : : : ; b: (11.2.12)
p2NC.n/
See [44, Rem. 14.5]. This entails that the cumulants of ab and thus by (11.2.11) the
moments of ab are indeed determined by multi-linear functionals of a and b alone
which again by virtue of (11.2.10) are determined by the moments of a together with
the moments of b.
The recursive Equation (11.2.12) and the Definition (11.2.10) of cumulants may
be conveniently encoded as algebraic relations
P1 between the following formal gen-
n n
erating series. For a 2 A let Ma .z/ D nD1 '.a /z denote
Pthe moment gen-
1 n
erating series and with n .a/ WD n Œa; : : : ; a let Ra .z/ WD nD1 n .a/z and
1
Ra .z/ WD z Ra .z/ denote cumulant generating series. In particular, for free self-
adjoint a; b 2 A we get by binomial expansion of n .a C b/ and Lemma 11.2.4 that
n .a C b/ D n .a/ C n .b/ and furthermore, as shown in [44, Lect. 12],
Lemma 11.2.6. For two free self-adjoint positive elements a; b 2 A, one has
Sab .z/ D Sa .z/Sb .z/ (11.2.16)
!
k 1 lk C 1
'.a / D DW Ck;l ; for all k > 1: (11.2.19)
lk C 1 k
The proof is based on combinatorial properties of non crossing partitions, see [5].
Since .G2 Gl Gl G2 / and G1 G1 are asymptotically free (see Section 1.1 of
this volume), we get by induction for the asymptotic distribution of l the recursion
l D l 1 1 , where 1 can be identified with the limiting Marchenko–Pastur dis-
tribution of G1 G1 . For arbitrary N N independent Wigner matrices (which are Her-
mitian matrices with entries which are independent random variables unless restricted
by symmetry) the relation (11.2.20) has been shown by combinatorial techniques af-
ter an appropriate regularisation in [2]. For more details on the asymptotic spec-
tral distribution of products of so-called Girko–Ginibre matrices (having independent
and identically distributed random entries) and their inverses using the free proba-
bility calculus, see [30]. Strictly speaking one needs to extend the non-commutative
C -probability spaces to spaces of unbounded operators to include distributions with
non-compact support like those of Gaussian matrices see e.g. [22].
Remarkably, the same results hold for powers instead of products. Since
G1l 1 .G1l 1 / and G1 G1 are also asymptotically free, a similar argument as above
shows that the asymptotic distribution of .G1l /.G1l / is also given by l . Similarly as
above, these results also extend to powers of non-Gaussian random matrices.
The calculus of S -transforms may even be used to describe the asymptotic spectral
measure of W W when some of the factors in W D G1 Gl are inverted, after ap-
propriate regularisation of the inverse matrices [30]. For instance, for W D G1 G2 1 ,
the limiting distribution of W W is given by the square of a Cauchy distribution.
Moreover, the calculus of R-transforms makes it possible, at least in principle,
to deal with the case where W is a sum of independent products as above [40]. For
instance, for W D G1 G2 1 C G3 G4 1 , the limiting distribution of W W is also given
by the square of a Cauchy distribution. This is related to the Cauchy distribution
being “stable” under free additive convolution.
Xn WD f f z1 ; : : : ; zn g D j zi ¤ zj for i ¤ j g
248 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
v3
v4 v2
v5 v1
v6 v8
v7
Figure 11.3.1. The path (braid) that is associated to the partition f f 1 g; f 2; 6; 7; 8 g; f 3; 5 g; f 4 g g.
describes a loop in the configuration space Xn , which does not depend (up to ho-
motopy relative to the basepoint) on the order of factors. We identify it with the
corresponding element of the fundamental group Bn .
Fact 11.3.1. The braid group Bn is generated by the braids i corresponding to the
counter-clockwise rotations vi 7! vi C1 7! vi for i D 1; : : : ; n 1.
In terms of these generators, the braid group Bn admits the following presenta-
tion:
i j D j i for ji j j > 2
Bn D 1 ; : : : ; n 1
i j i D j i j for ji j j D 1
W Bn ! Sn
Non-crossing partitions 249
v1 v5
v2 v4
v3
Figure 11.3.2. The generator 2 in the braid group on five strands. On the left, the “top view”
representation is shown whereas and on the right we have the “front view” given by a strand
diagram.
from the braid group on n strands to the symmetric group on n letters. A braid
corresponds to a motion of the n points v1 ; : : : ; vn , and at the end of this motion, the
dots may have changed positions. This way, each braid induces a permutation.
Strand diagrams are another frequently used visual representation of braids. Re-
call that a braid is given by a path in configuration space, i.e. the simultaneous motion
of n points in the disk D. Parametrizing time by a real number in Œ0; 1, each of those
moving points traces out a “strand” in D Œ0; 1. The diagrams we have used so far
can be regarded as a “top view” onto the cylinder D Œ0; 1. A strand diagram is
a view from the front. Here, it is useful to put the initial configuration U with the
hemicircle fully visible from the front. Figure 11.3.2 shows the two representations
of the generator 2 in B5 . Here, the generator i corresponds to a crossing of the i th
and the .i C 1/th strands. The left strand runs over the right strand. We call such a
crossing positive. The inverses of the generators correspond to negative crossings.
11.3.1 A classifying space for the braid group Tom Brady [15] has given a con-
struction of a classifying space for braid groups that is strongly related to non-crossing
partitions and has found some interesting applications.
Recall that the Cayley graph CG† .G/ of a group G relative to a specified generat-
ing set † is the graph with vertex set G and edges connecting g to gx for any g 2 G
and x 2 † n f 1 g. Note that the requirement x ¤ 1 rules out loops. Obviously, there
250 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
is more structure here: the edge is oriented from g to gx and should be regarded as
labeled by the generator x.
Observation 11.3.3. Since † is a generating set for G, the Cayley graph CG† .G/ is
connected: if we can write an element g as a word
"
g D x1"1 xkk
is an edge path connecting the identity element 1 to g. Note that the exponents of the
generators tell us whether to traverse edges with or against their orientation.
There are two generating sets for the braid group (and the symmetric group) of
particular interest to us. First, we consider the digon generators ij corresponding
to the counter-clockwise rotation vi 7! vj 7! vi . Let Bn be the Birman–Ko–Lee-
monoid [12, Section 2], i.e., the monoid generated by all the ij . We remark that Bn
is strictly larger than the submonoid of positive braids (those that can be drawn using
positive crossings only), which is the monoid generated by the i . We define a partial
order on the braid group by:
ˇ 6 ˇ0 W” ˇ 1 0
ˇ 2 Bn
The image sij 2 Sn of ij in the symmetric group is a transposition. Consider the
Cayley graph of the symmetric group Sn with respect to the generating set T Sn
of all transpositions. We define a partial order, called the absolute order, on Sn as
follows: For permutations ; 2 Sn we declare 6T if there is a geodesic (i.e.,
shortest possible) path in the Cayley graph connecting the identity 1 to and passing
through .
Our largest generating set is:
n WD f p j p 2 NC.n/ g Bn
Fact 11.3.4. [see [11, 15]] Let p; q 2 NC.n/. Then the following are equivalent:
1. In NC.n/, we have p q.
2. In n , the element p is a left-divisor of q , i.e., there exists r 2 NC.n/ such
that
q D p r
Non-crossing partitions 251
of posets.
p WD f f 1; 2; 8 g; f 3; 5 g; f 4 g; f 6 g; f 7 g g and q WD f f 1; 2; 6; 7; 8 g; f 3; 5 g; f 4 g g
D ı
D ı
The braid group Bn has a particularly nice presentation over the generating set
n :
1 2 D 3 for 1 ; 2 ; 3 2 n n f 1 g (11.3.1)
are a defining set of triangular relations for the braid group Bn with respect to the
generating set n n f 1 g.
Let Q n be the Cayley graph of the braid group Bn with respect to the generating
set n n f 1 g. A clique in Q n is a set of vertices that are pairwise connected via an
edge. As a directed graph, Q n does not have oriented cycles and each clique is totally
ordered by the orientation of edges. Thus, a clique is of the form
Observation 11.3.7. All maximal chains in NC.n/ have length n. Hence, all maxi-
mal simplices in YQn have dimension n.
Theorem 11.3.8 ([15, Thm. 6.9 and Cor. 6.11]). The clique complex YQn is con-
tractible, and the braid group Bn acts freely on it. Consequently, the orbit space
Yn WD Bn n YQn
nI : one from intrinsic divisibility with quotients again in nI and one induced from
the ambient poset n , i.e., divisibility where quotients are allowed to be anywhere
in n . However, since nI D n \ BnI , the two poset structures coincide. Then,
n #I Š nI is an isomorphism of posets.
Moreover, the order preserving bijection f 1; : : : ; n #I g ! f 1; : : : ; n g n I
induces an isomorphism NC.n #I / Š NCI .n/. This isomorphism is compatible
with the poset isomorphism from Fact 11.3.4, and we have a commutative square of
poset isomorphisms:
n #I nI
NC.n #I / NCI .n/
The identity nI D n \ BnI has another consequence:
Observation 11.3.9. Let YQnI be the full subcomplex spanned by BnI as a set of ver-
tices in YQn . Then, YQnI is isomorphic to YQn #I , whence it is contractible by Theo-
rem 11.3.8. For any coset ˇ BnI , regarded as a set of vertices in YQn , the full subcomplex
spanned by ˇ BnI is the translate ˇ YQnI and also contractible.
Observation 11.3.10. Assume that two coset complexes ˇ YQnI and ˇ 0 YQnJ intersect,
N Then ˇ YQ I D ˇN YQ I and ˇ 0 YQ J D ˇN YQ J . In this case, the intersection
say in ˇ. n n n n
is contractible.
N. U / WD f A j ¿ ¤ U g
of all index sets whose associated intersection is non-empty is called the nerve of the
family U . If U is a family of subcomplexes in a CW complex, one has the following:
Theorem 11.3.11 (Nerve Theorem, see [35, Cor. 4G.3]). Suppose U D .U˛ /˛2A
is a covering of a simplicial complex X by a family of contractible subcomplexes.
Suppose further that, for each 2 N. U /, the intersection U is contractible. Then,
the nerve N. U / is homotopy equivalent to X .
254 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
We deduce:
Proposition 11.3.12. The complex XQn is homotopy equivalent to the nerve N of the
family
f ˇ Bnk j ˇ 2 Bn ; 1 6 k 6 n g
of cosets.
Definition 11.3.13 ([1, 2.1]). Let G be a group and let H be a family of subgroups.
We say that H is m-generating for G if the coset nerve
NG .H/ WD N.f gH j g 2 G; H 2 H g/
is .m 1/-connected.
Corollary 11.3.14. The family Bn WD f Bn1 ; : : : ; Bnn g is m-generating for the braid
group Bn if and only if XQ n is .m 1/-connected.
Recall that Bn acts freely on the simplicial complex YQn . The projection YQn ! Yn
is a covering space map. In fact, YQn is the universal cover of Yn and the braid group
Bn acts as the group of deck transformations. The subcomplex XQ n is Bn -invariant.
Let Xn be its image in Yn .
Proposition 11.3.15. The family Bn WD f Bn1 ; : : : ; Bnn g is m-generating for the braid
group Bn if and only if the pair .Yn ; Xn / is m-connected.
Proof. First, consider the long exact sequence of homotopy groups for the inclusion
XQ n 6 YQn :
! 1 .XQ n / ! 1 .YQn / ! 1 .YQn ; XQ n / ! 0 .XQ n / ! 0 .YQn /
Since YQn is contractible, we obtain isomorphisms:
d C1 .YQn ; XQ n / Š d .XQ n /
On the other hand, YQn ! Yn is a covering space projection and therefore enjoys the
homotopy lifting property. Moreover, XQ n is the full preimage of Xn . Therefore any
map
Bd C1 ; Sd ; ! .Yn ; Xn ; 1/
Non-crossing partitions 255
inducing a map
d C1 .Yn ; Xn / ! d C1 .YQn ; XQ n /
which is inverse to the map
d C1 .YQn ; XQ n / ! d C1 .Yn ; Xn /
Remark 11.3.16. For n > 3, the family Bn is 1-generating for Bn , and for n > 4, it
is 2-generating.
S
Proof. A family H is 1-generating for G if and only if H 2H H generates G. It
is 2-generating for G if G is the product of the H 2 H amalgamated along their
intersections [1, 2.4].
Note that the braid group Bn is generated by counter-clockwise rotations
ˇij WD vi 7! vj 7! vi
around digons. Thus, Bn WD f Bn1 ; : : : ; Bnn g generates as long as n > 3 since then
each digon-generator is contained in some Bnk .
Considering the digon-generators for Bn , defining relations are given by braid
relations, visible in isomorphic copies of B3 inside Bn , and commutator relations,
visible in isomorphic copies of B4 inside Bn . Hence all necessary defining relations
are visible in the amalgamated product of the Bnk Š Bn 1 provided n > 5.
For n D 4, the challenge is to derive the commutator relations:
We do the first, the second is done analogously. Calculating with only three strands
at a time, we find:
Remark 11.3.17. The little computation at the end of the preceeding proof shows
that the commutator relations are redundant in the braid group presentation given in
[15, Lem. 4.2]. Accordingly, they are also redundant in the analoguous presentation
from [12, Prop. 2.1].
Theorem 11.3.18. For n > 4, the family Bn is m-generating for Bn if and only if
the homology groups Hd .Yn ; Xn / are trivial for 1 6 d 6 m.
Proof. As n > 4, the pair .Yn ; Xn / is 1-connected by Propositions 11.3.15 and 11.3.16.
Thus, it follows from the relative Hurewicz theorem that m-connectivity of the pair is
equivalent to m-acyclicity. By Proposition 11.3.15, this translates into higher genera-
tion of Bn by Bn .
As the pair .Yn ; Xn / consists of finite complexes that can be described explicitly,
Theorem 11.3.18 implies that it is a finite problem to determine the higher connec-
tivity properties of Bn relative to the family Bn . In particular, the question whether
the bounds derived in Example 13.5.4 for higher generation in braid groups are sharp
becomes amenable to empirical investigation.
Here, mij D 1 indicates that there is no defining relation for si and sj . We will refer
to the relations appearing above as braid relations (even though some authors reserve
this term for the relation with mij D 3).
If one additionally forces the generators si into being involutions, one obtains
the associated Coxeter group. A pair consisting of a Coxeter group together with
the generating set fs1 ; : : : ; sn g is called a Coxeter system; its rank is defined to be the
cardinality of the generating set. If the Coxeter group is spherical, the Coxeter system
is said to be spherical as well.
A Coxeter group is spherical if it is finite; an Artin group is spherical if the
corresponding Coxeter group is spherical.
Note that the braid group Bn is an Artin group and the symmetric group Sn is the
associated Coxeter group. Here, mij D 3 for ji j j D 1 and mij D 2 otherwise.
See Fact 11.3.1
Artin groups form a rich class of groups of importance in geometric group theory
and beyond. From geometric group theory perspective they remain in focus largely
due to the following conjecture.
Non-crossing partitions 257
Conjecture 11.3.20 (Charney). Every Artin group is CAT(0), i.e. it acts properly and
cocompactly on a CAT(0) space.
A CAT(0) space is a metric space with curvature bounded from above by 0; for
details see the book by Bridson–Haefliger [19]. From the current perspective let us
list some properties of CAT(0) groups: algorithmically, such groups have quadratic
Dehn functions and hence soluble word problem; geometrically, all free-abelian sub-
groups thereof are undistorted; algebraically, the centralisers of infinite cyclic sub-
groups thereof split; topologically, the space witnessing CAT(0)-ness of a group G is
a finite model for EG and thus, for example, allows to compute the K-theory of the
reduced C -algebra Cr .G/ provided the Baum–Connes conjecture is known for G.
Conjecture 11.3.20 has been verified by Charney–Davis for right-angled Artin
groups (RAAGs), that is for Artin groups with each mij equal to 2 or 1. Outside of
this class, the conjecture is mostly open. In particular, it is open (in general) for the
braid groups Bn .
To prove that a group G is CAT(0), one has to first construct a space X on which
G acts properly and cocompactly, and then prove that the space is indeed CAT(0).
We shall use the space YQn from above, on which Bn acts freely and with compact
quotient.
What is missing, however, is a metric structure on YQn . Such a metric can be
specified by realising the simplices in euclidean space, i.e., by endowing each simplex
in YQn with the metric of a euclidean polytope. Instead of the standard one, we will
follow Brady–McCammond [16].
We now endow each maximal simplex in YQn with the orthoscheme metric. Let
† D f ˇ; ˇ1 ; : : : ; ˇn g
Lemma 11.3.22. YQn is CAT(0) if and only if the link of each vertex in YQn is CAT(1).
258 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
Here CAT(1) means that the curvature of the space is bounded above by that of the
unit sphere; again, for details see [19].
The poset n Š NC.n/ has a unique maximal element, which is the braid
corresponding to the full counter-clockwise rotation:
v1 7! v2 7! 7! vm 7! v1
The nth power
n is central in the braid group Bn . In fact, it generates the infinite
cyclic center of Bn . Brady–McCammond observed in [16] that this algebraic fact has
a geometric counterpart: YQn splits as a cartesian product of the real line R and another
metric space. The R-factor inside YQn points in the direction of the edges labelled by
.
Because of this, instead of looking at the link of a vertex u in YQn , one can look at
the link of a midpoint of the (long) edge .u; u
/; every two such links are isometric
(since Bn acts transitively on the vertices of YQn ), and so let L denote any such link.
To compute the curvature of L, it is enough to study the subcomplex of YQn
spanned by all simplices containing the edge .u; u
/. Clearly, this is the subcomplex
spanned by L and u with 1 6 6
, with simplices defined by the chain condition
as before. Thus, such a link is isomorphic as a simplicial complex to the realisation
of NC.n/; the subcomplex also comes with a metric, and it is clear that this coincides
with the realisation of NC.n/ being endowed with its own orthoscheme metric defined
as before by identifying each maximal simplex with the n-orthoscheme. We will re-
fer to the realisation of NC.n/ with this metric simply as the orthoscheme complex of
NC.n/.
Note that if the orthoscheme complex of NC.n/ is CAT(0), then L, isometric to the
link of the midpoint of the main diagonal, is CAT.1/, which implies that YQn , and so
Bn , is CAT(0).
In view of the above, Brady–McCammond formulate the following conjecture.
Conjecture 11.3.23 ([16, Conj. 8.4]). For every n, the orthoscheme complex of NC.n/
is CAT(0), and so the braid group Bn is CAT(0).
Theorem 11.3.24 ([16, Thm. B]). For n 6 5, the braid group Bn is CAT(0).
Non-crossing partitions 259
Theorem 11.3.25 ([32, Cor. 4.18]). For n 6 6, the braid group Bn is CAT(0).
Note that their proof is not computer assisted. The crucial improvement in the
work of Haettel–Kielak–Schwer is to use the observation (present already in [16]),
that the link L can be embedded into a spherical building, in the following way.
First observe that the vertices of L are non-trivial proper partitions; let p be such
a partition with blocks B1 ; : : : ; Bk . Let F be the field of two elements; we associate
to p the subspace of Fn D hb1 ; : : : ; bn i which is the intersections of the kernels of
the characters X
bj D 0
j 2Bi
where 1 6 i 6 k, and bj is the j -th character in the basis dual to the bj .
It is easy to see that this gives
Pn a map
sending each vertex of L to a proper non-
trivial subspace of V WD ker b
j D1 j . But these subspaces are precisely the ver-
tices of the spherical building of SLn 1 .F/, and it turns out that our bijection extends
to a map sending each maximal simplex in L onto a chamber (i.e. maximal simplex)
in the building in an isometric way. Thus we may view L as a subcomplex of the
building.
The spherical building is CAT(1), and this information gives the extra leverage
used to prove Theorem 11.3.25.
(R1) ˆ generates V ;
(R2) ˆ \ R˛ D f˙˛g for all ˛ 2 ˆ;
(R3) s˛ .ˇ/ is in ˆ for all ˛; ˇ 2 ˆ.
The spherical Coxeter groups W are precisely the finite real reflection groups.
Coxeter classified the finite root systems which then also gives a classification of
the spherical Coxeter systems: there are the infinite families of type An ; Bn ; Cn and
Dn and some exceptional groups. For instance .Sn ; Str / is of type An 1 . Note that
the groups of type Bn and Cn are isomorphic; and also that the root systems of type
An ; Bn ; Cn and Dn are all crystallographic that is
.˛; ˇ/
2 Z for all ˛; ˇ 2 ˆ:
.˛; ˛/
Definition 11.4.1. For a dual Coxeter system .W; T / and a Coxeter element c in W
the set of non-crossing partitions is
NC.W; c/ D fu 2 W j u 6T cg:
This definition is conform with the definition in type An , see Fact 11.3.4.
The length function lT yields a grading on NC.W; c/ and the map
1
d W NC.W; c/ ! NC.W; c/; x 7! x c
The dual Matsumoto property for a Coxeter element c 2 W is the statement that
if we have two shortest factorisations
c D t1 tm D u1 um with ti ; ui 2 T
is a presentation of W .
We obtain the dual Matsumoto property for an arbitrary element w 2 W by re-
placing c by w in the definition of the dual braid relations and of the dual Matsumoto
property above.
For an element w 2 W , let
in order to distinguish between W and A.W /. We call an Artin group A.W / spher-
ical if the Coxeter group is spherical. And in the rest of this section, we always
consider spherical Artin groups.
Notice that the Matsumoto property implies that one can lift every w 2 W to an
element in A.W / just by mapping w to .s1 /a .sk /a 2 AW whenever w D s1 sk
is a reduced factorisation of w into elements of S . We denote this section of W in
A.W / by W .
The non-crossing partitions are a good tool for the better understanding of the
spherical Artin groups; for instance they can be used to construct a finite simplicial
classifying space for the spherical Artin groups (see Section 11.3.1), or to solve the
word or the conjugacy problem in them, see [17, 10].
The basic idea of this solution of the word and the conjugacy problem in the
spherical Artin group A.W / is to give a new presentation of A.W / as follows. Let
Non-crossing partitions 263
NC.W; c/a be a copy of the set of non-crossing partitions NC.W; c/ with respect to a
standard Coxeter element c, that is there is a bijection
a W NC.W; c/ ! NC.W; c/a :
Then the new generating set is NC.W; c/a ; and the new relations are the expressions
.w1 /a .wr /a whenever w1 ; w2 ; : : : ; wr are the vertices of a circuit in
Œid; c6T CGNC.W;c/ .W /:
Then this presentation can be used to obtain a new normal form for the elements in
A.W / [10]. Notice that this presentation generalises the presentation of the braid
group given by Birman, Ko and Lee [12] to all the spherical Artin groups, see also
Fact 11.3.6 in Section 11.3.1.
Next, we explain this new presentation. Denote the group given by the presenta-
tion above by A.W; c/. The strategy to prove that A.W; c/ and A.W / are isomorphic
is to use Garside theory. As a first step the presentation above can be transformed into
a presentation with set of generators a copy Ta D fta j t 2 T g of T and set of re-
lations the dual braid relations with respect to c. The next step is to consider the
monoid A.W; c/ generated by Ta and the dual braid relations, and to show that this
is a Garside monoid. Then using Garside theory one shows that the group of fractions
Frac.A.W; c// of A.W; c/ equals A.W; c/. The last step is to prove that the group
of fractions Frac.A.W; c// and the Artin group A.W / are isomorphic.
Note also that a basic ingredient in the proof of Theorem 11.4.3 is the dual Mat-
sumoto property for c, that is the transitivity of the Hurwitz action of the braid group
BlT .c/ on RedT .c/.
The isomorphism between A.W; c/ and AW given by Bessis is difficult to under-
stand explicitly. So an immediate question is what the elements of NC.W; c/a are
expressed in the generating set Sa ?
The rational permutation braids, that is, the elements xy 1 where x; y 2 W , are
also called Mikado braids as they satisfy in type An 1 a topological condition and are
therefore easy to recognise. This condition on an element in the Artin group A.W /
of type An 1 , that is on a braid in the braid group Bn , is that we can lift and remove
continuously one strand after the next of the braid without disturbing the remaining
strands until we reach an empty braid [26].
Proof. This is [26] for those groups of type different from Dn and [8] for those of
type Dn .
264 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
Figure 11.4.1. A Mikado braid in AB8 whose image in AB8 is a Mikado braid in AD8 .
Notice that Licata and Queffelec [43] have a proof of Theorem 11.4.4 in types
A,D,E with a different approach using categorification.
In order to be able to find a topological property that characterises the Mikado
braids as in type An 1 topological models for the series of spherical Artin groups
AW are needed. There is an embedding of Artin groups of type Bn into those of
type A2n 1 . The situation in type Dn is as follows [8]: The root system of type Dn
embeds into the root system of type Bn , which implies that the Coxeter system of
type Dn is a subsystem of that one of type Bn . But there is not an embedding of
the Artin group of type Dn into that one of type Bn that satisfies a certain natural
condition. Let .W; S / be a Coxeter system of type Bn . Then there is precisely one
element s 2 S that is a reflection corresponding to a short root. Let
where hhs 2 ii is the normal closure of s 2 in ABn . Then the following holds.
Proposition 11.4.5 ([8, Lem. 2.5 and Prop. 2.7]). There is a natural embedding of
ADn onto an index-2 subgroup of ABn . More precisely, there is the following com-
mutative diagram
Š
A Bn ! A Bn ht1 ; : : : ; tn i ADn
? ?
? ?
y B D y
A Bn ! WBn WDn
B
The embedding of ADn into ABn makes it possible to associate braid pictures to
the ADn -elements and to characterise Mikado braids in type Dn geometrically.
A reader familiar with Hecke algebras will find it interesting that the Mikado
braids satisfy a positivity property involving the canonical Kazhdan-Lusztig basis
C WD fCw j w 2 W g of the Iwahori–Hecke algebra H.W / related to the Coxeter
Non-crossing partitions 265
Theorem 11.5.1 ([6, Thm. 1.3]). Let .W; T / be a (finite or infinite) dual Coxeter
system of finite rank n and let c D s1 sm be a parabolic Coxeter element in W. The
Hurwitz action on RedT .c/ is transitive.
Theorem 11.5.1 is also more general than Theorem 1.4 in [37], as in [6] dual
Coxeter systems are considered while in [37] Coxeter systems, and in general the set
of Coxeter elements is in a dual system larger than that one in a Coxeter system.
The proof of Thereom 11.5.1 is based on a study of the Cayley graphs CGS .W /
and CGT .W /. Using the same methods one can also show that every reflection oc-
curring in a reduced T -factorisation of an element of a parabolic subgroup P of W is
already contained in that parabolic subgroup.
Theorem 11.5.2 ([6, Thm. 1.4]). Let .W; S / be a (finite or infinite) Coxeter system,
P a parabolic subgroup and w 2 P . Then RedT .w/ D RedT \P .w/.
This basic fact was not known before and can be seen as a founding stone towards
a general theory for ‘dual’ Coxeter systems.
Hurwitz action in the spherical Coxeter systems and quasi-Coxeter elements. In
the rest of the section, .W; T / is a finite dual Coxeter system.
In order to understand the dual Coxeter systems .W; T / one also needs to know
for which elements in W the Hurwitz action is transitive. The answer to that question
is as follows [7].
A parabolic quasi-Coxeter element is an element w 2 W that has a reduced
factorisation into reflections such that these reflections generate a parabolic subgroup
of W .
Note if one reduced T -factorisation of w 2 W generates a parabolic subgroup P
then every reduced T -factorisation of w is in P by Theorem 11.5.2. It also follows
that every such factorisation generates P [7, Thm. 1.2].
266 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
Theorem 11.5.3 ([7, Thm. 1.1]). Let .W; S / be a spherical Coxeter system and let
w 2 W . The Hurwitz action is transitive on RedT .w/ if and only if w is a parabolic
quasi-Coxeter element.
Recently, Wegener showed that the dual Matsumoto property holds for quasi-
Coxeter elements in affine Coxeter systems as well [52]. These two results have the
following consequence.
be the root and the coroot lattices, respectively. Quasi-Coxeter elements are also
intrinsic in the dual Coxeter systems as they generate the root as well as the coroot
lattice: Let w D t1 tn be a reduced T -factorisation of w 2 W and let ˛i 2 ˆ be
the root related to the reflection ti for 1 6 i 6 n.
Theorem 11.5.5 ([9, Thm. 1.1]). Let ˆ be a finite crystallographic root system of
rank n. Then w is a quasi-Coxeter element if and only if
1. f˛i j 1 6 i 6 ng is a Z-basis of the root lattice L.ˆ/, and
2. f˛i_ j 1 6 i 6 ng is a Z-basis of the coroot lattice L.ˆ_ /.
Thus if all the roots in ˆ are of the same length, then L.ˆ/ D L.ˆ_ / and the
quasi-Coxeter elements correspond precisely to the basis of the root lattice.
Quasi-Coxeter elements and Coxeter elements share further important properties
beyond Hurwitz transitivity.
Finally, Gobet observed that, in a spherical Coxeter system, every parabolic quasi-
Coxeter element can be uniquely written as a product of commuting parabolic quasi-
Coxeter elements [31]. This factorisation of a quasi-Coxeter element can be thought
of as a generalisation of the unique disjoint cycle decomposition of a permutation.
.a; x/
sx W K0 .A/ ! K0 .A/; a 7! a 2 x; (11.6.1)
.x; x/
assuming that .x; x/ ¤ 0 divides .ei ; x/ for all i . Let us denote by W .A/ the group
of automorphisms of K0 .A/ that is generated by the set of simple reflections S.A/ D
fse1 ; : : : ; sen g; it is called the Weyl group of A.
From now on, assume that A is hereditary, that is, of global dimension at most
one. Then, one can show that the Weyl group W .A/ is actually a Coxeter group.
For example, the path algebra kQ of any quiver Q is hereditary and in that case
kQ-modules identify with k-linear representations of Q.
Proposition 11.6.1 ([36, Thm. B.2]). A Coxeter system .W; S / is of the form
.W .A/; S.A// for some finite dimensional hereditary algebra A if and only if it is
crystallographic in the following sense:
1. mst 2 f2; 3; 4; 6; 1g for all s ¤ t in S , and
2. in each circuit of the Coxeter graph not containing the edge label 1, the num-
ber of edges labelled 4 (resp. 6) is even.
We may assume that the simple A-modules are numbered in such a way that
hei ; ej i D 0 for i > j , and we set c D se1 sen . Note that c D c.A/ is a Coxeter
268 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
The rest of this article is devoted to explaining this result. In particular, the crucial
notion of an exceptional sequence will be discussed.
This result goes back to beautiful work of Ingalls and Thomas [38]. It was then
established for arbitary path algebras by Igusa, Schiffler, and Thomas [37], and we
refer to [36] for the general case. Observe that path algebras of quivers cover only the
Coxeter groups of simply laced type (via the correspondence A 7! W .A/); so there
are further hereditary algebras.
We may think of Theorem 11.6.2 as a categorification of the poset of non-crossing
partitions. There is an immediate (and easy) consequence which is not obvious at all
from the original definition of non-crossing partitions; the first (combinatorial) proof
required a case by case analysis.
Proof. Any finite Coxeter group can be realised as the the Weyl group W .A/ of a
hereditary algebra of finite representation type. In that case any thick subcategory is
coreflective. On the other hand, it is clear from the definition that the intersection
of any collection of thick subcategories is again thick. This yields the join, but also
the meet operation; so the poset of thick and coreflective subcategories is actually a
lattice; see Remark 11.1.1
This categorification provides some further insight into the collection of all posets
of non-crossing partitions. This is based on the simple observation that any thick
and coreflective subcategory C mod A (given by an exceptional sequence E D
.E1 ; : : : ; Er /) is again the module category of a finite dimensional hereditary algebra,
say C D mod B. Then the inclusion mod B ! mod A induces not only an inclusion
K0 .B/ ! K0 .A/, but also an inclusion W .B/ ! W .A/ for the corresponding Weyl
groups, which identifies W .B/ with the subgroup of W .A/ generated by sE1 ; : : : ; sEr ,
Non-crossing partitions 269
and identifies the Coxeter element c.B/ with the non-crossing partition sE in W .A/.
Moreover, the inclusion W .B/ ! W .A/ induces an isomorphism
NC.W .B/; c.B// ! fx 2 NC.W .A/; c.A// j x 6 sE g:
The following result summarises this discussion; it reflects the fact that there is
a category of non-crossing partitions. This means that we consider a poset of non-
crossing partitions not as a single object but look instead at the relation with other
posets of non-crossing partitions.
Corollary 11.6.4 ([36, Cor. 5.8]). Let NC.W; c/ be the poset of non-crossing parti-
tions given by a crystallographic Coxeter group W . Then, any element x 2 NC.W; c/
is the Coxeter element of a subgroup W 0 6 W that is again a crystallographic Cox-
eter group. Moreover,
NC.W 0 ; x/ D fy 2 NC.W; c/ j y 6 xg:
Theorem 11.7.1 ([36, Thm 5.6]). The poset of subobjects of a generalised Cartan
lattice is isomorphic to the poset of non-crossing partitions NC./. The iso-
morphism sends a monomorphism W 0 ! to s.e1 / s.er / where cox. 0 / D
se1 ser . Moreover, the assignment w 7! wj 0 induces an isomorphism
W ./ hs.e1 / ; : : : ; s.er / i ! W . 0 /:
Theorem 11.8.1 (Happel). A hereditary abelian category with a tilting object is, up
to a derived equivalence, either of the form mod A for some finite dimensional hered-
itary algebra A or of the form coh X for some weighted projective line X.
Non-crossing partitions 271
The proof is straightforward. But a priori it is not clear that the product sE1 sEr
is a non-crossing partition. In fact, the proof of Theorem 11.6.2 hinges on the transi-
tivity of the Hurwitz action on factorisations of the Coxeter element. So the analogue
of Theorem 11.6.2 for categories of type A D coh X remains open. A proof would
provide an interesting extension of the theory of crystallograpic Coxeter groups and
non-crossing partitions, which seems very natural in view of Happel’s theorem since
the Grothendieck group K0 .A/ is a derived invariant.
Partial results were obtained recently by Wegener in his thesis [51]. In fact, when
a weighted projective line X is of tubular type (that is, the weight sequence is up to
permutation of the form .2; 2; 2; 2/; .3; 3; 3/; .2; 4; 4/ or .2; 3; 6/), then the Grothen-
dieck group gives rise to a tubular elliptic root system [47, 48]. Wegener showed the
transitivity of the Hurwitz action in this case. Thus, one has in particular the analogue
of Theorem 11.6.2 for coh X in the tubular case.
272 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
References
[1] H. Abels and S. Holz, Higher generation by subgroups, J. Algebra 160 (1993), 310–341.
[2] N. Alexeev, F. Götze and A. Tikhomirov, Asymptotic distribution of singular values of powers
of random matrices, Lith. Math. J. 50 (2010), 121–132.
[3] L. Angeleri Hügel, D. Happel and H. Krause (eds.), Handbook of Tilting Theory, Cambridge
University Press, Cambridge, 2007.
[4] D. Armstrong, Generalized noncrossing partitions and combinatorics of Coxeter groups, Mem.
Amer. Math. Soc. 202 (2009), No. 949.
[5] T. Banica, S. Belinschi, M. Capitaine and B. Collins, Free Bessel laws, Canad. J. Math. 63
(2011), 3–37.
[6] B. Baumeister, M. Dyer, C. Stump and P. Wegener, A note on the transitive Hurwitz action
on decompositions of parabolic Coxeter elements, Proc. Amer. Math. Soc. Ser. B 1 (2014),
149–154.
[7] B. Baumeister, T. Gobet, K. Roberts and P. Wegener, On the Hurwitz action in finite Coxeter
groups, J. Group Th. 20 (2017), 103–131.
[8] B. Baumeister and T. Gobet, Simple dual braids, noncrossing partitions and Mikado braids of
type Dn , Bull. London Math. Soc., 49 (2017), 1048–1065.
[9] B. Baumeister and P. Wegener, A note on Weyl groups and root lattices, Archiv der Mathe-
matik 111 (2018), 469–477.
[10] D. Bessis, The dual braid monoid, Ann. Sci. École Normale Sup. 36 (2003), 647–683.
[11] P. Biane, Some properties of crossings and partitions, Discr. Math. 175 (1997), 41–53.
[12] J. Birman, K.H. Ko and S.J. Lee, A new approach to the word and conjugacy problems in the
braid groups, Adv. Math. 139 (1998), 322–353.
[13] A. Blass and B.E. Sagan, Möbius functions of lattices, Adv. Math. 127 (1997), 94–123.
[14] B.H. Bowditch, Notes on locally CAT(1) spaces. In Geometric Group Theory (R. v. Charney,
M. Davis, and M. Shapiro, eds.), de Gruyter, Columbus, OH (1995), 3, 1–48.
[15] T. Brady, A partial order on the symmetric group and new K.1/’s for the braid groups, Adv.
Math. 161 (2001), 20–40.
[16] T. Brady and J. McCammond, Braids, posets and orthoschemes, Algebr. Geom. Topol., 10
(2010), 2277–2314.
[17] T. Brady and C. Watt, K.; 1/’s for Artin groups of finite type, Geom. Dedicata 94 (2002),
225–250.
[18] T. Brady and C. Watt, Noncrossing partitions lattices in finite real reflection groups, Trans.
Amer. Math. Soc. 360 (2008), 1983—2005.
[19] M.R. Bridson and A. Haefliger, Metric Spaces of Non-Positive Curvature, Springer, Berlin,
1999.
[20] G.P. Chistyakov and F. Götze, Limit theorems in free probability theory. I, Ann. Probab. 36
(2008), 54–90.
[21] G.P. Chistyakov and F. Götze, Limit theorems in free probability theory. II, Cent. Eur. J. Math.
6 (2008), 87–117.
Non-crossing partitions 273
[22] G.P. Chistyakov and F. Götze, The arithmetic of distributions in free probability theory, Cent.
Eur. J. Math. 9 (2011), 997–1050.
[23] G.P. Chistyakov and F. Götze, Asymptotic expansions in the CLT in free probability, Probab.
Th. Rel. Fields 157 (2013), 107–156.
[24] W. Crawley-Boevey, Exceptional sequences of representations of quivers In Proceedings of
the Sixth International Conference on Representations of Algebras (Ottawa, 1992) (V. Dlab
and H. Lenzing, eds.), Carleton Univ., Ottawa, ON, 1992, 117–124.
[25] P. Deligne, Letter to E. Looijenga, 9/3/1974. Available at:
http://homepage.univie.ac.at/christian.stump/Deligne_Looijenga_Letter_09-03-1974.pdf.
[26] F. Digne and T. Gobet, Dual braid monoids, Mikado braids and positivity in Hecke algebras,
Math. Z. 285 (2017), 215–238.
[27] M.J. Dyer, Reflection subgroups of Coxeter systems, J. Algebra 135 (1990), 57–73.
[28] M.J. Dyer and G.I. Lehrer, On positivity in Hecke algebras, Geom. Dedicata 25 (1990), 115–
125.
[29] W. Geigle and H. Lenzing, A class of weighted projective curves arising in representation
theory of finite-dimensional algebras. In Singularities, representation of algebras, and vector
bundles (Lambrecht, 1985) (G.-M. Greuel and G. Trautmann, eds.), LNM 1273, Springer,
Berlin, 1987, 265–297.
[30] F. Götze, H. Kösters and A. Tikhomirov, Asymptotic spectra of matrix-valued functions of
independent random matrices and free probability, Random Matrices Theory Appl. 4 (2015),
1550005 (85 pp).
[31] T. Gobet, On cycle decompositions in Coxeter groups, SLC 78B (2017), Article # 45, 12 pp.
[32] T. Haettel, D. Kielak and P. Schwer, The 6-strand braid group is CAT(0), Geom. Dedicata 182
(2016), 263–286.
[33] D. Happel, A characterization of hereditary categories with tilting object, Invent. Math. 144
(2001), 381–398.
[34] D. Happel and I. Reiten, Hereditary abelian categories with tilting object over arbitrary base
fields, J. Algebra 256 (2002), 414–432.
[35] A. Hatcher, Algebraic Topology, Cambridge University Press, Cambridge, 2002.
[36] A. Hubery and H. Krause, A categorification of non-crossing partitions, J. Eur. Math. Soc.
(JEMS) 18 (2016), 2273–2313.
[37] K. Igusa and R. Schiffler, Exceptional sequences and clusters, J. Algebra 323 (2010), 2183-
2202.
[38] C. Ingalls and H. Thomas, Noncrossing partitions and representations of quivers, Compos.
Math. 145 (2009), 1533–1562.
[39] D. Kazhdan and G. Lusztig, Representations of Coxeter groups and Hecke algebras, Invent.
Math. 53 (1979), 165-184.
[40] H. Kösters and A. Tikhomirov, Limiting spectral distributions of sums of products of non-
Hermitian random matrices, Probab. Math. Statist. 38 (2018), 359–384.
[41] G. Kreweras, Sur les partitions non croisees d’un cycle, Discr. Math. 1 (1972), 333–350.
[42] D. Kussin and H. Meltzer, The braid group action for exceptional curves, Arch. Math. (Basel)
79 (2002), 335–344.
274 B. Baumeister, K.-U. Bux, F. Götze, D. Kielak, H. Krause
[43] A. Licata and H. Queffelec, Braid groups of type ADE, Garside monoids, and the categorified
root lattice, preprint, arXiv:1703.06011.
[44] A. Nica and R. Speicher, Lectures on the Combinatorics of Free Probability, Cambridge Uni-
versity Press, Cambridge, 2006.
[45] V. Reiner, Non-crossing partitions for classical reflection groups, Discr. Math 177 (1997),
195–222.
[46] C.M. Ringel, The braid group action on the set of exceptional sequences of a hereditary
Artin algebra. In Abelian group theory and related topics (Oberwolfach, 1993) (R. Göbel,
P. Hill, and W. Liebert, eds.), Contemp. Math. 171, Amer. Math. Soc., Providence, RI (1994),
pp. 339–352.
[47] K. Saito, Extended affine root systems. I. Coxeter transformations, Publ. Res. Inst. Math. Sci.
21(1) (1985), 75–179.
[48] Y. Shiraishi, A. Takahashi and K. Wada, On Weyl groups and Artin groups associated to
orbifold projective lines, J. Algebra 453 (2016), 249–290.
[49] R.P. Stanley, Enumerative Combinatorics, Volume 1, Cambridge University Press, Cam-
bridge, 1997.
[50] D. Voiculescu, Lectures on free probability theory. In Lectures on Probability Theory and
Statistics (P. Bernard, ed.), LNM 1738, Springer, Berlin, 2000, 279–349.
[51] P. Wegener, Hurwitz Action in Coxeter Groups and Elliptic Weyl Groups, PhD thesis, Univ.
Bielefeld, 2017.
[52] P. Wegener, On the Hurwtiz action in affine groups, preprint, arXiv:1710.06694.
Chapter 12
The derived category of the projective line
H. Krause and G. Stevenson
12.1 Introduction
Ostensibly, this chapter is about the projective line over a field, but secretly it is an
invitation to a discussion of some open questions in the study of derived categories.
More specifically, we are thinking of localising subcategories and to what extent one
can hope for a complete classification. The localising subcategories are of interest
for various reasons. For instance, one can localise a derived category in the sense of
Verdier [26] by annihilating the objects of a localising subcategory; it is a formal ana-
logue of localisation in ring theory. The case of noetherian affine schemes is by now
quite well understood, having been settled by Neeman in his celebrated chromatic
tower paper [19]. However, surprisingly little is known in the simplest non-affine
case, namely the projective line over a field. We seek to begin to rectify this state of
affairs and to advertise this and similar problems.
Let us start by recalling what is known. We write QCoh P 1k for the category of
quasi-coherent sheaves on the projective line P 1k over a field k, and Coh P 1k denotes
the full subcategory of coherent sheaves. There is a complete description of the ob-
jects of Coh P 1k , due to Grothendieck [9]; see [5, §2] for a detailed discussion. The
colimit closed Serre subcategories of QCoh P 1k are known by work of Gabriel [8],
and are parametrised by specialisation closed collections of points of P 1k . When one
passes to the derived category D.QCoh P 1k /, the situation becomes considerably more
complicated. Several new phenomena appear as a result of the fact that one can no
longer non-trivially talk about subobjects and it remains a challenge to provide a
complete classification of localising subcategories, which are the derived analogue of
colimit closed Serre subcategories.
An enticing aspect of this problem is that it not only represents the first stum-
bling block for those coming from algebraic geometry, but also for the representation
1 Project C10
276 H. Krause, G. Stevenson
where Mod A denotes the module category of the finite dimensional k-algebra
h i
2
A D k0 kk
given by 2 2 matrices with the obvious addition and multiplication [6]. The algebra
A is isomorphic to the path algebra of the Kronecker quiver ! ! and is known to
be of tame representation type. The ring A is one of the simplest algebras that is not
of finite representation type, and so understanding its derived category is also a key
question from the point of view of representation theory. In particular, it is known by
work of Ringel [21, 22] that Mod A, the category of all representations, is wild and so
it is very natural to ask if, as in the case of commutative noetherian rings, localisations
can nonetheless be classified.
In this chapter we make a contribution toward this challenge in two different ways.
First of all, one of the main points of this work is to highlight this problem, provide
some appropriate background, and set out what is known. To this end the first part of
the chapter discusses the various types of localisation one might consider in a com-
pactly generated triangulated category and sketches the localisations of D.QCoh P 1k /
which are known.
Our second contribution is to provide a new perspective and new tools. The main
new result is that the subcategories we understand admit a natural intrinsic characteri-
sation: it is shown in Theorem 12.4.12 that they are precisely the cohomological ones.
In the final section, we provide a discussion of the various restrictions that would have
to be met by a non-cohomological localising subcategory. Here, our main results are
that such subcategories come in Z-families and consist of objects with full support
on P 1k .
12.2 Preliminaries
This section contains some background on localisations, localising subcategories, pu-
rity, and the projective line. It also serves to fix notation and recall general terminol-
ogy, and thus may be safely skipped, especially by experts, and referred back to as
needed.
We assume familiarity with the notion of a triangulated category and will not
give the basic definitions. But we seek to provide a brief glossary of some of the
specialised terminology, belonging to the world of triangulated categories with small
coproducts, that will be used throughout. For more fundamental definitions and facts
we refer the reader to the book of Neeman [20].
The derived category of the projective line 277
Some remarks on this are in order. First of all, it follows that i Š is a Verdier
quotient because it has a fully faithful left adjoint, and that there is a localisation
sequence
i j
L o / T o / T=L
iŠ j
to a strictly localising subcategory. Further details can be found in [16]. We will use
the language of (strictly) localising subcategories and localisations interchangeably.
12.2.2 Purity Let T be a compactly generated triangulated category and let Tc de-
note the thick subcategory of compact objects. We denote by Mod Tc the Grothen-
dieck category of modules over Tc , i.e. the category of contravariant additive functors
Tc ! Ab. There is a restricted Yoneda functor
H W T ! Mod Tc defined by HX D Hom. ; X /jTc ; (12.2.1)
which is cohomological, conservative, and preserves both products and coproducts.
12.2.3 The projective line Throughout, we will work over a fixed base field k
which will be supressed from the notation. For instance, P 1 denotes the projective
line P 1k over k. We will denote by the generic point of P 1 . The points of P 1 that are
different from are closed. A subset V P 1 is specialisation closed if it is the union
of the closures of its points. In our situation, this just says that V is specialisation
closed if 2 V implies V D P 1 .
As usual, QCoh P 1 is the Grothendieck category of quasi-coherent sheaves on P 1
and Coh P 1 is the full abelian subcategory of coherent sheaves.
We use standard notation for the usual ‘distinguished’ objects of QCoh P 1 . The
i th twisting sheaf is denoted O .i / and for a point x 2 P 1 we let k.x/ denote the
residue field at x. In particular, k./ is the sheaf of rational functions on P 1 . For an
object X 2 D.QCoh P 1 / or a localising subcategory L we will often write X.i / and
L.i / for X ˝ O.i / and L ˝ O.i / respectively.
All functors, unless explicitly mentioned otherwise, are derived. In particular, ˝
denotes the left derived tensor product of quasi-coherent sheaves and Hom the right
derived functor of the internal hom in QCoh P 1 .
For an object X 2 D.QCoh P 1 / we set
supp X D fx 2 P 1 j k.x/ ˝ X ¤ 0g:
The derived category of the projective line 279
This agrees with the notion of support one gets as in [4] by allowing D.QCoh P 1 / to
act on itself; the localising subcategories generated by k.x/ and x O coincide, where
x denotes the local cohomology functor with respect to x.
Remark 12.2.5. Let A be a hereditary abelian category, for example QCoh P 1 . Then
Extn .X; Y / vanishes for all n > 1 and therefore every object of the derived cate-
gory D.A/ decomposes into complexes that are concentrated in a single degree. It
follows that the functor H 0 W D.A/ ! A induces a bijection between the localising
subcategories of D.A/ and the full subcategories of A that are closed under kernels,
cokernels, extensions, and coproducts.
The smashing subcategories always form a set. Amongst the smashing subcat-
egories, there is a potentially smaller distinguished set of localising subcategories.
Unfortunately, there is no standard way to refer to such categories; the snappy nomen-
clature only exists for the corresponding localisations.
280 H. Krause, G. Stevenson
f g h
X / Y / Z / †X
the sequence
is exact in A.
We can extend this definition to an analogue for arbitrary regular cardinals, with
Definition 12.3.3 being the @0 or ‘base’ case. The idea is to relax the exactness con-
dition on the target abelian category. This requires a little terminological preparation.
Let J be a small category and ˛ a regular cardinal. We say that J is ˛-filtered if for
every category I with jI j < ˛, i.e. I has fewer than ˛ arrows, every functor F W I ! J
has a cocone. For instance, this implies that any collection of fewer than ˛ objects of
The derived category of the projective line 281
J has an upper bound and any collection of fewer than ˛ parallel arrows has a weak
coequaliser. If ˛ D @0 we just get the usual notion of a filtered category.
Let A be an abelian category. We say it satisfies (AB5˛ ) if it is cocomplete and
has exact ˛-filtered colimits.
Proof. Suppose L is smashing. Then T=L is compactly generated and for H we can
take the composite
T ! T=L ! Mod.T=L/c
where the latter functor is the restricted Yoneda functor (12.2.1).
Proof. This follows by applying [16, Thm. 7.1.1] and then [16, Thm. 7.4.1].
Proof. We have just seen that an ˛-cohomological localising subcategory has a gen-
erating set. On the other hand if L is generated by a set of objects then L is well-
generated, and so is strictly localising, and the quotient T=L is also well-generated
(see [16, Thm. 7.2.1]). One can then compose the quotient T ! T=L with the uni-
versal functor from T=L to an (AB5˛ ) abelian category, for a sufficiently large ˛, to
get the required cohomological functor.
Let us now restrict to cohomological localisations and make the connection to
purity in triangulated categories.
282 H. Krause, G. Stevenson
12.3.3 When things are sets As has been alluded to in the previous sections, it
is a significant subtlety that one does not usually know the class of all localising
subcategories forms a set. In fact there is no example where one knows that there are
a set of localising subcategories by ‘abstract means’; all of the examples come from
classification results.
If one does know there is a set of localising subcategories then life is much eas-
ier. The purpose of this section is to give some indication of this, and record some
other simple observations. Everything here should be known to experts, but these
observations have not yet found a home in the literature.
As previously let T be a compactly generated triangulated category.
Lemma 12.3.10. If the localising subcategories of T form a set then every localising
subcategory is generated by a set of objects (and hence by a single object).
Proof. Suppose, for a contradiction, that L is a localising subcategory of T which
is not generated by a set of objects. We define a proper chain of proper localising
subcategories
L0 ¨ L1 ¨ ¨ L˛ ¨ L˛C1 ¨ ¨ L;
each of which is generated by a set of objects, by transfinite induction. For the base
case pick any object X0 of L and set L0 D Loc.X0 /. This is evidently generated by
a set of objects, namely fX0 g. By assumption L is not generated by a set of objects
so L0 ¨ L. Suppose we have defined a proper localising subcategory L˛ of L which
is generated by a set of objects. Since L˛ is proper we may pick an object X˛C1 in L
but not in L˛ and set
L˛C1 D Loc.L˛ ; X˛C1 / © L˛ :
This is clearly still generated by a set of objects and hence is still a proper subcategory
of L. For a limit ordinal we set
L D Loc.L j < /:
Again this is generated by a set of objects (and so is still not all of L).
The derived category of the projective line 283
Remark 12.3.11. The above argument does not use that T is compactly generated.
It is valid for any triangulated category with infinite coproducts.
One then deduces that all localisations are cohomological for an appropriate car-
dinal.
Lemma 12.3.12. If the localising subcategories of T form a set then every localising
subcategory of T is ˛-cohomological for some regular cardinal ˛.
Proof. By the previous lemma, the hypothesis imply that every localising subcate-
gory of T is generated by a set of objects. It then follows from Corollary 12.3.8 that
they are all cohomological.
One can, to some extent, also work in the other direction. Then the following
result is of interest.
Proposition 12.3.13. For every regular cardinal ˛ the collection of localising sub-
categories of T that are ˛-cohomological forms a set.
forms a set.
3. There exists a regular cardinal such that every localising subcategory of T
is -cohomological.
Remark 12.3.15. Corollary 12.3.14 and the preceding lemmas apply, verbatim, to a
well-generated triangulated category. In fact the proofs go through, essentially with-
out modification. The results we use from [16] apply to well-generated categories.
12.3.4 State of the art Let us conclude this section with a brief description of what
is known concerning the various conditions we have discussed. It is perhaps more
honest to say that we pose a number of questions concerning these definitions.
Again the answers are not known. The former question is known to have an
affirmative answer (in even greater generality than we have asked it), as proved in
[7], provided Vopěnka’s principle is assumed to hold. For the uninitiated, Vopěnka’s
principle is a very strong large cardinal axiom. This is a remarkable achievement
The derived category of the projective line 285
and little is known in the absence of strong set-theoretic assumptions outside of cases
where we have a classification.
As far as we are aware there is no example of a localising subcategory which is
not cohomological with respect to some cardinal and nothing is known concerning
our final question outside of cases where there is a classification of localising subcat-
egories.
We refer to [2] for details about tilting. Note that A is isomorphic to the path algebra
of the Kronecker quiver ! ! and this algebra is known to be of tame representation
type. In fact, the representation theory of this algebra amounts to the classification
of pairs of k-linear maps, up to simultaneous conjugation. The finite-dimensional
representations were already known to Kronecker [18].
12.4.1 Thick subcategories of the bounded derived category The structure of the
lattice of thick subcategories of Db .Coh P 1 /, which we recall in this section, has been
known for some time; it can be computed by hand using the fact that Coh P 1 is tame
and hereditary.
The structure of the coherent sheaves on P 1 is well known: there is a Z-indexed
family of indecomposable vector bundles and a 1-parameter family of torsion sheaves
for each closed point on P 1 .
286 H. Krause, G. Stevenson
♥♥ P
❅PP
♥♥♥♥♥⑦⑦⑦⑦ ❅❅❅PPPPP
♥
PPP ❅
PPP❅❅❅ ⑦⑦⑦♥♥♥♥♥
PP ⑦♥♥♥
This is a special case of a general result because the indecomposable vector bun-
dles are precisely the exceptional objects of Db .Coh P 1 /. For any hereditary artin
algebra A the thick subcategories of Db .mod A/ that are generated by exceptional
objects form a poset which is isomorphic to the poset of non-crossing partitions given
by the Weyl group W .A/; see [10, 11] and Theorem 11.6.2. Note h thati W .A/ is an
Q 2
affine Coxeter group of type A1 for the Kronecker algebra A D k0 kk , keeping in
mind the derived equivalence
Db .Coh P 1 / ! Db .mod A/:
The thick tensor ideals are classified by Spc Db .Coh P 1 / Š P 1 , where the space
Spc Db .Coh P 1 / is meant in the sense of Balmer [3], and its computation is a special
case of a general result of Thomason [25]. What all this boils down to is that for any
set of closed points V of P 1 there is a thick tensor ideal
DbV .Coh P 1 / WD fE j supp E V g D Thick.k.x/ j x 2 V /
consisting of complexes of torsion sheaves supported on V . Moreover, together with
0 and Db .Coh P 1 /, this is a complete list of thick tensor ideals. One can make this uni-
form by considering subsets of P 1 which are specialisation closed. In this language,
by extending the above notation to allow Db¿ .Coh P 1 / D 0 and DbP 1 .Coh P 1 / D
Db .Coh P 1 /, we have a lattice isomorphism
fthick tensor ideals of Db .Coh P 1 /g ! fspc subsets of P 1 g;
where ‘spc’ is an abbreviation for ‘specialisation closed’, which is given by
[
I 7! supp I D supp E and V 7! DbV .Coh P 1 /
E 2I
We know every object of Db .Coh P 1 / is a direct sum of shifts of line bundles and
torsion sheaves and so one can readily combine these classifications to obtain a lattice
isomorphism2
fthick subcategories of Db .Coh P 1 /g ! fspc subsets of P 1 g q Z:
The verification that the evident bijection is indeed a lattice map as claimed is
elementary: the twisting sheaves are supported everywhere so are not contained in
any proper ideal, and any twisting sheaf and a torsion sheaf, or any pair of distinct
twisting sheaves, generate the category. Thus for i ¤ j and V proper non-empty and
specialisation closed in P 1 we have
and
12.4.2 Ideals and smashing subcategories We now describe the localising sub-
categories that one easily constructs from our understanding of the compact objects
Db .Coh P 1 / in D.QCoh P 1 /.
By [17], the smashing conjecture holds for D.QCoh P 1 / (our computations will
also essentially give a direct proof of this fact). Thus the finite localisations one
obtains by inflating the thick subcategories listed above exhaust the smashing locali-
sations i.e.
fthick subcategories of Db .Coh P 1 /g !
fsmashing subcategories of D.QCoh P 1 /g
as lattices.
The localising ideals are also understood. Again this is known more generally
(there is such a classification for any locally noetherian scheme, see [1]) but can
easily be computed by hand for P 1 . The precise statement is that there is a lattice
isomorphism
1
flocalising tensor ideals of D.QCoh P 1 /g ! 2P
1
where 2P denotes the powerset of P 1 with the obvious lattice structure. The bijection
is given by the assignments
L 7! fx 2 P 1 j k.x/ ˝ L ¤ 0g
2 Let L0 ; L00 be a pair of lattices with smallest elements 00 ; 000 and greatest elements 10 ; 100 . Then L0 qL00
denotes the new lattice which is obtained from the disjoint union L0 [ L00 (viewed as sum of posets) by
identifying 00 D 000 and 10 D 100 .
288 H. Krause, G. Stevenson
ix W Spec OP 1 ;x ! P 1 :
Remark 12.4.5. The Prüfer sheaves and k./ are precisely the indecomposable in-
jective quasi-coherent sheaves.
290 H. Krause, G. Stevenson
supp E .x/ D fxg; supp k./ D fg; and supp A.x/ D fx; g:
Proof. All of these sheaves are pushforwards along the inclusions of the spectra
of local rings at points, and so their supports are contained in the relevant subset
Spec OP 1 ;x . Having reduced to computing the support over OP 1 ;x this is then a stan-
dard computation.
As one would expect the localisations Loc.O .i // are particularly simple.
where the adjoints exist since O .i / is exceptional and the computation of the right
orthogonal follows from the computation of the cohomology of P 1 . Applying Thoma-
son’s localisation theorem shows that
Lemma 12.4.8. Let V be a set of closed points with complement U . Then, the inde-
composable pure-injective sheaves in V D.QCoh P 1 /? are:
the indecomposable coherent sheaves supported at closed points in U ;
the Prüfer sheaves E .x/ for x 2 U ;
the adic sheaves A.x/ for x 2 U ;
the sheaf of rational functions k./.
The derived category of the projective line 291
Lemma 12.4.9. Let V be a subset of P 1 containing the generic point and denote
its complement by U . Then the indecomposable pure-injective sheaves in
V D.QCoh P 1 /? are:
the indecomposable coherent sheaves supported at closed points in U ;
the adic sheaves A.x/ for x 2 U .
Proof. The sheaf of rational functions k./ has a map to every indecomposable injec-
tive sheaf and so no E .x/ is contained in the right perpendicular category (and clearly
k./ is not). The category V D.QCoh P 1 / contains the torsion and adic sheaves for
points in V so the only indecomposable pure-injective sheaves which could lie in the
right perpendicular are those indicated; it remains to check they really don’t receive
maps from objects of V D.QCoh P 1 /.
This is clear for the residue fields k.x/ for x 2 U , as they cannot receive a map
from any of the residue fields generating V D.QCoh P 1 /. Since the right perpendicu-
lar is thick it thus contains all the indecomposable coherent sheaves supported on U .
Moreover, the right perpendicular is closed under homotopy limits and so contains
the corresponding adic sheaves A.x/.
We now know which subsets of indecomposable pure-injectives occur in the right
perpendiculars of the localising subcategories we understand. It’s natural to ask for
the minimal set giving rise to one of these categories, as in Corollary 12.4.3. Let us
make the convention that for an object E 2 D.QCoh P 1 /
?
E D fF 2 D.QCoh P 1 / j Hom.F; †j E/ D 0 8j 2 Zg:
We can, without too much difficulty, compute all of the left perpendiculars of the
indecomposable pure-injectives.
Lemma 12.4.10. The left perpendicular categories to the suspension closures of the
indecomposable pure-injectives are as follows:
?
1. F D P 1 nfxg D.QCoh P 1 / for any F 2 Coh P 1 supported at x 2 P 1 ;
?
2. O.i / D Loc.O.i C 1//;
?
3. E .x/ D P 1 nfx;g D.QCoh P 1 /;
?
4. A.x/ D P 1 nfxg D.QCoh P 1 /;
?
5. k./ D P 1 nfg D.QCoh P 1 /.
Knowing this it is not hard to write down minimal sets of pure-injectives deter-
mining the ideals.
We also now have enough information to confirm that we have a complete list of
the cohomological localising subcategories.
That is, the cohomological localising subcategories are precisely the localising ideals
and the Loc.O .i // for i 2 Z.
Lemma 12.5.1. Let y be a closed point of P 1 and let X 2 D.QCoh P 1 / be such that
y … supp X . Then, L D Loc.X / is an ideal.
Hom.k.y/; X / Š 0:
L Loc.k.y//? Š D.QCoh A1 /:
Lemma 12.5.2. The category V D.QCoh P 1 / is generated by its tensor unit and
hence every localising subcategory contained in it is an ideal in it, and thus a sub-
module for the D.QCoh P 1 / action. In particular, every localising subcategory of
D.QCoh P 1 / contained in V D.QCoh P 1 / is an ideal of D.QCoh P 1 /.
Proof. The subset V is discrete, in the sense that there are no specialisation relations
between any distinct pair of points in it. It follows from [24] that V D.QCoh P 1 /
decomposes as Y
V D.QCoh P 1 / Š x D.QCoh P 1 /:
x2V
L
With respect to this decomposition, the monoidal unit V O is just x2V x O , which
clearly generates. It then follows that every localising subcategory of
V D.QCoh P 1 / is an ideal (see for instance [23, Lemma 3.13]) and from this the
remaining statements are clear.
294 H. Krause, G. Stevenson
Proof. Since the property of being generated by the tensor unit is preserved under
taking quotients it is enough to verify the statement when L has support a single
point. If supp L is a closed point, we can identify T with the derived category of
the open complement, which is isomorphic to A1 . Having made this observation the
conclusion follows immediately.
It remains to verify the lemma in the case that supp L D fg. In this situation
there is a recollement
/ /
V D.QCoh P 1 / o 1
/ D.QCoh P / o / L
where, as above, V denotes the set of closed points of P 1 . The bottom four arrows
identify V D.QCoh P 1 / with the quotient T and the desired conclusion is given by
Lemma 12.5.2.
Combining all of this we arrive at the following proposition.
Example 12.5.6. The non-ideals we know, namely the Loc.O .i //, are of course com-
patible with the proposition: every object of Loc.O .i // is just a sum of suspensions
of copies of O .i /, and these are all supported everywhere.
12.5.2 Twisting sheaves and avoiding compacts We next make a few comments
concerning the interactions between localising subcategories and the twisting sheaves.
Proof. Without loss of generality we may assume i > 0. Suppose, for a contradic-
tion, that X 2 L \ L.i / is non-zero. Pick a closed point y and consider a triangle
O. i / ! O ! Z.y/ ! †O. i /
where Z.y/ is the cyclic torsion sheaf of length i supported at y. We can tensor with
X to get a new triangle
X. i / ! X ! X ˝ Z.y/ ! †X. i /;
where both X and X. i / lie in L by hypothesis. Thus, since L is localising, we
see that X ˝ Z.y/ lies in L. By Proposition 12.5.5 we know that X is supported
everywhere and so X ˝Z.y/ ¤ 0. But on the other hand, X ˝Z.y/ is supported only
at y which, by the same Proposition, implies that L is an ideal yielding a contradiction.
Proof. The indecomposable compact objects are just the indecomposable torsion
sheaves at each point and the twisting sheaves. By Lemma 12.5.1, we know L cannot
contain a torsion sheaf and, by the last lemma, it cannot contain a twisting sheaf.
References
[1] L. Alonso Tarrío, A. Jeremías López and M.J. Souto Salorio, Bousfield localization on formal
schemes, J. Algebra 278 (2004), 585–610.
[2] L. Angeleri Hügel, D. Happel and H. Krause (eds.) Handbook of tilting theory, London Math-
ematical Society Lecture Note Series 332, Cambridge University Press, Cambridge, 2007.
[3] P. Balmer, The spectrum of prime ideals in tensor triangulated categories, J. Reine Angew.
Math. (Crelle) 588 (2005), 149–168.
[4] P. Balmer and G. Favi, Generalized tensor idempotents and the telescope conjecture, Proc.
Lond. Math. Soc. 102 (2011), 1161–1185.
[5] P. Baumann and C. Kassel, The Hall algebra of the category of coherent sheaves on the pro-
jective line, J. Reine Angew. Math. (Crelle) 533 (2001), 207–233.
[6] A.A. Beı̆linson, Coherent sheaves on Pn and problems in linear algebra, Funktsional. Anal. i
Prilozhen. 12 (1978), 68–69.
[7] C. Casacuberta, J. Gutiérrez and J. Rosický, Are all localizing subcategories of stable homo-
topy categories coreflective?, Adv. Math. 252 (2014), 158–184.
[8] P. Gabriel, Des catégories abéliennes, Bull. Soc. Math. France 90 (1962), 323–448.
[9] A. Grothendieck, Sur la classification des fibrés holomorphes sur la sphere de Riemann, Amer.
J. Math. 79 (1957), 121–138.
[10] A. Hubery and H. Krause, A categorification of non-crossing partitions, J. Eur. Math. Soc.
(JEMS) 18 (2016), 2273–2313.
[11] C. Ingalls and H. Thomas, Noncrossing partitions and representations of quivers, Compos.
Math. 145 (2009), 1533–1562.
[12] S.B. Iyengar and H. Krause, The Bousfield lattice of a triangulated category and stratification,
Math. Z. 273 (2013), 1215–1241.
[13] C.U. Jensen and H. Lenzing, Model-Theoretic Algebra with Particular Emphasis on Fields,
Rings, Modules, Gordon and Breach, New York, 1989.
[14] B. Keller, A remark on the generalized smashing conjecture, Manuscripta Math. 84 (1994),
193–198.
[15] H. Krause, Smashing subcategories and the telescope conjecture—an algebraic approach, In-
vent. Math. 139 (2000), 99–133.
The derived category of the projective line 297
The properties of a group to be finitely generated or finitely presentable are the first
two instances in a sequence of so called higher finiteness properties defined in terms
of skeletons of classifying spaces. The study of higher finiteness properties is a prime
example of how one can use a nice action of a group on a topological space to better
understand the group. We shall illustrate this method in detail using the braided
Thompson group V br as an example.1
Fact 13.1.1 ([20, Prop. 7.1.5 and Cor. 7.1.7]). Let G be a group. There exists a
connected CW-complex Y with contractible universal cover such that G D 1 .Y /.
Such a cell complex Y is called a classifying space for the group G. A classifying
space for G is unique up to homotopy equivalence.
invariants can therefore be obtained from it. In particular, the group (co)homology of
G can be regarded as the (co)homology of its classifying space. See [10, Prop. II.4.1]
or [20, Prop. 8.1.4] for homology and [10, Sec. III.1, page 59] or [20, Prop. 13.1.1]
for cohomology.
Numerical measures of complexity for groups can be obtained by leveraging the
fact that the classifying space is only unique up to homotopy equivalence.
So, if G has geometric dimension d , then its (co)homology coincides with the
(co)homology of a CW-complex of dimension d . In particular, the (co)homology of G
is trivial in dimensions above d . Similarly, if G is of type Fm , its (co)homology can be
computed from a CW-complex with finite m-skeleton. In particular, the (co)homology
is finitely generated up to dimension d . Often, but not always, the converses hold.
So, one a heuristic level, one might think of the dimension of a group G as marking
the dimension where the homology collapses for good, whereas .G/ C 1 marks the
first dimension where the homology explodes. Note that such an explosion may very
well be followed by an ultimate collaps. So, finiteness length and dimension may
both be finite.
Since a CW-complex with finite 5-skeleton automatically has a finite 4-skeleton,
we observe that a group of type F5 automatically has type F4 . Hence, the types form
a sequence of finiteness properties of groups of increasing strength. The first few of
these finiteness properties are familiar.
Fact 13.1.3. Every group has type F0 . A group is of type F1 if and only if it is finitely
generated; and is it of type F2 if and only if it is finitely presented.
Observation 13.2.1. The group G is of type Fm if and only it acts freely and cocom-
pactly on a CW-complex X whose homotopy groups d .X / vanish in dimensions
d < m.
A space whose first homotopy groups vanish is called highly connected. More
precisely, X is called l-connected if d .X / is trivial for d 6 l. We adopt the conven-
tion that every non-empty space is considered . 1/-connected. Note the difference
between the strict inequality in the criterion (13.2.1) and the non-strict inequality in
the definition of connectivity. This unfortunate difference causes a shift by one in
many theorems.
Brown’s criterion improves upon Observation 13.2.1 in two ways. First, he re-
moves the restriction to free actions. Instead, cell stabilisers are just required to have
suitable finiteness properties in their own right. In addition, the action is not required
to have a compact quotient. This condition is replaced by considering a filtration by
invariant and cocompact subcomplexes.
So let G act on a CW-complex X by cell-permuting homeomorphisms. We call X
m-good if the following conditions are satisfied:
1. The space X is .m 1/-connected, i.e., the homotopy groups d .X / are trivial
for d < m.
2. The stabiliser G of any d -cell of dimension d 6 m is of type Fm d.
Note that X is not assumed to have a compact quotient mod G. Instead, we consider
a filtration
X0 6 X1 6 X2 6
S
of X by G-invariant subcomplexes X˛ 6 X with X D ˛ X˛ where each X˛ has a
compact quotient mod G.
302 K.-U. Bux
Of course, some conditions on homotopy groups will be relevant. We call the fil-
tration .X˛ /alpha essentially homotopically trivial in dimension d if for every filtration
index ˛ 2 N there is a filtration index ˇ > ˛ so that the induced homomorphism
d .X˛ / ! d .Xˇ /
Theorem 13.2.2 ([11, Thms 2.2 and 3.2]). Assume that X is an m-good G-complex
with a filtration by G-invariant cocompact subcomplexes X˛ as above. The group G
is of type Fm if and only if the filtration .X˛ /˛ is essentially .m 1/-connected.
Corollary 13.2.3 ([11, Cor. 3.3]). Assume that X is a contractible G-complex with
cells stabilisers of type F1 and a filtration by G-invariant cocompact subcomplexes
X˛ . Suppose the connectivity of the pair of spaces .X˛C1 ; X˛ / tends to infinity as ˛
tends to 1. Then, G is of type F1 .
The criterion clearly suggests a strategy for establishing higher finiteness proper-
ties for a group G. An established pattern runs as follows:
1. Find a highly connected, usually a contractible, space X upon which G acts
with cell stabilisers that are of type F1 . Not only finiteness properties can be
deduced this way. Often, more detailed (co)homological information is hid-
den in this action. Important examples include: symmetric spaces and affine
buildings for arithmetic groups [6, 7, 13], Teichmüller space and Harer’s arc
complex for mapping class groups [24, 22, 23], outer space for outer automor-
phism groups of free groups [14]. As these examples show, often a good deal
of work is spent on the construction of a good space for a given group to act on
and establishing its topological structure.
2. Find a filtration by G-invariant subspaces X˛ that have compact quotient mod
G. This often takes the form of finding a G-invariant height function h W X !
R, mimicking those on locally-symmetric spaces defined by (14.3.3). The fil-
tration is then given by sublevel complexes, i.e., full subcomplexes spanned by
the set of vertices u 2 X with height h.u/ 6 s.
3. Show that the filtration .X˛ /˛ is essentially .m 1/-connected. If the filtration
is given by a height function h W X ! R this part of the argument usually
Higher finiteness properties of braided groups 303
Example 13.3.1. A piecewise Euclidean complex where all cells are unit cubes is
called a cube complex. The link of any vertex in a cube complex is a piecewise
spherical complex where all cells are right angled spherical simplices.
Fact 13.3.3. The sublevel set f h 6 s g deformation retracts onto the sublevel com-
plex Xs . Hence, both are homotopy equivalent.
Lemma 13.3.5 (Morse Lemma [3, Lemma 2.5]). Assume that there is no critical
value in the interval .s; t. Then the inclusion of sublevel sets f h 6 s g f h 6 t g
is a homotopy equivalence. Therefore, the same holds for the inclusion Xs Xt of
sublevel complexes.
Assume that there is precisely one critical value t in the interval .s; t, then the
sublevel set f h 6 t g is homotopy equivalent to the sublevel set f h 6 s g with the
descending links of vertices of height t coned off.
As the Morse Lemma 13.3.5 allows one to control how the homotopy type of
sublevel complexes changes, one can use it to deduce connectivity properties of a
filtration by sublevel complexes.
Corollary 13.3.6 ([3, Cor. 2.6]). Let h W X ! R be a Morse function and sup-
#
pose that the descending link lkX .u/ is l-connected for each vertex u 2 X with
s < h.u/ 6 t. Then, the inclusion of sublevel complexes Xs 6 Xt induces isomor-
phisms in d for d 6 l and an epimorphism in lC1 .
Example 13.3.7. Let be a tripod. We define the Morse function on sending the
central vertex to 0 one terminal vertex to 1 and the other two to 1. The product
X WD of two tripods is a square complex shown in Figure 13.3.1. On X , we
consider the Morse function obtained by adding the heights of the two coordinates.
Some preimages of non-critical values are shown as well as the link and descending
link of the central vertex.
Higher finiteness properties of braided groups 305
h
!
Figure 13.3.1. The product of two tripods is a square complex. Here it is is drawn according to a
height function. Point preimages are horizontal sections, shown on the left for some non-critical
values. The sublevel sets start at the bottom a union of four contractible components. Between the
lowest two critical values, sublevel sets are homotopy equivalent to a circle. This circle is coned
off as the descending link of the middle vertex (shown as solid arcs; the dotted arcs show the
non-descending parts of the link). Above this critical value, sublevel sets are contractible.
Example 13.4.1 (e.g. [5, Thm. 4.1]). For the complete graph D Kn on n ˘nodes,
the matching complex Mn WD M./ is .n/-connected where .n/ D nC1 3
2.
Example 13.4.2 (e.g. [5, Thm. 1.1]). For the complete bipartite graph D Km;n on
m white and n black nodes, the matching complex Mm;n WD M./, called the chess
board complex, is .m; n/-connected where
mCnC1
.m; n/ D min m; n; 2:
3
Remark 13.4.3. Matching complexes are just one example of complexes based upon
graphs. A construction similar in spirit is to consider sub-forests of , i.e., vertices
are non-loop lines and a collection of lines forms a simplex if their union does not
306 K.-U. Bux
Now, we turn to arc matching complexes, which encode the possible ways of
drawing a matching onto a given surface. Let † be a connected surface possibly with
punctures and boundary components. Let N be a set of marked points, possibly on
the boundary, and let be a graph with N as its set of nodes. An arc is a curve
connecting to marked points in N . A drawing of onto † is continuous injective
map
'W !†
such that ' restricts to the identity on N and '.nN / does not contain boundary
points nor marked points. Each line of is mapped to an arc on †. The injectivity
condition says that arcs do not self-intersect and do not meet each other apart from
their endpoints. We consider arcs and drawings up to isotopy relative to marked
points, punctures and boundary.
As soon as its Euler characteristic is negative, the surface † can be endowed with
a hyperbolic metric with geodesic boundary components. Since we consider isotopy
relative to marked points, we may also think of them as punctures. Then, a hyperbolic
metric exists in any non-boring situation, e.g., as soon as we have three marked points
(now thought of as punctures metrically turned into cusps). In the examples below,
three marked points are necessary to have more than one arc.
In the presence of a hyperbolic metric, we can represent each arc by the unique
geodesic in its homotopy class (relative endpoints). This uniqueness implies that
two different geodesic arcs between the same points cannot bound a disk. Also
geodesic arcs coincide or intersect transversally as geodesic germs extend uniquely
in both directions. By the bigon criterion [17, 1.2.4], geodesic representatives of arcs
are in minimal position, i.e., the geometric intersection number of their homotopy
classes coincides with the number of intersection points of the geodesic representa-
tives. Thus:
The arc matching complex A.; †/ is the simplicial complex whose simplices are
drawings (up to relative homotopy) of matchings in onto †. In this definition, Ob-
servation 13.4.4 is implicitly used: in a simplicial complex, a simplex is determined
by its vertices. If drawings were not uniquely determined by their arcs, we could only
define a CW-complex whose cells have the geometric shape of simplices. We remark
that A. Hatcher and K. Vogtmann consider very similar complexes in [21]. They do
not require the arcs to be fully disjoint but allow that arcs meet at their end points.
For the arc matching complex on a complete graph, one can deduce the same
connectivity as for its matching complex.
Higher finiteness properties of braided groups 307
Figure 13.4.1. An arc matching in A† .5/ on the disk with its corresponding matching in M5 .
Example 13.4.5 (see [12, Prop. 3.6]). Let WD Kn be the complete graph on n
nodes, all of which shall be interior points of the connected surface †. Then,
the˘ arc
matching complex A† .n/ WD A.; †/ is .n/-connected, where .n/ D nC1 3
2
as in Example 13.4.1.
Remark 13.4.6. One might consider other arc matching complexes also based upon
the complete graph Kn , the difference being the positioning of the nodes in the sur-
face. We place all marked points in the interior of the surface. One can also put
some (or even all) marked points on the boundary, and connectivity of the resulting
complex will in general be different (and usually worse).
Example 13.4.7. Let † be a connected surface with one boundary component. Let
D Km;n be the complete bipartite graph on m white and n black nodes. Let
A† .m; n/ be the arc matching complex in † based on where all white nodes are
placed on the boundary of † and all black nodes are placed in
the ˘interior of †. Then
A† .m; n/ is .m; n/-connected, where .m; n/ WD min.m; nC1 2
/ 2.
Figure 13.4.2. An arc matching in A† .4; 6/ on the disk with its corresponding matching in M4;6 .
arc matching complex has the same connectivity as its non-surface counterpart from
Example 13.4.2.
Remark 13.4.9. The arc matching complex A.; †/ projects simplicially to the
matching complex M./ simply by ignoring the way the matching is drawn. It
is tempting to make use of this projection in establishing connectivity of arc match-
ing complexes. Unfortunately, the fibers of this projection are badly behaved even in
those cases where connectivity of M./ and A.; †/ agree. So far, it appears that
this approach is not feasible.
Proposition 13.5.1 (see [31, A.5]). Let G act on a simplicial complex X with a
maximal simplex C as a strict fundamental domain. Then, X is isomorphic (as a
simplicial complex) to the coset complex CC.F / where
Example 13.5.2. The symmetric Sn group on n letters acts on the vertex set of Kn
and therefore on the associated matching complex Mn and on its barycentric subdi-
vision M V n . A simplex of the barycentric subdivision M V n is a flag of simplices in
Mn .
Note that up to an action of Sn , a matching is uniquely characterised by the num-
ber of lines it uses. Hence the action on M V n has a maximal simplex as a strict
fundamental domain, e.g., we can choose the "obvious" flag:
f 1; 2 g
f 1; 2 g f 3; 4 g
f 1; 2 g f 3; 4 g f 5; 6 g
::
:
Example 13.5.3. The symmetric group Sn on n letters acts on the bipartite complete
graph Km;n on m white nodes w1 ; : : : ; wm and n black nodes b1 ; : : : ; bn by permuting
the black nodes. We assume m 6 n. In this case, the matching
w1 b1 w2 b2 wm bm
is a fundamental domain for the action on the chess board complex Mm;n .
The stabiliser of the vertex wi bi is the subgroup StabSn .i / isomorphic to Sn 1.
Thus, we conclude from Proposition 13.5.1 and Example 13.4.2:
The family f StabSn .i / j 1 6 i 6 m g is ..m; n/ C 1/-generating for
Sn .
Example 13.5.4. Now consider the complete bipartite graph Km;n on m white and
n black nodes. Let † be a disk. Consider the arc matching complex A† .m; n/ from
Example 13.4.7.
Assume m 6 n. Then, the braid group Bn acts on A† .m; n/ with a maximal
simplex (a complete matching) as a strict fundamental domain. This action of the
braid group Bn arises as follows.
A braid records a motion of the n punctures within the disk. We think of the disk
being made out of elastic material so that the moving punctures drag ambient points
310 K.-U. Bux
along. We keep points on the boundary fixed. This way, a braid defines a mapping
class of the n-punctured disk relative to the boundary. This point pushing map induces
an isomorphism of the braid group Bn and the mapping class group of an n-punctured
disk, where the boundary is fixed pointwise and the punctures are fixed as a set. The
mapping class group clearly acts on the arc matching complex A.m; n/.
Stabilisers of its vertices (matchings between a single white node and a single
black node) are isomorphic copies of braid groups Bn 1 inside Bn . A mapping class
that stabilises an arc can be regarded as a mapping class on the surface obtained by
cutting along the arc. This cut yields another dist, but we are missing a puncture.
Let Bm;n be the family of these vertex stabilisers. Now, Proposition 13.5.1 applies,
and we have:
The family Bm;n is .l C 1/-generating for Bn if and only if A† .m; n/ is
l-connected.
As we know a lower bound on the connectivity of A† .m; n/ from Example 13.4.7,
we conclude that Bm;n is ..m; n/ C 1/-generating for the braid group Bn .
D D
Of course, these rules are taken from the well known presentation of Bn .
Now, we consider diagrams where strands are also allowed to split and merge.
Since we do not want strands to twist (invisibly in the picture) we require that splits
and merges only take place in the xz-plane. Also, a strand can only split in two; and
one of the resulting strands is the left branch whereas the other is the right branch.
The same rule applies to merges. Equivalence is defined by ambient isotopy with two
additional rules. A split followed immediately by a merge of its resulting branches
is as good as not splitting at all. That is, one may remove an eye. An eye is a disk
lying entirely in the xz-plane bounded by two branches following a split and merging
afterwards that is not intersected by any other strand of the diagram. Similarly, a
Higher finiteness properties of braided groups 311
Figure 13.6.1. These diagrams do not have eyes. In the left picture, the disk bounded by the
branches twists; in the right example, another strand passes through the disk.
merge can be undone by a split that follows. The two allowed moves are called eye
removal and X removal:
D D
With splits and merges interfering, the number of strands in a diagram is not con-
stant. An .m; n/-diagram has m strands at the top (heads) and n strands at the bottom
(feet). Clearly, .n; n/-diagrams can be stacked. They form a group, whose isomor-
phism type does not depend on the number of heads and feet. More generally, we can
consider the category B whose objects are counting numbers 1; 2; 3; : : : and where
the .m; n/-diagrams are the morphisms from m to n. The braided Thompson group
V br is defined as the automorphism group of the object 1 in this category. Its ele-
ments are thus represented by .1; 1/-diagrams. An example is shown in Figure 13.6.2.
This braided version of Thompson’s group V has been introduced independently by
M. Brin [9] and P. Dehornoy [16].
Any .1; 1/-diagram can be put into standard form. By applying eye and X re-
movals, one can achieve that the diagram consists of a sequence of splits (forming
a binary tree in the xz-plane) followed by a braid and concluded by a sequence of
merges (again forming a binary tree in the xz-plane).
As in the case of braid diagrams, one can restrict the form of allowed diagrams
and replace the equivalence of ambient isotopy by purely combinatorial requirements.
In addition to the braid moves and removal of eye and X sub-diagrams, we then need
moves that allow us to move splits and merges past one another and past crossings.
Here is a pictorial representation of some new moves one needs to consider:
D D
Whereas in the case of braid groups the moves correspond to the relations of a finite
presentation, it is not obvious that one can extract a finite presentation for V br from
312 K.-U. Bux
these rules. Not even a finite generating set suggests itself: diagrams can grow to
arbitrarily many strands; and splits, merges and crossings far to the right need to be
expressed in terms of moves closer to the left. Nonetheless, M. Brin [8, Thm. 2] gave
a finite presentation for V br . We can strengthen his result.
Theorem 13.6.1 (see [12, Main Thm.]). The braided Thompson group V br is of
type F1 , i.e., the group V br admits a classifying space with finite skeleta in each
dimension.
We also remark that V br has torsion and therefore cannot admit a finite classifying
space. In fact, it does not have a torsion free subgroup of finite index. Thus, a
classifying space that is finite in each dimension is the best one can hope for.
In the remainder of this paper, we shall outline how Brown’s criterion can be
used to prove Theorem 13.6.1. For various Thompson groups, this method has been
carried out successfully. The complexes we shall introduce below are inspired by the
complexes used by M. Stein [27] for Thompson groups and D. Farley [18] for the
larger class of diagram groups.
Bn -orbits of MorB .1; n/. Informally, modding out the action of Bn on MorB .1; n/
amounts to consider .1; n/-diagrams up to dangling feet.
Observation 13.7.1. The group V br acts simply transitively on MorB .1; n/. The
induced action on Dn is transitive and its point stabilisers are isomorphic to Bn .
We shall now construct a cube complex X with the set of all diagrams with dan-
gling feet
D WD D1 t D2 t D2 t
as its set of vertices. We glue in edges corresponding to the splitting of feet. We call
the splitting of a single foot in a diagram from Dn an elementary expansion. As the
number of feet increases by one, an elementary expansion yields a diagram in DnC1 .
We fill in all the edges from a diagram to its elementary expansions. A diagram with
n feet allows for exactly n different elementary expansions, thus n edges issue from
it. Of course, it might also have incoming edges as it can be obtained as an elementary
expansion from other diagrams.
Higher dimensional cubes arise from commuting elementary expansions. Con-
sider a diagram ‚ with n feet. For any selection of them, the diagram obtained from
‚ by splitting all the chosen feet does not depend on the order in which the strands
are split. This means that the total of
! ! ! !
n n n n
2n D C C C C
0 1 n 1 n
diagrams that can be obtained from ‚ by commuting elementary expansions are con-
nected by edges that form the 1-skeleton of a cube. We call ‚ the bottom corner of
the cube. Of course, this terminology foreshadows a Morse function where diagrams
with more feet will be higher than those with fewer feet. We obtain X by filling in all
those cubes. Figure 13.7.1 shows how a square in X arises.
Since V br acts on D from above whereas elementary expansions take place at the
bottom, the action of V br on D induces an action on X . Cell stabilisers of this action
are braid groups. More precisely, we make the following:
Observation 13.7.2. The stabiliser of a cube coincides with the stabiliser of its bot-
tom corner. Thus, cell stabilisers of the V br -action on X are isomorphic to braid
groups on finitely many strands and therefore of type F1 .
In order to derive finiteness properties from the action on X , we also need that X
has vanishing homotopy groups.
Sketch of proof. Consider the trivial .1; 1/-diagram j and let Yn be the subcomplex
spanned by all diagrams with at most n feet that can be obtained from j by (iterated)
314 K.-U. Bux
expansions.
S Note that YnC1 deformation retracts onto Yn . It follows that the union
Y WD n Yn is a contractible subcomplex of X . The vertices in Y can be represented
by diagrams that are just binary planar trees, i.e., free of braiding and merges.
Splitting all feet of a diagram simultaneously takes the bottom corner of each
cube to its top corner. By interpolation, we can extend this to a continuous map.
This map, however, does not preserve the cell structure of X : for instance, it takes
edges to square diagonals. In fact, it is even possible to extend it to a continuous
flow within X . Note that iterated splitting of all feet will ultimately undo any merge
that happened somewhere up in the diagram; and as diagrams are considered up to
dangling feet, everything untangles at the bottom when all merges are gone. Thus,
we end in the subcomplex Y . By compactness, a sphere in X involves only finitely
many diagrams. Thus, flowing a sphere up the complex, it will eventually be moved
completely into the subcomplex Y that consists of merge-free diagrams. Once inside
the contractible subspace Y , the sphere can be crushed.
Proof. The vertex ‚n is a .1; n/-diagram. Sticking this diagram on top of each dia-
gram in Zn realises the subcomplex of X consisting of all cubes with ‚n as their top
corner.
straighten all the strands and record what this does to the horizontal arcs. Formally,
the first step is just looking at a particularly obvious k-simplex in C .n/ whose arcs
keep track which pairs of feet merge. The second step uses the interpretation of the
braid group Bn as the mapping class group of the n-punctured plane. So undoing the
braid can be realised as a mapping class and we consider how this mapping class acts
on the simplex obtained in the first step.
We shall refer to this construction as unwinding. The process is best understood
visually:
7!
Intuition instead of proof. We have to address two issues. The first is the problem
of ˆ being well-defined. The unwinding construction takes place at the level of dia-
#
grams, but cells in Ln are given by diagrams up to dangling of feet. As indicated in
Figure 13.9.2, any additional dangling (drawn with thick strokes) is undone automat-
ically during the unwinding procedure.
The remaining question is compatibility with the simplicial structure, i.e., if is
a simplicial cell in L#n and 6 is a face, then unwinding yields a face of the arc
matching obtained by unwinding . This, however, is obvious: undoing a merge in
amounts to deleting the corresponding arc in ˆ. /.
In the same way that dangling is undone during unwinding, twisting strands before
merging is undone during unwinding, as shown in Figure 13.9.3. Consequently, the
318 K.-U. Bux
Figure 13.9.2. Automatic undangling during unwinding. The top row shows in thick strokes
additional (compared to the bottom row) dangling and how it is undone at the beginning of the the
unwinding procedure.
unwinding map is not an isomorphism. However, this phenomenon is the only source
of non-injectivity.
Fact 13.9.2. The fiber ˆ 1 .˛/ of a single arc is an infinite discrete set (a Z worth of
diagrams). The preimage ˆ 1 . / of a k-simplex is the full join of its vertex fibers:
ˆ 1
. / D ˆ
˛2
1
.˛/
1
In particular, the fiber ˆ . / is .k 1/-connected.
D
#
in the arc matching complex C .5/. The edges in L5 unwinding to are given by
diagrams of the form shown in Figure 13.9.4.
The fiber over the closed simplex is therefore the complete bipartite graph over
two countable infinite sets. Hence, it is connected.
Higher finiteness properties of braided groups 319
k l
Figure 13.9.4. The numbers k; l 2 Z count how many times the pair of strands is twisting. Each of
#
these edges in L5 unwinds to the edge .
In [26], D. Quillen proved several very useful results that relate connectivity prop-
erties of the domain of a projection to connectivity of the base, provided one has good
control over the fibers of the map. We shall make use of the following criterion, which
Quillen phrases in terms of maps between posets. We specialise it for simplicial com-
plexes (representing their poset of cells). Recall that a complex X is d -spherical if it
has dimension d and is .d 1/-connected.
Theorem 13.9.4 ([26, Thm. 9.1]). Let ' W X ! Y be a simplicial map. As-
sume that Y is d -spherical, and for each simplex Y that the link lkY . / is
.d dim. / 1/-spherical. About the fibers assume that the preimage ' 1 . / of a
closed simplex is dim. /-spherical. Then, X is d -spherical.
Our complexes are not spherical: they are highly connected but have cells above
the allowed dimension. Removing the cells above the critical dimension yields the
following:
Corollary 13.9.5. Let ' W X ! Y be a simplicial map. Assume that Y is l-
connected, and for each simplex Y that the link lkY . / is .l dim. / 1/-
connected. About the fibers, assume that the preimage ' 1 . / of a closed simplex is
.dim. / 1/-connected. Then X is l-connected.
˘
Recall that we defined .n/ WD nC13
2.
Proposition 13.9.6. The cell complex L#n is .n/-connected. Consequently, the de-
scending link lk# .‚/ of any n-feet vertex is .n/-connected.
Proof. We consider the projection ˆ W L#n ! C .n/. The base space C .n/ is .n/-
connected. Moreover, the link of a k-simplex in C .n/ is isomorphic to the arc
matching complex C .n 2k 2; k C 1/, which is .n 2k 2/-connected. Since
Remark 13.10.1. We did not use the fact that the inclusion Xn 1 Xn induces an
epimorphism in d for d D .n/ C 1. From this, we obtain one additional degree of
connectivity for the pair .Xn ; Xn 1 /.
Remark 13.10.2. The arc matching complexes for complete bipartite graphs studied
in Example 13.4.7 can be used to prove a conjecture of F. Degenhardt that the braided
Houghton groups, introduced in [15], have the same finiteness properties as their
unbraided counterparts, which were already treated by Brown in [11, Sec. 5]. These
groups have been given an alternative description as mapping class groups of certain
infinite surfaces in [19]. Details on their finiteness properties as well as a complete
treatment of the related arc matching complexes will be given elsewhere.
References
[1] H. Abels and S. Holz, Higher generation by subgroups, J. Algebra 160 (1993), 310–341.
[2] C.A. Anastasiadis, Decompositions and connectivity of matching and chessboard complexes,
Discrete Comput. Geom. 31 (2004), 395–403.
[3] M. Bestvina and N. Brady, Morse theory and finiteness properties of groups, Invent. Math.
129 (1997), 445–470.
[4] A. Björner, The homology and shellability of matroids and geometric lattices. In [30], 226–
283.
[5] A. Björner, L. Lovász, S.T. Vrećica and R.T. Živaljević, Chessboard complexes and matching
complexes, J. London Math. Soc. 49 (1994), 25–39.
[6] A. Borel and J-P. Serre, Corners and arithmetic groups, Comment. Math. Helv. 48 (1973),
436–491.
[7] A. Borel and J.-P. Serre, Cohomologie d’immeubles et de groupes S-arithmétiques, Topology
15 (1976), 211–232.
Higher finiteness properties of braided groups 321
[8] M. Brin, The algebra of strand splitting II: A presentation for the braid group on one strand,
Int. J. Algebra Comput. 16 (2006), 203–219.
[9] M. Brin, The algebra of strand splitting I: A braided version of Thompson’s group V , J. Group
Th. 10 (2007), 757–788.
[10] K.S. Brown, Cohomology of Groups, GTM 87, Springer, New York, 1982.
[11] K.S. Brown, Finiteness properties of groups, J. Pure Appl. Alg. 44 (1987), 45–75.
[12] K.-U. Bux, M. Fluch, M. Schwandt, S. Witzel and M. Zaremsky, The braided Thompson’s
groups are of type F1 , J. reine angew. Math. (Crelle) 718 (2016), 59–101.
[13] K.-U. Bux, R. Köhl and S. Witzel, Higher finiteness properties of reductive arithmetic groups
in positive characteristic: the rank theorem, Ann. of Math. 177 (2013), 311–366.
[14] M. Culler and K. Vogtmann, Moduli of graphs and automorphisms of free groups, Invent.
Math. 84 (1986), 91–119.
[15] F. Degenhardt, Endlichkeitseigenschaften gewisser Gruppen von Zöpfen unendlicher Ord-
nung, PhD thesis, Univ. Frankfurt, 2000.
[16] P. Dehornoy, The group of parenthesized braids, Adv. Math. 2005 (2006), 354–409.
[17] B. Farb and D. Margalit, A Primer on Mapping Class Groups, Princeton University Press,
Princeton, 2012.
[18] D. Farley, Finiteness and CAT.0/ properties of diagram groups, Topology 42 (2003), 1065–
1082.
[19] L. Funar, Braided Houghton groups as mapping class groups, An. Ştiinţ. Univ. Al. I. Cuza Iaşi.
Mat. (N.S.) 53 (2007), 229–240.
[20] R. Geoghegan, Topological Methods in Group Theory, GTM 243, Springer, New York, 2008.
[21] A. Hatcher and K. Vogtmann, Tethers and homology stability for surfaces, Algebr. Geom.
Topol. 17 (2017), 1871–1916
[22] J.L. Harer, Stability of the homology of the mapping class groups of orientable surfaces, Ann.
of Math. 121 (1985), 215–249.
[23] J.L. Harer, The virtual cohomological dimension of the mapping class group of an orientable
surface, Invent. Math. 84 (1986), 157–176.
[24] W.J. Harvey, Geometric structure of surface mapping class groups. In [29], 255–269.
[25] J.S. Povran, Decompositions, shellings and diameters of simplicial complexes and convex
polytopes, Thesis, Cornell University, 1977.
[26] D. Quillen, Homotopy properties of the poset of nontrivial p-subgroups of a group, Adv. Math.
28 (1978), 101–128.
[27] M. Stein, Groups of piecewise linear homeomorphisms, Trans. Amer. Math. Soc. 332 (1992),
477–514.
[28] K. Vogtmann, Local structure of some OUT(Fn )-complexes, Proc. Edinburgh Math. Soc. 33
(1990), 367–379.
[29] C.T.C. Wall (ed.), Homological Group Theory, LMS Lecture Notes 35, Cambridge University
Press, Cambridge, 1979.
[30] N. White, Matroid Applications, Encyclopedia Math. Appl., volume 40, Cambridge Univer-
sity Press, Cambridge, 1990.
[31] M. Zaremsky, Higher generation for pure braid groups, Appendix to arXiv version of [12],
arXiv:1210.2931.
Chapter 14
Zeta functions and the trace formula
W. Hoffmann
Prehomogeneous vector spaces provide a framework for the method of analytic con-
tinuation of zeta integrals due to Hecke and Tate. We will describe instances where
convergence can only be achieved by truncation. Special values of such zeta integrals
appear in the Arthur-Selberg trace formula, and their study is relevant in connection
with recent ideas in the Langlands program.1
14.1 Introduction
The initial motivation for introducing zeta functions was the study of the asymptotic
distribution of a sequence of real numbers, e.g., the sequence of primes, of closed
geodesics on a Riemann surface, or of the eigenvalues of a self-adjoined operator.
The larger the half-plane to which a zeta function can be meromorphically continued,
the smaller the error term in that asymptotics. The best one can expect in this respect
is meromorphic continuation to the entire complex plane. Zeta functions with this
property often satisfy a functional equation. The classical prototype is the Riemann
zeta function or, more generally, the Dedekind zeta function of a number field. In
this case, the functional equation is a consequence of the Poisson summation for-
mula, as was shown by Hecke and Tate. The most general zeta functions which owe
their functional equation to the Poisson summation formula are those associated with
prehomogeneous vector spaces. They will be the subject of the first section.
A seemingly unrelated topic is the Arthur–Selberg trace formula, which plays an
important role in the theory of automorphic forms. Roughly speaking, that formula
computes the trace of a certain intgral operator by integrating its kernel function over
the diagonal. In order to ensure convergence, that kernel function has to be truncated
in most cases.
The two topics are related because truncation is also necessary in some cases
to ensure the convergence of zeta functions associated with prehomogeneous vector
spaces. In fact, we expect that the so-called unipotent contribution to the trace formula
can be expressed in terms of such zeta functions. So far, this has been proved for
groups of rank up to two.
1 Project C7
324 W. Hoffmann
Now the next basic object enters the stage (cf. [18, §5.3]).
Definition 14.2.2. The zeta integral associated with a prehomogeneous vector space
over the field Q is
Z X
Z.; !; ƒ/ D !.g/ .g/ dg;
G.R/=
2ƒ\Vreg .Q/
We may assume that G acts (almost) faithfully, so that is discrete. Then the
G.R/-invariant measure dg on the homogeneous space G.R/= is determined by a
fixed Haar measure on G.R/ and the counting measure on . Often one chooses
!.g/ D j.g/js , where s 2 C and the character W G ! GL1 corresponds to
a regular relatively invariant function p on V . Then the integral-sum is absolutely
convergent for s in some right half-plane, which may be empty.
Zeta-functions arise as follows. We write the contribution from a -orbit Œ D
f
j
2 g in ƒ as a sum over
2 = , where denotes the stabiliser of
in . Assuming absolute convergence, we interchange the sum over the set of orbits
with the integral and combine the latter with the inner sum to get
X Z
Z.; !; ƒ/ D !.g/.g/dg:
G.R/=
Œ ƒ\Vreg .Q/
Now we split the resulting integral into an integral over G.R/=G .R/ and one over
G .R/= . The integrand does not depend on the variable in G .R/ and can be taken
out of the inner integral. On each G.R/-orbit O in Vreg .R/, we fix a G.R/-invariant
measure dG x and define a nonvanishing function q up to a nonzero factor by the
condition
q.gx/ D !.g/q.x/:
In fact, if q0 corresponds in this sense to the character !0 .g/ D jdetV gj, we have
dG x D q0 .x/dx for a Lebesgue measure dx on V .R/. The chosen measure on O
fixes those on both subfactors of G.R/ for each , and the remaining integral can be
reduced by a substitution to the weighted orbital integral
Z
JO .; q/ D .x/q.x/dG x:
O
326 W. Hoffmann
Definition 14.2.3. The adelic zeta integral associated with a prehomogeneous vector
space over a number field F is
Z X
Z.; !/ D !.g/ .g/ dg;
G.A/=G.F /
2Vreg .F /
Unramified characters of G.A/ are those of the form g 7! j.g/js for a rational
character , where we now use the idele norm defined by vol.aB/ D jaj vol.B/ for
Zeta functions and the trace formula 327
every a 2 A and every Borel set B A. The zeta integral of the product of such a
character, for as above, and a general character ! is denoted by Z.; s; !/.
In order to obtain zeta functions, we have to choose a finite set S of valuations
of F containing
Q all archimedean
Q ones. There is a splitting of A into F -algebras
FS D v2S Fv and AS D 0v…S Fv and correspondingly X.A/ D X.FS / X.AS /
Q
for every variety X over F . If U is a lattice in V .AS / over the ring v…S Ov , the
set ƒ D f 2 V .F / j S 2 U g is a lattice over the ring OS of S -integers in F and,
conversely, U is the closure of ƒ embedded diagonally into V .AS /. For a Schwartz-
Bruhat function of the form
.x/ D S .xS /1U .x S /;
where 1U is the characteristic function of U , we get a decomposition
X
Z.; !/ D S .q S ; O \ ƒ/JO .S ; qS / (14.2.2)
OVreg .FS /
into local zeta integrals. The automorphic L-function L.s; / is then defined as an
Euler product, see [11].
and is valid for any Schwartz–Bruhat function on V .A/, whose Fourier transform
is the Schwartz–Bruhat function on V .A/ given by
Z
O
.y/ D .x/ .hx; yi/ dx:
V .A/
In this case, the dual space V is also prehomogeneous for the contragredient
action characterised by hgx; gyi D hx; yi, and the map in the definition sets up
an equivariant isomorphism between the regular sets. The basic relative invariant
p of V pulls back to a constant times 1=p, hence its corresponding character is
D 1=. All the examples above are regular prehomogeneous vector spaces, while
a vector space V with the tautological action of GL.V / is not regular for dim V > 1,
having no relative invariants at all.
We will now recall how the functional equation emerges, whose proof also pro-
vides the meromorphic continuation. If we decompose
Z.; !/ D ZC .; !/ C Z .; !/;
where Z X
Z˙ .; !/ D !.g/ .g/dg;
j.g/j>
<1 2Vreg .F /
converges as well, where S and SO are the singular subvarieties in V and V . Note that
O ! ! / has the same integrand up to sign, but a different domain of integration.
I.; 0
Replacing !.g/ by its product with an unramified character j.g/js resp. j .g/js
and writing !0 .g/ D j.g/j , we get the integrals
I.; s; !/; O
I.; s; !/;
Zeta functions and the trace formula 329
which converge for Re s in some right resp. left half-plane. If one shows that their
meromorphic continuations exist and coincide, one gets the functional equation
O
Z.; s; !/ D Z.; s; !/;
O ! !0 /:
Z.; !/ D Z.; (14.2.4)
Be warned that many authors identify V with V in an equivariant way with respect
to some involutive automorphism of G rather than using the contragredient represen-
tation. This results in an argument ! 1 resp. ! 1 !0 on the right-hand side.
In order to derive a global functional equation for the collection of zeta func-
tions S in (14.2.2), one needs local functional equations connecting the distributions
JO on V .FS / with their Fourier transforms. M. Sato, the inventor of prehomogeneous
vector spaces, called such a statement the fundamental theorem in his theory. He
proved it, under some assumptions, for archimedean places himself (see [18, §4.1]),
and for non-archimedean places this was done by Igusa [17]. It is a consequence
of the equivariance of the Fourier transform and the fact that a distribution on each
open orbit O with the same relative invariance as JO is unique up to a constant fac-
tor. Denef and Meuser proved local functional equations without the assumption of
prehomogeneity, see (15.2.1).
Now we return to the question of convergence of I.; !/, where we restrict to the
case of unramified characters ! D jjs . We denote the kernel of the homomorphism
jj W G.A/ ! RC by G.A/1 and split the integral into one over 0; 1Œ and one
over G.A/1 =G.F /. The case when the inner integral-sum is absolutely convergent
was considered in [23]. This covers Example 1 for n D 1 (considered by Hecke
and, adelically, by Tate) and Example 2 for non-split quadratic forms (considered
by Epstein in the case of definite forms and by Siegel in general). One can then
decompose the integral into the contributions from the geometric orbits Si and SOj in S
and SO . Let us assume for simplicity that G.A/1 acts transitively on Si .A/ and SOj .A/.
Then Z X
†i ./ D .g/dg
G.A/1 =G.F /
2Si .F /
valid for Re s > , which provides the meromorphic continuation. The contributions
from the zero orbits S0 D f0g in V and SO0 D f0g in V are given by s0 D sO0 D 0
and
†0 ./ D †0 ./ O D vol.G.A/1 =G.F //:
ˇ
If denotes the characteristic function of the set fg 2 G.A/ ˇ j.g/j < 1g, then
Z Z !
O
†0 ./ X
Z.; s/ D j.g/js .g/ .g/ .gx/ dx dg:
s G.A/=G.F / V .A/
2Vreg .F /
(14.2.5)
In many cases (e.g., if G is anisotropic modulo center, S D f0g and SO D f0g), it
is known that the contribution from SO0 is the principal part in the Laurent expansion
of Z.; s/ at the abscissa of convergence s D and that the integral on the right-
hand side is convergent for s in a half-plane strictly larger than the half-plane of
convergence of Z.; s/. Its value at s D gives a formula for the constant term in
the Laurent expansion. Using (14.2.2), that constant term can also be expressed in
terms of the principal parts of the zeta functions and the Taylor coefficients of the
weighted orbital integrals.
In preparation for the next example, we recall the notion of a height function. For
a free module V over a commutative ring A, we denote by VP the scheme-theoretic
complement of the origin. Then VP .A/ consists of all x 2 V for which there exists
an A-linear form l on V such that l.x/ is invertible. If V is a vector space over the
number field F , a height function on VP .A/ is defined as
Y
kxk D kxv kv ;
v
the algebras Sym W and Sym W obtained by applying the elements of the latter, in-
terpreted as differential operators on W , to the elements of the former and evaluating
at zero. In particular, we may identify V with Sym2 W . For each r 6 n D dim W ,
there is a geometric orbit Sr V consisting of the elements of rank r. For n ¤ 2,
the zeta integral Z.; s/ is convergent for Re s > D nC1 2
and has a meromorphic
continuation to the complex plane satisfying the functional equation (14.2.4). For
each 0 6 r < n, the orbit Sr (resp. SOr ) causes a simple pole at 2r (resp. 2r ), and
formula (14.2.5) remains valid. However, the individual contributions of the orbits to
I.; !/ do not converge. Shintani circumvented this problem by choosing a function
that vanishes on S.A/ and for which O vanishes on SO .A/, while the orbital inte-
grals in (14.2.1) do not. The residues were fully determined only later in [16] using
different methods.
In the case n D 2, the zeta integral Z.; s/ is divergent due to the presence of split
elements D 1 2 2 Vreg .F / with 1 ¤ 2 2 W . Indeed, the i are eigenvectors
of each g 2 G , and the ratio of the eigenvalues is a rational character of that group.
In order to salvage convergence, one fixes a height function on W .A/ and chooses a
family t of truncation parameters tU > 0 for all one-dimensional subspaces U W
in such a way that the characteristic functions U of the sets
ˇ
fg 2 G.A/ ˇ kgk > tU kk for 2 U g
satisfy U .g
/ D
U .g/ for all
2 G.F /. Then the truncated zeta integral
Z X X
Z t .; s/ D j.g/js 1 U .g/ .g/dg; (14.2.6)
G.A/=G.F /
2Vreg .F / U j
where U runs through the two or none lines whose elements divide the given , con-
verges for Re s > D 32 . Since G.F / acts transitively on the set of split elements,
we can choose one such element 0 with corresponding subspaces U1 , U2 and write
Z X
Z t .; s/ D j.g/js .g/dg
G.A/=G.F /
2Vreg .F /
irred.
Z
C j.g/js 1 U .g/ U .g/ .g0 /dg:
1 2
G.A/=G0 .F /
For tU1 D tU2 D 0, Yukie [27] called this the adjusted zeta integral. It extends
meromorphically to the complex plane and satisfies the functional equation (14.2.4).
It has poles at 0 and 32 coming from zero orbits S0 , SO0 and poles at 12 and 1 from the
orbits S1 , SO1 of nonzero squares, see [27].
332 W. Hoffmann
Theorem 14.2.5. The truncated zeta integral for the space of binary quadratic forms
satisfies
O
†0 ./
Z t .; s/
s 32
!
Z X X Z
s
D j.g/j 1 U .g/ .g/ .g/ .gx/dx dg;
G.A/=G.F / V .A/
2Vreg .F / U j
(14.2.7)
where the integral on the right-hand side converges for Re s > 1.
Vi D Symi 1
W Sym4 i
U0
where Z denotes the analytically continued zeta integral associated to the one-dimen-
sional prehomogeneous vector space Vi =Vi 1 with the group GL1 and
Z
K
.x/ D .kx/dk
K
Zeta functions and the trace formula 333
is the average over a maximal compact subgroup K G.A/ as above. Proposition 6.1
of [8] asserts that
O
†0 ./ †0 ./ O 2/
† .; †1 .; 32 / O 1/
† .; †2 .; 1/
I.; s/ D C 1 3
C 2
s 1 s 6s 5 6s 1 2s 2 2s
for our parametrisation of unramified characters and a measure on G.A/ specified be-
low. The proof uses Shintani’s method of multiplying the integrand by certain func-
tions that make the contribution from each orbit convergent and so that the original
integral may be recovered. A more transparent proof using partial Fourier transforms,
which exhibits the cancellations between the contributions from the orbits, is given
in [19].
Although the elements of Vreg .F / which split over F into two factors do not
hamper the convergence of Z.; s/ for Re s > D 1, they do make the right-hand
side of (14.2.5) divergent for s D . This can be fixed by truncation:
Theorem 14.2.6. The zeta integral for the space of binary cubic forms satisfies
O C 1 †2 .;
O 1/ Z X
†0 ./ 2
Z.; s/ D j.g/js .g/
s 1 G.A/=G.F /
2Vreg .F /
Z Z !
X X
.g/ .g.x//dx .g/ .gx/dx dg
U 2Vreg;U .F / Sym3 U.A/ V .A/
where Vreg;U D f 2 Vreg W U j g= Sym3 U . The integral on the right-hand side is
convergent for Re s > 65 .
The subspaces U are the translates of U0 by elements of G.F /. The associated sub-
spaces Sym3 U and f 2 V W U j g D U Sym2 W are the translates of V1 and V3 ,
resp., and V3 \ Vreg D V3 n V2 , so the right-hand side becomes
Z X Z
j.g/js .g/ .g.x//dx dg;
G.A/=P .F / V1 .A/
2.V3.F /nV2 .F //=V1 .F /
which equals Z.3 ; 2/=.2s 2/. Now the asserted equality follows from the func-
tional equation (14.2.4) and the fact that O2 is the Fourier transform of 3 . The wider
range of convergence of the integral was proved in [9].
Definition 14.3.1. A function on G.R/ has moderate growth if there exist a faithful
rational representation over Q and positive constants c and n such that j.g/j 6
ck.g/kn for some norm on End W .R/. A smooth function of moderate growth is
called an automorphic form with respect to an arithmetic subgroup and a maximal
compact subgroup K of G.R/ if it is left -invariant, right K-finite and ZG -finite,
where ZG is the algebra of biinvariant differential operators on G.R/.
Zeta functions and the trace formula 335
The name is due to the fact that, for torsion-free , the space of alternating dif-
ferential forms on the locally symmetric space nG.R/=K can be embedded into the
space of left -invariant and right K-finite functions. Here K-finiteness means that
the right translates of by elements of K span a finite-dimensional subspace. Due
to tradition, and K have switched the sides in comparison with the last section.
Everything can be generalised to any number field F in place of Q, in which case R
has to be replaced by F1 .
The arithmetic importance of automorphic forms is due to Hecke operators. They
exist in abundance for congruence subgroups and can be best described when one
considers all congruence subgroups at the same time. As in Definition 14.2.2, this
requires the ring A of adeles of F .
Definition 14.3.2. A function on G.A/ has moderate growth if there exist a faithful
rational representation over F and positive constants c and n such that j.g/j 6
ck.g/kn for some height function on VP .A/, where V D End W . A smooth function
of moderate growth is called an automorphic form with respect to a maximal compact
subgroup K of G.A/ if it is left G.F /-invariant, right K-finite and ZG -finite, where
ZG is the algebra of biinvariant differential operators on G.F1 /.
The K-finiteness amounts to K1 -finiteness and Kfin -finiteness, where Kfin is as-
sumed to be open in G.Afin /. Finiteness under the totally disconnected group Kfin
means that is right invariant under some open compact subgroup U of G.Afin /,
hence can be viewed as a function on G.F /nG.A/=U. On this space, G.F1 / acts
from the right, with finitely many orbits if G 0 .F1 / is noncompact for every simple
factor G 0 of G over F . The orbit of a point G.F /gU with g 2 G.Afin / is isomor-
phic to g nG.F1 /, where the congruence subgroup g is the projection of G.F / \
G.F1 / gUg 1 . This relates the adelic picture to the classical one and can also be
interpreted as a cut and project scheme in the spirit of (9.2.1) with window U.
In order to get a representation R of the group G.A/ by setting
.R.g//.x/ D .xg/;
one has to complete the spaces of automorphic forms because K1 -finiteness is not
preserved under right translations. This takes place within the space of functions of
uniformly moderate growth, i. e., functions for which D is of moderate growth for
every left-invariant differential operator D with an exponent n independent of D. An
automorphic form generates a subrepresentation of finite length, and its subquotients
are called automorphic representations. Usually, however, one avoids this complica-
tion and is content with the action of the Lie algebra g of G.F1 / and of the algebra
of compactly supported left and right K-finite functions f on G.A/ given by
Z
R.f / D f .g/R.g/dg
G.A/
336 W. Hoffmann
for a Haar measure dg, which preserves the space of automorphic forms. One can
write R.f / as an integral operator
Z
.R.f //.x/ D K.x; y/.y/dy
G.F /nG.A/
If f .g/ D f1 .g1 /ffin .gfin / according to the decomposition G.A/ D G.F1 /G.Afin /,
then R.f / is the product of commuting operators R1 .f1 / and Rfin .ffin /, the latter
one being called a Hecke operator.
The version of R acting on the Hilbert space L2 .G.F /nG.A// is a unitary repre-
sentation. If G is anisotropic over F , then G.F /nG.A/ is compact, R.f / is of trace
class, and one has Z
tr R.f / D K.x; x/dx: (14.3.1)
G.F /nG.A/
In this case, R is a direct sum of automorphic representations, and the trace formula
becomes X X
a./J.; f / D a.
/J.
; f /; (14.3.2)
Œ Œ
where Œ runs through the equivalence classes of automorphic representations and
Œ
through the conjugacy classes in G.F /. Moreover,
Z
J.; f / D tr .f /; J.
; f / D f .g 1
g/dg
G
.A/nG.A/
denote the distributional character of and the orbital integral of
. Finally, a./ de-
notes the multiplicity of in R, and a.
/ D vol.G
.F /nG
.A//. The computation
of right-hand side of (14.3.2) is quite similar to that of (14.2.1).
If G is not anisotropic, it has nontrivial parabolic subgroups P , i. e. algebraic
subgroups for which the variety G=P is complete and hence P .A/nG.A/ is compact.
They have Levi decompositions P D MN over F , where N is the unipotent radical
of P and M is some reductive subgroup. Langlands has obtained a decomposition
of R into automorphic representations, cf. [21]. It consists of a discretely decompos-
able part Rdis and direct integrals of representations induced from representations of
M
parabolic subgroups P .A/. The latter are components of the analogue Rdis of Rdis
for M , pulled back under the natural homomorphism P .A/ ! M.A/. The connec-
tion between the induced representations and R is set up by Eisenstein series.
If the F -split component AG of the centre of G is nontrivial, then (14.3.1) is di-
vergent due to the invariance of the integrand under AG .A/. (One says that a torus
is split if it is isomorphic to a product of groups GL1 .) Thus one has either to con-
sider the space of functions satisfying .ax/ D !.a/.x/ for a fixed character !
Zeta functions and the trace formula 337
of AG .A/, or one has to restrict the functions to G.F /nG.A/1, where G.A/1 is now
the intersection of the kernels of all unramified characters introduced after Defini-
tion 14.2.3. Incidentally, as a complementary subgroup one can take the image of
the trivial connected component of AG .R/ in AG .A/ under the diagonal embedding
R ! F1 A. Its Lie algebra is denoted by aG , and any element g 2 G.A/ can be
written in the form g 1 exp HG .g/ with g 1 2 G.A/1 and HG .g/ 2 aG .
For general reductive groups G, this does not yet suffice to make (14.3.1) con-
vergent. One fixes a maximal compact subgroup K such that Kv is special for every
place v of F , so that G.A/ D P .A/K, and sets for m 2 M.A/, n 2 N.A/ and k 2 K
HP .mnk/ D HM .m/ 2 aP WD aM : (14.3.3)
We have h$; HP .g/i D log k.g 1 /vk for a suitable F -rational representation
of G, a vector v of highest weight $ and a K-invariant height function, whence HP
is sometimes also called a (logarithmic) height function. The modular character for
the action of P .A/ on N.A/ is eh2P ;HP .p/i for a linear functional P on aP . A
truncation parameter for G is a family of points TP 2 aP chosen compatibly with the
projections aP ! aP 0 for P P 0 and so that
H
P
1 .
x/ T
P
1 D HP .x/ TP
for
2 G.F /. Since the minimal parabolic subgroups over F are conjugate un-
der G.F /, T is determined by TP for any minimal P .
For each P , we have the positive chamber aC P consisting of elements H 2 aM
such that h˛; H i > 0 for all roots ˛ of AM in N . It is contained in the obtuse cone
C
aP defined by hP 0 ; H i > 0 for all (maximal) P 0 P . We denote by PT resp. OPT
be the characteristic function of the set of all x 2 G.A/ such that HP .x/ TP is in
aC C
P resp. aP . The function
X
F .x; T / D . 1/dim AP =AG OPT .x/ (14.3.4)
P
1
on G.A/ has compact support modulo G.F /, and if TP is sufficiently deep inside
aC G
P for one minimal (and hence for each) P , then F .x; T / it is the characteristic
function of a set that exhausts G.A/1 as T ! 1 in the obvious sense. So it would be
natural to integrate K.x; x/F .x; T /. Arthur realised that the integral is asymptotic to
a polynomial that can be described explicitly.
He considers the induced representation RP coming from the analogue RM of R
for a Levi component M of any parabolic subgroup P defined over F . Now RP .f /
is an integral operator with kernel function
X Z
P
K .x; y/ D f .x 1
ny/dn:
N.A/
2M.F /
The sum on the left-hand side is taken over the conjugacy classes of pairs consisting
of a Levi subgroup M and an element
of M.F /, while the integral on the right-hand
side is taken over the conjugacy classes of pairs consisting of a Levi subgroup M and
a (virtual) automorphic representation of M.A/ \ G.A/1 .
Inside the contributions from M D G to the right-hand side, one finds the trace
of Rdis .f /. Thus, the trace formula can be applied to the study of the multiplicities of
automorphic representations in this space. The general distributions JM .; f / come
from induced representations. They are weighted characters defined with the help of
intertwining operators.
The general distributions JM .
; f / on the left-hand side are weighted orbital
integrals over conjugacy classes. Inside the adelic points of a geometric conju-
gacy class, there may be infinitely many G.A/-conjugacy classes. Thus, strictly
speaking, one has to fix a finite set S of places of F and consider test functions
f .g/ D fS .gS /f S .g S /, where f S is the characteristic function of a maximal com-
pact subgroup of G.AS /. This leads to a modified notion of conjugacy depending
on S .
For advanced applications, e.g. the classification of automorphic representations
of classical groups in terms of those of GLn (cf. [3]), the trace formula has to be
generalised to the twisted trace formula and undergo a number of metamorphoses
(invariant form, stable form). There are potential applications which require an even
deeper understanding of the terms in the trace formula, see [4].
where the exterior sum runs through the geometric conjugacy classes D in M whose
induced conjugacy class is C . It was conjectured in [14] and proved in [10] that
the integral-sum defining JCT .f / is absolutely convergent, even for suitable non-
compactly supported functions f .
Arthur reduced the computation of J T .f / to that of the unipotent contribution
(cf. [2], section 19), hence we consider only unipotent conjugacy classes C now. A
unipotent element of G.F / is of the form
D exp X for a nilpotent element X of the
Lie algebra g of G over F . According to the theorem of Jacobson–Morozov, there
exists another nilpotent element Y and a semisimple element H such that ŒX; Y D
H , ŒH; X D 2X and ŒH; Y D 2Y . Under the endomorphism Z 7! ŒH; Z, the
Lie algebra decomposes into eigenspaces gn , n 2 Z. If we denote by q, u, u0 and u00
the sum of eigenspaces with n > 0, n > 0, n > 2 and n > 2, respectively, then
q is a parabolic subalgebra with ideals u, u0 and u00 , and Kostant proved that they
are independent of the choice of Y and H . Moreover, the normaliser Q of q in G
is a parabolic subgroup over F , called the canonical subgroup of X , with unipotent
radical U and normal subgroups U 0 and U 00 obtained from their analogues in g by
the exponential map, and the centraliser L of H in G is a Levi component of Q.
The subalgebra spanned by X , Y and H is isomorphic to sl2 .F /, and it follows from
the representation theory of the latter that V D U 0 =U 00 is a regular prehomogeneous
vector space under the action of L by conjugation.
A general approach to the evaluation of JCT .f / for a unipotent class C , which
generalises the original method of Selberg, was described in [14]. The first step is
to group the contributions from the various elements
of C.F / according to their
canonical subgroups. The problem is that we actually have contributions from cosets
N P , whose elements have different canonical subgroups. The trick is to re-
place the integral over
N.A/ for
in an inflated class D 0 .F / by a sum of integrals
0 0
over cosets
0 N D .A/ for a sufficiently small normal subgroup N D of P so that
0
the elements of
0 N D .F / \ D 0 .F / have the same canonical subgroup. This uses a
mean-value formula of Siegel, Weil and Ono, whose applicability relies on the con-
0
dition that N D is also sufficiently large so that every rational function on the variety
0
N=N D relatively invariant under the stabiliser of
N in P is constant. So far, no
0
case has been found in which such N D would not exist. We can now fix the canon-
ical subgroup Q of some element of C.F / and obtain those of the other ones by
conjugating with G.F /. If we combine the resulting sum over Q.F /nG.F / with the
integral, we obtain under certain convergence assumptions
Z X X Z
T
JC .f / D f .x 1
n0 x/ dn0 T
N 0 .x/dx;
Q.F /nG.A/1 N 0 Q N 0 .A/
2..C \Qcan /N 0 =N 0 /.F /
340 W. Hoffmann
with P running through the parabolic subgroups such that
is in some inflated con-
0
jugacy class D 0 P and N 0 D N D . According to Cor. 1 in [13], this sum is
finite.
The upshot is that, after introducing a simple exponential factor under the integral,
T 0
the contribution JC;N 0 .f; / from each subgroup N , given by
Z X Z
eh;HQ .x/i f .x 1
n0 x/ dn0 T
N 0 .x/dx;
Q.F /nG.A/1 N 0 .A/
2..C \Qcan /N 0 =N 0 /.F /
where Vreg D C \ U 0 =U 00 ,
Z Z
1
fV .x/ D f .k xuk/ dx dk
K U 00 .A/
and ehıU=U 00 ;HL .l/i is the modular character for the action of L.A/ on U.A/=U 00 .A/.
There are many cases in which the assumptions have been verified, and we can at
least claim the following.
Theorem 14.4.1. For connected reductive groups of F -rank 1 or absolute rank 2, the
contribution from a unipotent geometric conjugacy class C to the geometric side of
the trace formula is the value at D 0 of
X
JCT .f; / D T
JC;N 0 .f; /;
P 0 P
where P 0 runs through the parabolic subgroups with unipotent radical N 0 , and
T
JC;f1g .f; / is a truncated zeta integral convergent for Re in a chamber with apex
ıU=U 0 .
The case of F -rank one has been known since long (cf. [1, 12]). Here, the groups
L are anisotropic modulo centre, and the only singular orbit in V is f0g. Groups of
type A2 and B2 D C2 have been treated in [15], groups of type G2 in [9]. In the last
two cases, for the subregular unipotent orbits, the prehomogeneous vector spaces are
Zeta functions and the trace formula 341
those of binary quadratic resp. cubic forms, cf. Examples 3 and 4. The first case not
covered by the above result is the subregular unipotent orbit in the group GL4 , where
we encounter the prehomogeneous vector space from Example 5.
Note that for odd unipotent conjugacy classes, i. e. those with U ¤ U 0 , we
have convergence at D 0. Otherwise, this point is at the verge of the chamber of
convergence, and JCT .f; 0/ is related to the constant term in the Laurent expansion
of the zeta integral. The terms with N 0 ¤ f1g correspond to the principal part in
that expansion. The zoo of phenomena presented in Theorems 14.2.5 and 14.2.6 is
reproduced by the distribution JCT .f / with its uniform definition (14.3.5).
There are conjugacy classes in general groups that can be handled with the present
methods, e.g. the principal unipotent orbit. In this case, V Š g2 is a sum of root
spaces, but the truncation does not respect the product structure. This leads to correc-
tion terms that can be handled using a generalisation of Arthur’s notion of .G; M /-
families, and an explicit formula for the weight factor in JM .
; f / was obtained (see
section 5.3 in [14]).
Finally, we review a different approach to the computation of JCT .f / using piece-
wise exponential factors. The exponents form a family of complex linear func-
tionals P on aP for each parabolic P compatibly with the embeddings aP 0 ! aP
for P P 0 and such that
P
1 ı Ad.
/ D P for
2 G.F /. Again, is
determined by P for a single minimal P . The pieces come from Arthur’s parti-
tion of G.F /nG.A/, which is constructed as follows. For each parabolic subgroup
P D MN over F , we have an analogue F M .m; T M / of the function F .x; T /
from (14.3.4), where TPM0 \M D TP 0 for every parabolic subgroup P 0 P of G.
If we set F P .mnk/ D F M .m; T M / for m 2 M.A/, n 2 N.A/ and k 2 K, then
X
F P .x; T /PT .x/ D 1:
P
P T;]
under the intergral (14.3.5), we obtain a distribution J T;].f; / D C JC .f; /;
C
which certainly converges if Re P is nonpositive on aP for each P and which coin-
cides with J T .f / for D 0.
At first glance, these distributions seem more complicated than JCT .f; /. In [6],
however, the analogous distributions on the Lie algebra of the group GLn , with the
choice P D sP for s 2 C, were expressed in terms of mean values of truncated
Eisenstein series and zeta integrals for prehomogeneous vector spaces of the form
End W1 End Wm under two assumptions on the test function f —it must be
almost invariant and have small support in a suitable sense. In the proof, truncation
for GLn is expressed in terms of stability. It relies on an observation by Schiffmann
that, for test functions with small support, a certain condition on the slopes of all
subquotients of the canonical filtration is equivalent to the condition on the slope of
342 W. Hoffmann
the socle only. There is a special case of orbits in GLn , which are called regular by
blocks, where the invariance condition is automatically satisfied. In this case, the zeta
integrals are evaluated at a parameter in the range of convergence, see [6]. Another
case that has been completely solved by the same author is that of orbits induced from
the trivial orbit of Levi subgroups with two-by-two blocks, see [5]. In his case, the
result can be expressed as a weighted orbital integral with a global weight factor.
References
[1] J. Arthur, The Selberg trace formula for groups of F -rank one, Ann. of Math. 100 (1974),
326–385.
[2] J. Arthur, An introduction to the trace formula. In Harmonic Analysis, the Trace Formula, and
Shimura Varieties (J. Arthur, D. Ellwood and R. Kottwitz, eds.), Amer. Mathematical Society,
Providence, RI, 2005, 1–263.
[3] J. Arthur, Classifying automorphic representations. In Current Developments in Mathematics
(D. Jerison, M. Kisin, T. Mrowka, R. Stanley, H.-T. Yau and S.-T. Yau, eds.), Int. Press,
Somerville, MA, 2013, 1–58.
[4] J. Arthur, Problems beyond endoscopy. In Representation Theory, Number Theory, and In-
variant Theory (J. Cogdell, J.-L. Kim, and C.-B. Zhu, eds.), Birkhäuser, Cham, 2017, 23–45.
[5] P.-H. Chaudouard, Sur la contribution unipotente dans la formule des traces d’Arthur pour les
groupes généraux linéaires, Israel J. Math. 218 (2017), 175–271.
[6] P.-H. Chaudouard, Sur certaines contributions unipotentes dans la formule des traces d’Arthur,
Amer. J. Math. 140 (2018), 699–752.
[7] P.-H. Chaudouard, Sur une variante des troncatures d’Arthur. In Geometric Aspects of the
Trace Formula (W. Müller, S. Shin, and N. Templier, eds.), Springer, Cham, 2018, 88–120.
[8] B. Datskovsky and D. Wright, The adelic zeta function associated to the space of binary cubic
forms. II. Local theory, J. Reine Angew. Math. (Crelle) 367 (1986), 27–75.
[9] T. Finis, W. Hoffmann and S. Wakatsuki, The subregular unipotent contribution to the ge-
ometric side of the Arthur trace formula for the split exceptional group G2 . To appear in
Geometric Aspects of the Trace Formula. (W. Müller, Sug Woo Shin, and N. Templier, eds.)
Springer, New York, 2018, arXiv:1706.00964.
[10] T. Finis and E. Lapid, On the continuity of the geometric side of the trace formula, Acta Math.
Vietnam 41 (2016), 425–455.
[11] R. Godement and H. Jacquet, Zeta Functions of Simple Algebras, LNM 260, Springer, New
York, 1972.
[12] W. Hoffmann, The non-semisimple term in the trace formula for rank one lattices, J. Reine
Angew. Math. (Crelle) 379 (1987), 1–21.
[13] W. Hoffmann, Induced conjugacy classes, prehomogeneous varieties, and canonical parabolic
subgroups, preprint; arXiv:1206.3068.
[14] W. Hoffmann, The trace formula and prehomogeneous vector spaces, In Families of Auto-
morphic Forms and the Trace Formula. (W. Müller, Sug Woo Shin, and N. Templier, eds.),
Springer, New York, 2016, 175–215.
Zeta functions and the trace formula 343
[15] W. Hoffmann and S. Wakatsuki, On the geometric side of the Arthur trace formula for the
symplectic group of rank 2. In Mem. Amer. Math. Soc., Nr. 1244, vol. 255, 2018, American
Mathematical Society, Providence, RI.
[16] T. Ibukiyama and H. Saito, On zeta functions associated to symmetric matrices, II: Functional
equations and special values. Nagoya Math. J. 208 (2012), 265–316.
[17] J. Igusa, Some results on p-adic complex powers, Amer. J. Math. 106 (1984), 1013–1032,
[18] T. Kimura, Introduction to Prehomogeneous Vector Spaces, American Mathematical Society,
Providence, RI, 1998.
[19] T. Kogiso, Simple calculation of the residues of the adelic zeta function associated with the
space of binary cubic forms, J. Number Th. 51 (1995), 233–248.
[20] G. Lusztig and N. Spaltenstein, Induced unipotent classes, J. London Math. Soc. (2) 19 (1979),
41–52.
[21] C. Moeglin and J.-L. Waldspurger, Spectral Decomposition and Eisenstein Series, Cambridge
University Press, Cambridge, 1995.
[22] H. Saito, Convergence of the zeta functions of prehomogeneous vector spaces, Nagoya Math.
J. 170 (2003), 1–31.
[23] M. Sato and T. Shintani, On zeta functions associated with prehomogeneous vector spaces,
Ann. of Math. 100 (1974), 131–170.
[24] T. Shintani, On Dirichlet series whose coefficients are class numbers of integral binary cubic
forms, J. Math. Soc. Japan 24 (1972), 132–188.
[25] T. Shintani, On the zeta-functions associated with the vector space of quadratic forms, J. Fac.
Sci. Univ. Tokyo, Sect. IA Math. 22 (1975), 25–65.
[26] D. Wright, The adelic zeta function associated to the space of binary cubic forms. I. Global
theory, Math. Ann. 270 (1985), 503–534.
[27] A. Yukie, On the Shintani zeta function for the space of binary quadratic forms, Math. Ann.
292 (1992), 355–374.
Chapter 15
Zeta functions of groups and rings—functional
equations and analytic uniformity
C. Voll
Zeta functions are widely used tools in the study of asymptotic properties of infinite
groups and rings, in particular their subobject and representation growth. We survey
recent results on arithmetic and asymptotic features of such functions, focussing on
various classes of subobject zeta functions, in particular submodule zeta functions as-
sociated with nilpotent algebras of endomorphisms, and representation zeta functions
associated to arithmetic groups, specifically finitely generated nilpotent groups.1
that it converges absolutely for all s 2 C with Re.s/ > , gives the precise degree of
polynomial growth of sn :
log sn
˛ D lim sup :
n!1 log n
Example 15.1.1. A prime example of a zeta function of the type discussed is the
Dedekind zeta function K .s/ of a number field K, enumerating the ideals of finite
additive index (i.e. norm) in O , the ring of integers of K:
X 1
X
K .s/ D N.a/ s
D anG .O / n s ; (15.1.2)
aGO nD1
where N.a/ D jO W aj is the norm of the ideal a and anG .O / D #fa G O j N.a/ D
ng. The well-known fact that the abscissa of convergence of K .s/ is equal to 1 is
equivalent to the fact that the number snG .O / of ideals of O of norm at most n grows
linearly with n. In fact, K .s/ allows for meromorphic continuation to the whole
complex plane, with a simple pole at s D 1. It follows that
snG .O / cK n as n ! 1 (15.1.3)
for some constant cK . The famous analytic class number formula explains how cK
depends on arithmetic key invariants of the number field K. Note that, in contrast to
this subtle invariant, the abscissa of convergence ˛ of K .s/ and the order of the pole
arising at s D ˛ are independent of K.
Ideals in the Dedekind ring O factorise—essentially uniquely—as products of
prime ideals. This basic arithmetic fact is reflected analytically in the Euler product
decomposition Y
K .s/ D K;p .s/; (15.1.4)
p2Spec.O /n.0/
where Spec.O / is the set of prime ideals of O and, for a prime ideal p of O , the Euler
factor at p is defined as
1
X 1
K;p .s/ D N.pi / s
D s
: (15.1.5)
1 q
i D0
Here, we write q for the cardinality of the residue field O =p. In the special case
K D Q, we write .s/ D K .s/ for Riemann’s zeta function.
am . E Õ L / D
# fH 6 L j H is an .E C RidL /-submodule of L with jL W H j D mg :
This definition may be seen as an analogue of Solomon’s zeta function; cf. [32].
Submodule zeta functions generalise ideal zeta functions, enumerating (one- or two-
sided) ideals of finite index in rings of finite additive rank over R. In particular,
Dedekind’s zeta function K .s/ (cf. Example 15.1.1) is an example of a submodule
zeta function.
348 C. Voll
satisfied by all but finitely many factors of the Euler product (15.2.1) for integers
a; b; c. Here, “inverting q” is in general to be interpreted carefully in terms of the
inversion of certain Frobenius eigenvalues in suitable formulae for the local factors in
terms of the numbers of O =p-rational points of certain algebraic varieties associated
to .L; E /. In many special cases, however, the local factors are rational functions in
both q s and q, and the inversion of q may be interpreted naively in terms of these
rational functions.
Local functional equations of the form (15.2.2) are ubiquitous in the realm of
nilpotent submodule zeta functions, but not universal. In Theorem 15.2.4, we shall
describe a general, sufficient criterion for functional equations of the form (15.2.2),
together with an interpretation of the data .a; b; c/.
Zeta functions of groups and rings 349
Example 15.2.1. Consider the Lie ring M4 of (maximal) nilpotency class 4 with
presentation
M4 D hz; x1 ; x2 ; x3 ; x4 j Œz; x1 D x2 ; Œz; x2 D x3 ; Œz; x3 D x4 iZ : (15.2.3)
Here and in the sequel, we follow the convention that Lie brackets between generators
which do not follow from the specified ones by antisymmetry or the Jacobi identity
G
are assumed to be trivial. An explicit, uniform formula for the Euler factors M 4 ;p
.s/
G
of the ideal zeta function M4 .s/, enumerating ideals of M4 —viz. ad.M4 /-invariant
sublattices of Z 5 , the additive group underlying M4 —is given in [14, Thm. 2.37];
the formula—a rational function in p and p s —is involved, its proof is nontrivial.
Inspection reveals that it satisfies the functional equation
ˇ
G .s/ˇ
M4 ;p p!p 1 D p10 14s G .s/;
M4 ;p (15.2.4)
matching the template (15.2.2) with .a; b; c/ D .5; 52 ; 5 C 4 C 3 C 2/; our chosen
way of writing this data will fall into place with Theorem 15.2.4.
Example 15.2.2. Consider now the Lie ring Fil4 with presentation
Fil4 D hz; x1 ; x2 ; x3 ; x4 jŒz; x1 D x2 ; Œz; x2 D x3 ;
(15.2.5)
Œz; x3 D x4 ; Œx1 ; x2 D x4 iZ ;
differing from (15.2.3) only in the underlined additional relation. An explicit, uniform
G G
formula for the Euler factors Fil 4 ;p
.s/ of the ideal zeta function Fil 4
.s/, enumerating
ad.Fil4 /-invariant sublattices of Z 5 , is given in [14, Thm. 2.38]; it is of similar com-
G
plexity as the one yielding the Euler factors of M 4
.s/. Remarkably, however, it does
not satisfy a functional equation of the form (15.2.2).
The given presentations of M4 and Fil4 only differ in a single relation. As we shall
explain, it is this additional relation that destroys a certain “homogeneity condition”
(cf. Condition (15.2.3)) satisfied by M4 but not by Fil4 . Indeed, note that the five-
dimensional Z-module L underlying both M4 and Fil4 may be decomposed as
L D hz; x1 i ˚ hx2 i ˚ hx3 i ˚ hx4 i ; (15.2.6)
„ ƒ‚ … „ƒ‚… „ƒ‚… „ƒ‚…
DWL1 DWL2 DWL3 DWL4
with summands complementing the upper central series of M4 resp. Fil4 . The as-
sociative algebras generated by ad.M4 / and ad.Fil4 / are both generated by the two
elements ad.z/ and ad.x1 /. Note, however, that only M4 satisfies the condition
8c 2 fad.z/; ad.x1 /g; 8j 2 f1; 2; 3g W Lj c Lj C1 : (15.2.7)
Indeed, in Fil4 we have .x2 / ad.x1 / 2 L4 .
350 C. Voll
Lj ck Lj C1 : (15.2.8)
Theorem 15.2.4 ([38, Thm. 1.2]). Assume that .L; E / satisfies Condition 15.2.3.
Then, for almost all p 2 Spec.O /, the following functional equation holds:
ˇ P
ˇ n
n . 2/ s
c 1
i D0 Ni
Ep ÕLp .s/ˇ 1
D . 1/ q Ep ÕLp .s/:
q!q
paper [26] contains explicit formulae for local submodule zeta functions enumerat-
ing submodules invariant under a single endomorphism; that this endomorphism is
nilpotent is an important special case, to which the general case is reduced. Like-
wise, Condition 15.2.3 always holds whenever E 2 D 0, viz. c 6 2. In this case,
Theorem 15.2.4 is a mild generalisation of [36, Thm. C]. While the Lie ring Fil4 in
Example 15.2.2 has class c D 4, there are examples of nilpotent Lie rings of class
c D 3 whose Euler factors do not satisfy a functional equation of the form (15.2.2);
see, for instance, [14, Thm. 1.1].
Remark 15.2.6. [38, Thm. 1.2] is more general than Theorem 15.2.4 in as much as
it applies uniformly to the local submodule zeta functions associated to pairs of the
form .L.O/; E .O//, where O is any finite extension of the local ring Op and L.O/ D
L ˝O O and E .O/ D E ˝O O. These zeta functions enumerate E .O/-invariant
submodules of the O-modules L.O/. To control the variation of the distribution of
the E .Op /-invariant submodules of L.O/, viewed as Op -modules after restriction of
scalars, is a much more delicate task. Local zeta functions obtained in this way from
the Heisenberg (Z-) Lie lattice of strictly upper-triangular 33-matrices are studied
in [29, 30]; see also [38, Sec. 5.2].
Rather than study the subcomplex SubModEp directly, it proved advantageous to de-
352 C. Voll
3. the function „.s/ WD Ep ÕLp .s/=AG .s/ is a “simple” rational function in q s
,
whose coefficients only depend on the rank data .Ni /ciD1 , which satisfies
Pc 1
s Ni
„.s/jq!q 1 D q i D0 „.s/:
The key idea in the construction of the weight function m is to define an equiva-
lence relation on Vn whose equivalence classes C form posets naturally isomorphic
to .Z; 6/ with the property that the intersection C>0 WD C \SubModEp coincides with
the “non-negative part” of C . Thus
1 X X
s
Ep ÕLp .s/ D ns
jLp W ƒmax j :
1 q
C 2Vn = Œƒ2C>0
„ ƒ‚ …
DW„C .s/
>0
More precisely, we say that homothety classes Œƒ1 and Œƒ2 are -equivalent if there
exists M 2 N0 such that ƒ1 ıM 2 Œƒ2 . It is easy to check that—provided c > 1,
as we may assume without loss of generality—every -class C contains a (unique)
homothety class Œƒ0 such that
˚
C>0 D Œƒ0 ıM j M 2 N0 :
In other words, exactly half of C consists of Ep -invariant sublattices, viz. the “half-
ray” C>0 generated by Œƒ0 . For any weight function m satisfying (1), the generating
function
1 1 X X
ns
AG .s/ D ns
jLp W ƒmax j s q sm.Œƒ/
1 q 1 q
C 2Vn = Œƒ2C
„ ƒ‚ …
DW„C .s/
Zeta functions of groups and rings 353
overcounts, namely exactly along the “negative half-rays” C<0 WD C n C>0 consisting
of non-Ep-invariant lattices classes. The critical (and most technical) part of the proof
of Theorem 15.2.4 is now to design m such that both (1) and (2) hold and, crucially,
that correcting this overcounting is easy. More precisely, m is designed so that both
sums „C>0 .s/ and „C<0 .s/ are simple geometric progressions which only mildly
depend on C , via the index of ƒ0;max in Lp . “Stripping off” the summation over C<0
may thus be done uniformly over all C 2 Vn = by multiplying AG .s/ by „.s/,
a rational function combining the two relevant geometric progressions in a simple
manner; cf. (3).
The feasibility of this approach rests crucially on the following elementary con-
sequence of Condition 15.2.3: conjugating the generators ck of E with the staggered
homothety map (15.2.9) amounts to multiplying them by the uniformiser . This
allows us to achieve property (2), by ensuring that AG .s/ is amenable to the p-adic
integration templates provided in [36].
Corollary 15.2.8. For almost all primes p, the following functional equation holds:
ˇ N0
P
ˇ c 1
p. 2 /
s Ni
FGc;d ;p .s/ˇ 1
D . 1/ N0 i D0
FGc;d ;p .s/:
p!p
These symmetries had been conjectured by du Sautoy and Woodward; cf. [14,
Thm. 1.3]. While homogeneity follows, for the groups Fc;d , essentially from their
freeness, we are not aware of a general group-theoretic criterion that would imply lo-
cal functional equations for local normal zeta functions of finitely generated nilpotent
groups.
354 C. Voll
15.2.4 Connections with Igusa’s local zeta function Labelling Condition 15.2.3
one of “homogeneity” is a nod towards one of the earliest results establishing local
functional equations for p-adic integrals which are related to algebraic enumeration
problems. Recall that Igusa’s local zeta function associated to a polynomial f 2
Zp ŒX1 ; : : : ; Xn (where p is a rational prime and Zp the ring of p-adic integers) is
the p-adic integral Z
Zf;p .s/ D jf .x/jsp d; (15.2.10)
Zn
p
where j jp denotes the p-adic absolute norm, s is a complex variable, and de-
notes the additive Haar measure on Znp , normalised so that .Znp / D 1. The integral
Zf;p .s/ encodes the same information as the Poincaré series enumerating the num-
bers n n o
x 2 Z=.pN / j f .x/ 0 mod .pN /
for all N 2 N; cf. [8]. To a polynomial f 2 ZŒX1 ; : : : ; Xn one may associate a
local zeta function Zf;p .s/ for each prime p, by considering the coefficients of f
p-adically. In the case that f is homogeneous of degree d , say, Denef and Meuser
established a (local) functional equation for Zf;p .s/ of the form
ˇ
Zf;p ˇp!p 1 .s/ D p ds Zf;p .s/; (15.2.11)
valid for all but finitely many p; cf. [9]. Broadly speaking, homogeneity of f allows
one to rewrite the affine integral (15.2.10) as an integral over projective space, even-
tually bringing to bear the Weil conjectures on zeta functions of smooth projective
(!) algebraic varieties. Denef and Meuser’s seminal work served as a template for
the proof of various local functional equations in asymptotic group and ring theory,
notably the results in [36, 1], and—maybe least but likely not last—[38].
Functional equations of the form (15.2.11) may or may not hold for Igusa’s lo-
cal zeta functions Zf;p .s/ associated to an inhomogeneous polynomial f . In con-
trast, I am not aware of a submodule zeta function satisfying the conclusion of Theo-
rem 15.2.4 but not the homogeneity Condition 15.2.3.
15.2.5 Variations The concept of the submodule zeta function associated to a pair
.L; E / has numerous analogues and refinements.
The subring zeta function of a (not necessarily associative or unital) ring L of
finite additive rank (viz. a finite-rank Z-module with some bi-additive multiplication)
is the Dirichlet generating function enumerating the subrings (viz. multiplicatively
closed sublattices) of L of finite index in L. Functional equations for almost all of
the Euler factors of these zeta functions were established in [36, Thm. A]. It strikes me
as remarkable that local functional equations for the “quadratic” problem of counting
subrings hold regardless of conditions such as Condition 15.2.3 required for local
functional equations for the “linear” problem of counting invariant submodules.
An interesting generalisation of both classes of counting problems are graded
Lr functions, introduced in [25]. Here, one fixes an additive decomposi-
subobject zeta
tion L D i D1 Li of a lattice L and enumerates only subobjects (say subrings or
Zeta functions of groups and rings 355
nD1
A first hint at the value of the point of view taken comes from the fact that the
“coarse” Euler product (15.3.2) generalises to an Euler product of the form
Y
e
irr e
irr
G .OL / .s/ D G .OL;p / .s/I (15.3.3)
p2Spec.OL /n.0/
cf. [33, Prop. 2.2]. The Euler factors enumerate the finite-dimensional irreducible
continuous representations of the pro-p groups G .OL;p / up to twists by continuous
one-dimensional representations. The “fine” Euler product (15.3.3) reflects the Con-
gruence Subgroup Property and strong approximation for unipotent group schemes.
Theorem 15.3.1 ([11, Thm. A]). Let ƒ be a nilpotent OK -Lie lattice with associated
unipotent group scheme G D G ƒ as above. There exist constants a.G /, ı.G / 2
Q >0 , and ˇ.G / 2 N such that, for all finite extensions L=K, the following hold.
1. The abscissa of convergence of e irr
.s/ is the rational number a.G /.
G .OL /
e
irr
2. The zeta function G .s/ has meromorphic continuation (at least) to the
.OL /
half-plane fs 2 C j Re.s/ > a.G / ı.G /g. On the line fs 2 C j Re.s/ D
a.G/g the continued zeta function is holomorphic except for a pole at s D
a.G/, of order ˇ.G /.
There exists a constant c.G .OL // 2 R>0 such that
N
X
r n G .OL / c.G .OL //N a.G / .log N /ˇ.G /
e 1
as N ! 1: (15.3.5)
nD1
Example 15.3.3. Consider the group scheme H associated to the Heisenberg (Z-)
Lie lattice of strictly upper-triangular 3 3-matrices. If OL is the ring of integers
of a number field L, then H .OL / is the group of upper-unitriangular 3 3-matrices,
a finitely generated torsion-free nilpotent group of Hirsch length 3ŒL W Q. By [33,
Thm. B], the representation zeta function of H .OL / is given by the formula
L .s 1/ Y 1 q s
e
irr
H .OL / .s/ D D I (15.3.6)
L .s/ 1 q1 s
p2Spec.OL /n.0/
cf. Eqs. (15.1.4) and (15.1.5). By the other well-known properties of the Dedekind
zeta function K .s/ recalled in Example 15.1.1, one reads off from (15.3.6) that
Zeta functions of groups and rings 359
1
a.H / D 2, ˇ.H / D 1, and c.H .OL // D 2L .2/
, whence
N
X 1
r n .H .OL //
e N2 as N ! 1: (15.3.7)
nD1
2L .2/
Note that the special value L .2/ depends on L in a subtle way; cf. [39, Thm. 1]. In
the special case that L D Q, the formula
X1
e .s 1/
irr
H .Z/ .s/ D D '.n/ n s ;
.s/ nD1
(where ' is Euler’s totient function) is implicit in the work [24] by Magid and Nunley,
who worked out explicit representatives of the twist-isoclasses of irreducible repre-
sentations of the integral Heisenberg group. The special case
N
X 3 2
'.n/ N as N ! 1;
nD1
2
2
obtained from (15.3.7) using the well-known identity .2/ D 6 , due to Euler, is
an elementary exercise in analytic number theory. For quadratic number fields, the
formula (15.3.6) is due to Ezzat; cf. [15].
Thm. 7.5]) or A2 ([2, Thm. A]), where explicit formulae for almost all of the repre-
sentation zeta functions of the non-archimedean completions G .Op / of G .OS / are
available. In particular, it is not known whether the feasibility of (uniform) mero-
morphic continuation beyond the (rational) abscissa of convergence is a fluke for
these “small” root systems or a general feature. In [40], M. Zordan computes the
representation zeta functions enumerating continuous representations of the princi-
pal congruence subgroups SL14 .Op /; an explicit formula for the representation zeta
functions of the p-adic analytic groups SL4 .Op / seems currently out of reach.
15.3.4.2 Heisenberg groups over truncated polynomial rings Theorem 15.3.1 de-
scribes the uniform variation of analytic key invariants of representation zeta func-
tions associated to groups of rational points of certain unipotent group schemes G
under base change with number rings OL . While these kinds of base extensions
may be natural from a number-theoretic perspective, extensions by other, more exotic
rings may look just as natural from a group theorist’s perspective. In [10], Duong
studies Heisenberg groups over rings of the form O Œx=.x n/. The following result
generalises (15.3.6):
Theorem 15.3.4 ([10]). Let OL be the ring of integers of a number field L. For
n 6 3, one has
n
Y
e
irr L .i s 2i C 1/
H n
.OL Œx=.x // .s/ D :
L .i s 2i C 2/
i D1
Note that the three zeta functions for n D 1; 2; 3 share the abscissa of convergence
(viz. ˛ D 2) and may be continued meromorphically to the whole of C, but that the
order of the pole at s D 2 is n. The simple “multiplicative” form of (15.3.4) belies
the complicated “additive” computations carried out in their proofs, viz. the hands-on
computations of specific p-adic integrals.
Duong conjectures that (15.3.4) holds for all n. If it does, it is tempting to in-
terpret the limit of the right hand of the identity in Theorem 15.3.4 as n ! 1 as a
representation zeta function associated to the group H .OL Jx K/. There is, as yet, no
general theory describing the behaviour of representation zeta functions of groups of
the form G .O Œx=.x n//—let alone the completions G .OJx K/—for general unipotent
group schemes G and varying O and n. A few examples for n D 2 (i.e., base change
to “dual numbers”) are discussed in [27, Sec. 6.4]; cf. also [27, Qst. 7.3].
for n D 1 and m 2 f1; 2g, defined in [20, Ex. 2.13] (in notation differing from [20] but
consistent with [18, p. 165]). For m D 2, the resulting formulae cannot be described
as a finite product of translates of Dedekind zeta functions and their inverses, but still
invite a Coxeter group-theoretic interpretation.
References
[1] N. Avni, B. Klopsch, U. Onn and C. Voll, Representation zeta functions of compact p-adic
analytic groups and arithmetic groups, Duke Math. J. 162 (2013), 111–197.
[2] N. Avni, B. Klopsch, U. Onn and C. Voll, Similarity classes of integral p-adic matrices and
representation zeta functions of groups of type A2 Proc. Lond. Math. Soc. 112(3) (2016),
267–350.
[3] N. Avni, B. Klopsch, U. Onn and C. Voll, Arithmetic groups, base change, and representation
growth, Geom. Funct. Anal. 26 (2016), 67–135.
[4] M.N. Berman, Uniformity and functional equations for local zeta functions of K-split alge-
braic groups, Amer. J. Math. 133 (2011), 1–27.
[5] M.N. Berman and B. Klopsch, A nilpotent group without local functional equations for pro-
isomorphic subgroups, J. Group Th. 18 (2015), 489–510.
[6] F. Brenti and A. Carnevale, Proof of a conjecture of Klopsch-Voll on Weyl groups of type A,
Trans. Amer. Math. Soc. 369 (2017), 7531–7547.
[7] R. Cluckers and F. Loeser, Constructible exponential functions, motivic Fourier transform and
transfer principle, Ann. of Math., 171(2) (2010), 1011–1065.
[8] J. Denef, Report on Igusa’s local zeta function, Sém. Bourbaki 43 (1990–91), 359–386.
[9] J. Denef and D. Meuser, A functional equation of Igusa’s local zeta function, Amer. J. Math.
113 (1991), 1135–1152.
[10] H.D. Duong, Representation growth of the Heisenberg group over OŒx=.x 3 /, J. Algebra
Appl. 16 (2017), 1750077 (13 pages).
[11] H.D. Duong and C. Voll, Uniform analytic properties of representation zeta functions of
finitely generated nilpotent groups, Trans. Amer. Math. Soc. 369 (2017), 6327–6349.
[12] M.P.F. du Sautoy and F.J. Grunewald, Analytic Properties of zeta functions and subgroup
growth, Ann. of Math., 152(2) (2000), 793–833.
362 C. Voll
[13] M.P.F. du Sautoy and F.J. Grunewald, Zeta functions of groups and rings. In Proceedings of
the International Congress of Mathematicians, Madrid, August 22–30, 2006, Vol. II, European
Mathematical Society, Zürich, 2006, 131–149.
[14] M.P.F. du Sautoy and L. Woodward, Zeta Functions of Groups and Rings, Springer, Berlin,
2008.
[15] S. Ezzat, Counting irreducible representations of the Heisenberg group over the integers of a
quadratic number field, J. Algebra 397 (2014), 609–624.
[16] F.J. Grunewald, D. Segal and G.C. Smith, Subgroups of finite index in nilpotent groups, In-
vent. Math. 93 (1988), 185–223.
[17] E. Hrushovski, B. Martin, S. Rideau and R. Cluckers, Definable equivalence relations and zeta
functions of groups, J. Eur. Math. Soc. (JEMS) 20 (2018), 2467–2537.
[18] J.-I. Igusa, An Introduction to the Theory of Local Zeta Functions, AMS/IP Studies in Ad-
vanced Mathematics vol. 14, American Mathematical Society, Providence, RI, 2000.
[19] A. Jaikin-Zapirain, Zeta function of representations of compact p-adic analytic groups,
J. Amer. Math. Soc. 19 (2006), 91–118.
[20] T. Kimura, Introduction to Prehomogeneous Vector Spaces, Translations of Mathematical
Monographs 215, American Mathematical Society, Providence, RI (2003).
[21] B. Klopsch, Representation growth and representation zeta functions of groups, Note Mat. 33
(2013), 107–120.
[22] S. Lee and C. Voll, Enumerating graded ideals in graded rings associated to free nilpotent Lie
rings, Math. Z. (2018), 1–28.
[23] A. Lubotzky, A. Mann and D. Segal, Finitely generated groups of polynomial subgroup
growth, Israel J. Math. 82 (1993), 363–371.
[24] A.R. Magid and A. Lubotzky, Varieties of representations of finitely generated groups,
Mem. Amer. Math. Soc. 58 (1989).
[25] T. Rossmann, Computing local zeta functions of groups, algebras, and modules, Trans. Amer.
Math. Soc. 370 (2018), 4841–4879.
[26] T. Rossmann, Enumerating submodules invariant under an endomorphism, Math. Ann. 368
(2016), 391–417.
[27] T. Rossmann, Topological representation zeta functions of unipotent groups, J. Algebra 448
(2016), 210–237.
[28] T. Rossmann, Zeta version 0.3.2, https://www.math.uni-bielefeld.de/~rossmann/Zeta/.
[29] M.M. Schein and C. Voll, Normal zeta functions of the Heisenberg groups over number rings
I — the unramified case, J. Lond. Math. Soc. 91 (2014), 19–46.
[30] M.M. Schein and C. Voll, Normal zeta functions of the Heisenberg groups over number rings
II — the non-split case, Israel J. Math. 211 (2016), 171–195.
[31] R. Snocken, Zeta Functions of Groups and Rings, PhD thesis, University of Southampton
(2014), https://eprints.soton.ac.uk/372833/.
[32] L. Solomon, Zeta functions and integral representation theory, Adv. Math. 26 (1977), 306–326.
[33] A. Stasinski and C. Voll, Representation zeta functions of nilpotent groups and generating
functions for Weyl groups of type B, Amer. J. Math. 136 (2014), 501–550.
Zeta functions of groups and rings 363
[34] A. Stasinski and C. Voll, Representation zeta functions of some nilpotent groups associated to
prehomogeneous vector spaces, Forum Math. 29 (2017), 717–734.
[35] A. Stasinski and C. Voll, A new statistic on the hyperoctahedral groups, Electron. J. Combin.
20 (2013), paper 50 (23 pages).
[36] C. Voll, Functional equations for zeta functions of groups and rings, Ann. of Math. 172(2)
(2010), 1181–1218.
[37] C. Voll, Zeta functions of groups and rings—recent developments. In Groups St Andrews
2013, London Math. Soc. Lecture Note Ser. 422, Cambridge University Press, Cambridge,
2015, 469–492.
[38] C. Voll, Local Functional Equations for Submodule Zeta Functions Associated to Nilpotent
Algebras of Endomorphisms, Int. Math. Res. Not. IMRN (2017);
https://doi.org/10.1093/imrn/rnx186.
[39] D. Zagier, Hyperbolic manifolds and special values of Dedekind zeta-functions, Invent. Math.
83 (1986), 285–301.
[40] M. Zordan, Representation Zeta Functions of Special Linear Groups, PhD thesis, Bielefeld
University, 2016, https://pub.uni-bielefeld.de/publication/2902722.
Chapter 16
Conjectures of Brumer, Gross and Stark
A. Nickel
Introduction
Let L=K be a finite Galois extension of number fields with Galois group G. To each
finite set S of places of K containing all the archimedean places, one can associate
a so-called ‘Stickelberger element’ S in the centre of the group algebra CŒG. This
element is constructed from values at s D 0 of the S -truncated Artin L-series attached
to the complex irreducible characters of G. In particular, S is analytic in nature. By
a result of Siegel [34] one knows that S always has rational coefficients.
Let L and clL be the roots of unity and the class group of L, respectively. Sup-
pose that S also contains all places of K which ramify in L=K. Then it was inde-
pendently shown by Cassou-Nogues [8], Deligne–Ribet [14] and Barsky [1] that for
abelian G one has
AnnZŒG .L /S ZŒG;
where AnnR .M / denotes the annihilator ideal of M regarded as a module over the
ring R. In other words, the coefficients of S are almost integral. Now Brumer’s
conjecture simply asserts that AnnZŒG .L /S annihilates clL . In the case K D Q
Brumer’s conjecture is just Stickelberger’s theorem from the late 19th century [36].
Roughly speaking, the conjecture predicts that an analytic object gives constraints on
the structure of an arithmetic object. It is this kind of conjecture which is often called
a ‘Stark-type conjecture’.
In fact, Harold Stark suggested the following refinement of Brumer’s conjecture.
Let wL be the cardinality of L and fix a fractional ideal a in L. We will denote the
action of G on a and on its class in clL by exponents on the right as usual. Then the
Brumer–Stark conjecture not only predicts that awL S becomes principal, but also
gives precise information about a generator of that ideal.
In this survey article we explain recent generalisations of these conjectures to
arbitrary, not necessarily abelian Galois extensions; these are due to the author [27]
1 Project C7
366 A. Nickel
and, independently and in even greater generality, due to Burns [3]. A further, slightly
different approach has been developed by Dejou and Roblot [13].
We discuss the relation to further conjectures in the field such as the equivariant
Tamagawa number conjecture of Burns and Flach [6] and the main conjecture of
equivariant Iwasawa theory which (under a suitable condition) has been proven by
Ritter–Weiss [33] and, independently, by Kakde [25]. A conjecture of Gross [18] on
the behaviour of p-adic Artin L-series at s D 0 also plays a pivotal role.
Roughly speaking, the latter conjecture asserts that (i) the order of vanishing at
s D 0 of the p-adic L-series coincides with the order of vanishing at s D 0 of a
corresponding complex L-series and (ii) the special values of the p-adic and the com-
plex L-series at s D 0 coincide up to some explicit p-adic regulator. Considerable
progress has been made in the recent years by Spiess [35] and Burns [4] on part (i),
and by Dasgupta, Kakde and Ventullo [12] on part (ii). We also explain their results
and the relation to the equivariant Tamagawa number conjecture due to Burns [4] as
well as the relation to the (non-abelian) Brumer–Stark conjecture due to Johnston and
the author [23].
We provide no proofs unless they are short and we feel that it might help for
a better understanding. Instead we include some examples to illustrate occurring
obstacles and the underlying ideas how to overcome them.
Notation and conventions All rings are assumed to have an identity element and all
modules are assumed to be left modules unless otherwise stated. Unadorned tensor
products will always denote tensor products over Z. We fix the following notation:
R the group of units of a ring R
.R/ the centre of a ring R
AnnR .M / the annihilator ideal of the R-module M
Mmn .R/ the set of all m n matrices with entries in a ring R
K1 the cyclotomic Z p -extension of the number field K
K the roots of unity of a field K
clK the class group of a number field K
Kc an algebraic closure of a field K
IrrF .G/ the set of F -irreducible characters of the (pro)-finite group G
(with open kernel) where F is a field of characteristic 0
16.1 Preliminaries
16.1.1 Ray class groups Let L=K be a finite Galois extension of number fields with
Galois group G. For each place v of K we fix a place w of L above v and write Gw
and Iw for the decomposition group and inertia subgroup of L=K at w, respectively.
When w is a finite place, we choose a lift w 2 Gw of the Frobenius automorphism
at w; moreover, we write Pw for the associated prime ideal in L and j jw for the
corresponding absolute value. We denote the cardinality of the residue field of K at
v by N.v/.
Conjectures of Brumer, Gross and Stark 367
For any set S of places of K, we write S.L/ for the set of places of L which
lie above those in S . Now let S be a finite set of places of K containing the set
S1 D S1 .K/ of archimedean places and let T be a second finite set of places of
K such that S \ Q T D ;. We write clTL for the ray class group of L associated to
the ray MTL WD w2T .L/ Pw and OL;S for the ring of S.L/-integers in L. Let
OL WD OL;S1 be the ring of integers in L. Let Sf be the set of all finite places in
S ; then there is a natural map ZŒSf .L/ ! clTL which sends each place w 2 Sf .L/
to the corresponding class ŒPw 2 clTL . We denote the cokernel of this map by clTL;S .
When T is empty, we abbreviate cl;L;S to clL;S so that in particular clL;; D clL is
the usual class group
˚ of L. Moreover, we denote
the S.L/-units of L by EL;S and
T
define EL;S WD x 2 EL;S W x 1 mod MTL . All these modules are equipped with
a natural G-action and we have an exact sequence of ZŒG-modules
T
0 ! EL;S ! EL;S ! .OL;S =MTL / ! clTL;S ! clL;S ! 0;
where the map lifts an element x 2 .OL;S =MTL / to x 2 OL;S and sends it to the
ideal class Œ.x/ 2 clTL;S of the principal ideal .x/.
Each element in CŒG may be viewed as a complex valued function on G. The ir-
reducible characters constitute
Q a basis of the centre and we thus have a canonical
isomorphism .CŒG/ ' 2IrrC .G/ C. We define the equivariant S -truncated Artin
L-function to be the meromorphic .CŒG/-valued function
LS .s/ WD LS .s; / 2IrrC .G/ :
and
ıT .s/ WD ıT .s; / 2IrrC .G/
:
We set
‚S;T .s/ WD ıT .s/ LS .s/] ;
where ] W CŒG ! CŒG denotes the anti-involution induced by g 7! g 1 for g 2 G.
Note that LS .s/] D .LS .s; //
L 2IrrC .G/ where L denotes the character contragredient
368 A. Nickel
to . The functions ‚S;T .s/ are the so-called .S; T /-modified G-equivariant L-
functions, and we define Stickelberger elements
Note that a priori we only have ST 2 .CŒG/, but by a result of Siegel [34] we know
that ST has rational coefficients. If T is empty, we abbreviate ST to S .
Remark 16.1.1. Let 2 IrrC .G/ and let rS ./ be the order of vanishing of LS .s; /
at s D 0. Then by [37, Chapitre I, Proposition 3.4] one has
X
rS ./ D dimC .VGw / dimC .VG /: (16.1.1)
v2S
Thus if either is non-trivial and S contains an (infinite) place v such that VGw ¤ 0
or is trivial and jS j > 1 then the -part of ST vanishes. Now suppose that S con-
tains all ramified primes. Then if ST is non-trivial, precisely one of the following
possibilities occurs: (i) K is totally real and L is totally complex, (ii) K is an imag-
inary quadratic field, L=K is unramified and S D S1 or (iii) L D K D Q and
S D S1 .
whenever S contains the set Sram of all places of K that ramify in L=K. We now state
Brumer’s conjecture as discussed by Tate [37].
Conjecture 16.2.1 (B.L=K; S /). Let L=K be an abelian extension of number fields
with Galois group G and let S be a finite set of places of K containing the set S1
and all places of K that ramify in L=K. Then one has
Remark 16.2.3. Consider the three cases of Remark 16.1.1. In case (iii),
Brumer’s conjecture (and also the Brumer–Stark conjecture below) is trivial. Case
(ii) follows from the fact that each ideal of L becomes principal in the Hilbert class
field of L (see [17, Remark 6.3]). Finally, case (i) can be reduced to the case that L
is a CM-field (see [17, Proposition 6.4]).
Conjecture 16.2.5 (BS.L=K; S /). Let L=K be an abelian CM-extension with Ga-
lois group G and let S be a finite set of places of K containing the set S1 and all
places of K that ramify in L=K. Then, for every non-zero fractional ideal a in L,
there is an anti-unit 2 L such that
1. awL S D ./.
p
2. The extension L. wL /=K is abelian.
Here, the first part almost follows from B.L=K; S /, and Harold Stark originally
suggested that the second condition might hold. The conjecture was first stated in
published form by Tate [37].
T
Remark 16.2.7. Note that EL;S is torsionfree whenever T contains at least two
places of different residue characteristic or at least one place of sufficiently large
norm.
370 A. Nickel
Lemma 16.2.8. Let S be a finite set of places of K containing Sram [ S1 . Then, the
elements 1 N.v/w 1 , where v runs through all the finite places of K such that the
sets S and Tv WD fvg satisfy Hyp.S; Tv /, generate AnnZŒG .L /. Moreover, if we
restrict to totally decomposed primes v, the greatest common divisor of the integers
1 N.v/ equals wL .
Corollary 16.2.9. Let L=K be an abelian extension of number fields and let S be
a finite set of places of K containing the set S1 and all places of K that ramify in
L=K. Then, Brumer’s conjecture B.L=K; S / holds if and only if
ST 2 AnnZŒG .clL /
for all finite sets T of L such that Hyp.S; T / is satisfied.
Proof. We have ST D ıT .0/S and ıT .0/ 2 AnnZŒG .L / whenever Hyp.S; T / is
satisfied. As ıTv .0/ D 1 N.v/w 1 , the result follows from Lemma 16.2.8.
Let NL=K W L ! K be the field-theoretic norm map. For 2 L we define
S WD fv finite place of K W v j NL=K ./g:
Remark 16.2.11. In many cases, one can omit the condition that is an anti-unit:
Suppose that the order of a in the class group is odd. Then, we may write a D b2 .u/
for some u 2 L . Now assume that bwL S is principal and generated by some
ˇ 2 L such that (16.2.2) holds with replaced with ˇ. As .1 j /S D 2S , we
then have awL S D ./, where WD uwL S ˇ 1 j is an appropriate anti-unit.
We still have the Stickelberger elements ST at our disposal. In view of Corollary
16.2.9 one is tempted to conjecture that ST still annihilates the class group whenever
Hyp.S; T / is satisfied. Although the Stickelberger elements always belong to the
centre of QŒG, it is, however, in general not true that ST has integral coefficients.
Thus ST does in general not even act on the class group!
3
p ispdue to Nomura [32, §6]. Let ˛ be a root of x
Example 16.3.1. This example
11x C 7 and set L D Q. 3; 4001; ˛/. Then, L=Q is a Galois CM-extension
with Galois group G isomorphic to Z=2Z S3 , where j generates the first factor and
S3 denotes the symmetric group on 3 letters. We write
S3 D h; j 3 D 2 D 1; 1
D 1
i:
Then Sram D f3; 4001g and for S D Sram [ S1 and T D f7g one has
1
ST D .1 j / 3410 1774. C 2 / C 44. C C 2 /
3
which visibly does not belong to ZŒG.
The idea is to replace the centre of ZŒG by a larger ring I .G/ such that ST always
belongs to I .G/ and such that I .G/ D ZŒG when G is abelian. In order to achieve
annihilators, one then has to multiply by a certain ‘denominator ideal’ H.G/. We
next introduce these purely algebraic objects.
16.3.2 Denominator ideals and the integrality ring Let R be a Noetherian inte-
grally closed domain with field of fractions E. Let A be a finite-dimensional sepa-
rable E-algebra and let A be an R-order in A. Our main examples are group rings
A D RŒG and A D EŒG, where R and E are either Z and Q or Z p and Q p for
a prime p, respectively. The reduced norm map nr D nrA W A ! .A/ is defined
component-wise on the Wedderburn decomposition of A and extends to matrix rings
over A (see [10, §7D]). We choose a maximal R-order M such that A M A.
Following [20, §3.6], for every matrix H 2 Mnn .A/ there is a generalised adjoint
matrix H 2 Mnn .M/ such that H H D HH D nr.H / 1nn (note that the
conventions in [20, §3.6] slightly differ from those in [26]). If HQ 2 Mnn .A/ is a
second matrix, then .H HQ / D HQ H . We define
Hence H.A/ is an ideal in the commutative R-order I .A/. We will refer to H.A/ and
I .A/ as the denominator ideal and the integrality ring of the R-order A, respectively.
372 A. Nickel
Remark 16.3.2. The integrality ring is the smallest subring of .A/ that contains
.A/ and the image of the reduced norm of all matrices with entries in A. The de-
nominator ideal measures the failure of the generalised adjoint matrices having coef-
ficients in A.
Proposition 16.3.3. Let p be prime and G be a finite group. Then, one has Hp .G/ D
.Z p ŒG/ if and only if p does not divide the order of the commutator subgroup of
G. Moreover, in this case, we have Ip .G/ D .Z p ŒG/.
1
where D .E./=E/ denotes the inverse different of the extension
E./ WD E..g/ W g 2 G/ over E D Quot.R/
and the sum runs over all irreducible characters of G modulo Galois action. It is clear
from the definition that we always have F .A/ H.A/. As above we set
F .G/ WD F .ZŒG/; Fp .G/ WD F .Z p ŒG/:
Example 16.3.4. Let p and ` be primes with ` odd. We compute the denominator
ideals of Z p ŒD2` , where D2` denotes the dihedral group of order 2`. In the case
` D 3, one has D6 ' S3 , the symmetric group on three letters. We let Mp .D2` / be
a maximal Z p -order containing Z p ŒD2` . Then
(
.Z p ŒD2` /; if p ¤ `,
Hp .D2` / D
Fp .D2` /; if p D `;
(
.Z p ŒD2` /; if p ¤ `,
Ip .D2` / D
.Mp .D2` // if p D `.
In fact, in the case that p ¤ `, the result follows from Proposition 16.3.3. In the case
p D `, the result is established in [20, Ex. 6].
Conjectures of Brumer, Gross and Stark 373
As the integrality ring I .G/ is always contained in the centre of the maximal
order M.G/, we may also state the following considerably weaker conjecture.
the integrality conjectures of this section naturally decompose into local conjectures
at each prime p.
Example 16.3.10. Consider the Galois CM-extension L=Q from Example 16.3.1.
As before let S D Sram [ S1 and T D f7g. We have seen that in this case ST does
not lie in .Z 3 ŒG/. However, one has
71 1 13 37
ST D nr .1 j / C 11 2 C 19 C C 2 2 I3 .G/:
2 2 2 2
Theorem 16.3.11. Both Conjecture 16.3.7 and Conjecture 16.3.8 hold when L=K is
abelian.
Proof. Lemma 16.2.8 implies that AS D AnnZŒG .L /. Then, the result follows
from (16.2.1) and the fact that I .G/ D ZŒG in this case.
Recall that a finite group is called monomial if each of its irreducible characters is
induced by a linear character of a subgroup. The class of monomial groups includes
all nilpotent groups [10, Thm. 11.3] and, more generally, all supersoluble groups [38,
Ch. 2, Cor. 3.5].
Theorem 16.3.12. Let L=K be a Galois extension with Galois group G ' H C ,
where H is monomial and C is abelian. Let S be a finite set of places of K containing
Sram [ S1 . Then, we have an inclusion
AS S .M.H //ŒC :
Proof. This is due to the author [30, Thm. 1.2]. The proof heavily relies on the
abelian case and functoriality of Artin L-functions.
For non-abelian extensions, unconditional results on Conjecture 16.3.7 are
rather sparse. Here we only mention the following special case of [30, Cor. 5.12].
Corollary 16.3.13. Let ` be an odd prime. Let L=K be a Galois CM-extension with
Galois group isomorphic to D4` , the dihedral group of order 4`. Then, Conjecture
16.3.7 holds.
Proof. We first note that D4` ' D2` C2 with C2 WD Z=2Z and that dihedral
groups are monomial. Taking Example 16.3.4 into account, we see that the p-part of
Conjectures of Brumer, Gross and Stark 375
Conjecture 16.3.7 directly follows from Theorem 16.3.12 if p is odd. Now consider
the case p D 2. Let N be the commutator subgroup of D2` so that D2` =N ' C2 .
It follows from [21, Proposition 2.13] that the group ring Z 2 ŒD2` is ‘N -hybrid’
meaning that it decomposes into a direct product of Z 2 ŒD2` =N ' Z 2 ŒC2 and some
maximal order (see Definition 16.4.23 below). As
!L S 2 I .G/:
!L S 2 .M.G//:
Proof. It suffices to show that !L S 2 Ap .G/ for each prime p, where Ap .G/ D
Ip .G/ in case (1) and Ap .G/ D .Mp .G// in case (2). By Lemma 16.2.8, there is
a totally decomposed place v0 of K (in fact infinitely many places) such that jL j D
.1 N.v0 // c, where c is a unit in Z p , and such that Hyp.S; T0 / is satisfied with
T0 WD fv0 g. As nr.c/ belongs to Ap .G/, we have
!L S D nr.c/ST0 2 Ap .G/
as desired.
Remark 16.3.19. Burns [3] has also formulated a conjecture which generalises many
refined Stark conjectures to the non-abelian situation. In particular, it implies Con-
jecture 16.3.15 (see [3, Prop. 3.5.1]). A further approach to non-abelian Brumer and
Brumer–Stark conjectures is due to Dejou and Roblot [13].
Remark 16.3.23. When we restrict to ideals whose classes in clL have p-power or-
der, we again obtain local conjectures BS.L=K; S; p/ for each prime p.
16.3.5 The weak Brumer and Brumer–Stark conjectures Since H.G/ always
contains the central conductor F .G/, we can state the following weaker versions
of Conjectures B.L=K; S / and BS.L=K; S /.
Conjectures of Brumer, Gross and Stark 377
Conjecture 16.3.24 (Bw .L=K; S /). Let S be a finite set of places of K containing
Sram [ S1 . Then, for each x 2 F .G/ we have
x AS S Ann .ZŒG/ .clL /:
Conjecture 16.3.25 (BSw .L=K; S /). Let S be a finite set of places of K containing
Sram [ S1 . Then for each x 2 F .G/ we have x !L S 2 .ZŒG/. Moreover, for
each non-zero fractional ideal a of L, there is an anti-unit D .x; a; S / 2 L such
that
ax!L S D ./
and, for each finite set T of primes of K such that Hyp.S [ S ; T / holds, there is an
T 2 EST such that
zıT .0/ D Tz!L
for each z 2 F .G/.
Remark 16.3.26. Suppose that Conjecture 16.3.8 holds. Then both x AS S and
x !L S lie in .ZŒG/ (for the latter use Proposition 16.3.14).
Lemma 16.3.28. Let S and S 0 be two finite sets of places of K such that S contains
Sram [ S1 . If S S 0 , one has
B.L=K; S / H) B.L=K; S 0 /
Bw .L=K; S / H) Bw .L=K; S 0 /
BS.L=K; S / H) BS.L=K; S 0 /
BSw .L=K; S / H) BSw .L=K; S 0 /:
Proof. We only give the proof in the case of Brumer’s conjecture; the other cases
followQsimilarly. So assume that B.L=K; S / holds. We have S 0 D nr./S , where
D v2S 0 nS .1 w 1 / 2 ZŒG. If x lies in H.G/, so does xQ WD x nr./. Hence
we see that xAS 0 S 0 xA
Q S S belongs to .ZŒG/ and annihilates clL .
The relation between the Brumer–Stark conjecture and Brumer’s conjecture is
slightly more subtle.
Proof. We give the proof for the strong conjectures. Let a be a non-zero fractional
ideal of L and let x 2 H.G/. Then ax!L S D ./ and ./zıT .0/ D .T /z!L for all
z 2 H.G/. Hence
T
axz!L S D ./zıT .0/ D .T /z!L : (16.3.3)
Since !L 2 .QG/ , we find N 2 N such that N !L 1 2 .ZŒG/. Moreover,
jGj .ZŒG/ F .G/ H.G/ such that we may choose z D jGj N !L 1 .
However, the group of fractional ideals has no Z-torsion such that equation (16.3.3)
implies that x ST belongs to .ZŒG/ (take a to be a totally decomposed prime) and
T
axS D .T /.
Proposition 16.3.33. Let S be a finite set of places of K containing Sram [S1 . Then,
SBS.L=K; S / implies BS.L=K; S /.
Remark 16.3.35. Replacing the denominator ideal H.G/ by the central conductor
F .G/ one can formulate a weaker variant SBSw .L=K; S / such that SBSw .L=K; S /
implies BSw .L=K; S / (we refrain from calling this the “weak strong Brumer–Stark
property” for obvious reasons).
Theorem 16.4.1. Let L=K be a Galois CM-extension of number fields with Galois
group G. Let M.G/ be a maximal order in QŒG containing ZŒG. Then, the mi-
nus part of the ETNC for the pair .h0 .Spec.L//; M.G// implies BSw .L=K; S / and
Bw .L=K; S / for all finite sets S of places of K containing S1 [ Sram .
There is also a prime-by-prime version of Theorem 16.4.1 (see [27, Thm. 4.1]).
Combined with [28, Cor. 2] this leads to the following unconditional result (see also
[27, Cor. 4.2]). We denote the maximal totally real subfield of L by LC and let L0 be
the Galois closure of L over Q. For a natural number n we let n be a primitive n-th
root of unity.
Remark 16.4.3. We stress that for a given extension L=K the hypotheses on p in
Theorem 16.4.2 are fulfilled by all primes that do not ramify in L0 . In particular, the
hypotheses are satisfied by all but finitely many primes.
Theorem 16.4.4. Let L=K be a Galois CM-extension of number fields with Galois
group G. Let p be an odd prime. Then, the minus p-part of the ETNC for the pair
.h0 .Spec.L//; ZŒG/ implies BS.L=K; S; p/ and B.L=K; S; p/ for all finite sets S
of places of K containing S1 [ Sram .
Remark 16.4.5. The proofs of Theorem 16.4.4 actually show that a refinement of
SBS.L=K; S; p/ holds and then use an argument similar to Proposition 16.3.33.
There are meanwhile quite a few cases where the ETNC has been verified for cer-
tain non-abelian extensions. Here we only mention the following result of Johnston
and the author [21, Thm. 4.6].
Theorem 16.4.6. Let L=Q be a Galois extension with Galois group G ' Aff.q/,
where q D `n is a prime power. Then, the ETNC for the pair .h0 .Spec.L//; M.G//
holds and the p-part of the ETNC for the pair .h0 .Spec.L//; ZŒG/ holds for every
prime p ¤ `.
Remark 16.4.8. The `-part of the ETNC in the situation of Theorem 16.4.6 is con-
sidered in recent work with Henri Johnston [24]. Suppose in addition that L is to-
tally real and Leopoldt’s conjecture holds for L at `. Then, the ETNC for the pair
.h0 .Spec.L//.1/; ZŒG/ holds. Moreover, the ETNC for the pair .h0 .Spec.L//; ZŒG/
holds if ` is at most tamely ramified (see [24, Cor. 10.6]). For the proof one has to
verify the ‘`-adic Stark conjecture at s D 1’ for L=Q which might be seen as an
analogue at s D 1 of Gross’ conjecture 16.4.15 below.
16.4.2 p-adic Artin L-functions Let p be an odd prime and let K be a totally real
field. Let L=K be a Galois extension of K such that L is totally real and contains the
cyclotomic Z p -extension K1 of K and ŒL W K1 is finite. We put G WD Gal.L=K/
and K WD Gal.K1 =K/ ' Z p such that G ' H Ì , where H WD Gal.L=K1/ and
Conjectures of Brumer, Gross and Stark 381
defined by ./ D cyc ./ for any 2 Gal.L.p /=K/ and any p-power root of
unity . Let ! and denote the composition of cyc with the projections onto the
first and second factors of the canonical decomposition Z p D Q p .1 C pZ p /,
respectively; thus ! is the Teichmüller character. We note that factors through K
and put u WD .
K /.
Fix a character 2 IrrCp .G / and let S be a finite set of places of K containing all
archimedean places and all places that ramify in L=K. Note that S in particular con-
tains the set Sp of all p-adic places. Each topological generator
K of K permits the
definition of a power series G ;S .T / 2 Q cp ˝Q p Quot.Z p ŒŒT / by starting out from
the Deligne–Ribet power series for one-dimensional characters of open subgroups of
G (see [8, 14, 1]) and then extending to the general case by using Brauer induction
(see [15]). One then has an equality
s
G ;S .u 1/
Lp;S .1 s; / D ;
H .us 1/
16.4.3 The interpolation property and Gross’ conjecture Let p be an odd prime
and choose a field isomorphism W C ' Cp . For a character 2 IrrCp .G / we put
WD 1 ı 2 IrrC .G /. If is a linear character and r > 1 is an integer then for
every choice of field isomorphism W C ' Cp one has the interpolation property
r
Lp;S .1 r; / D .LS .1 r; . ! / // : (16.4.1)
This can be extended to characters of arbitrary degree provided that r > 2 (see [15,
§4]). However, the argument fails in the case r D 1. Nevertheless, it seems plausible
to conjecture the following.
Remark 16.4.10. As both sides in (16.4.9) are well-behaved with respect to direct
sum, inflation and induction of characters, it is easy to see that Conjecture 16.4.9
holds when is a monomial character (also see the discussion in [18, §2]).
does in fact not depend on the choice of . The first part of Gross’ conjecture [18]
concerns the order of vanishing of p-adic Artin L-functions and asserts the following.
Proof. For being a linear character, this has been proved by Spiess [35] using
Shintani cocycles (his approach actually allows p to be equal to 2). The general case
has recently been settled by Burns [4, Thm. 3.1].
The rest of this subsection is mainly devoted to (the second part of) Gross’ con-
jecture. This may be skipped by the reader who is only interested in the Brumer and
Brumer–Stark conjectures.
Fix a character 2 IrrCp .G / and choose a Galois CM-extension L over K such
that ! 1 factors through G WD Gal.L=K/. We denote the kernel of the natural
augmentation map ZŒS.L/ ! Z that maps each w 2 S.L/ to 1 by XL;S . The usual
Dirichlet map
here, Lab
w denotes the maximal abelian extension of Lw , the first arrow is the natural
inclusion, the second arrow is the reciprocity map of local class field theory and the
last map is the p-adic cyclotomic character. We define a homomorphism of Z p ŒG-
modules
p;L;S W Z p ˝ EL;S ! Z p ˝ XL;S
X
1 ˝ 7! logkkw;p w:
w2S.L/
1
Now, choose a field isomorphism W C ' Cp . Then, L;S and p;L;S induce an
endomorphism
1
.Cp ˝Z p p;L;S / ı .Cp ˝ L;S / W Cp ˝ XL;S ! Cp ˝ XL;S :
We define a p-adic regulator
./ 1
Rp;S . / WD detCp ..Cp ˝Z p p;L;S /ı.Cp ˝ L;S / j HomCp ŒG .V ! 1 ; Cp ˝XL;S //:
The following recent result due to Dasgupta, Kakde and Ventullo [12] generalises
the approach developed in [11].
Corollary 16.4.18. Suppose that Conjecture 16.4.11 holds for all 2 IrrCp .G /.
Then, Conjecture 16.4.15 is also true for all 2 IrrCp .G /.
16.4.4 Conditional Results We can now state the following result which has been
proved by Johnston and the author [23, Thm. 5.2 and Cor. 5.4]. We refer to the
‘equivariant Iwasawa main conjecture’ (EIMC) for totally real fields (see [25] or [33],
for instance).
Theorem 16.4.19. Let L=K be a Galois CM-extension with Galois group G and let
p be an odd prime. Suppose that the EIMC for the extension L.p /C
1 =K holds. Sup-
pose further that Conjecture 16.4.9 holds for all irreducible characters of
Gal.L.p /C C
1 =K/ which factor through Gal.L =K/. Then, SBS.L=K; S; p/ and
thus BS.L=K; S; p/ and B.L=K; S; p/ are true for every finite set S of places of K
containing Sp [ Sram [ S1 .
Remark 16.4.20. We write p .F / for the p-adic -invariant attached to the cy-
clotomic Z p -extension of a number field F (see [22, Rem. 4.3] for details). When
p .L.p /C / vanishes, then the EIMC has been proved independently by Kakde [25]
and Ritter and Weiss [33]. Without assuming the vanishing of -invariants consider-
able progress has been made in [22]. This includes the case p − jGj.
Remark 16.4.21. Suppose that L=K is abelian. Then, Conjecture 16.4.9 holds by
(16.4.1). Under the somewhat stronger condition that vanishes, Theorem 16.4.19
has been shown by Greither and Popescu [17] by an entirely different method. This
method has been generalised to arbitrary Galois CM-extensions by the author in [29].
In order to get rid of the p-adic places one has to assume the full strength of
Gross’ conjecture.
Theorem 16.4.22. Let L=K be a Galois CM-extension with Galois group G and
let p be an odd prime. Suppose that p .LC / vanishes or that p − jGj. Suppose
further that Gross’ Conjecture 16.4.11 holds for all 2 IrrCp .Gal.LC
1 =K//. Then,
the minus p-part of the ETNC for the pair h0 .Spec.L/; ZŒG/ holds. In particular,
both BS.L=K; S; p/ and B.L=K; S; p/ are true for all finite sets S of places of K
containing Sram [ S1 .
Proof. This is [4, Cors. 3.8 and 3.11]. Note that the last part follows from Theorem
16.4.4.
16.4.5 Unconditional results We now discuss certain cases where the Brumer–
Stark conjecture holds unconditionally. Let p be aP
prime and let G be a finite group.
For a normal subgroup N E G, let eN D jN j 1 2N be the associated central
trace idempotent in the group algebra Q p ŒG.
Definition 16.4.23. Let N E G. We say that the p-adic group ring Z p ŒG is N -
hybrid if (i) eN 2 Z p ŒG (i.e. p − jN j) and (ii) Z p ŒG.1 eN / is a maximal
Z p -order in Q p ŒG.1 eN /.
Conjectures of Brumer, Gross and Stark 385
Theorem 16.4.24. Let L=K be a finite Galois CM-extension of number fields. Let N
be a normal subgroup of G WD Gal.LC =K/ and let F D .LC /N . Let p be an odd
prime and let P be a Sylow p-subgroup of G WD Gal.F=K/ ' G=N . Suppose that
Z p ŒG is N -hybrid, G is monomial, and F P =Q is abelian. Let S be a finite set of
places of K such that Sp [ Sram .L=K/ [ S1 S . Then, both BS.L=K; S; p/ and
B.L=K; S; p/ are true.
Proof. It follows from the theory of hybrid Iwasawa algebras [22] that the relevant
case of the EIMC holds. We also recall that Conjecture 16.4.9 holds for monomial
characters. Then the result follows from Theorem 16.4.19. See [23, Thm. 10.5] for
details.
We recall that a Frobenius group is a finite group G with a proper nontrivial
subgroup V such that V \ gVg 1 D f1g for all g 2 G V , in which case V is called
a Frobenius complement. A Frobenius group G contains a unique normal subgroup
U , known as the Frobenius kernel, such that G is a semidirect product U Ì V .
Corollary 16.4.25. Let L=K be a finite Galois CM-extension of number fields and
let G D Gal.LC =K/. Suppose that G D U Ì V is a Frobenius group with Frobenius
kernel U and abelian Frobenius complement V . Suppose further that .LC /U =Q is
abelian (in particular, this is the case when K D Q). Let p be an odd prime and let
S be a finite set of places of K such that Sp [ Sram .L=K/ [ S1 S . Suppose that
either p − jU j or U is a p-group (in particular, this is the case if U is an `-group
for any prime `.) Then, both BS.L=K; S; p/ and B.L=K; S; p/ are true.
Proof. This is [23, Cor. 10.7]. We recall the proof for convenience. First note that
G is monomial by [23, Lemma 9.7] since V is abelian. Suppose that p − jU j. Let
N D U and F D .LC /N . Then Z p ŒG is N -hybrid by [23, Prop. 9.4]. Hence, the
desired result follows from Theorem 16.4.24 in this case since F=Q is abelian, which
forces F P =Q to be abelian. Suppose that U is a p-group. Taking N D f1g and
F D LC , we apply Theorem 16.4.24 with G D G and P D U to obtain the desired
result.
Theorem 16.4.27. Let L=Q be a finite Galois CM-extension of the rationals. Sup-
pose that Gal.L=Q/ ' hj i G, where G D Gal.LC =Q/ D N Ì V is a Frobe-
nius group with Frobenius kernel N and abelian Frobenius complement V . Suppose
further that N is an `-group for some prime `. Then, both BS.L=Q; S; p/ and
B.L=Q; S; p/ are true for every odd prime p and every finite set S of places of Q
such that Sram .L=Q/ [ S1 S .
Theorem 16.4.28. Let ` be an odd prime. Let L=K be a Galois CM-extension with
Galois group G ' D4` . Let p be a prime and suppose that p does not split in Q.` /.
Then, BS.L=K; S; p/ and B.L=K; S; p/ both hold for every finite set S of places of
K such that Sram [ S1 S .
References
[1] D. Barsky, Fonctions zeta p-adiques d’une classe de rayon des corps de nombres totalement
réels. In Groupe d’Etude d’Analyse Ultramétrique (5e année: 1977/78) (Y. Amice, D. Barsky,
and P. Robba, eds.), Secrétariat Math., Paris, 1978, Exp. No. 16, pp. 1–23.
[2] D. Burns, Equivariant Tamagawa numbers and Galois module theory. I, Compos. Math. 129
(2001), 203–237.
[3] D. Burns, On derivatives of Artin L-series, Invent. Math. 186 (2011), 291–371.
[4] D. Burns, On derivatives of p-adic L-series at s D 0, J. Reine Angew. Math. (Crelle) (to
appear).
[5] D. Burns and M. Breuning, Leading terms of Artin L-functions at s D 0 and s D 1, Compos.
Math. 143 (2007), 1427–1464.
[6] D. Burns and M. Flach, Tamagawa numbers for motives with (non-commutative) coefficients,
Doc. Math. 6 (2001), 501–570.
[7] D. Burns and M. Flach, Tamagawa numbers for motives with (non-commutative) coefficients.
II, Amer. J. Math. 125 (2003), 475–512.
[8] P. Cassou-Nogues, Valeurs aux entiers négatifs des fonctions zêta et fonctions zêta p-adiques,
Invent. Math. 51 (1979), 29–59.
[9] T. Chinburg, On the Galois structure of algebraic integers and S-units, Invent. Math. 74
(1983), 321–349.
[10] C.W. Curtis and I. Reiner, Methods of Representation Theory with Applications to Finite
Groups and Orders, Vol. 1, Wiley, New York, 1981.
[11] S. Dasgupta, H. Darmon and R. Pollack, Hilbert modular forms and the Gross–Stark conjec-
ture, Ann. of Math. 174 (2011), 439–484.
[12] S. Dasgupta, M. Kakde and K. Ventullo, On the Gross–Stark conjecture, Ann. of Math. 188
(2018), 833–870.
Conjectures of Brumer, Gross and Stark 387
[13] G. Dejou and X.-F. Roblot, A Brumer–Stark conjecture for non-abelian extensions, J. Number
Th. 142 (2014), 51–88.
[14] P. Deligne and K.A. Ribet, Values of abelian L-functions at negative integers over totally real
fields, Invent. Math. 59 (1980), 227–286.
[15] R. Greenberg, On p-adic Artin L-functions, Nagoya Math. J. 89 (1983), 77–87.
[16] C. Greither, Determining Fitting ideals of minus class groups via the equivariant Tamagawa
number conjecture, Compos. Math. 143 (2007), 1399–1426.
[17] C. Greither and C.D. Popescu, An equivariant main conjecture in Iwasawa theory and appli-
cations, J. Algebraic Geom. 24 (2015), 629–692.
[18] B.H. Gross, p-adic L-series at s D 0, J. Fac. Sci. Univ. Tokyo Sect. IA Math. 28 (1981),
979–994.
[19] K.W. Gruenberg, J. Ritter and A. Weiss, A local approach to Chinburg’s root number conjec-
ture, Proc. London Math. Soc. 79 (1999), 47–80.
[20] H. Johnston and A. Nickel, Noncommutative Fitting invariants and improved annihilation
results, J. Lond. Math. Soc. 88 (2013), 137–160.
[21] H. Johnston and A. Nickel, On the equivariant Tamagawa number conjecture for Tate motives
and unconditional annihilation results, Trans. Amer. Math. Soc. 368 (2016), 6539–6574.
[22] H. Johnston and A. Nickel, Hybrid Iwasawa algebras and the equivariant Iwasawa main con-
jecture, Amer. J. Math. 140 (2018), 245–276.
[23] H. Johnston and A. Nickel, On the non-abelian Brumer–Stark conjecture and the equivariant
Isawana main conjecture, Math. Z. (to appear).
[24] H. Johnston and A. Nickel, On the p-adic Stark conjecture at s D 1 and applications, preprint,
arXiv:1703.06803.
[25] M. Kakde, The main conjecture of Iwasawa theory for totally real fields, Invent. Math. 193
(2013), 539–626.
[26] A. Nickel, Non-commutative Fitting invariants and annihilation of class groups, J. Algebra
323 (2010), 2756–2778
[27] A. Nickel, On non-abelian Stark-type conjectures, Ann. Inst. Fourier 61 (2011), 2577–2608.
[28] A. Nickel, On the equivariant Tamagawa number conjecture in tame CM-extensions, Math. Z.
268 (2011), 1–35.
[29] A. Nickel, Equivariant Iwasawa theory and non-abelian Stark-type conjectures, Proc. Lond.
Math. Soc. 106 (2013), 1223–1247.
[30] A. Nickel, Integrality of Stickelberger elements and the equivariant Tamagawa number con-
jecture, J. Reine Angew. Math. (Crelle) 719 (2016), 101–132.
[31] J. Nomura, On non-abelian Brumer and Brumer–Stark conjecture for monomial CM-
extensions, Int. J. Number Th. 10 (2014), 817–848.
[32] J. Nomura, The 2-part of the non-abelian Brumer–Stark conjecture for extensions with group
D4p and numerical examples of the conjecture. In Algebraic Number Theory and Related
Topics 2012 (A. Shiho, T. Ochiai and N. Otsubo, eds.), Res. Inst. Math. Sci. (RIMS), Kyoto,
2014, 33–53.
[33] J. Ritter and A. Weiss, On the “main conjecture” of equivariant Iwasawa theory, J. Amer.
Math. Soc. 24 (2011), 1015–1050.
388 A. Nickel
[34] C.L. Siegel, Über die Fourierschen Koeffizienten von Modulformen, Nachr. Akad. Wiss. Göt-
tingen Math.-Phys. Kl. II 1970 (1970), 15–56.
[35] M. Spiess, Shintani cocycles and the order of vanishing of p-adic Hecke L-series at s D 0,
Math. Ann. 359 (2014), 239–265.
[36] L. Stickelberger, Ueber eine Verallgemeinerung der Kreistheilung, Math. Ann. 37 (1890),
321–367.
[37] J. Tate, Les conjectures de Stark sur les fonctions L d’Artin en s D 0, Lecture notes edited
by D. Bernardi and N. Schappacher, Progress in Mathematics 47, Birkhäuser Boston, Boston,
1984.
[38] M. Weinstein (ed.), Between Nilpotent and Solvable, Polygonal Publ. House, Passaic, 1982.
Chapter 17
Displays and p-divisible groups
T. Zink
The theory of displays is a Dieudonné theory for formal p-divisible groups which
is an equivalence of categories over an arbitrary p-adic ring. Over a more restricted
class of rings one obtains a classification of all p-divisible groups. We explain ba-
sic ideas and some recent results of this theory. The last paragraph ameliorates the
discussion of isogenies of displays found in the literature.1
17.1 Introduction
Let p be a fixed prime number. Let A be an abelian variety over a field k of charac-
tristic p. The formal group of A is a very interesting and subtle invariant of A. This
is in contrast to the case of characteristic 0 where the formal group is determined by
dim A. Dieudonné gave a classification of the formal groups over k entirely in terms
of linear algebra. This classification was extended to cases where k is replaced by a
p-adic ring R. In the generalisation one has to replace the formal group of A by a
finer invariant, the p-divisible group. To avoid confusion, it must be said that this is
not an abstract group but an inductive limit of commutative finite flat group schemes.
Many of the most important results in arithmetic proved in the last years depend
on a profound konwledge of p-divisible groups. Ever since the introduction of this
notion by J. Tate in 1967, very surprising results on these groups were proved contin-
uously. Some of the main contributors are A. Grothendieck, W. Messing, B. Mazur,
P. Berthelot, J.-M. Fontaine, Ch. Breuil, M. Kisin, F. Oort, A.J. de Jong, A. Vasiu,
L. Fargues, P. Scholze, J. Weinstein.
The classification of p-divisible groups is by objects of linear algebra over a p-
adic ring R which are called displays. In the case of a perfect field R, they coincide
with the classical Dieudonné modules. The notion was developed to study the bad
reduction of Shimura varieties. Some important recent results on p-divisible groups
have been explained in terms of displays.
We review here some linear algebra of displays which is buried in the literature
and we add some supplements. This is essential in order to apply the main theorems
of the classification. This article complements the overview [16]. We mention the
duality theory for displays only briefly. For a concise review of these results we refer
to [5], Chapter 3. In the papers [7] and [8] it is proved that the crystalline cohomology
1 Project B5
390 T. Zink
17.2 Frames
We make the following conventions. A ring will be a commutative ring with unit
element unless otherwise stated. Let S be a ring and let W S ! S be a ring
endomorphism. If M is an S -module, we write
M ./ D S ˝;S M;
where the right S -module structure on S is modified by restriction of scalars via . If
W M ! N is a -linear homomorphism, we define its linearisation ] W M ./ !
N by ] .s ˝ m/ D s.m/. We call a -linear epimorphism if ' ] is surjective, and
we call a -linear isomorphism if ] is bijective.
Frames were introduced in [21] and [11]. Let F=Qp be a finite field extension
and O D OF its ring of integers. We fix a prime element 2 O. Let D O=O be
the residue class field and q D pf D ].
Lemma 17.2.2. For each O-frame F , there is a unique element 2 S such that
.a/ D .a/
P for a 2 I:
We have 2 I C S and in particular belongs to the radical of S .
Proof. The uniqueness is immediate from condition (iii). Moreover, by (iii) we may
write: X
1 D si P .ai / (17.2.1)
i
for suitable elements si 2 S and ai 2 I . Then, the element
X
D si .ai /
i
P ! F 0 D .S 0 ; I 0 ; R0 ; 0 ; P 0 /
˛ W F D .S; I; R; ; /
P 0 .˛.a// D ˛..a//
P for a 2 I:
as required.
Example 17.2.4. Let R be a p-adic ring. We denote by W .R/ the ring p-typical
Witt vectors [2]. It is endowed with the Frobenius endomorphism F and with the
Verschiebung V . The ring W .R/ is again a p-adic ring [20]. We define a Z p -frame
where W W .R/ ! W .R/ is the Frobenius endomorphism ./ D F, and P .V/ D
, for 2 W .R/. We call this the Witt frame.
Example 17.2.5. Let R be an O-algebra. Then the Witt vectors WO .R/ are defined
[1]. As a set, WO .R/ Š RN . There is a unique O-algebra structure which is functorial
in R and such that the maps wn W WO .R/ ! R given by
n n 1 n 2
wn .x0 ; x1 ; x2 ; : : :/ D x0q C x1q C 2 x2q C : : : C n 1 q
xn 1 C n xn
where D F , and P D V 1 as before. We call this the small Witt frame relative
to O. In the case O D Z p we have WO .R/ D W .R/ and we find the Z p -frame of
Example 17.2.4.
WO .a/ Š aN : (17.2.2)
Displays and p-divisible groups 393
The image of a Witt vector on the right hand side is called its logarithmic coordi-
nates Œu0 ; u1 ; : : :, ui 2 a. We have
F V
Œu0 ; u1 ; u2 ; : : : D Œu1 ; u2 ; u3 ; : : :; Œu0 ; u1 ; u2 ; : : : D Œ0; u0 ; u1 ; : : ::
P W JO .S / ! WO .S / (17.2.4)
by setting
P 0 ; u1 ; u2 ; : : :/ D Œu1 ; u2 ; u3 ; : : :;
.Œu ui 2 a:
With respect to the maps (17.2.3), (17.2.4) we obtain a O-frame
17.3 Displays
Definition 17.3.1. Let F be a O-frame. An F -display (or window) P D .P; Q;
F; FP / consists of the following data:
A finitely generated projective S -module P , a submodule Q P , and two -
linear maps
F W P ! P; FP W Q ! P:
The following conditions are required.
(i) IP Q:
(ii) The factor module P =Q is a finitely generated projective R-module.
394 T. Zink
FP .ax/ D .a/F
P x:
We call rankWO .R/ P the height of P and rankR P =Q the dimension of P . Mor-
phisms of F -displays are defined in the obvious way.
Remarks: Let 2 S be the element in Lemma 1. Then, we have
It follows from the axioms that there exist S -submodules L and T of P such that
P D T ˚ L; Q D IT ˚ L (17.3.1)
Indeed, choose a finitely generated projective S -module T which lifts P =Q. Then,
we find an S -module homomorphism which makes the following diagram commuta-
tive,
P❉ /T
❉❉
❉❉
❉❉
❉"
P =Q
The homomorphism P ! T is surjective by the lemma of Nakayama. Let L be the
kernel. Taking a splitting of the exact sequence
0 !L !P !T ! 0;
F ˚ FP W T ˚ L ! P
.F ˚ FP /] W .T ˚ L/./ ! P: (17.3.2)
Displays and p-divisible groups 395
Since we have projective modules of the same rank on both sides, it is enough to show
that (17.3.2) is surjective. By the axioms, the elements of the form
FP .at C l/; for a 2 I; t 2 T; l 2 L
generate the S -module P . Since FP .at C l/ D .a/F
P .t/ C FP .l/, the surjectivity of
(17.3.2) follows.
Assume conversely that we are given a finitely generated projective
S -module P , a decomposition P D T ˚ L, and an isomorphism
ˆ W P ./ ! P:
We may write !! !
t0 A B t0
ˆ D : (17.3.3)
l0 C D l0
Here, A W T ./ ! T , B W L./ ! T , C W T ./ ! L, B W L./ ! L are
S -linear homomorphisms and t 0 2 T ./ , l 0 2 L./ .
We use the notation and P also for the following maps:
WT ! S ˝;S T;
t 7 ! 1 ˝ t;
where T can be any S -module, and
P W I ˝S T ! S ˝;S T;
a ˝ t 7 ! P .a/ ˝ t:
The F -display obtained from ˆ is then defined as follows: We set
Q D I T ˚ L; P D T ˚ L:
Let t 2 T , l 2 L and y 2 I T D I ˝S T be elements. We define the maps F and FP
as follows: !!
P y A B P .y/
F D ;
l C D .l/
!! (17.3.4)
t A B .t/
F D :
l C D .l/
Let T be a free module with basis e1 ; : : : ; ed and L a free module with basis ed C1 ; : : :
ed Cc , then, T ./ has the basis 1 ˝ e1 ; : : : ; 1 ˝ ed and L./ has the basis 1 ˝
ed C1 ; : : : ; 1 ˝ ed Cc . Now the linear map A W T ./ ! T may be simply regarded as
a d d -matrix with coefficients in S . In this sense, we may regard
A B
(17.3.5)
C D
as an element of GLd Cc .S /.
396 T. Zink
F 0 .s 0 ˝ x/ D 0 .s 0 / ˝ F .x/; s 0 2 S 0 ; x 2 P;
FP .s ˝ y/ D .s / ˝ FP .y/; FP .a ˝ x/ D P .a / ˝ F .x/; a0 2 I 0 ; y 2 Q:
0 0 0 0 0 0 0 0
P D T ˚ L; P 0 D T 0 ˚ L0 :
The matrix (17.3.8) defines a homomorphism iff the following diagram is commuta-
tive
˛
IT ˚ L ! I T 0 ˚ L0
? ?
? ? P0
FP y yF
˛
T ˚L ! T 0 ˚ L0 :
Displays and p-divisible groups 397
This diagram expresses the condition that ˛ commutes with FP . But then ˛ also
commutes with F because of the equation FP .ax/ D .a/FP .x/ for a 2 I , x 2 P and
the axiom that the elements P .a/ generate the unit ideal.
.p-div=R/ ! .W .R/-displays/
This functor commutes with base change with respect to a ring homomorphism R !
R0 which is defined for p-divisible groups and for displays with respect to the mor-
phism of frames W .R/ ! W .R0 / (17.3.6).
O Z .R/-displays/:
BT W .p-div=R/ ! .W p
The functor BT has an explicit description and commutes with the duality theory
which exists for both categories [10].
Let R be a complete regular local ring with perfect residue class field k of char-
acteristic p > 0. We consider a ring homomorphism
Sd WD W .k/ ŒT1 ; : : : ; Td ! R; (17.4.1)
Theorem 17.4.4 ([11]). Let p > 3. Then, the category of p-divisible groups over
the complete regular local ring R is equivalent to the category of B -displays.
This theorem was conjectured by Breuil in the case dim R D 1. The conjecture
was proved by Kisin [4]. It was proved by Vasiu and Zink [18] if E is an Eisenstein
polynomial. The theorem implies a classification of finite flat group schemes of order
a power of p over R in the same way as explained after Lemma 17.6.5.
Let X be a smooth and projective scheme over a ring R. One may ask whether
i
there is a display structure on the crystalline cohomology Hcris .X=W .R//. If X=R
is an abelian variety, one can use Theorem 17.4.1 to define a display structure on
1
Hcris .X=W .R//. There is a category of higher W .R/-displays [7, 14]. It is an exact
tensor category which contains the W .R/-displays as a full subcategory. Examples of
i
schemes X such that Hcris .X=R/ admits a display structure are given in [7, 8, 3, 14].
This is for instance the case if X is a smooth complete intersection in P m
R relative to R.
The display structure sometimes determines the deformation theory of X completely
[8]. The theory of higher displays is based on the relative de Rham–Witt complex [6].
such that
V ] .FP .y// D 1 ˝ y; V ] .F x/ D ˝ x; y 2 Q; x 2 P:
which is given by the image of the matrix (17.5.1) under the morphism S ! R=R,
AM0 BM 0
:
CM 0 DM 0
The first row of this matrix is zero. Therefore, the nilpotency condition says that, for
large e,
. e 1 /
DM 0 ı : : : ı DM 0 ı DM 0 D 0:
./
(17.5.2)
Here, DM 0./ D R=R ˝Frob;R=R DM 0 . Assume that the S -modules T and L are free
and that we have chosen a basis in each of them. Then the matrix DM 0./ is obtained
from DM 0 by raising all entries to the p-th power. The condition is equivalent to saying
that the map
e 1
DM . / ı : : : ı DM ./ ı DM (17.5.3)
400 T. Zink
e
has image in .I C S /L. / . If p is nilpotent in R, we can iterate (17.5.3) until the
image is zero modulo I .
Therefore, for nilpotent in R, we may reformulate the nilpotency condition as
follows:
e
(NP) For large numbers e, the image of the map (17.5.3) is contained in IL. / .
Equivalently we can say:
(NP)’ For large numbers e, the map V e] of Definition 17.5.2 is zero modulo I .
The first matrix on the right hand side has coefficients in c. To solve this equation for
X; U; Y; Z is the same as solving the following equation for Y :
Y P /DM 1 D B :
A2 .Y (17.5.4)
A2 .A2 / e 1
.A2 / P e .Y / e 1
.DM 1 / : : : .DM 1 / DM 1 : (17.5.5)
By the nilpotency condition we may assume that M D e 2 .DM 1 /: : : .DM 1 / DM 1 has
coefficients in the ideal I . We set N D P e 1 .Y /. This is a matrix with coefficients
in c and is therefore annihilated by . We have
P / .M / D P .NM / D .N /.M
.N P / D 0:
But this shows that (17.5.5) is equal to 0, which means that U is nilpotent. We
remark that, in the case P .c/ D 0, Equation (17.5.4) is trivial. Therefore, we need no
nilpotency condition to solve it. This proves the last part of the proposition.
The following theorem was first proved in [20] for O D Z p and an additional
assumption on R. The additional assumptions were removed in [9].
17.6 Isogenies
Let F be a frame as above. Let ˛ W P ! P 0 be a morphism of F -displays of the
same height. We assume that P and P 0 are free S -modules. If we choose a basis in
each of these modules, then det ˛ 2 S is defined. Up to a unit, it is independent of
the choice of the basis.
P / D .Y /. Therefore,
The right hand side of (17.6.1) is .det ˛/ because .Y
the claim follows from (17.3.9).
det ˛ D u :
Proof. We begin with the case R D 0. We set D det ˛. By the last proposition,
we find
F
D for some 2 WO .R/: (17.6.2)
Vt
We write D where w0 ./ ¤ 0. We claim that .17:6:2/ implies
F Ft
D : (17.6.3)
D.x/ [ D.x q 1
e/ D Spec R
q 1
D.x/ \ D.x e/ D ¿:
For an arbitrary block matrix with entries in a ring of Witt vectors WOF .R/ of the
type above, we define F
s X Y
M˛ D :
FU FZ
We will denote by Ed the unit matrix of size d d . If is invertible in R, we have
the relation
s 1Ed 0 F 1Ed 0
M˛ D M˛ :
0 Ec 0 1 Ec
By reduction to a universal case, we conclude that
The first equation is clear because F is a ring homomorphism and the second equation
is a consequence of the first one.
We write FP W IR T ˚ L ! T ˚ L in matrix form
!! !
V
t A B t
FP D :
` C D F`
The quadratic block matrix on the right hand side will be denoted by ˆ. In the same
way, we define with respect to P 0 the matrix
0
A B0
ˆ0 D :
C 0 D0
406 T. Zink
M˛ ı NQ ˛ D NQ ˛ ı M˛ D 1 1 u Eh : (17.6.10)
Since all entries of the Witt vector WD u are nilpotent, the equation D F
implies that D 0. Therefore, the right hand side of (17.6.10) is u Eh . The proof of
the proposition is finished by the following lemma.
Displays and p-divisible groups 407
uN F N D N :
u F D :
0 ! G ! X 0 ! X 1 ! 0;
where X 0 and X 1 are formal p-divisible groups. One obtains a fully faithful functor
from the category of finite flat local group schemes to the derived category of bounded
complexes of formal p-divisible groups [13]. By Theorem 17.5.5, the last category
is equivalent to the derived category D b .nilDsp/ of bounded complexes of nilpotent
W .R/-displays. The resulting functor
References
[1] T. Ahsendorf, C. Cheng and T. Zink, O-displays and -divisible formal O-modules, J. Alge-
bra 457 (2016), 129–193.
[2] N. Bourbaki, Algébre Commutative, Chapitre 9, Masson 1983.
[3] O. Gregory and A. Langer, Higher displays arising from filtered de Rham–Witt complexes,
preprint, 2016.
[4] M. Kisin, Crystalline representations and F -crystals, Algebraic geometry and number theory,
Progr. Math. 253 (2006), 459–496.
[5] S. Kudla, M. Rapoport and T. Zink, On Cherednik uniformization, in preparation.
[6] A. Langer and T. Zink, De Rham–Witt cohomology for a proper and smooth morphism, J.
Inst. Math. Jussieu 3 (2) (2004), 231–314.
[7] A. Langer and T. Zink, De Rham–Witt cohomology and displays, Docum. Math. 12 (2007),
147–191.
408 T. Zink
[8] A. Langer and T. Zink, Grothendieck–Messing deformation theory for varieties of K3-type,
TJM 1 (2019), 455–517.
[9] E. Lau, Displays and formal p-divisible groups. Invent. Math. 171 (2008), 617–628.
[10] E. Lau, A duality theorem for Dieudonné displays, Annales ENS 4e série t.42 (2009), 241–
259.
[11] E. Lau, Frames and finite group schemes over complete regular local rings. Docum. Math. 15
(2010), 545–569.
[12] E. Lau, Smoothness of the truncated display functor, J. Amer. Math. Soc. 26 (2013), 129–165.
[13] E. Lau, Relations between Dieudonné displays and crystalline Dieudonné theory. Algebra
Number Theory 8 (2014), 2201–2262.
[14] E. Lau, Higher frames and G-displays, preprint, 2016.
[15] W. Messing, The Crystals Associated to Barsotti–Tate Groups, LNM 264, Springer, Berlin,
1972.
[16] W. Messing, Travaux de Zink. Séminaire Bourbaki 2005/2006, exp. 964, Astérisque 311
(2007), 341–364.
[17] M. Rapoport and T. Zink, On the Drinfeld moduli problem of p-divisible groups, Cambridge
J. Math. 5 (2017), 229–279.
[18] A. Vasiu and T. Zink, Breuil’s classification of p-divisible groups over regular rings of ar-
bitrary dimension. In Proceedings of Algebraic and Arithmetic Structures of Moduli Spaces
(I. Nakamura and L. Weng, eds.), Advanced Studies in Pure Mathematics 58 (2010), 461–479.
[19] A. Vasiu and T. Zink, Purity results for p-divisible groups and abelian schemes over regular
bases of mixed characteristic, Docum. Math. 15 (2010), 571–599.
[20] T. Zink, The display of a formal p-divisible group. Astérisque 278 (2002), 127–248.
[21] T. Zink, A Dieudonné theory for p-divisible groups, Class field theory — its centenary and
prospect, Adv. Stud. Pure Math 30 (2001), 139–160.
[22] T. Zink, De Jong–Oort purity for p-divisible groups. In Algebra, Arithmetic, Geometry
(Y. Tschinkel, and Y. Zarhin, eds.), Birkhäuser, Basel, 2009, 693–701.
List of contributors
ISBN 978-3-03719-197-2
www.ems-ph.org