Professional Documents
Culture Documents
150001 (2023)
www.papersinphysics.org
ISSN 1852-4249
In statistical mechanics, measuring the number of available states and their probabilities,
and thus the system’s entropy, enables the prediction of the macroscopic properties of
a physical system at equilibrium. This predictive capacity hinges on the knowledge of
the a priori probabilities of observing the states of the system, given by the Boltzmann
distribution. Unfortunately, the successes of equilibrium statistical mechanics are hard
to replicate out of equilibrium, where the a priori probabilities of observing states are,
in general, not known, precluding the naı̈ve application of common tools. In the last
decade, exciting developments have occurred that enable direct numerical estimation of
the entropy and density of states of athermal and non-equilibrium systems, thanks to
significant methodological advances in the computation of the volume of high-dimensional
basins of attraction. Here, we provide a detailed account of these methods, underscoring
the challenges present in such estimations, recent progress on the matter, and promising
directions for future work.
150001-1
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
occur with equal probability (i.e., pC = 1/Ω with generic dynamical system. To each of these struc-
Ω the volume of the region of accessible states), tures, we can associate a basin of attraction, i.e.,
it reduces to the well-known Boltzmann entropy, the set of all initial conditions leading to the same
SB = log Ω + const. [117]. The entropy also allows structure via the dynamics. See Fig. 1 for an illus-
us to predict the direction of spontaneous thermo- tration.
dynamic transformations, as the entropy of an iso- In this perspective we show how we can make use
lated system can only increase with time: put sim- of these facts to arrive at a general protocol for es-
ply, heat flows from hot to cold [1, 118]. timating the entropy of systems out of equilibrium,
The state of affairs is not as simple far from equi- from granular to generic dynamical systems. The
librium. Let us consider an arbitrary system evolv- key observation is that while Shannon’s entropy,
ing according to some well-defined, but in general Eq. 2, provides a universal definition of entropy,
non-equilibrium, dynamics. In such a system, there we do not know the a priori probabilities, pi , of
is no guarantee that objects like temperature or observing a given state of the system, suggesting
energy can be defined, and there is generally no that its evaluation may not be possible. To the
clear prescription for the probability distribution contrary, we show that the problem of evaluating
of microscopic configurations. Is it still possible to these probabilities, pi , or alternatively the prob-
find analogies with equilibrium statistical mechan- lem of enumerating the number of possible states,
ics? To address this question, we must introduce Ω (e.g., if one wishes to compute a Boltzmann-like
two theoretical notions. entropy, SB = log Ω + const.), can be reduced to a
First, entropy can be universally defined as the tractable sampling problem. Indeed, as discussed
average amount of surprise (or uncertainty) in- at length in Sec. II, one can compute the a priori
herent to a random variable (rare observations probability of observing a given stable structure by
are more surprising, or informative, than common measuring the volume of its associated basin of at-
ones) [120]. Mathematically, Shannon [2] showed traction [119] – and it is therefore possible to esti-
that this amounts exactly to a rewriting of the mate the Shannon entropy for an arbitrary system.
Gibbs entropy (1), typically referred to as the Shan- In the following, we start by presenting the basin-
non entropy, sampling approach for entropy measurements in
Sec. II. Then, in Sec. III, we highlight its recent
Ω
X successes in the context of granular packings. Fi-
SS = − pi log pi , (2)
nally, in Sec. IV, we discuss exciting extensions of
i=1
this idea that we believe shall shape the future of
where pi is the probability of observing state i and computational non-equilibrium physics.
the sum is over all accessible states, this time for an
arbitrary statistical system with a generic (station-
ary) probability distribution. This information-
II Entropy from basins
theoretic interpretation of entropy can be used to
i Turning intractable counting into sam-
define, measure, and interpret entropies in any
pling
physical system, whatever its dynamics might be.
This is why there has been recent interest in direct While the probabilities of observing the states of
measurements of this quantity using various algo- a nonequilibrium system are in general not known
rithms, in no way restricted to the one we present a priori, it is a generic feature of dynamical sys-
here: for instance, the Shannon entropy of physi- tems that for a given initial condition they will
cal systems can be estimated using ideas from data end up trapped in small regions of their state
compression [3–7]. space, be they actual basins of a high-dimensional
Second, the states of interest of a system can energy landscape, or more general dynamical at-
often be described by the stable structures of its tractors. Assuming that there are Ω such basins,
dynamics, be they extrema of a high-dimensional one can readily compute a Boltzmann-like entropy,
function that can be reached by steepest descent SB = log Ω + const. In a thermodynamic system,
(e.g., energy minima of a potential energy land- it simply amounts to the entropy at zero temper-
scape), or the fixed points and limit cycles of a ature. If we restrict ourselves to a set of typical
150001-2
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
150001-3
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
0.4
2.0 2.0 2.0 2.0
0.2
Sd (r )/2 d
Sd (r )/2 d
Sd (r )/2 d
Sd (r )/2 d
w (r )
w (r )
w (r )
w (r )
1.5 1.5 1.5 1.5
0.8
0.6
1
r r r r
Figure 2: High-dimensional volumes. √ Top: 2d sketches of hypercubes in d = 2, 3, 4, and 5. The outer black line
links vertices located at a distance d from the center of the cube to midpoints located on the unit hypersphere
with hyperbolic lines. The inside of each shape is colored with a density plot of the relative contribution w(r)
of each radial shell inside the cube to the full volume, as obtained from Monte Carlo integration using 106
points uniformly drawn inside each cube. Bottom: Histograms of relative weights of radial shells w(r) for the
hypercube (orange) and for the hyperball (purple). The lines highlight how large the intersection of the cube
and a hyperspherical shell is at the distance that contains most of the cube’s volume. As reminded in the insets,
this intersection here represents 1, 1, 0.78, and 0.62 times the surface of the full hyperspherical shell, respectively.
Rout [127]. In other words, in large d, one corner To complete this picture, in the second row of
of a hypercube has a vanishing volume over surface Fig. 2, we plot in orange the distribution of mass
ratio, a property shared with hyperbolic objects, of hypercubes with a = 1 along the radial direc-
like Gabriel’s horn (also called Torricelli’s trumpet) tion, w(r), and in purple the corresponding line for
in 3d [129]. the hyperball, which is simply the√surface area Sd
This is an illustration of the fact that, in high of the (d − 1)-sphere with radius da, normalised
dimensions, even simple compact objects present like w(r), i.e. by the volume of the cube, (2a)d . As
tendril-, or tentacle-like regions [10, 15, 16, 18, 119] the dimensionality of space increases, the volume
that extend very far but get very thin. Yet, some- of the hypercube concentrates deeper and deeper
what counter-intuitively, most of the volume of into disconnected tendril-like objects, at distances
high-dimensional objects is contained in these ex- such that the intersection between the hypercube
tended objects! This is shown in Fig. 2 by the den- and the hypersphere becomes very small: in Fig. 2,
sity plot inside each shape, that represents the ac- as highlighted by lines and inset texts, the ratio
tual weight of each hyperspherical shell within each between the value of the orange curve at its maxi-
hypercube. While in d = 2, the shell that con- mum and the value of the purple curve at the same
tributes the most to the volume is the unit circle, distance decreases with d. Furthermore, the con-
as the dimensionality increases, it shifts p to larger tribution of the unit ball to the total volume of
and larger radial distances, at Rmax ∼ d/3 in
the limit of large d.1 The 2
P 2squared distance of this point from the center, r =
xi is a sum of independent, identically drawn uniform
1 Thiscan be shown by a simple statistical argument: random variables so that, per the central limit theorem,
suppose that we uniformly draw a random vector within its distribution tends to a Gaussian with mean
√ value
p µ =
a d−dimensional hypercube, with coordinates (x1 , . . . , xd ). d⟨x2 ⟩ = d/3, which asymptotically yields E[ r2 ] ∼ d/3.
150001-4
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
d /3
These last two properties make practical volume
estimations extremely complicated: a naı̈ve Monte 1.2
Carlo integration using uniformly sampled points in
R=
a hyperball is bound to fail since the vast majority 1.0
of points will fall outside of the shape of interest,
V (R )/V
while an integration within a ball close to the center 0.8
will yield only a tiny fraction of the overall volume, d
an example of the so-called curse of dimensional- 0.6 2
ity [10, 16, 18]. 4
To more concretely illustrate how naı̈ve random 0.4 8
sampling methods fail at estimating volumes in
16
high dimensions, let us consider the volume estima- 0.2
tion problem for a simple hypercube, using Monte
32
Carlo integration. We sample Ns points uniformly 64
0.0
drawn in a hyperball, B(R), with radius R cen- 0.0 0.2 0.4 0.6 0.8 1.0
1/2
tered at the same point as the hypercube, measure R /d
the fraction fM C of points that fall within the hy-
percube, and compute the corresponding volume,
V̂ (R) = fM C VB (R). In general, one does not nec- Figure 3: Naı̈ve random sampling shortcomings. Direct
essarily know the largest linear size of the object Monte Carlo evaluation V̂ (R) of the volume of a hy-
of interest, so R should, in principle, be varied. percube with actual volume V , using Ns = 105 points
We use this strategy to estimate the volume of hy- uniformly drawn from a hyperball with radius R, for di-
percubes with a = 1 by varying the ball radius, mensions going from 2 to 64. Here the half-sidelength
R, between 0 and√the length of the longest diago- of the hypercube is a = 1. The dashed line indicates
nal of the cube, d. In Fig. 3, we plot the mea- the asymptotic
p mode of the mass distribution of the
cube, d/3.
sured volume divided by the true √ volume of the
hypercube, V = 2d , against R/ d, for d between
2 (mauve) and 64 (red), for Ns = 105 . At low di- simple example here, in d = 64, the best estimate
mensions of space, this√method converges smoothly with our choice of Ns would be roughly 50% off!
to VM C = V as R → d. As d increases, the vol- This shows that simple Monte Carlo integration,
ume concentrates more and more aroundpthe mode while efficient in small dimensions, fails at captur-
of the distribution of mass of the cube, d/3, and ing volumes in high dimensions. Note that the case
as a result the curves become step-like. However, illustrated here is actually fairly ideal: we know
as d increases, the measurement also √ becomes less the location of the center of the hypercube, which
reliable upon approaching R → d: it first dis- makes the measurement easier, and the hypercube
plays increasingly large fluctuations (d ≤ 16) then is a rather regular object. In real-world situations,
violently falls to 0 (d ≥ 32). The reason is that a Monte Carlo integration of high-dimensional vol-
most points in the ball are actually sampled at radii umes is likely to be far worse.
such that the hypercube is already made of narrow
spikes, as illustrated in Fig. 2. Indeed, the ratio iii Free energy methods for volume compu-
of the volume of the√hypercube to that of the hy- tations
perball with radius √ d decays
√ exponentially with
dimension, V /VB ( d) ∼ d exp(−d). It has been shown that volume estimations and enu-
If we did not know that we were measuring a sim- meration problems can be treated in a way that
ple cube, and ball-picked from spheres of increas- amounts to a free energy calculation in the spirit
ing radius to produce the same kind of curve as in of the Frenkel-Ladd method [11]. The idea is that
Fig. 3, in high dimension our estimate would be far the volume v of a domain Γ can be rewritten as
off as the mode of the mass distribution of the cube
Z Z
moves further and further into the corners. In the v= drOΓ (r) = dre−βUΓ (r) , (4)
Rd Rd
150001-5
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
r0 r0 r0 r0
Figure 4: Probing basin volumes with random walks. Different random walkers are linked by springs with different
rigidities to the configuration at the minimum. The first three panels represent the probability density of the
trajectories of such walkers, with rigidities growing from left to right. At very small (possibly negative) rigidities,
the walker explores the outer rim of the basins, following tentacle-like structures, while at large rigidities the
random walk typically explores a hypersphere around the minimum. Putting together and properly reweighing
the information sampled by these walkers, one can faithfully recover the full volume as if it had been uniformly
sampled (last panel).
where OΓ is the characteristic function of the do- a control parameter that allows us to go continu-
main to integrate over, or “oracle”, which can be ously from an integral of unknown volume to one
exponentiated to yield the potential UΓ which is 0 of known volume. For instance, one can tether ran-
inside the domain and ∞ outside it. Written in this dom walkers to a reference point, r0 , inside the
form, the volume can be interpreted as the parti- basin of attraction (e.g., the minimum energy con-
tion function of a free Brownian walker exploring figuration) using harmonic springs of varying stiff-
the domain Γ, with a hard wall at the boundary, at ness k, see Fig. 4. For a random walk constrained
an inverse temperature β. to remain within the domain of interest Γ, one can
The most popular and earliest class of meth- compute a volume vk weighted by the Boltzmann
ods for the computation of partition functions is factor (viz., the corresponding partition function)
based on thermodynamic integration (TI) [21, 22, as Z
1 2
121], which consists of parameterizing the system’s vk = dre− 2 k|r−r0 | (5)
Hamiltonian in such a way that we can “morph” Γ
an unknown partition function into one that we and define the dimensionless basin free energy as
know how to compute analytically. In practice, this the negative log-volume fk = − log vk . When k =
amounts to replacing a high-dimensional integral 0, the walker is completely free to explore the basin
over phase space volume with a low-dimensional volume, while for k → ∞ the walk is reduced to a
integral over one or more Hamiltonian parame- small region surrounding r0 that fits entirely within
ters (each point in the integrand is obtained from the basin. Computing the basin volume amounts to
an equilibrium simulation with a given choice of measuring the dimensionless free energy difference
Hamiltonian parameters), or to estimating ratios between the walkers with k = 0 and k → ∞ so that
of partition functions from equilibrium samples ob-
fk=0 = fk→∞ + (fˆk=0 − fˆk→∞ ) (6)
tained from simulations with different Hamiltonian
parameters [23, 24], as we show presently. where we added the analytical reference free en-
In the context of volume estimations, the poten- ergy fk→∞ to the numerical estimate (denoted by
tial UΓ that encodes the oracle is athermal, so that a “hat”) of the free energy difference between k = 0
we can take β = 1 without loss of generality. Then, and k → ∞. This is necessary because free ener-
in the spirit of umbrella sampling [25, 121], we can gies can only be computed numerically up to an
introduce simple biasing potentials that depend on additive constant equal for all k’s. The analytical
150001-6
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
reference can be computed by a Gaussian integral values of k that are constrained to “live” closer to
at the largest stiffness value, kmax , provided that the hyperspherical core, as illustrated in Fig. 4.
kmax is large enough that the corresponding ran- The dimensionless free energy difference in Eq. 6
dom walk is essentially unaffected by the boundary can be computed by a simple thermodynamic in-
of the domain. tegration over k [121] or using more sophisticated
Intuitively, this importance sampling method estimators like the Multi-Bennet Acceptance Ra-
should outperform brute-force Monte Carlo sam- tio Method (MBAR) [19], which should yield more
pling because the steps of the walks are chosen to statistically accurate results. The MBAR estima-
remain close to one another, so that even within tor hinges on the idea that one can always relate the
the high-dimensional tentacles, the rejection rate of free energy at one value of k with the free energy
the steps will be much smaller than what we would at every other value of k, via
get by ball-picking within a sphere that contains
the shape of interest. In other words, this method K XNm
X exp[−βUk (ra )]
samples points compactly within the volume being fˆk = − ln ,
P ˆ
Nk′ exp[fk′ − βUk′ (ra )]
estimated, rather than throwing darts at random m=1 a=1
k′
in a huge volume around it. (7)
Nevertheless, running independent biased ran- where Uk (r) = −k|r−r0 |2 /2 is the biasing potential
dom walks remains a poor strategy in high dimen- with spring stiffness k (but can in principle assume
sions. Take for instance the example of the hyper- any shape), K is the total number of biased ran-
cube: Due to the rd−1 scaling for the surface of a dom walks, and Nm is the number of uncorrelated
hypersphere, most points of the walk will be con- equilibrium samples obtained from the m-th ran-
centrated far away from the center of the cube, and dom walk. This system of implicit equations can be
instead will live closep to the heaviest shell of the solved numerically, typically using a self-consistent
mass distribution at d/3. In high dimensions, Newton-Raphson scheme [19]. Ideally, for the qual-
this maximum lies far into the corners, so that a ity of the MBAR solution to be as good as possible,
single random walk would typically spend very long one should measure a similar amount of equilibrium
times in one of the 2d corners. As a result, attempt- samples in every region of the volume being mea-
ing a direct estimation of v with independent free sured. This problem boils down to the choice of Uk
random walks would typically require either expo- and we discuss it briefly in the next subsection.
nentially many (2d ) realizations, or exponentially Finally, MBAR yields the optimal reweighing of
long equilibration times so that the walk can es- the set of histograms hk (r), where r ≡ |r − r0 |,
cape a given corner, reach the (typically tiny) con- allowing for the reconstruction of the full density
vex core, and explore another corner, 2d times. of states, which, in the context of volume mea-
To overcome this problem, one can take inspi- surements, amounts to the mass distribution, h(r),
ration in the large body of work on free energy within the object of interest, that verifies
estimations at low temperature in rugged energy
landscapes (see for instance [121, 122]). A typical X h i
ln h(r) = wk (r) ln hk (r) + βUk (r) − (fˆk − fˆ0 ) ,
strategy is to use parallel tempering [15, 23, 24], k
which amounts to running a collection of simula- (8)
tions with different control parameters (typically h′k (r) being a set of nor-
P
with wk (r) = hk (r)/ k′
different temperatures), each called a “replica”, malised weights.
and allowing for configuration exchanges between
high and low temperature replicas so that the low
iv Measuring hypercubes
temperature replicas do not become trapped in a lo-
cal region of the energy landscape, all while respect- To illustrate how effective this technique is com-
ing the detailed balance condition. In the context of pared to brute-force Monte Carlo, we now apply
basin volume calculations, we exchange coordinates it to the measurement of high-dimensional hyper-
d
between random walks with different stiffness, k, so cubes, r ∈ [−a; a] . For each dimensionality, d, we
that walkers at low stiffness can escape the tenta- run a collection of Kd random walks – that we re-
cles by swapping coordinates with walkers at high fer to as replicas – associated to spring constants
150001-7
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
(k1 = 0, k2 , . . . , kKd ), for Ns = 5 × 105 steps, at- Finally, to get absolute free energy values, we use
tempting a parallel tempering coordinate exchange the negative log-volume of the largest ball inscribed
between replicas with neighboring values of k every in the hypercube as the known reference in Eq. 6,
10 steps. The number of replicas at a given dimen- instead of the free energy for a tight harmonic trap,
sionality, Kd , is set so that the histogram of sam- so that the free energy becomes
pled distances from the center, hki (r), overlaps sig-
nificantly with the histogram of replicas with neigh- fk=0 = fB(a) + (fˆk=0 − fˆB(a) ) (9)
boring values of k, namely h(r)ki±1 .
The scaling of the number of required replicas where fˆB(a) is the estimated free energy for a set of
with dimensionality, d, can be estimated from the points sampled directly from the largest inscribed
mean and variance of the radial distribution for a d- ball, B(a), whose volume is known analytically.
dimensional Gaussian, hk (r) ∼ rd−1 exp (−kr2 /2). The black line overlayed onto the histograms in
We start by noting that for parallel tempering to Fig. 5 is the reconstructed mass distribution, or
be efficient, and for the MBAR estimation to be density of states, of the cube obtained using MBAR
reliable, these distributions must display signifi- according to Eq. 8. Notice √ that, as expected, the
cant overlap. Choosing the replicas so that their region of large r leading to da = 5 contains very
modes are separated by the standard deviation of little volume.
the narrowest walk, one can show that the required
number of replicas, Kd , scales linearly with d (see
App. A). Recall, for context, that naı̈ve Monte 10
d =100
Carlo sampling typically requires a number of sam-
ples that grows exponentially with d.
We show the output histograms, hk (r), for a 8
practical implementation of these random walks in
a d = 100-dimensional hypercube in Fig. 5. In this
6
h(r )
150001-8
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
V /V
squares). The results are compared with a brute-
force Monte Carlo method using the same order of 0.6
magnitude of total number of samples, Ns = 108
(gray disks). Notice that the x axis is in log scale. 0.4
In spite of the finite number of samples, and
of the rather crude choice of Kd , the error of the 0.2
MBAR measurement grows roughly linearly with d MC
and remains below 10% even as d reaches 500, while 0.0
5 10 50 100 500 1000
the Monte Carlo estimate essentially measures zero
volume when dimensionality reaches d = 30, the d
point at which the ratio of the volume of the cube
to that of the smallest circumscribed ball is roughly Figure 6: Volume estimates. Ratio of measured volume
V̂ to expected volume V of the unit cube against the
1/Ns . Notice that in d = 500, that ratio is of the
dimensionality d, using brute-force Monte Carlo with
order of 10−156 , meaning that naı̈ve Monte Carlo Ns = 108 points in the smallest ball containing the
would be absolutely unfeasible. Here, using our ap- cube (gray disks) and MBAR (red squares) with K =
proach instead of naı̈ve MC sampling, with compa- max(64, d/5) random walks linearly spaced in r, each
rable numbers of samples, increases the maximum with Ns = 5 × 105 proposed steps. Error bars are 95%
dimension for which a volume can be measured, confidence intervals for the mean, using statistics across
albeit with some statistical error, from 30 to thou- 10 runs.
sands of dimensions. Since the number of samples
used per walk in this example is still rather modest
(5 × 105 ), the statistical error can be reduced by the shell of the cube that contributes most to its
making the random walks longer. mass. In d = 1000, we find that this number is
Our approach also yields the mass distribution, of the order of 1076 . Even if one could somehow
as reconstructed by unbiasing the histograms of guess the radius of the shell that contributes most
individual random walks, according to Eq. 8. In to a high-dimensional volume, direct Monte Carlo
Fig. 7a we show the hypercube mass distribution, sampling of that shell alone is in general impossi-
h(r), for a few dimensionalities up to d = 1000. As ble. In fact, we manage to reconstruct the density
expected, asp d increases, h(r) tends to a sharp peak of states reliably up to points that would require
centered at d/3a, indicated by a red dashed line 10120 Monte Carlo shots on a sphere, using only
for d = 1000. 108 points across all walks.
To illustrate the difficulty of measuring not just Altogether, we have shown for a simple exam-
the volume, but also the mass distribution of the ple that using importance sampling and a free en-
cube in such high dimensionality, in Fig. 7b we plot ergy estimation method (here, MBAR) enables us
h(r)/rd−1 for d = 1000, which amounts to the prob- to measure high-dimensional volumes much more
ability of landing inside the cube when sampling efficiently than brute-force Monte Carlo by virtue
uniformly on the surface of the hypersphere with of the correlations between successive positions of
radius r. Because the intersection between the cube the random walkers, and by avoiding getting stuck
and a sphere becomes very small, we plot the base- in singular features of the domain thanks to parallel
10 log of that ratio. A value of particular interest tempering.
is that obtained for the mode of h(r), as it gives Going beyond the simple example given here, the
an estimate of how many trials would be needed contrast in efficiency can be made even starker by
to find a single point inside the cube if we were refining the precise design of the algorithm we pre-
performing brute-force Monte Carlo, restricted to sented. First, one could choose a different set of
150001-9
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
2
walk, and requiring that the values go up to the
mode of the mass distribution
p of the cube
p minus
the standard deviation, d/3 − 1/15 − 4/(45d),
1 some simple algebra leads to the asymptotic scaling
Kd ∼ d1/2 . Since MBAR does not require the bias-
ing potentials to be harmonic, it is likely that this
0 scaling could be brought down even further with
0 2 4 6 8 10
an altogether different choice of biasing potentials,
r e.g. by choosing a series of potentials of the form
0 Ui (r) = r1−d exp[−k(r − ri )2 ], where r = |r − r0 |, k
(b ) is a fixed width, and ri is a tunable scalar distance
from r0 .
- 20
- 40
random walks by “cloud sampling”, which amounts
to biasing the Monte Carlo sampling on the average
- 60 weight of a larger number of trial points at each
pMC ≈ 10 -76 step [119]. Alternatively, one could introduce some
- 80 deterministic drift in the random walks, so as to
make them closer to recently-proposed piecewise-
- 100
deterministic processes [26], or so-called Galilean
Monte Carlo [27,28] methods, where particles travel
d = 1000 following straight lines and bounce on walls instead
0 2 4 6 8 10 of diffusing around.
r
The higher exploration efficiency afforded to us
Figure 7: Mass distribution estimate. (a) Mass dis- by this method is far from being free in general
tribution of the unit-sidelength hypercube, a = 0.5, basin volume computations: in order for the ran-
as estimated from MBAR, using the same runs as in dom walk to remain within the basin, we need to
Fig. 6, in d = 2, 10, 50, 100, 200, 500, 1000 dimensions. query an “oracle” at every step to determine if
(b) Decimal log of the ratio between the mass distribu- we are inside or outside the shape of interest [17].
tion and that of the smallest ball containing the whole While this oracle is a simple geometric condition in
cube for pd = 1000. The dashed red lines indicates the the case of the hypercube, for basins of attraction
location d/3a of the maximum for d = 1000, and the in a many-body energy landscape we must solve for
value of the ratio it corresponds to in (b). We indicate the path of steepest descent at every step of the ran-
the probability pM C of landing a single point inside the
dom walk [123], meaning that a single Monte Carlo
cube when drawing points uniformly from that specific
hyperspherical shell.
step typically requires hundreds of energy (func-
tion) evaluations. In liquids, for instance, the cost
of one such energy evaluation scales at best like the
number of particles, N , which needs to be large
biasing potentials than the one used above, which for thermodynamic properties to be measured ac-
reproduces the strategy of Ref. [14]. For instance, curately. While costly, we will show in Sec. III that
the scaling we used for the number of replicas is this cost is manageable in a practical example.
150001-10
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
150001-11
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
30 240
SB ●
■
25
■■ SG ●
●
●
●
● ●
■
● ●
●
●
■
● ●
235 ●
● ●
●
●
●●
●●
●
●
■
● ● ● ●● ● ●
● ●
●● ●● ● ●●
S
20 ■
● ● ● ●●
■
● ● ●
● ● ●
● ●● ●
●● ● ● ● ● ● ● ● ●
●
■
●● ● ● ● ● ● ● ● ●● ●
● ● ● ●● ●
■
● ● ● ●
● ●● ● ● ●● ● ●
■
●● ● ●● ● ● ●● ● ●● ●
■
● ● ● ● ● ●●
●● ● ● ●● ● ●
●● ● ●● ● ●● ●
●● ● ● ●● ● ● ● ● ●● ● ● ●●● ● ● ●● ●
●
● ●● ●● ● ●
■
● ●● ● ● ●
● ● ● ● ●● ● ●● ● ● ● ● ●
● ● ●● ●● ●●●● ●● ●● ● ● ●● ●●
●●● ● ● ● ●●
●● ● ● ● ● ●
● ● ●● ●● ●● ●
●●
●●●● ●● ● ● ●● ●●
● ● ●
■
●
15 ● ●● ●
● ● ● ●
● ● ● ●●●● ● ● ●● ●
■
● ● ● ● ● ●
●● ● ● ● ●●● ●● ●● ●● ● ●●● ● ● ●
230 ● ● ● ● ●● ●
● ●
● ●
●● ● ●
● ●●
● ● ● ●●
●●
● ●
● ● ●● ●
●● ●●● ● ●● ●● ●
● ●
● ● ●●
●
●
●●
●●●
● ●
● ●
●
■
● ● ●● ● ● ●
ϕJ
● ●
● ● ●● ●●● ● ●● ●● ● ●●● ●
● ●●
● ● ●●● ● ● ● ●● ● ● ●
● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ●● ● ● ●
● ●● ● ●●●
● ● ● ●●
● ●● ●● ● ● ● ●● ●● ● ●● ●●●● ● ● ● ● ● ● ● ● ● ● ●
● ● ● ● ●●●●● ● ●●●● ● ●
(a )
● ●● ● ● ●●●● ●●●●● ● ● ●●
● ● ● ●● ● ●●●● ● ● ●
●
●●
● ●● ● ●● ●
●● ●
● ● ● ●
● ● ● ●
● ● ● ● ● ●●●● ● ●●● ●● ●●
● ● ● ●●●● ● ● ●
●●
● ●
● ● ● ●● ● ●
●● ● ●●● ● ●●● ●
●● ●● ●●●
●● ●●●●●● ● ●●● ● ● ● ● ● ●● ●●● ● ●
● ● ●
● ●
● ● ● ● ●● ● ● ● ●● ●
● ● ●● ● ●● ●● ●●●●●● ●● ●●● ● ● ● ● ● ●
● ● ● ● ● ● ● ● ●●●●●● ● ● ● ●
● ● ●● ●● ●● ●● ● ●●●● ●
● ●● ●● ●●● ●● ●● ●● ● ●● ●● ● ● ●●● ●
●● ● ●● ●
● ●
●
● ● ● ● ●● ● ●● ● ● ● ●● ● ●● ● ●●● ● ●● ● ● ●●● ● ● ● ● ● ● ● ● ● ●●
● ●
●●● ●● ●● ●● ●●●●●● ●● ●●
● ● ●●● ● ● ●● ● ● ● ● ● ● ●●
10 ●●
●● ●●●●● ●●● ●● ● ● ●● ●
●
● ●
●
● ●● ● ● ● ● ● ● ● ●●● ● ● ● ●●●
● ●● ● ●●● ● ● ●●●●● ● ●●
–ln pi
● ●●
●●●● ●●●●
●● ● ●●● ●●● ● ●● ● ● ● ● ●●
● ● ●●●● ●● ● ●● ● ●● ● ●
ϕ
● ● ●● ●
● ●●●● ● ● ● ●● ● ●● ● ●●● ● ●●●● ● ●●● ● ●● ●
● ● ● ●●
● ●●●●● ● ● ●
●● ● ● ●●
● ● ●
● ● ● ● ● ●● ● ● ● ●●● ● ● ● ● ●
●
ϕ
●● ● ● ● ● ●
●● ●●● ●●● ● ● ● ●
● ●● ● ● ●● ● ● ●●● ●● ●● ●●
225 ●
●
●●
●● ● ● ●● ● ● ●● ●●
●
● ● ●● ● ● ●●●● ●●
●●●● ●● ●●● ●● ● ●●● ●●
●●●●
●●● ●●
●●●
●● ●●● ●●●●●
●● ● ● ●●
●● ●●
● ●●
● ●
● ● ●●● ●● ●
● ●
●●●●
● ●●
●●●●●● ●● ● ● ●●
●
0.828
●
● ● ●● ●● ●●●
● ●● ●●●
●
●● ●●● ●
● ●
● ●
●
●
●● ● ●●● ●●● ●●● ●● ●● ●● ● ●● ● ● ●● ●●●
● ●●
●
● ● ●●● ● ● ● ●● ● ●● ●● ● ●●● ●● ●
● ● ● ● ● ●● ● ●● ●
●
● ●●
●●●
●
● ●●●● ● ● ●●●
● ●● ●●●
● ●
●●● ●● ● ● ●● ●● ●
● ● ● ●● ●● ●●●
●
● ●●● ●● ● ●●●● ● ●
● ● ● ● ● ● ● ● ●●●●
● ●● ● ● ● ●
● ●●● ● ●● ● ●
●● ● ●● ● ● ● ●● ● ● ●● ● ● ● ● ●● ●● ● ●● ●● ● ●● ● ● ● ●● ●●
●
●●●
●
●● ● ●●● ●● ●●●● ●●● ●● ● ● ● ●● ● ●●●
● ● ● ●● ● ● ● ●●● ●●●●● ●● ● ● ●●●●● ●●●● ● ●● ●● ●● ● ● ● ●● ●● ●● ● ●●●
● ●
●● ● ●●●● ● ●
● ●● ● ●
●●● ● ● ●●●●●
●
● ● ● ● ● ●● ●● ●●● ●● ●● ●●●
●●
● ●● ●●
● ● ● ● ● ● ● ● ● ● ● ●● ● ● ●
●●●●
●● ● ● ●●● ●●● ● ● ●● ●
● ● ●●● ● ●●
●● ●● ●● ● ● ● ● ● ●● ● ● ●
● ●● ● ●●● ●●●●● ●● ●●● ● ● ●● ●●●● ● ●● ●● ● ● ●●●
● ●● ●● ● ● ●●●●●● ● ●●● ● ●● ● ●●● ● ● ●●●● ●●●●●●●● ●●
● ●● ● ● ●
●●●● ●● ● ● ●
0.830
● ● ●● ●●
●●●●●
0.10 ●
● ●
●
●
● ● ●● ●● ● ●
● ●
●●
● ● ●● ●
● ● ● ●
● ●● ● ●●●●●●●●
●● ●● ●
●●● ● ●●
● ●● ●
●●●
●● ●●
●
● ●
●●● ●
●● ● ●●●●
●● ● ● ●●
●● ● ●● ● ●● ●●
●● ●
●● ●
●●●
●●
●●
● ●●● ●
● ●●
●●● ● ●● ●● ●
●●●● ●●
● ●● ●
●●●●●
● ●●
● ●●●
● ●●●●●● ●
● ● ●● ●
● ●●● ●● ●
●●
●●●●●●
●●
●
● ●
●●●
● ●● ●●
● ●
●● ●●●●●●
●●●●
●
●●● ● ● ● ●
●
●
● ●● ●●● ● ● ● ●
●
●
●
●
●●●
● ●
● ●
●● ●●●
●●●
●●
●
■
● ● ● ● ●● ●● ● ●● ●
●● ●●●● ●●
●● ● ●● ●●●● ●●● ●● ● ●●●● ● ● ● ●●●
● ● ● ●●●● ●●● ●● ●●●● ●●● ●●● ●●● ● ● ●● ● ●● ● ●●● ● ●
●●●● ●● ●●●● ● ●● ●●●●● ●● ●
●●● ● ●
●●●● ●●
●
● ● ● ● ● ●●●● ● ●
●
● ●●● ● ●●
●
● ●
●● ●
●●●● ●●●
●● ●● ●●●●
● ●● ●● ● ● ● ● ● ●●●●●● ● ● ●●●● ●●
●●● ● ●
● ●●● ●
●
●
● ● ● ● ● ●● ●
● ● ●● ● ● ● ● ●●● ●● ● ● ● ● ● ● ● ●●
●● ●● ●●●● ● ●● ● ● ● ●●● ●
●●● ●● ● ● ● ●●●●●
●●●● ●●● ● ●●● ● ●● ● ●●● ●● ● ● ●● ●●●●● ●
● ●
●● ● ●● ●● ● ● ●●●●● ●●●
●●
■
● ● ●● ●●● ● ● ●● ●● ● ●● ● ● ●●● ● ● ●●
●
●● ●●● ●
0.835
● ● ● ●● ●● ● ● ●● ●● ● ● ●●●● ●●
● ●● ●
● ● ●● ● ●● ●
●●
● ●●
●● ●
●
● ● ● ●
●●●●●●●●
● ●●●
●●
●● ●● ●●● ●●●●●●● ●●●
●● ● ●● ●● ● ●
● ● ●● ●
●● ● ● ● ●● ● ●● ● ●●●● ●●●● ●●●●● ● ● ●● ● ●●● ● ●
●● ● ● ●● ●●●●●●● ● ●● ●● ●●●●●●
● ●● ● ● ● ●● ● ● ●● ●●● ● ●●● ● ●●● ●●● ●●● ●●●●●● ●●●● ●
● ●●●●●
●
●●●● ●
●● ●●●● ●
● ● ●● ● ●● ●● ● ●●● ●● ●●●
●●●●●● ●●●● ● ●●● ●●
●● ●● ● ●●● ●●●●
●
●●●●●● ●●● ●● ●●●● ● ● ● ● ●
●● ●●●●●● ●
0.08 ●● ● ●● ● ● ● ●●●● ● ●●● ●●●●●
●●●●● ●●
● ● ● ●● ●● ● ● ● ●● ●●●● ●●● ●
220 ● ● ● ●●
●
●●●●
●● ●●● ● ●
● ●●
● ●●
●●
●●●● ● ●
●
●● ●●●● ●●●
●● ● ● ●
●● ●●● ●●
●
● ● ●●●
●
●●●●
●●●
● ● ●●
●●
●●● ●●●
●●●●
●●●
●● ●●●● ●
● ●●●●● ●●●●●
● ● ● ●
●●
●
●●
●●●●●
●
●●●● ●●●● ●● ●
●
● ●●●● ● ●
●
● ●●●
●
●
●
●
■
● ● ● ● ●● ● ●● ●
● ● ●● ●●●● ● ●●●●
●●● ●
●
● ● ●● ●● ● ● ●
● ● ● ● ● ●● ● ● ● ● ● ●●●●● ● ●● ●●
●●●●●
●● ●●●●● ●●●● ●●
●●●
●●
● ●● ●● ●
● ●●●● ● ● ● ●
● ●●● ●
0.840
● ●● ● ● ●●
● ●● ●●● ●
●● ●
● ● ● ● ● ● ● ●● ● ● ● ●● ●●
●● ●●● ●● ●●●● ●●● ● ●●●● ●
● ●● ● ● ●● ●● ●● ●
● ● ●●●●●● ●●●●●● ●● ●●● ●●
● ● ●●● ● ● ●●● ● ●●●●● ●● ● ●●
●●●●●
● ●● ● ● ●● ●●
●●
● ●●
●●●
● ●●
● ●●
● ● ●● ● ● ● ● ●●● ●
●● ● ● ●●●● ● ●●●●●●● ●●●● ● ● ●● ● ●● ●
● ●●● ●●
●●●● ● ● ● ● ●
● ● ● ● ●
● ●●●● ● ● ●●● ●● ●●
● ●● ● ●●●●● ●●●●
● ●●● ●●● ●
● ● ●● ● ● ● ● ●● ● ●●● ● ●●● ●● ●●
● ●
●● ●● ● ● ●
●●●●●●● ● ●● ● ●●●●●● ●●● ●●●●●● ●
● ● ● ●● ● ●● ●●● ●● ●●● ●●●●●●●●● ●● ● ● ●
● ●
●● ● ● ● ●●● ● ● ●● ● ●●●●● ●● ●●● ●● ●●●
●● ● ● ● ●● ● ● ●● ● ● ●●●● ● ●●●●●●● ●●● ● ● ●●
●
● ●●● ●● ●●
● ●●
●●●● ● ●● ●● ● ●●●● ● ●● ●● ●●●●●● ● ●
0.06
● ● ●● ●● ●●●● ● ● ● ●● ●
●● ●● ●● ●● ●●● ● ●●● ●● ● ● ● ● ●●●● ●● ●●
● ●● ● ●● ●●● ●●●● ●● ● ●●●●●
● ●● ●●● ●●●● ●●
0.845
● ●● ●● ● ● ●●● ●● ● ● ●●● ●●●
● ●●●●●●
■
● ●● ●● ● ●● ●●● ● ● ●● ●●●●
●● ● ● ●● ●●●●
● ● ● ●● ●●●● ●
● ●●● ●●●●
● ●●● ●● ●● ● ●● ●●●●●
●
● ● ● ● ● ● ● ●
●
●
● ● ●●
●
● ●
●●● ●●
●
● ●● ● ●●●●● ●●●
●●
●● ● ●
● ● ●● ●● ● ●
●
●
● ●
● ● ● ● ● ●
● ● ● ●
● ●●● ● ● ● ●● ●●●● ●●●●●● ● ●●●● ●● ●●● ● ● ● ●●●
●●● ● ● ●●
●●
λ
0.850
● ● ● ● ● ● ● ● ● ● ●● ● ●
● ● ● ● ●● ●●●●●● ●●● ● ● ● ● ● ● ●
■
●● ●●●●● ●●
●●●● ● ●● ●●● ● ●●● ● ●●●●
0.04 215 ●
● ● ●
●
●
●
●●
● ● ● ● ●
●
●●
●
●● ●● ●
● ● ●
● ●● ● ● ●
●● ●●
●●● ●
●●
●●●
●
● ●●
●●● ●
● ● ● ●●●
●●●
●
● ●●●
●● ●● ●
●●●
●
●●●
●●● ●
●● ●
●●
● ●
● ●
● ●● ● ●
● ● ●●
● ●●● ● ● ●● ● ●
● ● ● ● ●● ● ● ●● ●●●●
●●
●● ● ● ●
■ 0.855 ●
●
● ●
● ● ●
●
●●
●
● ●
● ●
●●
●
●●●
●● ●● ●
●●●●
● ●
●
●● ●
● ● ● ● ●● ●● ●
●● ●
●●● ●● ● ●
●● ●
●
●
●
0.02
■■
●
(b )
● ● ●
(c ) 0.860 ●
●●
●
● ●
●
●
●
●●● ● ●
0.00 210
0.82 0.83 0.84 0.85 0.86 -6 -4 -2 0 2 4 6
ϕ ln P
Figure 9: Checking Edwards’ hypothesis. (a) Gibbs-Shannon entropy SG (blue) and Boltzmann entropy SB
(orange) as a function of ϕ, as obtained from a parameter-free kernel-density estimate (KDE) fit of the distribution
of basin volumes. The dashed curves are second-order polynomial fits. As ϕ approaches the unjamming density
for N = 64 particles ϕJ = 0.82 (dashed gray line), SG tends to SB , implying equiprobability of all states precisely
at unjamming. (b) Scatter plot of the log of the probability of landing in a given basin, pi (proportional to the
volume vi ) against the log of the pressure P measured in that basin. Power-law scaling relations (solid lines) are
found for several densities indicated in the inset. (c) Exponents found for the power laws of (b), plotted against
ϕ. The dashed black line is a linear fit, and the dashed gray line indicates ϕJ . The raw data from Ref. [14] was
used here and re-analyzed independently.
soft potential (see Fig. 8). Consider N such parti- semble of collectively jammed states [41]. A smaller
cles, put them into a periodic box with sidelength subset of minima is that of states stable not only
L at a packing fraction ϕ. The system is endowed against compression, but also shear: the condi-
with a many-body energy landscape that, at high tion of non-zero shear modulus this time imposes
enough densities, contains many different basins. Nc ≥ d(Nnr − 1) + 1, which defines the ensemble of
At the bottom of each basin lies a single local mini- strictly jammed states [41]. In practice, we restrict
mum, that can be interpreted as a zero-temperature ourselves to the latter case [14], which is a more
configuration of the system. At the characteristic broadly accepted definition of jamming. There-
unjamming density ϕJ (the point at which the sys- fore, by implementing the machinery of Sec. II to
tem goes from a liquid to a disordered solid), each the energy landscape of hard-WCA particles, and
minimum corresponds to a packing of non-frictional restricting the set of relevant basins to mechani-
hard particles. Among these minima, there is a cally stable configurations, one can estimate the
subset of mechanically stable configurations. De- Boltzmann and Gibbs-Shannon entropy of granu-
pending on the system of interest, it is common lar packings.
to define mechanical stability through a non-zero The main result, originally obtained in Ref. [14]
bulk modulus, which is equivalent to saying that for N = 64 hard-WCA disks, is reproduced in
the number of contacts verifies Nc ≥ d(Nnr − 1), Fig. 9a. This plotP shows that the Gibbs-Shannon
Ω
where Nnr is the number of particles that are not entropy, SG = − i=1 pi log pi + const., and the
rattlers (mobile particles) [40], thus defining the en- Boltzmann entropy SB = log Ω + const., both eval-
150001-12
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
uated using basin volume measurements, converge IV The road ahead: from packings
precisely at ϕJ ≈ 0.82, implying that only at this to generic dynamical systems
density is Edwards’ hypothesis, pi = 1/Ω, verified.
It can be checked through finite-size analysis [14] In the rest of the paper, we propose perspectives for
that this packing fraction is consistent with the the basin-volume method, ranging from extensions
lower range of values for which the system unjams of the problem of granular packings to completely
at N = 64 following the same preparation protocol, unrelated problems.
so that the two entropies coincide only at unjam-
ming. Note that, while the value of the unjam- i Future work on granular packings
ming density observed in compression experiments
depends on the precise preparation protocol, what As discussed in Sec. III, the basin-volume method
we report is a property of the energy landscape of led to a direct observation that Edwards’ hypothe-
jammed packings, which is a generic feature of the sis is only valid strictly at jamming in d = 2. There
system independent of preparation recipes. are however still several aspects of this problem
that remain unchecked, and could be addressed by
Furthermore, as shown in Fig. 9b, the authors
the very same method.
also found that above jamming, there exists a ro-
First, these results were obtained within the iso-
bust power law relationship, pi ∼ P −N λ(ϕ) , be-
choric ensemble, approaching unjamming where the
tween the probability of observing a packing and
measured pressure vanishes, P → 0+ . It has been
its pressure. This power law suggests a hierar-
argued that the equiprobability of packings could
chical structure of the energy landscape of hard-
break down when switching to the isobaric ensem-
WCA particles, where low-energy minima have
ble [42], viz., when allowing volume fraction fluctu-
large basin volumes and high-energy minima have
ations through particle inflation and deflation to
small volumes, see Fig. 10 for an illustration of this
maintain constant hydrostatic pressure, or more
property.
generally in the isostress ensemble that also con-
Finally, as shown in Fig. 9c, the exponent λ de-
strains the shear stresses.
creases roughly linearly as the packing fraction ap-
The isobaric and isostress ensembles can be ex-
proaches jamming from above, and reaches 0 at ϕJ .
plored within a basin volume framework. To do so,
In other words, at jamming, the volume of basins
in the spirit of Parrinello-Rahman barostats [43–
becomes a flat distribution that does not depend on
45], the trick is to allow for deformations of the
the pressure in the system, adding to the evidence
simulation box (both isotropic compressions and
that all basins are equiprobable at ϕJ . As shown
constant-volume deformations), and to replace the
in App. B, this argument on λ can be made more
energy landscape, E, by an enthalpy-like landscape,
formal, and the difference between the Boltzmann
H [40], where
and Gibbs entropies can be written as
H = E − σ : εV0 (13)
1
lim [SB (V ) − SG (V )] = O(λ2 ). (12) explicitly contains the dependence on the full stress
N →∞ N
and strain tensors σ and ε through their Frobenius
P
In other words, in order for the two entropies to inner product A : B = i,j Aij Bij , as well as a
agree at a given density in the thermodynamic reference volume V 0 of the simulation box. Note
limit, the exponent λ must go to 0 as observed. that a subcase of this strategy is that in which only
All in all, a first application to granular packings isotropic compression is allowed, ε = V /V0 I, with
has shown the potential of the method proposed I the identity tensor. In that case, only the trace
in Sec. II. By replacing an intractable enumera- T r[σ] ≡ −P participates in the box deformation
tion problem with a simpler sampling problem, the term, leading to the usual definition of enthalpy
basin-volume method allowed us to test a hypoth- with respect to the hydrostatic pressure P , namely
esis that had been left unchecked for 30 years. It H = E + P V . Basins are then defined through
also yielded insight into new physics at the jamming steepest descent paths that lead to the same en-
transition, by revealing a hierarchical structure of thalpy minimum, in a configuration space compris-
the basins of attraction above jamming. ing not only particle positions, but also d(d + 1)/2
150001-13
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
● ●● ●
�
●
●●
●
●
●
●
●
●
● ● ●● ●
●
●
●
●●
●
�� ��/
●●● ●
�~
●● ●● ●●● ●●
● ●● ●●
● ●
● ● ● ● ●
● ● ● ●●● ● ●● ●● ●● ●
●● ● ●
●● ●●●●●● ●● ● ●● ●●●●● ● ●
● ● ●● ● ●
● ● ●●●●
●●● ● ● ●
●●● ● ●●● ●●●●
● ●
●
●●
●
●
ϕ
●● ●●
●
●● ●● ● ● ● ●● ●
●●●●● ● ●●
● ● ●●●● ●● ● ● ●● ●
● ● ●●
●●●●●●●●● ●●●
●●
●●
● ● ●
● ● ● ●
●
● ●●●●
● ●●● ●
●●●
●●●
● ● ● ●● ●
●●
●
●●●
●
●
●
●●●●
●
●●●●●●
●●
●●●
● ●
●
●
● ●●
●●
●● ●●●●
●
●
●
●
●
●●
●●
●
●
●●
●
●
●
●
●
●
●
●
● ● ●●
●●
●
●
●● ● ●
●
●
●
●
● ●● ●●
●
●
●●●
●
●
●●●
●
●
●
●
● ●
● ●●
●● ●●
●●●
● ●●
●●●
● ● ●
●●●●
●●●●●
●
●
●
●●
●
●
● ● ●●● ● ● ●
●●● ●●●●●●●
●
●●●
●
● ● ●●●●●●●● ●
● ●
●
●
●●●● ●●
●●●
●
● ●●● ●●
● ●●
●●●●●● ●
●
●
● ●● ●
●●● ●●
●
●●●
●●
●●● ●
●● ●
●●●●
●●●●●
● ●
●
●●
●●
●
●
●●
●
●
●●
●●● ● ●● ● ● ● ●
●●● ● ● ●●
●●● ●●● ● ●● ●●●● ●
● ● ● ●● ●
�� �
●● ● ● ●
● ●● ●●
● ● ●● ●●●
●●● ●●● ●●● ● ●●●●
●●● ● ●●●●
●●● ●●●
●●●●
●●
●●● ●● ●●● ●● ● ● ● ● ● ● ●
● ●●● ●
●
●●
● ●● ● ● ●● ●● ● ●●●
● ●●● ●●●
●
●
●
●
●
●
●●
●
●●
● ● ●
●●●
●●
●
●
●●
●
●●
●
● ●●●
●
● ●●●
●●
●
●●●
●●●
●●
●
● ●
●
●●●
●●
●●
● ●
●
●●●●
● ●●●
●
●
● ●●●●●●●
● ●
●
●
●
●
●
● ●●●●●
●●●●●
● ●
● ●
●
●
●●
●
●
●
● ● ● ●
●● ● ●●●●● ●●●
● ●●● ●●●
●●●●● ●●●●●
●●●● ●●●●●● ●● ●
●
●●● ●●
●●●●
●●
●●●●●●●
●●● ●
●● ●●●
● ●●
●●
●● ●
● ●●
●
●●●●
●
●●●
●●
●●●●●●●
●●●
●●● ●●●● ●
●
●●
●
●●
●
●
●
●
●
●
● ● ●● ● ●● ●● ●●● ●●●● ● ●●●
● ● ●●●●● ●●
●
●
-��� ����� ●
● ●● ● ●● ●●
● ● ●● ●●●● ● ● ●● ●●●●●●
●●
●●●
●● ● ●●●● ●●
●● ●●● ● ● ●
●●
●
●●●
●
●
●
●●
●
●●●●●
●
●
●
●●
●●
●
●
●●●
●
●●
●●
●
●● ●
●●●●●
●●●●
●●●●
●
●●
●
●●●●●
●●●
●
●●
●●●● ●● ●
●
●●● ●
●●
●
●
●
●
●●
●
●● ●● ●
● ● ●● ●●●●
● ●●● ●
●●●●● ●●
●●
●●● ●●● ●●
●
●●
● ●● ●●● ●
●●
●●
● ●●
● ●● ● ●
● ●●
● ● ● ●●
●
●●
●
●●●●
●
●
●● ●● ●
●● ●● ●
●●●●●
●●
●●
●● ●
●● ● ● ●●
●●● ●
●
●●● ● ● ●●
● ●
●
●●
●
●●●● ● ●●
●●● ●● ●
● ●●
●●●● ●●
●●
●
●
●
●●
●
●
●
●
●●
●●
● ● ●●●●● ●●●●● ●● ●●● ●● ●● ●● ●● ●
●●●●●●
●● ●●●●
●
●● ●●●● ● ●
● ●●● ●●●
●●
●
●
●
●
-� ●●
● ●●
●
● ●
● ●● ●● ●
● ● ● ● ● ●●● ● ●
● ●
● ●
●
●●
●
●● ● ●● ●
● ●
●
●●
● ● ● ●●
●
●●
●●
●
●
●
●
●
●●
�����
● ●
● ●
● ●
(�) ( �)
one needs to sample configurations with constant -��� �����
-�� -� �
●
●
� ��
-�
-� � �
N and P , with an oracle defined through a mini- �� � �� �
mum of the enthalpy, and using umbrella sampling
not only on the particles’ positions, but also on the Figure 10: Hierarchical energy landscape of soft sphere
degrees of freedom of the box shape. This requires packings. (a) Sketch: the power law between basin vol-
the inclusion of independent shearing and stretch- ume, vi , and pressure suggests that a minimum lying
ing (viz., box deformations) moves in our Monte at Ei > 0 has a basin with volume vi ∝ Ei−γ with γ
Carlo sampling [46–49] as the building block of the a positive exponent. We verify this scaling relationship
in (b), using the same data as in Fig. 9. Panel (c) shows
random walks. Each of these moves is accepted or
a smooth dependence
√ of P as a function of E, with an
rejected according to a Monte Carlo criterion set
asymptote P ∝ E at unjamming.
by a combination of the oracle, which checks that
the new point still falls to the same minimum of
the enthalpy, Eq. 13, and the umbrella sampling Fig. 10(b), using the same data as in Fig. 9, and
biasing potentials that contain box deformations. report a density-dependent power-law between the
Implementing this algorithm as part of a basin vol- volume vi of a basin and the energy at the corre-
ume calculation, while numerically costlier than the sponding minimum. This power law results from P
usual isochoric strategy, would constitute an impor- being a smooth function of the energy, as shown in
tant check on the validity of the Edwards hypoth- Fig. 10(c), where we emphasize that √ unjamming is
esis. accompanied by an asymptote P ∝ E. This be-
Second, one may reasonably wonder whether the haviour is expected near unjamming, where over-
results are still observed in higher dimensions of laps are small so that the pair potential is well
space, notably in d = 3, where many open ques- captured by a harmonic approximation, U (r) ∼
tions remain to be answered to fully understand (r − r0 )2 [119], and the pressure contribution from
the ensemble of jammed states, thus constituting a one interacting pair, as obtained√ from a virial ex-
topic of current research [50]. pression, reads p ∼ r0 (r−r0 ) ∼ √ U which, summed
Finally, we observe that, in the isochoric en- over, can be shown to yield P ∼ E as E → 0.
semble and above jamming, the basin volumes are
linked to the pressure of the packings via a power-
ii Beyond granular jams: glassy systems
law suggesting that configuration space is tiled by
a hierarchy of basins (see Fig. 10(a)), which has Based on our discussion thus far, a reader may
neither been fully appreciated, nor explored. We think that basin volume calculations can only be
check that such a hierarchical dependence exists in used to measure granular entropies. In fact, these
150001-14
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
methods could be used to measure a number of in- tion (with or without anharmonic corrections), av-
dicators for the shapes of the basins, in addition eraged over many IS [68]. Using this and an-
to the radial mass distribution, as well as their other approach based on a Frenkel-Ladd-like com-
neighborhoods. Such measurements could have putation, Berthier and coworkers recently produced
deep fundamental implications, if one does not con- compelling results suggesting that the change in
sider a granular (or zero-temperature) system, but Sconf for supercooled liquids in d = 3 is consis-
a jammed liquid at a finite temperature. Indeed, tent with the existence of an ideal glass transition
in a finite-temperature glassy system, the shape at TK > 0 [65, 69–71].
of basins controls transition rates and relaxation Through basin volume calculations, it is possible
times [51–54], which has prompted a large body of to compute qi directly, and thereby obtain a general
recent work on the enumeration and characteriza- approach to estimating the configurational entropy
tion of minima of rough landscapes [52, 55–58]. of supercooled liquids, without resorting to any ap-
Achieving a clear picture of the basins of at- proximations. In particular, this method does not
traction in the configuration space of a dense liq- need to assume large system sizes, and could al-
uid in physical dimensions would therefore be very low to precisely study finite-size effects in complex
exciting, as it would open avenues for fundamen- free energy landscapes, a problem that is nontrivial
tal checks regarding glassy dynamics. In an ex- even in simple theories in 1d [66].
act mean-field treatment of hard sphere glasses, Finally, in the context of glassy systems, free en-
a Kauzmann transition known as random first- ergy methods could in principle be used to obtain
order transition (RFOT) is attained [62–64]. In not just the (Boltzmann weighted) volume of the
this framework the configurational entropy Sconf = basins of attraction of inherent structures, but also
log ΩF is defined in terms of the number of free en- more complicated information about their geome-
ergy minima ΩF , so that the notion and number of try, topology, and connectivity (e.g., what is the
“glassy states” is well defined and the ideal glass chromatic number for the tiling of basins in the en-
transition results from the population of glassy ergy landscape? [130]). Understanding the proper-
states becoming subextensive in system size. In ties of the most probable paths between minima of
finite dimensions, free energy minima are not in- the energy landscape is of particular interest to un-
finitely long-lived and the precise definition of a derstand the dynamics of relaxation in glasses, and
“glassy state” remains debated [65]. A popular take could help bridge the gap between microscopic dy-
on the problem was proposed by Stillinger and We- namics and mesoscopic models that recently shed
ber [60,67,124], who suggested that the supercooled new light on the interpretation of the relaxation
glassy states would reside close to the minima of the spectrum of glassy materials [72, 73].
potential energy surface, known as inherent struc-
tures (IS) of the liquid. For each IS there is an iii Generic dynamical systems: ecosys-
associated basin of attraction, that is, the set of all tems, neural networks, and more.
initial conditions leading to the i-th IS by steepest
descent. The partition function can then be ex- In the remainder of this paper, we discuss what we
pressed believe to be some of the most exciting opportu-
PΩclassically as a sum over the individual IS,
Q = i=1 qi , where, for β = 1/kB T , nities for basin-volume methods, namely their ap-
Z plications to generic dynamical systems. Indeed,
going back to the method presented in Sec. II, we
qi = e−βU (r) dr, (14)
Γi never explicitly relied on the existence of an un-
derlying energy function. In principle, one can de-
and U(r) is the interaction potential. Until re- fine any dynamics of their choice, let them evolve,
cently, qi could not be computed, and the ac- and classify initial conditions according to which
cepted operational definition for the configura- steady-state or dynamical attractor they eventu-
tional entropy was Sconf = Sliq − Sharm , where ally fall into, as sketched in Fig. 11. One can then
Sliq is the total entropy computed by thermody- always define the Shannon entropy associated with
namic integration, and Sharm is the vibrational the volumes of these attractors, and use it as a
entropy computed from the harmonic approxima- generic descriptor of the system for a given set of
150001-15
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
150001-16
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
150001-17
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
pi = vi /V, one gets SG = −⟨ln vi ⟩B − ln N ! + ln V, In order to simplify this expression, we use the re-
where the average is biased proportionally to the sult of Ref. [14] that ln P is approximately normal-
volume of the basins. The power-law relation distributed, so that
shown in Fig. 9b entails a linear law between logs,
1 (ln P −µ)2
−⟨ln vi ⟩B = N (λ(ϕ)⟨ln P ⟩B + c(ϕ)). Therefore, the B(P |V )P N λ ≈ √ e− 2σ2 +N λ ln P . (24)
Gibbs entropy reads P 2πσ 2
150001-18
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
[5] R Avinery, M Kornreich, R Beck, Univer- [17] D Frenkel, K J Schrenk, S Martiniani, Monte
sal and Accessible Entropy Estimation Using Carlo sampling for stochastic weight func-
a Compression Algorithm, Phys. Rev. Lett. tions, Proc. Nat. Ac. Sci. 114, 6924, (2017).
123, 178102, (2019).
[18] S S Ashwin, J Blawzdziewicz, C S O’Hern, et
[6] A Cavagna, P M Chaikin, D Levine, et al., al., Calculations of the structure of basin vol-
Vicsek model by time-interlaced compression: umes for mechanically stable packings, Phys.
A dynamical computable information density, Rev. E 85, 061307, (2012).
Phys. Rev. E 103, 062141, (2021).
[19] M R Shirts, J D Chodera, Statistically opti-
[7] S Ro, B Guo, A Shih, et al., Model-Free Mea-
mal analysis of samples from multiple equi-
surement of Local Entropy Production and
librium states, J. Chem. Phys. 129, 124105,
Extractable Work in Active Matter, Phys.
(2008).
Rev. Lett. 129, 220601, (2022).
[8] D A Wiley, S H Strogatz, M Girvan, The size [20] X Ding, J Vilseck, C Brooks, Fast Solver for
of a sync basin, Chaos 16, 015103, (2006). Large Scale Multistate Bennett Acceptance
Ratio Equations, J. Chem. Theory Comput.
[9] S P Cornelius, W L Kath, A E Motter, Real- 2019, 802, (2019).
istic control of network dynamics, Nat. Com-
mun. 4, 1, (2013). [21] J G Kirkwood, Statistical Mechanics of Fluid
Mixtures, J. Chem. Phys. 3, 300, (1935).
[10] Y Zhang, S H Strogatz, Basins with tentacles,
Phys. Rev. Lett. 127, 194101, (2021). [22] A Gelman, X-L Meng, Simulating normaliz-
[11] D Frenkel, A J Ladd, New Monte Carlo ing constants: from importance sampling to
mehod to compute the free energy of arbitrary bridge sampling to path sampling, Statist. Sci.
solids. Application to the fcc and hcp phases 13, 185, (1998).
of hard spheres, J. Chem. Phys. 81, 3188,
[23] A Bunker, B Dünweg, Parallel excluded vol-
(1984).
ume tempering for polymer melts, Phys. Rev.
[12] N Xu, D Frenkel, A J Liu, Direct determi- E 63, 016701, (2000).
nation of the size of basins of attraction of
jammed solids, Phys. Rev. Lett. 106, 245502, [24] H Fukunishi, O Watanabe, S Takada, On the
(2011). hamiltonian replica exchange method for effi-
cient sampling of biomolecular systems: Ap-
[13] D Asenjo, F Paillusson, D Frenkel, Numerical plication to protein structure prediction, J.
calculation of granular entropy, Phys. Rev. Chem. Phys. 116, 9058, (2002).
Lett. 112, 098002, (20114).
[25] J M Torrie, J P Valleau, Nonphysical sam-
[14] S Martiniani, K J Schrenk, K Pamola, et
pling distributions in Monte Carlo free-energy
al., Numerical test of the Edwards conjecture
estimation: Umbrella sampling, J. Comput.
shows that all packings are equally probable at
Phys. 23, 199, (1977).
jamming, Nat. Phys. 13, 848, (2017).
[15] S Martiniani, K J Schrenk, J D Stevenson, [26] A Chevallier, F Cazals, P Fearnhead, Effi-
et al., Turning intractable counting into sam- cient computation of the volume of a polytope
pling: computing the configurational entropy in high dimensions using Piecewise Deter-
of three-dimensional jammed packings, Phys. ministic Markov Processes, ArXiv Preprint,
Rev. E 93, 012906, (2016). 2202.09129, (2022).
150001-19
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
[36] E S Bililign, J E Kollmer, K E Daniels [47] S Yashonath, C Rao, A monte carlo study of
Protocol dependence and state variables in crystal structure transformations, Mol. Phys.
the force-moment ensemble, Phys. Rev. Lett. 54, 245, (1985).
122, 038001, (2019).
[48] R J N Baldock, L B Pártay, A P Bartók, et
[37] X Sun, W Kob, R Blumenfeld, et al., al., Determining pressure-temperature phase
Friction-controlled entropy-stability competi- diagrams of materials, Phys. Rev. B 93,
tion in granular systems, Phys. Rev. Lett. 174108, (2016).
125, 268005, (2020).
[49] R J N Baldock, N Bernstein, K Michael
[38] Y Yuan, Y Xing, J Zheng, et al., Experi- Salerno, et al., Constant-pressure nested sam-
mental test of the Edwards volume ensemble pling with atomistic dynamics, Phys. Rev. E
for tapped granular packings, Phys. Rev. Lett. 96, 043311, (2017).
127, 018002, (2021).
[50] C Anzivino, M Casiulis, T Zhang, et al., Esti-
[39] J D Weeks, D Chandler, H C Andersen, Role mating random close packing in polydisperse
of repulsive forces in determining the equi- and bidisperse hard spheres via an equilibrium
librium structure of simple liquids, J. Chem. model of crowding, J. Chem. Phys., In Press
Phys. 54, 5237, (1971). (2022).
150001-20
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
[51] T Castellani, A Cavagna, Spin-glass theory [63] S Franz, G Parisi, Recipes for metastable
for pedestrians, J. Stat. Mech. 2005, P05012, states in spin glasses, Journal de Physique
(2005). I 5, 1401, (1995).
[52] V Ros, G Ben Arous, G Biroli, et al., Com- [64] G Parisi, F Zamponi, Mean-field theory of
plex Energy Landscapes in Spiked-Tensor and hard sphere glasses and jamming, Rev. Mod.
Simple Glassy Models: Ruggedness, Arrange- Phys. 82, 789, (2010).
ments of Local Minima, and Phase Transi-
[65] L Berthier, M Ozawa, C Scalliet, Configu-
tions, Phys. Rev. X 9, 011003, (2019).
rational entropy of glass-forming liquids, J.
[53] T Rizzo, Path integral approach unveils role Chem. Phys. 150, 160902, (2019).
of complex energy landscape for activated dy-
[66] C Rulquin, P Urbani, G Biroli, G Tarjus,
namics of glassy systems, Phys. Rev. B 104,
M Tarzia, Nonperturbative fluctuations and
094203, (2021).
metastability in a simple model: from observ-
[54] G Folena, A Manacorda, F Zamponi, Intro- ables to microscopic theory and back, J. Stat.
duction to the dynamics of disordered sys- Mech. 2016, 023209, (2016).
tems: equilibrium and gradient descent, Lec-
[67] F H Stillinger, A topographic view of super-
tures Notes for the Fundamental Problems
cooled liquids and glass formation, Science
in Statistical Physics XV Summer School
267, 1935, (1995).
(2021).
[68] F Sciortino, Potential energy landscape de-
[55] V Ros, G Biroli, C Cammarota, Complexity
scription of supercooled liquids and glasses, J.
of energy barriers in mean-field glassy sys-
Stat. Mech. 2005, P05015, (2005).
tems, EPL 126, 20003, (2019).
[69] L Berthier, P Charbonneau, D Coslovich,
[56] V Ros, G Biroli, C Cammarota, Dynamical
A Ninarello, M Ozawa, S Yaida, Configura-
Instantons and Activated Processes in Mean-
tional entropy measurements in extremely su-
Field Glass Models, SciPost Phys. 10, 002,
percooled liquids that break the glass ceiling,
(2021).
Proc. Nat. Ac. Sci. 114, 11356, (2017).
[57] B Lacroix-à-chez-Toine, Y Fyodorov, Count-
[70] M Ozawa, G Parisi, L Berthier, Configura-
ing equilibria in a random non-gradient dy-
tional entropy of polydisperse supercooled liq-
namics with heterogeneous relaxation rates, J.
uids, J. Chem. Phys. 149, 154501, (2018).
Phys. A: Math. Theor. 55, 144001, (2022).
[71] Y Nishikawa, M Ozawa, A Ikeda, P Chaud-
[58] J Kent-Dobias, J Kurchan, Complex complex
huri, L Berthier, Relaxation Dynamics in the
landscapes, Phys. Rev. Research 3, 023064,
Energy Landscape of Glass-Forming Liquids,
(2021).
Phys. Rev. X 12, 021001, (2022).
[59] E Bitzek, P Koskinen, F Gähler, et al., Struc-
[72] C Scalliet, B Guiselin, L Berthier, Excess
tural relaxation made simple, Phys. Rev.
wings and asymmetric relaxation spectra in
Lett. 97, 170201, (2006).
a facilitated trap model, J. Chem. Phys. 155,
[60] F H Stillinger, T A Weber, Hidden structure 064505, (2021).
in liquids, Phys. Rev. A 25, 978, (1982).
[73] B Guiselin, C Scalliet, L Berthier, Micro-
[61] S D Cohen, A C Hindmarsh, P F Dubois, scopic origin of excess wings in relaxation
CVODE, a stiff/nonstiff ODE solver in C, spectra of supercooled liquids, Nat. Phys. 18,
Computers in Physics 10, 138, (1996). 468, (2022).
[62] P Charbonneau, J Kurchan, G Parisi, et al., [74] A Chakraborty, P Seiler, G J Balas, Sus-
Glass and jamming transitions: From ex- ceptibility of F/A-18 flight controllers to the
act results to finite-dimensional descriptions, fallingleaf mode: Nonlinear analysis, J. Guid.
Ann. Rev. Cond. Mat. Phys. 8, 265, (2017). Control Dyn. 34, 73, (2011).
150001-21
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
[75] M Kac, On the average number of real roots [86] Y V Fyodorov, High-dimensional random
of a random algebraic equation, Bull. Am. fields and random matrix theory, ArXiv
Math. Soc. 49, 314, (1943). Preprint, arXiv:1307.2379, (2013).
[76] K Farahmand, On the average number of real [87] Y V Fyodorov, A Lerario, E Lundberg, On
roots of a random algebraic equation, Ann. the number of connected components of ran-
Probab. 14, 702, (1986). dom algebraic hypersurfaces, J. Geom. Phys.
95, 1, (2015).
[77] E Bogomolny, O Bohigas, P Leboeuf, Distri-
bution of roots of random polynomials, Phys. [88] D Cheng, A Schwartzman, On the explicit
Rev. Lett. 68, 2726, (1992). height distribution and expected number of
local maxima of isotropic Gaussian random
[78] A Edelman, E Kostlan, How many zeros of
fields, ArXiv Preprint, arXiv:1503.01328,
a random polynomial are real?, Bull. Am.
(2015).
Math. Soc. 32, 1, (1995).
[89] F Krzakala, J Kurchan, Landscape analysis of
[79] J M Rojas, On the average number of real
constraint satisfaction problems, Phys. Rev.
roots of certain random sparse polynomial
E 76, 021122, (2007).
systems, Lectures in Applied Mathematics-
American Mathematical Society 32, 689, [90] F Krzakala, L Zdeborová, Hiding Quiet So-
(1996). lutions in Random Constraint Satisfaction
Problems, Phys. Rev. Lett. 102, 238701,
[80] Y V Fyodorov, Complexity of random en-
(2009).
ergy landscapes, glass transition, and abso-
lute value of the spectral determinant of ran- [91] L Zdeborová, M Mézard, Constraint Satis-
dom matrices, Phys. Rev. Lett. 92, 240601, faction Problems with Isolated Solutions are
(2004). Hard, J. Stat. Mech. 2008, P12004, (2008).
[81] G Malajovich, J M Rojas, High probability [92] M-H Tayarani-Narajan, A Prügel-Bennett,
analysis of the condition number of sparse On the Landscape of Combinatorial Optimi-
polynomial systems, Theor. Comput. Sci. sation Problems, IEEE Transactions on Evo-
315, 525, (2004). lutionary Computation 18, 420, (2013).
[82] J-M Azaı̈s, M Wschebor, On the roots of a [93] T C Bachlechner, K Eckerle, O Janssen, et
random system of equations. The theorem of al., Axion landscape cosmology, Journal of
Shub and Smale and some extensions, Found. Cosmology and Astroparticle Physics 2019,
Comput. Math. 5, 125, (2005). 062, (2019).
[83] D Armentano, M Wschebor, Random systems [94] A J Lotka, Contribution to the Theory of
of polynomial equations. The expected number Periodic Reaction, J. Phys. Chem. 14, 271,
of roots under smooth analysis, Bernoulli 15, (1910).
249, (2009).
[95] A J Lotka, Analytical Note on Certain Rhyth-
[84] Y V Fyodorov, G A Hiary, J Keating, mic Relations in Organic Systems, Proc.
Freezing transition, characteristic polynomi- Natl. Acad. Sci. U.S.A. 6, 410, (1920).
als of random matrices, and the Riemann
zeta function, Phys. Rev. Lett. 108, 170601, [96] V Volterra, Variazioni e fluttuazioni del nu-
(2012). mero d’individui in specie animali conviventi,
Mem. Acad. Lincei Roma 2, 31, (1926); Na-
[85] Y V Fyodorov, C Nadal, Critical behavior of ture 118, 558 (1926).
the number of minima of a random landscape
at the glass transition point and the Tracy- [97] G Bunin, Ecological communities with Lotka-
Widom distribution, Phys. Rev. Lett. 109, Volterra dynamics, Phys. Rev. E 95, 042414,
167203, (2012). (2017).
150001-22
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
[102] G Bunin, Directionality and community-level [115] P Attard, Thermodynamics and statistical
selection, Oikos 130, 489, (2018). mechanics: equilibrium by entropy maximi-
sation, Academic Press, London (2002).
[103] M Barbier, C de Mazancourt, M Loreau, et
al., Fingerprints of High-Dimensional Coex- [116] J W Gibbs, Elementary principles in statisti-
istence in Complex Ecosystems, Phys. Rev. X cal mechanics, Charles Scribner’s Sons, New
11, 011009, (2021). York (1902).
[104] A J Ballard, R Das, S Martiniani, et al., En- [117] M Kardar, Statistical Physics of Parti-
ergy landscapes for machine learning, Phys. cles, Cambridge University Press, Cambridge
Chem. Chem. Phys. 19, 12585-12603, (2017). (2007).
[105] S Hochreiter, J Schmidhuber, Simplify- [118] S Carnot, Réflexions sur la puissance motrice
ing neural nets by discovering flat minima, du feu et sur les machines propres à
NeurIPS 7, (1994). développer cette puissance, Annales Scien-
tifiques de l’É.N.S., Bachelier (1824).
[106] S Hochreiter, J Schmidhuber, Flat minima,
Neur. Comput. 9, 1, (1997). [119] S Martiniani, On the complexity of energy
landscapes: algorithms and a direct test of
[107] C Baldassi, F Pittorino, R Zecchina, Shap- the Edwards conjecture, University of Cam-
ing the learning landscape in neural networks bridge, Cambridge (2017).
around wide flat minima, Proc. Natl. Acad.
Sci. U.S.A. 117, 161, (2020). [120] T M Cover, Elements of information theory,
John Wiley & Sons, New York (1999).
[108] Y Feng, Y Tu, The inverse variance–flatness
relation in stochastic gradient descent is criti- [121] D Frenkel, B Smit, Understanding molecular
cal for finding flat minima, Proc. Natl. Acad. simulation: from algorithms to applications,
Sci. U.S.A. 118, e2015617118, (2021). Vol. I, Elsevier (2001).
[109] S Zhang, I Reid, G Pérez, et al., Why [122] D Landau, K Binder, A Guide to Monte
Flatness Correlates With Generalization For Carlo Simulations in Statistical Physics, Sec-
Deep Neural Networks, Arxiv Preprint ond Edition, Cambridge University Press,
arXiv:2103.06219, (2021). Cambridge (2005).
150001-23
Papers in Physics, vol. 15, art. 150001 (2023) / M Casiulis & S Martiniani
[125] E Kostlan, On the expected number of real [129] E W Weisstein, Gabriel’s Horn, In:
roots of a system of random polynomial equa- MathWorld–A Wolfram Web Resource,
tions, Foundations of Computational Mathe- mathworld.wolfram.com/GabrielsHorn.html
matics, World Scientific, Singapore (2002).
[130] D Frenkel, Private communication.
[126] A J Lotka, Elements of Physical Biology,
Williams and Wilkins, Baltimore (1925).
150001-24