Professional Documents
Culture Documents
€ rs Szathma
Contributed by Eugene V. Koonin; received November 2, 2021; accepted January 3, 2022; reviewed by Steven Frank and Eo ry
We outline a phenomenological theory of evolution and origin of It is therefore no surprise that many attempts have been made
life by combining the formalism of classical thermodynamics with to apply concepts of thermodynamics to problems of biology,
a statistical description of learning. The maximum entropy princi- especially to population genetics and the theory of evolution. The
ple constrained by the requirement for minimization of the loss basic idea is straightforward: Evolving populations of organisms
function is employed to derive a canonical ensemble of organisms fall within the domain of applicability of thermodynamics inas-
(population), the corresponding partition function (macroscopic much as a population consists of a number of organisms suffi-
counterpart of fitness), and free energy (macroscopic counterpart ciently large for the predictable collective effects to dominate over
of additive fitness). We further define the biological counterparts unpredictable life histories of individual (where organisms are
of temperature (evolutionary temperature) as the measure of sto- analogous to particles and their individual histories are analogous
chasticity of the evolutionary process and of chemical potential to thermal fluctuations). Ludwig Boltzmann prophetically
(evolutionary potential) as the amount of evolutionary work espoused the connection between entropy and biological evolu-
required to add a new trainable variable (such as an additional tion: “If you ask me about my innermost conviction whether our
gene) to the evolving system. We then develop a phenomenologi- century will be called the century of iron or the century of steam
cal approach to the description of evolution, which involves or electricity, I answer without hesitation: it will be called the cen-
modeling the grand potential as a function of the evolutionary tury of the mechanical view of nature, the century of Darwin”
temperature and evolutionary potential. We demonstrate how (11). More specifically, the link between thermodynamics and
this phenomenological approach can be used to study the “ideal evolution of biological populations was clearly formulated for the
EVOLUTION
mutation” model of evolution and its generalizations. Finally, we first time by Ronald Fisher, the principal founder of theoretical
show that, within this thermodynamics framework, major transi- population genetics (12). Subsequently, extended efforts aimed at
tions in evolution, such as the transition from an ensemble of mol- establishing detailed mapping between the principal quantities
ecules to an ensemble of organisms, that is, the origin of life, can analyzed by thermodynamics, such as entropy, temperature, and
be modeled as a special case of bona fide physical phase transi- free energy, and those central to population genetics, such as
tions that are associated with the emergence of a new type of effective population size and fitness, were undertaken by Sella
grand canonical ensemble and the corresponding new level of and Hirsh (13) and elaborated by Barton and Coe (14). The par-
description.
Downloaded from https://www.pnas.org by 45.163.220.115 on May 16, 2023 from IP address 45.163.220.115.
allel is indeed clear: The smaller the population size the stronger
are the effects of random processes (genetic drift), which in phys-
entropy j laws of thermodynamics j major transitions in evolution j origin ics associates naturally with temperature increase. It should be
of life j theory of learning
noted, however, that this crucial observation was predicated on
the specific model of independent, relatively rare mutations (low
mutation limit) in a constant environment or the so-called ideal
C lassical thermodynamics is probably the best example of
the efficiency of a purely phenomenological approach for
the study of an enormously broad range of physical and chemi-
mutation model. Among other attempts to conceptualize the rela-
tionships between thermodynamics and evolution, of special note
is the work of Frank (15, 16) on applications of the maximum
cal phenomena (1, 2). According to Einstein, “It is the only
entropy principle, according to which the distribution of any quan-
physical theory of universal content, which I am convinced, that
tity in a large ensemble of entities tends to the highest entropy
within the framework of applicability of its basic concepts will
distribution subject to the relevant constraints (6). The nature of
never be overthrown” (3). Indeed, the basic laws of thermody-
namics were established at a time when the atomistic theory of
matter was only in its infancy, and even the existence of atoms Significance
has not yet been demonstrated unequivocally. Nevertheless,
these laws remained untouched by all subsequent developments We employ the conceptual apparatus of thermodynamics to
in physics, with the important qualifier “within the framework develop a phenomenological theory of evolution and of the
of applicability of its basic concepts.” This framework of appli- origin of life that incorporates both equilibrium and none-
cability is known as the “thermodynamic limit,” the limit of a quilibrium evolutionary processes within a mathematical
large number of particles when fluctuations are assumed to be framework of the theory of learning. The threefold corre-
small (4). Moreover, the concept of entropy that is central to spondence is traced between the fundamental quantities of
thermodynamics was further generalized to become the corner- thermodynamics, the theory of learning, and the theory of
stone of information theory [Shannon’s entropy (5)] and is cur- evolution. Under this theory, major transitions in evolution,
rently considered to be one of the most important concepts in including the origin of life, represent specific types of physi-
all of science, reaching far beyond physics (6). The conventional cal phase transitions.
presentation of thermodynamics starts with the analysis of ther-
Author contributions: V.V., Y.I.W., E.V.K., and M.I.K. designed research; V.V. and
mal machines. However, a more recently promoted and appar-
M.I.K. performed research; and V.V. and E.V.K. wrote the paper.
ently much deeper approach is based on the understanding of
Reviewers: S.F., University of California, Irvine; and E.S., Parmenides Foundation.
entropy as a measure of our knowledge (or, more accurately,
The authors declare no competing interest.
our ignorance) of a system (6–8). In a sense, there is no entropy
This open access article is distributed under Creative Commons Attribution License 4.0
other than information entropy, and the loss of information (CC BY).
resulting from summation over a subset of the degrees of free- 1
To whom correspondence may be addressed. Email: vitaly.vanchurin@gmail.com,
dom is the only necessary condition to derive the Gibbs distri- koonin@ncbi.nlm.nih.gov, or m.katsnelson@science.ru.nl.
bution and hence all the laws of thermodynamics (9, 10). Published February 7, 2022.
For further details, see the text and refs. 17 and 18.
We postulate that a system under consideration obeys the (Table 1), after entropy, which emerges as the inverse of the
maximum entropy principle but is also learning or evolving by Lagrange multiplier β that imposes a constraint on the average
minimizing the average loss function UðqÞ [2.2]. The corre- loss function [2.2], that is, defines the extent of stochasticity of
EVOLUTION
sponding maximum entropy distribution can be calculated using the process of evolution. Roughly, free energy F is the macro-
the method of Lagrange multipliers, that is, by solving the fol- scopic counterpart of the loss function H or additive fitness
lowing variational problem: (18), whereas, as shown below, partition function Z is the mac-
Ð Ð roscopic counterpart of Malthusian fitness,
δ S β dN y H ðy, qÞpðyjqÞ U ν dN y pðyjqÞ 1
¼ 0, φ ≡ exp ðβH ðx, qÞÞ: [2.9]
δpðxjqÞ
The relation between the loss function and fitness is discussed
[2.4]
Downloaded from https://www.pnas.org by 45.163.220.115 on May 16, 2023 from IP address 45.163.220.115.
EVOLUTION
adaptable variables depends on the coefficient b that can be In such a limit, the average loss function contains all the rele-
thought of as the measure of stochasticity caused by factors vant information on the learning system in equilibrium, which
independent of the population size. The smaller b, the more can be derived from a theoretical model, such as the one devel-
genes can be involved in adaptation. In the biological context, oped in the accompanying paper (18) using the mathematical
this implies that the entire adaptive potential of the population framework of neural networks, or a phenomenological model
is determined by mutations in a small fraction of the genome, (such as the one developed in the previous section), or recon-
which is indeed realistic. It has been shown that in prokaryotes structed from observations or numerical simulations. In this
effective population size estimated from the ratio of the rates section, we adopt a phenomenological approach to model the
Downloaded from https://www.pnas.org by 45.163.220.115 on May 16, 2023 from IP address 45.163.220.115.
of nonsynonymous vs. synonymous mutations (dN/dS), indeed, average loss function of a population of noninteracting organ-
positively correlates with the number of genes in the genome, isms (that is, selection only affects individuals), and in the fol-
and, presumably, with the number of genes that are available lowing section we construct a more general phenomenological
for adaptation (44–46). model, which will also be relevant for the analysis of MTE in
To study the state of learning equilibrium for a grand canoni- Major Transitions in Evolution and the Origin of Life.
cal ensemble of organisms, it is convenient to express the aver- Consider a population of organisms described by their geno-
age loss function as types q1 , …, qNe . There are rare mutations (on time scales ∼ τ)
from one genotype to another that are either quickly fixed or
U ðS, K Þ ¼ T ðS, K ÞS þ μðS, KÞK, [4.2]
eliminated from the population (on shorter time scales ≪ τ),
where the conjugate variables are, respectively, evolutionary but the total number of organisms Ne remains fixed. In addi-
temperature tion, we assume that the system is observed for a long period of
time ≫ τ so that it has reached a learning equilibrium (that is,
∂U
T≡ [4.3] an evolutionarily stable configuration). In this simple model, all
∂S organisms share the same qðcÞ , whereas all other variables have
and evolutionary potential already equilibrated, but their effect on the loss function
depends on the type of the variable, that is, qðaÞ vs. qðnÞ vs. x. In
∂U
μ≡ : [4.4] particular, the trainable variables of individual organisms qn ’s
∂K evolve in such a way that entropy is minimized on short time
Once again, evolutionary temperature is a measure of disorder, scales ≪ τ due to fixation of beneficial mutations but maximized
that is, stochasticity in the evolutionary process, whereas evolu- on long time scales ≫ τ due to equilibration, that is, exploration
tionary potential is the measure of adaptability. For a given of the entire nearly neutral mutational network (50, 51). Thus,
phenomenological expression of the loss function [4.2], all the same variables evolve toward maximizing free energy on
other thermodynamic potentials, such as free energy FðT, K Þ short time scales but toward minimizing free energy on longer
and grand potential ΩðT, μÞ, can be obtained by switching to time scales. This evolutionary trajectory is similar to the phe-
conjugate variables using Legendre transformations, i.e., nomenon of broken ergodicity in condensed matter systems,
S $ T, K $ μ. where the short time and ensemble (or long-time) averages can
The difference between the grand canonical ensembles in phys- differ. The prototype of nonergodic systems in physics are
ics and in evolutionary biology should be emphasized. In physics, (spin) glasses (52–54). The glass-like character of evolutionary
the grand canonical ensemble is constructed by constraining the phenomena was qualitatively examined previously (55, 56).
average number of particles (2). In contrast, for the evolutionary Nonergodicity unavoidably involves frustrations that emerge
grand canonical ensemble the constraint is imposed not on the from competing interactions (57), and such frustrations are
number of organisms Ne per se but rather on the number of adapt- thought to be a major driving force of biological evolution (55).
able variables in organisms of a given species K∝log ðNe Þ, which In terms of the model we discuss here, the most fundamental
depends on the effective population size. This key statement frustration that appears to be central to evolution is caused by
EVOLUTION
of microbial genomes. Indeed, bacteria that inhabit high- in the accompanying paper (18), the origin of life is not equal
entropy environments, such as soil, typically possess more genes to the origin of learning or selection. Instead, we associate the
than those that live in low-entropy environments, for example origin of life with a phase transition that gave rise to a distinct,
sea water (67). Furthermore, the number of genes in bacterial highly efficient form of learning or a learning algorithm known
genomes increases with the estimated effective population size as natural selection. Neither the nature of the statistical ensem-
(44–46), which also can be interpreted as taking advantage of ble of molecules that preceded this phase transition nor that of
diverse, high entropy environments. the statistical ensemble of organisms that emerged from the
Given a phenomenological expression for the average loss phase transition [referred to as the Last Universal Cellular
Downloaded from https://www.pnas.org by 45.163.220.115 on May 16, 2023 from IP address 45.163.220.115.
function [6.1], the corresponding grand potential is given by Ancestor, LUCA (69, 70)] are well understood, but at the phe-
the Legendre transformation, nomenological level we can try to determine which statistical
ensembles yield the most biologically plausible results.
μ The origin of life can be identified with a phase transition
ΩðT, μÞ ¼ ð1 nÞSðTÞ , [6.3]
b from an ideal gas of molecules that is often considered in the
where entropy should be expressed as a function of evolution- analysis of physical systems to an ideal gas of organisms that is
ary temperature and evolutionary potential, discussed in the previous section. Then, during such a transi-
tion, the grand canonical ensemble of subsystems changes from
μ ab being constrained by a fixed average number of subsystems (or
T¼ n þ log þ ðn 1Þlog S : [6.4]
b μ molecules),
By solving [6.4] for S and plugging into [6.3], we obtain the Ne ¼ Ne , [7.1]
grand potential
μ n1
n
to being constrained by a fixed average number of adaptable
bT variables associated with the subsystems (or organisms),
ΩðT, μÞ ¼ aðn 1Þ exp : [6.5]
eb ðn 1Þμ
K ¼ K: [7.2]
We shall refer to the ensemble described by [6.5] as an “ideal
gas” of organisms. Immediately before and immediately after the phase transition,
In principle, the grand potential can also be reconstructed we are dealing with the very same system, but the ensembles
phenomenologically, directly from numerical simulations or are described in terms of different ensembles of thermody-
observations of time-series of the population size Ne ðtÞ and fit- namic variables. Formally, it is possible to describe an organism
ness distribution Zðq, tÞ. Given such data, evolutionary temper- by the coordinates of all atoms of which it is comprised, but
this is not a particularly useful language (71). Atoms (and mol-
ature T can be calculated using [5.3] and the distributions
ecules) behave in a collective manner, that is, coherently, and
pT ð K Þ ¼ plog Ne of the number of adaptable variables K can be
therefore the appropriate language to describe their behavior is
estimated for a given temperature T. Then, the grand potential
the language of collective variables similar to, for example, the
is reconstructed from the cumulants κn ðT Þ of the distributions
“dual boson” approach to many-body systems (72).
pT ð K Þ:
According to [4.1], the total number of organisms (population
∞ κ ðT Þ μ n size) and the number of adaptable variables are related,
n
ΩðT, μÞ ¼ T ∑ [6.6] K ∝ log ðNe Þ, but the choice of the constraint, [7.1] vs. [7.2],
n¼1 n! T
determines the choice of the statistical ensemble, which describes
and the average loss function U ðS, K Þ is obtained by Legendre the state of the system. In particular, an ensemble of molecules
transformation from variables ðT, μÞ to ðS, K Þ. Obviously, the can be described by the grand potential Ωp ðT, M Þ, where T is
phenomenological reconstruction of the thermodynamic poten- the physical temperature, M is the chemical potential, and an
tials ΩðT, μÞ and U ðS, K Þ is feasible only if the evolving learning ensemble of biological subsystems can be described by the grand
each other. For example, consider the following relations (or stantial selection (36). The essential role of gene sharing via
dual mappings between physical and biological quantities): horizontal gene transfer at the earliest stages in the evolution
of life is thought of as a major factor underlying the universality
α of the translation system and the genetic code across all life
T0 ¼ μ0 log T0 [7.6]
b forms (73). Strictly speaking, the transition from an ensemble
and of molecules to an ensemble of organisms could correspond to
the emergence of protocells that lacked genomes but neverthe-
c
M 0 ¼ T0 log μ0 : [7.7] less displayed collective behavior and were subject to primitive
γ form of selection for persistence (18). The origin of genomes
Plugging [7.6] and [7.7] into [7.4] gives would be a later event that kicked off natural selection. How-
α ever, under the phenomenological approach adopted here, Eq.
γM 0 bT0 7.3 covers both these stages.
Ωp ðT0 , M 0 Þ ∝ Tα0 exp ¼ exp expðclog μ0 Þ
T0 αμ0 The subsequent MTE, such as the origin of eukaryotic cells
as a result of symbiosis between archaea and bacteria, or the
bT0 c
¼ exp μ0 ∝Ωb ðT0 , μ0 Þ, origin of multicellularity, or of sociality, in principle, follow the
μ0 same scheme: One has to switch between two alternative (or
[7.8] dual) descriptions of the same system, that is, the grand poten-
which is in agreement with [7.3]. The relations [7.6] and [7.7] tials in the dual descriptions should be equal at the MTE point,
were used here to illustrate the conditions under which the similar to Eq. 7.3. Here we only illustrated how the phase tran-
phase transition might occur, but it is also interesting to exam- sition associated with the origin of life could be modeled
ine whether these relations actually make sense qualitatively. phenomenologically and argue that essentially the same phe-
Eq. 7.6 implies that energy/loss associated with learning dynam- nomenological approach would generally apply to the other
ics, T0 , is logarithmically smaller compared to the energy/loss MTEs.
associated with stochastic dynamics, T0 , but depends linearly
on the energy/loss required to add a new adaptable variable to Discussion
the learning system, that is, the evolutionary potential μ0 . This Since its emergence in the Big Bang about 13.8 billion y ago,
dependency makes sense because the learning dynamics is far our universe has been evolving in the overall direction of
more stringently constrained than the stochastic dynamics and increasing entropy, according to the second law of thermody-
its efficiency critically depends on the ability to engage new namics. Locally, however, numerous structures emerge that are
adaptable degrees of freedom. Eq. 7.7 implies that the energy/ characterized by readily discernible (even if not necessarily eas-
loss, M 0 , that is required to incorporate an additional nontrain- ily described formally) order and complexity. The dynamics of
able variable into the evolving system is logarithmically smaller such structures was addressed by nonequilibrium thermody-
μ0 but depends linearly on the energy/loss, T0 , associated with namics (74) but traditionally has not been described as a pro-
stochastic dynamics. This also makes sense because it is much cess involving learning or selection, although some attempts in
EVOLUTION
been previously described as a first-order phase transition,
construction is the interplay between the entropy increase in
albeit within the framework of a specific model of replicator
the environment dictated by the second law of thermodynamics
evolution (78). Furthermore, the transition associated with the
and the entropy decrease in evolving systems (such as organ-
origin of life corresponds to the transition from infrabiological
isms or populations) dictated by the second law of learning
entities to biological ones, the first organisms, as formulated by
(17). Thus, the evolving biological systems are open from the
viewpoint of classical thermodynamics but are closed and reach Szathmary (79) following Ganti’s chemoton concept. Accord-
equilibrium within the extended approach that includes ther- ing to Ganti, life is characterized by the union of three
essential features: membrane compartmentalization, autocata-
Downloaded from https://www.pnas.org by 45.163.220.115 on May 16, 2023 from IP address 45.163.220.115.
modynamics of learning.
Under the statistical description of evolution, Malthusian lytic metabolic network, and informational replicator (80, 81).
fitness is naturally defined as the negative exponent of the The pretransition, infrabiological (protocellular) systems only
average loss function, establishing the direct connection encompass the first two features, and the emergence of the
between the processes of evolution and learning. Further, informational replicators precipitates the transition, at which
evolutionary temperature is defined as the inverse of the point all quantities describing the system have a dual meaning
Lagrange multiplier that constrains the average loss func- according to Eq. 7.3 [see also the accompanying paper (18)].
tion. This interpretation of evolutionary temperature is The phenomenological theory of evolution outlined here is
related to that given by Sella and Hirsh (13), where evolu- highly abstract and requires extensive further elaboration, spec-
tionary temperature was represented by the inverse of the ification, and, most importantly, validation with empirical data;
effective population size, but is more general, reflecting the we indicate several specific predictions for which such valida-
degree of stochasticity in the evolutionary process, which tion appears to be straightforward. Nevertheless, even in this
depends not only on the effective population size, but also general form the theory achieves the crucial goal of merging
on other factors, in particular interaction of organisms with learning and thermodynamics into a single, coherent frame-
the environment. It should be emphasized that here we work for modeling biological evolution. Therefore, we hope
adhere to a phenomenological thermodynamics approach, that this work will stimulate the development of new directions
under which the details of replicator dynamics are irrele- in the study of the origin of life and other MTE. By incorporat-
vant, in contrast, for example, to the approach of Sella and ing biological evolution into the framework of learning pro-
Hirsh (13). cesses, this theory implies that the emergence of complexity
Within our theoretical framework, adaptive evolution commensurate with life is an inherent feature of learning that
involves primarily organisms learning to predict their envi- occurs throughout the history of our universe. Thus, although
ronment, and accordingly, the entropy of the environment the origin of life is likely to be rare due to the multiple con-
with respect to the organism is one of the key determinants straints on the learning/evolutionary processes leading to such
of evolution. For illustration, we consider a specific phenom- an event (including the requirement for the essential chemicals,
enological model, in which the rate of adaptive evolution concentration mechanisms, and more), it might not be an
reflected in the value of the loss function depends exponen- extremely improbable, lucky accident but rather a manifestation
tially on the number of adaptable variables and also shows a of a general evolutionary trend in a universe modeled as a
power law dependence on the entropy of the environment. learning system (18, 68).
The number of adaptable variables, or in biological terms the
number of genes or sites that are available for positive selec- Data Availability. There are no data underlying this work.
tion in a given evolving population at a given time, is itself
proportional to the entropy of the environment and to the ACKNOWLEDGMENTS. V.V. is grateful to Dr. Karl Friston and E.V.K. is grate-
ful to Dr. Purificacion Lopez-Garcia for helpful discussions. V.V. was supported
log of the effective population size. Thus, high-entropy in part by the Foundational Questions Institute and the Oak Ridge Institute
environments promote adaptation, and then success breeds for Science and Education. Y.I.W. and E.V.K. are supported by the Intramural
success, that is, adaptation is most effective in large populations. Research Program of the NIH.
21. C. Jeler, Explanatory goals and explanatory means in multilevel selection theory. 58. M. Kimura, The Neutral Theory of Molecular Evolution (Cambridge University Press,
Hist. Philos. Life Sci. 42, 36 (2020). Cambridge, UK, 1983).
22. D. Cze gel, I. Zachar, E. Szathma ry, Multilevel selection as Bayesian inference, major 59. E. Fermi, Thermodynamics (Dover, New York, 1956).
transitions in individuality as structure learning. R. Soc. Open Sci. 6, 190202 (2019). 60. M. Lynch, The frailty of adaptive hypotheses for the origins of organismal complex-
23. S. Okashi, Evolution and the Levels of Selection (Oxford Univrsity Press, Oxford, 2006). ity. Proc. Natl. Acad. Sci. U.S.A. 104 (suppl. 1), 8597–8604 (2007).
24. T. D. Brunet, W. F. Doolittle, Multilevel selection theory and the evolutionary func- 61. S. V. Vonsovsky, Magnetism (Wiley, New York, 1974).
tions of transposable elements. Genome Biol. Evol. 7, 2445–2457 (2015). 62. M. I. Katsnelson, Y. I. Wolf, E. V. Koonin, On the feasibility of saltational evolution.
25. S. A. Frank, The Price equation program: Simple invariances unify population dynam- Proc. Natl. Acad. Sci. U.S.A. 116, 21068–21075 (2019).
ics, thermodynamics, probability, information and inference. Entropy (Basel) 20, 63. R. S. Galhardo, P. J. Hastings, S. M. Rosenberg, Mutation as a stress response and the
E978 (2018). regulation of evolvability. Crit. Rev. Biochem. Mol. Biol. 42, 399–435 (2007).
26. S. A. Frank, F. J. Bruggeman, The fundamental equations of change in statistical 64. S. M. Rosenberg, Evolving responsively: Adaptive mutation. Nat. Rev. Genet. 2,
ensembles and biological populations. Entropy (Basel) 22, E1395 (2020). 504–515 (2001).
27. M. I. Katsnelson, V. Vanchurin, Quantumness in neural networks. Found. Phys. 51, 94 65. P. Tarazona, Error thresholds for molecular quasispecies as phase transitions: From
(2021). simple landscapes to spin-glass models. Phys. Rev. A 45, 6038–6050 (1992).
28. M. I. Katsnelson, V. Vanchurin, T. Westerhout, Self-organized criticality in neural net- 66. P. J. Gerrish, A. Colato, P. D. Sniegowski, Genomic mutation rates that neutralize
works. arXiv [Preprint] (2021). https://arxiv.org/abs/2107.03402 (Accessed 20 January adaptive evolution and natural selection. J. R. Soc. Interface 10, 20130329 (2013).
2021). 67. L. M. Bobay, H. Ochman, The evolution of bacterial genome architecture. Front.
29. E. Schroedinger, What Is Life? The Physical Aspect of the Living Cell (Trinity College Genet. 8, 72 (2017).
Press, Dublin, 1944). 68. V. Vanchurin, The world as a neural network. Entropy (Basel) 22, E1210 (2020).
30. A. Wagner, Robustness, evolvability, and neutrality. FEBS Lett. 579, 1772–1778 69. E. V. Koonin, Comparative genomics, minimal gene-sets and the last universal com-
(2005). mon ancestor. Nat. Rev. Microbiol. 1, 127–136 (2003).
31. A. Wagner, Neutralism and selectionism: A network-based reconciliation. Nat. Rev. 70. N. Glansdorff, Y. Xu, B. Labedan, The last universal common ancestor: Emergence,
Genet. 9, 965–974 (2008). constitution and genetic legacy of an elusive forerunner. Biol. Direct 3, 29 (2008).
32. E. V. Koonin, The meaning of biological information. Philos. Trans.- Royal Soc., Math. 71. R. B. Laughlin, D. Pines, The theory of everything. Proc. Natl. Acad. Sci. U.S.A. 97,
Phys. Eng. Sci. 374, 20150065 (2016). 28–31 (2000).
33. A. F. Palazzo, E. V. Koonin, Functional long non-coding RNAs evolve from junk tran- 72. A. N. Rubtsov, M. I. Katsnelson, A. I. Lichtenstein, Dual boson approach to collective
scripts. Cell 183, 1151–1161 (2020). excitations in correlated fermionic systems. Ann. Phys. 327, 1320–1335 (2012).
34. M. Lynch, Streamlining and simplification of microbial genome architecture. Annu. 73. K. Vetsigian, C. Woese, N. Goldenfeld, Collective evolution and the genetic code.
Rev. Microbiol. 60, 327–349 (2006). Proc. Natl. Acad. Sci. U.S.A. 103, 10696–10701 (2006).
35. M. Lynch, The Origins of Genome Architecture (Sinauer Associates, Sunderland, MA, 2007). 74. I. R. Prigogine, I. Stengers, Order Out of Chaos (Bantam, London, 1984).
36. Y. I. Wolf, K. S. Makarova, A. E. Lobkovsky, E. V. Koonin, Two fundamentally different 75. L. Smolin, Did the universe evolve? Class. Quantum Gravity 9, 173–191 (1992).
classes of microbial genes. Nat. Microbiol. 2, 16208 (2016). 76. L. Smolin, The Life of the Cosmos (Oxford University Press, Oxford, 1999).
37. P. Puigbo , A. E. Lobkovsky, D. M. Kristensen, Y. I. Wolf, E. V. Koonin, Genomes in tur- 77. W. J. Ewens, Mathematical Population Genetics (Springer Verlag, Berlin, 1979).
moil: Quantification of genome dynamics in prokaryote supergenomes. BMC Biol. 78. C. Mathis, T. Bhattacharya, S. I. Walker, The emergence of life as a first-order phase
12, 66 (2014). transition. Astrobiology 17, 266–276 (2017).
38. H. Tettelin, D. Riley, C. Cattuto, D. Medini, Comparative genomics: The bacterial pan- 79. E. Szathma ry, Life: In search of the simplest cell. Nature 433, 469–470 (2005).
genome. Curr. Opin. Microbiol. 11, 472–477 (2008). 80. T. Ga nti, The Principles of Life (Oxford University Press, Oxford, 2003).
39. I. Sela, Y. I. Wolf, E. V. Koonin, Selection and genome plasticity as the key factors in 81. T. Ga nti, Chemoton Theory: Theory of Living Systems (Kluwer Academic, New York,
the evolution of bacteria. Phys. Rev. X 9, 031018 (2019). 2004).