Professional Documents
Culture Documents
net/publication/49751085
CITATIONS READS
6 55
4 authors, including:
Yannis Almirantis
National Center for Scientific Research Demokritos
55 PUBLICATIONS 614 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
BioASQ: large-scale biomedical semantic indexing and question answering competition View project
All content following this page was uploaded by K. Karamanos on 04 April 2014.
Scaling properties and fractality in the distribution of coding segments in eukaryotic genomes
revealed through a block entropy approach
Labrini Athanasopoulou, Stavros Athanasopoulos, Kostas Karamanos, and Yannis Almirantis*
Institute of Biology, NRCPS “Demokritos,” 15310 Athens, Greece
共Received 28 July 2010; revised manuscript received 19 September 2010; published 11 November 2010兲
Statistical methods, including block entropy based approaches, have already been used in the study of
long-range features of genomic sequences seen as symbol series, either considering the full alphabet of the four
nucleotides or the binary purine or pyrimidine character set. Here we explore the alternation of short protein-
coding segments with long noncoding spacers in entire chromosomes, focusing on the scaling properties of
block entropy. In previous studies, it has been shown that the sizes of noncoding spacers follow power-law-like
distributions in most chromosomes of eukaryotic organisms from distant taxa. We have developed a simple
evolutionary model based on well-known molecular events 共segmental duplications followed by elimination of
most of the duplicated genes兲 which reproduces the observed linearity in log-log plots. The scaling properties
of block entropy H共n兲 have been studied in several works. Their findings suggest that linearity in semiloga-
rithmic scale characterizes symbol sequences which exhibit fractal properties and long-range order, while this
linearity has been shown in the case of the logistic map at the Feigenbaum accumulation point. The present
work starts with the observation that the block entropy of the Cantor-like binary symbol series scales in a
similar way. Then, we perform the same analysis for the full set of human chromosomes and for several
chromosomes of other eukaryotes. A similar but less extended linearity in semilogarithmic scale, indicating
fractality, is observed, while randomly formed surrogate sequences clearly lack this type of scaling. Genomic
sequences always present entropy values much lower than their random surrogates. Symbol sequences pro-
duced by the aforementioned evolutionary model follow the scaling found in genomic sequences, thus cor-
roborating the conjecture that “segmental duplication-gene elimination” dynamics may have contributed to the
observed long rangeness in the coding or noncoding alternation in genomes.
051917-2
SCALING PROPERTIES AND FRACTALITY IN THE… PHYSICAL REVIEW E 82, 051917 共2010兲
FIG. 4. 共Color online兲 Examples of human chromosomes’ block entropy H共n兲 plots. In all cases s.f. is taken equal to 30. Random
surrogates are also included. In 共a兲–共d兲 chromosomes 2, 12, 20, and X are shown, respectively. The complete set of human chromosome plots
is given in the auxiliary material 关27兴, see also Table I.
in the case of the Feigenbaum attractor for the logistic map generate additional “noisy” sequences after making a number
and for n = 2k 共k = 2 , 3 , 4 , . . .兲, Grassberger 关12兴 共see also of random insertions and deletions of symbols. The reason
关15,19兴兲 showed that for reading the sequence by gliding, the for the construction of such sequences is that random inser-
following scaling holds: tions and deletions 共termed collectively: indel兲 are wide-
spread phenomena in molecular dynamics with important
H共n兲 = log2共3n/2兲. 共3兲 consequences for the genomic architecture 共see, e.g., 关22兴兲.
In this system, it is admitted that linearity in semilogarithmic The entropy scaling for those sequences is shown in Figs. 1
plot holds 共see 关20兴 and references given therein兲, which in and 2. Notice that in a different context, entropic analysis in
terms of Eq. 共2兲 corresponds to g ⫽ 0, h = 0, 0 = 0, and 1 symbol sequences after the addition of several forms of noise
⬎ 0 共see 关21兴兲. This type of scaling is conjectured to hold for has been done initially by Freund et al. 关23,24兴. In the figures
a large class of symbol sequences with fractal properties. presented in the sequel, surrogate random sequences are also
Thus, the H共n兲 − log n linearity is related to the scale-free included, which are generated in the following way: for each
structure of such sequences entailing the existence of long- sequence its surrogate is of equal length and equal number of
range correlations. symbols 共0 and 1兲 that are positioned randomly.
In order to test the suitability of this approach in the study In Figs. 1共a兲 and 1共b兲 the block entropy H共n兲 is plotted
of genomic sequences 共estimated to present fractality in the against the block 共word兲 length n for a deterministic and a
alternation of their coding and/or noncoding segments 关6兴兲, probabilistic Cantor-like symbol sequence, respectively,
in Sec. II B, we proceed with a preliminary study of se- alongside with their noisy counterparts. In both cases, curves
quences which are “by construction” fractal. Here, we con- for surrogate random sequences are included. First, we verify
sider a deterministic and a probabilistic Cantor-like symbol the nonmonotonicity for the graphs of the unaltered se-
sequence. quences as expected on the basis of analogous findings 关17兴.
Most importantly for the analysis of genomic sequences
which follows, we observe that a weak noise suffices for the
B. Case of Cantor-like symbol sequences
disappearance of this nonmonotonic pattern and the transfor-
For the construction of a ternary deterministic Cantor se- mation of the curve into its upper envelop. This could be
quence of length 3R our starting symbol is 1. We then apply expected intuitively because the nonmonotonicity is due to a
for R consecutive times the substitutions of every 1 by 101 “stroboscopic” combination of the scale-free structure of the
and of every 0 by 000. In the probabilistic case we substitute sequence with increasing values of n 共see explanation in
again every 0 by 000, while 1 is substituted by 110, 101, and 关15–17兴兲. This may tolerate probabilistic features as shown
011 with probabilities p1, p2, and p3, respectively. Then for in our Fig. 1共b兲, but it is killed out due to multiple frame
both cases 共considered as “perfect” Cantor sequences兲 we shifts caused by random insertions and deletions. We also
051917-3
ATHANASOPOULOU et al. PHYSICAL REVIEW E 82, 051917 共2010兲
FIG. 5. 共Color online兲 Examples of chromosomes’ block entropy H共n兲 plots from several organisms. In all cases random surrogates are
included. In 共a兲–共f兲 R. norvegicus, chromosome 2, s.f. = 100; G. gallus, chromosome 4, s.f. = 30; A. mellifera, chromosome LG6, s.f. = 30; T.
castaneum, chromosome LG2, s.f. = 5; O. sativa, chromosome 2, s.f. = 10; and S. cerevisiae, chromosome I, s.f. = 1 are shown, respectively.
The plots of all examined nonhuman chromosome are given in the auxiliary material 关27兴, see also Table II.
remark the linearity of the H共n兲-n curve for the random sur- taken equal to n0 = log2 N if N is the length of a binary se-
rogate sequence. This form of scaling has been shown for the quence in order to have assured the presence of all possible
Bernoulli systems 共see, e.g., 关13,25兴兲. words of length n in the sequence. This is done on the basis
In Fig. 2 the noisy probabilistic Cantor-like symbol se- of two assumptions: 共i兲 the equiprobability of all possible
quence 关shown in Fig. 1共b兲兴 is depicted in semilogarithmic words in random sequences and 共ii兲 that the sequence has to
scale, alongside with its surrogate random sequence. The include each possible word at least once 关26兴. However, the
corresponding figure for the deterministic Cantor-like con- approximative estimation that a sequence of length 2n in-
struction is omitted, as it is qualitatively similar and almost cludes each n word once is compromised, especially when
identical with the presented plot. We verify here that Cantor- nonequiprobability of symbols and fractality hold. There, the
like sequences exhibit the expected linearity in semilogarith- number of “frequent” words 共words contributing the most in
mic scale in H共n兲-n graphs, which is shown to occur in other the value of entropy; see 关14兴兲 depends strongly on the un-
cases of fractal-like symbol sequences, as discussed in Sec. derlying scale-free pattern. Consequently, in the present
II A. study, we have chosen to include high values of n. We ob-
Here, we have to emphasize the inclusion of high values serve a long extent of linearity in the semilogarithmic plot
of word length n in the graphs presented in Fig. 2 and in the for the fractal-like symbol sequences. This applies both in
figures including genomic data presented in Sec. III. Thus, the Cantor-like constructions, like that of Fig. 2 and in the
the range of n is extended much further than the limit usually genomic sequences presented in Sec. III. Notice the distor-
set in order to guarantee a good statistics in a random symbol tion 共leading to the appearance of a maximum兲 of the ran-
sequence and avoid finite-size effects. This limit is typically dom surrogate symbol sequence, which, however, occurs at
051917-4
SCALING PROPERTIES AND FRACTALITY IN THE… PHYSICAL REVIEW E 82, 051917 共2010兲
TABLE I. Quantitative information for all human chromosomes. The extent of the linear region in
semilogarithmic scale and the slope are included. The square of the correlation coefficient resulting from a
logarithmic regression analysis is in all cases 共in this table and in Table II兲 higher than 0.98. s.f. equals 30 in
all chromosomes. Chromosome lengths are given in Mbp 共millions of nucleotides兲.
values much higher than the aforementioned limit n = n0. Now, notice that the smallest coding segments 共coding
This limit does not apply here due to the strong inequality of exons兲 are of a length of a few tens of nucleotides. In order
zero and one populations, although surrogate data lack by to avoid the study of short words formed in the region of a
construction any internal order. The inclusion of lengthy few nucleotides not contributing meaningfully to the entropy
words in our analysis 共n Ⰷ n0兲 is important for the study of scaling 共because alternation of coding and noncoding is not
genomic sequences where linearity appears in an intermedi- expected there兲, we introduce a “shrinkage factor” 共s.f.兲 al-
ate range of word lengths, as we discuss further in Sec. III. lowing compression of the genomic sequence. Thus, we pro-
ceed in the following way: for s.f. equal to, e.g., 30 symbols,
we start from the beginning of the chromosome and we sub-
III. SCALING PROPERTIES OF BLOCK ENTROPY IN
stitute every 30 zeros 共0兲 by one zero and every 30 units 共1兲
GENOMIC SEQUENCES
by one unit. When we meet a 30-letter string of mixed com-
For the study of the coding or noncoding segments’ alter- position we substitute it by a single 1. Notice that the coding
nation in entire eukaryotic chromosomes we proceed in the segments consist of a population of small dispersed parts, in
following way. We downloaded data of genomic annotation comparison to noncoding intervening sequences in the eu-
for several genomes from the National Center for Biotech- karyotic genome 共coding space spans only ⬃1.5% of the
nology Information ftp site 共for further details see in the human genome兲. Thus, in our approach ones correspond to
auxiliary material 关27兴兲. Using these data, we pass from the the almost “zero-measure” component of a Cantor-like con-
sequence written in the four letter alphabet A, G, C, and T to struction. In this way, we perform a “coarse graining,”
the two numbers 共0, 1兲 where “1” stands for every coding mainly retaining the alternation of the coding and noncoding
nucleotide and “0” for every noncoding one. The down- segments. We have tested a variety of values of s.f., ranging
loaded data include coordinates of every coding segment in a from 10 to 10 000 共see example curves in the auxiliary ma-
chromosome, thus allowing the described construction. terial 关27兴兲. We chose to present our results of H. sapiens for
051917-5
ATHANASOPOULOU et al. PHYSICAL REVIEW E 82, 051917 共2010兲
TABLE II. Quantitative information for all nonhuman chromosomes treated herein. Column content is
similar to that of Table I, while in the extra column are given the used shrinkage factors.
s.f. = 30, length which roughly corresponds to the shortest Cantor structures depicted in previous figures. Due to this
coding exons in the human genome. property, the nonmonotonicity discussed in Sec. II B is not
In Fig. 3 we plotted H共n兲 versus n for the human chro- found in genomic sequences even if part of the underlying
mosome 21 alongside with a “noisy” Cantor-like construc- genome dynamics is perhaps characterized by some sort of
tion, the length, and percentages of ones of which almost multiplicative processes.
coincide with those of the chromosomal sequence. We chose Inspection of Fig. 3 shows that the genomic sequence
to compare entropic scaling of genome sequences with the presents linearity in a semilogarithmic plot, although shorter
one of noisy Cantor-like sequences. This is done because than that observed for the Cantor-like sequence, as it deviates
insertions and deletions occur regularly in the noncoding re- from linearity for both limits of short and very long words.
gions, while coding space is highly conserved due to its role, Linearity starts at n ⬵ 15 and ends at n ⬵ 900, which corre-
which is crucial for the organism’s survival. Insertions and sponds 共given the s.f. value, which here equals to 100兲 to
deletions considered here comprise several molecular 共ge- noncoding spacers of lengths ⬃1500 and ⬃90 000 nucle-
nomic兲 events, such as formation of clusters of similar nucle- otides, respectively. This result has its counterpart in the size
otides 共mostly in the noncoding兲 关28,29兴, usually due to slip- distribution of the distances between coding segments in eu-
page errors during replication 关30,31兴, as well as insertions karyotic genomes, which are found to be power-law-like 关8兴.
共and less often deletions兲 of the so-called repeated elements Notice that these distances correspond to runs of zeros in our
关32,33兴 which, in many genomes, consists of a large part of symbol sequences, which in the case of the Cantor-like se-
the genome 共⬃45% in human genome兲. These genomic quences follow strictly a power-law statistics. The curve of
modifications happen in the slow evolutionary time 共slow if the surrogate random sequence lacks a considerable linear
compared to the “fast” time of individual organism’s life part, while the finite-size effects start at relatively low n val-
span兲. Thus, a continuous short-distance 共irregular兲 shift of ues, as already mentioned.
the coordinates of coding segments underlies genome struc- In Figs. 4 and 5 examples of the block entropy scaling for
ture generating a “noise” analogous to the one added in the chromosomes of H. sapiens and of a collection of several
051917-6
SCALING PROPERTIES AND FRACTALITY IN THE… PHYSICAL REVIEW E 82, 051917 共2010兲
051917-7
ATHANASOPOULOU et al. PHYSICAL REVIEW E 82, 051917 共2010兲
important component for the generation of long-range corre- ing powerful experimental techniques combined with com-
lations and fractal features in the eukaryotic genome at the putational treatment 共the Hi-C method; see 关41兴兲 shed light to
coding or noncoding level. the spatial arrangement of the genome in the very confined
condition of the eukaryotic nucleus and show that the ge-
nome very probably adopts the so-called form of the “fractal
V. DISCUSSION globule.” Such a structure for the genome inside the nucleus
In the present paper we study how the block 共Shannon兲 has been predicted theoretically 关42,43兴 and offers important
entropy scales with the word length in genomic sequences benefits to cellular functioning. More specifically, it facili-
when focusing on the coding or noncoding structure of eu- tates the quick and repetitive transcriptional switching on
karyotic chromosomes. Initially, we studied binary symbol and off of specific genes, possibly in a coordinated way
sequences constructed following a Cantor pattern. It is con- when genes cooperate in the same cellular task even if they
cluded that the Cantor structure entails nonmonotonic scaling are separated by large distances intrachromosomally or they
of the block entropy when the sequence is read by lumping. belong to different chromosomes. In addition, the structure
However, this nonmonotonicity does not persist when a weak of the fractal globule allows the repetitive winding and un-
noise, having the form of symbols’ insertions and deletions, winding of the genomic thread during the consecutive cell
is added. Furthermore, linearity in semilogarithmic scale is cycles, each cycle mediated by complete replication of the
observed in H共n兲-n plots, extended in the region of high genomic material. A knot-free structure 关44兴 would enor-
values of n. For reasons of comparison, random surrogate mously facilitate this function. Notice that quantitative re-
symbol sequences have been included in each case. No lin- sults from the application of the Hi-C and three-dimensional
earity in semilogarithmic scale is observed in the random fluorescence in situ hybridization methods verify the pre-
surrogates and their block entropy values are always consid- dicted scaling features of the fractal globule 关41,45兴.
erably higher than the values of their genomic counterparts. A large amount of results about fractality and long-range
Instead, block entropy scales linearly in the original plot in order in the genome are actually available. Only indicatively
accordance with the cited literature until finite-size effects we could refer to 共i兲 the clustering of similar nucleotides
appear. Then, we examined genomic sequences of eukaryotic generating long-range correlations in the nucleotide constitu-
origin, transformed to binary symbol sequences, with one tion of large noncoding eukaryotic sequences 关1–3,46,47兴;
and zero denoting the protein-coding or noncoding charac- 共ii兲 the findings presented herein and in related works about
ters of each nucleotide, respectively. Resemblance has been the structure generating by coding or noncoding alternation
found between the scaling properties of genomic sequences 关5–8兴; 共iii兲 the pattern observed in the distribution of repeats
and of sequences which are fractal by construction. Our find- in eukaryotic genomes which is shown to be the product of
ings indicate that the block entropy of symbol sequences is a genomic dynamics in evolutionary time 关48兴; 共iv兲 the power-
suitable tool for the analysis of fractality or self-similarity law size distribution of the isochores 共regions of relatively
features, even in cases of sequences highly “imperfect” and homogeneous G + C content兲 共see 关49兴 and supplementary
noisy, as is the genomic DNA. Using a model based on well- Fig. 2 of Ref. 关50兴兲; and 共v兲 long-range correlations and frac-
known events of genomic dynamics 关8兴, we have reproduced tality due to the nucleosomal structure, strand-asymmetry,
the qualitative features of the entropic scaling of genomic replication origins’ distribution, and localization of other ge-
sequences. The proposed evolutionary scenario implies that nomic functional or structural sites 共see 关51–54兴 and other
fractality and long rangeness, at least at the level of coding or works of the same group兲.
noncoding structure, can emerge as results of genomic dy- These geometrical genomic features, often found at the
namics. level of the sequence primary structure, have been probably
The deviation of the genomic H共n兲-n curve from linearity used by natural selection for the formation and maintenance
in semilogarithmic plot at the limit of low word length 共see of the overall fractal structure of the nuclear content 共the
Figs. 3–5兲 may be understood given that short introns or fractal globule兲 which itself bears concrete functional roles.
intergenic regions are denser in regulatory sequences than The recruitment 共exaptation兲 of structures and features,
other regions and therefore have to be under evolutionary which initially emerged accidentally, into functions and roles
constraints. This means that the organism’s viability would crucial for the survival and development of organisms, oc-
be affected if these short spacers are subject to the molecular curs frequently with important repercussions for biological
dynamics which eventually drives the rest of the genome to evolution.
the quasifractal structure generating the linear entropic scal- ACKNOWLEDGMENTS
ing adopted by the lengthier spacers. At the limit of the very
lengthy words we observe again deviation from linearity, The authors would like to thank Dr. A. Provata and D.
which can be related ultimately to finite-size effects. Sellis for helpful discussions, Professor K. Eftaxias and Dr.
The question of existence of eventual benefits for the or- C. Nikolaou for critical reading of the paper, and the anony-
ganism from a fractal-like genomic organization as the one mous referees for suggestions which have considerably im-
described herein remains open. Recent findings obtained us- proved the presented work.
051917-8
SCALING PROPERTIES AND FRACTALITY IN THE… PHYSICAL REVIEW E 82, 051917 共2010兲
关1兴 W. Li and K. Kaneko, Europhys. Lett. 17, 655 共1992兲. 关31兴 H. Gragg, B. D. Harfe, and S. Jinks-Robertson, Mol. Cell.
关2兴 C.-K. Peng, S. V. Buldyrev, A. L. Goldberger, S. Havlin, F. Biol. 22, 8756 共2002兲.
Sciortino, M. Simons, and H. E. Stanley, Nature 共London兲 关32兴 J. Jurka, O. Kohany, A. Pavlicek, V. V. Kapitonov, and M. V.
356, 168 共1992兲. Jurka, Proc. Natl. Acad. Sci. U.S.A. 101, 1268 共2004兲.
关3兴 R. F. Voss, Phys. Rev. Lett. 68, 3805 共1992兲. 关33兴 J. Jurka, V. V. Kapitonov, O. Kohany, and M. V. Jurka, Annu.
关4兴 A. Arneodo, E. Bacry, P. V. Graves, and J. F. Muzy, Phys. Rev. Rev. Genomics Hum. Genet. 8, 241 共2007兲.
Lett. 74, 3293 共1995兲.
关34兴 W. Li, Phys. Rev. A 43, 5240 共1991兲.
关5兴 Y. Almirantis and A. Provata, J. Stat. Phys. 97, 233 共1999兲.
关35兴 S. V. Buldyrev, A. L. Goldberger, S. Havlin, C.-K. Peng, H. E.
关6兴 A. Provata and Y. Almirantis, Fractals 8, 15 共2000兲.
Stanley, M. H. R. Stanley, and M. Simons, Biophys. J. 65,
关7兴 A. Provata and T. Oikonomou, Phys. Rev. E 75, 056102
共2007兲. 2673 共1993兲.
关8兴 D. Sellis and Y. Almirantis, Gene 447, 18 共2009兲. 关36兴 J. des Cloizeaux, J. Phys. 共France兲 41, 223 共1980兲.
关9兴 A. O. Schmitt and H. Herzel, J. Theor. Biol. 188, 369 共1997兲. 关37兴 H. Takayasu, M. Takayasu, A. Provata, and G. Huber, J. Stat.
关10兴 M. A. Jiménez-Montaño, W. Ebeling, T. Pohl, and P. E. Rapp, Phys. 65, 725 共1991兲.
BioSystems 64, 23 共2002兲. 关38兴 J. A. Bailey et al., Science 297, 1003 共2002兲.
关11兴 J. Kim, S. Kim, K. Lee, and Y. Kwon, Chaos, Solitons Fractals 关39兴 M. Lynch and J. S. Conery, Science 290, 1151 共2000兲.
39, 1565 共2009兲. 关40兴 A. McLysaght, K. Hokamp, and K. H. Wolfe, Nat. Genet. 31,
关12兴 P. Grassberger, Int. J. Theor. Phys. 25, 907 共1986兲. 200 共2002兲.
关13兴 W. Ebeling and G. Nicolis, Europhys. Lett. 14, 191 共1991兲. 关41兴 E. Lieberman-Aiden, N. L. van Berkum, L. Williams et al.,
关14兴 W. Ebeling and G. Nicolis, Chaos, Solitons Fractals 2, 635 Science 326, 289 共2009兲.
共1992兲. 关42兴 A. Yu. Grosberg, S. K. Nechaev, and E. I. Shakhnovich, J.
关15兴 K. Karamanos and G. Nicolis, Chaos, Solitons Fractals 10, Phys. France 49, 2095 共1988兲.
1135 共1999兲. 关43兴 A. Grosberg, Y. Rabin, S. Havlin, and A. Neer, Europhys. Lett.
关16兴 K. Karamanos, J. Phys. A 34, 9231 共2001兲. 23, 373 共1993兲.
关17兴 K. Karamanos, Kybernetes 38, 1025 共2009兲. 关44兴 O. A. Vasilyev and S. K. Nechaev, Theor. Math. Phys. 134,
关18兴 W. Ebeling, J. Freund, and K. Rateitschak, Int. J. Bifurcation 142 共2003兲.
Chaos Appl. Sci. Eng. 6, 611 共1996兲. 关45兴 J. Mateos-Langerak, M. Bohn, W. deLeeuw et al., Proc. Natl.
关19兴 W. Ebeling, in ICCS 2002, LNCS 2331, edited by P. M. A. Acad. Sci. U.S.A. 106, 3812 共2009兲.
Sloot et al. 共Springer-Verlag, Berlin, 2002兲, p. 1209. 关46兴 S. V. Buldyrev, A. L. Goldberger, S. Havlin, C.-K. Peng, M.
关20兴 W. Ebelings and K. Rateitschak, Discrete Dyn. Nat. Soc. 2, Simons, and H. E. Stanley, Phys. Rev. E 47, 4514 共1993兲.
187 共1998兲. 关47兴 R. N. Mantegna, S. V. Buldyrev, A. L. Goldberger, S. Havlin,
关21兴 G. Nicolis and P. Gaspard, Chaos, Solitons Fractals 4, 41 C.-K. Peng, M. Simons, and H. E. Stanley, Phys. Rev. E 52,
共1994兲. 2939 共1995兲.
关22兴 D. A. Petrov, Theor Popul. Biol. 61, 531 共2002兲. 关48兴 D. Sellis, A. Provata, and Y. Almirantis, Mol. Biol. Evol. 24,
关23兴 J. A. Freund and K. Rateitschak, Int. J. Bifurcation Chaos 2385 共2007兲.
Appl. Sci. Eng. 8, 933 共1998兲. 关49兴 N. Cohen, T. Dagan, L. Stone, and D. Graur, Mol. Biol. Evol.
关24兴 J. A. Freund, W. Ebeling, and K. Rateitschak, Phys. Rev. E 54, 22, 1260 共2005兲.
5561 共1996兲. 关50兴 M. Costantini, O. Clay, F. Auletta, and G. Bernardi, Genome
关25兴 H. Herzel, A. O. Schmitt, and W. Ebeling, Chaos, Solitons Res. 16, 536 共2006兲.
Fractals 4, 97 共1994兲. 关51兴 B. Audit, C. Thermes, C. Vaillant, Y. d’Aubenton-Carafa, J. F.
关26兴 A. O. Schmitt, H. Herzel, and W. Ebeling, Europhys. Lett. 23, Muzy, and A. Arneodo, Phys. Rev. Lett. 86, 2471 共2001兲.
303 共1993兲. 关52兴 B. Audit, C. Vaillant, A. Arneodo, Y. d’Aubenton-Carafa, and
关27兴 See supplementary material at http://link.aps.org/supplemental/ C. Thermes, J. Mol. Biol. 316, 903 共2002兲.
10.1103/PhysRevE.82.051917 for auxiliary material. 关53兴 S. Nicolay, E. B. Brodie of Brodie, M. Touchon, B. Audit, Y.
关28兴 Y. Almirantis and A. Provata, Bull. Math. Biol. 59, 975 d’Aubenton-Carafa, C. Thermes, and A. Arneodo, Phys. Rev.
共1997兲. Lett. 93, 108101 共2004兲.
关29兴 Y. Almirantis, J. Theor. Biol. 196, 297 共1999兲. 关54兴 S. Nicolay, E. B. Brodie of Brodie, M. Touchon, B. Audit, Y.
关30兴 K. J. Dechering, K. Cuelenaere, R. N. Konings, and J. A. Leu- d’Aubenton-Carafa, C. Thermes, and A. Arneodo, Phys. Rev.
nissen, Nucleic Acids Res. 26, 4056 共1998兲. E 75, 032902 共2007兲.
051917-9