Professional Documents
Culture Documents
Dynamics of DNA Replication in Yeast
Dynamics of DNA Replication in Yeast
week ending
5 AUGUST 2011
Alessandro de Moura
Institute of Complex Systems and Mathematical Biology, University of Aberdeen, Aberdeen, AB24 3UE United Kingdom
(Received 15 December 2010; published 4 August 2011)
We present a mathematical model for the spatial dynamics of DNA replication. Using this model we
determine the probability distribution for the time at which each chromosomal position is replicated. From
this we show, contrary to previous reports, that mean replication time curves cannot be used to directly
determine origin parameters. We demonstrate that the stochastic nature of replication dynamics leaves a
clear signature in experimentally measured population average data, and we show that the width of the
activation time probability distribution can be inferred from this data. Our results compare favorably with
experimental measurements in Saccharomyces cerevisae.
DOI: 10.1103/PhysRevLett.107.068103
DNA replication starts at specific locations in the chromosome called replication origins. Most bacterial genomes
are replicated from a single origin, but the much greater
size of eukaryotic genomes requires multiple origins per
chromosome to ensure that the replication process does
not take too long. Genome replication has been comprehensively studied in the model organism S. cerevisiae
(brewers yeast). Origin locations in S. cerevisiae are determined by specific DNA sequences and are thus fixed in
every yeast cell [1]. The positions of origins in yeast have
been comprehensively catalogued [2]. However, a given
origin may not be active during replication, because origins
must be licensed before the start of S phase (the part of the
cell cycle where DNA replication takes place). Licensing
consists of a series of specific protein complexes binding
at origin locations, culminating in the loading of pairs of
Mcm2-7 molecules. If in a given cell licensing of a certain
origin is not completed by the time S phase starts, the
origin is unable to function [3].
High-throughput methods have allowed the measurement of replication times as a function of chromosomal
position for the whole genome [4]. These experiments
yield average replication times over large cell populations
(typically >107 cells) and therefore can mask the cell-tocell variability present in the system [5]; to date single
cell and single molecule studies are not able to measure
the kinetics of whole genome replication [5,6]. The low
abundance of the molecules involved in triggering origin
activation strongly suggests that origins have stochastic
activation times [7]. This is often ignored in the biological
0031-9007=11=107(6)=068103(4)
068103-1
(1)
N
X
i1
Pi x; t:
(2)
week ending
5 AUGUST 2011
Z 1
1
tPx; tdt;
1 s1 s2 sN 1
(3)
R
where 1 s1 s2 sN 1
1 Px; tdt is the probability
that at least one of the origins will activate. The average
replication time across whole chromosomes [Tx curves]
has been measured in a number of organisms. However,
caution is required when interpreting Tx curves. In some
of the biological literature, Tx curves are used to directly
infer origin parameters [4]. For example, it is widely
accepted that the values of T at xi are the average activation
times of origins. However, Eq. (3) shows that Tx is
determined collectively by all origins [8]. This suggests
that simple interpretations of Tx are not justifiable.
We want to use the general theory presented above to
study replication dynamics in a simple setting. From now
on we focus on the case of a hypothetical linear chromosome with just two origins. We define the chromosomal
coordinates so that one of the origins has position x1 0;
the other origin has position x2 D. We assume for simplicity that each origin can activate within a time window
t with uniform probability; we will argue later that our
conclusions are largely independent of the precise shape of
the probability distribution. We select origin activation
times so that the average activation time of the first origin
is 0. The other origin has an average activation time , and
we assume without loss of generality that 0. Thus the
activation time distributions are
pi t
qi
;
t
t av t
;
t
;
if t 2 tav
i
2 i
2
(4)
av
where i 1, 2, tav
1 0, and t2 . p1 and p2 are set to
zero outside the stated intervals.
Using Eqs. (3) and (4), we can write analytical expressions for the probability density Px; t and the average
replication time Tx. From Eq. (4) and (2), Px; t vanishes
outside the intervals I1 and I2 given by
Ii
jxi xj
t jxi xj
t
av
tav
;
t
;
i
i
v
2
v
2
(5)
where x1 0 and x2 D. In total there are five scenarios (depending on the relative values of , t, D, and v)
that differ in the dynamics of how the chromosome is
replicated. From here on we will consider just the case
where the condition t < D=v is satisfied, since this
is the case for many pairs of origins in real chromosomes. This means that the variations in the activation
time t are small enough that a fork from one origin can
only replicate the other origin if that origin is not
competent. The expression for Tx is then
068103-2
week ending
5 AUGUST 2011
8 Dq s q s x
>
v2 1 2v1 q2 s1 q1vjxj=1 s1 s2 ;
>
>
>
2 t2 v2 v2x2
>
D2xvfq1 Dvtq2 s1 Dvtg
>
2D tv 2x vq1 q2 D12t
>
2 v3 1s s
>
2tv2 1s1 s2
1 2
>
>
tvf4q
q
x3q
q
3q
vg
Dft3q
q
3q
v2q
q
v2xg
>
1
2
1
1
2
1
1
2
1
2
<
;
2
3
2
3
12t v 1s1 s2
12t v 1s1 s2
Tx D2xvfq
2 t2 v2 v2x2
>
2 Dvtq1 s2 Dvtg
>
2D
2x v tq1 q2 D12t
2 v3 1s s
>
2tv2 1s1 s2
>
1 2
>
>
tv4q1 q2 x3q1 3q2 5q1 q2 v
Dt3q1 3q2 5q1 q2 v2q1 q2 2xv
>
>
;
2
3
2
3
>
12t v 1s1 s2
12t v 1s1 s2
>
> qsx
:
=1
s
s
;
1v2 q2 q2 jDxj
1
2
v
if x < Dvt
;
2
if Dvt
x < Dv
2 ;
2
Dvt
if Dv
;
2
2 x
if x > Dvt
:
2
(6)
(7)
q1 q2 s1
:
v1 s1 s2
(8)
q1 0.3
q1 0.5
T (x)
t 1
q1 0.7
ORI1
t 2
Tx
q1 0.9
t 4
ORI2
t 8
ORI1
ORI2
068103-3
30
Fitted parabola
25
20
ARS1012/13 ARS1014
15
360
370
380
390
400
410
420
430
Chromosomal position, kb
FIG. 3. Replication time curves [4] for S. cerevisiae chromosome X with a fitted parabola [375420 kilobase pairs (kb)].
values of t. In order to investigate this more quantitatively, we use the modulus of the second derivative of Tx
at the maxima to measure how broad the maxima islow
values of jT 00 j correspond to broad peaks. We now use
Eq. (6) to find the relationship between jT 00 j and the origin
parameters. At the maximum of Tx, we find
q
4q1 q2 1 j q12 q11 j
jT 00 j
:
(9)
v2 t1 s1 s2
Thus jT 00 j is inversely proportional to t. Notice also
that jT 00 j does not depend on , which means it is independent of the origins average activation times. This expression can be used to calculate t from an experimental
replication time profile Tx, if the origin competencies
and the fork velocity are known. This is a very useful result
because it allows the determination of a quantity characterizing stochastic properties of the system t from Tx,
which is defined by a population average. This is valuable
because experiments to directly measure t are technically
difficult, and there are few results available [5,6]. We note
that this does not require assuming that all cells in the
population are synchronized, since in each individual cell
in an asynchronous population, the statistics of the relative
activation times of origins remain unaltered [8].
Equation (9) was obtained using a simple uniform distribution for pi t. However, we expect it to be a good
approximation for any single-peaked distribution function
pi t, since Eq. (9) only involves the second moment (the
variance) of the distribution, and the replication dynamics
are mostly determined by the average activation time and
the width of the activation distributionthe first and second moments of pi t. To test this assumption we used
Eqs. (1)(6) to numerically compute Tx for pairs of
origins with either a Gaussian or a skewed distribution
that lead to sigmoidal cumulative distributions [14].
Choosing parameters such that all these distributions
have the same mean and variance, we find that in all cases
T 00 never differs between distributions by more than 10%.
Despite the fact that we have been considering a hypothetical two-origin chromosome, we expect Eq. (6) to be a
good approximation for chromosomes with many origins
when two neighboring origins are relatively isolated from
week ending
5 AUGUST 2011
068103-4