Professional Documents
Culture Documents
15
0022-538X/00/$04.00⫹0
Copyright © 2000, American Society for Microbiology. All Rights Reserved.
The eukaryotic replicative DNA polymerases are similar to those of large DNA viruses of eukaryotic and
bacterial T4 phages but not to those of eubacteria. We develop and examine the hypothesis that DNA virus
replication proteins gave rise to those of eukaryotes during evolution. We chose the DNA polymerase from
phycodnavirus (which infects microalgae) as the basis of this analysis, as it represents a virus of a primitive
eukaryote. We show that it has significant similarity with replicative DNA polymerases of eukaryotes and
certain of their large DNA viruses. Sequence alignment confirms this similarity and establishes the presence
of highly conserved domains in the polymerase amino terminus. Subsequent reconstruction of a phylogenetic
tree indicates that these algal viral DNA polymerases are near the root of the clade containing all eukaryotic
DNA polymerase delta members but that this clade does not contain the polymerases of other DNA viruses. We
consider arguments for the polarity of this relationship and present the hypothesis that the replication genes
of DNA viruses gave rise to those of eukaryotes and not the reverse direction.
Divergence of the bacterial and eukaryotic lineages appears of RNA genomes and the error-prone nature of their replica-
to represent the deepest split in the tree of life (22). Because tion (14). It therefore appears more likely that the common
the DNA replication proteins of these groups are of funda- ancestor had a DNA genome, which leaves unexplained how
mental importance and interact through complex mechanisms, the replication systems underwent the transition during the
it seems likely that the genome replication system, like the divergence of bacteria from archaea and eukaryotes.
translational system, would contain the most conserved co- DNA viruses, however, also possess a full set of independent
evolved genes among all related lineages. DNA replication and repair proteins that include members of
Obvious functional homologues of replication genes are family A and B DNA Pols (12). When first sequenced, it was
found in bacteria, eukaryotes, and archaea, including proteins noteworthy how similar phage T4 DNA Pol was to DNA Pols
involved in origin recognition, helicases, DNA-binding pro- alpha and delta of eukaryotes, Epstein-Barr virus, human cy-
teins, DNA synthesis, sliding clamp processivity factors tomegalovirus, and other DNA viruses of eukaryotes, but not
(PCNA), ligation, and primer removal (see reference 7 and adenoviruses or E. coli Pol I or III (23). This similarity includes
references therein). However, there are clear differences in the conservation of five of six sequential domains (31), as well
sequence similarity that separate the replication proteins of as resistance to various family B-specific inhibitors (3). Other
bacteria from those of the archea and eukaryotes (7). The phage DNA Pols, however, such as T7, show similarity to
bacterial replication genes thus appear evolutionarily unre- bacterial DNA Pol I but not to Pols of eukaryotes. With the
lated to those of eukaryotes and archaea. For example, the sequencing of the entire T4 genome, it was additionally sur-
replicative DNA polymerase (Pol) III of Escherichia coli be- prising to see that this strictly lytic bacteriophage had more
longs to the family C DNA Pol group and does not have genes similar to those of eukaryotes (including genes for self-
similarity to either of the two mammalian replicative DNA splicing RNA [13]) than to bacterial genes (4). Viruses are
family B DNA Pols (alpha priming and delta extending; see usually thought to impose negative selection on their hosts. In
reference 30). As such, phylogenetic analysis of these replica- addition, recombination between host and viral genomes is a
tive DNA Pols results in polyphyletic groupings that are con- commonly observed phenomenon, such as with retroviruses
trary to accepted species trees (6). Such wide existence of acquiring cellular protooncogenes (5, 28). Yet viruses are
functionally identical yet nonorthologous genes presents a di- rarely considered a source of host genes, and hence viral se-
lemma when they are being used for connecting the universal quences are not taken into account when reconstructing the
tree of life, and this has led some to propose that the cenan- tree of life. However, a viral genome can evolve up to a million
cestor of bacteria, archea, and eukaryotes had an RNA ge- time faster than that of its host. If a DNA virus could impose
nome (7, 17). However, it is now clear that between bacteria a stable persistent (or genomic) infection on its host, it might
and eukaryotes, perhaps several hundred functional genes are then also provide genes altering host evolution, as we have
homologous (e.g., DNA synthesis genes). This suggests that previously reasoned (29). This raises the question: Could a
the putative prokaryotic-eukaryotic ancestor possessed many DNA virus have been the origin of replicative eukaryotic DNA
genes inherited by both lineages (for references, see references Pols?
14 and 8). Proper replicative transmission of such a large In this report, we consider the hypothesis for the viral origin
number of essential genes seems unlikely given the small size of eukaryotic replication proteins in the context of DNA vi-
ruses that infect host species which are likely representative of
* Corresponding author. Mailing address: Department of Molecular the earliest eukaryotes. We examine DNA Pols from two fam-
Biology and Biochemistry, 3205 Bio Sci II, University of California, ilies of DNA viruses prevalent as acute infections of parasitic
Irvine, Irvine, CA 92697. Phone: (949) 824-6074. Fax: (949) 824-8551. microalgae (Chlorella-like viruses) (27) and persistent infec-
E-mail: lpvillar@uci.edu. tions of filamentous brown algae (Feldmania species virus) (9,
7079
Downloaded from http://jvi.asm.org/ on July 18, 2018 by UNIVERSIDAD AUTONOMA DE SLP
7080
VOL. 74, 2000 VIRAL ORIGIN OF EUKARYOTIC REPLICATION PROTEINS 7081
15, 16, 21, 27). These algal species represent some of the RESULTS
earliest eukaryotes for which clear archaeological data exist
(11). We perform sequence similarity and phylogenetic analy- The results suggest that the relationships are robust: 68% of
ses which indicate that these viral proteins appear related to the nodes had ⬎90% bootstrap frequency support, and all
the progenitor of all eukaryotic Pol delta sequences and con- nodes were ⬎50%. The unrooted tree shows DNA Pol se-
sider arguments that a DNA virus may have been the origin of quences falling into seven clades that correspond to biologi-
the eukaryotic DNA replication system. cally coherent gene sets. The two largest clades correspond to
variants of DNA Pol alpha (pink) and DNA Pol delta, respec-
MATERIALS AND METHODS tively. In the DNA Pol delta clade (black), the Feldmania
species virus (which causes a prevalent persistent infection of
FIG. 1. Amino acid alignment of four highly conserved DNA Pol protein regions. Taxon names are color coded according to clade as in Fig. 3 and are labeled A0
to L5 according to the branch tips therein. Gaps inserted to improve the alignment are indicated by a dash (—). Amino acids are color coded according to side group
properties using the following scheme: red, negatively charged (D or E); orange, positively charged (H, K, or R); light green, amide (N or Q); blue, alcohol (S or T);
purple, aliphatic (L, I, or V); gray, aromatic (F, Y, or W); brown, small (A or G); dark green, sulfur-containing (M or C); white, proline (P). Abbreviations: Hu, human;
VZV, varicella-zoster virus; HSV, herpes simplex virus; cytomeg., cytomegalovirus; HHV, human herpesvirus.
7082 VILLARREAL AND DEFILIPPIS J. VIROL.
rise to similar patterns, the authors did not attempt to explain naviruses are clearly evolutionarily exceptional DNA viruses.
this result (6). Since it is unrooted, the phylogeny does not The simplest way to account for these observations is to pro-
directly establish the polarity or direction of evolutionary pose that host Pol delta genes are derived from an early DNA
change. It therefore remains formally possible that the phy- viral gene that resembles that present in Feldmania virus.
codnaviruses acquired DNA Pol genes from their algal hosts Trees of life have been generated using different genes,
and maintained similarity to them for unknown reasons. As the yielding multiple evolutionary histories (8). Phylogenetic anal-
algal host DNA Pol genes have not been sequenced, we cannot ysis of DNA Pol sequences presents patterns inconsistent with
place them on this tree. Even if they were subsequently to be accepted organismal phylogenies. These phylogenetic dispari-
placed phylogenetically near the phycodnavirus genes, this ties are difficult to explain if most genetic variation during
would still be unlikely to resolve the issue of evolutionary evolution of species occurs by random genetic change and
direction. However, we believe several considerations argue vertical gene transmission. Genomic analysis has suggested
that the direction of transmission was from virus to host. First, that horizontal transfer of gene sets may have been more
only under this circumstance could the dilemma of dissimilar prevalent then previously believed, especially in bacterial spe-
replication genes now present in bacteria and eukaryotes be cies. Horizontal transmission of DNA replication genes, how-
resolved. In addition, all the other viral DNA Pols examined ever, would suggest the transfer of fundamental, complex, cel-
form distinct monophyletic groups (i.e., herpesviruses, poxvi- lular components and the involvement of a DNA virus. We
ruses, and baculoviruses) that do not include host Pols. There- have argued that the persistence of a genetic parasite (a virus
fore, these other viruses did not appear to acquire their Pol or its defective derivatives) is a life strategy that can allow the
genes from a host species. The DNA delta clade is clearly superimposition of complex molecular genetic control systems
monophyletic yet includes all the diverse phycodnavirus Pols of onto its host (29). As such, a persistent agent (like Feldmania
both microalgal and filamentous algal hosts. Thus, the phycod- virus) can potentially provide new systems of genetic control,
VOL. 74, 2000 VIRAL ORIGIN OF EUKARYOTIC REPLICATION PROTEINS 7083
including genome replication, to its host, particularly if it is gruent with their host species over long periods of time (29).
integrated into the genome. We suggest at least in the case of Thus, at least for these regulatory genes, they are viral, not
DNA Pol delta an evolutionary link of the bacteria and eu- host, creations. Viral genomes can evolve much faster than
karyota (and archaea) via the DNA Pol of an ancient DNA host genomes, and populations are known to exhibit much
virus, not the replicative host genes. Our analysis also suggests greater genetic variability, as demonstrated by the frequent
that DNA Pol alpha may share an ancestor with DNA Pol II of occurrence of mutants and defectives. Thus, viral systems have
archaea that diverged after the initial divergence of bacteria an enhanced capacity to produce genetic novelty. Although
from eukaryotes and archaea. Two other DNA Pols resemble some examples of virus-mediated horizontal gene transfer
the family B replicative Pols of eukaryotes and archaea. One is have recently been proposed (2), in most of these proposals it
the nonessential Pol II of E. coli, and the other is the Pol from is suggested that the host, not the virus, is the original source
lytic phages T4 and RB69. Both branch from the largely un- of the transferred gene. We now suggest that such infectious
resolved center of the tree. As the phages represent a much and/or persisting agents may be a general source for acquisi-
more transmissible system than E. coli Pol II, and as T-like tion of complex molecular systems and phenotypes.
phages infect both bacteria and archaea (Euryachaeota king-
dom [32]), it is easier to envision substitution of functional ACKNOWLEDGMENT
homologues for DNA replication genes if such a virus was This research was supported by the Irvine Research Unit in Animal
involved. Other DNA replication genes may also fit this pat- Virology.
tern, since it is known that DNA viruses also code for various
ligases, helicases, and PCNA-like genes as well as “repair-like” REFERENCES
DNA Pols, such as DNA Pol beta, found in entomopoxvirus 1. Afonso, C. L., E. R. Tulman, Z. Lu, E. Oma, G. F. Kutish, and D. L. Rock.
(1). 1999. The genome of Melanoplus sanguinipes entomopoxvirus. J. Virol. 73:
533–552.
Many of the crucial regulatory genes of DNA viruses, such 2. Baldo, A. M., and M. A. McClure. 1999. Evolution and horizontal transfer of
as the T antigens of polyomaviruses, have no known host an- dUTPase-encoding genes in viruses and their hosts. J. Virol. 73:7710–7721.
alogues, even though these viruses are phylogenetically con- 3. Bernad, A., A. Zaballos, M. Salas, and L. Blanco. 1987. Structural and
7084 VILLARREAL AND DEFILIPPIS J. VIROL.
functional relationships between prokaryotic and eukaryotic DNA poly- 19. Pisani, F. M., C. De Martino, and M. Rossi. 1992. A DNA polymerase from
merases. EMBO J. 6:4219–4225. the archaeon Sulfolobus solfataricus shows sequence similarity to family B
4. Bernstein, H., and C. Bernstein. 1989. Bacteriophage T4 genetic homologies DNA polymerases. Nucleic Acids Res. 20:2711–2716.
with bacteria and eucaryotes. J. Bacteriol. 171:2265–2270. 20. Saitou, N., and M. Nei. 1987. The neighbor-joining method: a new method
5. Bishop, J. M. 1983. Cellular oncogenes and retroviruses. Annu. Rev. Bio- for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406–425.
chem. 52:301–354. 21. Sengco, M. R., M. Braeutigam, M. Kapp, and D. G. Mueller. 1996. Detection
6. Braithwaite, D. K., and J. Ito. 1993. Compilation, alignment, and phyloge- of virus DNA in Ectocarpus siliculosus and E. fasciculatus (Phaeophyceae)
netic relationships of DNA polymerases. Nucleic Acids Res. 21:787–802. from various geographic areas. Eur. J. Phycol. 31:73–78.
7. Edgell, D. R., H. P. Klenk, and W. F. Doolittle. 1997. Gene duplications in 22. Sogin, M. L., J. H. Gunderson, H. J. Elwood, R. A. Alonso, and D. A. Peattie.
evolution of archaeal family B DNA polymerases. J. Bacteriol. 179:2632–2640. 1989. Phylogenetic meaning of the kingdom concept: an unusual ribosomal
8. Forterre, P. 1999. Displacement of cellular proteins by functional analogues RNA from Giardia lamblia. Science 243:75–77.
from plasmids or viruses could explain puzzling phylogenies of many DNA 23. Spicer, E. K., J. Rush, C. Fung, L. J. Reha-Krantz, J. D. Karam, and W. H.