Professional Documents
Culture Documents
E
volutionary associations The association between two or between organisms, such as be-
among genes, organisms more lineages over evolutionary time istween hosts and their parasites9
and geographical areas have a recurrent theme spanning several (including viruses10), endosym-
traditionally been studied bionts and their hosts11, and
different fields within biology, from
by biologists from different disci- molecular evolution to coevolution insects and plants12,13, can have a
plines, with little interaction be- and biogeography. In each long evolutionary history, which
tween them. Consequently, recog- is reflected in similarities between
‘historical association’, one lineage is
nition of the fundamental similarity associated with another, and can be their evolutionary trees14. At a
of the problem faced by molecular thought of as tracking the other overlarger scale still, organisms can
systematists, parasitologists and evolutionary time with a greater or track geological history such that
biogeographers has been slow in lesser degree of fidelity. Examplessequences of geological events
coming1–3. This is particularly true include genes tracking organisms, (e.g. continental break-up) are di-
of the parallels between the relation- parasites tracking hosts and organisms rectly reflected in the phylogenies
ship between gene and organismal tracking geological and geographical of those organisms15.
phylogeny, and the macroevolution- changes. Parallels among these In each association, one entity
ary associations studied by para- problems raise the tantalizing prospect(the ‘associate’) tracks the other
sitologists and biogeographers. that each is a special case of a more(the ‘host’) with a degree of fidelity
The analogy between vicariance general problem, and that a single that depends on the relative fre-
biogeography (organisms tracking analytical tool can be applied to quency of four categories of events:
areas) and host–parasite cospeci- all three kinds of association. codivergence, duplication, hori-
ation (parasites tracking hosts) has zontal transfer and sorting (Box
been recognized for some time4; for 1). Joint cladogenesis of host and
a parasite the host can be thought Roderic Page is at the associate is codivergence. If the
of as an ‘area’, hence host speci- Division of Environmental and Evolutionary Biology, associates undergo cladogenesis
Institute of Biomedical and Life Sciences,
ation is equivalent to a vicariance University of Glasgow, Glasgow,
independently and both descend-
event (Fig. 1). The suggestion that UK G12 8QQ (r.page@bio.gla.ac.uk); ants remain associated with the
these macroevolutionary patterns Michael Charleston is at the host then we have a duplication of
are analogous to the relationship Dept of Zoology, University of Oxford, associate lineages. Cladogenesis
between gene and species trees is Oxford, UK OX1 3PS accompanied by one descendant
a more recent development1,3. (michael.charleston@zoology.oxford.ac.uk). colonizing a new host is horizontal
transfer. ‘Sorting event’ is a gen-
Types of historical association eric term for the apparent absence
Historical associations can be of an associate from a host.
divided into three basic categories (Table 1): genes and The analogies among the categories of events for the dif-
organisms, organisms and organisms, and organisms and ferent kinds of association (Table 1) need not imply close
areas. At the molecular level, each gene has a phylogenetic analogy among the processes; rather, the analogy is among
history that is intimately connected with, but not necessar- the patterns these processes produce. For example, although
ily identical to, the history of the organisms in which the the processes of gene duplication and allele divergence are
gene resides5,6. Processes such as gene duplication, lineage different, the resulting pattern is the same – more than one
sorting and horizontal transfer can produce complex gene gene lineage in the same organismal lineage.
trees that differ from organismal trees3,7,8. Associations
Reconstructing the history of an association
Despite the relative lack of interaction among these dif-
ferent disciplines, strikingly similar concepts have arisen in-
(a) (b) (c) dependently from them. Parasitologists16,17 recognized the
problem of multiple parasite lineages decades before
Fitch’s8 analogous distinction between paralogous and
orthologous genes18 (Box 1). Molecular systematists19 and
cladistic biogeographers20 independently developed similar
methods for interpreting the history of gene trees and bio-
Time geographic patterns, respectively.
Organism Host Area
One implication of the parallels among the different kinds
Gene Parasite Organism
of association is that they can be studied using the same
Fig. 1. Historical associations among genes, organisms and areas: (a) a gene tree methods. Reconciled trees (Box 2) originated in molecular
embedded in a species tree; (b) a parasite cospeciating with its host; and (c) a systematics19 but have been applied to both host–parasite
clade of organisms diverging in concert with geological events (vicariance). In each coevolution21 and biogeography22. As well as visualizing the
case one entity (the ‘associate’) can be thought of as tracking the other (the ‘host’). relationship between host and associate, reconciled trees
provide a quantitative measure of the extent to which the
356 Copyright © 1998, Elsevier Science Ltd. All rights reserved. 0169-5347/98/$19.00 PII: S0169-5347(98)01438-4 TREE vol. 13, no. 9 September 1998
REVIEWS
Organism–gene Interspecific coalescence40 Gene duplication, deep coalescence5 Gene transfer Gene loss, lineage sorting
Host–parasite Cospeciation Within host speciation Host-switch Parasite extinction, ‘missing the boat’
Organism–areas Vicariance Sympatry Dispersal Extinction
host and its associate’s phylogenetic histories have diverged. Multiple lineages
Hence, reconciled trees have been employed in two differ- Duplications result in multiple lineages of associates on
ent ways: to document the history of an associate where the same host lineages. The implications of this for inferring
both the host and associate relationships are presumed to host phylogenies have been recognized by molecular sys-
be known, and to infer host relationships based on the as- tematists dealing with gene families8 or multiple mitochon-
sociate phylogeny. The inference of species trees from gene drial lineages24. If some gene lineages go extinct or are in-
trees is the paradigm instance of the latter, but there is a long completely sampled, gene trees might not faithfully reflect
history of parasitologists attempting to infer host phylogeny organismal history (Fig. 2). In contrast, parasitologists and
from parasite phylogeny23, and cladistic biogeographers aim biogeographers have (with a few exceptions17,20) attributed
to infer geological history from organismal phylogeny15. discordance between host and associate trees to horizontal
transfer, rather than to a combination of multiple lineages
and subsequent loss of associates. Host-switching and dis-
Box 1. Terminology of historical associations persal do occur but, as they might not be the sole cause of
Given the parallels among different kinds of historical association it is discordance, their prevalence can be overestimated18.
desirable to have a generic set of terms that are applicable to genes,
organisms and areas33. The terminology used here is illustrated below.
Box 2. Reconciled trees
The concept of reconciled trees dates from Goodman et al.’s19 attempts to recon-
cile disagreements between then accepted mammalian evolutionary relationships
Horizontal and those obtained from haemoglobin genes. Largely neglected until recently, rec-
transfer onciled trees are now receiving renewed attention from biologists and mathemati-
cians26,27,30,31,33,36. Suppose we have a phylogeny for four species, and four se-
quences sampled from those species, and that the two trees, which we believe to
Sorting be correct, disagree, as illustrated in (a).
event
(a)
Organism 1 a Gene
Associate
Codivergence 2 b
Host Duplication 3 c
4 d
Associate: a lineage that tracks another lineage, or set of historically The question is, how can the trees both be true and yet be discordant? One expla-
related entities (such as geographical areas). nation is to embed the gene tree in the species tree (b), which requires us to pos-
Codivergence: joint divergence of both host and associate. Examples tulate a number of gene duplications and subsequent losses (in this instance one
include host–parasite cospeciation, and vicariance. duplication and three losses). This embedding can also be represented using a
Duplication: independent divergence of the associate, with both de- reconciled tree (c), which simply takes the embedded gene tree and ‘unfolds’ it so
scendants remaining associated with the host (e.g. gene duplication). that it lies flat on the page. The reconciled tree would depict the complete history
Horizontal transfer: transfer of an associate lineage from one host of the gene if there had been no gene losses (i.e. the three sorting events). As a
(‘source’) to another host (‘destination’) that is not itself the immedi- consequence of the gene duplication in the ancestor of species 2, 3 and 4, we would
ate descendant of the source host. Examples include horizontal gene expect those species to each have two copies of the gene. Because they do not, we
transfer and host switching. must postulate three gene losses. Alternatively, the gene copies could be present
Host: the lineage or entities being tracked, such as organisms har- but undetected. Hence, the reconciled tree makes predictions about the existence
bouring a lineage of parasites. of undiscovered genes. It also suggests that genes b and c are paralogous to gene
‘Missing the boat’: if an associate is distributed over only part of the d, which is not apparent from the gene phylogeny alone.
host’s distribution (e.g. a patchily distributed parasite), divergence
within the host lineage may yield one or more descendants that lack (b) (c)
the associate. The associate has not gone extinct from those hosts, it Missing genes
simply was never there21. a b c d
1 2 3 4 a b c d
Orthologous: a pair of genes that are descendants of the same copy of
a gene are orthologous. 1 2 3 (4) (2) (3) 4
Paralogous: a pair of genes separated by at least one gene duplication
are paralogous. Sorting
Reconciled tree: the simplest embedding of an associate tree inside Sorting events
its host tree (Box 2). events
Sorting event: the (apparent) absence of an associate in the descend- Gene
Duplication
ants of a host that had previously had that associate. Sorting events
include extinction of the associate and ‘missing the boat’. Associates Duplication
can also be present but undetected, such as with rare parasites or Organism
undiscovered gene loci. Reconciled tree