Professional Documents
Culture Documents
Received 17 May 2020; reviews returned 20 December 2020; accepted 21 December 2020
Associate Editor: Claudia Solis-Lemus
Abstract.—Hybridization between species occurs more frequently in vertebrates than traditionally thought, but distinguish-
ing ancient hybridization from other phenomena that generate similar evolutionary patterns remains challenging. Here,
we used a comprehensive workflow to discover evidence of ancient hybridization between the Komodo dragon (Varanus
komodoensis) from Indonesia and a common ancestor of an Australian group of monitor lizards known colloquially as
sand monitors. Our data comprise >300 nuclear loci, mitochondrial genomes, phenotypic data, fossil and contemporary
records, and past/present climatic data. We show that the four sand monitor species share more nuclear alleles with
V. komodoensis than expected given a bifurcating phylogeny, likely as a result of hybridization between the latter species
and a common ancestor of sand monitors. Sand monitors display phenotypes that are intermediate between their closest
relatives and V. komodoensis. Biogeographic analyses suggest that V. komodoensis and ancestral sand monitors co-occurred
in northern Australia. In agreement with the fossil record, this provides further evidence that the Komodo dragon once
inhabited the Australian continent. Our study shows how different sources of evidence can be used to thoroughly characterize
evolutionary histories that deviate from a treelike pattern, that hybridization can have long-lasting effects on phenotypes,
and that detecting hybridization can improve our understanding of evolutionary and biogeographic patterns.[Biogeography;
introgression; Komodo dragon; phylogenetic networks; phylogenomics; reticulation; Varanus.]
Natural hybridization, the successful crossing of indi- Another commonly used approach is the calculation
viduals from distinct populations or groups of pop- of the D statistic in a test known as ABBA-BABA,
ulations (Harrison and Larson 2014), has long been which indicates whether the patterns of allele sharing
of interest to evolutionary biologists (Mallet 2008). between four taxa are more congruent with ILS or intro-
Interspecific hybridization has long been recognized as gression (Green et al 2010; see Materials and Methods
playing an important role in the evolutionary process section).
(Stebbins 1959), and Charles Darwin even devoted an Hybridization poses practical challenges to evolution-
entire chapter to it in the Origin of Species (Darwin 1859). ary biology and conservation. There is a broad body
Recently, technical and methodological advances have of literature devoted on how to alleviate the negative
revealed numerous instances of ancient hybridization (Li effects it can have on phylogenetic inference (Page and
et al 2016; Meier et al 2017; Taylor and Larson 2019), often Charleston 1997; Meyer et al 2017). Concern exists over
by detecting introgression (i.e., the transfer of alleles introgression involving endangered and/or evolution-
between species through hybridization and backcross- arily unique taxa, which can result in decreased fitness
ing). This has shown that hybridization is more wide- (Mallet 2005). However, evaluation of the phenotypic
spread in nature than traditionally considered and that and genotypic signatures of hybridization can also shed
it can have important evolutionary consequences (Taylor light on the historic processes responsible for observable
and Larson 2019), affecting key features such as genetic biological patterns. The study of natural hybrid zones
variability, fitness, diversification rates, and reproduct- has been crucial for understanding the nature of species
ive mode (e.g., Mallet 2005; Lampert and Schartl 2008; boundaries, the speciation process, and the heritability
Meier et al 2017; Patton et al 2020). However, detecting of organismal traits (Stebbins 1959; Barton and Hewitt
ancient hybridization remains practically challenging. It 1989; Taylor and Larson 2019). Furthermore, hybridiz-
often requires the generation of large molecular data sets, ation can play a fundamental role in the creation of
and methods used to infer ancient hybridization based novel genetic combinations on which natural selection
on molecular data need to account for other evolutionary can act upon, as has been documented in textbook
phenomena that generate similar patterns, particularly examples of adaptive radiation (Lamichhaney et al 2015;
incomplete lineage sorting (ILS; i.e., the retention of Meier et al 2017; Taylor and Larson 2019). Hybridization
ancestral genetic polymorphism across species). Explicit can therefore play a central role in the sorting of
phylogenetic networks depict phylogenetic relationships phenotypic variation, often resulting in increased fitness
as a reticulate network instead of a bifurcating tree (The Heliconius Genome Consortium 2012; Arnold and
whenever ILS is insufficient to explain the distribution Kunte 2017; Karimi et al 2020). Finally, the detection of
of genetic variation. In those cases, hybridization or hybridization can impose temporal and spatial bounds
other processes that cause deviation from a treelike that can yield insight into the timing and geographic
pattern such as horizontal gene transfer and admix- location of evolutionary events, as hybridization requires
ture may better explain the data (Cao et al 2019). sympatry. This has been the case with hominin evolution
1
2 SYSTEMATIC BIOLOGY
(Green et al 2010; Villanea and Schraiber 2019; Chen et al hybridization. We found molecular and phenotypic
2020). evidence for ancient hybridization between the Komodo
V. giganteus
51
0.93
V. rosenbergi
34.4 24.6 Sand 15 ka 900–750 ka
monitors N
V. spenceri
0.07 3.6 Ma
–20°
LAMc 450–330 ka
V. komodoensis
64.7 150 ka
3.5 Ma
V. varius
V. salvadorii
0 500 1,000 km
Pleistocene
Holocene
Pliocene
Miocene
120° 140°
Neogene Quat
Fossil V. komodoensis
20 10 0 Ma
FIGURE 1. Phylogeny and distribution of large Australasian monitors. The phylogeny is a species tree based on nuclear data, obtained in
ASTRAL, and time-calibrated with MCMCTree. Blue bars represent the 95% credibility interval for divergence dates. Numbers in nodes indicate
gene concordance factors. Bootstrap values were 100 for all nodes. The red dotted line indicates the most likely reticulation event as suggested by
our results, with the proportion of alleles inherited from each of the involved taxa specified by the red number (results from SNaQ; see Fig. 2b).
Representatives of each clade are illustrated (scaled roughly proportionally to each other). The map depicts the current distribution of living
taxa and fossil records (stars; approximate age shown next to each) of V. komodoensis (based on Ciofi et al (1999) and IUCN (2019)). Colors in the
map and illustrations follow the circles at the tips of the phylogeny.
for each sample is specified in Supplementary Table S1 sets, the development of approaches based on the MSC
available on Dryad. For the mitochondrial introgression that summarize gene trees to estimate species trees has
test (see below), we downloaded additional sequences improved the scalability of coalescent methods (Mirarab
of the NADH dehydrogenase subunit 2 gene (ND2) for et al 2014; Rabiee et al 2019). In order to maximize
the LAMc and an outgroup (Varanus acanthurus) from the sampling of loci while accounting for ILS, we used
GenBank (GB in Supplementary Table S2 available on the summary method ASTRAL-III 5.6.3 (Zhang et al
Dryad). 2018) to estimate the species tree of the LAMc. We
specified the maximum likelihood gene trees of Subset
1 obtained with IQ-TREE as input for ASTRAL and
Phylogenomics performed multilocus bootstrapping (Seo 2008) based on
Many of the methods we employed to test for hybrid- the ultrafast bootstrap trees of each locus.
ization rely on individual gene trees and a species-level To temporally frame our results and perform mor-
phylogeny. Thus, we inferred individual trees for each phological analyses that take an ultrametric tree as
locus in Subset 1. To evaluate mitonuclear discordance, input we time-calibrated the ASTRAL species tree in
we obtained the maternal phylogeny based on the MCMCTree as implemented in the PAML 4.8 package
mitochondrial genomes. Tree inference was performed (Yang 2007). MCMCTree is a Bayesian method that
using maximum likelihood in IQ-TREE 1.7 (Nguyen estimates divergence times on a fixed topology. Its
et al 2015; see Supplementary Appendix available on use of approximate likelihood calculation permits the
Dryad). Node support was obtained by performing efficient analysis of large alignments (dos Reis and
1000 ultrafast bootstrap replicates (Hoang et al 2018). Yang 2011), compared to computationally demanding
In the presence of ILS, it is generally accepted that Bayesian methods such as BEAST2 (Bouckaert et al 2014)
methods based on the multispecies coalescent (MSC) or MrBayes (Ronquist et al 2012). We used a concatenated
recover interspecific relationships more accurately than alignment of Subset 1 as input for MCMCTree, choosing
concatenation approaches (Edwards et al 2007; Mendes the individuals with the lowest proportion of missing
and Hahn 2018). While full Bayesian implementation of data for those species with multiple individuals. We
the MSC can be prohibitively inefficient for large data first estimated the branch lengths of the ASTRAL tree
4 SYSTEMATIC BIOLOGY
in BASEML as implemented in PAML. To calibrate the The possibility of nuclear introgression was further
tree, we specified a minimum age of 3.6 Ma for the evaluated using ABBA-BABA tests (Green et al 2010).
priors and parameters are found in the Supplementary the Akaike Information Criterion (AIC), Akaike weights
Appendix available on Dryad. (AICw), and difference in AIC (AIC).
6 SYSTEMATIC BIOLOGY
intermediate between the alleged parental lineages in the Brennan et al (2021) (Fig. 1, Supplementary Fig. S3
putative hybrid taxa (Wilson 1992). Additional details available on Dryad). The New Guinean V. salvadorii was
a)
10 48 V. panoptes 40 V. rosenbergi 38 V. mertensi
V. rosenbergi V. spenceri V. giganteus
0
25 V. gouldii 24 V. panoptes 24 V. panoptes
V. panoptes V. gouldii V. spenceri
–5 V. rosenbergi V. rosenbergi V. rosenbergi
V. giganteus V. spenceri V. gouldii
V. spenceri V. mertensi V. mertensi
–10 V. mertensi V. giganteus V. komodoensis
V. komodoensis V. komodoensis V. varius
–5 0 5 10
V. varius V. varius V. giganteus
Dimension 2
V. salvadorii V. salvadorii V. salvadorii
V. timorensis V. timorensis V. timorensis
b)
SNaQ: Subset 1 SNaQ: Subset 2
V. timorensis V. timorensis
V. salvadorii V. salvadorii
V. varius V. varius
V. komodoensis V. komodoensis
0.07 V. spenceri 0.18 V. panoptes
V. rosenbergi V. gouldii
0.93 0.82
V. gouldii V. rosenbergi
V. panoptes V. spenceri
V. giganteus V. giganteus
V. mertensi V. mertensi
FIGURE 2. Molecular evidence of hybridization in the large Australasian monitors clade. a) Clusters of gene trees identified through
multidimensional scaling of phylogenetic tree space; different colors are used for each cluster; individual gene trees and 95% confidence ellipses
for each cluster are plotted to the left, and the median tree for each cluster is plotted to the right; the number of trees assigned to each cluster is
shown next to each median tree; red dotted lines indicate trees where V. komodoensis is more closely related to sand monitors than to its sister
species, while blue dotted lines indicate trees where V. komodoensis disrupts the monophyly of the Australian clade. b) Phylogenetic networks
inferred for the large Australasian monitors clade; reticulation events are represented by the dotted lines, with numbers indicating the proportion
of alleles inherited from each of the involved taxa; analyses were based on either 337 loci with all species represented (Subset 1) or 40 informative
loci with little missing data (Subset 2).
8 SYSTEMATIC BIOLOGY
Morphology
–1
The PCA and phyloPCA were mostly congruent with
–1 –0.5 0 0.5
each other (Fig. 3; see Supplementary Table S8 available
Principal component 1 (35.1%) on Dryad for loadings). The highly arboreal and phylo-
b) genetically divergent V. salvadorii is morphologically
–0.5 divergent from the other members of the LAMc. Sand
monitors are morphologically intermediate between the
other members of the Australian clade (V. giganteus and
Principal component 2 (23.2%)
robust to branch-length uncertainty (Supplementary in the large Australasian monitors clade (LAMc). Discus-
Table S10 available on Dryad). sion on the diversification and systematics of the LAMc
10 SYSTEMATIC BIOLOGY
15 10 5 0 Ma
0m
−4,000 m
−8,000 m
FIGURE 4. Geographic distribution of the large Australasian monitors in space and time. a) Ancestral ranges inferred from current distribution
in discrete areas (left); ancestral ranges are shown by the colored rectangles on each node of the tree including only contemporary records
(middle) and fossil records of V. komodoensis (right); arrows indicate shifts into particular ranges; uncertainty in ancestral ranges is presented
in Supplementary Fig. S6 available on Dryad. b) Ancestral ranges based on polygonal distributions; results are shown in pairs for the analysis
including only contemporary records (left) and fossil records of V. komodoensis (right); different shades of red are used for 5% (darkest), 50%,
and 95% (lightest) posterior kernel density; sea depth/elevation is indicated in order to highlight areas that are currently underwater but that
can become exposed during glacial periods.
polyploidy, parthenogenesis, and unisexuality (Fujita the sand monitor V. panoptes, and other varanids outside
and Moritz 2009; Neaves and Baumann 2011). How- the LAMc (Lenk et al 2005; Watts et al 2006; Iannucci
ever, prior evidence for hybridization in Varanidae is et al 2019). Vertebrate parthenogenesis is often associated
limited. Brennan et al (2021) found that an individual with polyploidy, which is in turn often associated with
of V. bengalensis showed mixed ancestry with a dis- hybridization (Mallet 2008). However, all the partheno-
tantly related Varanus clade and suggested that the genetic varanid individuals for which ploidy has been
conserved chromosomal number in Varanidae could tested are diploid (Iannucci et al 2019). Nonetheless,
facilitate hybridization (Kinga and King 1975). Here, polyploidy has been recently reported for one individual
we show compelling evidence for hybridization in of a normally diploid species, V. albigularis (Iannucci
monitor lizards. The relatively low proportion of alleles et al 2019). Thus, it seems possible that future studies
coming from the Komodo dragon that was inherited with wide sampling of individuals could reveal a link
by sand monitors is more consistent with introgressive between parthenogenesis, polyploidy, and hybridization
hybridization than with hybrid speciation (Fig. 2). The in Varanus.
apparent absence of mitochondrial introgression could Patterns of morphological variation in the LAMc are
be due to our limited sampling, but it could also suggest consistent with hybridization involving V. komodoensis
that backcrossing was male biased. This could be due and sand monitors. Sand monitors are morphologically
to selection against hybrid females (Mitchell et al 2016). intermediate between their sister clade and the Komodo
Alternatively, it has been suggested that hybridization dragon (Fig. 3; Supplementary Appendix available on
will commonly involve males of the larger species Dryad). While support was not decisive, evolutionary
and females of the smaller species, reflecting female models involving a contribution of V. komodoensis in the
preferences or mechanical limitations to mating (Grant phenotype of sand monitors were preferred for most
and Grant 1997; Wirtz 1999). This is consistent with the characters. In most cases, the preferred model was that
inferred role of the large Komodo dragon as the source of implying intermediacy. Furthermore, the “search.conv”
gene flow (Supplementary Fig. S5d available on Dryad). analyses revealed several instances of greater than
Facultative parthenogenesis (nonobligate asexual expected similarity between sand monitors and the
reproduction) has been reported for the Komodo dragon, Komodo dragon. Convergence cannot be ruled out as the
single explanation for the intermediacy of sand monitors most of our biogeographic analyses. The exception is the
and their similarity with the Komodo dragon. However, “BioGeoBEARS” analysis which does not incorporate
the fossil record. Thus, we agree with Hocknull et al
12 SYSTEMATIC BIOLOGY
ACKNOWLEDGMENTS Chen L., Wolf A.B., Fu W., Li L., Akey J.M. 2020. Identifying and
interpreting apparent Neanderthal ancestry in African individuals.
We thank A. Allison, A.P. Amey, W. Böhme, U. Bott, Cell 180:677–687.
Hocknull S.A., Piper P.J., van den Bergh G.D., Due R.A., Morwood M.J., Mallet J. 2008. Hybridization, ecological races and the nature of species:
Kurniawan I. 2009. Dragon’s paradise lost: palaeobiogeography, empirical evidence for the ease of speciation. Philos. T. R. Soc. B
evolution and extinction of the largest-ever terrestrial lizards 363:2971–2986.
14 SYSTEMATIC BIOLOGY
Price G.J., Louys J., Cramb J., Feng Y., Zhao J., Hocknull S.A., Webb G.E., Stebbins G.L. 1959. The role of hybridization in evolution. Proc. Am.
Nguyen A.D., Joannes-Boyau R. 2015. Temporal overlap of humans Philos. Soc. 103:231–251.
and giant lizards (Varanidae; Squamata) in Pleistocene Australia. Taylor S.A., Larson E.L. 2019. Insights from genomes into the