Professional Documents
Culture Documents
Lectura Taller Sesion 6
Lectura Taller Sesion 6
Report
Phylogenomic insights into
the early diversification of fungi
Jürgen F.H. Strassert1,3,4,* and Michael T. Monaghan1,2
1Department of Evolutionary and Integrative Ecology, Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Berlin, Germany
2Institut €t Berlin, Berlin, Germany
für Biologie, Freie Universita
3Twitter: @jfh_strassert
4Lead contact
*Correspondence: juergen.strassert@igb-berlin.de
https://doi.org/10.1016/j.cub.2022.06.057
SUMMARY
Phylogenomic analyses have boosted our understanding of the evolutionary trajectories of all living forms
by providing continuous improvements to the tree of life.1–5 Within this tree, fungi represent an ancient
eukaryote group,6 having diverged from the animals 1.35 billion years ago.7 Estimates of the number
of extant species range between 1.5 and 3.8 million.8,9 Recent reclassifications and the discovery of
the deep-branching Sanchytriomycota lineage10 have brought the number of proposed phyla to 20,11
21 if the Microsporidia are included.12–14 Uncovering how the diverse and globally distributed fungi are
related to each other is fundamental for understanding how their lifestyles, morphologies, and metabolic
capacities evolved. To date, many of the proposed relationships among the phyla remain controversial
and no phylogenomic study has examined the entire fungal tree using a taxonomically comprehensive
dataset and suitable models of evolution. We assembled and curated a 299-protein dataset with a
taxon sampling broad enough to encompass all recognized fungal diversity with available data, but
selective enough to run computationally intensive analyses using best-fitting models. Using a range of
reconstruction methods, we were able to resolve many contested nodes, such as a sister relationship
of Chytridiomyceta to all other non-Opisthosporidia fungi (with Chytridiomycota being sister to
Monoblepharomycota + Neocallimastigomycota), a branching of Blastocladiomycota + Sanchytriomycota
after the Chytridiomyceta but before other non-Opisthosporidia fungi, and a branching of Glomeromycota
as sister to the Dikarya. Our up-to-date fungal tree of life will serve as a springboard for future investiga-
tions on the early evolution of fungi.
3628 Current Biology 32, 3628–3635, August 22, 2022 ª 2022 Elsevier Inc.
ll
Report
A B
branching of Chytridiomyceta (Chytridiomycota, Neocallimasti- of CATGTR compared to BI-C60 (Table S2). We also removed
gomycota, and Monoblepharomycota) as sister to all other non- 25% and 50% of the most compositionally heterogeneous sites
Opisthosporidia fungi; the sister relationship of Chytridiomycota from the alignment in order to reduce compositional heterogene-
to a clade comprising Neocallimastigomycota + Monoblepharo- ity, and additional trees were reconstructed from these new
mycota; the branching of Blastocladiomycota + Sanchytriomy- alignments with CATGTR (Figure 2). The inferred trees were in
cota, which diverged after the Chytridiomyceta but before other agreement with the CATGTR tree reconstructed from the full-
non-Opisthosporidia fungi; the sister relationship of Olpidium to length alignment (Figures 1A and 2).
all other non-zoosporic fungi; and the sister relationship of Pucci- In order to understand the extent to which fast-evolving sites,
niomycotina to Agaricomycotina/Wallemiomycotina + Ustilagino- which are more likely to be homoplasic, led to systematic bias in
mycotina. Aphelidiomycota was fully supported as sister taxon to the ML-C60 tree inference due to model misspecification,21 the
the entire non-Opisthosporidia clade and distinct from Rozello- fastest-evolving sites were identified using the -wsr function in
mycota and Microsporidia, although Aphelidiomycota was repre- IQ-Tree under the LG+C60+G+F model (STAR Methods) and
sented by only one species in our analysis (Figure 1). progressively removed from the alignment in increments of
Ambiguity remained concerning the diverging pattern of the 10,000 sites. Each sub-alignment was then subjected to further
Glomeromycota (arbuscular mycorrhizal fungi), either as sister ML tree inferences using the LG+C60+G+F model, and node
to Mucoromycota + Mortierellomycota as reconstructed in the support was inferred by ultrafast bootstrap approximation.
MSC and ML-C60 trees or as sister to the Dikarya (Ascomycota + Because of the higher computational demand of BI, this analysis
Basidiomycota, and possibly Entorrhizomycota, which lack was run for ML inferences only. Interestingly, whereas the boot-
genomic/transcriptomic data12) as reconstructed in the CATGTR strap support for Dikarya remained maximal throughout all the
tree (Figure 1). Another discrepancy between the inferred tree to- analyses, the support for the sister relationship of Glomeromy-
pologies was the placement of the opisthosporid Mitosporidium cota to Mucoromycota + Mortierellomycota decreased rapidly
as either sister to only Paramicrosporidium (MSC and ML-C60; while the support for their closer relationship to the Dikarya, as
Methods S1C and S1D) or sister to all other Microsporidia seen in the CATGTR trees reconstructed from complete and
including Paramicrosporidium (CATGTR; Figure 1A). pruned alignments, increased, reaching 92% after the removal
of 30,000 sites (Figure 3). Further removal of sites either led to
Tree evaluation an unsupported branching or recovered the Glomeromycota +
The conflicts in the branching of the Glomeromycota and Mito- Dikarya topology again (100,000 sites removed; Figure 3). For
sporidium were fully supported by bootstrap (ML-C60) and pos- the fast-evolving microsporidians,22 the sister relationship of Mi-
terior probability (CATGTR) analyses. Both alternative branching tosporidium to all other Microsporidia, as inferred by CATGTR,
orders in the CATGTR topology were also rejected by ML in was only supported when the 120,000 fastest-evolving sites
approximately unbiased tests (Glomeromycota + Dikarya, p- were removed from the alignment (i.e., 8,450 sites remained;
AU = 0.022; Mitosporidium + all other Microsporidia, p-AU = Figure 3).
0.011). To test whether the conflicts were due to the usage of Finally, to assess whether the here-received topologies were
different tree reconstruction methods (ML or BI), a further tree skewed due to long-branch attraction (LBA) effects,23 further
was derived using BI under the same C60 model that was ML-C60 and CATGTR trees were calculated but with the San-
used for the ML tree. This tree (hereafter BI-C60; Methods chytriomycota and the nine fastest-evolving Microsporidia spe-
S1E) recapitulated the topology obtained with ML-C60 indicating cies removed. In both trees, the branching order of the remaining
that differences were due to the evolutionary model rather than fungal groups was fully congruent to the branching order that
the inference method. To test which model better minimized in- had been recovered before under the same evolutionary model,
adequacy in describing compositional heterogeneity, posterior using the alignment that includes the fast-evolving taxa
predictive analyses were conducted that resulted in a better fit (compare Figure 1 with Figure S1).
with an increasing diversity of fungi for which genomic/transcrip- novel eukaryotic super-group. Genome Biol. Evol. 10, 427–433. https://
tomic data will be available, several clades will be reshuffled. doi.org/10.1093/gbe/evy014.
Likewise, the increase of sequence data in general will allow the 2. Strassert, J.F.H., Jamy, M., Mylnikov, A.P., Tikhonenkov, D.V., and Burki,
inference of substitution matrices from concatenated protein F. (2019). New phylogenomic analysis of the enigmatic phylum Telonemia
further resolves the eukaryote tree of life. Mol. Biol. Evol. 36, 757–765.
sequence alignments (as opposed to pre-defined matrices used
https://doi.org/10.1093/molbev/msz012.
here), which have a better signal-to-noise ratio than DNA align-
3. Irisarri, I., Strassert, J.F.H., and Burki, F. (2022). Phylogenomic insights
ments and are thus favored when studying ancient nodes like in
into the origin of primary plastids. Syst. Biol. 71, 105–120. https://doi.
this present study. org/10.1093/sysbio/syab036.
4. Gawryluk, R.M.R., Tikhonenkov, D.V., Hehenberger, E., Husnik, F.,
STAR+METHODS Mylnikov, A.P., and Keeling, P.J. (2019). Non-photosynthetic predators
are sister to red algae. Nature 572, 240–243. https://doi.org/10.1038/
Detailed methods are provided in the online version of this paper s41586-019-1398-6.
and include the following: 5. Schön, M.E., Zlatogursky, V.V., Singh, R.P., Poirier, C., Wilken, S., Mathur,
V., Strassert, J.F.H., Pinhassi, J., Worden, A.Z., Keeling, P.J., and et al.
d KEY RESOURCES TABLE (2021). Single cell genomics reveals plastid-lacking Picozoa are close rel-
d RESOURCE AVAILABILITY atives of red algae. Nat. Commun. 12, 6651. https://doi.org/10.1038/
s41467-021-26918-0.
B Lead contact
B Materials availability 6. Voigt, K., James, T.Y., Kirk, P.M., Santiago, A.L.C.M., Waldman, B.,
Griffith, G.W., Fu, M., Radek, R., Strassert, J.F.H., Wurzbacher, C., et al.
B Data and code availability
(2021). Early-diverging fungal phyla: taxonomy, species concept, ecology,
d EXPERIMENTAL MODEL AND SUBJECT DETAILS distribution, anthropogenic impact, and novel phylogenetic proposals.
d METHOD DETAILS Fungal Divers. 109, 59–98. https://doi.org/10.1007/s13225-021-00480-y.
B Phylogenomic dataset construction 7. Strassert, J.F.H., Irisarri, I., Williams, T.A., and Burki, F. (2021). A molecular
B Phylogenomic analyses timescale for eukaryote evolution with implications for the origin of red
B Sites removal analyses algal-derived plastids. Nat. Commun. 12, 1879. https://doi.org/10.1038/
B Long branch attraction test s41467-021-22044-z.
d QUANTIFICATION AND STATISTICAL ANALYSIS 8. Hawksworth, D.L. (2001). The magnitude of fungal diversity: the 1.5 million
species estimate revisited. Mycol. Res. 105, 1422–1432. https://doi.org/
10.1017/s0953756201004725.
SUPPLEMENTAL INFORMATION
9. Hawksworth, D.L., Lücking, R., Joseph, H., and James, T.Y. (2017). Fungal
Supplemental information can be found online at https://doi.org/10.1016/j. diversity revisited: 2.2 to 3.8 million species. Microbiol. Spectr. 5. 5-4.10.
cub.2022.06.057. https://doi.org/10.1128/microbiolspec.funk-0052-2016.
10. Galindo, L.J., López-Garcı́a, P., Torruella, G., Karpov, S., and Moreira, D.
ACKNOWLEDGMENTS (2021). Phylogenomics of a new fungal phylum reveals multiple waves of
reductive evolution across Holomycota. Nat. Commun. 12, 4973.
J.F.H.S. acknowledges support from the German Research Foundation (DFG; https://doi.org/10.1038/s41467-021-25308-w.
grant STR1349/2-1, project no. 432453260), and research was partially funded 11. Wijayawardene, N.N. (2020). Outline of fungi and fungus-like taxa.
by the German Federal Ministry of Education and Research (BMBF, Förder- Mycosphere 11, 1060–1456. https://doi.org/10.5943/mycosphere/11/1/8.
kennzeichen 033W034A). The authors thank the High-Performance 12. James, T.Y., Stajich, J.E., Hittinger, C.T., and Rokas, A. (2020). Toward a
Computing Service of ZEDAT, Freie Universita €t Berlin, for computing time.
fully resolved fungal tree of life. Annu. Rev. Microbiol. 74, 291–313. https://
We also thank four anonymous reviewers for their helpful comments on the doi.org/10.1146/annurev-micro-022020-051835.
manuscript. Finally, we thank David Moreira and Luis Javier Galindo for sharing
13. James, T.Y., Pelin, A., Bonen, L., Ahrendt, S., Sain, D., Corradi, N., and
sequence data for Amoeboradix gromovi and Sanchytrium tribonematis.
Stajich, J.E. (2013). Shared signatures of parasitism and phylogenomics
unite cryptomycota and microsporidia. Curr. Biol. 23, 1548–1553.
AUTHOR CONTRIBUTIONS https://doi.org/10.1016/j.cub.2013.06.057.
14. Keeling, P.J., Luker, M.A., and Palmer, J.D. (2000). Evidence from beta-
J.F.H.S. conceived the study, assembled and curated the data, performed all
tubulin phylogeny that microsporidia evolved from within the fungi. Mol.
analyses, and drafted the manuscript. M.T.M. contributed to the final manu-
Biol. Evol. 17, 23–31. https://doi.org/10.1093/oxfordjournals.molbev.
script. Both authors read and approved the final version.
a026235.
DECLARATION OF INTERESTS 15. López-Escardó, D., López-Garcı́a, P., Moreira, D., Ruiz-Trillo, I., and
Torruella, G. (2018). Parvularia atlantis gen. et sp. nov., a nucleariid filose
The authors declare no competing interests. Amoeba (Holomycota, Opisthokonta). J. Eukaryot. Microbiol. 65,
170–179. https://doi.org/10.1111/jeu.12450.
Received: March 23, 2022 16. Schuler, G.A., and Brown, M.W. (2019). Description of Armaparvus lan-
Revised: May 10, 2022 guidus n. gen. n. sp. confirms ultrastructural unity of Cutosea
Accepted: June 16, 2022 (Amoebozoa, Evosea). J. Eukaryot. Microbiol. 66, 158–166. https://doi.
Published: July 12, 2022 org/10.1111/jeu.12640.
17. Tice, A.K., Shadwick, L.L., Fiore-Donno, A.M., Geisen, S., Kang, S.,
REFERENCES Schuler, G.A., Spiegel, F.W., Wilkinson, K.A., Bonkowski, M., Dumack,
K., et al. (2016). Expansion of the molecular and morphological diversity
1. Brown, M.W., Heiss, A.A., Kamikawa, R., Inagaki, Y., Yabuki, A., Tice, of Acanthamoebidae (Centramoebida, Amoebozoa) and identification of
A.K., Shiratori, T., Ishida, K.I., Hashimoto, T., Simpson, A.G.B., and a novel life cycle type within the group. Biol. Direct 11, 69. https://doi.
Roger, A.J. (2018). Phylogenomics places orphan protistan lineages in a org/10.1186/s13062-016-0171-0.
STAR+METHODS
RESOURCE AVAILABILITY
Lead contact
Further information and requests for resources should be directed to and will be fulfilled by the lead contact, Jürgen F.H. Strassert
(juergen.strassert@igb-berlin.de).
Materials availability
This study did not generate new unique reagents.
d All data needed to evaluate the conclusions of this study are presented in the paper and the Supplemental information. Raw
sequence data are available under the following web-links: https://doi.org/10.6084/m9.figshare.12417881.v2, https://fungi.
ensembl.org/index.html, https://metazoa.ensembl.org/index.html, https://protists.ensembl.org/index.html, https://www.
imicrobe.us/#/projects/104, https://mycocosm.jgi.doe.gov/mycocosm/home, and https://www.ncbi.nlm.nih.gov/.
d The script alignment_pruner.pl is available at https://github.com/novigit/davinciCode/tree/master/perl.
d Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.
Genomes and transcriptomes were downloaded from EukProt, Ensembl, Imicrobe, Mycocosm, and NCBI. Sequences of outgroup
taxa were added from Strassert et al.7 For detailed source information, see Table S1. Taxa were selected such that all major fungal
groups were represented, neglecting taxa with more missing data. The lineage information in this study was assigned according to
the NCBI taxonomy (https://ncbi.nlm.nih.gov/taxonomy) with some modifications especially regarding the phylum assignments,
which, if not otherwise noted, were adopted in order to match those recently proposed by Wijayawardene et al.11 Analyses were car-
ried out using the Zedat high-performance computing infrastructure at Freie Universita€t Berlin.63
METHOD DETAILS
Phylogenomic analyses
The taxon-reduced matrix was used to reconstruct a ML tree with IQ-Tree using the best-fitting site-heterogeneous model
LG+C60+G+F with the PMSF approach20 to obtain nonparametric bootstrap support (100 replicates; Figure 1B; Methods S1C).
Copies of this tree were manually edited to test alternative topologies with the approximately unbiased test (AU test66). The
taxon-reduced matrix was also subjected to Bayesian analyses using PhyloBayes-MPI v. 1.855 with the site-heterogeneous
CAT+GTR+G4 model and the -dc option to remove constant sites. Three independent Markov Chain Monte Carlo (MCMC) chains
were run for 1,600 generations (all sampled). For each chain, the burnin period was estimated by monitoring evolution of the log-likeli-
hood (Lnl) at each sampled point, and 750 generations were removed from all chains. A consensus tree of all three chains (Figure 1A)
was built and global convergence between chains was assessed with PhyloBayes’ built-in tools. As frequently recognised in
PhyloBayes, global convergence was not received in all combinations of the chains with maxdiff = 1 and meandiff = 0.0167504.
To discriminate whether potential discrepancies between this tree and the tree calculated by ML are due to the usage of different
software packages or due to different evolutionary models, a further Bayesian tree (Methods S1E) was inferred under the LG+C60
model (-dc -catfix C60 -lg -dgam 4; three chains, each with 2,000 generations; convergence was not received; maxdiff = 1,
meandiff = 0.0268007). Posterior predictive analyses were then run for both tree inferences in order to informatively select the
best topology.
The single-protein ML trees of both the full and the taxon-reduced datasets (Data S1) were also analyzed under the multi-species
Coalescent (MSC) model in ASTRAL-III v. 5.7.747 to incorporate protein tree uncertainties (Figure 1; Methods S1B and S1D). Prior to
that, branches with bootstrap support <10 were collapsed as recommended by the developers. Quadripartition supports were calcu-
lated from quartet frequencies among the set of input trees.67 The single-protein ML trees of the reduced dataset (with un-collapsed
branches) and the ML tree inferred from the taxon-reduced matrix with LG+C60+G+F-PMSF were additionally used to calculate
quartet-based internode certainty scores56 (Data S1).
Phylogenetic model selection was based on IQ-Tree’s Bayesian information criterion.52 Statistical support for phylogenies was ob-
tained using non-parametric bootstraps, ultrafast bootstraps,65 Bayesian posterior probabilities in BI,55 and quadripartition support
values in ASTRAL.47,67 Alternative topologies were tested using the approximately unbiased test implemented in IQ-Tree,52 and the
fit of evolutionary models (CATGTR and BI-C60) was tested using posterior predictive analyses implemented in the PhyloBayes soft-
ware package.55