Professional Documents
Culture Documents
Diversity and Functional Landscapes in The Microbiota of Animals in The Wild
Diversity and Functional Landscapes in The Microbiota of Animals in The Wild
Molecular Cell Biology, Weizmann Institute of Science, Rehovot 7610001, Israel. 4Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA.
5Department of Genetics, Stanford University, Stanford, CA 94305, USA. 6Augmanity, Rehovot, Israel. 7Center for Studies in Physics and Biology, Rockefeller University,
Animals in the wild are able to subsist on pathogen-infected and poisonous food and show immunity to
various diseases. These may be due to their microbiota, yet we have a poor understanding of animal
Compared to the human microbiota, which has been exten- suggest that despite previous studies of the human gut mi-
sively studied (1, 2), the microbiota of animals has received crobiome, a large portion of the human microbiome remains
less focus. Microbiome research was pioneered with several unexplored. Given that less resources have been invested in
well accepted methodologies; such as metagenomic profiling wildlife microbiome studies, it can be assumed that the po-
applied to several animal species (3–13). Metagenomics anal- tential for discovery of previously undescribed microbiota is
ysis has been combined with 16S ribosomal RNA sequencing particularly high.
to investigate how diet shapes the gut microbiota within var- It is becoming increasingly evident that animal microbi-
ious host species (14), in particular primates (15, 16). Diet omes are a rich source of biological functions that may have
metabarcoding, in particular, was useful in the study of die- biotechnological impact. For example, recent microbiome
tary niche partitioning and composition (17–23). 16S riboso- studies have reported the discovery of antimicrobials (41–45),
mal RNA sequencing alone has provided compositional data and immunomodulators (46), as well as industrial enzymes
and taxonomy-related insights about the microbiota in many (47–51). Moreover, animals in the wild exhibit adaptations
animal species [(24–27) just to cite a few]. Other studies ad- such as the safe consumption of rotting, pathogen-infected
dressed behavioral aspects and their relation to animal mi- meat and poisonous plants (52, 53); production of highly po-
crobiota [for example (28–33)]. Still, despite these studies, a tent toxins (54); bioluminescence (55); specific immunity to
comprehensive catalog and unified analysis of the microbiota various diseases (56) and microbial pathogens (57); regener-
of animals across different geographies, feeding behaviors, ative capabilities (58); and, in some species, extreme longev-
and other traits, is still lacking. ity (59, 60). Some of these adaptations, such as toxin
With advances in shotgun metagenomic sequencing tech- production and bioluminescence, are conferred, at least in
nologies and algorithms to build metagenomic assembled ge- part, by microbial symbionts living in and on the animal (54,
nomes (MAGs), several studies aimed to better capture the 55). However, despite these examples, a comprehensive view
diversity of gut and other microbes by reconstruction of mi- of the association between animal traits and its microbiota is
crobial genomes (34–36). By utilizing metagenomic assembly still lacking.
methods to build genomes of uncultured bacteria, several Beyond its biotechnological potential, studying animal
studies successfully reconstructed hundreds of thousands of microbiomes improves our understanding of host-microbe
MAGs from the human microbiome (37–39). Additional ef- ecology and bi-directional interactions. The microbiome af-
forts were made to create catalogs of species-level microbial fects several factors in host physiology [immunity, digestion,
genomes, resulting in an estimation of over 4,500 distinct mi- energy metabolism, development etc. (27, 30)]. In the other
crobial species in the human gut (39, 40). These studies direction, host diet and phylogeny are major forces that
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 1
shape the gut microbiome composition (25, 61–63). This rela- an air-breathing catfish (C. gariepinus) (Fig. 1, A to C). We
tionship between host phylogeny and diet ignited studies of designed a standardized sample collection process, managed
the co-evolution of host-microbiome, but most of these stud- by a custom-designed, publicly-available mobile application
ies have focused on mammals and/or were limited to differ- (Zerion Software iFormBuilder). The application was de-
ences in microbiome composition (27). signed by reviewing the collection procedure, to enable effi-
The microbiota of wild animals is also a natural reservoir cient documentation of the metadata. The application
for pathogens of both animals and humans, whose mapping documents sampling time and geographical coordinates,
could elucidate the timing and routes of their transmission gathers manually-provided details such as animal species
into the human population (64). Recent studies have mapped identity, sex, health state, and other traits (when discernible),
pathogen reservoirs in wild animals (64, 65), demonstrating and captures photographs of the animal and/or its fecal mat-
the potential of this approach in surveillance and forecast of ter, when possible. Fecal matter was collected from identified
human pathogen dynamics. Metagenomics have been used to animals and as close to excretion as possible without disturb-
map antibiotic resistance genes (resistomes) in human, ing the animal in its natural habitat (table S1). All fecal mat-
chicken, pigs, migratory birds, and non-human primates (13, ter was collected with the same protocol and using the same
66, 67). Many pathogenic microbe strains differ from non- collection kit, and then shipped to our laboratory. We then
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 2
previously unsampled bacterial species (Fig. 2B). As a control, any known phylum (Fig. 3A). The number of novel bacteria
we applied the same metagenome assembly pipeline to 240 varies across different phyla (Fig. 3B), with Verrucomicrobia
human gut microbiota samples that were not used in the con- being the most enriched (1/27 known rSGBs). Fusobacteria
struction of the comparison reference database (80), and exhibit a unique distribution of maximum ANI values, imply-
found that over 99% of the 962 species-level genomic bins ing a taxonomic continuity between species and genus levels.
that we constructed from these control human gut samples As an illustrative example, we examined a specific clade
were classified as known (Fig. 2B). of 11 novel species (star, Fig. 3C) that belong to the Verru-
To assess the completeness of our database, we performed comicrobia phylum, for which we reconstructed a distance-
a taxa accumulation analysis. Namely, we randomly ordered based phylogenetic tree (Fig. 3C) and performed a Kyoto en-
our samples as well as the above control human gut microbi- cyclopedia of genes and genomes (KEGG) pathways enrich-
ome samples, and computed the total number of bacterial ge- ment analysis (82). We found that this clade is enriched with
nomes that accumulate in these sample sets as a function of a unique subset of pathways, providing an initial framework
samples added in this random order. As expected, the accu- for understanding the role of this unknown bacterial clade.
mulation rate of bacterial species of mammalian origin was This demonstrates the importance of applying comparative
higher than that from human samples, likely because the methods to describe novel clades, for which taxonomic anno-
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 3
same) class. For example, as seen in clusters 1, 2 and 4 (Fig. of each rSGB for all Aves and Mammalia from the wild (Fig.
4D), the clusters are associated with Mammalia, Reptilia and 5C). Indeed, some rSGBs exhibit stronger association with the
Aves, respectively, and belong to at least two bacterial phyla. diet rather than with the host class. For example, rSGB 435,
Such a strong host association suggests that these clusters an unknown species of the bacterial class Clostridia, which
may have a role in host-microbiome interactions. Cluster 1, we found exclusively in carnivores, is known to be related to
for example, consists of the bacterial phyla Firmicutes and toxin production (85). This association may reveal mecha-
Bacteroidetes, consistent with a co-occurrence network re- nisms of toxin metabolism in the guts of carnivores. In con-
ported in a recent study (25), but also contains members of trast, rSGB 292, an unknown species of the bacterial phylum
other, less common, phyla. In some cases, as demonstrated in Firmicutes, was found solely in mammals, while having no
cluster 3, the members of the cluster are found in various an- effect size related to the examined lifestyle traits. It was con-
imal classes, but all belong to the same bacterial phyla. structed from the microbiome of a southern sea lion (O. fla-
vescens), but was also found in lower abundance in several
Microbiota composition associates with host class and species from the Carnivora, Primates and Artiodactyla orders,
traits having very different lifestyles and habitats. Interestingly,
We next investigated how microbiota composition is associ- such microbial species may have a strong interaction with a
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 4
confoundment between these traits and taxonomy is a key effects. We found numerous orthologs that are enriched in
difficulty for properly analyzing these associations, we specific diet groups (Fig. 6G).
showed that in some cases (both at the microbial species and Next, we aimed to examine whether the enriched
the phylum level) the abundance is associated exclusively orthologs correspond to specific physiological processes. To
with either the taxonomy or a trait. These findings suggest this end, we tested for overrepresentation of these orthologs
that some microbial niches may have a specific role in the in KEGG pathways. Our analysis revealed several intriguing
adaptation to environmental conditions (regardless of the associations. For example, carnivore and omnivore microbi-
taxonomy). In contrast, other microbes may have a co-evolu- omes exhibited very significant enrichment in components of
tionary role in specific taxonomy groups regardless of life- the ABC transporter group (P=2.5x10−3 and P=3.4x10−23, re-
style. spectively, FDR corrected Fisher exact test for the enriched
orthologs in these groups relative to herbivores, Fig. 6H), and
The functional landscape of the microbiota of animals the phosphotransferase system (PTS) (P=2.5x10−49 and
in the wild P=6.5x10−12, Fig. 6I), in agreement with previous reports (91,
To obtain a deeper functional view of the wild animal micro- 92). Specifically, carnivore/omnivore-enriched orthologs in-
biota, we studied its gene functional content by performing cluded ABC transporters for amino acids (e.g., methionine,
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 5
associations and may produce biological insights into the Discussion
contributions to the hosts that are made by microbiomes. Organism-associated microbial communities have a com-
plex and profound impact on the biology of their hosts. They
Discovery of novel toxin-metabolizing genes in carrion critically contribute to host adaptation to its habitat, and are
eaters linked in many cases, as has been recently shown in humans,
Finally, as a proof of concept for the potential of our animal to the health and disease state of the host. However, our un-
microbiota resource to serve as a reservoir of bacterial func- derstanding of these phenomena in non-human hosts is
tions, we searched for genes that metabolize bacterial toxins much less developed. Metagenomic studies of animal micro-
within the microbiota of carrion eaters. Our underlying hy- biota typically probed animals in captivity, which has likely
pothesis was that microbial symbionts within the gut micro- been altered compared to the microbiota of counterpart ani-
biota of carrion eaters confer at least part of the ability to mals from their natural habitat. Other works have focused
neutralize toxins in the food of their hosts. This hypothesis more on microbial composition, using such methods as 16S
was previously tested in new world vultures (10). For the val- ribosomal RNA sequencing, providing mainly taxonomy-re-
idation study, we focused on microbiota from the gut of grif- lated insights (24, 25). Yet, a large-scale view at the meta-
fon vultures (Gyps fulvus), which we sampled weekly from genomic level is missing. Since the metagenomic approach
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 6
Finally, our experimental study identified that much poten- among various studies].
tial remains for discovery inside animal microbiomes. Mapping the microbiota of animals in the wild could shed
Our results enable a glimpse into potentially functional light on the natural reservoir of microbial pathogens. In par-
roles of gut microbiomes in the biology of their hosts. For ex- ticular, it could enable the surveillance and tracking of the
ample, we found enrichment in orthologs associated with an- movements of specific pathogens from their animal hosts
tibiotic biosynthesis and bacterial motility. This combination into the human population, and enable early warning of po-
highlights the possibility that carnivore microbiomes are flex- tential outbreaks. In an initial screen, we identified an esti-
ible functionally and anatomically, having been adapted to mated 100 bacterial species that are known human
rapidly responding to the active and dynamic environment in pathogens, such as Klebsiella, Enterobacter, Shigella, and
which they subsist. This pattern is absent from herbivore mi- Clostridium species, including several pathogens of plants,
crobiomes, as their hosts’ gastrointestinal tracts are signifi- such as Pantotea. Further studies should elucidate the feasi-
cantly longer and structurally complex, and their diet is less bility of reservoir mapping relying on our database, for the
likely to be contaminated with animal pathogens. The ob- benefit of public health.
served enrichment in orthologs associated with xenobiotic Our study demonstrates the importance and potential of
detoxification may, hypothetically, reflect the bioaccumula- complementing the computational work with large synthetic
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 7
the phylogenetic tree of all rSGBs. Novelty of rSGBs was as- Sicheritz-Pontén, S. J. Sørensen, M. T. P. Gilbert, G. R. Graves, L. H. Hansen, The
sessed by comparison to a collection of publicly available ref- microbiome of New World vultures. Nat. Commun. 5, 5498 (2014).
doi:10.1038/ncomms6498 Medline
erences. For each sample, relative abundance of each rSGB 11. J. Quan, Z. Wu, Y. Ye, L. Peng, J. Wu, D. Ruan, Y. Qiu, R. Ding, X. Wang, E. Zheng, G.
was estimated and followed by diversity, co-existence and dif- Cai, W. Huang, J. Yang, Metagenomic characterization of intestinal regions in pigs
ferential abundance analyses. Functional annotation (includ- with contrasting feed efficiency. Front. Microbiol. 11, 32 (2020).
ing COG and KEGG) was inferred to each gene. To avoid doi:10.3389/fmicb.2020.00032 Medline
12. J. L. Chun, S. Y. Ji, S. D. Lee, Y. K. Lee, B. Kim, K. H. Kim, Difference of gut
phylogenetic confoundment, various phylogenetic correction microbiota composition based on the body condition scores in dogs. J. Anim. Sci.
methods were applied. For synthetic validation, genes were Technol. 62, 239–246 (2020). Medline
selected, expressed in E. coli and purified using standard 13. T. P. Campbell, X. Sun, V. H. Patel, C. Sanz, D. Morgan, G. Dantas, The microbiome
methods. Proteases were screened for cleavage activity using and resistome of chimpanzees, gorillas, and humans across host lifestyle and
geography. ISME J. 14, 1584–1599 (2020). doi:10.1038/s41396-020-0634-2
peptide arrays containing 120 solvent-accessible peptide se- Medline
quences, ranging from 2 to 6 amino acids, derived from 12 14. B. D. Muegge, J. Kuczynski, D. Knights, J. C. Clemente, A. González, L. Fontana, B.
bacterial toxins with solved structures in the PDB. All statis- Henrissat, R. Knight, J. I. Gordon, Diet drives convergence in gut microbiome
tical analyses were performed using hypothesis testing and functions across mammalian phylogeny and within humans. Science 332, 970–
974 (2011). doi:10.1126/science.1198719 Medline
the resulting P-values were corrected to account for multiple
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 8
25. N. D. Youngblut, G. H. Reischer, W. Walters, N. Schuster, C. Walzer, G. Stalder, R. sponges. Mar. Drugs 10, 1192–1202 (2012). doi:10.3390/md10061192 Medline
E. Ley, A. H. Farnleitner, Host diet and evolutionary history explain different 42. N. Akbar, R. Siddiqui, K. A. Sagathevan, N. A. Khan, Gut bacteria of animals/pests
aspects of gut microbiome diversity among vertebrate clades. Nat. Commun. 10, living in polluted environments are a potential source of antibacterials. Appl.
2200 (2019). doi:10.1038/s41467-019-10191-3 Medline Microbiol. Biotechnol. 103, 3955–3964 (2019). doi:10.1007/s00253-019-09783-
26. K. J. McDermid, R. P. Kittle III, A. Veillet, S. Plouviez, L. Muehlstein, G. H. Balazs, 2 Medline
Identification of gastrointestinal microbiota in Hawaiian green turtles (Chelonia 43. M. G. Chevrette, C. M. Carlson, H. E. Ortega, C. Thomas, G. E. Ananiev, K. J. Barns,
mydas). Evol. Bioinform. Online 16, 1176934320914603 (2020). A. J. Book, J. Cagnazzo, C. Carlos, W. Flanigan, K. J. Grubbs, H. A. Horn, F. M.
doi:10.1177/1176934320914603 Medline Hoffmann, J. L. Klassen, J. J. Knack, G. R. Lewin, B. R. McDonald, L. Muller, W. G.
27. T. J. Colston, C. R. Jackson, Microbiome evolution along divergent branches of the P. Melo, A. A. Pinto-Tomás, A. Schmitz, E. Wendt-Pienkowski, S. Wildman, M.
vertebrate tree of life: What is known and unknown. Mol. Ecol. 25, 3776–3800 Zhao, F. Zhang, T. S. Bugni, D. R. Andes, M. T. Pupo, C. R. Currie, The antimicrobial
(2016). doi:10.1111/mec.13730 Medline potential of Streptomyces from insect microbiomes. Nat. Commun. 10, 516
28. G. Bennett, M. Malone, M. L. Sauther, F. P. Cuozzo, B. White, K. E. Nelson, R. M. (2019). doi:10.1038/s41467-019-08438-0 Medline
Stumpf, R. Knight, S. R. Leigh, K. R. Amato, Host age, social group, and habitat 44. J. G. Ganley, G. Carr, T. R. Ioerger, J. C. Sacchettini, J. Clardy, E. R. Derbyshire,
type influence the gut microbiota of wild ring-tailed lemurs (Lemur catta). Am. J. Discovery of antimicrobial lipodepsipeptides produced by a Serratia sp. within
Primatol. 78, 883–892 (2016). doi:10.1002/ajp.22555 Medline mosquito microbiomes. ChemBioChem 19, 1590–1594 (2018).
29. M. Gacias, S. Gaspari, P.-M. G. Santos, S. Tamburini, M. Andrade, F. Zhang, N. doi:10.1002/cbic.201800124 Medline
Shen, V. Tolstikov, M. A. Kiebish, J. L. Dupree, V. Zachariou, J. C. Clemente, P. 45. J. L. Ochoa, L. M. Sanchez, B.-M. Koo, J. S. Doherty, M. Rajendram, K. C. Huang, C.
Casaccia, Microbiota-driven transcriptional changes in prefrontal cortex override A. Gross, R. G. Linington, Marine mammal microbiota yields novel antibiotic with
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 9
lack of organ regeneration in amniotes. J. Exp. Zool. B Mol. Dev. Evol. 336, 145– 76. W. Guo, S. Mishra, C. Wang, H. Zhang, R. Ning, F. Kong, B. Zeng, J. Zhao, Y. Li,
164 (2021). doi:10.1002/jez.b.22901 Medline Comparative study of gut microbiota in wild and captive giant pandas (Ailuropoda
59. V. Briggs-Gonzalez, C. Bonenfant, M. Basille, M. Cherkiss, J. Beauchamp, F. melanoleuca). Genes 10, 827 (2019). doi:10.3390/genes10100827 Medline
Mazzotti, Life histories and conservation of long-lived reptiles, an illustration with 77. E. W. Sayers, M. Cavanaugh, K. Clark, J. Ostell, K. D. Pruitt, I. Karsch-Mizrachi,
the American crocodile (Crocodylus acutus). J. Anim. Ecol. 86, 1102–1113 (2017). GenBank. Nucleic Acids Res. 48, D84–D86 (2020). Medline
doi:10.1111/1365-2656.12723 Medline 78. F. A. B. von Meijenfeldt, K. Arkhipova, D. D. Cambuy, F. H. Coutinho, B. E. Dutilh,
60. V. Quesada, S. Freitas-Rodríguez, J. Miller, J. G. Pérez-Silva, Z.-F. Jiang, W. Tapia, Robust taxonomic classification of uncharted microbial sequences and bins with
O. Santiago-Fernández, D. Campos-Iglesias, L. F. K. Kuderna, M. Quinzin, M. G. CAT and BAT. Genome Biol. 20, 217 (2019). doi:10.1186/s13059-019-1817-x
Álvarez, D. Carrero, L. B. Beheregaray, J. P. Gibbs, Y. Chiari, S. Glaberman, C. Ciofi, Medline
M. Araujo-Voces, P. Mayoral, J. R. Arango, I. Tamargo-Gómez, D. Roiz-Valle, M. 79. C. Jain, L. M. Rodriguez-R, A. M. Phillippy, K. T. Konstantinidis, S. Aluru, High
Pascual-Torner, B. R. Evans, D. L. Edwards, R. C. Garrick, M. A. Russello, N. throughput ANI analysis of 90K prokaryotic genomes reveals clear species
Poulakakis, S. J. Gaughran, D. O. Rueda, G. Bretones, T. Marquès-Bonet, K. P. boundaries. Nat. Commun. 9, 5114 (2018). doi:10.1038/s41467-018-07641-9
White, A. Caccone, C. López-Otín, Giant tortoise genomes provide insights into Medline
longevity and age-related disease. Nat. Ecol. Evol. 3, 87–95 (2019). 80. D. Zeevi, T. Korem, N. Zmora, D. Israeli, D. Rothschild, A. Weinberger, O. Ben-
doi:10.1038/s41559-018-0733-x Medline Yacov, D. Lador, T. Avnit-Sagi, M. Lotan-Pompan, J. Suez, J. A. Mahdi, E. Matot, G.
61. R. E. Ley, M. Hamady, C. Lozupone, P. J. Turnbaugh, R. R. Ramey, J. S. Bircher, M. Malka, N. Kosower, M. Rein, G. Zilberman-Schapira, L. Dohnalová, M. Pevsner-
L. Schlegel, T. A. Tucker, M. D. Schrenzel, R. Knight, J. I. Gordon, Evolution of Fischer, R. Bikovsky, Z. Halpern, E. Elinav, E. Segal, Personalized nutrition by
mammals and their gut microbes. Science 320, 1647–1651 (2008). prediction of glycemic responses. Cell 163, 1079–1094 (2015).
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 10
evolution to a high purine and fat diet of carnivorans revealed by gut microbiomes 113. I. Letunic, P. Bork, Interactive Tree Of Life (iTOL) v4: Recent updates and new
and host genomes. Environ. Microbiol. 20, 1711–1722 (2018). doi:10.1111/1462- developments. Nucleic Acids Res. 47, W256–W259 (2019).
2920.14096 Medline doi:10.1093/nar/gkz239 Medline
93. C. C. Rimmer, E. K. Miller, K. P. McFarland, R. J. Taylor, S. D. Faccio, Mercury 114. D. Rothschild, S. Leviatan, A. Hanemann, Y. Cohen, O. Weissbrod, E. Segal, An
bioaccumulation and trophic transfer in the terrestrial food web of a montane atlas of robust microbiome associations with phenotypic traits based on large-
forest. Ecotoxicology 19, 697–709 (2010). doi:10.1007/s10646-009-0443-x scale cohorts from two continents. bioRxiv 2020.05.28.122325 [Preprint]. 18
Medline October 2020; https://doi.org/10.1101/2020.05.28.122325.
94. C. R. Dromard, Y. Bouchon-Navaro, S. Cordonnier, M. Guéné, M. Harmelin-Vivien, 115. S. C. Watts, S. C. Ritchie, M. Inouye, K. E. Holt, FastSpar: Rapid and scalable
C. Bouchon, Different transfer pathways of an organochlorine pesticide across correlation estimation for compositional data. Bioinformatics 35, 1064–1066
marine tropical food webs assessed with stable isotope analysis. PLOS ONE 13, (2019). doi:10.1093/bioinformatics/bty734 Medline
e0191335 (2018). doi:10.1371/journal.pone.0191335 Medline 116. J. Friedman, E. J. Alm, Inferring correlation networks from genomic survey data.
95. L. J. Revell, Size-correction and principal components for interspecific PLOS Comput. Biol. 8, e1002687 (2012). doi:10.1371/journal.pcbi.1002687
comparative studies. Evolution 63, 3258–3268 (2009). doi:10.1111/j.1558- Medline
5646.2009.00804.x Medline 117. J. Revell, phytools: An R package for phylogenetic comparative biology (and other
96. J. C. Uyeda, D. S. Caetano, M. W. Pennell, Comparative analysis of principal things). Methods Ecol. Evol. 3, 217–223 (2012). doi:10.1111/j.2041-
components can be misleading. Syst. Biol. 64, 677–689 (2015). 210X.2011.00169.x
doi:10.1093/sysbio/syv019 Medline 118. C. M. Mlewa, J. M. Green, Biology of the marbled lungfish, Protopterus aethiopicus
97. P. D. Polly, A. M. Lawing, A. Fabre, A. Goswami, Phylogenetic principal components Heckel, in Lake Baringo, Kenya. Afr. J. Ecol. 42, 338–345 (2004).
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 11
package version. 5, 1–36 (2013). Published online 25 March 2021
141. D. Melo, G. Garcia, A. Hubbe, A. P. Assis, G. Marroig, EvolQG – An R package for 10.1126/science.abb5352
evolutionary quantitative genetics. F1000 Res. 4, 925 (2015).
doi:10.12688/f1000research.7082.2 Medline
142. K. Afiahayati, K. Sato, Y. Sakakibara, MetaVelvet-SL: An extension of the Velvet
assembler to a de novo metagenomic assembler utilizing supervised learning.
DNA Res. 22, 69–77 (2015). doi:10.1093/dnares/dsu041 Medline
143. H. Noguchi, T. Taniguchi, T. Itoh, MetaGeneAnnotator: Detecting species-specific
patterns of ribosomal binding site for precise gene prediction in anonymous
prokaryotic and phage genomes. DNA Res. 15, 387–396 (2008).
doi:10.1093/dnares/dsn027 Medline
144. B. Buchfink, C. Xie, D. H. Huson, Fast and sensitive protein alignment using
DIAMOND. Nat. Methods 12, 59–60 (2015). doi:10.1038/nmeth.3176 Medline
ACKNOWLEDGMENTS
The authors wish to thank the entire team of Wild Biotech and the Segal lab at the
Weizmann Institute of Science for valuable assistance with this study; Dr. M.
Kopel and Dr. I. Horowitz for providing valuable assistance with sampling; Ms. K.
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 12
Downloaded from http://science.sciencemag.org/ on March 26, 2021
Fig. 1. Study cohort and design. (A) A collage of representative wild animal species profiled in the
present study. Animals were photographed in their natural habitats by the collection teams. Columns
cluster species based on sampling regions: (from left to right) Australia (Queensland), Falkland
Islands, Madagascar, Uganda, Israel. Animal names are color-coded based on class (see B for color
legend). (B) A pruned NCBI taxonomy tree of all species included in the cohort. Arc colors denote
animal class, dots outside arc denote sampling region. Branches represent taxonomic relations and
are not scaled to evolutionary distance. (C) World map showing sampling regions with sample size
from each region. (D) Study design.
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 13
Fig. 2. Identification of unknown bacterial genomes
in the microbiota of wild animals. (A) Adding
genomes from our metagenomic constructed
bacterial genome database of wild animals
significantly improves the mapping rate of the
samples. A reference database was used [41,904
known bacterial genomes, collected from various
large-scale human microbiome studies and NCBI,
representing all known bacterial genomes (X-axis),
and using an expanded genome database that also
includes the 1,209 genomes that we assembled from
our cohort (Y-axis)]. Mapping rate median (across
samples) significantly increased from 7.2% to 21.0%
(inset) (P<4x10−8, Wilcoxon signed-rank test). (B) Wild
genomes introduce microbial members to the tree of
life. For each medium or higher-quality bin
(completeness >50%, contamination <5%), maximal
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 14
Downloaded from http://science.sciencemag.org/ on March 26, 2021
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 15
Fig. 3. Microbial genomes from wild animals expand the known phylogenetic diversity. (A) A
maximum-likelihood alignment based phylogenetic tree of the 1,209 rSGBs assembled in this study.
Inner color strip, bacterial phylum; Outer color strip, host class. Clades of novel genomes (novel
defined as having ANI lower than 83% compared to the reference database) are colored dark-red.
(B) Distribution of maximal ANI distance to the reference database is shown for the bacterial
genomes constructed from each phylum. Phyla are sorted by percentage of novel genomes within
the phyla. (C) Genomes that were taxonomically annotated as Verrucomicrobia were used to
construct a separate distance-based phylogenetic tree (left). A clade, consisting of only novel
species, was identified and denoted as a novel clade (star, top). Normalized counts of significantly
enriched (or depleted) KEGG pathways in the novel clade relative to the whole phylum are
represented by color intensity (bottom). (D) Genomes that were taxonomically annotated as
Spirochaete were used to construct a separate distance-based phylogenetic tree (left). The
Treponema genus is marked with a star. Normalized counts of significantly enriched (or depleted)
KEGG pathways of the Treponema genus relative to the whole phylum are represented by color
intensity (right).
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 16
Downloaded from http://science.sciencemag.org/ on March 26, 2021
Fig. 4. Microbiota composition is associated with host class and exhibits co-existing bacterial clusters.
(A) Pairwise Bray-Curtis dissimilarity between pairs of samples within a host taxonomic rank shows that
bacterial similarity increases with host similarity (*P<0.05; ****P<0.0001, Mann-Whitney U test). (B) A
heatmap representing relative abundance of each rSGB (columns, colored by bacterial phylum) in each sample
(rows, colored by the host class). The heatmap was hierarchically clustered using average linkage based on
Euclidean distance, showing co-existence of groups of rSGBs within particular host groups. (C and D)
Correlation matrix calculated between vectors of abundances of rSGBs across all samples. rSGBs were
hierarchically clustered with average linkage based on Euclidean distance. Four clusters of co-existing bacteria
were visually selected and their abundance in the corresponding hosts is represented in (D). The colors in (D)
share the color key from (B).
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 17
Downloaded from http://science.sciencemag.org/ on March 26, 2021
Fig. 5. Microbiota composition and richness are associated with host class and phenotypes.
(A) Alpha diversity of the microbiota of all samples from the wild, calculated using Shannon’s
diversity index, showing that diversity is significantly associated with dietary adaptations, while
considering the phylogenetic relationships between the hosts (***P<0.001, phyloANOVA with a
post-hoc test). (B) Differential abundance analysis of the rSGBs between groups of host classes.
Each panel represents a volcano plot of a pair of classes. The difference in average abundance per
animal class is shown on the X-axis and FDR corrected P-value (Mann-Whitney U test) is shown on
the Y-axis. (C) A heatmap representing the effect size for each rSGB (columns) per trait or taxonomy
(rows) for all samples from the wild. For each rSGB and trait/taxonomy pair (a heatmap cell), the
effect size was measured using Mann-Whitney U test. (D) Heatmap representing the effect size for
each microbial phylum (rows) per trait or taxonomy (columns) for all samples from the wild [similar
to (C)]. (E and F) Association between microbial abundance and average body mass (lower row) and
lifespan (upper row) in Mammalia (E) and Aves (F), while accounting for phylogenetic relationships
between the hosts using phylogenetic generalized least squares (PGLS).
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 18
Downloaded from http://science.sciencemag.org/ on March 26, 2021
Fig. 6. Functional bacterial gene content is associated with host phenotypes. (A) Gene calling and
annotation were performed using eggNOG. Genes that mapped to a known COG category were
counted. The categorized functional annotation of 1,880,067 genes within medium or higher quality
bins (completeness>50%, contamination<5%) predicted in the cohort. Outer ring: COG category.
Inner ring: existence or absence of KEGG pathway annotation. (B to F) PCoA (Euclidean metric) of
sample-level KEGG pathway vectors (features are pathway-associated ortholog counts per sample).
Mammals from the wild colored by diet type (B) (R2=0.35; P<1.3x10−3), activity hours (C) (R2=0.31;
P<1.3x10−3), social structure (D) (R2=0.24; P<8x10−3), body mass (E) (R2=0.36; P<1.3x10−3) and
lifespan (F) (R2=0.41; P<1.3x10−3) (phyloMantel, FDR corrected). (G) A heatmap of log odds ratio of
KEGG orthologs that are enriched in diet groups. Rows represent orthologs, columns are different
diet groups compared. Only orthologs that are enriched (by means of Fisher exact test) with P<0.05
after FDR correction and phylogenetic P<0.05 are shown. (H and I) Existence map of KEGG orthologs
that are associated with ABC transporters (H) and phosphotransferase system (I), both enriched in
carnivores relative to herbivores. Rows represent orthologs, columns represent samples, colored by
dietary adaptations.
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 19
Downloaded from http://science.sciencemag.org/ on March 26, 2021
Fig. 7. Discovery and experimental validation of unknown toxin-metabolizing genes in the
microbiota of carrion eaters. (A to D) Characterization of protease genes from the gut microbiota
of carrion eaters. Length distribution of the 693 proteases that were initially selected for screening
is shown in (A). Classification of the proteases by type (up to the third hierarchy of EC numbers) is
shown in (B). DIAMOND similarity score distribution (aligned to KEGG protease database) of a
deduplicated list of 531 proteases is shown in (C) and (D). (D) is a magnification of (C), with Y-axis
spanning to 60 instead of 300; X-axis remains unchanged. (E) Structures of 12 bacterial toxins from
which 120 peptides were extracted as targets for the screening, with isolated peptide substrate
positions marked in red. Structures are semi-transparent so peptides from all sides are visible.
Structures were drawn from PDB (a full list of PDB IDs can be found in table S7). (F) Protease
screening results. Protease genes were synthetized and tested on a printed microarray of
biotinylated peptides in order to identify cleavage events. Microarray staining was done post
cleavage by FITC-labeled streptavidin, such that cleaved peptides were not stained. Significant hits
(FDR of 0.01 or less) are marked by black squares. Each row in the heatmap is a specific protease
(enzyme) gene; each column is a specific peptide. (G) Validation of a selected cleavage event, a
protease termed WBT001, identified by the screening. Validation was done in solution, using the
specific peptide synthesized as a FRET substrate, modified with 6-carobyfluorescein (6-FAM) and
4-(dimethylaminoazo)benzene-4-carboxylic acid (DABCYL), and measured by fluorometry.
First release: 25 March 2021 www.sciencemag.org (Page numbers not final at time of first release) 20
Diversity and functional landscapes in the microbiota of animals in the wild
Doron Levin, Neta Raab, Yishay Pinto, Daphna Rothschild, Gal Zanir, Anastasia Godneva, Nadav Mellul, David Futorian, Doran
Gal, Sigal Leviatan, David Zeevi, Ido Bachelet and Eran Segal
SUPPLEMENTARY http://science.sciencemag.org/content/suppl/2021/03/24/science.abb5352.DC1
MATERIALS
REFERENCES This article cites 121 articles, 15 of which you can access for free
http://science.sciencemag.org/content/early/2021/03/24/science.abb5352#BIBL
PERMISSIONS http://www.sciencemag.org/help/reprints-and-permissions
Science (print ISSN 0036-8075; online ISSN 1095-9203) is published by the American Association for the Advancement of
Science, 1200 New York Avenue NW, Washington, DC 20005. The title Science is a registered trademark of AAAS.
Copyright © 2021, American Association for the Advancement of Science