Professional Documents
Culture Documents
net/publication/350501787
CITATIONS READS
0 44
4 authors:
Some of the authors of this publication are also working on these related projects:
Metagenomic analysis of Microbial Community in Fresh Water Lake in Saudi Arabia View project
اﻟﻤﺸﻜﻠﺔ واﻟﺤﻞ دراﺳﺔ ﺗﻄﺒﻴﻘﻴﺔ... اﻟﻤﻴﺎه اﻟﻤﺼﺎﺣﺒﺔ ﻟﻠﺒﺘﺮول اﻟﺨﺎمWater associated with crude oil … the problem and solutions Applied study View project
All content following this page was uploaded by Hend Hamedo on 03 April 2021.
RESEARCH ARTICLE
Na s h wa I . Ha g a g y
Na g l a a F. E l s h a f i
Samy A. Selim
He n d A . H a m e d o
ABSTRACT:
Shotgun metagenomic DNA sequencing is an
effective ecological sequencing approach that Na g l a a F. E l s h a f i 2
offers understanding of microbial community Samy A. Selim1,3
biodiversity and function. This study He n d A . H a m e d o 2
evaluated the phylogenetic diversity and 1
Botany and Microbiology Department,
metabolic potential of the haloalkaliphilic
Faculty of science, Suez Canal
microbial community of hypersaline and
University, Ismailia, 41522, Egypt.
alkaline lake in Wadi Al-Natrun, Egypt, by
2
using High-throughput sequencing technique. Botany and Microbiology Department,
Metagenome comprised of 32,149,168 reads, Faculty of science, Al -Arish University,
and total read bases were 4.0G bp, with GC AL-Arish, Egypt.
content 61.589% and Q30 87.847%. The brine 3
Department of Clinical Laboratory Sciences,
sample of Ga ’ ar lake have been proceeded College of Applied Medical Sciences,
with library construction and revealed an Jouf University, Sakaka, P.O. 2014,
enormous value for archaeal structure and Saudi Arabia.
functionality. The community investigation
indicated the predominance of domain
Archaea (81%), represented by phylum
Euryarchaeota, family halobacteriaceae.
Domain Eubacteria (2%) comprised total
reads 208 OTUs of microbial community and ACCEPTED: Mar 28, 2021
included five phyla; Bacteraidetes (1.2%),
Proteobacteria (0.3%), Firmicutes (0.1%), ARTICLE CODE: 03.02.21
Chloroflexi (0.2%) and Actinobacteria (0.2%).
Only 17% of total community was unassigned.
In functional analysis, Gene Ontology (GO) INTRODUCTION:
terms derived from InterPro matches to brine Further research into extremophilic
sample revealed that 42% was biological microorganisms may reveal their ability to
processes, 52% genes for molecular functions withstand the harsh environments that we are
and 6% was cellular components. most likely to encounter on other pl anets.
Metagenomic sequence analysis indicated Although high-throughput sequencing
that this lake is promising for explori ng of technologies have been expanded in past
novel genes and extremophilic decades (Uritskiy and DiRuggiero , 2019),
microorganisms for various purposes. understanding the ecology and functional
diversity of extremophilic community is still
KEY WORDS: limited. Many scientists of physical
environments, based on comparative
Shotgun metagenomic; Biodiversity,
sequencing data, demonstrated that high
Functional analysis, Extremopiles, Soda Lake,
salinity, as well as extremes of pH,
Egypt.
temperature, primarily determined microbial
community (Xie et al., 2017). One of the
CORRESPONDENCE: naturally occurring extreme alkaline
Na s h wa I . Ha g a g y environments are Soda Lakes, t hey are
characterized by their highly pH values (pH 8
Botany and Microbiology Department, Faculty
to 11) and presence of high concentration of
of science, Suez Canal University,
sodium carbonate, and other salts, formed by
Ismailia, 41522, Egypt.
evaporative concentration (Van den Burg,
E-mail: nashwa_ibrahim@science.suez.edu.eg 2003). They are colonized by a rich diversity
ISSN: 1687-7497 Online ISSN: 2090 - 0503 https://www.ejmanager.com/my/ejeb
22 Egypt. J. Exp. Biol. (Bot.), 17(1): 21 – 31 (2021)
Fig. 1. Schematic map of soda lakes of Wadi Al-Natrun, showing the position of the study Lake.
Physico-chemical characterization of brine : Faculty of agriculture, Suez Canal University
in according to standard procedures of
Physico-chemical parameters: pH,
Clesceri et al. (1998).
temperature, and salinity were measured by
pH and conductivity probes (V.tech, DNA extraction, sequencing and
pH/conductivity and temperature meter, bioinformatics analysis:
Taiwan) in the field. Red or pink brine sample Metagenomic DNA was extracted
was collected into 1000 ml bottles of sterile directly from 50 ml of brine, according to
Pyrex. The concentration of some ions, Ca 2 + , Mesbah et al. (2007). Genomic material was
Mg 2 + , Na + , K + , Cl - , HCO 3 - , CO 3 - , SO 4 , P + 2 , quantified by picogreen (Invitrogen, cat.
and NO 3 - 2 in the environmental samples #P7589) method using Victor 3 fluorometry .
under study was done in Soil Department,
ISSN: 1687-7497 Online ISSN: 2090 - 0503 https://www.ejmanager.com/my/ejeb
Hagagy et al., Shotgun Metagenomic Sequencing of Extremophilic Community from Soda Lake, Ga’ar Lake, in Wadi Al-Natrun, Egypt 23
The quality of the DNA was confirmed via Metagenomic mode. Further all genes
electrophoresis on a 1% agarose gel. predicted were searched for the functional
DNA sample of lake Ga , ar brine, was assignment using KEGG, COG and Pfam
delivered to MacroGen company databases. Different plots were generated
www.dna.macrogen.com for library using taxonomic abundance. Complete
construction and sequencing following taxonomic representation for brine sample
Illumina Hiseq2000 manual’s instructions. The using Krona
brine samples passed the QC results. The (https://github.com/marbl/Krona/wiki ) was
Illumina HiSeq2000 generates raw images generated.
utilizing HCS (HiSeq Control Software v 2.2) Data availability:
for system control and base calling through Metagenome sequence data are available on
integrated primary analysis software RTA EMBL Metagenomics under the accession no.
(Real Time Analysis. v1.18). The BCL (base PRJEEB18746
calls) binary is converted into FASTQ utilizing (https://www.ebi.ac.uk/ena/data/view/ERR1770058).
Illumina package bcl2fastq (v1.8.4).
Bioinformatics analysis of the databa ses: RESULTS:
The data of structural and functional In Water sample (brine) of Ga'ar Lake,
analysis of Ga’ar Lake was annotated by EBI the total number of bases sequenced was
Metagenome Training online EBI 4,037,788,302 pb (4.0 G). Total number of
Metagenomics reads, in illumina paired-end sequencing, was
(https://www.ebi.ac.uk/metagenomics ). As 39, 978,102 reads. Figure 2 shows the
well as, some metagenomics data w ere number of reads which pass the quality
analyzed by ArrayGen Technologies Pvt. Ltd., control steps in the pipeline. The GC content
India (www.arraygen.com). Metagenomics was 61.690%, and ratio of reads that have
reads were proceeded for quality check to passed quality score of over 30 (Q30) was
remove the bad quality base as well as any 83.912% (Fig. 3), the histograms show the
adapters present were removed using distributions of sequence lengths (Fig. 3A)
Cutadapt, and megahit and percentage GC content (Fig. 3B) for the
(https://github.com/voutcn/megahit) for the sequences having passed quality control. The
assembly. All the contigs less than 1200 were standard deviations are shown on each plot.
removed and only contigs with greater than The bar chart underneath each graph
1200 bp were further processed for gene indicates the minimum, mean and maximum
prediction and abundance for each sample length and mean GC and AT content,
against the assembled conti g sequence. respectively.
Gene/orf was predicted using PROKKA
(https://github.com/tseemann/prokka ).
Fig. 3. Reads length histogram: (A) distributions of sequence lengths, (B) percentage GC content for the sequences.
Taxonomic analysis: Archaea (81.7%), comprised 10798 OTUs of
prokaryotic community, domain Eubacteria
The brine sample showed an enormous
(18%) comprised 2328 OTUs of microbial
consequence for archaeal composition and
community and less than 0.5% of OTUs were
functionality in Ga ’ ar lake, the community
assigned to viruses (Fig. 4).
analysis revealed predominance of domain
Microbial diversity at the phylum and and Thaumarchaeota. Most of the OTUs
class levels, represented in contigs longer (98%) assigned to Euryarchaeota were
than 1200 pb, are shown in figure 5A and 5B, affiliated with class Halobacteria, 153 OTUs
respectively; 81% of the OTUs assigned to assigned to Euryarchaeota were affiliated with
Euryarchaeota, Proteobacteria (7%), Chlorobi class Methanomicrobia, 20 OTUs were
and Actinobacteria (4%), while 1% of the affiliated with class Methanopyri, 12 OTUs
OTUs assigned to Spirochaeota, Firmicutes were Thermoplasmata, only 5 and 3 OTUs
and Cyanobacteria. Less than 1% of OTUs were affiliated to classes Archaeoglobi and
assigned to phyla Crenarchaeota, Methanobacteria, respectively.
Bacteroidetes, Verrucomicrobia, Tenericutes
Fig. 5. Microbial diversity at (A) the phylum level and (B) at the class level represented in contigs longer than 1200 pb.
From the OTUs assigned to Haloterrigena, Halomicrobium, Haloferex,
Halobacteria (families Natrialbaceae, Halobacterium, haloarcula, Halobiforma,
Halorubraceae and Haloferacaceae), the most Halogeometrcium, Halococcus,
dominant genera were Halorubrum (32%), Halostagnicola, Haloplanus, Haladaptatus,
Halorhabdus (18%), Natronococcus (2%) and Natrialba, Natronorubrum and Haloquadrtum.
Halonotius, Natronomonas each representing Moreover, 36% of the archaeal OTUs were
(1%). As well as other species represented unassigned genera included in
less than (1%) including Halobaculum, Halobacteriaceae (Fig. 6).
Fig. 6. Archaeal community structure of Ga’ar Lake Metagenome showing the most dominant genera.
As shown in figure 7, the most Bacteriodetes, 157 OTUs of total bacterial
prominent phylum of Eubacteria was community. 155 OTUs affiliated with family
ISSN: 1687-7497 Online ISSN: 2090 - 0503 https://www.ejmanager.com/my/ejeb
26 Egypt. J. Exp. Biol. (Bot.), 17(1): 21 – 31 (2021)
Fig. 7. Bacterial community structure of Ga’ar Lake Metagenome showing the dominant phyla.
Functional analysis: inter promatch; (78.6%) contained predicted
proteins with unknown function, and 43.763
A total of 15,750,883 sequences have
(0.00028%) were ribosomal RNA genes (Fig.
passed quality control with predicted CDS,
8).
and out of them 2,692,838 (21.4%) contained
predicted proteins with known functions and
Fig. 8. The functional content of the sequences in the brine of Ga’ar Lake.
A summary of Gene Ontology (GO) (52%) were assigned to molecular functions,
terms derived from Inte rPro matches to brine and 1665583 annotations (6%) were assigned
sample is provided in the charts shown in to cellular components. As shown in figure 10
figure 9. The detection of genes encoded for the proportion of SEED subsystems
secondary metabolism and metabolism of (Overbeek et al., 2005) annotated in contigs
aromatic compounds indicates that the from the brine sample of the studied lake.
studied lake was prosperous with Most of genes related to central carbohydrate
commercially valuable enzymes. A tota l of metabolism, RNA processing and
1243352 annotations (42%) were assigned to modification, protein biosynthesis, folate and
biological processes, 1539319 annotations pterines, isoprenoids, lysine, threonine,
ISSN: 1687-7497 Online ISSN: 2090 - 0503 https://www.ejmanager.com/my/ejeb
Hagagy et al., Shotgun Metagenomic Sequencing of Extremophilic Community from Soda Lake, Ga’ar Lake, in Wadi Al-Natrun, Egypt 27
Fig. 9. A summary of Gene Ontology (GO) terms derived from InterPro matches to the lake brine. A. for biological
processes, B. genes for Molecular functions, and C. for cellular components.
ISSN: 1687-7497 Online ISSN: 2090 - 0503 https://www.ejmanager.com/my/ejeb
28 Egypt. J. Exp. Biol. (Bot.), 17(1): 21 – 31 (2021)
Fig. 10. Heatmap depicting the proportion of proteins related to SEED subsystems in the brine of Ga’ar Lake. Scale has
been applied by row.
REFERENCES:
Abd El Ghani M, Soliman A, Abdelfattah R. 2014.
Spatial distribution and soil characteristics of Keshri J, Mody K, Jha B. 2013. Bacterial community
the vegetation associated with common structure in a semi- arid haloalkaline soil using
succulent plants in Egypt. Turk. J. Bot., 38: culture independent method. Geomicrobiol. J.,
550-565. 30(6): 517–529.
Clescer S, Greenberg AE, Eaton AD. 1998. Standard Marco D. 2011. Metagenomics current innovations
methods for the examination of water and and future trends. Caister Academic Press,
waste water, 20 th Edition.” American Public pp. 296.
Health Association, Washington DC., pp.
1325. Mesbah NM, Abou-El-Ela SH, Wiegel J. 2007. Novel
and Unexpected Prokaryotic Diversity in Water
de la Haba RR, Sánchez-Porro C, Marquez MC, and Sediments of the Alkaline, Hypersaline
Ventosa A. 2011. Taxonomy of Halophiles. In: Lakes of the Wadi An Natrun, Egypt. Microb.
“Extremophiles Handbook. (Horikoshi K. eds)”. Ecol., 54(4): 598–617.
Springer, Tokyo, pp. 255-308.
Navarro-Noya YE, Valenzuela-Encinas C, Sandoval-
Dudhagara P, Ghelani A, Patel R, Chaudhari R, Yuriar A, Jiménez-bueno NG, Marsch R,
Bhatt S. 2015. Bacterial tag encoded FLX Dendooven L. 2015. Archaeal communities in
titanium amplicon pyrosequencing (bTEFAP) a heterogeneous hypersaline-alkaline soil.
based assessment of prokaryotic diversity in Archaea, 2015(2): 1-11
metagenome of Lonar soda lake, India.
Genomics Data, 4: 8-11. Overbeek R, Begley T, Butler RM, Choudhuri JV,
Chuang HY, Cohoon M, de Crécy-Lagard V,
Ghai R, Pašić L, Fernández AB, Martin-Cuadrado Diaz N, Disz T, Edwards R, Fonstein M, Frank
AB, Mizuno CM, McMahon KD, Papke RT, ED, Gerdes S, Glass EM, Goesmann A,
Stepanauskas R, Rodriguez-Brito B, Rohwer Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi
F, Sánchez-Porro C, Ventosa A, Rodríguez- N, Krause L, Kubal M, Larsen N, Linke B,
Valera F. 2011. New abundant microbial McHardy AC, Meyer F, Neuweger H, Olsen G,
groups in aquatic hypersaline environments. Olson R, Osterman A, Portnoy V, Pusch GD,
Sci. Rep., 1: 135. Rodionov DA, Rückert C, Steiner J, Stevens
Grant S, Sorokin DY, Grant WD, Jones BE, Heaphy R, Thiele I, Vassieva O, Ye Y, Zagnitko O,
S. 2004. A phylogenetic analysis of Wadi el Vonstein V. 2005. The subsystems approach
Natrun soda lake cellulase enrichment to genome annotation and its use in the
cultures and identification of cellulase genes project to annotate 1000 genomes. Nucleic
from these cultures. Extremophiles, 8(5): 421 – Acids Res., 33(17): 5691–5702.
429. Pandit AS, Joshi MN, Bhargava P, Shaikh I, Ayachit
Grant WD, Sorokin DY. 2011. Distribution and GN, Raj SR, Saxena AK, Bagatharia SB.
diversity of soda lake alkaliphiles. In: 2015. A snapshot of microbial communities
“Extremophiles Handbook. (Horikoshi K, from the Kutch: One of the largest salt deserts
Antranikian G, Bull AT, Robb FT, Stetter KO. in the world. Extremophiles, 19(5): 973–987.
Eds)”. Springer, vol. 1, pp. 27–54. Selim AS, El-Alfy MS, Hagagy NI, Hassanin AAI,
Hahnke RL, Meier-Kolthoff JP, García-López M, Khattab RM, El-Meleigy EA, Abdel Aziz MH,
Mukherjee S, Huntemann M, Ivanova NN, Maugeri TL. 2012. Oil-biodegradation and
Woyke T, Kyrpides NC, Klenk HP, Göker M. biosurfactant production by haloalkaliphilic
2016. Genome-Based Taxonomic archaea isolated from Soda Lakes of the Wadi
Classification of Bacteroidetes. Front. An Natrun, Egypt. J. Pure Appl. Microbiol.,
Microbiol., 7: 2003. 6(3): 1011-1020.
doi: 10.3389/fmicb.2016.02003 Shi Y, Adams JM, Ni Y, Yang T, Jing X, Chen L, He
Hamedo HA, Hagagy NI, El-Shafi NF, AbdElaziz MH. JS, Chu H. 2016. The biogeography of soil
2017. Screening of hydrolytic extremozymes archaeal communities on the eastern Tibetan
in haloalkaliphilic Archaea by culture and Plateau. Sci. Rep., 6: 38893. doi:
molecular-based methods. Egypt. J. Exp. Biol. 10.1038/srep38893.
(Bot.), 13(1): 81-87.
ISSN: 1687-7497 Online ISSN: 2090 - 0503 https://www.ejmanager.com/my/ejeb
Hagagy et al., Shotgun Metagenomic Sequencing of Extremophilic Community from Soda Lake, Ga’ar Lake, in Wadi Al-Natrun, Egypt 31
Simon C, Wiezer A, Strittmatter AW, Daniel R. 2009. Saltmarshes (SW Spain). Genes (Basel). 9(3):
Phylogenetic diversity and metabolic potential 152. doi: 10.3390/genes9030152.
revealed in a glacier ice metagenome. Appl. Xie K, Deng Y, Zhang S, Zhang W, Liu J, Xie Y,
Environ. Microbiol., 75(23): 7519-7526. Zhang X, Huang H. 2017. Prokaryotic
Uritskiy U, DiRuggiero J. 2019. Applying genome- community distribution along an ecological
resolved metagenomics to deconvolute the gradient of salinity in surface and subsurface
halophilic microbiome. Genes (Basel). 10(3): saline soils. Sci. Rep., 7: 13332.
220. doi: 10.3390/genes10030220. doi.org/10.1038/s41598-017-13608-5
Van den Burg B. 2003. Extremophiles as a source for Xie W, Wang F, Guo L, Chen Z, Sievert SM, Men g J,
novel enzymes. Curr. Opin. Microbiol., 6(3): Huang G, Li Y, Yan Q, Wu S, Wang X, Chen
213-218. S, He G, Xiao X, Xu A. 2011. Comparative
Vera-Gargallo B, Ventosa A. 2018. Metagenomic metagenomics of microbial communities
insights into the phylogenetic and metabolic inhabiting deep-sea hydrothermal vent
diversity of the prokaryotic community chimneys with contrasting chemistries. ISME
dwelling in hypersaline soils from the Odiel J., 5(3): 414–426.
أما البكتريا " "Eubacteriaالمتواجدة تمثل ( )2%فقط من تقنية تسلسل الحمض النووي عالية اإلنتاجية
المجتمع الميكروبي وتضم خمس مجموعات تتألف من وا لمعروفة ب الميتاجينومية هي نهج تسلسل بيئي فعال
مجموع 208وحدات ” “OTUsللمجتمع الميكروبي وتضم ما للتنوع البيولوجي والوظيفي في المجتمع يوفر فه ً
خمس شعب؛)Proteobacteria ،Bacteraidetes (1.2% الميكروبي .قيمت هذه الدراسة التنوع الوراثي واإلمكانات
) Chloroflexi (0.2%) ،Firmicutes (0.1%) ،(0.3%و األيضية للمجتمع الميكروبي " "Haloalkaliphilicلبحيرة
) .Actinobacteria (0.2%وتبين ان 17%فقط من المجتمع شديدة الملوحة والقلوية في وادي النطرون في مصر،
الكلي لم يتم تعيينهم .في التحليل الوظيفي ،كشفت باستخدام تقنية التسلسل عالي اإلنتاجية ( (Illumina
مصطلحات علم الوجود الجيني ) (GOالمستمدة من تطابق .Hiseq200يتكون Metagenomeمن 32،149،168قراءة،
InterProمع العينة أن 42%كانت عمليات بيولوجية ،و وبلغ إجمالي قواعد القراءة ،4.0G pbمع محتوى 61.589٪
52%جينات للوظائف الجزيئية و 6%كانت مكونات خلوية. GCو .87.847% Q30تمت متابعة عينة المياه عالية
يشير تحليل تسلسل الميتاجينوم إلى أن هذه البحيرة الملوحة ) (Brineلبحيرة الجعار Ga'arفي بناء المكتبة
واعدة الستكشاف الجينات الجديدة والكائنات الحية الدقيقة الجينية وكشفت عن قيمة هائلة للتنوع التكويني
شديدة الحساسية ألغراض مختلفة. والوظيفي لالركيا " ،"Archaeaوتبين أن ( )81%تمثل االركيا
" "Archaeaمن طائفة ،Euryarchaeotaعائلة .alobacteriaceae