Professional Documents
Culture Documents
Textbook Dna Barcoding and Molecular Phylogeny Subrata Trivedi Ebook All Chapter PDF
Textbook Dna Barcoding and Molecular Phylogeny Subrata Trivedi Ebook All Chapter PDF
https://textbookfull.com/product/fundamentals-of-molecular-
structural-biology-1st-edition-subrata-pal/
https://textbookfull.com/product/dna-computing-and-molecular-
programming-24th-international-conference-dna-24-jinan-china-
october-8-12-2018-proceedings-david-doty/
https://textbookfull.com/product/probability-and-statistics-with-
reliability-queueing-and-computer-science-applications-trivedi/
How to Speak Tech 2nd Edition Vinay Trivedi
https://textbookfull.com/product/how-to-speak-tech-2nd-edition-
vinay-trivedi/
https://textbookfull.com/product/fortran-2018-with-parallel-
programming-1st-edition-subrata-ray-author/
https://textbookfull.com/product/hydrodynamics-and-transport-
processes-of-inverse-bubbly-flow-1st-edition-subrata-kumar-
majumder/
https://textbookfull.com/product/dna-engineering-properties-and-
applications-1st-edition-mizoguchi/
https://textbookfull.com/product/mitochondrial-dna-and-
diseases-1st-edition-hongzhi-sun/
Subrata Trivedi · Hasibur Rehman
Shalini Saggu · Chellasamy Panneerselvam
Sankar K. Ghosh Editors
DNA Barcoding
and Molecular
Phylogeny
DNA Barcoding and Molecular Phylogeny
Subrata Trivedi • Hasibur Rehman •
Shalini Saggu • Chellasamy Panneerselvam •
Sankar K. Ghosh
Editors
DNA Barcoding
and Molecular Phylogeny
Editors
Subrata Trivedi Hasibur Rehman
Faculty of Science, Department of Biology Faculty of Science, Department of Biology
University of Tabuk University of Tabuk
Tabuk, Saudi Arabia Tabuk, Saudi Arabia
Departments of Pathology
School of Medicine, The University of Alabama
at Birmingham (UAB)
Birmingham, AL, USA
Sankar K. Ghosh
University of Kalyani
Kalyani, West Bengal, India
This Springer imprint is published by the registered company Springer International Publishing AG part of
Springer Nature.
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Contents
v
vi Contents
S. Daravath
Department of Biotechnology, Nizam College, Hyderabad, Telangana, India
R. Naik Bannoth
Department of Zoology, Osmania University, Hyderabad, Telangana, India
M. T. Selvi
Value Added Corporate Services Pvt Ltd, Chennai, Tamil Nadu, India
S. Ankanagari (*)
Department of Genetics, Osmania University, Hyderabad, Telangana, India
Biodiversity is the variability of life on Earth, including the genes which they carry
and complex ecosystems that create the environment (Primack et al. 2001). The
degree of biodiversity is constantly changing; the rate of extinction is 100–1000
times higher, almost exclusively as a consequence of human activity. In recent years,
the effort to map and to protect global biodiversity has been ever increasing. Thus the
global project aimed at the mapping and protection of biodiversity using a new
taxonomic method called “DNA barcoding” was started. DNA barcoding assists in
cataloging the list of species biodiversity and answers fundamental questions in
ecology, evolution, and conservation biology. Many authors have proposed DNA
barcoding as an integrated approach with classical taxonomy for species identifica-
tion and authentication in the postgenomics era (Kane and Cronk 2008; Newmaster
et al. 2009; Sahare and Srinivasu 2012; Vohra and Khera 2013). Since it has enough
nucleotide diversity, DNA barcodes are used to discover, describe, and understand
biodiversity of unknown species. The universal coded labels found by the
researchers are used as unique identification marker for the species on the planet
(Kress et al. 2015). Combination of molecular biology techniques, bioinformatics,
and DNA barcoding is giving an opportunity to change the existing use of biodi-
versity information for basic and practical applications.
DNA Barcoding Significance and Utilities 5
The Barcode of Life (2010) defines four aspects of DNA barcoding workflow.
Firstly, verified taxonomic identification and ideally voucher specimens are used
as critical sources for DNA library preparation from national parks, botanical
gardens, zoological gardens/zoos, seed banks, national herberia, and gene banks.
Secondly, these samples are subjected to analysis in the laboratory where extraction
of DNA is done as per the laboratory standard operating procedures. After extraction
of DNA, amplification is done by using PCR method and sequenced. Thirdly, the
obtained gene sequences are kept in the database, a reference library, which helps the
allocation of known sequence. Lastly, by using algorithm, best fit reference record is
identified from the database for the unknown sample data.
The criteria for identification of gene marker regions for univerisally applicable
DNA barcode standard include following: (1) significant species-level genetic var-
iability and divergence, i.e., DNA barcode region should have high interspecific and
low intraspecific variability, (2) short sequence length to facilitate DNA extraction
and amplification, (3) simple to sequence universal PCR primers, and (4) few easily
alignable insertions/deletions.
identifying animal species (Hebert et al. 2003b). Compared to previous 650 base long
segment of the cytochrome c oxidase gene, a 100 base fragment of the original
barcode was reported to be effective in identifying archival specimens and potentially
useful for all taxa of the eukaryotes DNA barcoding. COI region is now widely used
for molecular evaluation of diversity, as it has good potential for identifying cryptic
species and improving understanding of marine biodiversity. The other mitochon-
drial genes used as barcode markers include: cob, which encodes for apocytochrome
b; cox2 and cox3, which encode for the cytochrome oxidase subunits 2 and 3,
respectively; and nad1, which encodes for NADH dehydrogenase subunit1 and the
mitochondrial 16S-rDNA gene.
Plant DNA barcoding meant for mainly plastid DNA includes maturase K (matK)
and ribulose-bisphosphate carboxylase (rbcl) (CBOL Plant Working Group 2009;
Burgess et al. 2011). Several plastidial genes, such as the most conserved rpoB,
rpoC1, and rbcL or a section of matK, which shows a fast evolution rate, have been
proposed as barcode regions (Shaw et al. 2007). Similarly intergenic spacers such as
trnH-psbA, atpF-atpH, and psbK-psbI have been proposed as barcode regions
(Fazekas et al. 2009). In 2009, the CBoL (Consortium for the Barcode of Life)
Plant Working Group (Hollingsworth et al. 2009) suggested the use of two-locus
combination of rbcL and matK as core-barcode regions, due to recovery rate of rbcL
and the high resolution of matK. For a single primer pair sequencing, matK is
difficult to use (Dunning and Savolainen 2010), whereas, despite its limited resolu-
tion, rbcL of matk is considered in terms of amplification, sequencing, and alignment
and provides a useful backbone in the creation of plant DNA barcode datasets
(De Mattia et al. 2012). The trnH-psbA intergenic spacer is direct to amplify and
has a high genetic variability among closely related taxa (Bruni et al. 2010; Kress
et al. 2015; Shaw et al. 2007). The nuclear ITS region is also considered as a
supplementary DNA barcode region (Li et al. 2011). Although there is still debate
on the effectiveness of these markers especially when users are dealing with closely
related taxa, DNA barcoding showed consistent results when used to identify
unknown specimens based on the comparison with reference sequences (Burgess
et al. 2011; De Mattia et al. 2012).
The combining of barcodes from multiple loci has been used successfully. Plant
DNA barcodes will prove extremely useful for applications like ecological forensics,
identification of traded materials, undertaking identifications, and assisting species
discovery in some plant groups. The key step is assembling large DNA sample sets
representing the Earth’s botanical diversity, supported by voucher specimens and
indexed via DNA sequences which help to use plant DNA for identification of
species effectively (Hollingsworth et al. 2011).
DNA Barcoding Significance and Utilities 7
The green alga is an ancient Chlorophyta, which are used as bioindicator for monitor-
ing water and ecosystem. It is used in biotechnological applications such as the
production of fuels, chemicals, food, and animal feed. The identification of green
microalgae is very difficult by using traditional method in live cultured cells. DNA
barcoding method is helpful to identify the 5000 which is not explained earlier. rbcL
and nuITS1 and nuITS2 markers are used to identify the green microalgae species from
neotropical inland waters. Existing available universal primers for ITS1-5.8S-ITS2
region amplification were done for 92% of the samples. But by designing new set of
primers for rbcL, which sequenced for 96% of the samples, nuITS1 or nuITS2
sequences identified 35% of species using barcode gap calculations. Accuracy of
identification is done by nuITS2 compensatory base change (CBC) and ITS1-5.8S-
ITS2 region phylogenetic analysis along with morphological inspection. By using rbcL
sequences only 6% of right species could be identified, however, with the available
analytical tools and reference barcodes in the database nuITS1 or nuITS2 can be other
useful markers for DNA barcoding of freshwater green microalgae by using availability
of analytical tools and reference barcodes for this marker in the database. DNA
barcoding pipeline based on nuITS2 is useful for identification of green microalgae
species. It is clear that deposition of more taxonomically accurate reference barcodes in
curated databases (e.g., BOLD Systems) will help in lessening known tropical species.
COI is the default marker adopted by the Consortium for the Barcode of Life for all
groups of organisms, including fungi. Most fungi are microscopic and inconspicu-
ous, and many are unculturable. Hence universal primers are required to detect a
truly representative profile where COI seems to have many challenges from other
candidates like ITS. ITS region is the main barcode region in fungi (Schoch et al.
2012). Internal transcribed spacer (ITS) is used to identify Oomycota, but the default
COI marker is helpful to identify closely related species and in Penicillium and other
fungi. Across the fungal kingdom, ITS was generally considered as higher resolution
than LSU in species identification. ITS performance in species discrimination is as
close to RNA polymerase II largest subunit (RPB1) as the protein-coding marker.
However, ITS region fails to identify the species in certain groups of fungi and thus
requires secondary barcode loci for proper identification.
Protists are a diverse and loose group of disparate eukaryotic microorganisms. They
have very simple structural organization, unicellular or multicellular. This simple
cellular organization distinguishes the protists from other eukaryotes. They have
8 S. Daravath et al.
evolved before plants, animals, or fungi appeared on Earth. CBOL initiated Protist
Working Group and assessed the efforts to identify the barcode regions across all
protist lineages, and recently it has a two-step barcoding approach to assess protistan
biodiversity. Various protistan DNA barcodes have been proposed; D1–D2 and/or
D2–D3 regions at the 50 end of 28S rDNA have been positively tested in ciliates,
saprophytes, and acantharians and are also promising for diatoms. Highly variable
V4 subregion on the 18S rRNA gene may serve as a potential candidate for
barcoding diatoms (Zimmermann and his group). ITS1 and/or ITS2 rDNA is also
commonly utilized in oomycetes, chlorarachniophytes, and green algae. COI also
allows morphospecies identification in red and brown algae, dinoflagellates, some
raphid diatoms, Euglyphida, naked lobose and shelled amoebae, coccolithophorid
haptophytes, and some ciliates. Other group-specific barcodes include the large
subunit of the ribulose-1,5-biphosphate carboxylase–oxygenase gene (rbcL) and
the chloroplast 23S rRNA gene for photosynthetic protists and spliced leader RNA
genes for trypanosomatids. The resolution powers of different protistan DNA
barcodes are not compared.
markers can also be used. The data generated using metagenomics provides addi-
tional genomic information to identify the taxa and functional characterization of the
environment.
Sequencing methodologies for DNA barcoding have changed over time. More
recent HTS-based studies have employed one of three major sequencing library
preparation strategies: ligation-based “tagmentation” kits (Richardson et al. 2015a,
b), singly indexed barcoded primers (Valentini et al. 2010; Hawkins et al. 2015;
Keller et al. 2015; Kraaijeveld et al. 2015), or dual-indexed barcoded primers (Sickel
et al. 2015). The dual-indexing approach described in Sickel et al. (2015), adapted
from Kozich et al. (2013), shows great promise for facilitating library preparation for
large studies while reducing multiplexing cost and increasing laboratory efficiency.
Sickel et al. (2015) report 2000–3000 high-quality reads to be adequate to describe
bee-collected samples with up to 80 taxa. Lastly, pollen metabarcoding has been
successfully conducted using a variety of platforms including Ion Torrent
(Kraaijeveld et al. 2015), Roche 454 (Valentini et al. 2010; Hawkins et al. 2015;
Keller et al. 2015), and Illumina (Richardson et al. 2015a, b; Sickel et al. 2015);
however, with increased relative throughput, accurate homopolymer sequencing,
and increasing length capabilities, Illumina is the current platform of choice for
PCR-based approaches.
NGS is the most standard tool for characterizing the ITS2 (DNA marker for
mosquitoes) region in mosquitoes; this can be used in many other insect species and
genera. Multiplexing made NGS more efficient in sequencing polymorphic regions
successfully and in understanding large diversity of ITS2 alleles present in mosqui-
toes. DNA barcoding marker ITS2 was able to separate all of the species, apart from
members of the Culex pipiens complex, given similar resolution as cytochrome
oxidase I (COI). The best DNA marker could successfully separate all species and
provide a good linkage among species phylogenies. The gene with best resolution
can be used for bulk DNA barcoding, where large pools of mosquitoes could be
sequenced and identified (Hajibabaei et al. 2011).
Microbes in marine environments are studied by using NGS technology. Ana-
lyses of bacterial studies were done by using 18S rDNA (Huber et al. 2007) and 16S
rDNA (Sogin et al. 2006). The amplicons, microbe’s gene expression in surface
seawater (Frias-Lopez et al. 2008), transcriptomic sequencing analysis of cDNA
libraries, and functional assemblages within seawater were investigated in
bacterioplankton (Mou et al. 2008) were investigated. Marine eukaryotic microbiota
was studied through NGS analysis of 18S rDNA amplicons (Stoeck et al. 2010). A
shotgun sequencing approach was employed to investigate microbial diversity in
seawater (Williamson et al. 2008).
Metabarcoding is a tool to assess biodiversity. Very recently metabarcoding is
validated by testing it against three high-quality standard datasets from Malaysia
(tropical), China (subtropical), and the United Kingdom (temperate) and that has
55,813 arthropod and bird specimens identified to species level. It can be applied in
restoration ecology and systematic conservation planning, minimizing false-positive
assignments (Matsen et al. 2010; Zhang et al. 2012) linked with the end users of
biodiversity data (Cook et al. 2013). In recent years, using DNA metabarcoding with
DNA Barcoding Significance and Utilities 11
DNA extracts have a mixture of plastid and mitochondria and generate the sequence
data across all the mentioned organelle. At low sequence coverage, approx. 1 GB of
data, the genome can be “skimmed” and permits near completion assembly of the
high-copy plastid, mitochondria, and ribosomal RNA. Skimming approach has the
capacity to make highly fragmented nuclear genome assembly. Genome skimming
also helps to prepare single insert-size library for less number of samples, e.g.,
Illumina Nextera and TruSeq. Larger-size sample library preparation can be auto-
mated, e.g., Illumina NeoPrep. It also reduces the cost of analysis. Genome skim-
ming generates some low coverage data of single-copy nuclear regions and can be
used in combination with algorithms inspired by Maillet et al. (2014) or Fan et al.
(2015) to generate similarity indices between pairs of nuclear genomes which can
solve problems in hybridization and/or recent origins. Genome skimming has great
promise for extending the plant barcode, which can be used in highly degraded
DNAs from herbarium specimens. Sequence data is recoverable from herbarium
specimens which has degraded DNA in the absence of PCR, e.g., whole plastid
genome from 100-year-old herbarium (Besnard et al. 2014). This absence of PCR is
12 S. Daravath et al.
used when universal primers are ineffective (e.g., matK from various plant lineages
or plastid regions from many parasitic plants).
Several studies carried out to date have highlighted many major challenges, includ-
ing lack of reference libraries, unavailability of the vouchers to professionally
identified specimens archived in a herbarium corresponding to the reference DNA
sequences in the GenBank (consequently a GenBank reference sequence may be
from an incorrectly identified plant species with no way to verify its specific origin),
and variable rate of evolution corresponding to different loci (Newsmaster et al.
2013). Researchers have to establish the DNA barcoding library data base with
known sequences available for open access to identify unknown specimen.
Development of biological reference material (BRM) acts as a universal platform
for reference sequence database at species level to identify plant components in the
herbal products. This would consist of taxonomically validated herbarium vouchers
of known provenance. The barcode of an unknown sample from commercially
produced herbal products can be compared with the reference barcodes to identify
the related species. The BRM herbal barcode library presents a method for good
manufacturing practices (GMP) of herbal products.
Two central DNA databases are available; one is BOLD (Barcode of Life Data
Systems), through which researchers can upload and use it for their analysis and
assembling data. BOLD is connected to other databases of voucher specimens by
BARCODE Data Standard. Another one are GenBank, EMBL, and DDBJ which are
DNA Barcoding Significance and Utilities 13
3.2.1 BOLD
The Barcode of Life Data Systems (BOLD) is a web-based workbench and database
supporting the acquisition, storage, analysis, and publication of DNA barcode
records. By assembling molecular, morphological, and distributional data, it bridges
a traditional bioinformatics chasm.
BOLD contains now more than 4.2 million validated barcodes (http://www.
boldsystems.org/index.php/databases; Ratnasingham and Hebert 2007).
3.2.2 NCBI
Globally several DNA barcoding projects are launched to aim specific taxonomic
groups such as plants, fungi, protists, bacteria, and different entities of the animal
kingdom including fishes, brides, insects, nematodes, mammals, etc. The largest
consortia in this DNA barcoding projects are:
iBOL
A milestone in the field of DNA barcoding was achieved by launching of Interna-
tional Barcode of Life (iBOL) project. The iBOL is the largest biodiversity genomics
program. From 25 nations biodiversity scientists, genomics specialists, technolo-
gists, and ethicists are working together to construct a richly parameterized DNA
barcode reference library which is going to be the basis for a DNA-based identifi-
cation system for all multicellular organisms. In the initial phase of the operations,
iBOL collaborators are barcoding for 5 million specimens representing 500,000
14 S. Daravath et al.
species. During construction of the barcode library, iBOL participants will also be
building the infrastructure needed to use it in real-world situations such as conser-
vation, ecosystem monitoring, forensics, and control of agricultural pests and inva-
sive species.
CBOL
To support worldwide DNA barcoding and international online data management
system—Consortium for the Barcode of Life (CBOL) was established. Later Barcode
of Life Data Systems (http://www.barcodinglife.org) came into effect. Canada was the
first country to establish national network for DNA barcoding in Canada initially
(BOLNET.ca). Later most of the countries and regions have also established barcoding
networks as part of the iBOL, e.g., Europe (ECBOL; http://www.ecbol.org/), Norway
(NorBOL; http://dnabarcoding.no/en/), Mexico (MexBOL; http://www.mexbol.org/),
and Japan (JBOLI; http://www.jboli.org/).
ECBOL
Earlier times ECBOL was within EDIT, the European Distributed Institute of
Taxonomy, to complement CBOL’s activities from a European perspective. The
idea of ECBOL.org was created within EDIT work package 3.4 “DNA barcoding”
and is an information and coordination hub maintained by the Centraalbureau voor
Schimmelcultures (CBS) in Utrecht. As a result of EDIT and the DNA Barcoding in
Europe meeting in 2007, the ECBOL initiative was started, for a network of
European leading labs in the field of DNA barcoding.
Currently, as many as 1,371,809 DNA barcodes are identified and stored, which
correspond to approximately 113,435 denominated species (as of September
30, 2011). These projects enhance the information resources available in the genome
databases of GenBank, EMBL, DDBJ, and encyclopedia. The ultimate concept of
these entire project missions is to develop the DNA barcoding ease and affordable
molecular tool for taxonomic research but also to study, preserve, and protect the
biodiversity. The applications of this tool in turn benefits science and society.
activities including test reports and SOPs. Procedures are extensively employed to
assist with working safely. They are sometimes called safe work methods statements
(SWMS). They are usually preceded by various methods of analyzing tasks or jobs
to be performed in a workplace, including an approach called job safety analysis, in
which hazards are identified and their control methods described. Procedures must
be suited to the literacy levels of the user, and as part of this, the readability of
procedures is important. Thus, SOPs ensures the quality of uniformity in working
process with a rich outcome of the activity.
The procedures involved in DNA barcoding are sample collection, isolation of
DNA, amplification of the barcoding gene, and finally sequencing and analyzing the
results. Product obtained by PCR is sent for DNA sequencing and analyzed using
both the DNA BOLD and NCBI databases (Jacque Keele et al. 2014). It helps to
identify species from samples of DNA from fish, birds, mammals, plants, and
invertebrates. Due to poor DNA or failure of PCR, result might go wrong, but
PCR with RDLES is providing researchers a way to confirm through molecular
biology the identification of organisms.
To enhance the quality of these methods, several laboratories and organizations
around the world are getting ISO 17025 accreditation for the methods of DNA
sequencing, next-generation sequencing, and PCR ISO 15189. Main focus of ISO
17025 is on the accredited test and calibration method including (1) detailed sam-
pling procedures, (2) specific handling of test and calibration equipment, (3) ISO
17025-compliant result reports and result reporting, and (4) the participation in
proficiency testing programs/lab comparison tests. The compliance with ISO
17025 is secured through accreditation by a national accreditation body and inde-
pendent internal audits. Quality assurance perspectives are crucial. It is vital to that
working with small amounts of DNA is challenging, DNA is not visible to the naked
eye throughout the process, from sampling to analysis. Also, obtained results in one
study cannot be translated directly to other studies that use different primers,
sampling methods, lab and field protocols, etc. Thus the entire SOP is important
for each species from sampling to analysis, and also each SOP undergoes method
verification before implementation in to other projects.
DNA barcoding is a critical tool that helps to identify an animal species and gives its
genetic sequence database. On comparison of nucleotide distance between species of
different genera, it was found that Sarotherodon melanotheron and Coptodon zilli
are closely related. Using cytochrome oxidase subunit I (COI) confirmed accurate
identification of these fish species from Southwest Nigeria (Falade et al. 2016).
DNA barcoding is advance in accurate species identification and also allows for
the recognition and documentation of unknown taxa across alpha and beta diversity
estimation in the tropical bat. DNA barcoding surveys have shown that a more
number of species-level taxa are unnoticed by traditional methods. Bat tissue DNA
was extracted and the COI barcode region amplified and sequenced and identified
nine species-level taxa within samples, based on analysis of the DNA barcodes. The
study outcome has shown that high diversity of bats within Peninsular Malaysia
(9 species in 13 samples) was identified and demonstrated how DNA barcoding
helps and allows for cataloguing and documenting known taxa lacking formal
taxonomic status (Wilson et al. 2014).
Evolution Parallel evolution of species were identified at inter and intraspecific
level by using three aphid and two buchnera genes eg. Mollithrichosiphum and
Buchnera at intraspecific as well as the interspecific levels. It supports using
endosymbiont genes to study host evolutionary history and biogeographical pat-
terns. In addition, Buchnera gnd gene acts as a barcoding marker for aphid identi-
fication (Liu et al. 2013). The study indicates that the Buchnera gnd gene is as good
as COI as a barcoding marker for aphids (Lebonah et al. 2014).
Ecology Rapid habitat loss and degradation are responsible for population decline
in a growing number of species. Understanding the natural history of the species is
important for designing conservation strategies to enhance habitat or ex situ conser-
vation. Globally ecosystems have undergone rapid changes during the recent past,
especially anthropogenic climate change, biodiversity loss, and biological invasions.
The immense environmental degradation stresses the need for quick technique for
quantifying and monitoring the spatial and temporal dynamics of biodiversity.
In ecological studies DNA barcoding is used to survey the diets of animals by
analyzing the fecal matter to identify the remains of the plants (Valentini et al. 2009).
Many studies have used NGS technology in diet analysis and in the investigation of
gut microbial ecology. Several studies have been conducted on the effect of diet on
the gut microbiome of mice using 16S rDNA amplicons. Recently, diet of bats was
studied using short COI amplicons to identify the species which allows or permits to
understand the relationship of the diet of sympatric cryptic species (Razgour et al.
2011).
Conservation Unknown species identification helps industries of mining, fisheries,
and forestry to conserve and manage its environment which in turn gives us
economic benefits. To identify all species in the planet will take 2000 years by
using traditional taxonomy. By the help of DNA barcoding, it can increase the
20 S. Daravath et al.
identification rate, and in turn it gives scientific and economic benefits. Genetic
marker system in barcoding will help to identify accurately and to track endangered
valuable species. Applications of DNA barcoding in extinct species can identify
hunted wildlife for the traditional medicine, collections of rare species for private
parties, and harvest for other products of wildlife.
Biodiversity Genomic studies in conjugation with DNA barcoding can be very
effective in assessment of global biodiversity. Canada has a global hub for DNA
barcoding. Advanced technology in high-throughput sequencing is used in DNA
barcoding of large environmental samples to identify the species. DNA barcoding
was effective in analyzing zooplankton sample collected from Equatorial Pacific
Ocean (Machida et al. 2009; Ficetola et al. 2008). NGS approach with conventional
Sanger sequencing of cytochrome b amplicons is used to identify the presence of
bullfrogs in freshwater samples.
Microbial samples collected from the burned and unburned land soil sites and the
gene used for DNA barcoding analysis were the 16S rRNA gene (Natalie 2013).
Microbial samples collected from the burned and unburned land soil sites were
analysed for DNA barcoding with 16S rRNA gene. The data has shown that the
microbes found in the unburned samples are less prominent indicating that many of
soil Archaea microbes have moved quickly after the fire and stablized (Natalie
2013). Barcoding of selected enterobacterial species using fluorescent proteins pro-
vides a simple and speedy method for distinguishing species identity (Thao et al.
2013).
In oceans, microbial life is accountable for 98% of the primary production (Sogin
et al. 2006). Microbes cause numerous diseases (Galimberti et al. 2013). The
increasing community genomics studies and the metagenomics approaches assure
real insights on prokaryote biodiversity and molecular evolution. Microbes relied
heavily on gene-centric metagenomic profiling using two genes (16S rRNA and
60 kDa chaperonin protein (cpn60)) to recognize and identify the bacteria. Links
et al. (2012) evaluated DNA barcodes for bacteria from the 16S rRNA gene and the
protein coding cpn60 gene. Assembling consensus sequences for barcodes was
shown to be a reliable method for the tracking and identification of novel microbes
in metagenomic studies. The most commonly used barcode gene of bacterial and
archaeal community is 16S rRNA marker as a barcode to quantify microbial
community structure from environmental samples based on DNA sequences. To
examine protist diversity in freshwater samples, amplicons of 18S rDNA were used
(Medinger et al. 2010).
DNA Barcoding Significance and Utilities 21
Airborne Allergen Monitoring Plant pollen is one of the major allergens which
affects the economy of individual person and work (D’Amato et al. 2005; Davies
et al. 2015). Symptoms from this pollen vary based on its origin and its concentra-
tion. Pollen monitoring programs are available which are volumetric pollen sam-
plers, whirling arm samplers, or passive samplers. Hirst-type volumetric samplers
are used frequently by national pollen monitoring networks that immobilize air
particulates on sticky tape (Scheifinger et al. 2013). DNA metabarcoding has been
used to identify species of airborne pollen successfully for the monitoring of
allergens in the Netherlands (Kraaijeveld et al. 2015). Pollen samples were obtained
by volumetric samplers, and an increased taxonomic resolution among classifica-
tions was achieved using DNA metabarcoding, while microscopy could only iden-
tify pollen to family level in many cases. In addition, aerobiological monitoring not
only identifies pollen but also identifies its taxa for occurrences of certain bacteria or
fungi of health interest.
Epidemiology Epidemiologists can use DNA barcoding to zero the effects of mos-
quitoes, tsetse flies, or the feces of migratory birds most responsible for the spread of
infectious disease. This helps not only epidemiologists to take more effective action to
protect human health; it can also help limit negative side effects of activities of
spraying of pesticides. And it gives confidence to tropical countries combating
endemic diseases and gives caution to outbreak of possible epidemics (Muturi et al.
2011). To identify disease vectors, DNA barcoding allows nonecologists to identify
the vector species that can cause serious infectious diseases to animals and humans and
helps to understand the mentioned diseases and its cure. A global initiative of building
22 S. Daravath et al.
a mosquito barcoding reference library helps public health officials to control vector-
borne diseases more effectively and with very less use of insecticides.
Drug Resistance DNA barcoding helps to trace beneficial mutations in 25,000
yeast lineages in a larger community. Through NGS, scientists were able to know
which lineage and mutations were increased and decrease at 8th generation of yeasts.
Sherlock and his colleagues calculated that about 25,000 beneficial mutations with a
fitness effect of more than 2% were established by generation 112. The conclusion is
that with the time period taken for beneficial mutation, the population of yeast is
established and the fitness is due to the influence of mutation. A high-resolution
lineage-tracking tool can be used to track pathogenic microbes, cancer cell lines, and
animal tumor models. By combining whole genome sequencing, lineage and track-
ing a powerful method is offered to characterize the mutational spectrum underlying
evolution, disease progression, and drug resistance (Genome Web).
Food Provenance and Quality Pollen DNA metabarcoding has application in the
studies of food provenance and quality. Pollen is a nearly omnipresent environmen-
tal biomarker, and most foodstuffs are likely to contain pollen that can be used to
trace the product’s potential time of origin and its geography. This application is
used to trace the geographic and botanical origins of honey from flowers. Honey is a
high-value nutritional product and its taste, food quality, and safety differ depending
on the plants the honeybees have foraged upon (Crane 1975).
Product labeling guidelines are necessary to prevent mislabeling and often require
the floral source of commercially sold honey to be declared (Bruni et al. 2015).
Honeys are labeled as monofloral or multifloral by nectar and pollen from a single or
multi-plant species. Honeys are classified as monofloral if the pollen content of one
species is greater than 45% (Anklam 1998). Honey from one plant species often have
higher commerical value and therefore have a greater possibility of adulterations and
incorrect labeling (Persano Oddo and Bogdanov 2004). Safety and quality of honey
is very crucial because pollen from poisonous plants can sometimes be found within
honey, for example, Atropa belladonna (Bruni et al. 2015). Hepatotoxic
pyrrolizidine alkaloids (PAs) have been detected in honeys after bees have foraged
on plants within the Boraginaceae (Edgar et al. 2002) and grayanotoxins arising
from Rhododendron spp. (Koca and Koca 2007). Thus botanical profile of honey is
therefore very important to ensure that products are of high quality and safe for the
user (Olivieri et al. 2012).
Food Labeling, Food Safety, and Food Piracy In food industry and food supply
chains, cpDNA and mtDNA barcoding are used for food labeling, food safety, and
food piracy in processed products. Since several copies of extranuclear genomes are
within each cell, this technique helps to identify the information of degraded samples
or transformed materials derived from crops and livestock species. The technology is
used for routine analyses to maintain food safety and quality. DNA barcoding can be
DNA Barcoding Significance and Utilities 23
crucial because it can verify the presence/absence of the original species and to
identify the nature of the replaced species. One of the most striking substitution cases
ever revealed refers to fish meat (e.g., sold as slices, fillets, blocks, surimi, fish sticks,
and fins).
Adulterants in foods are or adding any substance to increase weight and lose its
original nature and appears to be better than original products. Most adulterants are
economically motivated misbranding and mislabeling, fakes based on simulation
processes and imitation products. When the adulterants are toxic or allergenic,
serious public health consequences may result. In this case, the food mislabeling
not only cheats the users, but it also causes intolerance or allergies to certain foods or
their components. The most frequent incidents as per the literature from 1980 were
grouped into 11 food categories: fish and seafood, dairy products, fruit juices, oils
and fats, grain products, honey and other natural sweeteners, spices and extracts,
wine and other alcoholic beverages, infant formula, plant-based proteins, and other
food products (Barcaccia et al. 2015).
5.2.4 Forensics
Forensic palynology is the use of pollen to link persons or objects which related to
particular place and time (Mathewes 2006). This technique is of great utility to
forensics because (1) pollen is a nearly ubiquitous feature of the environment;
(2) different geographic locations have different pollen signatures, allowing for
inference related to spatial tracking; (3) plants flower at different times, allowing
for temporal inference; and (4) pollen is extremely durable and thus the sample can
be studied any time (Walsh and Horrocks 2008). DNA metabarcoding application
could very likely increase its usage at broader range of situations (Bell et al. 2016).
For example, on combining DNA barcoding with a universal database of geographic
and temporal knowledge of plants (Goodman et al. 2015), the database could
potentially be useful in forensics. In any kind of genetic analysis, DNA barcoding
requires damaged pollen grain samples; this issue can be addressed to split pollen
samples into partitions for DNA extraction, morphological examination, and per-
manent storage. The results of the DNA barcoding and morphological identifications
are compared for its consistency and accuracy. In the future DNA metabarcoding
will use pollen as a biomarker, which provides forensic scientists to ensure global
security and justice.
5.2.5 Biomedicine
1.D. The copyright laws of the place where you are located also
govern what you can do with this work. Copyright laws in most
countries are in a constant state of change. If you are outside
the United States, check the laws of your country in addition to
the terms of this agreement before downloading, copying,
displaying, performing, distributing or creating derivative works
based on this work or any other Project Gutenberg™ work. The
Foundation makes no representations concerning the copyright
status of any work in any country other than the United States.
1.E.6. You may convert to and distribute this work in any binary,
compressed, marked up, nonproprietary or proprietary form,
including any word processing or hypertext form. However, if
you provide access to or distribute copies of a Project
Gutenberg™ work in a format other than “Plain Vanilla ASCII” or
other format used in the official version posted on the official
Project Gutenberg™ website (www.gutenberg.org), you must, at
no additional cost, fee or expense to the user, provide a copy, a
means of exporting a copy, or a means of obtaining a copy upon
request, of the work in its original “Plain Vanilla ASCII” or other
form. Any alternate format must include the full Project
Gutenberg™ License as specified in paragraph 1.E.1.
• You pay a royalty fee of 20% of the gross profits you derive from
the use of Project Gutenberg™ works calculated using the
method you already use to calculate your applicable taxes. The
fee is owed to the owner of the Project Gutenberg™ trademark,
but he has agreed to donate royalties under this paragraph to
the Project Gutenberg Literary Archive Foundation. Royalty
payments must be paid within 60 days following each date on
which you prepare (or are legally required to prepare) your
periodic tax returns. Royalty payments should be clearly marked
as such and sent to the Project Gutenberg Literary Archive
Foundation at the address specified in Section 4, “Information
about donations to the Project Gutenberg Literary Archive
Foundation.”
• You comply with all other terms of this agreement for free
distribution of Project Gutenberg™ works.
1.F.