Professional Documents
Culture Documents
Reagents (1) Primer anneal1ng and chain extension (2) ddNTP binding and chain terminaticr
dT cctTF dTP
TTTTT TTTTTTT
eSF .
TT T
TITITTTTITTT
3 Fuorescently labelled DNk 3ample (4) Capillary gel electroplhoresis and (5) Sequence analy sis and reconstruciicn
fluorescence deteciion
GAC
TTTTTTTTmTTTITTTTTT DNA Sanple
TTTTTTTTTTTTTTTU
liiTT
TTTTTTTTO3
Detector
TTT
generated. 4- Labeled DNA Jragments are separated bused on noleculur nass by capillarv gel
Reagents OPrimer annealing and chain extensan 3dNTP >ndng and chasin terntee
adTTF
Titliif
3 Fuorescenty labeled DN smple Capillan gel etectrcohoresis an Squenre anai, is and rconshciea
fiuorescence deteeivan
iitTiEiTTUTTTUTTTT
TTTITTTTTTTTTTT
at much lower
recurring cost.
Nexi-generation sequencing (NGS), also known as high-throughput sequencing, is the caich-
all term used to describe a
number of different modern
These technologies allow for sequencing technologies.
sequencing of DNA and RNA much more
quickly and
han the
previously used Sanger sequencing, and as such revolutionised the study of cheaply
and molecular
biology. Such technologies include: genomics
Advantages of NGS
NGS can be used to analyse DNA and RNA samples and is a popular tool in functional
genomics. In contrast to nmicroarray methods, NGS-based approaches have several advantages
including:
a
priori knowledge of the genome or genomic features is not required
it offerssingle-nucleotide resolution, making it possibie to detect related genes (or features).
altenatively spliced transcripts, allelic gene variants and single nucleotide polymorphisms
higher dynamic range of signal
requires less DNA/RNA as input (nanograms of materials are sufficient)
higher reproducibility
NGS technologies
Roche 454 sequencing: This method is based on pyrosequencing, a technique which detects
pyrophosphate release, using light signal (bioluminescence), atter nucleotides are incorporated
by polymerase to a new strand of DNA. Roche 454 sequencing platform has been discontinued
since 2016.
lon Torrent: Proton / PGM sequencing: lon Torrent scquencing measures the direct release of
H+ (protons) from tne incorporation of individual bases by DNA polymerase and therefore
differs from the previous two methods as it does not measure light.
2
1972:Sanger started work on DNA sequencing
sequencing
9r.Sanger developed Oindeoxy thain terminaton method of DNAof DIA SEqUercug
1977.Maxan and Gitbert developed chemic al degradation method
First 1977 First ONA based gename sequenc ed (x 1/4 harteriophage
9 9 5 First bacterium Maemophllus infionzoo was sequenced by sholgun methoc
h
Generation 996 Applied Biosystems developed automated DNA sequencing b a s e d on S a r g e r s i
Te:hndoges
21 Re'ease I : l © N ala!fom by No G Nan3 201e
GridON & Sm agiON X3 olarfonis by Ortorc
2017 Reease ct FrolathON.
Fourth Mancpo3 lcg.es
p'altt'm by -sord Nanopore ecnrc ag e
Generation 213 CorTerc alza: cn ci PrckletT-©A
y Enuls:on 6 Bndge
Sequencing by S) n!hess Soid Stte uleclu0n
CR Amphicaton
1
harpore
dcicciivii
>y >Lcr d i i i
Covai dppiiuaiio
icSuy,
3
End Repair 8
Fragmentaüon (Adaptve Adapter Ligation
SampeGencme Focused Acousbe Sheartng)
Clonal Amplificaton
by Bridge PCR
Cional Ampiificaticn
by Emusion PCR
DNA I s
G.,
L Lscferar
Schematic representation of the basic steps invoived in DNA sequencing using ditterent NGS
platforms
Introduction to NGS
Next-generation sequencing (NGS) is a technology for determining the sequence of DNA or
RNA too study genetic variation associated with diseases or other biologica!
phenomena. introduced for commerciai use in 2005, this method was initially calied
"massively-parallel sequencing", because it enabled the sequencing of many DNA strands at
the same time, instead of one at a time as with traditional Sanger sequencing by capillary
electrophoresis (CE).
Each of these technologies has utility in today's genetic analysis environment. Sanger
sequencing is best for analyzing small numbers of gene targets and samples and can be
accomplished in a single day. It is also considered the gold-standard sequencing technology.
so NGS results are often verified using Sanger sequencing. NGS enables the interrogation of
hundreds to thousands of genes at one time in multiple samnples, as well as discovery and
analysis of different types of genomic features in a single sequencing run, from single
nucleotide variants (SNVs), to copy number and structural variants, and even RNA fusions.
NGS pro, ides the ideal throughput per run, and studies can be performed quickly and cost-
effectively. Additional advantages of NGS include lower sampie input requireneuis, iigier
accuracy, and ability to detect variants at lower aliele frequencies than with Sanger sequencing
1. Construct library
is
sequencing "library must be created iîrom the sample. The DNA (or cDNA) sanple
the
processed into reiatively short double-stranded tragments (100-800 bp). Depending on
ronal m nlifingtinv
synthesis.
. Analyze data
of short DNA
of conmplex data consisting
Each NGS experiment generates large quantities and data anaiysis toois, they
has its own aigorithms
reads. Although each technology platform evaluate the quality of NGS
data
share a similar analysis 'pipeline
and use c o n m o n metrics to
seis.
nd tertiary analy sis (Figure
be divided into three steps: primary, secondary.
Anaiysis instrument detectors into digitized
can
PrmaryAnalisis pucFASTa
TerergAalys
Antarerotaandtresul
ouaudnedorinab Qput;3A
Aited
reads andRUiy
overview
Figure 3. NGS analysis pipeline
is typically
NGS data and associated
algorithms, NGS analysis
the complexity of don't have specialized
Due to users who
bioinformatics specialists. To empower intuitive
performed by ike lor "Torrent have created user-friendly.
bioinformatics training. platforms shiiis tw gel iesuiis. ioi a
6
Overview of methods
(NGS) methods lo cno
have ultiple next-generation sequencing
uenomes researchers methods include wn
their studies. General NGS
rom when designing and implementing subdivided into exome
which is futher
and targeted sequencing. they can all yiela
Sc e Scquencing panels (see table below). Although While whole
o r region-specific
Sequencing and gene far from interchangeable.
interest to a researcher. they targeted next
are
Highly tlexible,
Most comprehensive
1% of human customizable
Pros genome, much
genome coverage
of less data to designs
Detect widest range Data is focused
anaiyze than
features: SNVs, specitically on
indels. structural and WGS regions/genes of
copy nunmber variants,
Faster
workilow than interest
regulatory elements
Lowest sample input
PCR WGS
No bias from (10 ng)
amplification or probe Multiplexing Compatible with
small number
hybridization FFPE tissue
of samples
Best for discovery Multiplexing large
Medium sample
research numbers of samples
input (50 ng-1
Better for detecting
ug depending
rare alleles
on library prep
method)
SIlaii nuiber
of genes
Speed/ Slowest Medium Fastest
return of
results
Data
volume Largest Mediun Smallest
When to
use
Complete coverage of Disease- Clinical sequencing
genome needed specific
De oTO assembly Disease-specific
research research projects
scovery of projects
L.known genomic
IVD testing
Clinical LDT development
variants causing a
disease sequencing Inherited disease
Aneupioidy detectioon Oncology
preimplantation Immune repertoire
geneie testing) Liquid biopsy
SangerScquencing:
What s Sanger sequcncing2
Sanger sequcncng. also known as the "chain termination nnethod," was devetoped
by the Englisochcmist Frederick Sanger and his colleagues in 1977. This method
is based on amplitication ofthe IDNA fragment to be sequenced by DNA polymerase
and incorporation of modified nucleotides specifically, dideoxynucleotides
ddNTPs).
This method is designed for dctermining the sequence of nucleotide bases in a piece
of DNA (commonly less than 1.000 bp in length). Sanger sequencing with 99.99%
base accuracy is consicored the "gold standard" for validating DNA sequences,
including those already quenced through nexl-generalion sequencing (NGS).
Sanger sequencing was ised in the Human Genome Project to delermine the
sequences of relatively small fragments. of human DNA (900 bp or less). These
fragments were used assemble larger DNA fragments and, eventually, entire
chromosomes.
-oCH Base
H H
Dideoxynucleotide (ddNTP)
-OCHa Base
H
OH
Deoxynucleotide (dNTP)
Dideoxy nucleotides are similar to regular. or deoxy, nucleu.ides, but with one kcy
difference: they lack a hydroxyl group on the 3" carbon
of the sugar ring. In a regular
nucleotide, the 3' hydroxyl group acts as a "hook," allowing a new nucieotide to be
added to an existing chain.
SangerSequencing Steps
The Sanger sequencing method consists of 6 steps:
1) The double-stranded DNA (dsDNA) is denatured into two single-stranded DNA
(ssDNA).
(2) The DNA sample is divided into four separate sequencing reactions.
containing
ail four of the standard deoxynucleotides (dATP, dGiTP, dCTP and dTTP) and
the DNA polymerase. To each reaction is added only one of the four
dideoxynucleoides (ddATP, ddGTP, dCTP, or ddTP). The dúNTPs may be
radioactively or fluorescently labelled for detection in automated sequencing
machines.
(3) Primer that corresponds to one end of the sequence and poBymerase solution are
added to all four tubes.
(4) The DNA synthesis reaction initiates and the chain extends until a termination
nucleotide is randomly incorporated.
(6) The denatured fragments are separated by gel electrophoresis and the sequence
is determined. This is frequently performed using a denaturing polyacrylamide-
urca gel with each of the four reactions run in one of four individual lanes (lanes
A. T. G.C). The DNA bands may then be visualized by auloradiography or UV
light and the DNA sequence can be directly read off the X-ray film or gel image.
Heat
-
AGTCATGAGTCC5 Template
5 3 Punor
1 2 3 4
5 -AGTCATGAGTCCS
3
TGTT
ATP
GTP dATP ATP
DNA
ol Tol
DNA ToTPrimer
DNA
dCTP
CTCP
dGTP
dCTP dGTP Fol
dAT
PolDNA Polymeras
OTCP dTTP dATP
dTCP dCTP drEE. dTTP Eree
TdATP SdGTP doTp dcy?,7deoxynucleotid
dG72 dGTP
ddATP
ddGTP
DNA polymarase ddCTPDideoxynucleot
For example
3 AGTCATGAGTCC
5 TCAddG
3 AGTCATGAGTCC-5
5*TCAGTACTCAcdG
5 AGTCATGAGTCC
TCAGTACTCAGddG
Complementary
chain
Electrophoresis
labeled
lor radiolabelling, or using a primer system
Tor
Osphorus rcading in an optical
sequencing facilitates by
CScent dye. Dye-priner automation The
later development
analysis and
tStCr
more economical
and
labeled ddNTPs
and primers
set the
coworkers luorescently
of imethod
LCTO HoOd and Chain-termination
DNA sequcncing.
Stge automated. high-throughput
ior recently. dyc-terminator
scquencing
chain terminators is
the tour dideoxynucleotide
waveiengtns.
iwhich emit ight at different
of
10 iG
20 TC CGC..G 1GC
THOCCGCCG
OCCcTTTIC r TG ca
C1G
ol a Sanger sequencing
This is an example ot the output DNA bases are
Chromatograph: The tour
Figure: labclled dyc-icrminators. the
fluorescently the sottware to give
read using which are interpreted by
different colours
represented by
DNA sequence above.
up to 584
can sequence
instruments (DNA sequencers)
Automated DNA-sequencing DNA sequencers cary
a day.
(run) in up to 24
runs
STEPS:
poly1nerase chain reaction(PCR)
. Labelling the products of
4. In place ofradioactive labelling, luorescent lables are usedl. The luorescent lables
are attached to the four dideoxyribonucleotides or dedNTPs (ddATP, ddGTP, ddC IP
and ddlTP) used for chain
termination.
. lf
ditferent fhuorochrome is attached to cach of the four ddNTPs, all of them
a