Professional Documents
Culture Documents
• missed/weak associations
• available markers to far away from causal polymorphism for given population
• possible solutions:
• major challenges:
• throughput
• read length
Genome sequencing: classical Sanger sequencing
• same principle
• terminator is labeled
• for each terminator a different fluorescent label is used
• all four reactions are run on one lane of the gel
• fluorescence is measured at end of gel
(liquid capillary gel)
Sanger Sequencing – Sample Preparation
Whole Genome Sequencing Strategy – The Classics
• Sanger Sequencing
• transform E. coli
• re-isolate plasmid
• perform sequencing
reaction
• biggest challenge:
• shorter reads
• amount of data
à assembly
Sequencing by Synthesis – The Next Gen Principle
• two concepts:
• stop reaction after incorporating one
base, measure result
• monitor reaction ‘on the go’
Sequence capture:
6. add primer complementary to one of the
adapters
7. add reversible terminators (each base labeled
with a different fluorescent dye, fluorophore
prevents incorporation of another base)
8. add polymerase to add ONLY ONE base,
wash unincorporated nucleotides off
9. scan flow cell with fluorescent laser
scanner, record first base
10.cleave off fluorescent dye, reverts
incorporated terminator to regular base
11. repeat 7. to 10. to capture subsequent bases
current
• movement blocks channel, leads to
change in current
• induced current change differ
depending on base composition of
• sample prep:
• fragment DNA (200 kb plus)
• add leader to one side
à directs DNA to motor protein at pore
• add hairpin adapter to other side
à allows sequencing the same fragment twice
• still has very high error rate (100 times higher than Illumina)
• BUT:
• very long reads (10,000 times longer than Illumina, 10
times longer than SMRT)
• high throughput (up to 40 GB)
• very fast (up to 500 bp per second, two days per run)
• small instrument footprint
Summary
gene features are defined in the DNA sequence Zhang (2002) Nat. Rev.
resulting gene models are the first annotation of the genome Zhang (2002) Nat. Rev.
Genet. 3:698-709
Gene annotation – Similarity Based Approaches