Professional Documents
Culture Documents
1. Learning Outcomes
2. Introduction
3. Detection of SNPs
5. Summary
2. Introduction
SNPs are single base pair positions in genomic DNA at which different sequence alternatives
(alleles) exist in normal individuals (s), wherein the least frequent allele has an abundance of
1% or greater. The structure of a SNP is very simple. SNP loci are abundant in the human
genome and can be found about once every 1000 bp hence, each polymorphic SNP locus has
other polymorphic SNP loci nearby. Given that the human genome is 3.2 billion bps long, we
can estimate that there will be approximately 1 million differences between two genomes that
are due to SNPs: this represents approximately 85% of human genetic variation. The vast
majority of polymorphisms intrinsically limits the information that can be gained from the
analysis of any given SNP, and this has been the major factor limiting their application to
forensic analysis: 50 to 80 SNPs are required to achieve the same levels of discrimination as
the current STR based methods. However, the vast numbers of SNPs within the genome
(currently over 10 million SNPs have been placed in public databases can compensate for the
limited information carried by any individual SNP, and make them a tempting polymorphism
to exploit. The technology that is being used for SNP detection is evolving and SNP analysis
is becoming possible in many forensic laboratories.
3. Detection of SNPs
There are many techniques available for the resolution of SNPs. Restriction digestion can be
used to genotype SNPs when the SNP either creates or destroys a particular restriction
enzyme recognition sequence, but the method is limited for forensic casework because it
needs a large amount of DNA and is a long and laborious process.
Sanger sequencing
Sanger sequencing, also known as chain-termination sequencing, was developed in the late
1970s and is a milestone in the development of molecular biology. The sequencing takes
advantage of the biochemistry of DNA replication. The first stage of the analysis is to
amplify the target region using PCR; amplified products are then used as the template in a
sequencing reaction. The DNA sequencing reaction is similar to PCR amplification and the
reaction mixture is very similar, containing the thermophilic Taq DNA polymerase and
deoxynucleotide triphosphates (dNTPs). It differs from PCR in that only one primer is used
and, in addition to the dNTPs, there are four fluorescently labelled dideoxyribonucleotides
(ddNTPs); each ddNTP is labelled with a different coloured dye. The ddNTPs differs from
normal dNTP in that it has 3´-H rather than 3´-OH on the ribose sugar and the former has
hydroxyl group on the 3´carbon, which prevents any extension of the DNA molecule.
The concentration of dNTPs is higher than ddNTPs and therefore in most cases a dNTP is
added. The ddNTPs are incorporated at random intervals along the molecule. This produces a
range of different sized molecules. The products of the sequencing reaction are analysed
using capillary gel electrophoresis systems, such as the ABI PRISM Genetic Analysers, that
separates DNA to single base pair resolution and can simultaneously detect four different
fluorescent labels.
Sequencing is not a practical option for the analysis of SNPs in a forensic context. Most
SNPs are widely dispersed throughout the genome and a separate reaction is required for each
SNP. An exception is the mitochondrial genome, where a number of SNPs are concentrated
into a small area and can be analysed in a small number of reactions. Sequencing has also
been a powerful method to type SNPs within rapidly evolving regions of DNA in the HIV
virus.
Restriction digestion and sequence analysis are not viable methods to use for most forensic
cases that might require the analysis of 50 to 80 SNPs dispersed around the genome. A
number of methods have evolved that can be applied for the detection of SNPs.
Primer extension
Primer extension is a robust method for discriminating between different alleles and several
methodologies have been developed. One of the commonly used methods is the mini-
sequencing reaction. The basis of the reaction is very similar to Sanger sequencing.
By using different sized primers and different fluorescent tags for each of the four bases, a
large number of SNPs can be simultaneously detected.
Variations on the primer extension technique include pyro sequencing; microarrays, where
the extension primers are attached to a silicon chip; and allele specific extension, when the
primer is only extended if it is 100% complementary to the target sequence
Fig 3: The primer extension assay. (a) The target sequence is amplified using PCR and
the products are used as the template in the extension assay; (b) an internal primer
hybridizes to the target adjacent to the SNP and a single fluorescently labelled ddNTP is
added by Taq polymerase; (c) the reaction is analysed by capillary electrophoresis
Under stringent conditions, even one nucleotide mismatching between a template and primer
can differentiate between two alleles. There is a large number of methods that exploit the
hybridization of probes, including reverse dot blots; Taqman assays; Light Cycler assays;
molecular beacons; and Gene Chips.
A vast amount of data is available on the different SNPs in the human genome and one of the
biggest tasks when applying SNPs to forensic applications is to select the most appropriate
SNPs from the overwhelming numbers that are available. The choice of SNPs is very much
dependent on the application.
Forensic identification
The vast majority of forensic DNA analysis involves the characterization of biological
material recovered from the scene of a crime. Several panels of SNPs have been developed
that are designed to provide maximum discrimination powers for forensic identification.
These contain SNPs that are polymorphic in all major population groups. A panel, containing
52 SNPs is used for identification which produce match probabilities ranging from 5.0×10−
19 in Asian population. However, even with the high discrimination power, the effort
involved in analysing 50 SNPs is greater than when undertaking standard STR analysis. The
major attraction of using SNPs with the current technology is that SNP analysis can provide
results from highly degraded DNA when conventional STR profiling has failed.
5. Summary
SNPs are single base pair positions in genomic DNA at which different sequence
alternatives (alleles) exist in normal individuals in some population(s), wherein the
least frequent allele has an abundance of 1% or greater. SNPs are found in the human
genome about once in every 1000 bp.
Primer extension is a robust method for discriminating between different alleles The
basis of the reaction is very similar to Sanger sequencing. The first part of the
procedure is to amplify the target region using PCR. An internal primer then anneals
to the denatured PCR product; the 3´ end of the primer is adjacent to the polymorphic
site.
Under stringent conditions, even one nucleotide mismatching between a template and
primer can differentiate between two alleles. Several methods that exploit the
hybridization of probes, including reverse dot blots; Taqman assays; Light Cycler
assays; molecular beacons; and Gene Chips are available.
The great advantage of STRs over SNPs is their power of discrimination due to the
large number of alleles they have in comparison with biallelic SNPs.