Professional Documents
Culture Documents
425
MOLECULAR
GENETICS II
Overview of the Composition and Chemical
Structure of Nucleic Acids
THE FLOW OF GENETIC INFORMATION
A. The Central Dogma
2. Transcription 3 Translation
DNA RNA PROTEIN
1.Replication 1. Replication (DNA Synthesis)
2. Transcription (RNA Synthesis)
DNA 3. Translation (Protein Synthesis)
THE FLOW OF GENETIC INFORMATION
• Genetic diseases occur because of mutations in DNA
4. Reverse Transcription
2. Transcription 3 Translation
DNA RNA PROTEIN
1.Replication
DNA
Be familiar with
the structures of
the purine bases,
adenine (A) and
guanine (G); and
the pyrimidine Thymine (T)
bases, thymine (T)
and cytosine (C).
DNA methylation
Nucleotide
ii). Structure
Structure of the
of the DNA
DNA double
polynucleotide helix
chain
5’
3’
• polynucleotide chain
• 3’,5’-phosphodiester bond
Strength of A-T and G-C Base pairs
Chargaff’s rule: The content of A equals the content of T, and the content of G
equals the content of C in double-stranded DNA from any species.
. • Only 4 nucleotides (bases)
make up all DNA:
A (adenine) -T
C (cytosine) -G
T (thymine) -A
G (guanine) -C
Major groove
Minor groove
“B” DNA
3’ 5’ 3’ 5’
DNA is Double-Stranded
•Base-pairing between the complementary strands is
required for two important functions of DNA:
1) DNA replication involves an unwinding of the double
helix (right) followed by synthesis of a complementary
strand from each of the unpaired template strands,
and
2) DNA serves as a template for RNA synthesis by utilizing
the information in one strand to code for a
complementary RNA strand.
FORMS OF DNA
DNA in the "B" form has a major groove and a minor
groove, and has 10 base pairs per one turn of the double
helix. (Other forms include, A, C, D, E and Z-DNA)
DNA that is overwound (A-form – 11bp per turn with a
diameter of 2.3nm; Z-DNA -12bp; ) or underwound (C-
DNA, 9.3bp), with fewer than or more than 10 base pairs
per turn, is said to be "supercoiled".
Z-DNA is left-handed double helical structure in which the helix winds to the left
in a zigzag pattern, instead of to the right .
It should also be noted that the complementary strands in double helical DNA are
antiparallel with respect to each other.
Each polynucleotide chain has a 5' end and a 3' end running in opposite directions.
FORMS OF DNA
Form Direction of Helix Base Pairs Distance Diameter of
Per Turn Between bp helix (Å)
(Å)
A Right-handed 11 2.6 23
B Right-handed 10 3.4 19
C Right-handed 9.3 3.3 19
Z Left-handed 12 3.7 18
THE GENETIC
CODE
The genetic code is the set of rules by which information encoded in
genetic material (DNA or RNA sequences) is translated into proteins
(amino acid sequences) by living cells.
It defines a mapping between tri-nucleotide sequences called codons
and amino acids; every triplet of nucleotides in a nucleic acid
sequence specifies a single amino acid.
Properties of the Genetic Code
1. The code is a triplet codon: Each codon consists of three successive nitrogenous
bases.
2. The code is non-overlapping: A base in a mRNA is not used for different codons.
3. The code is commaless: after one amino acid is coded, the second amino acid will
be automatically, coded by the next three letters.
4. The code is non-ambiguous: Non-ambiguous code means that a particular codon
will always code for the same amino acid.
5. The code has polarity: The code is always read in a fixed direction, i.e., in the
5’→3′ direction.
6. The code is degenerate: More than one codon may specify the same amino acid;
this is called degeneracy of the code. E.g., UCU, UCC, UCA and UCG code for serine
7. Some codes act as start codons: AUG codon is the start or initiation codon, i.e.,
the polypeptide chain starts either with methionine (eukaryotes) or N- formylmethionine
(prokaryotes). In rare cases, GUG also serves as the initiation codon, e.g., bacterial
protein synthesis. Normally, GUG codes for valine, but when normal AUG codon is lost
by deletion, only then GUG is used as initiation codon.
8. Some codes act as stop codons: Three codons UAG, UAA and UGA are the
chain stop or termination codons
9. The code is universal: Same genetic code is found valid for all organisms.
DNASES
Deoxyribonucleases (or DNases) are enzymes that
cleave phosphodiester bonds.
Some are used for constructive purposes, such as
proofreading during DNA replication, whereas others
are used to degrade DNA.
There are two basic classes of DNases:
1. exonucleases and 2. endonucleases.
Exonucleases remove only the terminal nucleotide,
whereas Endonucleases cleave anywhere within the
DNA double helix.
Chemistry of DNA
Forces affecting the stability of the DNA double helix
• hydrophobic interactions - stabilize
- hydrophobic inside and hydrophilic outside
• stacking interactions – stabilize
- relatively weak but additive van der Waals forces
• hydrogen bonding - stabilize
- relatively weak but additive and facilitates stacking
• electrostatic interactions - destabilize
- contributed primarily by the (negative) phosphates
- affect intrastrand and interstrand interactions
- repulsion can be neutralized with positive charges
(e.g., positively charged Na+ ions or proteins)
Stacking interactions
Charge repulsion
Charge repulsion
Denaturation of DNA
Double-stranded DNA Strand separation
and formation of
single-stranded
random coils
Cooperative unwinding
of the DNA strands
When the strands of DNA separate, the DNA is said to
be denatured (when high temperature is used to
denature DNA, the DNA is said to be melted).
Absorbance maximum
for single-stranded DNA
Absorbance
Absorbance maximum for
double-stranded DNA
100
Percent hyperchromicity
50
50 70 90
Temperature oC
• Tm is the temperature at the midpoint of the transition
DNA Melting Curve
• Hyperchromicity can be used to follow the denaturation of DNA
as a function of increasing temperature. As the temperature of a
DNA solution gradually rises above 50⁰C, the A-T regions will melt
first giving rise to an increase in the UV absorbance. As the
temperature increases further, more of the DNA will become
single-stranded, further increasing the UV absorbance, until the
DNA is fully denatured above 90⁰C.
• The temperature at the mid-point of the melting curve is
termed "melting temperature" and is abbreviated Tm. The Tm for
a DNA depends on its average G+C content: the higher the G+C
content, the higher the Tm.
• Note: G+C content, G-C content, and GC content are equivalent
terms, that is, they are used interchangeably.
The 2+4 Rule: Tm = 2 X (A+T) + 4 X (G+C).
Tm is dependent on the G-C content of the DNA
Percent hyperchromicity
E. coli DNA is
50% G-C
50
60 70 80
Temperature oC
This slide shows the dependence of Tm on avg G+C content of 3 different DNAs. Under the conditions
used in this experiment, E. coli DNA which has an avg G+C content of about 50%, melted with a Tm of
69ºC. The curve on the left represents a DNA with a lower G+C content and the curve on the right
represents a DNA with a higher G+C content. Tm is dependent on the ionic strength of the solution. At
a fixed ionic strength there is a linear relation between Tm and G+C content.
DNA Renaturation…
• Denaturation of a DNA double helix produces single-stranded DNA.
•The reverse reaction (the formation of double-stranded DNA) can be
carried out if the complementary DNA strands are incubated under
conditions that will promote renaturation (reassociation).
• This process involves two steps.
•The first step is a slower, rate-limiting reaction in which the
complementary strands attempt to find each other. The single
strands randomly interact until complementary base pairing can
occur.
•This may occur over only a short region of each strand, but may be
enough to "nucleate" the reaction. Because the two reacting strands
are at the same concentration in solution, the reaction is second-
order, with the rate of the reaction being defined by the second-order
rate constant, k2. Once nucleation has occurred, the complementary
strands rapidly zipper up.
DNA reassociation (renaturation)
Double-stranded DNA
Denatured,
single-stranded
DNA
Faster,
zippering
reaction to
form long
k2 molecules
of double-
Slower, rate-limiting,
stranded
second-order process of
DNA
finding complementary
sequences to nucleate
base-pairing
Polymerase Chain Reaction (PCR)
DNA RENATURATION...
Secondly, the complexity of a DNA is a function of how many
base pairs it has. In other words, a low complexity DNA may
consist of a few hundred or a few thousand base pairs, in contrast
to a high complexity DNA that may contain millions of base pairs.
TTAGGGTTAGGGTTAGGGTTAGGG
Interspersed repeats are sequences that are repeated many times and scattered
throughout the genome. In contrast, tandem repeats are sequences that are
repeated many times adjacent to each other. The latter are usually found in the
centromeres and telomeres of chromosomes
Biology of the RNA
RNA Structure
The major bases found in DNA and RNA
DNA RNA
Adenine Adenine
Cytosine Cytosine
Guanine Guanine
Thymine Uracil (U)
Tertiary structure
RNAs frequently adopt secondary structure and tertiary structure in order to carry
out their functions. WHY?
Types And Functions Of RNA
Of the many types of RNA, the three most well-known and most commonly studied
are 1. messenger RNA (mRNA), 2. transfer RNA (tRNA), and 3. ribosomal RNA (rRNA).
In protein synthesis, mRNA carries genetic codes from the DNA in the nucleus to
ribosomes, the sites of protein translation in the cytoplasm. Ribosomes are
composed of rRNA and protein. The ribosome protein subunits are encoded by
rRNA and are synthesized in the nucleolus. Once fully assembled, they move to the
cytoplasm, where, as key regulators of translation, they “read” the code carried by
mRNA. A sequence of three nitrogenous bases in mRNA specifies incorporation of a
specific amino acid in the sequence that makes up the protein. Molecules of tRNA
(sometimes also called soluble, or activator, RNA), which contain fewer than 100
nucleotides, bring the specified amino acids to the ribosomes, where they are linked
to form proteins.
Types And Functions Of RNA
(Asgari 2011)
Circular RNA (circRNA)
Circular RNA (circRNA) is unique from other RNA types because its 5′ and 3′
ends are bonded together, creating a loop. The circRNAs are generated
from many protein-encoding genes, and some can serve as templates for
protein synthesis, similar to mRNA. They can also bind miRNA, acting as
“sponges” that prevent miRNA molecules from binding to their targets. In
addition, circRNAs play an important role in regulating the transcription
and alternative splicing of the genes from which circRNAs were derived.
+1
introns (between exons)
transcribed region
mRNA structure
5’ 3’
translated region
GENE STRUCTURE …
•RNA processing (we shall see this later) then removes the
intron sequences, "splicing" together the exon sequences to
produce the mature mRNA.
•Note: There are untranslated regions at the 5' and 3' ends of
mRNAs that are encoded by exon sequences but are not
directly translated.
GENE STRUCTURE …
• The next slide shows examples of the wide variety of gene
structures seen in the human genome. Some (very few) genes do
not have introns. One example is the histone genes, which encode
the small DNA-binding proteins, histones H1, H2A, H2B, H3, and
H4. Shown here is a histone gene that is only 400 base pairs (bp)
in length and is composed of only one exon.
• The beta-globin gene has three exons and two introns.
• The hypoxanthine-guanine phosphoribosyl transferase (HGPRT
or HPRT) gene has nine exons and is over 100-times larger than
the histone gene, yet has an mRNA that is only about 3-times
larger than the histone mRNA (total exon length is 1,263 bp).
•This is due to the fact that introns can be very long, while exons
are usually relatively short. An extreme example of this is the
factor VIII gene, which has numerous exons (the blue boxes and
blue vertical lines).
The (exon-intron-exon) structure of various genes
histone
-globin
HGPRT
(HPRT)
total = 42,830 bp; exons = 1263 bp
(hypoxanthine-guanine phosphoribosyl transferase)
factor VIII
Nuclear genome
• the haploidhuman genome has ~3 X 109 bp of DNA
• single-copy DNA comprises ~75% of the human genome
• the human genome contains ~30,000 to 40,000 genes
• most genes are single-copy in the haploid genome
• genes are composed of 1 to >75 exons
• genes vary in length from <100 to >2,300,000 bp
• Alu sequences are present throughout the genome
Mitochondrial genome
• circular genomeof ~17,000 bp
• contains <40 genes
Principles of Anotomy and Physiology (2006). John Wiley & Sons
TELOMERES AND AGING
•The chromosome contains a single, long molecule of
double stranded DNA, and thus has two ends.
•These ends create two problems:
•they are difficult to replicate and
•they have a tendency to fuse with other
chromosome ends causing karyotypic
rearrangements.
•To prevent these problems, chromosomes have
protective ends called "telomeres" that are
composed of tandemly repeated, 5-8 bp sequences
up to 12 kb in length.
TELOMERES AND AGING
In germline cells and in the cells of young individuals,
telomeres are of maximal length, but with every round of somatic
cell division telomeres get a little shorter.
<1 to >12 kb
telomere structure
(TTAGGG)many
young
(TTAGGG)few
senescent