Professional Documents
Culture Documents
An Overview of Nucleic Acid Chemistry, Structure, and Function
An Overview of Nucleic Acid Chemistry, Structure, and Function
An Overview of Nucleic Acid Chemistry, Structure, and Function
net/publication/227220425
CITATION READS
1 20,830
1 author:
William B Coleman
American Society for Investigative Pathology
144 PUBLICATIONS 4,203 CITATIONS
SEE PROFILE
All content following this page was uploaded by William B Coleman on 29 May 2014.
ix
02_Coleman 6/6/05 6:50 PM Page 13
WILLIAM B. COLEMAN
13
02_Coleman 6/6/05 6:50 PM Page 14
the genome (53,59). Refinement of the human genome sequence, the primary RNA transcripts contain both coding and noncoding
identification and characterization of the genes contained, and sequences. The noncoding sequences must be removed from the
description of the features of the genome (SNPs and other varia- primary RNA transcript during processing to produce a func-
tions) continues at a rapid pace (http://www.nhgri.nih.gov/). In tional mRNA molecule appropriate for translation.
early 2003, it was announced that the Human Genome Project 7. DNA FUNCTION
had completely sequenced 99% of the gene-containing portion of
DNA serves two important functions with respect to cellular
the human genome, and new plans and goals for genomics
homeostasis: the storage of genetic information and the trans-
research were described (60,61).
mission of genetic information. In order to fulfill both of these
6. ORGANIZATION OF GENOMIC DNA functions, the DNA molecule must serve as a template. The cel-
The human genomic DNA is packaged into discreet structural lular DNA provides the source of information for the synthesis
units that vary in size and genetic composition. The structural unit of all the proteins in the cell. In this respect, DNA serves as a
of DNA is the chromosome, which is a large continuous segment template for the synthesis of RNA. In cell division, DNA serves
of DNA (62). A chromosome represents a single genetically spe- as the source of information inherited by progeny cells. In this
cific DNA molecule to which are attached a large number of pro- case, DNA serves as a template for the faithful replication of the
tein molecules that are involved in the maintenance of genetic information that is ultimately passed into daughter cells.
chromosome structure and regulation of gene expression (63). 7.1. TRANSCRIPTION OF RNA Contained within the
Genomic DNA contains both “coding” and “noncoding” linear nucleotide sequence of the cellular DNA is the informa-
sequences. Noncoding sequences contain information that does tion necessary for the synthesis of all the protein constituents of
not lead to the synthesis of an active RNA molecule or protein a cell (Table 1). Transcription is the process in which mRNA is
(54,64). This is not to suggest that noncoding DNA serves no synthesized with a sequence complementary to the DNA of a
function within the genome. On the contrary, noncoding DNA gene to be expressed. The correct start and end points for tran-
sequences have been suggested to function in DNA packaging, scription of a specific gene are identified in the DNA by a pro-
chromosome structure, chromatin organization within the nucleus, moter sequence upstream of the gene and a termination signal
or in the regulation of gene expression (65,66). A portion of the downstream (Fig. 4). In the case of RNA transcription, only one
noncoding sequences represent intervening sequences that split strand of the DNA molecule serves as a template. This strand is
the coding regions of structural genes. However, the majority of referred to as the “sense” strand. Transcription of the sense
noncoding DNA falls into several families of repetitive DNA strand ultimately yields a mRNA molecule that encodes the
whose exact functions have not been entirely elucidated (67,68). proper amino acid sequence for a specific protein.
Coding DNA sequences give rise to all of the transcribed 7.2. REPLICATION OF DNA The double-stranded
RNAs of the cell, including mRNA. The organization of tran- model of the structure of DNA strongly suggests that replica-
scribed structural genes consists of coding regions that are inter- tion of the DNA can be achieved in a semiconservative manner
rupted by intervening noncoding regions of DNA (Fig. 4). Thus, (69–72). In semiconservative replication, each strand of the
02_Coleman 6/6/05 6:50 PM Page 16
Fig. 3. The chemical structure of repeating nucleotide subunits in DNA and RNA. Each panel shows the sugar-phosphate backbone of a single
polynucleotide strand of nucleic acid composed of four nucleotide subunits. The stippled area highlights a 3′–5′ phosphodiester bond.
DNA helix serves as a template for the synthesis of comple- organisms that reproduce sexually, recombination is initiated by
mentary DNA strands. The result is the formation of two com- formation of a junction between similar nucleotide sequences
plete copies of the DNA molecule, each consisting of one carried on the same chromosome from the two different parents.
strand derived from the parent DNA molecule and one newly The junction is able to move along the DNA helix through
synthesized complementary strand. The utilization of the DNA branch migration, resulting in an exchange of the DNA strands.
strands as the template for the synthesis of new DNA ensures 7.4. DNA REPAIR Maintenance of the integrity of the
the faithful reproduction of the genetic material for transmis- informational content of the cellular DNA is absolutely
sion into daughter cells (19). required for cellular and organismal homeostasis (75,76). The
7.3. GENETIC RECOMBINATION Genetic recombina- cellular DNA is continuously subjected to structural damage
tion represents one mechanism for the generation of genetic through the action of endogenous or environmental mutagens.
diversity through the exchange of genetic material between two In the absence of efficient repair mechanisms, stable mutations
homologous nucleotide sequences (73,74). Such an exchange of can be introduced into DNA during the process of replication at
genetic material often results in alterations of the primary struc- damaged sites within the DNA. Mammalian cells possess sev-
ture (nucleotide sequence) of a gene and, subsequently, alter- eral distinct DNA repair mechanisms and pathways that serve
ation of the primary structure of the encoded protein product. In to maintain DNA integrity, including enzymatic reversal repair,
02_Coleman 6/6/05 6:50 PM Page 17
Fig. 4. Basic organization of a structural gene in DNA and biogenesis of mature mRNAs.
nucleoside triphosphate through a 5′–5′ triphosphate linkage, through nuclear processing of precursor RNA transcripts. The
rather than a 3′–5′ phosphodiester bond. Thus, this guanine structure of the tRNA molecule reflects its function as an
residue occurs in the structure of the mRNA in reverse orienta- adapter between the mRNA and amino acids during protein
tion from all the other nucleotides. The modified 5′-terminus is synthesis. Specific tRNAs correspond to each of the amino
referred to as a “CAP” and is the site of additional modification acids utilized in protein synthesis in any particular cell type.
reactions involving the addition of methyl groups. These addi- Although the specific tRNAs differ from each other with
tional modifications are catalyzed by enzymes located in the respect to their actual nucleotide sequence, tRNAs as a class of
cytoplasm of the cell. The first methylation occurs in all mRNAs RNA molecules share several common structural features.
and consists of the addition of a methyl group to the 7-position Each tRNA contains information in the primary structure or
of the terminal guanine through the action of guanine-7-methyl- nucleotide sequence that dictates the higher-order structure of
transferase. Additional methylation reactions can occur involv- the molecule. The secondary structure of the tRNA resembles a
ing the additional bases at the 5′-terminus of the mRNA cloverleaf (97). The folding of the cloverleaf structure is main-
transcript, and less frequently, internal methylation of bases tained through intrastrand sequence complementarity and base-
within an mRNA molecule takes place. pairing interactions between nucleotides. In addition, each
The poly(A) tail possessed by most eukaryotic mRNAs is tRNA contains an ACC sequence at the 3′-terminus and an anti-
not encoded in the DNA; rather, it is added to the RNA in the codon loop (97). Amino acids are attached to their specific
nucleus after transcription of the structural gene is complete. tRNA through an ester bond to the 3′-hydroxyl group of the ter-
The addition of the poly(A) tail is catalyzed by the enzyme minal adenine of the ACC sequence. The anticodon loop recog-
poly(A) polymerase, which adds approx 200 adenosine nizes the triplet codon of the template mRNA during the
residues to the free 3′-OH terminus of the primary RNA tran- process of translation. With the exception of the codons encod-
script. The precise function of the poly(A) tail is unknown, but ing methionine and tryptophan, there are at least two possible
has been speculated to be involved in mRNA stability or in con- codons for each amino acid (Table 1). Nonetheless, each amino
trol of mRNA utilization (95). Although the removal of the acid has only one corresponding tRNA. Thus, the hydrogen-
poly(A) tail does precede degradation of certain mRNAs, a sys- bonding between nucleotides of the codon and anticodon often
tematic correlation between mRNA stability or survival, and involve “wobble” pairing (98). This form of base-pairing
the length or presence of the poly(A) tail has not been estab- allows mismatches in the third base of a codon triplet. In the
lished. Removal of the poly(A) tail can inhibit the initiation of overall structure–function relationship in tRNA, the nucleotide
translation in vitro, suggesting a potential role for this structure sequence of the anticodon loop dictates which amino acid will
in the control of mRNA translation. However, it is not clear be attached to the ACC sequence of the tRNA.
whether this effect is related to a direct influence of poly(A) Transfer RNAs serve as adapters between the mRNA tem-
structures on the initiation reaction or the result of some indi- plate and the amino acids of growing polypeptide chains during
rect cause. the process of protein translation (97); that is, the tRNA serves to
Some other forms of posttranscriptional RNA processing ferry the appropriate amino acid into the active site of the ribo-
occur with respect to a small subset of eukaryotic mRNAs. The some, where it becomes incorporated into the growing polypep-
process of RNA editing involves a posttranscriptional alteration tide being synthesized. Amino acids are coupled to their specific
of the informational content of a specific mRNA (91). The edit- tRNA through the action of enzymes called aminoacyl tRNA
ing of RNA is revealed when the linear sequence of the mRNA synthetases. The specificity of the “charging” reaction is critical
molecule differs from the coding sequence carried in the DNA. to the integrity of the translation process because the incorpora-
In mammalian cells, there are examples where substitution of a tion of amino acids at the level of the ribosome depends wholly
single base occurs in the mRNA, resulting in an alteration of an on the sequence of the anticodon portion of the tRNA molecule.
amino acid in the final protein product. Because no known tem- The charged tRNA the mRNA through the transient hybridiza-
plate source mediates the RNA editing reaction, the most likely tion of the codon and anti-codon RNA sequences in the ribosome
mechanism for mRNA editing would involve a specific enzyme complex as it moves along the mRNA. The entry of the charged
that can recognize the sequence or secondary structure of the tRNA into the active site of the ribosome brings its associated
specific target mRNA and catalyze the specific base substitu- amino acid into juxtaposition with the nascent polypeptide, facil-
tion. However, there are examples of RNA editing in some itating the formation of a peptide bond. In this manner, the tRNA
lower eukaryotes that utilize “guide RNA,” which directs the provides a link between the genetic information contained in the
RNA editing reaction (96). The final result of RNA editing is mRNA and the linear sequence of amino acids represented in the
the generation protein products representing more than one resulting polypeptide product.
polypeptide sequence from a single coding gene. The different 9.3. STRUCTURE AND FUNCTION OF RIBOSOMAL
protein products might possess different biological activities, RNA The ribosome is a nucleoprotein that serves as the pri-
suggesting that RNA editing might represent a mechanism for mary component of a cell’s protein synthesis machinery. The
controlling the functional expression of genes through a post- ribosome is a complex structure consisting of two subunit parti-
transcriptional process that does not impact the normal mecha- cle types (60S and 40S). The overall composition of the fully
nisms for controlling levels of gene expression. assembled ribosome includes at least 4 distinct rRNA molecules
9.2. STRUCTURE AND FUNCTION OF TRANSFER and nearly 100 specific protein subunits. The major rRNAs in
RNA Transfer RNAs are small molecules consisting of mammalian cells were named for their molecular size as deter-
approx 75–80 nucleotides. Like mRNAs, tRNA is generated mined by their sedimentation rates. Three of these rRNAs (5S,
02_Coleman 6/6/05 6:50 PM Page 20
Table 2
Secondary and Tertiary Structural Motifs in RNA
Structure Description Functions
Hairpin loops Single-stranded loop that bridges one end of a Component of more complex RNA structures;
double-stranded stem May serve as a nucleation site for RNA folding, or
recognition sites for protein–RNA interaction
Internal loops Interruptions in double-stranded RNA caused by Protein-binding sites and ribozyme cleavage sites
the presence of nucleotides on both strands that
cannot participate in Watson–Crick base-pairing
Bulges Double-stranded RNA molecules with unpaired Contribute to the formation of more complex RNA
nucleotides on only one strand structures; recognition sites for protein–RNA
interactions.
Nucleotide triples Triple helical structures that form through hydrogen Orient regions of secondary structure in large RNA
bonding between nucleotides of a single-stranded molecules and stabilize three-dimensional RNA
RNA molecule and nucleotides within a double- structures.
stranded RNA molecule; hydrogen bonds can
involve nucleotide bases, sugars, or phosphate groups
Pseudoknots Results from base-pairing between nucleotide RNA self-splicing, autoregulation of translational
sequences within an RNA loop structure and a processes, and ribosome frameshifting
complementary nucleotide sequence outside the
RNA loop
5.8S, and 28S) are components of 60S ribosomal particle. The can also serve as recognition sites for protein–RNA interaction.
smaller 40S ribosomal particle contains a single 18S rRNA. The Nucleotide triples occur when single-stranded RNA sequences
5.8S, 18S, and 28S rRNAs are the products of the processing of form hydrogen bonds with nucleotides that are already base-
a single 45S precursor RNA molecule. The 5S rRNA is independ- paired. These interactions serve to stabilize three-dimensional
ently transcribed and processed. The rRNAs assemble with ribo- RNA structures and to orient regions of RNA secondary struc-
somal protein subunits in the nucleus. The precise role of rRNA ture in large RNA molecules. Pseudoknots are tertiary struc-
in the function of the ribosome is not completely understood (99). tural elements that result from base-pairing between nucleotide
However, it is recognized that interactions between the rRNAs of sequences contained within a loop structure and sequences out-
the ribosomal subunits might be important in the overall structure side the loop structure. These are important in RNA self-splicing,
of the functioning ribosomal particle. In addition, rRNA translational autoregulation, and ribosomal frameshifting.
sequences can interact with ribosome-binding sequences of
mRNA during the initiation of translation. Likewise, it is likely 10. DNA DAMAGE, MUTAGENESIS, AND THE
that rRNAs bind to invariant tRNA sequences when these mole- CONSEQUENCES OF MUTATION
cules enter the active site of the ribosome (99). DNA damage can result from spontaneous alteration of the
9.4. SPECIAL RNA STRUCTURES Higher-order RNA DNA molecule or from the interaction of numerous chemical
structures exhibit hydrogen-bonding between A:U and G:C and physical agents with the structural DNA molecule (101).
basepairs. Several specific higher-order RNA structures have Spontaneous lesions can occur during normal cellular
been recognized (Table 2) and characterized in detail (100). processes, such as DNA replication, DNA repair, or gene
Hairpin loops consist of a double-stranded stem and a single- rearrangement (78), or through chemical alteration of the DNA
stranded loop that bridges one end of the stem. These structures molecule itself as a result of hydrolysis, oxidation, or methyla-
are essential components of more complex RNA structures and tion (102,103). In most cases, DNA lesions create nucleotide
probably serve as nucleation sites for RNA folding. Loops can mismatches that lead to point mutations. Nucleotide mismatches
also function as recognition sites for protein–RNA interactions. can result from the formation of apurinic or apyrimidinic sites
Internal loops represent interruptions in double-stranded RNA following depurination or depyrimidination reactions (103),
caused by the presence of nucleotides on both strands that can- nucleotide conversions involving deamination reactions (78),
not participate in Watson–Crick base-pairing. Several impor- or, in rare instances, from the presence of a tautomeric form
tant functions are associated with internal loops, including of an individual nucleotide in replicating DNA. Deamination
protein-binding sites and ribozyme cleavage sites. In many reactions can result in the conversion of cytosine to uracil,
cases, internal loops have been shown to represent highly adenine to hypoxanthine, and guanine to xanthine (78).
ordered structures maintained by the formation of non- However, the most common nucleotide deamination reaction
Watson–Crick basepairs. Bulges are structural motifs contained involves methylated cytosines, which can replace cytosine
within double-stranded RNA molecules with unpaired in the linear sequence of a DNA molecule in the form of
nucleotides on only one strand. RNA bulges contribute to the 5-methylcytosine. The 5-methylcytosine residues are always
formation of more complex, higher-order RNA structures and located next to guanine residues on the same chain, a motif
02_Coleman 6/6/05 6:50 PM Page 21
Table 3
Forms and Consequences of Molecular Mutation
translational stop codon. This results in the premature termina- 7. Franklin, R. E. and Gosling, R. G. Evidence for 2-chain helix in crys-
tion of translation and the production of a truncated protein talline structure of sodium deoxyribonucleate. Nature 172:156–157,
1953.
product. Small deletions and insertions can usually be classi- 8. Jacobson, B. Hydration structure of deoxyribonucleic acid and its
fied as frameshift mutations because the deletion or insertion of physicochemical properties. Nature 172:666–667, 1953.
a single nucleotide (for instance) alters the reading frame of the 9. Wilkins, M. H., Seeds, W. E., Stokes, A. R., and Wilson, H. R.
gene on the 3′ side of the affected site. This results in the syn- Helical structure of crystalline deoxypentose nucleic acid. Nature
thesis of a polypeptide product that might bear no resemblance 172:759–762, 1953.
10. Watson, J. D. and Crick, F. H. Genetical implications of the struc-
to the normal gene product (Table 3). In addition, small inser- ture of deoxyribonucleic acid. Nature 171:964–967, 1953.
tions or deletions can result in the premature termination of 11. Brenner, S., Jacob, F., and Meselson, M. An unstable intermediate
translation resulting from the presence of a stop codon in the carrying information from genes to ribosomes for protein synthesis.
new reading frame of the mutated gene. Deletions or insertions Nature 190:576–581, 1961.
12. Dounce, A. L. Duplicating mechanism for peptide chain and nucleic
that occur involving multiples of three nucleotides will not
acid synthesis. Enzymologia 15:251–258, 1952.
result in a frameshift mutation, but will alter the resulting 13. Lehman, I. R., Bessman, M. J., Simms, E. S., and Kornberg, A.
polypeptide gene product, which will exhibit either loss of spe- Enzymatic synthesis of deoxyribonucleic acid. I. Preparation of
cific amino acids or the presence of additional amino acids substrates and partial purification of an enzyme from Escherichia
within its primary structure. These types of alteration can also coli. J. Biol. Chem. 233:163–170, 1958.
14. Kornberg, A. Biologic synthesis of deoxyribonucleic acid. Science
lead to a loss of protein function. 131:1503–1508, 1960.
15. Nirenberg, M. W. and Matthaei, J. H. The dependence of cell-free
REFERENCES protein synthesis in E. coli upon naturally occurring or synthetic
1. Avery, O. T., MacLeod, C. M., and McCarty, M. Studies on the polyribonucleotides. Proc. Natl. Acad. Sci. USA 47:1588–1602,
chemical nature of the substance inducing transformation of 1961.
Pneumococcus types. Induction of transformation by a desoxyri- 16. Matthaei, H. and Nirenberg, M. W. The dependence of cell-free pro-
bonucleic acid fraction isolated from Pneumococcus Type III. tein synthesis in E. coli upon RNA prepared from ribosomes.
J. Exp. Med. 79:137–158, 1944. Biochem. Biophys. Res. Commun. 4:404–408, 1961.
2. McCarty, M. and Avery, O. T. Studies of the chemical nature of the 17. Kornberg, R. D. and Lorch, Y. Chromatin structure and transcrip-
substance inducing transformation of pneumococcal types II. Effect tion. Annu. Rev. Cell Biol. 8:563–587, 1992.
of desoxyribonulcease on the biological activity of the transforming 18. Kozak, M. Regulation of translation in eukaryotic systems. Annu.
substance. J. Exp. Med. 83:89–96, 1946. Rev. Cell Biol. 8:197–225, 1992.
3. Watson, J. D. and Crick F. H. Molecular structure of nucleic acids; a 19. Coverley, D. and Laskey, R. A. Regulation of eukaryotic DNA repli-
structure for deoxyribose nucleic acid. Nature 171:737–738, 1953. cation. Annu. Rev. Biochem. 63:745–776, 1994.
4. Wilkins, M. H., Stokes, A. R., and Wilson, H. R. Molecular struc- 20. Crick, F. H. C. On protein synthesis. Symp. Soc. Exp. Biol.
ture of deoxypentose nucleic acids. Nature 171:738–740, 1953. 12:548–555, 1958.
5. Franklin, R. E. and Gosling, R. G. Molecular structure of nucleic 21. Crick, F. Central dogma of molecular biology. Nature 227:561–563,
acids. Molecular configuration in sodium thymonucleate. 1953. 1970.
Ann. NY Acad. Sci. 758:16–17, 1995. 22. Chargaff, E., Vischer, E., Doniger, R., Green, C., and Misani, F. The
6. Watson, J. D. and Crick, F. H. The structure of DNA. Cold Spring composition of the desoxypentose nucleic acids of thymus and
Harbor Symp. Quant. Biol. 18:123–131, 1953. spleen. J. Biol. Chem. 177:405–416, 1949.
02_Coleman 6/6/05 6:50 PM Page 23
23. Chargaff, E. Structure and function of nucleic acids as cell con- 50. Venter, J. C., Smith, H. O., and Hood, L. A new strategy for genome
stituents. Fed. Proc. 10:654–659, 1951. sequencing. Nature 381:364–366, 1996.
24. Crick, F. H. C. and Watson, J. D. The complementary structure of 51. Venter, J. C., Adams, M. D., Sutton, G. G., et al. Shotgun sequenc-
deoxyribonucleic acid. Proc. R. Soc. A 223:80–96, 1954. ing of the human genome. Science 280:1540–1542, 1998.
25. Hunter, W. N., Brown, T., Anand, N. N., and Kennard. O. Structure 52. Lander, E. S., Linton, L. M., Birren, B., et al. Initial sequencing and
of an adenine–cytosine base pair in DNA and its implications for analysis of the human genome. Nature 409:860–921, 2001.
mismatch repair. Nature 320:552–555, 1986. 53. Venter, J. C., Adams, M. D., Myers, E. W., et al. The sequence of the
26. Liu, K., Miles, H. T., Frazier, J., and Sasisekharan, V. A novel DNA human genome. Science 291:1304–1351, 2001.
duplex. A parallel-stranded DNA helix with Hoogsteen base pair- 54. Nowak, R. Mining treasures from ‘junk DNA.’ Science 263:608–610,
ing. Biochemistry 32:11,802–11,809, 1983. 1994.
27. Dickerson, R. E., Drew, H. R., Conner, B. N., Wing, R. M., Fratini, 55. Antequera, F. and Bird, A. Predicting the total number of human
A. V., and Kopka, M. L. The anatomy of A-, B-, and Z-DNA. genes. Nat. Genet. 8:114, 1994.
Science 216:475–485, 1982. 56. Fields, C., Adams, M. D., White, O., and Venter, J. C. How many
28. Collins, F. S. The human genome project and the future of medicine. genes in the human genome? Nat. Genet. 7:345–346, 1994.
Ann. NY Acad. Sci. 882:42–55, 1999; discussion 56–65. 57. Ewing, B. and Green, P. Analysis of expressed sequence tags indi-
29. Futreal, P. A., Kasprzyk, A., Birney, E., Mullikin, J. C., Wooster, R., cates 35,000 human genes. Nat. Genet. 25:232–234, 2000.
and Stratton, M. R. Cancer and genomics. Nature 409:850–852, 2001. 58. Roest Crollius, H., Jaillon, O., Bernot, A. Estimate of human gene
30. Jimenez-Sanchez, G., Childs, B., and Valle, D. Human disease number provided by genome-wide analysis using Tetraodon
genes. Nature 409:853–855, 2001. nigroviridis DNA sequence. Nat. Genet. 25:235–238, 2000.
31. Sanger, F. and Coulson, A. R. A rapid method for determining 59. Sachidanandam, R., Weissman, D., Schmidt, S. C., et al. A map of
sequences in DNA by primed synthesis with DNA polymerase. human genome sequence variation containing 1.42 million single
J. Mol. Biol. 94:441–448, 1975. nucleotide polymorphisms. Nature 409:928–933, 2001.
32. Sanger, F., Nicklen, S., and Coulson, A. R. DNA sequencing with 60. Collins, F. S., Green, E. D., Guttmacher, A. E., and Guyer, M. S. A
chain-terminating inhibitors. Proc. Natl. Acad. Sci. USA 74:5463– vision for the future of genomics research. Nature 422:835–847,
5467, 1977. 2003.
33. Maxam, A. M. and Gilbert, W. A new method for sequencing DNA. 61. Collins, F. S., Morgan, M. and Patrinos, A. The Human Genome
Proc. Natl. Acad. Sci. USA 74:560–564, 1977. Project: lessons from large-scale biology. Science 300:286–290,
34. Smith, L. M., Sanders, J. Z., Kaiser, R. J., et al. Fluorescence detec- 2003.
tion in automated DNA sequence analysis. Nature 321:674–679, 62. Kavenoff, R., Klotz, L. C., and Zimm, B. H. One the nature of chro-
1986. mosome-sized DNA molecules. Cold Spring Harbor Symp. Quant.
35. Ansorge, W., Sproat, B. S., Stegemann, J., and Schwager, C. A non- Biol. 38:1–8, 1974.
radioactive automated method for DNA sequence determination. 63. Spector, D. L. The dynamics of chromosome organization and gene
J. Biochem. Biophys. Methods 13:315–323, 1986. regulation. Annu. Rev. Biochem. 72:573–608, 2003.
36. Collins, F. and Galas, D. A new five-year plan for the U.S. Human 64. Mattick, J. S. Non-coding RNAs: the architects of eukaryotic com-
Genome Project. Science 262:43–46, 1993. plexity. EMBO Rep. 2:986–991, 2001.
37. Collins, F. S., Patrinos, A., Jordan, E., Chakravarti, A., Gesteland, 65. Small, D., Nelkin, B., and Vogelstein, B. Nonrandom distribution
R., and Walters, L. New goals for the U.S. Human Genome Project: of repeated DNA sequences with respect to supercoiled loops
1998–2003. Science 282:682–689, 1998. and the nuclear matrix. Proc. Natl. Acad. Sci. USA 79:5911–5915,
38. Hudson, T. J., Stein, L. D., Gerety, S. S., et al. An STS-based map 1982.
of the human genome. Science 270:1945–1954, 1995. 66. Tsongalis, G. J., Coleman, W. B., Smith, G. J., and Kaufman, D. G.
39. Berry, R., Stevens, T. J., Walter, N. A., et al. Gene-based sequence- Partial characterization of nuclear matrix attachment regions from
tagged-sites (STSs) as the basis for a human gene map. Nat. Genet. human fibroblast DNA using Alu-polymerase chain reaction.
10:415–423, 1995. Cancer Res. 52:3807–3810, 1992.
40. Gyapay, G., Schmitt, K., Fizames, C., et al. A radiation hybrid map 67. Jelinek, W. R. and Schmid, C. W. Repetitive sequences in eukary-
of the human genome. Hum. Mol. Genet. 5:339–346, 1996. otic DNA and their expression. Annu. Rev. Biochem. 51:813–844,
41. Cheung, V. G., Nowak, N., Jang, W., et al. Integration of cytogenetic 1982.
landmarks into the draft sequence of the human genome. Nature 68. Schmid, C. W. and Jelinek, W. R. The Alu family of dispersed repet-
409:953–958, 2001. itive sequences. Science 216:1065–1070, 1958.
42. Olivier, M., Aggarwal, A., Allen, J., et al. A high-resolution radia- 69. Meselson, M. and Stahl, F. W. The replication of DNA. Cold Spring
tion hybrid map of the human genome draft sequence. Science Harbor Symp. Quant. Biol. 23:9–12, 1958.
291:1298–1302, 2001. 70. Meselson, M. and Stahl, F. W. The replication of DNA in
43. McPherson, J. D., Marra, M., Hillier, L., et al. A physical map of the Escherichia coli. Proc. Natl. Acad. Sci. USA 44:671–682, 1958.
human genome. Nature 409:934–941, 2001. 71. Franceschini, P. Semiconservative DNA duplication in human chro-
44. Adams, M. D., Dubnick, M., Kerlavage, A. R., et al. Sequence iden- mosomes treated with BUDR and stained with acridine orange. Exp.
tification of 2,375 human brain genes. Nature 355:632–634, 1992. Cell Res. 89:420–421, 1974.
45. Adams, M. D., Soares, M. B., Kerlavage, A. R., Fields, C., and 72. Rude, J. M. and Friedberg, E. C. Semi-conservative deoxyribonu-
Venter, J. C. Rapid cDNA sequencing (expressed sequence tags) cleic acid synthesis in unirradiated and ultraviolet-irradiated xero-
from a directionally cloned human infant brain cDNA library. Nat. derma pigmentosum and normal human skin fibroblasts. Mutat.
Genet. 4:373–380, 1993. Res. 42:433–442, 1977.
46. Adams, M. D., Kerlavage, A. R., Fields, C., and Venter, J. C. 3,400 73. West, S. C. Enzymes and molecular mechanisms of genetic recom-
new expressed sequence tags identify diversity of transcripts in bination. Annu. Rev. Biochem. 61:603–640, 1992.
human brain. Nat. Genet. 4:256–267, 1993. 74. Alberts, B. DNA replication and recombination. Nature 421:431–435,
47. Schuler, G. D., Boguski, M. S., Stewart, E. A., et al. A gene map of 2003.
the human genome. Science 274:540–546, 1996. 75. Friedberg, E. C. Biological responses to DNA damage: a perspec-
48. Caron, H., van Schaik, B., van der Mee, M., et al. The human tran- tive in the new millennium. Cold Spring Harb. Symp. Quant. Biol.
scriptome map: clustering of highly expressed genes in chromoso- 65:593–602, 2000.
mal domains. Science 291:1289–1292, 2001. 76. Friedberg, E. C. DNA damage and repair. Nature 421:436–440,
49. Venter, J. C., Adams, M. D., Martin-Gallardo, A., McCombie, W. 2003.
R., and Fields, C. Genome sequence analysis: scientific objectives 77. Sancar, A. and Sancar G. B. (1988) DNA repair enzymes. Annu.
and practical strategies. Trends Biotechnol. 10:8–11, 1992. Rev. Biochem, 57:29–67.
02_Coleman 6/6/05 6:50 PM Page 24
78. Friedberg, E. C., Walker, G. C., and Siede, W. DNA Repair and 95. Darnell, J. E., Philipson, L., Wall, R., and Adesnik, M. Polyadenylic
Mutagenesis, ASM, Washington, DC, pp. xvii, 1995. acid sequences: role in conversion of nuclear RNA into messenger
79. Lewin, B. Units of transcription and translation: sequence compo- RNA. Science 174:507–510, 1971.
nents of heterogeneous nuclear RNA and messenger RNA. Cell 96. Blum, B., Bakalara, N., and Simpson, L. A model for RNA editing
4:77–93, 1975. in kinetoplastid mitochondria: “guide” RNA molecules transcribed
80. Oren, M. The p53 cellular tumor antigen: gene structure, expression from maxicircle DNA provide the edited information. Cell
and protein properties. Biochim. Biophys. Acta 823:67–78, 1985. 60:189–198, 1990.
81. Hollstein, M., Hergenhahn, M., Yang, Q., Bartsch, H., Wang, Z. Q., 97. Schimmel, P. and Ribas de Pouplana, L. Transfer RNA: from mini-
and Hainaut, P. New approaches to understanding p53 gene tumor helix to genetic code. Cell 81:983–986, 1995.
mutation spectra. Mutat. Res. 431:199–209, 1999. 98. Crick, F. H. Codon–anticodon pairing: the wobble hypothesis. J.
82. Bennett, W. P., Hussain, S. P., Vahakangas, K. H., Khan, M. A., Mol. Biol. 19:548–555, 1966.
Shields, P. G., and Harris, C. C. Molecular epidemiology of human 99. Noller, H. F. Structure of ribosomal RNA. Annu. Rev. Biochem.
cancer risk: gene–environment interactions and p53 mutation spec- 53:119–162, 1984.
trum in human lung cancer. J. Pathol. 187:8–18, 1999. 100. Shen, L. X., Cai, Z., and Tinoco, I., Jr. RNA structure at high reso-
83. Bookstein, R., Lee, E. Y., To, H., et al. Human retinoblastoma sus- lution. FASEB J. 9:1023–1033, 1995.
ceptibility gene: genomic organization and analysis of heterozygous 101. Drake, J. W. and Baltz, R. H. The biochemistry of mutagenesis.
intragenic deletion mutants. Proc. Natl. Acad. Sci. USA 85:2210– Annu. Rev. Biochem. 45:11–37, 1976.
2214, 1988. 102. Lindahl, T. Instability and decay of the primary structure of DNA.
84. Lee, W. H., Bookstein, R., Hong, F., Young, L. J., Shew, J. Y., and Nature 362:709–715, 1993.
Lee, E. Y. Human retinoblastoma susceptibility gene: cloning, iden- 103. Ames, B. N., Shigenaga, M. K., and Gold, L. S. DNA lesions,
tification, and sequence. Science 235:1394–1399, 1987. inducible DNA repair, and cell division: three key factors in muta-
85. Hong, F. D., Huang, H. J., To, H., et al. Structure of the human genesis and carcinogenesis. Environ. Health Perspect. 101(Suppl
retinoblastoma gene. Proc. Natl. Acad. Sci. USA 86:5502–5506, 1989. 5):35–44, 1993.
86. Cech, T. R. Self-splicing of group I introns. Annu. Rev. Biochem. 104. Cooper, D. N. and Youssoufian, H. The CpG dinucleotide and
59:543–568, 1990. human genetic disease. Hum. Genet. 78:151–155, 1988.
87. Jurica, M. S. and Moore, M. J. Pre-mRNA splicing: awash in a sea 105. Rideout, W. M., 3rd, Coetzee, G. A., Olumi, A. F., and Jones,
of proteins. Mol. Cell 12:5–14, 2003. P. A. 5-Methylcytosine as an endogenous mutagen in the
88. Black, D. L. Mechanisms of alternative pre-messenger RNA splic- human LDL receptor and p53 genes. Science 249:1288–1290,
ing. Annu. Rev. Biochem. 72:291–336, 2003. 1990.
89. Padgett, R. A., Grabowski, P. J., Konarska, M. M., Seiler, S., and 106. Jones, P. A., Buckley, J. D., Henderson, B. E., Ross, R. K., and Pike,
Sharp, P. A. Splicing of messenger RNA precursors. Annu. Rev. M. C. From gene to carcinogen: a rapidly evolving field in molecu-
Biochem. 55:1119–1150, 1986. lar epidemiology. Cancer Res. 51:3617–3620, 1991.
90. Sharp, P. A. Splicing of messenger RNA precursors. Science 107. Bishop, J. M. Molecular themes in oncogenesis. Cell 64:235–248,
235:766–771, 1987. 1991.
91. Powell, L. M., Wallis, S. C., Pease, R. J., Edwards, Y. H., Knott, T. 108. Coleman, W. B. and Tsongalis, G. J. Multiple mechanisms account
J., and Scott, J. A novel form of tissue-specific RNA processing pro- for genomic instability and molecular mutation in neoplastic trans-
duces apolipoprotein-B48 in intestine. Cell 50:831–840, 1987. formation. Clin. Chem. 41:644–657, 1995.
92. Sollner-Webb, B. RNA editing. Curr. Opin. Cell Biol. 3:1056–1061, 109. Coleman, W. B. and Tsongalis, G. J. The role of genomic instabil-
1991. ity in human carcinogenesis. Anticancer Res. 19:4645–4664,
93. Landweber, L. F. and Gilbert, W. RNA editing as a source of genetic 1999.
variation. Nature 363:179–182, 1993. 110. Coleman, W. B. and Tsongalis, G. J. The role of genomic instability
94. Mattaj, I. W., Tollervey, D., and Seraphin, B. Small nuclear RNAs in the development of human cancer, in The Molecular Basis of
in messenger RNA and ribosomal RNA processing. FASEB J. Human Cancer, Coleman, W. B. and Tsongalis, G. J., eds., Humana,
7:47–53, 1993. Totowa, NJ, pp. 115–142, 2002.