Professional Documents
Culture Documents
We are now ready to examine the major kinds of biological macromolecules. In each
case, we will focus first on the chemical nature of the monomeric components and then on
the synthesis and properties of the polymer itself. As we will see shortly, most biological
macromolecules in cells are synthesized from about 30 common small molecules. Our
survey begins with proteins because they play such important and widespread roles in
cellular structure and function. We then move on to nucleic acids and polysaccharides.
The tour concludes with lipids, which do not quite fit the definition of a polymer but are
important cellular components whose synthesis resembles that of true polymers.
66
Kind of Molecules Number Present Names of Molecules Role in Cell Figure Number for Structures
Amino acids 20 See list in Table 3-2 Monomeric units of all proteins 3-2
Aromatic bases 5 Adenine Components of nucleic acids 3-15
Cytosine
Guanine
Thymine
Uracil
Sugars varies Ribose Component of RNA 3-15
Deoxyribose Component of DNA
Glucose Energy metabolism; component 3-24
of starch and glycogen
Lipids varies Fatty acids Components of phospholipids 3-27a
and membranes
Cholesterol 3-27e
Source: Adapted from Wald, G. The origins of life. Proc. Natl. Acad. Sci. USA 52 (1994): 595.
3.1 Proteins in Table 3-1. Some proteins contain more than 20 different
Chapter 3
kinds of amino acids, but the additional ones usually are a
Chemistry
Review–
Proteins are a class of extremely important and ubiquitous result of modifications that occur after the protein has been
Proteins: macromolecules in all organisms, occurring nearly everywhere synthesized. Although most proteins contain all or most of
Functions of
Proteins in the cell. In fact, their importance is implied by their name, the 20 amino acids, the proportions vary greatly, and no two
H H H H H
O O O O O
H3N+ C C H3N+ C C H3N+ C C H3N+ C C H3N+ C C
O- O- O- O- O-
H CH3 CH CH2 H3C CH
CH3 CH3 CH CH2
CH3 CH3 CH3
Glycine Alanine Valine Leucine Isoleucine
H H H H
O O O O
+ + + +
H3N C C H3N C C H3N C C H2N C C
O- O- O- O-
CH2 CH2 CH2 H2C CH2
CH2 CH2
NH
S
CH3
H H H H H H
O O O O O O
H3N+ C C H3N+ C C H3N+ C C H3N+ C C H3N+ C C H3N+ C C
O- O- O- O- O- O-
CH2 CH CH2 CH2 CH2 CH2
OH OH CH3 SH C CH2
NH2 O
C
NH2 O
OH
Serine Threonine Cysteine Tyrosine Asparagine Glutamine
Acidic Basic
H H H H H
O O O O O
H3N+ C C H3N+ C C H3N+ C C H3N+ C C H3N+ C C
O- O- O- O- O-
CH2 CH2 CH2 CH2 CH2
NH Chemistry
C CH2 CH2 CH2 Review–
-O Organic
O Molecules:
C CH2 CH2 NH+ Functional
-O
O Groups
CH2 NH Amino Acid
Functional
NH3+ C NH2+ Groups
NH2
Aspartate Glutamate Lysine Arginine Histidine
Figure 3-2 The Structures of the 20 Amino Acids Found in Proteins. All amino acids have a carboxyl group and
an amino group attached to the central (a) carbon, but each has its own distinctive R group (shaded boxes). Those in
Group A have nonpolar, hydrocarbon R groups and are therefore hydrophobic. The others are hydrophilic, either because
68 the R group is polar (Group B) or because the R group is acidic or basic and thus is charged at cellular pH (Group C).
Three-Letter One-Letter
R1 O R2 O
Amino Acid Abbreviation Abbreviation
Alanine Ala A H3N+ CH C O- + H3N+ CH C O-
Arginine Arg R Amino acid 1 Amino acid 2
Asparagine Asn N R1 O R2 O
Aspartate Asp D
Cysteine Cys C H3N+ CH C N CH C O- + H2O
Glutamate Glu E H
Glutamine Gln Q Peptide
Glycine Gly G
Histidine His H As each new peptide bond is formed by condensation, the
Isoleucine Ile I growing chain of amino acids is lengthened by one amino
Leucine Leu L acid. Peptide bond formation is illustrated schematically in
Lysine Lys K
Figure 3-3 using ball-and-stick models of the amino acids
glycine and alanine. Peptide bonds have a partial double-bond
Methionine Met M
character, and thus the six nearest atoms are nearly planar
Phenylalanine Phe F
(see the shaded rectangle in Figure 3-3).
Proline Pro P Notice that the chain of amino acids formed in this way
Serine Ser S has an intrinsic directionality because it always has an ami-
Chapter 3
Threonine Thr T no group at one end and a carboxyl group at the other end.
Tryptophan Trp W The end of the chain with the amino group is known as the
Tyrosine Tyr Y
N- (or amino) terminus, and the end with the carboxyl
HC CH2 SH + HS CH2 CH
2H
HC CH2 S S CH2 CH
a subunit
Chapter 3
projected to result in ~$1.2 trillion in annual payouts to health-
care and health-related services in the United States alone.
The symptoms of Alzheimer’s are caused by the degenera-
tion of brain cells due to excessive association of proteins both
71
Chapter 3
along the chain. Tertiary structure results from long-distance to the C-terminus of the polypeptide, which is also the direc-
interactions between stretches of amino acid residues from dif- tion in which the polypeptide is synthesized.
ferent parts of a polypeptide molecule. Quaternary structure The first protein to have its complete amino acid sequence
describes the interaction of two or more individual folded poly- determined was the hormone insulin. This important techni-
N
R (d) Quaternary structure. The
H quaternary structure describes
C
H the association of two or more
N
polypeptides as they interact
C
Chemistry R to form a functional multimeric
Review - C protein.
Proteins: H
Levels of O
Structure
C
Activity:
Protein
Structure
Levels of
Structure
in
Proteins
Figure 3-7 The Structure of Insulin. Insulin consists of two polypeptides, called the A and B subunits,
which are covalently linked by two disulfide bonds. (For abbreviations of amino acids, see Table 3-2, page 69.)
and 30 amino acid residues, respectively. Figure 3-7 shows The a helix structure was proposed in 1951 by Linus Paul-
the structure of insulin, illustrating the primary sequence of ing and Robert Corey. As shown in Figure 3-8a, an a helix is spi- Chemistry
each subunit in sequence from its N-terminus (left) to its C- ral in shape, consisting of a backbone of amino acids linked by Review–
Proteins:
terminus (right). Notice also the covalent disulfide (—S—S—) peptide bonds with the specific R groups of the individual amino Secondary
Structure
bond between two cysteine residues within the A chain and acid residues sticking out from it. In the a helix there are 3.6
the two disulfide bonds linking the A and B chains. Disulfide amino acids per turn, bringing the peptide bonds of every fourth
bonds play an important role in stabilizing the tertiary struc- amino acid in close proximity. The distance between these pep-
ture of many proteins. tide bonds is, in fact, just right for the formation of a hydrogen
Sanger’s techniques paved the way for the sequencing of bond between the NH group adjacent to one peptide bond and
hundreds of other proteins and led ultimately to the design the CO group adjacent to the other, as shown in Figure 3-8a.
of machines that can determine an amino acid sequence au- As a result, every peptide bond in the helix is hydrogen-
tomatically. However, with current technology, it is much bonded through its CO group to the peptide bond immediately
easier to purify a DNA molecule and determine its nucleotide “below” it in the spiral and through its NH group to the peptide
sequence than it is to purify a protein and analyze its amino bond just “above” it, even though the amino acid residues in-
acid sequence. Once a DNA nucleotide sequence has been deter- volved are not directly adjacent. These hydrogen bonds are all
mined, the amino acid sequence of the polypeptide encoded by nearly parallel to the main axis of the helix and therefore tend to
that DNA segment can be easily inferred using the genetic code. stabilize the spiral structure by holding successive turns of the he-
Computerized data banks are now available that contain thou- lix together. In addition, two or more a helices can coil together in
sands of polypeptide sequences, making it easy to compare se- a rope-like fashion to form a bundle of a helices called a coiled coil,
quences and look for regions of similarity between polypeptides. as we will see shortly in the keratin protein that makes up hair.
The primary structure of a protein is important both ge- Another form of common secondary structure in pro-
netically and structurally. Genetically, it is significant because teins is the b sheet, also initially proposed by Pauling and
the amino acid sequence of the polypeptide is determined Corey. As shown in Figure 3-8b, this structure is an extended
by the order of nucleotides in the corresponding messenger sheetlike conformation with successive atoms in the polypep-
RNA. The messenger RNA in turn reflects DNA sequences tide chain located at the “peaks” and “troughs” of the pleats.
in the gene that encodes the protein. Therefore, the primary The R groups of successive amino acids stick out on alternat-
structure of a protein is the result of the order of nucleotides ing sides of the sheet. Because the carbon atoms that make up
in the DNA of the gene. the backbone of the polypeptide chain are successively located
Of more immediate significance are the implications of the a little above and a little below the plane of the b sheet, such
primary structure for higher levels of protein structure. In essence, structures are sometimes called b-pleated sheets.
all three higher levels of protein organization are direct conse- Like the a helix, the b sheet is characterized by a maxi-
quences of the primary structure. Although protein denaturation mum of hydrogen bonding. In both cases, all of the CO
by heating unfolds a polypeptide and eliminates all but the pri- groups and NH groups adjacent to the peptide bonds are in-
mary structure, the information in the primary sequence specifies volved. However, hydrogen bonding in an a helix is invariably
these higher levels of structure, and often the protein can refold intramolecular (within the same polypeptide), whereas hydro-
into its native conformation, as we saw for ribonuclease (see Fig- gen bonding in the b sheet can be either intramolecular (be-
ure 2-17). Similarly, if synthetic polypeptides are made that cor- tween two segments of the same polypeptide) or intermolecular
respond in sequence to the a and b subunits of hemoglobin, they (linking two different polypeptides). The protein regions that
will assume the native three-dimensional conformations of these form b sheets can interact with each other in two different ways.
subunits and will then interact spontaneously to form the native If the two interacting regions run in the same N-terminus-to-C-
a2b2 tetramer that we recognize as hemoglobin (see Figure 3-4). terminus direction, the structure is called a parallel b sheet; if
the two strands run in opposite N-terminus-to-C-terminus di-
Secondary Structure. The secondary structure of a rections, the structure is called an antiparallel b sheet. In some
protein describes local regions of structure that result from proteins, several antiparallel b sheets associate symmetrically
hydrogen bonding between NH and CO groups along the poly- around a central axis in a structure known as a b propeller.
peptide backbone. These local interactions result in two major Whether a specific segment of a polypeptide will form an
structural patterns, referred to as the a helix and b sheet a helix, a b sheet, or neither depends on the amino acids pres-
74 conformations (Figure 3-8). ent in that segment. For example, leucine, methionine, and
Chapter 3
R H bonds in the adjacent polypeptide
C O C regions.
H N
O N H O C
H
C Hydrogen bonds
C R H
Figure 3-8 The A Helix and B Sheet. The a helix shown in (a) and the b sheet shown in (b) are both
stabilized by hydrogen bonds (blue dots), either within a local region of primary sequence (a helix) or between
two separate regions (b sheet).
glutamate are strong “a helix formers” and are commonly secondary structure, called motifs, consist of small seg-
found in a-helical regions. Isoleucine, valine, and phenyl- ments of an a helix and/or a b sheet connected to each other
alanine are strong “b-sheet formers” and are often found in by looped regions of varying length. Among the most com-
b-sheet regions. Proline is considered a “helix breaker” be- monly encountered motifs are the b-a-b motif shown in
cause its R group is covalently bonded to its amino nitrogen, Figure 3-9a and the hairpin loop and helix-turn-helix motifs
which therefore lacks the hydrogen atom needed for hydrogen depicted in Figure 3-9, parts b and c, respectively. When the
bonding. Proline is rarely found in an a helix and, when pres- same motif is present in different proteins, it usually serves
ent, introduces a bend in the helix. the same purpose in each. (For example, the helix-turn-helix
To depict localized regions of structure within a pro- motif is one of several secondary structure motifs that are
tein, biochemists have adopted the conventions shown in characteristic of the DNA-binding proteins we will encoun-
Figure 3-9. An a-helical region is represented as either a ter in Chapter 20 when we consider the regulation of gene
spiral or a cylinder, whereas a b-sheet region is drawn as a expression.)
flat ribbon or arrow with the arrowhead pointing in the di-
rection of the C-terminus. Depending on their relative orien- Tertiary Structure. The tertiary structure of a protein can
tation, b sheets can be parallel (Figure 3-9a) or antiparallel probably be best understood by contrasting it with the second- Chemistry
(Figure 3-9b). A looped segment that connects a-helical and/ ary structure (Figure 3-6b, c). Secondary structure is a predict- Review–
Proteins:
or b-sheet regions is called a random coil that is intrinsically able, repeating conformational pattern that derives from the Tertiary
Structure
disordered. This segment has no defined secondary structure repetitive nature of the polypeptide because it involves hydro-
and is depicted as a narrow cord. gen bonding between NH and CO groups adjacent to peptide
Certain combinations of a helices and b sheets have bonds—the common structural elements along every poly-
been identified in many proteins. These combinations of peptide chain. If proteins contained only one or a few kinds of 75
Chapter 3
proteins usually have multiple domains. The portions of the
immunoglobulin and hexokinase molecules shown in Figure
3-13, parts b and c, are, in fact, specific domains of these
Disulfide
bond
40
95
110 26
65 84
72
N-terminus
a helix
(spiral)
C-terminus
Chemistry
Review– Figure 3-12 The Three-Dimensional Structure of Ribonuclease. Ribonuclease is a monomeric globular
Proteins:
Models of
protein with significant a-helical and b-sheet regions connected by random coils. Its tertiary structure can be
Proteins represented by either (a) a ball-and-stick model or (b) a spiral-and-ribbon model.
E6V) causes enough of a difference in the tertiary structure of exactly how a given protein will fold, especially for large pro-
the b chain that the hemoglobin molecules tend to crystallize, teins (more than 100 amino acids). In fact, one of the most
deforming the cell into a sickle shape. Not all amino acid substi- challenging unsolved problems in structural biochemistry is
tutions cause such dramatic changes in structure and function, to predict the final folded tertiary structure of a protein from
but this example underscores the crucial relationship between its known primary structure. Even with all our knowledge of
the amino acid sequence of a polypeptide and the final shape the factors and forces involved in folding, and the availability
and biological activity of the molecule. of supercomputers to do billions of calculations per second,
Although we know that the primary sequence of a protein we cannot often predict the most stable conformation for a
determines its final folded shape, we still are not able to predict given protein.
78
(a) Predominantly a helix (b) Predominantly b sheet (c) Mixed a helix and b sheet
Figure 3-13 Structures of Several Globular Proteins. Shown here are proteins with different tertiary
structures: (a) the predominantly a-helical structure of the coat protein of tobacco mosaic virus (TMV); (b) the
mainly b-sheet structure of the V2 domain of immunoglobulin; and (c) the mixture of a helices and b sheets
seen in domain 2 of hexokinase.
Chapter 3
In fact, in every other year since 1994, protein modelers Although we can deduce the primary structure of a
worldwide test their predictive methods in a major modeling polypeptide from the nucleotide sequence of the DNA encod-
experiment known as CASP—the critical assessment of tech- ing it, determining its overall three-dimensional conforma-
niques for protein structure prediction. Their predictions are tion is much more complicated. Key Technique, page 80,
organized into a multiprotein complex, with each protein compound pyruvate, the product of glycolysis (Chapter 10).
involved sequentially in a common multistep process. An ex- Three individual enzymes and five kinds of molecules called
ample of such a complex is an enzyme system called the pyru- coenzymes constitute a highly organized multienzyme complex.
vate dehydrogenase complex. This complex catalyzes the oxida- The pyruvate dehydrogenase complex is one of the best under-
tive removal of a carbon atom (as CO2) from the three-carbon stood examples of how cells can achieve economy of function
80
Chapter 3
information on diffraction patterns, see the “X-ray Crystallogra-
phy” section of the Appendix.)
Model Construction: The diffraction data are converted into Figure 3B-2 Protein Crystals for X-ray Crystallography.
an electron density map for the protein using specialized math- These crystals of pure lysozyme are ready for analysis. Note the
Diffraction pattern
Protein crystal
X-ray beam
X-ray instrument
1 X-rays diffracted by a protein crystal 2 The resulting diffraction pattern is 3 An electron density map of the
produce a diffraction pattern on a detector. analyzed mathematically. molecule is deduced.
by ordering the enzymes that catalyze sequential reactions CONCEPT CHECK 3.1
into a single multienzyme complex. Other multiprotein com- Suppose you have a patient suffering from anemia and find
plexes we will encounter in our studies include ribosomes, that her hemoglobin is missing three amino acids from the
proteosomes, the photosystems, and the DNA replication primary sequence. How might this affect each of the three
complex. higher levels of protein structure, thus causing this condition?
81