Professional Documents
Culture Documents
Dimensional Structure
of Proteins
© 2017 W. H. Freeman and Company
CHAPTER 4
The Three-Dimensional Structure of Proteins
Learning goals: six themes
• The three-dimensional structure taken up by a protein are determined by its
amino acid sequence.
• The function of a typical protein depends on its structure.
• Most isolated proteins exist in one or a small number of stable structural forms.
• The most important forces stabilizing the specific structures maintained by a
given protein are noncovalent; the hydrophobic effect is particularly important.
• Some common structural patterns that help to organize our understanding of
protein architecture.
• Protein structures are not static. All proteins undergo changes in conformation
ranging from subtle to dramatic. Parts of many proteins have no discernible
structure (→ critical to their function).
Overview of Protein Structure
• The organization around the peptide bond, paired with the identity of the R
groups, determines the secondary structure of the protein.
Distribution of f and y Dihedral Angles
• The Ramachandran plot is a visual description of the combinations of ϕ and
ψ dihedral angles that are permitted in a peptide backbone and those that
are not permitted due to steric constraints.
• shows the common secondary structure elements
no steric overlap
• reveals regions with unusual backbone structure conformations that are
not allowed
• Some f and y combinations are very
unfavorable because of steric interference
of backbone atoms with other atoms in
the backbone or side chains.
• Some f and y combinations are more
favorable because of chance to form
favorable H-bonding interactions along
the backbone.
Structure f y
Helix –57˚ –47˚
Conformation
Turn type I
i + 1a –60˚ –30˚
i + 2a –90˚ 0˚
Turn type II
i+2 +80˚ 0˚
The Helix
• Helical backbone is held together by hydrogen
bonds between the backbone amides of an n n
(C=O) and n + 4 (N—H) amino acids.
• It is a right-handed helix with 3.6 residues (5.4 n+4
Å) per turn.
• Peptide bonds are aligned roughly parallel
with the helical axis.
• Side chains point out and are roughly
perpendicular with the helical axis.
Hydrophobic
residues
Amino Acid Sequence Affects Stability of the α Helix
Sources: Data (except proline) from J. W. Bryson et al., Science 270:935, 1995. Proline data from J. K.
Myers et al., Biochemistry 36:10,923, 1997.
aDDG° is the difference in free-energy change, relative to that for alanine, required for the amino acid
residue to take up the α-helical conformation. Larger numbers reflect greater difficulty taking up the α-
helical structure. Data are a composite derived from multiple experiments and experimental systems.
The Helix Dipole
• Recall that the peptide bond has a strong dipole
moment.
– C−O (carbonyl) negative
– N−H (amide) positive
• All peptide bonds in the helix have a similar
orientation.
• The helix has a large macroscopic dipole moment
that is enhanced by unpaired amides and carbonyls
near the ends of the helix.
• Negatively charged residues often occur near the
positive end of the helix dipole.
The β Conformation Organizes Polypeptide
Chains into Sheets
• Pauling and Corey predicted a second type of repetitive structure, a more
extended conformation of polypeptide chains, the β conformation.
• The arrangement of several segments (zigzag structure) side by side, all of
which are in the β conformation, is called a β sheet.
• The planarity of the peptide bond and tetrahedral geometry of the carbon
create a pleated sheet-like structure.
• Sheet-like structure are held together by hydrogen bonds between the amide
and carbonyl groups of the peptide backbone in different strands.
• The R groups protrude from the zigzag structure, alternating in an up-and-
down direction.
Parallel and Antiparallel Sheets
• Two major orientations of sheets are determined by the directionality of
the strands.
• In parallel sheets, the H-bonded strands are oriented in the same direction.
• Hydrogen bonds between strands are bent (weaker).
• In antiparallel sheets, the H-bonded strands are oriented in opposite
directions.
• Hydrogen bonds between strands are linear (stronger).
β Turns Are Common in Proteins
• turns occur frequently whenever strands in sheets change the direction.
• The structure is a 180° turn involving four amino acid residues.
• The turn is stabilized by a hydrogen bond from a carbonyl oxygen (1st residue)
to amide Hydrogen (4th residue) of the bends.
• Proline (cis configuration) in position 2 or glycine (small and flexible) in
position 3 are common in turns.
α Helix, cross-linked by Tough, insoluble protective structures α-Keratin of hair, feathers, nails
disulfide bonds of varying hardness and flexibility
β Conformation Soft, flexible filaments Silk fibroin
Collagen triple helix High tensile strength, without stretch Collagen of tendons, bone matrix
Structure of -Keratin in Hair
• When hair is exposed to moist heat, it can be stretched. At the molecular level,
the α helices in the α-keratin of hair are stretched out until they arrive at the
fully extended β conformation.
• On cooling, they spontaneously revert to the α-helix conformation. The
characteristic “stretchability” of α-keratins, as well as their numerous disulfide
crosslinkages, is the basis of permanent waving (or hair straightening).
Structure of Collagen
• Collagen is an important constituent of connective tissue: tendons, cartilage,
bones, cornea of the eye.
• The collagen helix is a unique secondary structure, quite distinct from the α
helix (left-handed and three amino acid residues per turn; called α chains).
• Collagen is a coiled coil with distinct tertiary and quaternary structures:
– Three collagen chains are supertwisted about each other to a right-handed
superhelical triple helix.
His
Residues (%)a
Protein (total residues) α Helix β Conformation
Chymotrypsin (247) 14 45
Ribonuclease (124) 26 35
Carboxypeptidase (307) 38 17
Cytochrome c (104) 39 0
Lysozyme (129) 40 12
Myoglobin (153) 78 0
SOURCE: Data from C. R. Cantor and P. R. Schimmel, Biophysical Chemistry, Part I: The
Conformation of Biological Macromolecules, p. 100, W. H. Freeman and Company, 1980.
aPortions of the polypeptide chains not accounted for by α helix or β conformation consist of bends and
irregularly coiled or extended stretches. Segments of α helix and β conformation sometimes deviate
slightly from their normal dimensions and geometry.
• The number of known three-dimensional protein structures is now
more than 100,000 and doubles every couple of years.
• Protein Data Bank (PDB; www.pdb.org).
• The PDB is an archive of experimentally determined three-dimensional
structures of biological macromolecules, containing virtually all of the
macromolecular structures (proteins, RNAs, DNAs, etc.).
– The data files in the PDB describe the spatial coordinates of each atom for
which the position has been determined (many of the cataloged structures
are not complete).
– Additional data files provide information on how the structure was
determined and its accuracy (using structure visualization software).
Motif (fold)
• A motif or fold is a recognizable folding pattern involving two or more
elements of secondary structure and the connection(s) between them.
– The terms “motif” and “fold” are often used interchangeably, although “fold” is
applied more commonly to somewhat more complex folding patterns.
– Also called a (more rarely) supersecondary structure.
– A motif is not a hierarchical structural element falling between secondary and
tertiary structure. It is simply a folding pattern.
• Motifs can be found as recurring structures in numerous proteins.
• Globular proteins are composed of different motifs folded together.
Simple motif
Calcium-binding domains
PDB ID
Topology diagram
Quaternary Structure
• A quaternary structure is formed by the assembly of individual polypeptides
into a larger functional cluster.
• A multisubunit protein is also referred to as a multimer.
• The repeating structural unit in such a multimeric protein, whether a single
subunit or a group of subunits, is called a protomer.
β subunits
A tetramer or
a dimer of α β protomers
X-Ray Crystallography (Diffraction)
Steps
• purify the protein
• crystallize the protein
• collect x-ray diffraction data
• calculate electron density
• fit known amino acid residues into density
Pros
• no size limits
• well established
Cons
• limited to molecules that can be crystallized
• difficult for membrane proteins
• cannot resolve (see) hydrogens
Nuclear Magnetic Resonance
Steps
• purify the protein
• dissolve the protein
• collect NMR data
• assign NMR signals
• calculate the structure
Pros
• no need to crystallize the protein
• can be carried out on
macromolecules in solution
• can see many hydrogens
• can also illuminate the dynamic
side of protein structure
Cons
• difficult for insoluble proteins
• works best with small proteins
Proteostasis: Synthesis, Assembly, and Degradation
of Proteins in vivo
GroEL/GroES complex
Protein Misfolding Is the Basis of Numerous Human
Genetic Disorders
• Protein misfolding is a substantial problem in all
cells, and a quarter or more of all polypeptides
synthesized may be destroyed because they do
not fold correctly.
• Misfolding causes or contributes to the
development of serious disease.
• Type 2 diabetes, Alzheimer disease, Huntington
disease, and Parkinson disease, are associated
with misfolding.
• a soluble protein that is normally secreted from
the cell is secreted in a misfolded state and
converted into an insoluble extracellular amyloid
fiber with a high degree of β-sheet structure
(characteristic of Alzheimer disease).
• Misfolded amyloid promotes aggregation at
newly exposed protein-protein interface.
• Correctly folded helices are lost and peptides
form strands, helices, and sheets.
• A misfolded brain protein seems to be the causative agent of several rare
degenerative brain diseases in mammals.
– BSE (bovine spongiform encephalopathy (holes in brain); also known as mad cow
disease)
– Kuru, Creutzfeldt-Jakob disease in human
– scrapie in sheep
• Disease-causing agents seemed to lack nucleic acids and was a protein.
• The infectious agent has been traced to a single protein (Mr 28,000), called prion
protein (PrP). (proteinaceous infectious → proin → prion)
• Prion protein is a normal constituent of brain tissue in all mammals.
• Illness occurs only when the normal cellular PrP, or PrPC, occurs in an altered
conformation called PrPSc (Sc denotes scrapie).
– The structure of PrPC has two α helices but PrPSc is very different, with much of the
structure converted to amyloidlike β sheets.
– The interaction of PrPSc with PrPC converts the latter to PrPSc, initiating a domino effect.
– The mechanism is not understood. And how prion protein affects brain function?
– Prionlike proteins may be responsible for additional neurodegenerative diseases
(Parkinson disease).