Professional Documents
Culture Documents
INTRODUCTORY BIOCHEMISTRY
AMINO ACIDS
General Structure
• Proteins are the most abundant organic molecules in cells, constituting 50% or more
of their dry weight
• There are many different kinds of proteins, each specialized for a different biological
function. And most of the genetic information is expressed by proteins
• Proteins are made up of 20 common α-amino acids
2
AMINO ACIDS
General Structure
• Under normal cellular conditions amino acids are zwitterions (dipolar ions):
vAmino group = -NH3+
vCarboxyl group = -COO-
3
AMINO ACIDS
General Structure
• 19 of the 20 common amino acids have a chiral α-carbon atom (Gly does not)
vChiral carbons - have four different groups attached
Glycine Alanine
4
AMINO ACIDS
General Structure
• Threonine and isoleucine have 2 chiral carbons each (4 possible stereoisomers each)
v Stereoisomers - compounds that have the same molecular formula but differ in
the arrangement of atoms in space
Threonine Isoleucine
5
AMINO ACIDS
General Structure
• Mirror image pairs of amino acids are designated L (levo) and D (dextro) and are called
enantiomers
• Proteins are assembled from L-amino acids (a few D-amino acids occur in nature)
6
AMINO ACIDS
Abbreviations
7
CLASSIFCATION OF AMINO ACIDS
Based on the Polarity of R Groups
① Nonpolar (hydrophobic) R groups Ala, Val, Leu, Ile, Pro, Trp, Phe, Met
② Uncharged Polar R groups Gly, Ser, Thr, Cys, Tyr, Asn, Gln
8
CLASSIFICATION OF AMINO ACIDS
Nonpolar R Groups
Alanine [A] Valine [V] Leucine [L] Isoleucine [I] Proline [P]
(Ala) (Val) (Leu) (Ile) (Pro)
Tryptophan [W]
(Trp) Phenylalanine [F] Methionine [M]
(Phe) (Met)
9
CLASSIFICATION OF AMINO ACIDS
Uncharged Polar R Groups
Glycine [G] Serine [S] Threonine [T] Tyrosine [Y] Cysteine [C]
(Gly) (Ser) (Thr) (Tyr) (Cys)
10
CLASSIFICATION OF AMINO ACIDS
Acidic Amino Acids
11
CLASSIFICATION OF AMINO ACIDS
Basic Amino Acids
+
H
12
CLASSIFICATION OF AMINO ACIDS
Hydrophobicity of Amino Acid Side Chains
• Hydropathy: the relative hydrophobicity of each
amino acid
• The larger the hydropathy, the greater the
tendency of an amino acid to prefer a hydrophobic
environment
• Hydropathy affects protein folding:
– hydrophobic side chains tend to be in the interior
– hydrophilic residues tend to be on the surface
• The free-energy change is for transfer of an amino
acid from the interior of a lipid bilayer to water
13
CLASSIFICATION OF AMINO ACIDS
Based on the Chemical Composition of R Groups
① Aliphatic Gly, Ala, Val, Leu, Ile
② Imino Pro
14
CLASSIFICATION OF AMINO ACIDS
Nonpolar, Aliphatic R Groups
Glycine [G] Alanine [A] Valine [V]
• R groups are hydrophobic (Gly) (Ala) (Val)
15
CLASSIFICATION OF AMINO ACIDS
Imino R Groups
• Pro has an aliphatic side chain and secondary amino Proline [P]
(Pro)
(imino) group.
16
CLASSIFICATION OF AMINO ACIDS
Side Chains with Alcohol Groups
17
CLASSIFICATION OF AMINO ACIDS
Sulfur-Containing R Groups
18
CLASSIFICATION OF AMINO ACIDS
Sulfur-Containing R Groups
Formation of cystine
19
CLASSIFICATION OF AMINO ACIDS
Aromatic R Groups
21
CLASSIFICATION OF AMINO ACIDS
Acidic R Groups
Amide Derivatives
• Asp and Glu form amide linkages to form Asn and Gln
22
CLASSIFICATION OF AMINO ACIDS
Basic R Groups
Lysine [K] Arginine [R] Histidine [H]
• Positively charged at pH6.0 (Lys) (Arg) (His)
• Side chains are nitrogenous bases which are substantially positively charged at pH 7
• His residues facilitate many enzyme-catalyzed reactions by serving as proton
donors/ acceptors
23
CLASSIFICATION OF AMINO ACIDS
Based on Human Nutrition
• Amino acids that cannot be synthesized by our bodies are essential
Nonessential Essential
Alanine Arginine*
Asparagine Histidine
Aspartate Isoleucine
Cysteine Leucine
Glutamate Lysine
Glutamine Methionine
Glycine Phenylalanine
Proline Threonine
Serine Tryptophan
Tyrosine Valine
*Essential in young, growing animals but not in adults 24
CLASSIFICATION OF AMINO ACIDS
Based on Human Nutrition
• Humans take essential amino acids from their diet (plants) or from bacteria living in
their intestines
• Bacteria such as E. coli can synthesize the entire common 20 amino acids, whereas
humans can make half of them
• Essential amino acids should be of particular concern to:
– Pregnant women
– Adults caring for children
– Vegetarians
– Weight watchers
25
OTHER AMINO ACIDS AND DERIVATIVES
Rare Amino Acids of Proteins
4-Hydroxyproline
• Found in plant cell wall proteins (extensins)
• Found in the fibrous protein collagen
5-Hydroxylysine
• Found in collagen
26
OTHER AMINO ACIDS AND DERIVATIVES
Non-protein Amino Acids
• Over 200 different amino acids are found in nature
Homocysteine
• Intermediates in aa metabolism HS – CH2 – CH2 – CH – COOH
NH2
Homoserine
• Intermediates in aa metabolism HO – CH2 – CH2 – CH – COOH
NH2
27
OTHER AMINO ACIDS AND DERIVATIVES
Non-protein Amino Acids
Ornithine
• Key intermediate in the biosynthesis of H2N – CH2 – CH2 – CH2 – CH – COOH
arginine and in the urea cycle NH2
Citrulline
• Key intermediate in the biosynthesis of H2N – C – N – CH2 – CH2 – CH2 – CH – COOH
arginine and in the urea cycle O H NH2
28
OTHER AMINO ACIDS AND DERIVATIVES
Additional Common Amino Acids
• N-formylmethionine, selenocysteine, and pyrrolysine are incorporated at specific codons
à additions to the standard repertoire of protein precursors
N-formylmethionine
• 21st amino acid residue
• Initial amino acid during protein synthesis
in bacteria
Selenocysteine
• 22nd amino acid (rare)
• Contains selenium rather than the sulfur of
cysteine
• Incorporated into a few proteins
29
OTHER AMINO ACIDS AND DERIVATIVES
Additional Common Amino Acids
Pyrrolysine
• 23rd amino acid
• Found in some species of archaebacteria
30
OTHER AMINO ACIDS AND DERIVATIVES
Compounds Derived from Common Amino Acids
γ-Aminobutyrate (GABA)
• Derived from glutamate
• Important neurotransmitter in mammalian
brains
Histamine
• Derived from histidine
• Controls constriction of blood vessel
and secretion of HCl by the stomach
• Involved in allergic reactions
31
OTHER AMINO ACIDS AND DERIVATIVES
Compounds Derived from Common Amino Acids
Epinephrine (adrenaline)
Norepinephrine (noradrenaline)
• Hormones that help regulate ( )
metabolism in mammals
Thyroxine / Triiodothyronine
• Derived from tyrosine
• Regulate metabolism
32
AMINO ACIDS
Ionization of Amino Acids
• Ionizable groups in amino acids function as weak acids and bases:
α-carboxyl
α-amino
some side chains
33
AMINO ACIDS
Ionization of Amino Acids
• Each ionizable group has a specific pKa
AH B + H+
• For a solution pH below the pKa, the
protonated form predominates (AH)
• For a solution pH above the pKa, the
unprotonated form predominates (B)
34
AMINO ACIDS
Ionization of alanine
Titration curve for alanine
• Titration curves are used to determine
pKa values
pK1 = 2.4
pK2 = 9.9
pIAla = isoelectric point
35
AMINO ACIDS
Ionization of histidine
Titration curve of histidine
pK1 = 1.8
pK2 = 6.0
pK3 = 9.3
36
AMINO ACIDS
Ionization of histidine
Deprotonation of imidazolium ring
37
PEPTIDES
Peptide Bonds
• Peptide bond - linkage between amino acids is a secondary amide bond
• Formed by condensation of the α-carboxyl of one amino acid with the α-amino of
another amino acid (loss of H2O molecule)
38
PEPTIDES
Polypeptide chain nomenclature
• Amino acid residues compose peptide chains:
àFormation of peptide bonds eliminates the ionizable α-carboxyl and α-amino
groups of the free amino acids
• Peptide chains consist of a backbone of repeating units that differ in the R groups
• Peptide chains are numbered from the N (amino) terminus to the C (carboxyl)
terminus
39
PEPTIDES
Polypeptide chain nomenclature
A pentapeptide:
à Serylglycyltyrosylalanylleucine
à Ser-Gly-Tyr-Ala-Leu
à SGYAL
40
PEPTIDES
Important Biological Compounds
• Insulin (regulates glucose metabolism)
• Endorphins (naturally occurring molecules that modulate pain in vertebrates)
• Useful food additives (Aspartame; artificial
sweetener)
- commercially synthesized dipeptide methyl
ester - aspartylphenylalanine methyl ester
- about 200 times sweeter than table sugar)
- Used in diet drinks
Aspartame
41
PEPTIDES
Peptides of Non-protein Origins
• Glutathione (γ-glutamylcysteinylglycine)
O O
HOOC – CH – CH2 – CH2 – C – NH – CH – C – NH – CH2 – COOH
NH2 CH2
SH
42
PROTEINS
Three-Dimensional Structure and Function
• E. coli has about 4000 different polypeptides (average size 300 amino acids, Mr
33,000)
• Fruit fly (Drosophila melanogaster) about 16,000, humans, other mammals about
40,000 different polypeptides
43
PROTEINS
Three-Dimensional Structure and Function
• Proteins from E. coli cells are separated by Escherichia coli proteins.
two-dimensional gel electrophoresis
44
PROTEINS
Three-Dimensional Structure and Function
• Native conformation - each protein folds into a single stable shape (physiological
conditions)
45
PROTEINS
Levels of Protein Structure
• Primary structure - linear sequence of amino acids in a polypeptide or protein
àdetermined by the number, kind and sequence of amino acids
àDevoid of function or any biological activity
• Secondary structure - regions of regularly repeating conformations of the peptide
chain, such as α-helices and β-sheets
• Tertiary structure - describes the shape of the fully folded polypeptide chain
• Quaternary structure - arrangement of two or more polypeptide chains into
multisubunit molecule
46
PROTEINS
Levels of Protein Structure
47
PROTEINS
Methods for Determining Protein Structure
• X-ray crystallography is used to determine the three-dimensional conformation of
proteins
48
PROTEINS
Methods for Determining Protein Structure
Bovine (Bos taurus) ribonuclease A
• Ribonuclease A is a secreted enzyme that hydrolyzes RNA during digestion
49
PROTEINS
Methods for Determining Protein Structure
• NMR (nuclear magnetic resonance) is used to analyze protein structure in solution
• The basis of NMR is the observation that an atomic nucleus such as a proton (a
hydrogen nucleus) resonates in an applied magnetic field in a way that is sensitive to
its electronic environment and its interactions with nearby nuclei
50
PROTEINS
Structural Bioinformatics
• Bioinformatics deals with information related to molecular sequences and structures
• Structural bioinformatics is a branch of bioinformatics concerned with how
macromolecular structures are displayed and compared (see tools on next slide)
• The atomic coordinates of nearly 50,000 macromolecular structures, including
proteins, nucleic acids and carbohydrates are archived in the Protein Data Bank
(PDB)
51
PROTEINS
Structural Bioinformatics
Structural Bioinformatics Internet Addresses (1/2)
Structural Databases
Protein Data Bank (PDB): http://www.rcsb.org/pdb/
Nucleic Acid Database: http://ndbserver.rutgers.edu
Molecular Modeling Database (MMDB): http://www.ncbi.nlm.nih.gov/Structure/index.shtml
Most Representative NMR Structure in an Ensemble: http://pqss.ebi.ac.uk/pqs-nmr.html
PQS Protein Quaternary Structure Query Form at the EBI: http://pqs.ebi.ac.uk
Molecular Genetics Programs
Cn3D: http://www.ncbi.nhlm.nih.gov/Structure/CN3D/cn3d.shtml
Jmol: http://jmol.sourceforge.net/
KiNG: http://kinemage.biochem.duke.edu
FirstGlance: http://molvis.sdsc.edu/fgij/index.htm
Swiss-PDB Viewer (Deep View): http://us.expasy.org/spdbv
52
PROTEINS
Structural Bioinformatics
Structural Bioinformatics Internet Addresses (2/2)
Structural Classification Algorithms
CATH (Class, Architecture, Topology, and Homologous superfamily): http://www.cathdb.info/latest/index/html
53
PROTEINS
The Conformation of the Peptide Group
• The peptide group consists of 6 atoms
• Peptide bonds have some double bond properties
so that their conformation is restricted to either
Trans
trans or cis
• Nearly all peptide groups in proteins are in the
trans conformation
• Cis conformation is less favorable than trans due
to steric interference of α-carbon side chains
• Cis-trans isomerases can catalyze the
interconversion of cis and trans conformations
54
PROTEINS
The Conformation of the Peptide Group
Resonance structure of the peptide bond
55
PROTEINS
The Conformation of the Peptide Group
• Rotation around the C—N bond is restricted due to the double-bond nature of the
resonance hybrid form
• Peptide groups (blue planes) are therefore planar
56
PROTEINS
The Conformation of the Peptide Group
• The backbone of a protein (main chain) can be drawn as a linked sequence of rigid
planar peptide groups (repeating N—Cα—C backbone)
57
PROTEINS
Rotation of Atoms in a Peptide Group
• The rotation of the backbone can be (a) Peptide groups in an extended
conformation
described by the rotation angles around the
Cα—N bond (f) and the Cα—C bond (Y) of
each residue
• Rotation is restricted by steric interference
between main-chain and side-chain atoms
(b) Peptide groups in an unstable
conformation
• Rotation of the N—Cα bond in proline is
restricted because of the pyrrolidine ring
structure
58
PROTEINS
Ramachandran plots
• The Ramachandran plot indicates allowed conformations of polypeptides
• Conformation of a polypeptide chain can be solely described by f and Y angles
• Ramachandran plots of f and Y show permissible angles for polypeptide chains
• Some f and Y angles are not allowed because of steric hindrance (f and Y values
that would bring atoms closer than the corresponding van der Waals distance - the
distance of closest contact between non-bonded atoms)
59
PROTEINS
Ramachandran plots
• Conformations of several types of secondary structures fall within permissible areas
Sterically
allowed f and Y
angles for all aa Orange circles
except Gly and represent
Pro conformational angles
of secondary structures
(α) α-helix
(αL) left handed α-helix
(⥣) parallel β- sheet
More crowded (⥮) antiparallel β sheet
(outer limit) f (C) Collagen helix
and Y angles
60
PROTEIN SECONDARY STRUCTURE
The α Helix
• The simplest arrangement the polypeptide chain can C-terminus
assume is a helical structure
• The α helix forms more readily than any other
conformation
• Helix is stabilized by many hydrogen bonds (which are
nearly parallel to long axis of the helix)
• Each C=O (residue n) forms a hydrogen bond with the
amide hydrogen of residue n+4
• All C=O groups point toward the C-terminus (entire
helix is a dipole with (+) N, (-) C-termini)
N-terminus
61
PROTEIN SECONDARY STRUCTURE
The α Helix
C-terminus The φ and ψ angles of each
residue are similar:
near -57 (φ) and near -47 (ψ)
N-terminus 62
PROTEIN SECONDARY STRUCTURE
The α Helix
• The length of a helix in a protein can range from 4 or 5 residues to more than 40 – the
average is about 12
• The helix is amphipathic – hydrophilic amino acids on the face of the helix cylinder
and hydrophobic residues on the opposite face
α helix in horse liver alcohol dehydrogenase
63
PROTEIN SECONDARY STRUCTURE
The α Helix
64
PROTEIN SECONDARY STRUCTURE
The α Helix
65
PROTEIN SECONDARY STRUCTURE
β Strands and β Sheets
• β Strands - polypeptide chains that are almost fully extended (0.32 – 0.34 Å)
• β Sheets - multiple β strands arranged side-by-side
• Each strand has an average of 6 amino acid residues and each sheet is made of 2 – 15
strands
• β Strands are stabilized by hydrogen bonds between C=O and -NH on adjacent
strands
• The adjacent polypeptide chains in a β sheet can be either parallel or antiparallel
(more stable)
66
PROTEIN SECONDARY STRUCTURE
β Strands and β Sheets
Parallel β sheet Antiparallel β sheet
68
PROTEIN SECONDARY STRUCTURE
Interactions of β Sheets
• β Sheet side chains project alternately above and below the plane of the β strands
• One surface of a β sheet may consist of hydrophobic side chains that can interact
with other hydrophobic residues in protein interior
• Amphipathic α helices have hydrophobic side chains projecting outward that can
interact with hydrophobic faces of β sheets or other helices
69
PROTEIN SECONDARY STRUCTURE
Interactions of β Sheets
Structure of PHL P2 from Timothy grass (Phleum pratense) pollen
Antiparallel β sheets within a protein Stereo view of the β sandwich
71
PROTEIN SECONDARY STRUCTURE
Loops and Turns
Type I β Turn Type II β Turn
72
PROTEIN TERTIARY STRUCTURE
• Tertiary structure results from the folding of a polypeptide chain into a closely-
packed three-dimensional structure
• Amino acids far apart in the primary structure may be brought together
• Stabilized primarily by noncovalent interactions (e.g. hydrophobic effects, van der
Waals and charge-charge interactions, and hydrogen bonding) between side chains
• Disulfide bridges also part of tertiary structure
73
NONCOVALENT INTERACTIONS
74
NONCOVALENT INTERACTIONS
75
NONCOVALENT INTERACTIONS
76
PROTEIN TERTIARY STRUCTURE
Supersecondary Structures
• Certain combinations of secondary structures form motifs
• Grouping of secondary structural elements, called supersecondary structures (motifs), occur
in many unrelated globular proteins
77
PROTEIN TERTIARY STRUCTURE
Supersecondary Structures
βαβ unit • two parallel β strands linked to an
intervening a helix by two loops
• most common form of motifs
78
PROTEIN TERTIARY STRUCTURE
Domains
• Independently folded, compact units in proteins (~1400 different protein domains)
• Domain size: ~25 to ~300 amino acid residues
• Each domain is a distinct compact unit consisting of various elements of secondary
structure
• Domains are connected to each other by loops, bound by weak interactions between
side chains
79
PROTEIN TERTIARY STRUCTURE
Domains
• A single domain may have a particular function (e.g. binding small molecules,
catalyzing a single reaction)
• Interfaces between two separate domains provide crevices, grooves, and pockets on
the surface of a protein for binding or catalytic sites
• In multifunctional enzymes, each catalytic activity can be on one of several domains
80
PROTEIN TERTIARY STRUCTURE
Domains
Four categories of protein domains
82
PROTEIN TERTIARY STRUCTURE
Domains
• Within each of the four main structural categories, domains can be classified by
characteristic folds
• A fold is a combination of secondary structures that form the core of a domain
• Some domains have simple folds, others have more complex folds (~200 different
folding patterns)
Parallel twisted
sheet β barrel α/β barrel β helix
83
PROTEIN TERTIARY STRUCTURE
Examples
84
PROTEIN QUATERNARY STRUCTURE
85
PROTEIN QUATERNARY STRUCTURE
• The fact that a large proportion of proteins consist of multiple subunits is probably
related to several factors:
Oligomers are more stable
Active sites are formed by residues from adjacent polypeptide chains
Binding of ligands changes the 3D structure of many oligomeric proteins –
important in regulation of biological activity
Different proteins can share the same subunits – evolution
A multisubunit protein may bring together two sequential enzymatic steps
86
PROTEIN QUATERNARY STRUCTURE
Natural occurrence of oligomeric proteins in Escherichia coli
Oligomeric Number of Number of
state homooligomers heterooligomers Percent
Monomer 72 19.4
Dimer 115 27 38.2
Trimer 15 5 5.4
Tetramer 62 16 21.0
Pentamer 1 1 0.1
Hexamer 20 1 5.6
Heptamer 1 1 0.1
Octamer 3 6 2.4
Nonamer 0 0 0.0
Decamer 1 0 0.0
Undecamer 0 1 0.0
Dodecamer 4 2 1.6
Higher oligomers 8 2.2
Polymers 10 2.7 87
PROTEIN QUATERNARY STRUCTURE
Examples
88
PROTEIN FOLDING AND STABILITY
• Folded proteins occupy a low-energy well that makes the native structure most stable
• Many proteins can fold spontaneously to this low-energy conformation
• Proteins are thought to fold cooperatively … the first few interactions assist
subsequent alignment and folding
• Folding is extremely rapid, the native conformation is generally reached < 1 second
89
PROTEIN FOLDING AND STABILITY
• During folding the polypeptide collapses in upon itself due to the hydrophobic effect
• An intermediate “molten globule” forms with elements of secondary structure
• The backbone is rearranged to achieve a stable native conformation
90
PROTEIN FOLDING AND STABILITY
Energy well of protein folding
(a)
• The funnels represent the free-energy
potential of folding proteins
(a) A simplified funnel showing two
possible pathways to the low-energy native
protein. In path B the polypeptide enters a
local low-energy minimum as it folds
91
PROTEIN FOLDING AND STABILITY
92
PROTEIN FOLDING AND STABILITY
Chaperones
• Molecular chaperones increase rate of correct folding and prevent the formation of
incorrectly folded intermediates
• Chaperones can bind to unassembled protein subunits to prevent incorrect
aggregation before they are assembled into a multisubunit protein
• Most chaperones are heat shock proteins (synthesized as temperature increases)
93
PROTEIN FOLDING AND STABILITY
Chaperones
94
PROTEIN DENATURATION AND RENATURATION
95
PROTEIN DENATURATION AND RENATURATION
Thermal Denaturation
96
PROTEIN DENATURATION AND RENATURATION
Chemical Denaturation
97
PROTEIN DENATURATION AND RENATURATION
Denaturation and Renaturation of Ribonuclease A
98
PROTEIN PURIFICATION TECHNIQUES
Column Chromatography
• Preparation of a solution of proteins - Most techniques are performed at 0-4oC
Breaking open tissue cells and releasing their proteins into a solution (crude
extract)
Precipitation of proteins by addition of certain salts (ammonium sulfate)
Removal of small solutes and salts by dialysis
99
PROTEIN PURIFICATION TECHNIQUES
Column Chromatography
• A cylindrical column is filled with an
insoluble material (cellulose or synthetic
beads), protein mixture is applied to the
column and washed through the matrix of
insoluble material by the addition of
solvent
• As solvent flows through the column, the
eluate (liquid emerging for the bottom of
the column) is collected in many fractions.
• Absorbance is measured at 280nm
• May be performed under high pressure
(HPLC)
100
PROTEIN PURIFICATION TECHNIQUES
Ion-Exchange Chromatography
• Separation based upon the overall charge
of molecules
• The matrix carries positive charges
(anion-exchange resins) or negative
charges (cation-exchange resins)
• Anion-exchange matrices bind negatively
charged proteins, retaining them in the
matrix for subsequent elution (and
cation-exchange resins bind positively
charged proteins)
• Bound proteins can be serially eluted by
gradually increasing the salt
concentration in the solvent 101
PROTEIN PURIFICATION TECHNIQUES
Gel-Filtration Chromatography
• Also called size-exclusion chromatography
• Separation based upon molecular size
• The gel is a matrix a porous beads
• Proteins that are smaller than average pore
size penetrate much of the internal volume
of the beads and are therefore retarded by
the matrix as the buffer solution flows
through the column
• The smaller the protein, the later it elutes
from the column
102
PROTEIN PURIFICATION TECHNIQUES
Affinity Chromatography
• Relies on specific binding interactions
between the target protein and some other
molecule (an antibody that recognizes the
target protein or another protein) that is
covalently bound to the matrix of the column
• While all the proteins pass through the
column, only the target protein binds to the
matrix
• The target protein is eluted by washing the
column with a solvent containing a high
concentration of salt that disrupts the
interaction between the protein and the
matrix 103
PROTEIN PURIFICATION TECHNIQUES
Electrophoresis
104
PROTEIN PURIFICATION TECHNIQUES
Electrophoresis
105
AMINO ACID SEQUENCING OF PROTEINS
Importance
Clinical application – many inherited diseases are caused by mutations that result
in an amino acid change in a protein
- Sickle cell anemia: results from the replacement of glutamate residue (6) in the
β chains of normal adults with a valine residue
106
AMINO ACID SEQUENCING OF PROTEINS
Step 1 – Separating Subunits
• Chaotropic agents (8M urea and guanidinium chloride) and detergents (SDS) are used
• The subunits are separated by ion-exchange chromatography or gel filtration
• Disulfide bonds must be cleaved:
à To permit the separation of polypeptide chains
à To prevent the native protein conformation
107
AMINO ACID SEQUENCING OF PROTEINS
Step 1 – Separating Subunits
Breaking disulfide bonds by performic acid (1/3)
• Performic acid converts all cysteine residues whether linked by S-S bridges or not to
cysteic acid residues
• Cysteic acid is stable in both acidic and basic solutions so the total cysteine content of
a protein may be determined as cysteic acid
• Disadvantages of using performic acid:
• Oxidizes methionine residues to methionine sulfone (BrCN cannot be used in
sequencing)
• Partially destroys the indole side chain of tryptophan
108
AMINO ACID SEQUENCING OF PROTEINS
Step 1 – Separating Subunits
Breaking disulfide bonds by performic acid (2/3)
+
Performic acid
109
AMINO ACID SEQUENCING OF PROTEINS
Step 1 – Separating Subunits
Breaking disulfide bonds by performic acid (3/3)
110
AMINO ACID SEQUENCING OF PROTEINS
Step 1 – Separating Subunits
Breaking disulfide bonds by 2-Mercaptoethanol
2-Mercaptoethanol is used in
the reduction of a cystine
residue
112
AMINO ACID SEQUENCING OF PROTEINS
Step 2 - Determining Amino Acid Composition
• The amino acid composition is quantitatively determined using an automated amino
acid analyzer, which separates amino acids by ion-exchange chromatography
• Amino acids are identified as peak on a chromatogram – both kinds and amounts of
amino acids are detected
• The analysis of a protein digest can be done in <1h and can detect as little as 1 ρmol
of each amino acid
• Phenylisothiocyanate (PITC) used to derivatize the amino acids prior to HPLC analysis
113
AMINO ACID SEQUENCING OF PROTEINS
Step 2 - Determining Amino Acid Composition
Amino acid treated with Chromatogram obtained from HPLC
phenylisothiocyanate (PITC) separation of PTC–amino acids
Absorbance at 254 nm
B = totals of asparagine + aspartate
Z = glutamine + glutamate
114
AMINO ACID SEQUENCING OF PROTEINS
Step 3 - Edman Degradation
N-TERMINAL IDENTIFICATION
Treat peptide with phyenylisothiocyanate (PITC; also known as Edman’s reagent)
which reacts with the N-terminus to form a phenylthiocarbamyl (PTC) polypeptide
Treat with anhydrous trifluoroacetic acid (TFA) or hydrofluoric acid (HF) to selectively
cleave the N-terminal peptide bond as a thiazolinone derivative
Separate N-terminal derivative from peptide into an organic solvent
Convert derivative to the more stable phenylthiohydantoin (PTH) amino acid
derivative by treatment with aqueous acid
The PTH-amino acid may be identified by comparing it with known standards using
TLC, HPLC, GLC, or electrophoresis
115
AMINO ACID SEQUENCING OF PROTEINS
Step 3 - Edman Degradation
Edman degradation procedure (1/2)
116
AMINO ACID SEQUENCING OF PROTEINS
Step 3 - Edman Degradation
Edman degradation procedure (2/2)
118
AMINO ACID SEQUENCING OF PROTEINS
Step 4 - Cleaving the Polypeptide Chain
0.1 N HCl
or
70% formic acid
Peptidyl homoserine lactone
+
BrCN
Aminoacyl peptide
119
AMINO ACID SEQUENCING OF PROTEINS
Step 4 - Cleaving the Polypeptide Chain
Protease enzymes cleave specific peptide bonds
• Trypsin: C-side of basic residues (Lys, Arg), provided the next residue is not Pro
Trypsin
+
H2O
120
AMINO ACID SEQUENCING OF PROTEINS
Step 4 - Cleaving the Polypeptide Chain
Protease enzymes cleave specific peptide bonds
• Staphylococcus aureus V8 protease: C-side of negatively charged residues (Glu, Asp)
• Chymotrypsin: C-side of aromatic or bulky noncharged aliphatic residues (e.g. Phe, Tyr,
Trp, Leu)
121
AMINO ACID SEQUENCING OF PROTEINS
Step 4 - Cleaving the Polypeptide Chain
122
AMINO ACID SEQUENCING OF PROTEINS
Step 5 – Overlap Peptides
Cleavage and sequencing of an oligopeptide
123
AMINO ACID SEQUENCING OF PROTEINS
Sequences of DNA and protein
• Protein amino acid sequences can be deduced from the sequence of nucleotides in
the corresponding gene
• A sequence of three nucleotides specifies one amino acid (A,C,G,T are DNA residues )
124
125
TYPES OF PROTEINS
Classification based on their Conformation
Fibrous proteins
- Physically tough and insoluble in water
- Often assembled into large cables or threads
- Provide mechanical support and are basic structural elements in the connective
tissue of higher animals
- α-Keratins: major components of hair and nails
- Collagen: major component of tendons, skin, bones and teeth
126
TYPES OF PROTEINS
Classification based on their Conformation
Globular proteins
- Usually water soluble, compact, roughly spherical
- Hydrophobic interior, hydrophilic surface
- Globular proteins include enzymes, antibodies, regulatory and carrier proteins
(serum albumin and hemoglobin)
• Some proteins fall between fibrous and globular proteins (myosin and fibrinogen)
127
TYPES OF PROTEINS
Classification based on their Composition
Simple Proteins
- Those which on hydrolysis yield only amino acids and no other major organic or
inorganic products
Conjugated proteins
- Those which on hydrolysis yield not only amino acids but also a prosthetic group
made up of organic or inorganic components
128
TYPES OF PROTEINS
Classification based on their Composition
Conjugated Proteins
Class Prosthetic group(s) Example
Nucleoproteins DNA, RNA Chromosomes, ribosomes
Lipoproteins Lipids β1-Lipoprotein of blood
Glycoproteins Carbohydrates Immunoglobin G
Phosphoproteins Phosphate groups Casein of milk
Hemoproteins Heme (iron porphyrin) Hemoglobin
Flavoproteins Flavin nucleotides Succinate dehydrogenase
Metalloproteins Iron Ferritin
Zinc Alcohol dehydrogenase
Calcium Calmodulin
Molybdenum Nitrogenase
Copper Cytochrome oxidase
129
TYPES OF PROTEINS
Glycoproteins
• Proteins that contain covalently-bound oligosaccharides
• Oligosaccharide chains exhibit great variability in sugar sequence and composition. They
include 1 to over 30 sugar residues and may account for 80% of the mass of the protein
• 9 sugars predominate in eukaryotic glycoproteins
• Hexoses: L-fucose, D-galactose, D-glucose, D-mannose
• Pentoses: D-xylose, L-arabinose
• Amino sugars: N-acetyl-D-galactosamine, N-acetyl-D-glucosamine and sialic
acid (N-acetlyneuraminic acid)
• Glycoforms - proteins with identical amino acid sequences but different oligosaccharide
chain composition
• A single glycoprotein molecule might contain up to four branches of oligosaccharides,
which might be O-glycosidic or N-glycosidic or both
130
TYPES OF PROTEINS
Glycoproteins
• Two major types of carbohydrate-peptide linkages (O-glycosidic and N-glycosidic)
O-GLYCOSIDIC BOND
• From N-acetylglalactosamine to the –OH of serine or threonine
131
TYPES OF PROTEINS
Glycoproteins
Four subclasses of O-glycosidic linkages
GalNAc-Ser/Thr
(most common)
Gal-Gal-Xyl-Ser-core protein
(found in certain proteoglycans)
132
TYPES OF PROTEINS
Glycoproteins
N-GLYCOSIDIC BOND
• From N-acetylglucosamine to the amide nitrogen of asparagine
• Most N-linked oligosaccharides can be divided into three subclasses: high mannose,
complex, and hybrid
133
TYPES OF PROTEINS
Glycoproteins
Structures of N-linked oligosaccharides
High-mannose
chain
Complex
chain
Hybrid chain
134
TYPES OF PROTEINS
Glycoproteins
Some Functions of Glycoproteins
• Included in nearly all proteins of the blood serum:
• Enzymes (ribonuclease B, glucose oxidase)
• Hormones (human chorionic gonadotropin hCG)
• Blood group proteins of the erythrocyte membrane
• Interferon and ovalbumin
• Only a handful of sugar-free serum proteins are known: insulin and albumin
135
PROTEINS
Functional Diversity of Proteins
3. Transport proteins - Capable of binding and transporting specific Hemoglobin, myoglobin, serum
types of molecule via the blood albumin, lipoproteins
136
PROTEINS
Functional Diversity of Proteins
137
PROTEINS
Functional Diversity of Proteins
138