Professional Documents
Culture Documents
Principles of Biochemistry - Amino Acids and Proteins
Principles of Biochemistry - Amino Acids and Proteins
proteins
Amino acids are molecules containing an amine cell shape. Other proteins are important in cell signaling,
group(NH2 ), a carboxylic acid group(R-C=O-OH) and immune responses, cell adhesion, and the cell cycle. Pro-
a side-chain( usually denoted as R) that varies between teins are also necessary in animals diets, since animals
dierent amino acids. The key elements of an amino cannot synthesize all the amino acids they need and must
acid are carbon, hydrogen, oxygen, and nitrogen. They obtain essential amino acids from food. Through the pro-
are particularly important in biochemistry, where the cess of digestion, animals break down ingested protein
term usually refers to alpha-amino acids.Proteins are into free amino acids that are then used in metabolism.
biochemical compounds consisting of one or more Proteins were rst described by the Dutch chemist Ger-
polypeptides typically folded into a globular or brous
hardus Johannes Mulder and named by the Swedish
form in a biologically functional way. A polypeptide chemist Jns Jakob Berzelius in 1838. Early nutritional
is a single linear polymer chain of amino acids bonded scientists such as the German Carl von Voit believed that
together by peptide bonds between the carboxyl and protein was the most important nutrient for maintaining
amino groups of adjacent amino acid residues. The the structure of the body, because it was generally be-
sequence of amino acids in a protein is dened by the lieved that esh makes esh. The central role of proteins
sequence of a gene, which is encoded in the genetic code. as enzymes in living organisms was however not fully ap-
In general, the genetic code species 20 standard amino preciated until 1926, when James B. Sumner showed that
acids; however, in certain organisms the genetic code the enzyme urease was in fact a protein.The rst protein to
can include selenocysteineand in certain archaea be sequenced was insulin, by Frederick Sanger, who won
pyrrolysine. Shortly after or even during synthesis, the the Nobel Prize for this achievement in 1958. The rst
residues in a protein are often chemically modied by protein structures to be solved were hemoglobin and myo-
post-translational modication, which alters the physical globin, by Max Perutz and Sir John Cowdery Kendrew,
and chemical properties, folding, stability, activity, and respectively, in 1958. The three-dimensional structures
ultimately, the function of the proteins. Sometimes of both proteins were rst determined by X-ray dirac-
proteins have non-peptide groups attached, which can tion analysis; Perutz and Kendrew shared the 1962 Nobel
be called prosthetic groups or cofactors. Proteins can Prize in Chemistry for these discoveries. Proteins may
also work together to achieve a particular function, and be puried from other cellular components using a vari-
they often associate to form stable complexes. One of ety of techniques such as ultracentrifugation, precipita-
the most distinguishing features of polypeptides is their tion, electrophoresis, and chromatography; the advent of
ability to fold into a globular state, or structure. The genetic engineering has made possible a number of meth-
extent to which proteins fold into a dened structure ods to facilitate purication. Methods commonly used to
varies widely. Some proteins fold into a highly rigid study protein structure and function include immunohis-
structure with small uctuations and are therefore tochemistry, site-directed mutagenesis, nuclear magnetic
considered to be single structure. Other proteins undergo resonance and mass spectrometry. Distributed comput-
large rearrangements from one conformation to another.
ing is a relatively new tool researchers are using to exam-
This conformational change is often associated with a ine the infamously complex interactions that govern pro-
signaling event. Thus, the structure of a protein serves as
tein folding; the statistical analysis techniques employed
a medium through which to regulate either the function to calculate a proteins probable tertiary structure from
of a protein or activity of an enzyme. Not all proteins
its amino acid sequence (primary structure) are well-
requiring a folding process in order to function, as some suited for the distributed computing environment, which
function in an unfolded state[1] .
has made this otherwise prohibitively expensive and time
Like other biological macromolecules such as polysac- consuming problem signicantly more manageable[2] .
charides and nucleic acids, proteins are essential parts
of organisms and participate in virtually every process
within cells. Many proteins are enzymes that catalyze bio-
chemical reactions and are vital to metabolism. Proteins 1 Amino acids
also have structural or mechanical functions, such as actin
and myosin in muscle and the proteins in the cytoskele- There are 22 standard amino acids, but only 21 are
ton, which form a system of scaolding that maintains found in eukaryotes. Of the 22, 20 are directly encoded
1
2 1 AMINO ACIDS
present in all the amino acid types, and a side chain that
is unique to each type of residue. An exception from this
rule is proline, where the hydrogen atom is replaced by
a bond to the side chain. Because the carbon atom is
bound to four dierent groups it is chiral, however only
one of the isomers occur in biological proteins. Glycine
however, is not chiral since its side chain is a hydro-
gen atom. A simple mnemonic for correct L-form is
CORN": when the C atom is viewed with the H in
front, the residues read CO-R-N in a clockwise direc-
tion.
Isomerism
The standard -amino acids, all but glycine can exist in
A schematic visual model of oxygen-binding process, showing all either of two optical isomers, called L or D amino acids,
four monomers and hemes, and protein chains only as diagra- which are mirror images of each other . While L-amino
matic coils, to facilitate visualization into the molecule. Oxygen acids represent all of the amino acids found in proteins
is not shown in this model, but, for each of the iron atoms, it during translation in the ribosome, D-amino acids are
binds to the iron (red sphere) in the at heme. For example, in found in some proteins produced by enzyme posttrans-
the upper left of the four hemes shown, oxygen binds at the left lational modications after translation and translocation
of the iron atom shown in the upper left of diagram. This causes to the endoplasmic reticulum, as in exotic sea-dwelling
the iron atom to move backward into the heme which holds it (the organisms such as cone snails. They are also abundant
iron moves upward as it binds oxygen, in this illustration), tug-
components of the peptidoglycan cell walls of bacteria,
ging the histidine residue (modeled as a red pentagon on the right
and D-serine may act as a neurotransmitter in the
of the iron) closer, as it does. This, in turn, pulls on the protein
chain holding the histidine. brain. The L and D convention for amino acid con-
guration refers not to the optical activity of the amino
acid itself, but rather to the optical activity of the isomer
of glyceraldehyde from which that amino acid can the-
oretically be synthesized (D-glyceraldehyde is dextroro-
tary; L-glyceraldehyde is levorotary). Alternatively, the
(S) and (R) designators are used to indicate the absolute
stereochemistry. Almost all of the amino acids in pro-
teins are (S) at the carbon, with cysteine being (R) and
glycine non-chiral.Cysteine is unusual since it has a sul-
fur atom at the second position in its side-chain, which
has a larger atomic mass than the groups attached to the
rst carbon which is attached to the -carbon in the other
standard amino acids, thus the (R) instead of (S)[3] .
Zwitterions
The amine and carboxylic acid functional groups found in
amino acids allow it to have amphiprotic properties. At
a certain pH, known as the isoelectric point, an amino
acid has no overall charge since the number of protonated
ammonia groups (positive charges) and deprotonated car-
boxylate groups (negative charges) are equal. The amino
acids all have dierent isoelectric points. The ions pro-
CO-R-N rule
duced at the isoelectric point have both positive and neg-
ative charges and are known as a zwitterion, which comes
from the German word Zwitter meaning hermaphrodite
by the universal genetic code. Humans can synthesize
or hybrid. Amino acids can exist as zwitterions in
11 of these 20 from each other or from other molecules
solids and in polar solutions such as water, but not
of intermediary metabolism. The other 9 must be con-
in the gas phase. Zwitterions have minimal solubility at
sumed in the diet, and so are called essential amino acids;
their isolectric point and an amino acid can be isolated
those are histidine, isoleucine, leucine, lysine, methionine,
by precipitating it from water by adjusting the pH to its
phenylalanine, threonine, tryptophan, and valine. The re-
particular isoelectric point[4] .
maining two, selenocysteine and pyrrolysine, are incor-
porated into proteins by unique synthetic mechanisms. The 20 naturally occurring amino acids have dierent
physical and chemical properties, including their electro-
Each -amino acid consists of a backbone part that is
3
L-Glutamic acid
(Glu / E)
L-Alanine
(Ala / A)
L-Glutamine
(Gln / Q)
L-Arginine
(Arg / R) Glycine
(Gly / G)
L-Asparagine
(Asn / N) L-Histidine
(His / H)
L-Cysteine L-Leucine
(Cys / C) (Leu / L)
4 1 AMINO ACIDS
L-Lysine
(Lys / K)
L-Tryptophan
(Trp / W)
L-Methionine
(Met / M) L-Tyrosine
(Tyr / Y)
L-Valine
L-Phenylalanine (Val / V)
(Phe / F)
L-Selenocysteine
L-Proline (Sec / U)
(Pro / P)
L-Pyrrolysine
(Pyl / O)
2 Structure of protein
Psi-loop motif
Portion of Carboxypeptidase A.
2.3 Tertiary structure of protein merase, and ion channels. Other assemblies referred to
instead as multiprotein complexes also possess quaternary
Tertiary structure is considered to be largely determined structure. Examples include nucleosomes and micro-
by the proteins primary structure - the sequence of amino tubules.
acids of which it is composed. Eorts to predict tertiary Changes in quartary structure can occur through confor-
structure from the primary structure are known generally mational changes within individual subunits or through
as protein structure prediction. However, the environ- reorientation of the subunits relative to each other. It is
ment in which a protein is synthesized and allowed to fold through such changes, which underlie cooperativity and
are signicant determinants of its nal shape and are usu- allostery in multimeric enzymes, that many proteins un-
ally not directly taken into account by current prediction dergo regulation and perform their physiological func-
methods. In globular proteins, tertiary interactions are tion. The above denition follows a classical approach
frequently stabilized by the sequestration of hydrophobic to biochemistry, established at times when the distinc-
amino acid residues in the protein core, from which water tion between a protein and a functional, proteinaceous
is excluded, and by the consequent enrichment of charged unit was dicult to elucidate. More recently, people re-
or hydrophilic residues on the proteins water-exposed fer to protein-protein interaction when discussing quar-
surface. In secreted proteins that do not spend time in the tary structure of proteins and consider all assemblies of
cytoplasm, disulde bonds between cysteine residues help proteins as protein complexes.
to maintain the proteins tertiary structure. A variety of
common and stable tertiary structures appear in a large
number of proteins that are unrelated in both function
and evolution - for example, many proteins are shaped 3 Types of protein
like a TIM barrel, named for the enzyme triosephos-
phateisomerase. Another common structure is a highly 3.1 Conjugated protein
stable dimeric coiled coil structure composed of 2-7 al-
pha helices. The majority of protein structures known to
A conjugated protein is a protein that functions in inter-
date have been solved with the experimental technique
action with other chemical groups attached by covalent
of X-ray crystallography, which typically provides data
bonds or by weak interactions. Many proteins contain
of high resolution but provides no time-dependent in-
only amino acids and no other chemical groups, and they
formation on the proteins conformational exibility. A
are called simple proteins. However, other kind of pro-
second common way of solving protein structures uses
teins yield, on hydrolysis, some other chemical compo-
NMR, which provides somewhat lower-resolution data
nent in addition to amino acids and they are called conju-
in general and is limited to relatively small proteins, but
gated proteins. The nonamino part of a conjugated pro-
can provide time-dependent information about the mo-
tein is usually called its prosthetic group. Most prosthetic
tion of a protein in solution. Dual polarisation inter-
groups are formed from vitamins. Conjugated proteins
ferometry is a time resolved analytical method for de-
are classied on the basis of the chemical nature of their
termining the overall conformation and conformational
prosthetic groups. Some examples of conjugated proteins
changes in surface captured proteins providing comple-
are
mentary information to these high resolution methods.
More is known about the tertiary structural features of
soluble globular proteins than about membrane proteins 3.1.1 Lipoproteins
because the latter class is extremely dicult to study us-
ing these methodshttp://en.wikipedia.org/w/index.php?
A lipoprotein is a biochemical assembly that contains
title=Protein_tertiary_structure&oldid=422486540.
both proteins and lipids water-bound to the proteins.
Many enzymes, transporters, structural proteins, anti-
gens, adhesins and toxins are lipoproteins. Examples
2.4 Quartary structure of proteins include the high density (HDL) and low density (LDL)
lipoproteins which enable fats to be carried in the blood
Several proteins are actually assemblies of more than one stream, the transmembrane proteins of the mitochon-
polypeptide chain, which in the context of the larger drion and the chloroplast, and bacterial lipoproteins.
assemblage are known as protein subunits. In addition
to the tertiary structure of the subunits, multiple-subunit
proteins possess a quartary structure, which is the ar- 3.1.2 Glycoproteins
rangement into which the subunits assemble. Enzymes
composed of subunits with diverse functions are some- Glycoproteins are proteins that contain oligosaccharide
times called holoenzymes, in which some parts may be chains (glycans) covalently attached to polypeptide side-
known as regulatory subunits and the functional core is chains. The carbohydrate is attached to the protein in
known as the catalytic subunit. Examples of proteins a cotranslational or posttranslational modication. This
with quartary structure include hemoglobin, DNA poly- process is known as glycosylation. In proteins that have
3.1 Conjugated protein 11
Cytochromes are, in general, membrane-bound hemo- 3-dimensional structure of bovine rhodopsin. The seven
proteins that contain heme groups and carry out electron transmembrane domains are shown in varying colors. The
transport. They are found either as monomeric proteins chromophore is shown in red.
(e.g., cytochrome c) or as subunits of bigger enzymatic
complexes that catalyze redox reactions. They are found
synthesis, DNA repair, and apoptosis. The spectroscopic
in the mitochondrial inner membrane and endoplasmic
properties of the avin cofactor make it a natural reporter
reticulum of eukaryotes, in the chloroplasts of plants, in
for changes occurring within the active site; this makes
photosynthetic microorganisms, and in bacteria.
avoproteins one of the most-studied enzyme families.
3.1.6 Opsins
3.2 Simple proteins
Opsins are a group of light-sensitive 35-55 kDa
The proteins which upon hydrolysis yield only amino
membrane-bound G protein-coupled receptors of the
acids are known as simple proteins.
retinylidene protein family found in photoreceptor cells
of the retina. Five classical groups of opsins are involved
in vision, mediating the conversion of a photon of light 3.2.1 Albumin
into an electrochemical signal, the rst step in the vi-
sual transduction cascade. Another opsin found in the Albumin (Latin: albus, white) refers generally to any pro-
mammalian retina, melanopsin, is involved in circadian tein that is water soluble, which is moderately soluble in
rhythms and pupillary reex but not in image-forming. concentrated salt solutions, and experiences heat denatu-
ration. They are commonly found in blood plasma, and
are unique to other plasma proteins in that they are not
3.1.7 Flavoproteins glycosylated. Substances containing albumin, such as egg
white, are called albuminoids.
Flavoproteins are proteins that contain a nucleic acid
derivative of riboavin: the avin adenine dinucleotide
(FAD) or avin mononucleotide (FMN). Flavoproteins 3.2.2 Globulin
are involved in a wide array of biological processes, in-
cluding, but by no means limited to, bioluminescence, re- Globulin is one of the three types of serum proteins, the
moval of radicals contributing to oxidative stress, photo- others being albumin and brinogen. Some globulins are
4.2 Protein as cell signalling molecule 13
produced in the liver, while others are made by the im- known to be catalyzed by enzymes. The rate acceleration
mune system. The term globulin encompasses a hetero- conferred by enzymatic catalysis is often enormousas
geneous group of proteins with typical high molecular much as 1017-fold increase in rate over the uncatalyzed
weight, and both solubility and electrophoretic migration reaction in the case of orotate decarboxylase (78 million
rates lower than for albumin. years without the enzyme, 18 milliseconds with the en-
zyme). The molecules bound and acted upon by enzymes
are called substrates. Although enzymes can consist of
3.2.3 Histones hundreds of amino acids, it is usually only a small fraction
of the residues that come in contact with the substrate,
In biology, histones are highly alkaline proteins found in and an even smaller fractionthree to four residues on
all eukaryotic cell nuclei and some archaea, which pack- averagethat are directly involved in catalysis. The re-
age and order the DNA into structural units called nucle- gion of the enzyme that binds the substrate and contains
osomes. They are the chief protein components of chro- the catalytic residues is known as the active site[29] .
matin, acting as spools around which DNA winds, and
play a role in gene regulation.
proteins are specialized to select for only a particular ion; 3D density distribution of electrons in the protein (in
for example, potassium and sodium channels often dis- the crystallized state) and thereby infer the 3D coordi-
criminate for only one of the two ions[30] . nates of all the atoms to be determined to a certain res-
olution. Roughly 9% of the known protein structures
have been obtained by Nuclear Magnetic Resonance tech-
4.3 Other functions niques. The secondary structure composition can be de-
termined via circular dichroism or dual polarisation in-
Structural proteins confer stiness and rigidity to terferometry. Cryo-electron microscopy has recently be-
otherwise-uid biological components. Most structural come a means of determining protein structures to high
proteins are brous proteins; for example, actin and tubu- resolution (less than 5 angstroms or 0.5 nanometer) and
lin are globular and soluble as monomers, but polymerize is anticipated to increase in power as a tool for high res-
to form long, sti bers that comprise the cytoskeleton, olution work in the next decade. This technique is still a
which allows the cell to maintain its shape and size. Col- valuable resource for researchers working with very large
lagen and elastin are critical components of connective protein complexes such as virus coat proteins and amyloid
tissue such as cartilage, and keratin is found in hard or l- bers.
amentous structures such as hair, nails, feathers, hooves,
and some animal shells. Other proteins that serve struc-
tural functions are motor proteins such as myosin, ki-
nesin, and dynein, which are capable of generating me-
chanical forces. These proteins are crucial for cellu-
lar motility of single celled organisms and the sperm
5.1 X-ray crystallography
of many multicellular organisms which reproduce sexu-
ally. They also generate the forces exerted by contracting
muscles[31] . X-ray crystallography of biological molecules took o
with Dorothy Crowfoot Hodgkin, who solved the struc-
tures of cholesterol (1937), vitamin B12 (1945) and
5 Protein structure determination penicillin (1954), for which she was awarded the Nobel
Prize in Chemistry in 1964. In 1969, she succeeded in
solving the structure of insulin, on which she worked for
over thirty years.[32]
X-ray crystallography is a method of determining the ar-
rangement of atoms within a crystal, in which a beam
of X-rays strikes a crystal and diracts into many spe-
cic directions. Crystal structures of proteins (which
are irregular and hundreds of times larger than choles-
terol) began to be solved in the late 1950s, beginning
with the structure of sperm whale myoglobin by Max
Perutz and Sir John Cowdery Kendrew, for which they
were awarded the Nobel Prize in Chemistry in 1962.[33]
Since that success, over 61840 X-ray crystal structures
of proteins, nucleic acids and other biological molecules
have been determined.[34] For comparison, the nearest
competing method in terms of structures analyzed is
nuclear magnetic resonance (NMR) spectroscopy, which
has resolved 8759 chemical structures.[35] Moreover,
crystallography can solve structures of arbitrarily large
molecules, whereas solution-state NMR is restricted to
Ribbon diagram of the structure of myoglobin, showing colored relatively small ones (less than 70 kDa). X-ray crystal-
alpha helices. Such proteins are long, linear molecules with thou- lography is now used routinely by scientists to determine
sands of atoms; yet the relative position of each atom has been how a pharmaceutical drug interacts with its protein tar-
determined with sub-atomic resolution by X-ray crystallography. get and what changes might improve it.[36] However, in-
Since it is dicult to visualize all the atoms at once, the ribbon trinsic membrane proteins remain challenging to crystal-
shows the rough path of the protein polymer from its N-terminus lize because they require detergents or other means to sol-
(blue) to its C-terminus (red). ubilize them in isolation, and such detergents often inter-
fere with crystallization. Such membrane proteins are a
Around 90% of the protein structures available in the large component of the genome and include many pro-
Protein Data Bank have been determined by X-ray crys- teins of great physiological importance, such as ion chan-
tallography. This method allows one to measure the nels and receptors.[37][38]
15
5.2 Nuclear magnetic resonance spec- quences of amino acids. These sequences are linear, in
troscopy or NMR the manner of letters in a written sentence or beads on
a string. In all proteins, it is the variation in the type
Protein nuclear magnetic resonance spectroscopy (usually of amino acids in the protein sequence of amino acids,
abbreviated protein NMR) is a eld of structural biology which determine the proteins chemical properties and
in which NMR spectroscopy is used to obtain information function. This is true of hemoglobin, where the sequence
about the structure and dynamics of proteins. The eld of amino acids may aect crucial functions such as the
was pioneered by Richard R. Ernst and Kurt Wthrich[1], proteins anity for oxygen.
among others. Protein NMR techniques are continu- There is more than one hemoglobin gene. The amino
ally being used and improved in both academia and the acid sequences of the globin proteins in hemoglobins usu-
biotech industry. Structure determination by NMR spec- ally dier between species, although the dierences grow
troscopy usually consists of several following phases, each with the evolutionary distance between species. For ex-
using a separate set of highly specialized techniques. The ample, the most common hemoglobin sequences in hu-
sample is prepared, resonances are assigned, restraints mans and chimpanzees are nearly identical, diering by
are generated and a structure is calculated and validated only one amino acid in both the alpha and the beta globin
protein chains. These dierences grow larger between
less closely related species. Even within a species, dier-
6 Hemoglobin and its structure ent variants of hemoglobin always exist, although one se-
quence is usually a most common one in each species.
Mutations in the genes for the hemoglobin protein in a
The oxygen-carrying protein hemoglobin was discovered species result in hemoglobin variants. Many of these mu-
by Hnefeld in 1840. In 1851, Otto Funke published a se- tant forms of hemoglobin cause no disease. Some of these
ries of articles in which he described growing hemoglobin mutant forms of hemoglobin, however, cause a group
crystals by successively diluting red blood cells with a sol- of hereditary diseases termed the hemoglobinopathies.
vent such as pure water, alcohol or ether, followed by The best known hemoglobinopathy is sickle-cell disease,
slow evaporation of the solvent from the resulting pro- which was the rst human disease whose mechanism was
tein solution. Hemoglobins reversible oxygenation was understood at the molecular level. A (mostly) separate
described a few years later by Felix Hoppe-Seyler. In set of diseases called thalassemias involves underproduc-
1959 Max Perutz determined the molecular structure of tion of normal and sometimes abnormal hemoglobins,
hemoglobin by X-ray crystallography. This work resulted through problems and mutations in globin gene regula-
in his sharing with John Kendrew the 1962 Nobel Prize tion. All these diseases produce anemia[40] .
in Chemistry. The role of hemoglobin in the blood was
elucidated by physiologist Claude Bernard. The name Hemoglobin variants are a part of the normal embryonic
hemoglobin is derived from the words heme and globin, and fetal development, but may also be pathologic mutant
reecting the fact that each subunit of hemoglobin is forms of hemoglobin in a population, caused by varia-
a globular protein with an embedded heme (or haem) tions in genetics. Some well-known hemoglobin variants
group. Each heme group contains one iron atom, that such as sickle-cell anemia are responsible for diseases,
can bind one oxygen molecule through ion-induced dipole and are considered hemoglobinopathies. Other variants
forces. The most common type of hemoglobin in mam- cause no detectable pathology, and are thus considered
mals contains four such subunits. Hemoglobin (also non-pathological variants[41] .
spelled haemoglobin and abbreviated Hb or Hgb) is the In the embryo: Gower 1 (22) Gower 2 (22)
iron-containing oxygen-transport metalloprotein in the (PDB 1A9W) Hemoglobin Portland (22) In the fetus:
red blood cells of all vertebrates[1] (except the sh family Hemoglobin F (22) (PDB 1FDH)
Channichthyidae ) and the tissues of some invertebrates.
In adults: Hemoglobin A (22) (PDB 1BZ0) - The most
Hemoglobin in the blood is what transports oxygen from
common with a normal amount over 95% Hemoglobin
the lungs or gills to the rest of the body (i.e. the tissues)
A2 (22) - chain synthesis begins late in the third
where it releases the oxygen for cell use, and collects car-
trimester and in adults, it has a normal range of 1.5-3.5%
bon dioxide to bring it back to the lungs. In mammals
Hemoglobin F (22) - In adults Hemoglobin F is re-
the protein makes up about 97% of the red blood cells
stricted to a limited population of red cells called F-cells.
dry content, and around 35% of the total content (includ-
However, the level of Hb F can be elevated in persons
ing water). Hemoglobin has an oxygen binding capac-
with sickle-cell disease and beta-thalassemia.
ity of 1.34 ml O2 per gram of hemoglobin, which in-
creases the total blood oxygen capacity seventyfold com- Variant forms that cause disease: Hemoglobin H (4)
pared to dissolved oxygen in blood. The mammalian - A variant form of hemoglobin, formed by a tetramer
hemoglobin molecule can bind (carry) up to four oxygen of chains, which may be present in variants of tha-
molecules[39] . lassemia. Hemoglobin Barts (4) - A variant form of
hemoglobin, formed by a tetramer of chains, which may
Hemoglobin consists mostly of protein (the globin
be present in variants of thalassemia. Hemoglobin S
chains), and these proteins, in turn, are composed of se-
16 6 HEMOGLOBIN AND ITS STRUCTURE
Unaected Unaected
"Carrier" "Carrier"
Father Mother
Modern distribution of malaria
R r R r
anaemia (or anemia; SCA) or drepanocyto-
sisis an autosomal recessive genetic blood dis-
R R R r R r r r
order, with overdominance, characterized by
red blood cells that assume an abnormal, rigid,
sickle shape. Sickling decreases the cells ex-
ibility and results in a risk of various com-
plications. The sickling occurs because of a
mutation in the haemoglobin gene. Life ex-
pectancy is shortened, with studies reporting
an average life expectancy of 42 in males and
48 in females.Sickle-cell anaemia is caused
Unaected Unaected "Carrier" Aected
1 in 4 chance 2 in 4 chance 1 in 4 chance by a point mutation in the -globin chain of
haemoglobin, causing the hydrophilic amino
Sickle-cell disease is inherited in the autosomal recessive pattern. acid glutamic acid to be replaced with the hy-
drophobic amino acid valine at the sixth posi-
Sickle-cell disease (SCD) or sickle-cell tion. The -globin gene is found on the short
17
arm of chromosome 11. The association of (for example, while climbing a mountain) or
two wild-type -globin subunits with two mu- while severely dehydrated. Under normal cir-
tant -globin subunits forms haemoglobin S cumstances, these painful crises occur about
(HbS). Under low-oxygen conditions (being at 0.8 times per year per patient. The sickle-cell
high altitude, for example), the absence of a disease occurs when the seventh amino acid
polar amino acid at position six of the -globin (if the initial methionine is counted), glutamic
chain promotes the non-covalent polymerisa- acid, is replaced by valine to change its struc-
tion (aggregation) of haemoglobin, which dis- ture and function.
torts red blood cells into a sickle shape and de- The gene defect is a known mutation of a
creases their elasticity. single nucleotide ( single-nucleotide polymor-
The loss of red blood cell elasticity is cen- phism - SNP) (A to T) of the -globin gene,
tral to the pathophysiology of sickle-cell dis- which results in glutamic acid being substituted
ease. Normal red blood cells are quite elas- by valine at position 6. Haemoglobin S with
tic, which allows the cells to deform to pass this mutation are referred to as HbS, as op-
through capillaries. In sickle-cell disease, low- posed to the normal adult HbA. The genetic
oxygen tension promotes red blood cell sick- disorder is due to the mutation of a single nu-
ling and repeated episodes of sickling damage cleotide, from a GAG to GTG codon mutation,
the cell membrane and decrease the cells elas- becoming a GUG codon by transcription. This
ticity. These cells fail to return to normal shape is normally a benign mutation, causing no ap-
when normal oxygen tension is restored. As a parent eects on the secondary, tertiary, or
consequence, these rigid blood cells are unable quaternary structure of haemoglobin in condi-
to deform as they pass through narrow capillar- tions of normal oxygen concentration. What it
ies, leading to vessel occlusion and ischaemia. does allow for, under conditions of low oxygen
The actual anaemia of the illness is caused by concentration, is the polymerization of the HbS
haemolysis, the destruction of the red cells in- itself. The deoxy form of haemoglobin ex-
side the spleen, because of their misshape. Al- poses a hydrophobic patch on the protein be-
though the bone marrow attempts to compen- tween the E and F helices. The hydrophobic
sate by creating new red cells, it does not match residues of the valine at position 6 of the beta
the rate of destruction. Healthy red blood cells chain in haemoglobin are able to associate with
typically live 90120 days, but sickle cells only the hydrophobic patch, causing haemoglobin S
survive 1020 days. Normally, humans have molecules to aggregate and form brous pre-
Haemoglobin A, which consists of two alpha cipitates.
and two beta chains, Haemoglobin A2, which The allele responsible for sickle-cell
consists of two alpha and two delta chains and anaemia is autosomal recessive and can be
Haemoglobin F, consisting of two alpha and found on the short arm of chromosome 11. A
two gamma chains in their bodies. Of these, person that receives the defective gene from
Haemoglobin A makes up around 96-97% of both father and mother develops the disease;
the normal haemoglobin in humans. a person that receives one defective and one
Sickle-cell gene mutation probably arose healthy allele remains healthy, but can pass on
spontaneously in dierent geographic areas, as the disease and is known as a carrier. If two
suggested by restriction endonuclease analysis. parents who are carriers have a child, there
These variants are known as Cameroon, Sene- is a 1-in-4 chance of their child developing
gal, Benin, Bantu and Saudi-Asian. Their clin- the disease and a 1-in-2 chance of their
ical importance springs from the fact that some childs being just a carrier. Since the gene is
of them are associated with higher HbF lev- incompletely recessive, carriers can produce
els, e.g., Senegal and Saudi-Asian variants, and a few sickled red blood cells, not enough to
tend to have milder disease.[42] cause symptoms, but enough to give resistance
In people heterozygous for HgbS (carri- to malaria. Because of this, heterozygotes
ers of sickling haemoglobin), the polymeri- have a higher
sation problems are minor, because the nor- tness than either of the homozygotes.
mal allele is able to produce over 50% of This is known as heterozygote advantage. Due
the haemoglobin. In people homozygous for to the adaptive advantage of the heterozygote,
HgbS, the presence of long-chain polymers of the disease is still prevalent, especially among
HbS distort the shape of the red blood cell from people with recent ancestry in malaria-stricken
a smooth doughnut-like shape to ragged and areas, such as Africa, the Mediterranean, India
full of spikes, making it fragile and susceptible and the Middle East.[43] Malaria was histori-
to breaking within capillaries. Carriers have cally endemic to southern Europe, but it was
symptoms only if they are deprived of oxygen declared eradicated in the mid-20th century,
18 6 HEMOGLOBIN AND ITS STRUCTURE
8.2 Images
File:1GZX_Haemoglobin.png Source: https://upload.wikimedia.org/wikipedia/commons/3/3d/1GZX_Haemoglobin.png License: CC-
BY-SA-3.0 Contributors: Transferred from en.wikipedia to Commons. Original artist: Zephyris at English Wikipedia
File:5CPAgood.png Source: https://upload.wikimedia.org/wikipedia/commons/e/ed/5CPAgood.png License: Public domain Contribu-
tors:
Transferred from en.wikipedia to Commons by User:Giac83 using CommonsHelper. Original artist:
Uploaded by Xenonblast at en.wikipedia
File:A-amino-acid.png Source: https://upload.wikimedia.org/wikipedia/commons/f/f2/A-amino-acid.png License: Public domain Con-
tributors: Originally from en.wikipedia; description page is/was here Original artist: The original uploader was Password at English
Wikipedia
File:Amino-CORN.png Source: https://upload.wikimedia.org/wikipedia/commons/0/01/Amino-CORN.png License: Public domain
Contributors: Transferred from en.wikipedia to Commons. Original artist: Password at English Wikipedia
File:Anthrax_toxin_protein_key_motif.svg Source: https://upload.wikimedia.org/wikipedia/commons/4/4f/Anthrax_toxin_protein_
key_motif.svg License: CC-BY-SA-3.0 Contributors: File:Anthrax toxin protein key motif.jpg Original artist:
Original work: w:User:Natelewis
File:Autorecessive.svg Source: https://upload.wikimedia.org/wikipedia/commons/3/3e/Autorecessive.svg License: CC-BY-SA-3.0 Con-
tributors: Own work in Inkscape Original artist: en:User:Cburnett
File:Beta-meander1.png Source: https://upload.wikimedia.org/wikipedia/commons/b/b8/Beta-meander1.png License: Public domain
Contributors: Transferred from en.wikipedia to Commons. Original artist: Xenonblast at English Wikipedia
File:Beta_hairpin.png Source: https://upload.wikimedia.org/wikipedia/commons/b/ba/Beta_hairpin.png License: Public domain Con-
tributors: Graphical representation from the program MolMol, rendered with PovRay Original artist: Isabella Daidone
File:Cytochrome_c.png Source: https://upload.wikimedia.org/wikipedia/commons/0/07/Cytochrome_c.png License: Public domain
Contributors: Own work Original artist: Klaus Homeier
File:Glycine-skeletal.png Source: https://upload.wikimedia.org/wikipedia/commons/f/f1/Glycine-skeletal.png License: Public domain
Contributors: Own work Original artist: Benjah-bmm27
File:Heme_b.svg Source: https://upload.wikimedia.org/wikipedia/commons/b/be/Heme_b.svg License: Public domain Contributors: Own
work Original artist: Yikrazuul
File:Hemoglobin_t-r_state_ani.gif Source: https://upload.wikimedia.org/wikipedia/commons/b/ba/Hemoglobin_t-r_state_ani.gif Li-
cense: CC-BY-SA-3.0 Contributors: Uploaded by Habj Original artist: en:User:BerserkerBen
File:L-alanine-skeletal.png Source: https://upload.wikimedia.org/wikipedia/commons/0/05/L-alanine-skeletal.png License: Public do-
main Contributors: ? Original artist: ?
File:L-arginine-skeletal-(tall).png Source: https://upload.wikimedia.org/wikipedia/commons/c/c6/L-arginine-skeletal-%28tall%29.
png License: Public domain Contributors: No machine-readable source provided. Own work assumed (based on copyright claims). Original
artist: No machine-readable author provided. Benjah-bmm27 assumed (based on copyright claims).
File:L-asparagine-skeletal.png Source: https://upload.wikimedia.org/wikipedia/commons/c/c9/L-asparagine-skeletal.png License:
Public domain Contributors: No machine-readable source provided. Own work assumed (based on copyright claims). Original artist:
No machine-readable author provided. Benjah-bmm27 assumed (based on copyright claims).
File:L-aspartic-acid-skeletal.png Source: https://upload.wikimedia.org/wikipedia/commons/6/6e/L-aspartic-acid-skeletal.png License:
Public domain Contributors: No machine-readable source provided. Own work assumed (based on copyright claims). Original artist: No
machine-readable author provided. Benjah-bmm27 assumed (based on copyright claims).
File:L-cysteine-skeletal.png Source: https://upload.wikimedia.org/wikipedia/commons/a/a5/L-cysteine-skeletal.png License: Public
domain Contributors: ? Original artist: ?
File:L-glutamic-acid-skeletal.png Source: https://upload.wikimedia.org/wikipedia/commons/a/a8/L-glutamic-acid-skeletal.png Li-
cense: Public domain Contributors: ? Original artist: ?
File:L-glutamine-skeletal.png Source: https://upload.wikimedia.org/wikipedia/commons/9/91/L-glutamine-skeletal.png License: Pub-
lic domain Contributors: No machine-readable source provided. Own work assumed (based on copyright claims). Original artist: No
machine-readable author provided. Benjah-bmm27 assumed (based on copyright claims).
File:L-histidine-skeletal.png Source: https://upload.wikimedia.org/wikipedia/commons/6/64/L-histidine-skeletal.png License: Public
domain Contributors: ? Original artist: ?
File:L-isoleucine-skeletal.png Source: https://upload.wikimedia.org/wikipedia/commons/5/50/L-isoleucine-skeletal.png License: Pub-
lic domain Contributors: ? Original artist: ?
File:L-leucine-skeletal.png Source: https://upload.wikimedia.org/wikipedia/commons/2/29/L-leucine-skeletal.png License: Public do-
main Contributors: ? Original artist: ?
22 8 TEXT AND IMAGE SOURCES, CONTRIBUTORS, AND LICENSES