Professional Documents
Culture Documents
GLYCOPROTEINS: AN OVERVIEW
M SHYLAJA and H S SESHADRI*
D e p a r t m e n t of Nutrition and F o o d Safety
Central F o o d Technological R e s e a r c h Institute
* D e p a r t m e n t of Biochemistry
Manasagangothri
University of M y s o r e
M y s o r e 570 006, India
Introduction Glycoproteins are proteins having covalently bound carbohydrate. They are found in all
living organisms, in both soluble and insoluble forms with diverse functions and
properties. 1-3 Indeed, there are more proteins that contain covalently bound carbo-
hydrate in their molecule than are devoid of carbohydrate. The carbohydrate content of
glycoproteins ranges from less than 1% to over 80% of the molecule. These carbohydrate
units are involved in various biological activities. The history of glycoproteins goes back to
as early as 1805, when Bostock first characterized the mucus substance of the animal body
as chemically distinct from what we now recognize as protein, although biochemists have
shown interest only during the last two decades.
The glycoproteins can broadly be classified into three types depending on the nature
and function of their carbohydrate units:
'Typical glycoproteins', which includes several glycoproteins of varied carbohydrate
content as indicated in Table 1. The carbohydrate takes the form of oligosaccharide
chain(s) which are branched and irregular, consisting of neutral, basic and amphoteric
monosaccharides.
Glycosaminoglycans, a group of compounds often classified as glycoproteins but differing
from typical glycoproteins in that they contain very long polysaccharide chains which are
linear and fairly regular, possessing alternating monosaccharide sequences, which
generally involve acid and basic monosaccharides. Glycosaminoglycans usually contain
uronic acids and sulfate groups.
Collagens which have a unique type of glycosylation (Table 1) and represent one of the
major groups of protein found in the animal kingdom, occurring in multicellular organisms
ranging from sponges to mammals. The presence of carbohydrate in collagens from widely
differing sources suggest that it may play an essential biological role.
The recent literature abounds in reports of glycoproteins with unusual properties. Of
the many examples, human intestinal enzymes, specific for the hydrolysis of the
disaccharides, maltose and sucrose have been shown to be glycoproteins which are
resistant to proteolytic digestion. 4 Another very important material which is responsible
for the sexual agglutination of cells from one type of yeast with those of another type is
also a glycoprotein. 5 Myelin, the white substance forming a sheath around some nerve
fibres is found to be a glycoprotein. 6 This discovery was also of interest with regard to
speculation about the possible role of glycoproteins in memory and in nerve transmission.
B I O C H E M I C A L E D U C A T I O N 17(4) 1989
171
Toxins: I
Snake venom toxins poison gland 14-20%
Toad toxins Defence mechanism poison gland
Hormones:
Chorionic gonadotropin Hormonal control Human urine, serum, 30%
FSH, TSH and LH pituitary 20%
Lectins:
Potato Agglutination potato 50%
Hemagglutinin Agglutination Influenza virus 25%
Sexual agglutination sexual agglutination Hanesula wingei 85%
factor of cells
Membrane glycoproteins:
Glycophorin Receptor for M and N Human erythrocytes 60%
blood groups, and
various lectins;
antigenic receptor for
influenza virus
Carrier glycoprotein:
Avidin vitamin binding Hen's egg 9%
Fibronectin cell adhesion cell surface
Transport glycoproteins:
Ceruloplasmin
Transferrin Transport Human plasma 6-8%
The role of Protein glycosylation is a costly process for the organism in terms of energy and materials
carbohydrates in which presumably indicates that these units have important biological functions, and it
glycoproteins seems likely that carbohydrate in glycoproteins have diverse functions. Different
glycoproteins have different carbohydrates and these in turn may be responsible for varied
functions. The following are some functions of carbohydrates of glycoproteins. ~1
Viscosity and water binding capacity The sialic acid of the sialic acid rich glycoproteins of
saliva, intestinal, trachial or cervical mucus is responsible for the high viscosity and the
functioning of these mucins as lubricants.
The antifreeze activity of antifreeze glycoproteins of antarctic fish depends on the
integrity of the disaccharide units that form hydrogen bonds with water molecules, thus
preventing the growth of ice crystals.
Protein folding, conformation and stabilization of biological membranes Carbohydrates
influence protein folding and conformation. Cell surface glycoproteins possibly act as
structural components to stabilize cell membranes both in archebacteria and eukaryotes.
Carbohydrates are needed not only for folding and acquisition of the correct conformation
of certain proteins, but they also participate in subunit interactions. However, the
presence of carbohydrate is not always essential for the particular function of the
glycoprotein in which it occurs.
The carbohydrate of transmembrane glycoproteins may help to orient and anchor these
molecules in the lipid bilayer. Frequently carbohydrates protect the protein against
intracellular proteolysis during biosynthesis and transport, and are therefore also required
for membrane insertion and secretion. In some cases, however, secretion occurs in the
absence of glycosylation.
The carbohydrates/oligosaccharides of glycoproteins of different viruses or protozoa,
may mask the protein antigenic sites and help these infectious agents escape destruction by
the immunological system of the host.
Biological recognition Carbohydrates may serve as recognition determinants in glyco-
proteins in solution as well as on cell membranes, 12 where they also exist as glycolipids.
The classical example is the removal of asialoglycoproteins from the circulatory system
and their uptake by galactose-specific, GlcNAC-specific or fucose-specific receptors in the
membranes of liver cells.
The life-times of proteins in the circulatory system and their ultimate fates may be
controlled by attaching covalently linked carbohydrates to proteins or by modification of
the sugars in glycoproteins. Such techniques may be useful in enzyme replacement therapy
for treatment of genetic diseases and also for delivering drugs to target organs. In addition
to their ability to serve as recognition markers of various biologically active molecules, cell
surface sugars serve as recognition markers in cell-cell, cell-virus and cell-bacteria
interactions. Cell-surface glycoproteins are the immunological determinant structures of
blood group A, B, H and M/N specificities, act as acceptors for a number of lectins, are
involved in cell adhesion and presumably play a structural role in stabilizing the cell
membrane.
Structure of The carbohydrate of glycoproteins may be present as simple disaccharide units or as fairly
glycoproteins large heteropolysaccharides, which may contain as many as 15-20 monosaccharide
residues. The number of moieties may vary from several hundred as in case of
disaccharides to only a few or even a single one in heteropolysaccharides. 2A3
The carbohydrate units of glycoproteins contain mainly n-galactose, o-mannose, D-
glucose, L-fucose, N-acetyl-D-glucosamine, N-acetyl-D-galactosamine, xylose, rhamnose
and sialic acid but the content and proportion of these sugars varies considerably from one
glycoprotein to another. For example, collagen contains less than 1% of carbohydrate
(glucose/galactose) while rat sublingual glycoproteins contain as much as 81% carbo-
hydrate comprised of D-galactose, D-mannose, L-fucose, N-acetyl-3/-o-hexosamines and
sialic acid. Hence any distinction between different classes of glycoproteins based purely
on their carbohydrate content will tend to be arbitrary.
The carbohydrate peptide linkage in glycoproteins The most characteristic feature of
glycoproteins is the presence of a covalent linkage between carbohydrate and protein. The
glycosylation of amino acids within a polypeptide chain that is destined to be a
glycoprotein is apparently not a random event, because it is known that glycosylation
occurs at specific sites on the protein molecule. There are two major types of glycopeptide
bonds N-glycosidic and O-glycosidic, the difference being the attachment of given sugar to
a specific functional group of an amino acid (Table 2). Some representative structures of
glycoproteins are shown in Fig 1 (see refs 14-16).
Microheterogeneity Microheterogeneity of carbohydrate units of glycoproteins is a well
recognised phenomenon. As a result, it is often very difficult to establish the structure of
the carbohydrate moieties of glycoproteins. The microheterogeneity of the sialic acid
containing carbohydrate side chains of glycoproteins may be caused by variation in the
number of N-acetyl neuraminic acid units present in different chains. Microheterogeneity
might result from (1) the action of glycosyl transferases since there is no template as in the
case of protein biosynthesis, and/or by (2) subsequent degradation of glycoproteins by
glycosidases to different extents.
CH2OH
I N-glycosidic HI,oH H~H NH --C =0 Alkali Bovine and
/9- N- acetylg I uco- H I stable Ovi ne
CH2 luteinizing
saminyt asporagine H j hormones
(GLcNAc-Asn) H NH H 2 N - C - H
I
C=O COOH
I
CH3
Sub types CHzOH
(I) High mannose 1 H ALkali Human thyro-
H?HH~HZ -- =0 stable, globulin unit A
(2) Complex type t
o ~ i "2 ALkali Human Ig G,
Ha ] [ ", stable erythrocyte
H NH HzN-C-H membrane
I I gLycoprotein
C= 0 C0 OH
(3) Hybrid type I ALkali gaLactose-
stable containing
CH3 ovalbumin
CHz0H
rr O-Glycosiaic
Ho -%H
,;~__/~_ o_?.~
(I) N-acetylqaLacto- ALkali Bovine,ovine
saminyl serine/ i J H2N - C - H labile porcine and
threonine H NH
I
I
CO OH
human submaxillary
(Ga LNAC- ser/thr) c =o mucin s e c r e t i o n s
i
CH3
H
(2) XyLosyl-serine HVN
. , / j ~ - - O ~ o _ C H 2• Alkali Chondroitin
labile sulphate
(xyl-ser) HO ~ / / ] H H2N-;- O heparins
0 H d e r m a t a n sulfate
H OH
INHz
CHzOH CH z
(:5) GalactosyL- Hn/~ O Alkali Collagens anal
hydroxylysine Vl/'#l H/~- 0 - C--H stable Bovine lens
(Gat-Hyl) H~OH ~z i H2 capsule
H OH CH2
I
H2N--C--H
H COOH
(4) ArabinosyL- Ha O\ H H Alkali Plant celt wall
Hydroxy praline ?/H
~,OH .
H?-- O u ~{ , Labile gtycoprotein
N-GLycosidic
HIGH MANNOSE TYPE COMPLEX TYPE HYBRID TYPE
Man Man Man NANA NANA GaL
I BI,4
GLcNAc
Man Man
01,35x~, //~,01,6 GLcNAc
IB1,4
GLcNAc
,81,4
Ash Asn
GLcNAc
Thyrogtobutin unit A # OvaLbumin
(GaLactose containi ng)
Ash
Human lgG
GoL
~1,4
Biosynthesis In recent years, there have been considerable advances in our understanding of
glycoprotein biosynthesis. 17-19 It follows different pathways depending on the type of
linkage being produced. In both N-glycosidic and O-glycosidic types, the protein moieties
are initially synthesized in the rough endoplasmic reticulum (RER). However, the
addition of carbohydrates is a highly segregated process that occurs within an intracellular
membrane system composed of the endoplasmic reticulum, transfer vesicles, the Golgi
complex and secreting vesicles. During their translation, the glycoproteins are isolated
from the cytoplasm and after processing, travel to the cell surface (in the case of
membrane glycoproteins), or to the extracellular environment (in the case of secretory
glycoproteins) or to the lysosomes (in the case of lysosomal enzymes), as part of, or
within, these membrane compartments 11 (Fig 2). Sugar nucleotides such as CMP-sialic
acid, UDP-glucose, UDP-galactose, GDP-fucose, UDP-N-acetyl hexosamines and GDP-
mannose, or respectively, dolichol oligosaccharides, act as donors of the sugar resi-
due, 2°'2] the addition being catalysed by a group of specific enzymes called glycosyl trans-
ferases. 22,23
Sugar residues are added to a peptide moiety in two ways:
(1) Direct addition sugars from sugar nucleotides are added directly in the synthesis of O-
linked glycoproteins and in some stages of N-linked glycoprotein synthesis.
(2) Lipid mediated addition sugar nucleotides act as donors to form dolichol derivatives.
EndopLosmic "~
K•ey : ~ Gtycoprot.ein recticutum GoLgi \
Lysosomes J
Initially, sugars are added to dolichol phosphate (p-Dol) in a sequential manner to form an
oligosaccharide which is then transferred en bloc to the peptide moiety. Subsequently,
some sugars may be deleted.
There are two distinct pathways for biosynthesis of N-glycosidic and O-glycosidic
glycoproteins.
KEY
P-DoL DoLichot Phosphate
• . A^ ~ J~'-P-P-DoL
• Gtchl L, UDP.e~"- ~GDP-O
o Man / ~ "n'r ",~
• GLc =-P-P-Dot o. •
/ o . ~ d ~ ~P-P-o°t
0
~ : : / ' -=--- Polypept ide --~ tp"
Figure 3 Dolichol phosphate cycle. Synthesis of oligosaccharide precursor (adapted from Sharon
and Lis24)
obligatory for them to be added to the oligosaccharide-lipid, because they act as (1) a
terminating signal for the synthesis of lipid-linked precursor, and (2) ensures efficient
glycosylation.
Transfer of oligosaccharide from OSL to the peptide moiety Once the lipid linked
oligosaccharide precursor is synthesized on the RER, it is transferred to the nascent
polypeptide, by the action of membrane bound, dolichol diphosphoryl oligosaccharide:
polypeptide oligosaccharyl transferase. Transfer may only occur to asparagine when found
in the sequence ASn-X-Thr(Ser), where 'X' may be almost any other amino acid.
However, asparagine in the sequence is not always glycosylated. The efficient glycosyl-
ation of proteins is dependent on there being: (1) a sufficient pool of completely
assembled and glucosylated lipid-linked oligosaccharide donors, (2) adequate activity of
oligosaccharyl transferase, 25 and (3) a properly oriented and accessible Ash-X-Set (Thr)
sequence in the acceptor polypeptide.
Processing of glycoprotein oligosaccharides After the transfer of the lipid-linked
precursor to the polypeptide moiety, the three glucose residues are cleaved by
glucosidases 26'27 yielding a structure which is identical to high mannose type oligo-
saccharides, as found in calf thyroglobulin and myeloma IgM. As shown in Fig 4 in case of
the hybrid type, 2s'29 GlcNAc is added and is in fact the signal for the formation of hybrid
type, because, GlcNAc prevents the removal of some of the mannose residues. In the case
of the complex type, mannose residues are completely removed by mannosidases, and
GlcNAc, Fuc, Gal, and NANA residues are added to the chain yielding a structure of the
complex type of N-glycosidic glycoproteins. 3° It is not understood how the same precursor
protein is transformed to three different types of oligosaccharides in glycoproteins. The
site of processing 17 is indicated in Fig 5.
Biosynthesis of The synthesis of O-linked oligosaccharides occur almost entirely by sequential glycosyl-
O-glycosidic ation, where the product of one glycosyl transferase forms the acceptor for another
glycoproteins glycosyl transferase. It is a 'post translational' process, because, glycosylation is initiated
in the Golgi apparatus with the transfer of N-acetyl galactosamine from UDP-GaINAc to
the hydroxyl group of a threonine or serine in the polypeptide by the enzyme, UDP
GalNAc: polypeptide transferase, after protein translation in RER.
In case of branched O-glycosidic oligosaccharides such as blood group substances, there
is a definite sequence during the addition of sugar residues by glycosyl transferases in a
direct fashion, utilizing nucleotide sugars as donors.
r ' 2 3
p ~ ca
• GI.c
a GLcNAc /
• Fuc
•
• NeuAc ~ 7
Figure 4 Processing and glycosylation of N-linked glycoproteins (adapted from Kornfeld et al2s and
Reitman et a129with slight modification )
CYTO
"~--mRNA ( ~
Figure 5 Diagram outlining some of the steps in the assembly and processing of N and O-linked
oligosaccharide chains of the glycoproteins (adapted from Hanover and Lennarz17:
copyright Academic Press)
The Future Many fascinating aspects of glycoprotein structure and function remain to be studied. 33
Some which are receiving considerable attention 34 are:
(1) Is microheterogeneity an expression of the fidelity of the mechanism of glycoprotein
synthesis or does it have a biological significance? (2) Are all glycoproteins synthesized in
the same part of the cell? (3) Why are N-linked carbohydrate units and not O-linked ones,
synthesized via lipid intermediates? (4) What determines the site of attachment of protein
of N-linked and O-linked carbohydrate units, especially the polysaccharide chains of the
proteoglycans? (5) Does the polypeptide backbone control the pattern of N-glycosylation
and if so how? (6) Why is glycosylation sometimes, but not always, required for insertion
of proteins into the plasma membrane and for secretion? (7) How are glycoproteins
inserted into the plasma-membrane and how is the distinction made between membrane
bound and secreted proteins?