Professional Documents
Culture Documents
"
Abstract
Currently there is a lack of experimental systems for defining the functional domains of the
fibrillar collagens. Here we describe an experimental strategy that employs the polymerase
chain reaction (PCR) to create a series of cDNA cassettes coding for seven separate domains
of procollagen II. The system was used to prepare novel recombinant procollagens I! from
which one of the four repetitive D-periods of the triple helix was deleted. Four constructs,
each lacking a different D-period, were expressed in stably transfected mammalian cells
(HT-1080). Truncated procollagens of the predicted size were recovered from the medium.
All were triple-helical as assayed by circular dichroism. Therefore, deletion of a complete
D-period containing 234 amino acids does not destabilize the triple helix of homotrimeric
collagen II as much as some naturally occurring mutations in the heterotrimeric monomer of
collagen I that delete shorter sequences or that convert obligate glycine residues to residues
with bulkier side chains. Moreover, the results suggest that the strategy developed here can
be used to map in detail the binding sites on fibrillar collagens for other components of the
extracellular matrix and for the binding, spreading and signaling of cells.
Key words: cDNA cassettes, recombinant type II procollagen
Introduction
Collagens are a major family of extracellular proteins
that have a variety of functions and that are character1 Current address: Center for Gene Therapy, Allegheny University of the Health Sciences, 245 North 15 Street, M.S. 421,
Philadelphia, PA 19102
Matrix Biology Vol. 16/1997, pp. 105-116
1997 by Gustav FischerVerlag
106
blocks" (Privilov, 1982) of the triple helix begin to undergo micro-unfolding at about 30 to 37 C and well before the molecule fully unfolds at about 41 C. Evidence
for the micro-unfolding model includes the effects of
partially denaturing and then renaturing the protein
(Kiihn et al, 1966; Rhy~inen et al., 1983), comparisons
of the helix-forming properties of synthetic peptides
with repetitive -Gly-Xxx-Yyy- sequences (Sakakibara et
al., 1973; Prockop et al., 1976; Engel et al., 1977; Inouye et al., 1982; D61z and Heidemann, 1986; Roth and
Heidemann, 1986; Germann and Heidemann, 1988),
measurements of enthalpy changes of thermal denaturation by microcalorimetry (Privalov, 1982) and the effects
of temperature on the kinetics of fibril formation
(Kadler et al., 1988). The results demonstrated that sequences in which the Xxx positions are occupied by proline, and especially sequences in which the Yyy positions
are occupied by hydroxyproline, form the most stable
triple-helical regions. Regions lacking the two imino
Signal Peptide
N-Propeptide
Triple
N-telopeptide
Helix
C-telopeptide
C-Propeptide
N-Proteinase
Cleavage Site
Nt
C-Proteinase
Cleavage Site
D1
I
137 aa
D2
I
234 aa
D3
I
234 aa
D4
I
234 aa
D0.4
Ct
I I
234 aa
78 aa
I
273 aa
Figure 1. Schematic drawing of Procollagen II. The subdivisions of the protein indicated below the molecule were used in the design of the procollagen II DNA cassette system. Symbols: aa, amino acids.
5'
107
mutated before assembly. We have assembled four constructs that each lack a specific D-period and have
shown that we can express them as truncated variants of
procollagen II that are secreted as correctly folded recombinant proteins in a mammalian cell system. The resuits indicate that deletion of a complete D-period of
234 amino acids has surprisingly little effect on the thermal stability of collagen II.
666 bbb
3'
ccc
cc
PCR
S'
3'
LIGATE
cassette
insert
Figure 2. Creation of a DNA cassette. PCR primers containing engineered blunt-cutting restriction sites B and C were used to
PCR-subclone a region of a cDNA template. The restriction site A was a part of the multiple cloning region of the plasmid vector.
The A nucleotide shown at the 3'-ends of the PCR product were added by Taq polymerase and provided nucleotide overhangs used
in the TA cloning system (Invitrogen).
108
B ,
5
.Reaion 1
-
,C
B ,
5
cleave at sites /
A and C
/ cleave at sites
A and B
3, C
B,
leaion 2
Figure 3. DNA construct assembly using the DNA cassette system. The assembly of a construct containing two cassette inserts is
demonstrated. The A, B and C restriction sites are the same in the three plasmids shown. The nucleotide sequences of some sites
are capitalized so that they may be differentiated from the same sites in different plasmids.
of S.-W. Li). The PCR primers were engineered to introduce blunt-cutting restriction sites B and C (Fig. 2) at the
ends of the amplified region. These restriction sites (Fig.
2 and 3) were designed to meet the following criteria: (a)
cleavage at B or C maintained the wild-type amino acids
encoded by the c D N A ; (b) restriction site B did not
Synthesis of D N A cassettes
Seven DNA cassettes were created to code for seven
separate regions of the procxl(II) chain of type II procollagen (Fig. 1). Because the 5" end of the COL2A1 cDNA
was difficult to isolate (Baldwin et al., 1989), nucleotides 23 to 126 (nucleotide 23 is the first of the start
codon) in the construct were from exon 1 of human
C O L I A 1 cDNA (Tromp et al., 1988). The remainder of
the cDNA, nucleotides 127 to 4737, were derived from
exons 3 to 52 of human COL2A1 gene (Baldwin et al.,
1989; Ala-Kokko and Prockop, 1990; Fertala et al.,
1994). Therefore, the construct included the signal peptide and signal peptide cleavage site from the COL1A1
gene. However, it did not include either exon 2 from the
109
Table I. PCR Primers used for subcloning the pro~l(II) cDNA in the creation of the proctl(II) DNA cassette system.
Primer
Name a
Primer Sequenceb
Engineered
Restriction Sitec
N,+
N~DI+
D1D2+
D2D3+
D3D4+
D4-
GTCTACATGTCTAGGGTCTAGACATG'YI'CAG
CAG/CTGCATTACTCCCAACTGGGCGCCACCA
GCCC/GGGCCAATGGGCCCCATGGGACCTCG
GAG/CGGGAAGCCAGGAGCACCAGCAATGCC
GCCC/GGGCCACGGGGTCCTCCTGGCCCTCA
CAG/CGGAAGTCCCTGGAACCCAGATGGCCC
GCCC/GGGGCTCCTGGTCCCCCAGGTGAAGGT
CG/CGCAGCTCCAGGGAATCCAGTGGCTCCCG
GCCC/GGGCGTGTTGGACCCCCAGGCTCCAATGd
CG/CGTGAAGCCACGGTGTCCCTI'CAGGCCTCT
GCCC/GGGCTGCAGGGTCTGCCCGGCCCTCCTGGTC
GAG/CGGGGGACCTGGAGGACCAGGGGGCCCAGGAT
GCCC/GGGCCTGGCATCGACATGTCCGCCT
TCTGGCCTGGGCTGGGGGCAGTCACTCAG
NONE
PvulI (C)
Sr[I (B)
BsrBI (C)
Srfl (B)
BsrBI (C)
Srfl (B)
BstUI (C)
Srfl (B)
BstUI (C)
SrfI (B)
BsrBI (C)
SrfI (B)
NONE
D0.4+
D0.4-
Ct+
Ct-
a The primer name denotes the region of the procxl(II) cDNA that this primer was used to subclone by PCR. A "+" or "-" in the
primer name denotes whether the primer primes the polymerization of the sense or the antisense strand, respectively.
b All primers are written from the 5' to 3' end. Boldface nucleotides indicate where changes were made from the original pro0tl(II)
cDNA. These changes were engineered to introduce the indicated restriction sites, which are underlined. The slash in each primer
sequence indicates where the corresponding restriction enzyme cleaves the engineered restriction site.
' The (B) or (C) following the name of the engineered restriction site indicates whether that site is used as a B or a C restriction site
in the corresponding DNA cassette.
The additional change (C to T) introduced here was done to destroy a native BstUI site which occurs in this position in the original pro00(II) cDNA. Without this change the BstUI site engineered at the 3' end of the D 4 cassette would not be unique for this
coding region. Note that this C to T change does not alter the codon at this position.
110
(')
H jI
IL
Hindll,/~
~ Sphl
Spel
BsrBI
kan
~[
amp
I//
(b)
Sphl
//
II
I .mp
Hindlll
/I"
(c)
EcoRV
Pvul
I
Hindlll
neo
amp
eDNA constructs
Five DNA constructs (Table II) were assembled from
the cassettes in the order indicated in Table Ill, using the
bacterial strain DH5(x (Life Technologies, Inc.) as host.
In each assembly step, the fidelity of the DNA sequence
at the junctions was verified by DNA sequencing. Functional constructs were removed intact by digestion with
HindIII and BsrBI, and cloned into the corresponding
HindIII and EcoRV sites of the mammalian expression
vector pcDNA3 (Invitrogen) containing the CMV promoter and a neomycin resistance (Fig. 4c).
Cell transfections
HT-1080 cells (American Type Culture Collection
CCL 121) were cultured in DMEM supplemented with
10% (v/v) fetal calf serum. The cells were transfected
(Fertala et al., 1994) with one of the DNA constructs by
calcium phosphate precipitation using a commercial kit
(Profection Mammalian Transfection System Kit;
Promega), according to the manufacturer's instructions.
Briefly, cells were split 18 h prior to transfection, grown
to a density of approximately 106 cells in a 10 mm cell
111
Resutts
General features of the cassette system
In designing cassettes for the pro~l(II) chain, the 8rfl
(5'-GCCC/GGGC-3') site was used as the B site (Fig. 2)
for all the cassettes because Srlq sites are rare in plasmid
vectors, do not occur in protxl(II) cDNA, and cleavage
of the site leaves a complete codon for glycine (GGN) at
the 5" end of the cassette. Accordingly, a Srfl site can be
used to define the 5' end of a D-period or build any cassette beginning with a complete glycine codon found
anywhere within a D-period of the pro~l(II) chain. To
accommodate the variability in the -Yyy- positions that
end each D-period, the restriction sequences introduced
into the C site were BsrBI (GAG/CGG) for proline
(CCN in the 3' to 5' orientation), BstUI (CG/CG) for
alanine (GCN), and Eco47III (AGC/GCT) for serine
(AGN).
The Nt and Ct regions containing the propeptides and
the telopeptides cap the ends of the DNA constructs,
and therefore restriction sites were engineered into one
end only. The SpeI site was used as the A site (Fig. 2).
As designed, the pro00(II) DNA cassette system can
be used to assemble DNA constructs encoding novel
procollagens II in which individual D-periods of the
triple helix can be deleted, rearranged or duplicated,
since any D-period cassette from Plasmid I can be added
Description
Composition
FL
-D1
-D2
full-length
missing D1
missing D2
N,-D 1-D2-D3-D4-Do.4-Ct
Nt-D2-D3-D4-D0.4-C,
-D 3
missing D 3
NcD1-D2-D4-Do.4-Ct
-D4
missing D4
Nt-DI-D2-D3-Do.4-C
t
Nt-D1-D3-D4-Do.a-Ct
112
Table III. Assembly of functional proal(II) DNA constructs using the DNA cassette system.
Functional Construct
Nt-D1-D2-D3-D4-D0.4-C t
(full length)
N,-D2-D3-D4-Do.4-C t
(missing D,)
Nt-D1-D3-D4-Do.4-C t
( m i s s i n g D2)
Nt-D1-D2-D4-D0 4-C ~
(missing D3)
Nt-Dj-D2-D3-D0.4-C ~
( m i s s i n g D4)
The (p) or (r) following each cassette in the ligation reactions indicates whether that cassette was used as a p- or r-cassette, respectively, in the given cloning procedure. The proal(II) insert in each p-cassette, with one exception noted in b, was released intact by
digestion with SpeI and BsrBI (in the case of cassette inserts ending with D1, D2 or D0.4); or Spel and BstUI (in the case of cassette
inserts ending D 3 or D4}. The r-cassette was opened, with exception described in b, by digestion with SpeI and Srfl. The number in
parentheses preceding each ligation reaction describes the temporal order in which these reactions were performed. Note that
many reactions were performed concomitantly. The constructs appearing as a result of final ligation reactions were the functional
constructs which were transferred to a mammalian expression vector and used in subsequent transfection procedures.
in any desired order to the 5' end of a Ct cassette in Plasmid II (Fig. 3). To reduce the number of cloning steps,
combined cassettes of more than one D-period were
used as inserts (Table III). The final D N A construct was
capped by the addition of the Nt cassette to the 5' end of
an r-cassette.
113
A
+
Q-
type
m
r~
-D1 pro
a
I
a
I
LL
I
pro al (I)
pro c 2(I)
Figure 5. Screening of G418-resistant clones by Western blotting. Panel A: Western blot demonstrating a positive signal in
one of four clones transfected with a recombinant procollagen
II missing the Dl-period. The band below the recombinant
protein is a partially processed procd(II) chain lacking a Dl-period. Panel B: A corresponding phosphor storage image of the
Western blot demonstrating ~4C-radiolabeled proteins. Symbols: FN, fibronectin; Type IV, collagen IV; "+", lane containing a positive clone.
114
Discussion
The DNA cassette system described here together with
the previously described expression system for recombinant procollagens (Fertala et al., 1994) can be used to
prepare novel fibrillar procollagens. The fidelity of the
over-all system in preparing these novel proteins was
verified by partial DNA sequencing, by amino acid analysis of the proteins, and by conformational analysis by
CD. The D-period deleted procollagens II were folded
into the triple helical structure as shown by the CD spectrum. The recombinant molecules are therefore useful to
determine the contribution of different regions to thermal stability (Engel, 1987; D61z and Heidemann, 1988;
Kadler et al., 1988; Morris et al., 1990; Westerhausen et
al., 1990; B/ichinger and Davis, 1991; B/ichinger et al.,
1993; Fertala et al., 1993), folding, fibril formation
(Prockop and Hulmes, 1994) and specific interactions
with other molecules.
The flexibility of the DNA cassette system allows for
the creation of a number of informative procollagen II
DNA constructs. It should be noted that the D-period
division of the triple-helical region of proixl(II) was a
somewhat arbitrary division of the functional domains
of the protein (Hulmes et al., 1973; Chapman, 1984).
Smaller or larger cassettes can readily be synthesized.
The nucleotide sequence coding for the human pro0cl(II)
triple-helical region contains 151 separate locations
where a Srfl site may be engineered as the 5' end of a
cassette. The 3' end of a cassette can be terminated in
any codon for a proline, alanine or serine. Similar cassettes can also be made for the propeptides. Furthermore, complex constructs can now be assembled from
the intermediate cassette constructs already available
(Table III). Also, the DNA cassette system can easily be
adapted to other procollagen cDNAs and even to
cDNAs for other large proteins. The crux of the strategy
A c D N A Cassette System
data on specific binding sites are not available, primarily
because site-directed mutagenesis of the whole
m o n o m e r s is technically difficult, and most short peptides do not fold into the triple-helical conformation
that is probably essential for specific binding interactions. The strategy of D-period cassettes should make it
possible to overcome these problems.
AcknowLedgements
This work was supported in part by National Institutes of
Health Grant AR-39740, a grant from the Lucille P. Markey
Charitable Trust, and a grant from the Shriners Hospital.
References
Ala-Kokko, L. and Prockop, D.J.: Completion of the intronexon structure of the gene for human type II procollagen
(COL2A1). Variations in the nucleotide sequences of the alleles from three chromosomes. Genomics 8" 454-460, 1990.
Ala-Kokko, L., Hyland, J., Smith, C., Kivirikko, K.L, Jimenez,
S.A. and Prockop, D.J.: Expression of a human cartilage procollagen gene (COL2A1) in mouse 3T3 cells. J. Biol. Chem.
266: 14175-14178, 1991.
B/ichinger, H.P. and Davis, J.M.: Sequence specific thermal stability of the collagen triple helix. Int. J. Biol. Macromol. 13:
152-162, 1991.
B/ichinger, H.P., Morris, N.P. and Davis, J.M.: Thermal stability
and folding of the collagen triple helix and the effects of mutations in osteogenesis imperfecta on the triple helix of type I
collagen. Am. J. Med. Genet. 45: 152-162, 1993.
Baldwin, C.T., Reginato, A.M., Smith, C., Jimenez, S.A. and
Prockop, D.J.: Structure of cDNA clones coding for human
type II procollagen. The ~1(II) chain is more similar to the
cxl(I) chain than two other a chains of fibrillar collagens.
Biochem. J. 262: 521-528, 1989.
Chapman, J.A.: Molecular organization in the collagen fibril.
In: Connective Tissue Matrix ed. by Hukins, D.W.L., Verlag
Chemie, Deerfield Beach, FL, 1984, pp. 89-132.
D61z, R. and Heidemann, E.: Influence of different tripeptides
on the stability of the collagen triple helix. I. Analysis of the
collagen sequence and identification of typical tripeptides.
Biopolymers 25: 1069-1080, 1986.
Engel, J., Chen, H.-T., Prockop, D.J. and Klump, H.: The triple
helix to coil conversion of collagen-like polytripeptides in
aqueous and nonaqueous solvents. Comparison of the thermodynamic parameters and the binding of water to
(L-Pro-L-Pro-Gly), and (L-Pro-L-Hyp-Gly)n. Biopolymers 16:
601-622, 1977.
Engel, J.: Folding and unfolding of collagen triple helices. In:
Advances in Meat Research, ed. by Pearson, A.M., Dudson,
T.R. and Bailey, A.J., Van Nostrand Reinhold, 1987, pp.
145-161.
Fertala, A., Sieron, A.L., Ganguly, A., Li, S.-W., Ala-Kokko, L.,
Anumula, K.R. and Prockop, D.J.: Synthesis of recombinant
human procollagen II in a stably transfected tumor cell line
(HT-1080). Biochem. J. 298: 31-37, 1994.
Germann, H.-P. and Heidemann, E.: A synthetic model of collagen: an experimental investigation of the triple-helix stability.
Biopolymers 27: 157-163, 1988.
115
116