Z.S. Derewenda / Methods 34 (2004) 354–363
355
insulin [5], human leukocyte interferon A [6,7], murineinterferon-
[8], and eglin C [9],With the dawn of recombinant methods, modi
W
cationof protein and using protein as another variable in thecrystallization experiments became also possible. Priorto that point, the only way to use the protein as a vari-able was to screen homologues from various species. Theconcept was originally used by Kendrew [10] in his origi-nal selection of sperm whale myoglobin as a target forX-ray analysis. Later on, Campbell et al. [11] reiteratedthe idea and suggested screening of various species as aroutine tool in search for a crystallizable protein homo-logue. The approach has been successfully used in manystudies, and is still often exploited in membrane proteincrystallography, where
W
nding a suitable and crystalliz-able variant of the target molecule is even more of achallenge.In spite of its obvious advantages, homologue screen-ing is cumbersome, and not useful if a speci
W
c structureis needed. In contrast, recombinant methods o
V
er arange of possibilities for modifying the target protein tohelp in the expression, puri
W
cation, and crystallization.As a consequence, most protein crystallography groupshave incorporated molecular biology techniques intotheir standard arsenal of methodologies, and ofteninvest much more time at the bench producing the pro-tein and crystallizing it, than solving the structure oncecrystals are in hand.The aim of this review is to provide the reader with anoverview of the contemporary protein engineeringapproaches used in a protein crystallography laboratoryto facilitate the preparation of high-quality crystals. Thesubject is broad, and a truly comprehensive review is notpossible within the limitations of this chapter. Thus, wehighlight the selected important aspects and provide rep-resentative examples of case applications for globularproteins. Particular attention is given to the advances incrystallization enhancement via surface mutagenesis. Wedo not discuss protein engineering of membrane proteins,as the options there are very limited due to the much lessadvanced stage of heterologous overexpression.
2. Use of recombinant proteins in crystallization
The advantages of using recombinant proteins forcrystallization typically include, among others, a dra-matic increase in the yield of protein, ease of puri
W
cationusing a
Y
nity tags and fusion proteins, possibility of pro-tein modi
W
cation or use of isolated domains, and ease of labeling with SeMet for phasing purposes. A survey of crystallographic studies published last year in high-impact journals (excluding membrane proteins) indi-cates that over 90% of crystal structures are based onrecombinant material (Bielnicki, J., Janda, I., Derew-enda, Z.S., unpublished).Bacteria, and in particular
E. coli
, are preferred asprotein producers for biophysical studies [12]. It is thebest characterized host with a diversity of strains for spe-ci
W
c applications, such as expression of proteins withnon-optimal codon bias, protease-de
W
cient strains,strains containing chaperones for enhanced folding, etc[13]. Other advantages include ease of handling com-pared to eukaryotic cell cultures, multitude of vectors forexpression of numerous fusion proteins, and overall lowcost. However, in spite of these advantages, not all pro-teins of interest will be expressed with an adequate yield,in a properly folded soluble form, or at all.A simple method to increase the yield of protein pro-duction is to increase the culture density from the typicalOD of 0.6–2.0 or higher. Very high density can beachieved using disposable 2-L plastic bottles, which getwell aerated in shakers and which also minimize theoverall e
V
ort by disposing with sterilization [14]. Themedia composition may also be altered and both mini-mal media and rich media have been shown to be advan-tageous in speci
W
c cases. To further increase the yield forthose proteins for which expression levels are particu-larly low, it is necessary to resort to fermentors, althoughthe optimization of large-scale expression is often a sig-ni
W
cant e
V
ort in itself [15].Protein aggregation and inclusion body formation areoften a problem [16]. To monitor the solubility of theexpressed protein, one can use a fusion with a down-stream located GFP (green
X
uorescent protein). Underthose circumstances, increase in
X
uorescence indicatesproper folding of the target protein and can be used as adiagnostic [17]. If the target protein expresses well, butgoes into inclusion bodies, the di
Y
culties can often beresolved using a number of alternative strategies. Thesimplest remedy is to lower the IPTG inducer concentra-tion to
»
0.2mM and reduce the post-induction tempera-ture [18], so that the translation machinery slows downallowing the polypeptide chain enough time to fold. Forexample, reduction of the temperature from the typical30–37°C to 25°C resulted in a 2-fold increase in the sol-uble fraction of apoglobin in
E. coli
[19]. A similar resultwas obtained for the phage K11 RNA polymerase,which required a temperature of 25°C for proper foldingand solubility [20]. In the case of the hepatitis C virusNS3 serine protease, maximum solubility was achievedwhen the temperature was dropped to 15°C [21]. In ourlaboratory, we routinely bring the temperature to 10°Cwith very good results (Derewenda, U., unpublished).If the temperature does not help, it may be necessaryto use an
E. coli
strain co-expressing molecular chaper-ones belonging to the DnaK–DnaJ–GrpE and GroEL– GroES systems [22]. The 50kDa
E. coli
trigger factor, apeptidyl-prolyl
cis/trans
isomerase, can also signi
W
cantlyhelp to prevent protein aggregation, with or without theassistance of chaperones [23]. Finally, it should not beforgotten that some carrier proteins used in fusion