You are on page 1of 16

Proteins, Chemistry and Chemical Reactivity of

S. David Tilley, Neel S. Joshi, and Matthew B. Francis,

Department of Chemistry, University of California, Berkeley, Berkeley, California
doi: 10.1002/9780470048672.wecb493

Advanced Article
Article Contents
General Reactivity and Stability Considerations Chemical Reactivity of Amino Acid Residues in Proteins Modication of the N- and C-Termini Bioorthogonal Methods for Protein Labeling Conclusion

Protein bioconjugation is a critically important tool for the elucidation of enzyme mechanisms, the tracking of biomolecules in living systems, the improvement of pharmacokinetic properties, and the construction of new materials. All these applications rely on a continually expanding set of chemical reactions that can modify native protein functionality in aqueous solution under mild pH and temperature conditions. To survey these techniques, this article provides an introduction to the chemical reactivity of the amino acid side chains, with an emphasis on the selectivity that can be achieved using a particular reactive strategy. Site-selective techniques that target the unique reactive properties of N-terminal residues are also reviewed, as are native chemical ligation methods for the modication of the C-terminus. Whenever possible, the mechanistic aspects of the reactions are discussed, as these considerations provide the foundation for future reaction development. The article concludes with a brief description of labeling reactions that selectively target unnatural functional groups in the presence of native protein functionality.

Our understanding of the chemical reactivity of proteins lies at the very heart of chemical biology. In concert, the chemical behavior of the amino acid functional groups provides the basis for enzyme catalysis, and the differences between their reactive properties allow chemoselective protein labeling techniques to be developed. The ability to attach new functionality to a protein of interest allows the rich diversity of biomolecular function to be expanded to the limits of ones imagination, which affords hybrid structures that can enhance our understanding of and interactions with living systems. For example, the biological role of many proteins has been claried by tracking their cellular localization with attached uorescent probes (1, 2). In many cases, chemical cross-linking strategies have then facilitated the identication of their binding partners. Radiolabels are commonly introduced to assess protein biodistribution in vivo , and polymers have been attached to improve circulation behavior (3, 4). Surface immobilization offers exceptional promise for the generation of microarrays for proteomic studies, and chemically modied proteins have been used to build new materials that use biological structure to template inorganic crystal growth (5, 6). At the core of these studies is the expanding set of chemical reactions that can modify biomolecules in buffered aqueous solution. These techniques push organic reactivity to the limits of efciency and chemoselectivity, often with the ability to modify a single functional site among hundreds of spectator

groups. This article focuses on the reactive principles that have led to the development of these strategies, with a particular emphasis on the site selectivity that can be achieved for each. It is hoped that this survey can serve as a useful guide for those individuals wishing to modify a protein of interest and that it can facilitate the design of new strategies for modifying proteins with ever-increasing levels of precision and yield.

General Reactivity and Stability Considerations

Proteins have evolved to catalyze a remarkably diverse set of chemical transformations, largely by employing general acid/base chemistry and radical pathways (7). Although this range of reactivity provides the very reason for our interest in the chemical behavior of proteins, it also limits our ability to make general predictions about how specic reagents and conditions will affect a biomolecular target. As long as we remember that each protein has a unique reactivity personality, however, we can formulate a few guidelines that can serve as logical starting points for protein analysis, purication, and modication protocols. 1


Proteins, Chemistry and Chemical Reactivity of

The amide linkages of proteins are generally stable under both acidic and basic conditions, as are the native amino acid functional groups. However, changes in pH will alter the protonation state of many side chains (Fig. 1), which will result in changes in their reactivity and the overall folding state of the protein. To maintain higher-order structure, most modication techniques are conducted at a pH between 6 and 8.5, although many proteins will remain folded outside this range. The three-dimensional structure of most proteins is stable in aqueous solution between room temperature and 37 C, and many proteins (such as those isolated from thermophilic organisms) can be heated to temperatures that are signicantly higher before unfolding occurs. Large fractions of organic cosolvents (such as methanol, DMF, and DMSO) are generally disruptive to protein structure, whereas small amounts are often used to solubilize the organic reagents that are used for protein modication. Although it is sometimes possible to refold proteins into their native conformation after denaturation occurs, this refolding can be difcult to achieve in practice because of unwanted aggregation or the absence of chaperones that facilitate the correct folding pathway. Thus, unless a refolding protocol is available or anticipated, it is recommended that proteins be processed under the mildest conditions possible if more biochemical studies are to be pursued. The side chain functional groups contain a wide range of nucleophiles, including amines, thiolates, carboxylates, phenolates, and electron-rich aromatic rings, but disulde bonds represent the only appreciably electrophilic functional groups. Therefore, the bulk of protein modication reagents are themselves electrophilic in nature. Amino acid side chains in the active sites of enzymes often show distinct nucleophilicity because of the alteration of their pKa values or other forms of activation. This nucleophilicity allows these positions to be targeted by less-reactive electrophiles that are ineffective for the modication of ordinary residues. An excellent review of this technique for the activity proling of enzymes has recently appeared (8). Many side chain groups are prone to oxidation, including thiolates, thioethers, and indole rings, but proteins are relatively nonreactive toward reducing agents (disulde bonds again being the one exception). Although several proteins are known to catalyze reactions through radical pathways, proteins themselves are often unreactive toward radical species in solution [such as polymerization initiators (9)]. The lack of alkene functional groups also precludes the direct participation of proteins in most electrocyclization reactions. A critical consideration for any protein labeling experiment is the level of site specicity that can be achieved. Largely, the inherent selectivity is dictated by the abundance of the targeted amino acid in the protein sequence. As listed in Fig. 1, the commonly modied amino acids occur with average frequencies ranging from 1.4% (tryptophan) to 6.3% (glutamic acid) of the amino acids overall (10). However, signicant variations are commonly observed, as tabulated for the soluble cellulase domain in Fig. 1 (11). The hydrophobicity of the amino acids also inuences their relative surface accessibility and, thus, their ability to participate in a particular reaction. The best strategy for the modication of a particular protein, therefore, varies on a 2

case-by-case basis, which makes it important to have a range of modication techniques that can target several different amino acids. These considerations form the basis for the numerous techniques that are now available for the chemical modication of proteins. The sections that follow will examine these techniques and the reactive principles by which they function. A section describing reactions that display orthogonal reactivity to native protein functional groups has also been included because of the growing importance of these reactions as tools to label proteins in complex mixtures. Because it is not practical to summarize all protein bioconjugation methods here, this information instead is intended to serve as an introduction to the concepts that drive the development of these reactions. Several additional reviews and books on protein modication have been listed in the Further Reading section.

Chemical Reactivity of Amino Acid Residues in Proteins

It should be stressed that proteins are more than just the sum of the amino acids that comprise them. Surprises abound when predicting the chemical modication behavior of a particular protein, largely because of the complex interplay between the individual functional groups and the local environments in which they reside. Nevertheless, the rst step to understanding the chemical behavior of proteins is to characterize the reactive properties of the individual side chain groups. The following sections describe these aspects, with an emphasis on the chemical techniques that are commonly used to modify these residues for applications in chemical biology.

Chemical reactions of lysine residues

The amino groups of lysine residues are highly nucleophilic in aqueous solution, which makes them the most commonly targeted sites for covalent protein modication. As they must be deprotonated to react, these reactions are typically run at slightly elevated pH (often 89). However, many lysine modication reagents undergo competing hydrolysis under alkaline conditions, and thus a balance must be struck to obtain optimal modication yield. Although the average pKa of lysine ammonium groups is 10.5, this value can be altered signicantly by interactions with other charged groups on the protein surface. A recent review has described an analysis technique to evaluate these effects (12). The most common reagents for lysine modication are NHS-esters, which react with amines at rates that are significantly greater than background hydrolysis, Fig. 2a (13). These reagents are most commonly prepared from carboxylic acids and isolated before exposure to the protein substrate. Because of the popularity of this method, dozens of premade NHS-esters are now commercially available. In cases where aqueous solubility is problematic, sulfonated NHS-esters can be used. Because virtually all proteins have many lysine residues on their surface, this reaction affords the most reliable and general method


Proteins, Chemistry and Chemical Reactivity of

for protein modication, although initial screens are still recommended to determine the required amount of reagent and optimal buffer conditions. It should be noted that the abundance of lysine residues also renders this technique inherently non-site selective, which leads to mixtures of products that vary in both the number and location of the modications. Although lysine acylation is the principal reaction that occurs with these reagents, the N-terminus can also be acylated, as can a variety of other nucleophilic side chains. In the latter cases, the transient species that result are often cleaved by exposing the protein to hydroxylamine before isolating the product by gel ltration. Several related reagents are also available for lysine modication, including isocyanates, which are often prepared from acyl azides using the Curtius rearrangement (Fig. 2b); and isothiocyanates (Fig. 2c), which are more stable for long-term storage. Amine arylation can also be accomplished using 2,4-dinitrouorobenzene (Sangers reagent, Fig. 2d). The products of this reaction are uorescent, which assists their detection in sequencing applications. All of these reactions proceed under similar conditions and are often selected based on the convenience of reagent preparation from the compounds on hand. As an alternative strategy, lysine residues can be modied through reductive alkylation, Fig. 2e. This method is most frequently carried out by exposing the protein to aldehydes in the presence of hydride-containing agents that reduce the transiently formed imines. NaB(CN)H3 and NaB(OAc)3 H are commonly used for this purpose. As an alternative, transfer hydrogenation can be carried out in the presence of an Ir(III)[Cp*]2 (bipyridyl) catalyst, which allows imine reduction to occur under mild conditions using buffered formate as the hydride source (14).

Relative to acylation strategies, modication via reductive alkylation preserves the overall charge state (and thus the solubility) of the protein. In general, this technique also suffers from poor site selectivity.

Chemical reactions of cysteine residues and disuldes

By far the most widely used methods for site-selective protein modication target cysteine residues. In contrast to lysine, cysteine is one of the rarest amino acids (10), and it is unusual to nd it in the reduced form on protein surfaces. Thus, it is often possible to introduce a uniquely reactive cysteine residue using genetic methods. Similar to lysine-modifying reagents, many cysteine-reactive small molecules are commercially available because of the success of this overall strategy. In the reduced state, the sulfhydryl group of cysteine can be deprotonated (pKa 8) to generate a potent thiolate nucleophile. This species then can be intercepted by soft electrophiles, such as iodoacetamides (Fig. 3a), maleimides (Fig. 3b), acrylamides, and vinyl sulfones (Fig. 3c), to result in the formation of a new carbonsulfur bond. Although lysine residues can also be modied using many of these reagents, cross-reactivity can often be minimized by running the reactions at relatively low pH to encourage protonation of the amino groups. Varying amounts of cysteine-alkylating reagents (from 1 to 1000 equivalents) are used to reach the desired level of modication. Although possible, the acylation of cysteine residues is not commonly used as a modication strategy because the resulting thioesters hydrolyze over time in solution.

Lysine Cysteine Aspartic and Glutamic acid Tyrosine Tryptophan Histidine Front view Back view

Amino Acid Lysine Cysteine Aspartic Acid Glutamic Acid Tyrosine Tryptophan Histidine Methionine Arginine

Abbreviation K C D E Y W H M R

Side Chain pKa 10.5 8 3.5 4 10 n/a 6 n/a 12.5

Average Abundance10 5.9% 1.9% 5.3% 6.3% 3.2% 1.4% 2.3% 2.3% 5.1%

Number in Cellulase Modify With: Sequence11 25 (5.8%) 6 (1.4%) 33 (7.7%) 13 (3.0%) 30 (7.0%) 11 (2.6%) 10 (2.3%) 10 (2.3%) 5 (1.2%) NHS Esters Maleimides EDC + Amines EDC + Amines Diazonium Salts Rhodium Carbenoids Diethylpyrocarbonate Cyanogen Bromide Glyoxal Compounds

Figure 1 Tabulated data for amino acid side chains commonly targeted in chemical modication reactions. As a specic example, a space-lling model of a soluble celulase domain from C. cellulolyticum (PDB ID 1IA7) shows the relative abundance and surface accessibility of these residues. Examples of commonly used modication reagents are also listed. WILEY ENCYCLOPEDIA OF CHEMICAL BIOLOGY 2008, John Wiley & Sons, Inc.

Proteins, Chemistry and Chemical Reactivity of

Cysteine-alkylating groups have been coupled to small molecules that bind to specic sites on a protein to gain an additional level of labeling selectivity. As one example, a weakly reactive uoromethyl ketone was fused to pyrrolopyrimidines known to inhibit a series of kinases. When binding to two kinases (RSK1 and RSK2), selective alkylation was observed for a cysteine residue adjacent to the binding site (15). Ligandreceptor interactions have also been implicated in the modication of ion channels. In these studies, maleimides bearing channel-blocking groups were used to alkylate genetically introduced cysteine residues. After modication, the activity of the ion channels could be controlled by the photoisomerization of an attached azo moiety (16). A second method for the modication of cysteine residues involves the formation of disulde bonds (Fig. 3d). In this method, reduced cysteine residues exchange with exogenous disuldes, which leads to the formation of new disuldes on the protein surface. This reaction can be driven to completion by mass action using a large excess of disulde, or its efciency can be improved by using asymmetric disuldes that possess good leaving groups. Disuldes of 2-thiopyridone (17) and 3-carboxy-4-nitrothiophenol (Ellmans reagent) are commonly used for this purpose. To assist with the thermodynamic considerations of these reactions, a helpful table of disulde reduction potentials has been published (18). A unique aspect of this class of cysteine-modication reactions is that they are reversible, which is a feature that could be exploited for drug delivery applications (19). Disuldes themselves are often modied during protein analysis. Most commonly, this modication is accomplished by reducing them with dithiothreitol, mercaptoethanol, or tris
(a) Acylation with NHS-Esters O N O O Lysine O R H HN N O (c) Reaction with Isothiocyanates O N H R

(2-carboxyethyl)phosphine (20) and following this reduction by a subsequent alkylation step using any of the reagents listed above, Fig. 3e (21). An interesting version of this procedure has been developed to link both of the cysteines that result from the reduction step through two consecutive alkylation reactions (22). The ability of this technique to replace a disulde bond with a more robust link could be used to improve the stability of proteins, hormones, or antibodies for in vivo applications. To distinguish reduced cysteines from disuldes in proteolytic digests, a biotin-based afnity capture strategy has been developed (23).

Chemical reactions of aspartic and glutamic acid residues

The surface of most proteins displays many aspartic and glutamic acid residues, which provides an additional set of locations for protein modication. The carboxylate groups are predominantly deprotonated at neutral pH (pKa = 3.54), which makes it generally difcult to distinguish between the many copies that are present. In cases where site selectivity is not a concern, the carboxylate functional groups can be activated through the use of water-soluble carbodiimides, such as N -ethyl-3-N ,N -dimethylaminopropyl carbodiimide (EDC), Fig. 4. This reagent forms an O -acylisourea intermediate that can react with amines to form amides. This intermediate can also hydrolyze to regenerate the carboxylate anion, and in some cases it will undergo an acyl shift to yield an N -acyl urea. This latter pathway results in a permanent modication of the protein and in a deactivation of the carboxylate group toward more modication. Nucleophilic catalysts (such as NHS and HOBT)
(b) Reaction with Isocyanates R CO2H 1. (PhO)2PON3 2. Heat Lysine R NCO H HN N O (d) Arylation with Sanger's Reagent F O N H N H R

R NCS Lysine









(e) Reductive Alkylation O Lysine R H H HN N O H N R

NaB(CN)H3 -orNaB(OAc)3H -orOMe N Ir cat. H N OMe HCO2H, pH 7 +



Figure 2 Chemical modication strategies for lysine residues.


Proteins, Chemistry and Chemical Reactivity of

have been shown to suppress this pathway (24). When lysine side chains supply the amino groups, this reaction results in protein cross-links that can be used for topology mapping purposes. As one example, this technique has been used to determine the number of individual protein subunits in an ion channel, which affords information about its multimeric state (25).

Chemical reactions of tyrosine residues

Although the chemical modication of tyrosine residues has enjoyed a long history, this residue remains an underused target for bioconjugation reactions. It is typically modied through electrophilic aromatic substitutions (EAS), which makes its reactivity distinct from other amino acid side chains. This reaction
(a) Alkylation with Iodoacetamides O I R N H H HN Cysteine N O (b) Alkylation with Maleimides O N Cysteine R O H HN N O (c) Alkylation with Vinylsulfones S O O R H HN N O (d) Formation of Disulfides S S

complementarity is particularly useful in cases when cysteine chemistry cannot be used or in combination with cysteine chemistry when multiple labels need to be introduced in specic locations. Although tyrosine is a relatively common amino acid, it is often buried in the interior of the protein. This placement effectively reduces the number of phenolic side chains that can participate in reactions, which often leads to higher site selectivity than can be expected for lysine modication strategies. In general, EAS reactions are selective for the positions adjacent to the phenolic hydroxyl group, and in some cases (such as iodination) two additions can be observed for a single ring. When increased reactivity is needed, reactions are often run at elevated pH (pH 810) to access an appreciable population of the more nucleophilic phenolate anion.




R O Often with: O O2N -orR S OH




(e) Reduction of Disulfides H HN N O NH H N O DTT: HS




Figure 3 Chemical modication strategies for cysteine residues.

N C N Aspartic or Glutamic Acid











N-Acyl urea byproduct

Figure 4 Chemical modication of aspartic and glutamic acid residues with EDC. WILEY ENCYCLOPEDIA OF CHEMICAL BIOLOGY 2008, John Wiley & Sons, Inc.

Proteins, Chemistry and Chemical Reactivity of

The oldest method for the modication of tyrosine residues occurs through the use of diazonium salts, as reported in 1904 (26). This reaction results in the formation of azo compounds that are brightly colored, Fig. 5a. Both tyrosine and histidine participate in these reactions, and it has been reported that aliphatic amino groups can react to form unstable triazens (27, 28). Thus, it should not be assumed that tyrosine is the only modied amino acid without additional conrmation. In the case of tyrosine, these reactions typically require the participation of the phenolate anion and thus are carried out at pH 810 at 4 C (29). In most cases, the diazonium salts are generated through the reaction of anilines with sodium nitrite under acidic conditions and are used immediately. This reaction has been used to produce antigens (30) and to elucidate the requirements of tyrosine and histidine in enzyme active sites (31). The azo products have also served as probes of amino acid surface accessibility (29), thus providing an indirect method to determine protein conformation. One recent example has used this reaction to immobilize pig liver esterase on silica to prepare a reusable catalyst for the hydrolytic kinetic resolution of racemic esters (32). Diazonium coupling reactions have also been applied to the modication of protein assemblies, reaching nearly quantitative conversion for the modication of tyrosine residues that are displayed on MS2 (33) and TMV (34) viral capsids. These proteins are particularly good targets for this reaction, as each capsid monomer possesses a single solvent-exposed tyrosine residue and contains no histidines. In these studies, it was found that electron-withdrawing substituents in the 4 position of the diazonium salt signicantly enhanced the efciency of the reaction. These modication reactions have been used to introduce hundreds to thousands of ketone and aldehyde groups for the subsequent attachment of MRI contrast enhancement agents (35) and water-solubilized carbon nanotubes (36). Phenol nitration with tetranitromethane (37) (TNM) is typically run at pH 8 at room temperature and is selective for tyrosine residues under these conditions (although some oxidation of cysteine residues has been reported) (38). The product of this reaction could be thought to develop through an electrophilic
(a) Azo Formation with Diazonium Salts

aromatic substitution reaction, but the mechanism instead has been shown to proceed through a radical coupling pathway (39). The degree of nitration is easily quantied by measuring the absorbance of nitrotyrosine at 428 nm. If the reaction is performed under more rigorous conditions, such as with increased equivalents of TNM, then histidine, tryptophan, and methionine can also react. The in vivo nitration of tyrosine by peroxynitrite and other reactive nitrogen species is implicated in many disease states and is an area of active research (40). It has recently been proposed that tyrosine nitration, mediated by nitric oxide and superoxide, is a regulated cell signaling pathway that provides quick response to the microenvironment of the cell (4143). The iodination of proteins has proven to be invaluable in the area of biomedical research. These techniques allow the biodistribution of biomolecules to be tracked after they have been labeled with radioisotopes, such as Iodine-131. Although early studies used molecular iodine and bromine for this purpose (37), more efcient reagents have since been developed to make better use of small quantities of radioisotopes. An improved iodination method that uses iodine monochloride (ICl) has been exceptionally useful for protein radiolabeling, with efciencies as high as 6080% being reported, Fig. 5b (44). Since its original disclosure, several modications of this reaction have been reported to improve efciency and reduce handling hazards (4547). An alternative Chloramine-T based method has also been developed to achieve both low and high levels of specic radioactivity, with iodine use as high as 6075% (48). Formaldehyde-induced cross-linking is commonly used for many biochemical applications, from studying proteinprotein interactions to immobilizing whole cells on surfaces. Although a detailed early review of the effect of formaldehyde on proteins (49) implicated tyrosine as one residue responsible for cross-linking, a few reports have been written regarding the rigorous chemical characterization of this mechanism with modern analytical techniques. A recent investigation into the nature of these cross-links used extensive enzymatic degradation and MS/MS analysis of insulin that had been exposed to formaldehyde to determine the relative level of participation of different
(d) Mannich-Type Alkylation
H2N R 10 mM RCHO, pH 6.5

N2 R pH >8.5

Tyrosine (b) Iodination






(e) Oxidative Crosslinking Tyrosine + Tyrosine

Ni2+ + persulfate -or2+ Ru(bpy)3


ICl -orNaI + Chloramine-T




persulfate hn

(c) Formaldehyde Crosslinking Tyrosine + Lysine

Formaldehyde Heating

(f) Phenol Alkylation with Allyl Palladium Complexes




Pd(OAc)2 P(m-C6H4SO3)3 pH 8.5 9


Figure 5 Chemical modication strategies for tyrosine residues.


Proteins, Chemistry and Chemical Reactivity of

amino acids in cross-linking reactions (50, 51). The researchers concluded that tyrosine was among the most reactive amino acids in this procedure, reacting in an electrophilic aromatic substitution with Schiff bases formed between formaldehyde and the amines of nearby lysine residues, Fig. 5c. It is important to note that all methods for protein cross-linking that use this approach require high (>200 mM) concentrations of formaldehyde and often elevated temperatures (>37 C) that would not be suitable for preserving the native function of most proteins. A more recent advance in tyrosine modication chemistry has been the development of a Mannich-type reaction that involves the condensation of an aniline derivative with aldehydes, followed by the nucleophilic addition of tyrosine to generate a new carboncarbon bond, Fig. 5d (52). This modication strategy is operationally straightforward and occurs under mild reaction conditions. In most cases, the protein target is incubated with formaldehyde and an aniline of choice at concentrations that range from 525 mM in pH 6.5 aqueous buffer. Importantly, aliphatic amines do not participate in the reaction at these concentrations, which prevents undesired protein cross-linking. The reaction is typically quenched by the addition of hydroxylamine to cleave any imines that have been formed with lysine residues. Proteolytic digests and NMR experiments have conrmed the selectivity of the reaction for tyrosine, although side reactions with tryptophan have also been noted in some cases. Although cross-reactivity is likely to be observed in some cases, this reaction has been reported for the coupling of synthetic peptides to tyrosine residues on intact proteins (53). Much interest has appeared in the development of techniques that can identify adjacent proteins in large multimeric assemblies. As one example, a tyrosine-based cross-linking strategy has been developed through the use of Ni(II) ions that bind to (His)6 tags on proteins of interest (54, 55). During oxidation to Ni(III) in the presence of persulfate ion or a peracid, the metal ion extracts an electron from a nearby tyrosine residue to create a radical intermediate. This species then couples to additional tyrosine residues and ultimately forms a dityrosine link after abstraction of a hydrogen atom by sulfate radical and re-aromatization, Fig. 5e. In the absence of a nearby tyrosine, a mechanism that involves the addition of a cysteine or lysine is proposed. In a subsequent report, an additional strategy coined photo-induced crosslinking of unmodied proteins (PICUP) was described as a technique that did not require a (His)6 tag (56). In this system, ruthenium(II) tris(bipridine), in combination with ammonium persulfate as an electron acceptor, effectively cross-links tyrosine-containing proteins with irradiation times of less than one second. In this reaction, the excited ruthenium complex donates an electron to ammonium persulfate, which generates Ru(III) as the one-electron oxidant. In follow-up studies, it was demonstrated that nickel-catalyzed proximity biotinylation and Ru(II)(bpy)3 -mediated oxidative cross-linking can be used effectively to measure the equilibrium dissociation constant and stoichiometry of protein complexes (57). This methodology has also been used to map the large multiprotein complex 20 S proteasome core particle (58) and has provided a valuable tool for the study of pro-amyloid laments (59). A modied version of this strategy has been used to

cross-link tyrosine residues on a viral capsid and to modify them with alkyl halides that were added to the reaction solution (60). A palladium-based method has been developed for the alkylation of the phenolic oxygen of tyrosine residues, Fig. 5f (61). In this reaction, allylic carbonates, esters, and carbamates are activated by palladium(0) complexes in aqueous solution to form electrophilic piallyl complexes. These species react at pH 810 with the phenolate anions of tyrosine residues, which results in the formation of an aryl ether and the regeneration of the Pd(0) catalyst. The reaction requires P(m -C6 H4 SO3 )3 as a water-soluble phosphine ligand. Activated piallyl complexes that do not react with tyrosine residues undergo -hydride elimination under the basic conditions to yield diene by-products. A particularly attractive feature of this method is its ability to use substrates with charged groups in the allylic positions. This ability allows hydrophobic substrates, such as lipids, to be solubilized to facilitate protein modication.

Chemical reactions of tryptophan residues

Several features of tryptophan residues render them strategic targets for chemical modication. Tryptophan is the most uorescent of the native amino acids, and the emission wavelength and intensity of this residue are often used to assess changes in the folding state of the protein or to detect ligand binding. Through the chemical alteration of this residue, these properties could be enhanced, shifted, or abrogated to determine the environment of a single tryptophan residue in the presence of others. As tryptophan residues are often hydrophobic contributors in binding sites, the modication of this residue could offer a way to modulate substrate binding. Studies have also revealed the importance of tryptophan side chains as mediators of electron transfer (62), which suggests opportunities to tune the electrical potential of tryptophan side chains through derivatization. As it is the rarest of the amino acids (particularly on protein surfaces), its selective modication could provide a strategic handle for bioconjugation when cysteine chemistry cannot be used. Because of its importance, a growing number of techniques are becoming available for the modication of this residue. The electron-rich indole ring of tryptophan is susceptible to attack by electrophilic reagents. Although the 3-position is typically the most nucleophilic site of the aromatic ring, the alkyl substituent present in tryptophan residues can direct reactions to occur at the 2-position instead. Electrophilic halogen species, such as N -bromosuccinimide (NBS) (63) and in situ -generated dimethylchlorosulfonium ions,(64) have been used for some time to effect the oxidation of tryptophan residues through the intermediacy of halohydrins, Fig. 6a. These transient species undergo elimination, followed by tautomerization to afford 2-oxindole products. The halogenation conditions can also involve the intramolecular attack of the amide carbonyl on the bromonium ion intermediate, Fig. 6b (65). The resulting iminolactone hydrolyzes readily, which results in peptide bond cleavage. This procedure thus provides a useful alternative to cyanogen bromide cleavage for use in proteomics applications. With NBS, cleavage at tyrosine and histidine residues can also occur through similar pathways. As a more tryptophan-selective alternative, 7


Proteins, Chemistry and Chemical Reactivity of

(a) Oxidation NBS Tryptophan H N O (b) Peptide Cleavage NBS Tryptophan -orBNPS-Skatole (c) Alkylation with HNB Br O2N Tryptophan O OH H N NH + NH OH H N O NH NH O Br N O 2. NBS/H2O N H NH NH Br 2. Tautomerization OH 1. HBr Elimination H N O NH NH H O NH 1. Hydrolysis NH2 + O O O N H


O2N (d) Alkylation with Malonyl Dialdehyde H OHC Tryptophan TFA CHO H N O (e) Alkylation with Rhodium Carbenoids N2 Ar Tryptophan Rh(OAc)4 HONH2HCl CO2R H N O Ar
Figure 6 Chemical modication strategies for tyrptophan residues.

O2N + other products




CO2R +



3-bromo-2-(2-nitrophenylsulfonyl)-3-methylindole (BNPS-skatole) is often used (66). It was recently reported that the tryptophan residues of proteins could be nitrated by the action of peroxynitrite (67). This reactive nitrogen species (RNS) is generated from the reaction of nitric oxide with superoxide at a rate that is ten times greater than the destruction of superoxide by dismutases. The authors propose that the nitration of tryptophan, although less common than tyrosine nitration, could serve to modulate the function of some proteins. However, at this time the in vivo evidence for tryptophan nitration by RNS has yet to be reported. One common way to modify tryptophan residues occurs through the addition of 2-hydroxy-5-nitrobenzyl bromide (HNB). Also known as Koshlands reagent (68), this compound rst alkylates the 3-position of the ring and eventually leads to the formation of a product mixture through subsequent intramolecular cyclization pathways, Fig. 6c. This reagent exhibits good tryptophan selectivity, although some levels of 8

cysteine cross-reactivity (about vefold less) are sometimes observed. Multiple additions to a single tryptophan residue have also been reported (69). Nonetheless, this compound has long served as the reagent of choice to estimate the tryptophan content of proteins (70, 71), and it has been used extensively to probe the role of tryptophan in the active sites of enzymes (7274). As an alternative reaction pathway, a recent report has shown that the nitrogen atom of the indole ring can be modied using malonyl dialdehyde derivatives with quantitative conversion, Fig. 6d (75). When carried out under strongly acidic conditions, such as 50% aqueous triuoroacetic acid, competing Schiff base formation and arginine modication can be avoided. Although the strongly acidic conditions required for this reaction will undoubtedly denature protein substrates, this reaction offers a convenient method for the identication of tryptophan-containing peptides for proteomics applications. The aldehyde group remaining in the product can be derivatized through hydrazone


Proteins, Chemistry and Chemical Reactivity of

formation, which allows enrichment of tryptophan-containing peptides through solid-phase capture (76). A transition metal catalyzed reaction has been developed to modify tryptophan residues on proteins with high chemoselectivity, Fig. 6e (77). This reaction involves metallocarbenoid intermediates generated in situ through the degradation of vinyl diazo compounds by rhodium carboxylate salts. The resulting species react with indole rings to form a mixture of N - and 2-substituted products, likely proceeding through direct NH insertion and cyclopropanation/ring opening pathways, respectively. Although no organic cosolvent is required for the reaction to occur, the addition of small amounts of ethylene glycol can be used to achieve solubilization of the diazo compound. Hydroxylamine hydrochloride was found to promote this reaction signicantly, presumably by binding the catalyst and attenuating the reactivity of the metallocarbenoid species. However, the use of this additive requires that the reaction be carried out under low pH conditions (pH 3.5) that would be expected to denature most proteins. A subsequent study has reported that 4-diazo-1,6-heptadiene-3,5-dione derivatives can be used at neutral pH without the use of this additive (78). This report demonstrated the use of this approach for the attachment of uorescent probes to a tryptophan residue on beta-lactoglobulin.

(a) Histidine Acylation with Diethylpyrocarbonate

O EtO Histidine O O OEt H N O NH N OEt N O

(b) Methionine Cleavage with Cyanogen Bromide

NCBr Methionine H N O NH N O TFA NH + CN S Me


NH2 + O O

(c) Arginine Modification with Dicarbonyl Compounds

O R Arginine O R H N NH N N O + other products H N OH R R

Figure 7 Chemical modication strategies of other amino acid residues.

Chemical reactions of other amino acid residues

Several other amino acids are known to react selectively with chemical reagents. Although they are less commonly targeted for bioconjugation, the modication of these residues is often used for the purposes of analysis or to annihilate catalytic activity. As one example, histidine residues can be acylated using diethylpyrocarbonate (Fig. 7a), a commonly used strategy to deactivate RNase when intact RNA molecules are required. Methionine residues react selectively with cyanogen bromide (CNBr) (79), which results in the formation of cyclic imidates that can be cleaved during acidication (Fig. 7b). This method has long been used to degrade full-sized proteins into smaller peptides for sequencing analysis. Methionine residues are reactive toward oxidants, such as periodate, yielding sulfoxides. Arginine residues can be modied selectively using dicarbonyl compounds (Fig. 7c) (80). Both single- and double-addition products are obtained in these reactions. Serine and threonine residues are generally difcult to modify in aqueous solution unless they are present in the active sites of proteases (8) or at the protein N-terminus, as described below.

Modication of the N-terminus

Historically, the most important technique to target the N-terminus has been the Edman degradation, Fig. 8a (81). In this sequencing method the N-terminal amino group is rst reacted with phenylisothiocyanate to form a thiourea. During subsequent acidication, the sulfur atom attacks the proximal amide bond, which results in the removal of the rst amino acid from the polypeptide chain. The initially formed thiazolidinone quickly rearranges under the acidic conditions to form the more stable thiohydantoin. HPLC analysis then is used to identify the liberated residue. The remaining polypeptide chain bears a new N-terminal amino acid that can be removed in a subsequent sequencing round. Peptides up to 50 amino acids in length can be analyzed in this fashion. The N-terminal amino group is less basic [pKa = 68 for the protonated form (82)] than lysine amino groups, which makes it possible to achieve a degree of selective acylation by using NHS-esters at relatively low pH. However, this strategy usually does not yield absolute site selectivity because of the large number of competing lysines that are present. More selective N-terminal modication reactions also involve the side chain functionality of the rst amino acid. A common example is the oxidation of beta-amino alcohols of N-terminal serine and threonine residues to yield aldehyde groups for subsequent modication (Fig. 8b) (83, 84). N-terminal serine and cysteine residues can also be condensed with aldehydes to form oxazolines and thiazolidines, respectively (Fig. 8c) (85). Both of these links are stable at high pH, but display varying hydrolysis rates in acidic media. Aldehydes are known to react readily and irreversibly with N-terminal tryptophan residues, yielding high yields of PictetSpengler products (Fig. 8d) (86). Finally, N-terminal cysteine residues can participate in native 9

Modication of the N- and C-Termini

As multiple copies of even the rarest amino acids are often present, cases exist in which none of the chemical modication strategies described above can be used to functionalize a single site. To address this challenge, many groups have developed reactions that take advantage of the unique chemical reactivities of the polypeptide termini. In addition to providing improvements in site selectivity, these methods have been used to form native-like peptide links in some instances.


Proteins, Chemistry and Chemical Reactivity of

(a) Edman Degradation with Phenylisothiocyanate


S Ph N

NH R + H2N O

(b) Oxidation with Periodate

N-terminal Serine or Threonine NaIO4 H O O H N H2NOR RO H O N H N

(c) Oxazolidine and Thiazolidine Formation

O N-terminal Serine or Cysteine H R X R N H H N O X = O, S

(d) Pictet-Spengler Reaction

O N-terminal Tryptophan H R R CHO OH N R O O H N H2NOR RO R O N H N N H O


(e) Transamination
2O PO 3



Figure 8 Reactive strategies targeting the N-terminus.

chemical ligations with thioesters, as detailed in the next section (see Fig. 9). As an alternative to these techniques, reactive functionality can also be introduced at the N-terminus using biomimetic strategies. Early reports by Dixon indicated that reactive functionality can be accomplished by exposing proteins to aldehydes (such as glyoxylic acid) and Ni2+ or Cu2+ ions (87, 88). More recently, a metal-free version of this reaction has been reported using pyridoxal phosphate (PLP) to effect the transamination reaction (89). This method proceeds in buffered aqueous solution at pH 6.5 at 2250 C. Several N-terminal amino acids, such as glycine and aspartic acid, have been shown to be compatible with this method, whereas tryptophan can react with the aldehyde of PLP through other pathways, as described in the previous paragraph. This technique provides a convenient method for the site-specic modication of several protein targets and has been demonstrated for the attachment of polymerization initiators (9) and surfaces to proteins through well-dened links (90). This method has also been applied to the modication of monoclonal antibodies in sites that are adjacent to the antigen binding regions (91).

Modication of the C-terminus

The native chemical ligation (NCL) has enjoyed tremendous success as one of the few methods for C-terminal protein modication. In its most general form, this approach is a ligation between a peptide that contains a C-terminal thioester and a 10

peptide that contains an N-terminal cysteine, which results in a native amide bond (Fig. 9) (92). The full power of this protein synthesis technique is realized in the expressed protein ligation (EPL), a variant of the NCL in which the thioester-containing peptide is produced recombinantly (93). In EPL, the N-terminal portion of the desired peptide is expressed contiguously with an intein domain. The intein catalyzes an N-to-S acyl shift to form a thioester between the peptide target and a cysteine thiol at the end of the intein sequence. During the addition of a functionalized N-terminal cysteine, a transthioesterication occurs to join the two coupling partners and release the intein. A nal S-to-N acyl shift forms the native peptide bond to yield the desired product. Additional small-molecule thiols are often added to accelerate this reaction. A wide variety of biologically relevant molecules have been linked to the C-terminus of proteins using the EPL approach, ranging from naturally occurring biomolecules, such as lipids and polysaccharides, to non-natural probes like uorophores, afnity tags, and metal chelating agents (94). In combination with solid-phase peptide synthesis (SPPS), EPL has made valuable contributions to proteomics by enabling the synthesis of proteins containing posttranslational modications that are difcult to introduce (95). This approach has been used to attach uorophores to proteins for the detection of proteinprotein interactions by emission wavelength shifts (96) or through FRET (97). The site-specic incorporation of stable isotopes has been used also to facilitate NMR structural studies. In one example, two intein-splicing events were used to 15 N-label an internal


Proteins, Chemistry and Chemical Reactivity of


Protein Target







Protein Target


O S H2N H N O Intein CO2H


HS H2N O O S R NH2 H N Intein CO2H


Protein Target




a desired label that might have interfered with the chemistry of the rst step. The targeting of unique chemical functional groups is also important for the labeling of a single biomolecule that is present among many others, as is the case in cell lysates, living cells, or entire organisms (101). In particular, the development of bioorthogonal labeling reactions has been propelled by the availability of new tools for the direct incorporation of articial functional groups into biomolecules. For proteins, powerful techniques have been developed for the introduction of new amino acids on the translational level, both through stop codon suppression (102) and amino acid codon reassignment (103). Metabolic engineering techniques have also been developed for the incorporation of articial groups into carbohydrates (104, 105), and sequence-selective enzymatic labeling techniques have been used to introduce lipids (106) and cofactors (107, 108) that are substituted with new functionality. The purpose of this section is to provide a brief list of the new chemical strategies that have been developed to target these articial functional groups because the importance of these techniques will undoubtedly grow in the coming years.


Protein Target




Methods targeting carbonyl groups

The rst methods that were used to label non-native functional groups on proteins targeted aldehydes and ketones. These groups can be introduced directly through the periodate oxidation of N-terminal serine residues (83), transamination (89), or carbohydrate oxidation with periodate (109). They can also be installed using metabolic engineering (104) or translational techniques (110). Aldehydes and ketones condense selectively with hydrazine and alkoxyamine derivatives to form hydrazoneand oxime-type derivatives, respectively (Fig. 10a) (110, 104). Both reactions are carried out in aqueous solution using an excess of the ketone-reactive reagents. Early mechanistic studies determined that the rate-limiting step in the reaction is the dehydration of the tetrahedral intermediate that is formed after nucleophilic attack on the carbonyl group (111). Therefore, mildly acidic conditions (e.g., pH 6.5) are typically used to accelerate the dehydration step without fully protonating the nucleophilic reagents. This reaction exhibits excellent chemoselectivity and works well for protein labeling with a diverse array of functional groups. However, the presence of competing ketone and aldehyde metabolites limits its use for protein labeling in crude cell lysates.

Figure 9 Modication of the C-terminus through native chemical ligation (NCL).

region of the 370-residue maltose binding protein (98). In conjugation with the traceless Staudinger reaction described below, the NCL technique has also been used to synthesize a functional RNase enzyme from several precursor fragments (99). In the last few years, several interesting methods have been developed to couple C-terminal thioesters to the N-terminus of synthetic peptides without leaving cysteine residues at the junction site. These techniques instead supply the requisite sulfhydryl group through the use of auxiliaries that are cleaved after the ligation takes place. One notable example of this technique is the total synthesis of the multiply glycosylated erythropoietin alpha (EPO 1), a protein that has found widespread use in the treatment of anemia (100). Applications such as these showcase the versatility of the EPL and its considerable ability to address difcult bioconjugation challenges.

Bioorthogonal Methods for Protein Labeling

Although this review has focused on the reactivity modes that can be used to modify the amino acids directly, it is equally important to be able to predict what will not react with the native functional groups of proteins. Reactions that can proceed in aqueous solution while ignoring the natural amino acids are exceptionally useful for the development of secondary labeling strategies. In these approaches, one reaction described above is used to attach a chemically distinct functional group to a particular amino acid. A second reaction then is used to install

Methods targeting azides

More recently, azide groups have emerged as popular targets for chemoselective protein modication. They seem to be ignored completely in a variety of biological settings (112), and yet they possess a favorable thermodynamic reaction potential through the loss of nitrogen gas (113). They are also small in size and thus can be incorporated using a variety of enzymatic, metabolic, or translational techniques. The azide group also provides a unique IR chromophore for spectroscopic characterization convenience. The rst technique that was used to modify azide groups in a biological setting was a modied Staudinger ligation (105), 11


Proteins, Chemistry and Chemical Reactivity of

(a) Modification of Ketones and Aldehydes

O Protein R H2N X R Protein Alkoxyamine: X = O Hydrazine: X = NH Hydrazide: X = NHCO Semicarbazide: X = NHCONH N

X R R Oxime: X = O Hydrazone: X = NH Acylhydrazone: X = NHCO Semicarbazone: X =NHCONH

(b) Staudinger Ligation with Azides

O MeO Protein N3 R Protein N H R PPh2 O + O PPh2

(c) Copper-Catalyzed Huisgen Cyclization of Azides

N3 R Protein Cu(I), TCEP, Ligand N N N


(d) Strain-Promoted Huisgen Cyclization of Azides

X X N3 X = H, F OR X Protein N N N X OR


X = H, F

(e) Oxidative Coupling of Anilines

R R Protein NH2 NaIO4 N NHCOR Protein N NHCOR R N R O

Figure 10 Chemoselective modication strategies targeting non-native functional groups.

which used a pendant ester group to capture the iminophosphorane intermediate. The resulting amide bond served to link additional functionality on the phosphine to the protein permanently (Fig. 10b). Although the original version of this reaction incorporated the triarylphosphine oxide into the coupling product, more recent traceless versions have appeared (114, 115). When used in conjunction with native chemical ligation techniques (see above), this reaction can serve as a means to synthesize full-sized proteins (99). A version of this technique has also been developed for the uorescent detection of azido groups (116). Perhaps the ultimate testament to the bioorthogonality of this method is its reported use in living animals (112), which renders it a uniquely useful approach for the study of glycosylation using in vivo imaging techniques (117). A detailed mechanistic investigation of this reaction has appeared (118). A second mode of reactivity for azide groups occurs through a [3 + 2] cyclization reaction with alkynes, as originally described by Huisgen (119). Although this reaction occurs thermally in the absence of additional reagents, a critical advance for the targeting of azide groups on biomolecules was the observation that Cu(I) salts catalyzed the formation of the triazole products in aqueous solution at ambient temperature (120, 121) (Fig. 10c). This technique has been used for the labeling of viral capsids (122, 123) and the surface of bacterial cells (124), and it has been used to attach proteins to surfaces (125). Both azide and 12

alkyne groups have been introduced on the translational level to provide the appropriate coupling partners (124, 126). This reaction has been used with particular success as a detection method for the identication of new proteases, which were labeled with azides at active site residues (127). The monumental success of this reaction for the facile construction of diverse molecules in organic synthesis suggests that it will have a very bright future ahead for biomolecule labeling. The mechanism of the copper-catalyzed reaction has been described (128). For situations in which copper ions are observed to interfere with protein function or to exhibit toxicity, a metal-free version of this reaction has been developed. This strategy uses a strained cyclooctyne as the alkyne component, which readily undergoes the Huisgen cyclization with azides at room temperature (Fig. 10d) (129). Although unsubstituted cyclic alkynes participate in the reaction with relatively slow rates, a subsequent report has shown that the presence of electron-withdrawing uorine substituents can accelerate the cycloaddition signicantly (130). Taken together, this set of reactive pathways renders the azide group one of the most promising targets for bioconjugation reactions in which complete functional group tolerance is required.


Proteins, Chemistry and Chemical Reactivity of

Modication through oxidative coupling reactions

An alternative method has been developed for the modication of proteins through the oxidative coupling of aniline groups (131) (Fig. 10e). This method targets aminophenylalanine, which is one of the most successful amino acids to be introduced into protein sequences using the stop codon suppression technique (132). In the presence of oxidants, such as periodate or cerium (IV), this group reacts rapidly with phenylene diamine derivatives to afford adducts that are highly stable toward hydrolysis, reduction, or oxidation. The reaction displays very high chemoselectivity, although the reliance on periodate leads to varying levels of methionine oxidation in addition to the desired products.




13. 14.


Although protein bioconjugation has enjoyed a long and successful history, recent years have witnessed a dramatic increase in the number of available techniques for chemical modication. This increase has developed largely because of the availability of high-resolution mass spectrometry and NMR characterization techniques, which have accelerated the rate with which detailed chemical information can be obtained for the modication products. With the addition of each new reaction, previously inaccessible protein bioconjugates of ever-increasing complexity can be realized. The chemical underpinnings of these reactions also form the basis for future modication strategies and expand our understanding of protein reactivity in living systems.






21. 1. Staight AF, Cheung A, Limouze J, Chen I, Westwood NJ, Sellers JR, Mitchison TJ. Dissecting temporal and spatial control of cytokinesis with a myosin II inhibitor. Science 2003;299:1743 1747. Grifn BA, Adams SR, Tsien RY. Specic covalent labeling of recombinant protein molecules inside live cells. Science 1998;281:269272. Zalipsky S. Adv. Chemistry of polyethylene-glycol conjugates with biologically-active molecules. Adv. Drug Deliv. Rev. 1995; 16:157182. Zalipsky S, Harris JM. Introduction to chemistry and biological applications of poly(ethylene glycol). Poly(Ethylene Glycol). 1997;680:113. Niemeyer CM. Nanoparticles, proteins, and nucleic acids: biotechnology meets materials science. Angew. Chem. Int. Ed. 2001;40:41284158. Seeman NC, Belcher AM. Emulating biology: building nanostructures from the bottom up. Proc. Natl. Acad. Sci. U.S.A. 2002;99:64516455. McMurry J, Begley TP. The Organic Chemistry of Biological Pathways. 2005. Roberts and Company, New York. Evans MJ, Cravatt BF. Mechanism-based proling of enzyme families. Chem. Rev. 2006;106:32793301. Heredia KL, Maynard HD. Synthesis of protein-polymer conjugates. Org. Biomol. Chem. 2007;5:4553.










7. 8. 9.


Doolittle RF. Redundancies in protein sequences. In: Prediction of Protein Structure and the Principles of Protein Conformation. Fasman GD, ed. 1989. Plenum Press, New York. Parsiegla G, Belaich A, Belaich JP, Haser R. Crystal structure of the cellulase Cel9M enlightens structure/function relationships of the variable catalytic modules in glycoside hydrolases. Biochemistry 2002;41:11134. Gitlin I, Carbeck JD, Whitesides GM. Why are proteins charged? Networks of charge-charge interactions in proteins measured by charge ladders and capillary electrophoresis. Angew. Chem. Int. Ed. 2006;45:30223060. Hermanson GT. Bioconjugate Techniques. 1996. Academic Press, San Diego. McFarland JM, Francis MB. Reductive alkylation of proteins using iridium catalyzed transfer hydrogenation. J. Am. Chem. Soc. 2005;127:1349013491. Cohen MS, Zhang C, Shokat KM, Taunton J. Structural bioinformatics-based design of selective, irreversible kinase inhibitors. Science 2005;308:13181321. Szobota S, Gorostiza P, Del Bene F, Wyart C, Fortin DL, Kolstad KD, Tulyathan O, Volgraf M, Numano R, Aaron HL, Scott EK, Kramer RH, Flannery J, Baier H, Trauner D, Isacoff EY. Remote control of neuronal activity with a light-gated glutamate receptor. Neuron 2007;54:535545. King TP, Li Y, Kochoumian L. Preparation of protein conjugates via intermolecular disulde bond formation. Biochemistry 1978;17:14991506. Houk J, Whitesides GM. Structure-Reactivity Relations for Thiol-Disulde Interchange. J. Am. Chem. Soc. 1987;109:6825 6836. Zalipsky S, Qazen M, Walker JA, Mullah N, Quinn YP, Huang SK. New detachable poly(ethylene glycol) conjugates: Cysteine-cleavable lipopolymers regenerating natural phospholipid, diacyl phosphatidylethanolamine. Bioconjug. Chem. 1999; 10:703707. Burns JA, Butler JC, Moran J, Whitesides GM. Selective reduction of disuldes by Tris(2-Carboxyethyl)Phosphine. J. Org. Chem. 1991;56:26482650. Sun MMC, Beam KS, Cerveny CG, Hamblett KJ, Blackmore RS, Torgov MY, Handley FGM, Ihle NC, Senter PD, Alley SC. Reduction-alkylation strategies for the modication of specic monoclonal antibody disuldes. Bioconjug. Chem. 2005;16:12821290. Shaunak S, Godwin A, Choi JW, Balan S, Pedone E, Vijayarangam D, Heidelberger S, Teo I, Zloh M, Brocchini S. Site-specic PEGylation of native disulde bonds in therapeutic proteins. Nat. Chem. Biol. 2006;2:312313. Yen TY, Joshi RK, Yan H, Seto NOL, Palcic MM, Macher BA. Characterization of cysteine residues and disulde bonds in proteins by liquid chromatography/electrospray ionization tandem mass spectrometry. J. Mass Spectrom. 2000;35:9901002. Schlick TL, Ding ZB, Kovacs EW, Francis MB. Dual-surface modication of the tobacco mosaic virus. J. Am. Chem. Soc. 2005;127:37183723. Maurer JA, Elmore DE, Lester HA, Dougherty DA. Comparing and contrasting Escherichia coli and Mycobacterium tuberculosis mechanosensitive channels (MscL)new gain of function mutations in the loop region. J. Biol. Chem. 2000;275:2223822244. Pauly, H. On the constitution of histidine I: Announcement. Hoppe-Seylers Zeitschrift Fur Physiologische Chemie 1904;42: 508518.



Proteins, Chemistry and Chemical Reactivity of

27. Busch, M, Patrascanu, N, Weber, W. The coupling of D-amino acids with diazonium salts. J. Fur Praktische Chemie-Leipzig 1934;140:117128. 28. Howard, AN, Wild, F. Reactions of diazonium compounds with amino acids and proteins. Biochem. J. 1957;65:651659. 29. Riordan JF, Vallee BL. Diazonium salts as specic reagents and probes for protein conformation. Methods Enzymol. 1972;25: 521531. 30. Landsteiner K. The Specicity of Serological Reactions. 1945. Harvard University Press, Cambridge, MA. 31. Fraenkel-Conrat H, Bean RS, Lineweaver H. Essential groups for the interaction of ovomucoid (egg white trypsin inhibitor) and trypsin, and for tryptic activity. J. Biol. Chem. 1949;177:385403. 32. Herdan JM, Balulescu M, Cira O. Enantioselective hydrolysis of racemic esters using pig liver esterase. J. Mol. Catal. A: Chem. 1996;107:409414. 33. Hooker JM, Kovacs EW, Francis MB. Interior surface modication of bacteriophage MS2. J. Am. Chem. Soc. 2004;126:3718 3719. 34. Schlick TL, Ding ZB, Kovacs EW, Francis MB. Dual-surface modication of the tobacco mosaic virus. J. Am. Chem. Soc. 2005;127:37183723. 35. Hooker JM, Datta A, Botta M, Raymond KN, Francis MB. Magnetic resonance contrast agents from viral capsid shells: a comparison of exterior and interior cargo strategies. Nano Lett. 2007;7:22072210. 36. Holder PG, Francis MB. Integration of a self-assembling protein scaffold with water-soluble single-walled carbon nanotubes. Angew. Chem. Int. Ed. 2007;46:43704373. 37. Wormall A. The immunological specicity of chemically altered proteinshalogenated and nitrated proteins. J. Exp. Med. 1930;51:295317. 38. Riordan JF, Vallee BL. Nitration with tetranitromethane. Methods Enzymol. 1972;25:521531. 39. Bruice TC, Gregory JJ, Walters SL. Reactions of tetranitromethane.1. kinetics and mechanism of nitration of phenols by tetranitromethane. J. Am. Chem. Soc. 1968;90:16121614. 40. Wong PSY, van der Vliet A. Diazonium salts as specic reagents and probes of protein conformation. Methods Enzymol. 2002;359:399410. 41. Monteiro HP. Signal transduction by protein tyrosine nitration: competition or cooperation with tyrosine phosphorylation-dependent signaling events? Free Radic. Biol. Med. 2002;33:765773. 42. Koeck T, Stuehr DJ, Aulak KS. Mitochondria and regulated tyrosine nitration. Biochem. Soc. Trans. 2005;33:13991403. 43. Ischiropoulos H, Gow A. Pathophysiological functions of nitric oxide-mediated protein modications. Toxicology 2005;208:299 303. 44. McFarlane AS. Efcient trace-labelling of proteins with iodine. Nature 1958;182:5353. 45. Helmkamp RW, Goodland RL, Bale WF, Spar IL, Mutschler LE. High specic activity iodination of gamma-globulin with iodine-131 monochloride. Cancer Res. 1960;20:14951500. 46. Samols E, Williams HS. Trace-labelling of insulin with iodine. Nature 1961;190:12111212. 47. Helmkamp RW, Contreras MA, Izzo MJ. I131 -labeling of proteins at high activity level with I131 Cl produced by oxidation of total iodine in NaI131 preparations. Int. J. Appl. Radiat. Isot. 1967;18:747754. 48. Hunter WM, Greenwood FC. Preparation of iodine-131 labelled human growth hormone of high specic activity. Nature 1962;194:495496.

















65. 66.


Fraenkel-Conrat H, Olcott HS. Reaction of formaldehyde with proteins.6. Cross-linking of amino groups with phenol, imidazole, or indole groups. J. Biol. Chem. 1948;174:827843. Metz B, Kersten GFA, Baart GJE, de Jong A, Meiring H, ten Hove J, van Steenbergen MJ, Hennink WE, Crommelin DJA, Jiskoot W. Identication of formaldehyde-induced modications in proteins: Reactions with insulin. Bioconjug. Chem. 2006;17:815822. Metz B, Kersten GFA, Hoogerhout P, Brugghe HF, Timmermans HAM, de Jong A, Meiring H, ten Hove J, Hennink WE, Crommelin DJA, et al. Identication of formaldehyde-induced modications in proteinsReactions with model peptides. J. Biol. Chem. 2004;279:62356243. Joshi NS, Whitaker LR, Francis MB. A three-component Mannich-type reaction for selective tyrosine bioconjugation. J. Am. Chem. Soc. 2004;126:1594215943. Romanini DW, Francis MB. Attachment of peptide building blocks to proteins through tyrosine bioconjugation. Bioconjug. Chem. In Press. Kodadek T, Duroux-Richard I, Bonnafous JC. Techniques: oxidative cross-linking as an emergent tool for the analysis of receptor-mediated signaling events. Trends Pharmacol. Sci. 2005; 26:210217. Fancy DA. Elucidation of protein-protein interactions using chemical cross-linking or label transfer techniques. Curr. Opin. Chem. Biol. 2000;4:2833. Fancy DA, Kodadek T. Chemistry for the analysis of proteinprotein interactions: rapid and efcient cross-linking triggered by long wavelength light. Proc. Natl. Acad. Sci. U.S.A. 1999;96: 60206024. Amini F, Kodadek T, Brown KC. Protein afnity labeling mediated by genetically encoded peptide tags. Angew. Chem. Int. Ed. 2001;41:356359. Denison C, Kodadek T. Toward a general chemical method for rapidly mapping multi-protein complexes. J. Proteome Res. 2004;3:417425. Bitan G, Teplow DB. Rapid photochemical cross-linking-a new tool for studies of metastable, amyloidogenic protein assemblies. Acc. Chem. Res. 2004;37:357364. Meunier S, Strable E, Finn MG. Crosslinking of and coupling to viral capsid proteins by tyrosine oxidation. Chem. Biol. 2004;11:319326. Tilley, SD, Francis MB. Tyrosine-selective protein alkylation using pi-allylpalladium complexes. J. Am. Chem. Soc. 2006;128: 10801081. Stubbe J, Nocera DG, Yee CS, Chang MCY. Radical initiation in the class I ribonucleotide reductase: long-range proton-coupled electron transfer? Chem.Rev. 2003, 103:21672201. Patchornik A, Lawson WB, Witkop B. Selective cleavage of peptide bonds.2. The tryptophyl peptide bond and the cleavage of glucagon. J. Am. Chem. Soc. 1958;80:47474748. Savige WE, Fontana A. Modication of tryptophan to oxindolylalanine by dimethyl sulfoxide-hydrochloric acid. Methods Enzymol. 1977;47:442453. Ramachandran LK, Witkop B. N -bromosuccinimide cleavage of peptides. Methods Enzymol. 1967;11:283299. Fontana A. Modication of tryptophan with BNPS-skatole (2-(2nitrophenylsulfenyl)-3-methyl-3-bromoindolenine). Methods Enzymol. 1972;25:419423. Yamakura F, Ikeda K. Modication of tryptophan and tryptophan residues in proteins by reactive nitrogen species. Nitric Oxide 2006;14:152161.



Proteins, Chemistry and Chemical Reactivity of












79. 80. 81. 82.




Koshland DE, Karkhanis YD, Latham HG. Environmentally86. sensitive reagent with selectivity for tryptophan residue in proteins. J. Am. Chem. Soc. 1964;86:14481450. Strohalm M, Kodicek M, Pechar M. Tryptophan modication by 87. 2-hydroxy-5-nitrobenzyl bromide studied by MALDI-TOF mass spectrometry. Biochem. Biophys. Res. Commun. 2003;312:811 88. 816. Barman TE, Koshland DE. A colorimetric procedure for quan89. titative determination of tryptophan residues in proteins. J. Biol. Chem. 1967;242:57715776. Dasgupta BR, Rothstein E, Boroff DA. Method for quantita90. tive determination of free and peptide-linked tryptophan after reaction with 2-hydroxy-5-nitrobenzyl bromide. Anal. Biochem. 1965;11:555565. 91. Amutha B, Khire JM, Khan MI. Active site characterization of the exo-N-acetyl-beta-D-glucosaminidase from thermotolerant Bacillus sp NCIM 5120: involvement of tryptophan, histidine and 92. carboxylate residues in catalytic activity. Biochim. Biophys. Acta 1999;1427:121132. 93. Ghosh AK, Naskar AK, Sengupta S. Characterisation of a xylanolytic amyloglucosidase of Termitomyces clypeatus . Biochim. Biophys. Acta-Protein Struct. Mol. Enzymol. 1997;1339:289 94. 296. Wink MR, Buffon A, Bonan CD, Valenzuela MA, Sarkis JJF, Battastini AMO. Effect of protein-modifying reagents on 95. ecto-apyrase from rat brain. Int. J. Biochem. Cell Biol. 2000;32: 105113. Foettinger A, Melmer M, Leitner A, Lindner W. Reaction of the 96. indole group with malondialdehyde: application for the derivatization of tryptophan residues in peptides. Bioconjug. Chem. 2007;18:16781683. 97. Foettinger A, Leitner A, Lindner W. Selective enrichment of tryptophan-containing peptides from protein digests employing a reversible derivatization with malondialdehyde and solid-phase 98. capture on hydrazide beads. J. Proteome Res. 2007;6:38273834. Antos JM, Francis MB. Selective tryptophan modication with rhodium carbenoids in aqueous solution. J. Am. Chem. Soc. 2004;126:1025610257. 99. Bao ZJ, Wang SJ, Shi W, Dong SY, Ma HM. Selective modication of Trp19 in beta-lactoglobulin by a new diazo uorescence probe. J. Proteome Res. 2007;6:38353841. 100. Gross E. The cyanogen bromide reaction. Methods Enzymol. 1967;11:238255. Yankeelov JA. Modication of arginine by diketones. Methods Enzymol. 1972;25:566579. 101. Edman P. Method for determination of the amino acid sequence in peptides. Acta Chem. Scand. 1950;4:283293. 102. Sereda TJ, Mant CT, Quinn AM, Hodges RS. Effect of alphaamino group on peptide retention behavior in reversed-phase 103. chromatographydetermination of the pKa values of the alpha-amino group of 19 different N-terminal amino-acid-residues. J. Chromatogr. 1993;646:1730. Geoghegan KF, Stroh JG. Site-directed conjugation of nonpep104. tide groups to peptides and proteins via periodate-oxidation of a 2-amino alcoholapplication to modication at N-terminal serine. Bioconjug. Chem. 1992;3:138146. 105. Chen JK, Lane WS, Brauer AW, Tanaka A, Schreiber SL. Biased combinatorial librariesnovel ligands for the Sh3 do106. main of phosphatidylinositol 3-kinase. J. Am. Chem. Soc. 1993;115:1259112592. Tam JP, Yu QT, Miao ZW. Orthogonal ligation strategies for peptide and protein. Biopolymers 1999;51:311332.

Li XF, Zhang LS, Hall SE, Tam JP. A new ligation method for N-terminal tryptophan-containing peptides using the PictetSpengler reaction. Tetrahedron Lett. 2000;41:40694073. Dixon HBF. N-terminal modication of proteinsa review. J. Protein Chem. 1984;3:99108. Wu P, Brand L. N-terminal modication of proteins for uorescence measurements. Methods Enzymol. 1997;278:321330. Gilmore JM, Scheck RA, Esser-Kahn AP, Joshi NS, Francis MB. N-terminal protein modication through a biomimetic transamination reaction. Angew. Chem. Int. Ed. 2006;45:53075311. Christman KL, Broyer RM, Tolstyka ZP, Maynard HD. Sitespecic protein immobilization through N-terminal oxime linkages. J. Mater. Chem. 2007;17:20212027. Scheck RA, Francis MB. Regioselective labeling of antibodies through N-terminal transamination. ACA Chem. Biol. 2007;2: 247251. Dawson PE, Muir TW, Clarklewis I, Kent SBH. Synthesis of proteins by native chemical ligation. Science 1994;266:776779. Hofmann RM, Muir TW: Recent advances in the application of expressed protein ligation to protein engineering. Curr. Opin. Biotechnol. 2002;13:297303. Tolbert TJ, Wong CH. Intein-mediated synthesis of proteins containing carbohydrates and other molecular probes. J. Am. Chem. Soc. 2000;122:54215428. Ottesen JJ, Huse M, Sekedat MD, Muir TW. Semisynthesis of phosphovariants of Smad2 reveals substrate preference of activated T-beta-RI kinase. Biochemistry 2004;43:56985706. Cotton GJ, Ayers B, Xu R, Muir TW. Insertion of a synthetic peptide into a recombinant protein framework: a protein biosensor. J. Am. Chem. Soc. 1999;121:11001101. Cotton GJ, Muir TW. Generation of a dual-labeled uorescence biosensor for Crk-II phosphorylation using solid-phase expressed protein ligation. Chem. Biol. 2000;7:253261. Otomo T, Ito N, Kyogoku Y, Yamazaki T. NMR observation of selected segments in a larger protein: central-segment isotope labeling through intein-mediated ligation. Biochemistry 1999;38:1604016044. Nilsson BL, Hondal RJ, Soellner MB, Raines RT. Protein assembly by orthogonal chemical ligation methods. J. Am. Chem.l Soc. 2003;125:52685269. Wu B, Chen JH, Warren JD, Chen G, Hua ZH, Danishefsky SJ. Building complex glycopeptides: development of a cysteine-free native chemical ligation protocol. Angew. Chem. Int. Ed. 2006;45:41164125. Prescher JA, Bertozzi CR. Chemistry in living systems. Nat. Chem. Biol. 2005;1:1321. Wang L, Schultz PG. Expanding the genetic code. Chem. Commun. 2002; 111. Kiick KL, Tirrell DA. Protein engineering by in vivo incorporation of non-natural amino acids: Control of incorporation of methionine analogues by methionyl-tRNA synthetase. Tetrahedron 2000;56:94879493. Mahal LK, Yarema KJ, Bertozzi CR. Engineering chemical reactivity on cell surfaces through oligosaccharide biosynthesis. Science 1997;276:11251128. Saxon E, Bertozzi CR. Cell surface engineering by a modied Staudinger reaction. Science 2000;287:20072010. Kho Y, Kim SC, Jiang C, Barma D, Kwon SW, Cheng JK, Jaunbergs J, Weinbaum C, Tamanoi F, Falck J, Zhao YM. A tagging-via-substrate technology for detection and proteomics of farnesylated proteins. Proc. Natl. Acad. Sci. U.S.A. 2004;101:1247912484.



Proteins, Chemistry and Chemical Reactivity of





111. 112. 113.

114. 115.


117. 118.

119. 120.








Yin J, Liu F, Li XH, Walsh CT. Labeling proteins with small molecules by site-specic posttranslational modication. J. Am. Chem. Soc. 2004;126:77547755. Chen I, Howarth M, Lin W, Ting AY. Site-specic labeling of cell surface proteins with biophysical probes using biotin ligase. Nat. Methods 2005;2:99104. Hage DS, Wolfe CAC, Oates MR. Development of a kinetic model to describe the effective rate of antibody oxidation by periodate. Bioconjug. Chem. 1997;8:914920. Cornish VW, Hahn KM, Schultz PG. Site-specic protein modication using a ketone handle. J. Am. Chem. Soc. 1996;118: 81508151. Jencks WP. Studies on the mechanism of oxime and semicarbazone formation. J. Am. Chem. Soc. 1959;81:475481. Prescher JA, Dube DH, Bertozzi CR. Chemical remodelling of cell surfaces in living animals. Nature 2004;430:873877. Kolb HC, Finn MG, Sharpless KB. Click chemistry: diverse chemical function from a few good reactions. Angew. Chem. Int. Ed. 2001;40:20042021. Nilsson BL, Kiessling LL, Raines RT. Staudinger ligation: a peptide from a thioester and azide. Org. Lett. 2000;2:19391941. Saxon E, Armstrong JI, Bertozzi CR. A traceless Staudinger ligation for the chemoselective synthesis of amide bonds. Org.Lett. 2000;2:21412143. Lemieux GA, de Graffenried CL, Bertozzi CR. A uorogenic dye activated by the Staudinger ligation. J. Am. Chem. Soc. 2003;125:47084709. Prescher JA, Dube DH, Lo A, Bertozzi CR. Noninvasive imaging of glycosylation in vivo . Glycobiology 2005;15:11871187. Lin FL, Hoyt HM, van Halbeek H, Bergman RG, Bertozzi CR. Mechanistic investigation of the Staudinger ligation. J. Am. Chem. Soc. 2005;127:26862695. Huisgen R. 1,3-Dipolar Cycloaddition Chemistry, Volume I. Padwa A, ed. 1984. Wiley, New York. Rostovtsev VV, Green LG, Fokin VV, Sharpless KB. A stepwise Huisgen cycloaddition process: copper(I)-catalyzed regioselective ligation of azides and terminal alkynes. Angew. Chem. Int. Ed. 2002;41:25962599. Torne CW, Christensen C, Meldal M. Peptidotriazoles on solid phase: [1,2,3]-triazoles by regiospecic copper(I)-catalyzed 1,3-dipolar cycloadditions of terminal alkynes to azides. J. Org. Chem. 2002;67:30573062. Wang Q, Chan TR, Hilgraf R, Fokin VV, Sharpless KB, Finn MG. Bioconjugation by copper(I)-catalyzed azide-alkyne [3 + 2] cycloaddition. J. Am. Chem. Soc. 2003;125:31923193. Sen Gupta S, Kuzelka J, Singh P, Lewis WG, Manchester M, Finn MG. Accelerated bioorthogonal conjugation: a practical method for the Ligation of diverse functional molecules to a polyvalent virus scaffold. Bioconjug. Chem. 2005;16:15721579. Link AJ, Tirrell DA. Cell surface labeling of Escherichia coli via copper(I)-catalyzed [3 + 2] cycloaddition. J. Am. Chem. Soc. 2003;125:1116411165. Gauchet C, Labadie GR, Poulter CD. Regio- and chemoselective covalent immobilization of proteins through unnatural amino acids. J. Am. Chem. Soc. 2006;128:92749275. Deiters A, Cropp TA, Mukherji M, Chin JW, Anderson JC, Schultz PG. Adding amino acids with novel reactivity to the genetic code of Saccharomyces cerevisiae . J. Am. Chem. Soc. 2003;125:1178211783. Speers AE, Adam GC, Cravatt BF. Activity-based protein proling in vivo using a copper(I)-catalyzed azide-alkyne [3 + 2] cycloaddition. J. Am. Chem. Soc. 2003;125:46864687.


Rodionov VO, Fokin VV, Finn MG. Mechanism of the ligandfree Cu-I-catalyzed azide-alkyne cycloaddition reaction. Angew. Chem. Int. Ed. 2005;44:22102215. 129. Agard NJ, Prescher JA, Bertozzi CR. A strain-promoted [3 + 2] azide-alkyne cycloaddition for covalent modication of blomolecules in living systems. J. Am. Chem. Soc. 2004;126:15046 15047. 130. Agard NJ, Baskin JM, Prescher JA, Lo A, Bertozzi CR. A comparative study of bioorthogonal reactions with azides. ACS Chem. Biol. 2006;1:644648. 131. Hooker JM, Esser-Kahn AP, Francis MB. Modication of aniline containing proteins using an oxidative coupling strategy. J. Am. Chem. Soc. 2006;128:1555815559. 132. Mehl RA, Anderson JC, Santoro SW, Wang L, Martin AB, King DS, Horn DM, Schultz PG Generation of a bacterium with a 21 amino acid genetic code. J. Am. Chem. Soc. 2003;125:935939.

Further Reading
Evans MJ, Cravatt BF. Mechanism-based proling of enzyme families. Chemical Reviews 2006;106:32793301. Francis MB. New Chemical tools for protein modication. In: Chemical Biology. Schreiber SL, Kapoor T, Wess G, eds. 2007. Wiley-VCH: Weinheim, Germany. pp. 593635. Hermanson GT. Bioconjugate techniques. 1996. Academic Press, San Diego, CA. McMurry J, Begley TP. The Organic Chemistry of Biological Pathways. 2005. Roberts and Company, New York. Niemeyer CM, ed. Bioconjugation Protocols: Strategies and Methods (Methods in Molecular Biology). 2004. Humana Press, Totowa, NJ.

See Also
Chemical Modication of Proteins Amino Acids, Chemical Properties of Enzyme Catalysis, Chemical Strategies for Proteins, In Vivo Chemical Modications of Peptides, Chemistry of