Professional Documents
Culture Documents
com
Nonribosomal peptide synthetases (NRPSs) are large posed of an array of distinct modular sections (Figure 1b),
multimodular biocatalysts that utilize complex regiospecific each of which is responsible for the incorporation of one
and stereospecific reactions to assemble structurally and defined monomer into the final peptide product. The
functionally diverse peptides that have important medicinal identity and order of a module in an assembly line
applications. During this ribosome-independent peptide specifies: first, the sequence of monomer units activated
synthesis, catalytic domains of NRPS select, activate or modify and incorporated; second, the chemistry that occurs at
the covalently tethered reaction intermediates to control the each way station in the assembly line; and third, the
iterative chain elongation process and product release. Recent length and functionality of the product released from
advances in structural elucidation of domains, didomains, and the distal end of the assembly line (Figure 1b) [4].
an entire termination module revealed valuable insights into the The modules can be further divided into catalytic
mechanism of nonribosomal synthesis and are highlighted domains. Three domains are ubiquitous in NRP synthesis
herein. and essential for peptide elongation. The domains are
responsible for the activation of the amino acid (adenyla-
Address tion (A) domains), the propagation of the growing peptide
Philipps-University Marburg, Department of Chemistry/Biochemistry, chain (thiolation or peptidyl carrier protein (PCP)
Hans-Meerwein-Strasse, D-35032 Marburg, Germany domains), and the condensation of the amino acids (con-
Corresponding author: Marahiel, Mohamed A densation (C) domains). A fourth essential NRPS cataly-
(marahiel@staff.uni-marburg.de) tic unit associated with product release is the thioesterase
(TE) domain. The TE domain is located in the termin-
ation module and catalyzes peptide release by either
Current Opinion in Structural Biology 2010, 20:234–240 hydrolysis or macrocyclization [5].
This review comes from a themed issue on
Macromolecular assemblages During the last decade each domain structure was deter-
Edited by Edward H Egelman and Nenad Ban mined by either crystal or NMR structure elucidation
(Figure 1c); however, the dissected domain structures
Available online 10th February 2010
only gave little knowledge of the mechanism underlying
0959-440X/$ – see front matter the domain interaction during NRP synthesis [6].
# 2010 Elsevier Ltd. All rights reserved. Recently, different conformations and structural states
DOI 10.1016/j.sbi.2010.01.009
of PCP and A domains were observed. The implications
regarding the reaction cycles of PCP and A domains are
discussed herein. In addition, during the last three years,
it was possible to obtain high-resolution NMR and crystal
Introduction structures of didomains and even an X-ray structure of an
Nonribosomal peptides (NRPs) built a large pool of entire NRPS termination module, by trapping the most
biologically active natural compounds. The spectrum of flexible domain, the PCP domain in a certain state. This
the clinical applications of NRPs is broad, for example review features the structural insights into modular NRP
they are used as last resort antibiotics (daptomycin), synthesis by detailing the recent results in multidomain
antitumor (bleomycin), or antifungal drugs or as immu- structures. The possibility how these findings might assist
nosuppressants (cyclosporin) (Figure 1a) [1]. This diverse future NRPS re-engineering experiments as well as what
bioactivity can be explained by the way how nature experiments remain to be done to fully understand the
synthesizes these molecules. NRPs are produced in the mechanisms underlying NRP synthesis are discussed in
secondary metabolism of bacteria and fungi by the con- the last section of this article.
secutive condensation of amino acids, which is achieved
by large multimodular enzymes, nonribosomal peptide Reaction cycles of peptidyl carrier and
synthetases (NRPSs) [1,2]. Notably, this process is not adenylation domains
limited to the 20 proteinogenic amino acids. Some 500 Extensive NMR and X-ray structure elucidation exper-
different monomers, including nonproteinogenic amino iments provided insights into the PCP and A domain
acids, fatty acids, and a-hydroxy acids, have been ident- reaction cycles and are depicted in Figure 2. In the first
ified as building blocks for NRPs [3]. The nonproteino- step of the PCP cycle, the carrier (80 aa’s), which is
genic building blocks contribute to structural versatility of responsible for the transportation, propagation, and pres-
NRPs and are likely to contribute substantially to the entation of the aminoacyl or peptidyl substrates of the
observed biological activity. In brief, NRPSs are com- growing NRP chain, is primed post-translationally with its
Figure 1
(a) Examples of nonribosomally assembled, clinically approved peptides. Next to the compound the names (first line), the trade names (second line),
and the biological activity are given (third line). (b) Simplified mechanism of nonribosomal peptide (NRP) synthesis. (1) The amino acid is activated as
aminoacyl-AMP by the adenylation domain. (2) Transfer of the amino acid onto the PCP domain. (3) Condensation of PCP-bound amino acids. (4)
Possibility of amino acid modifications, for example by epimerization domains. (5) Transesterification of the peptide chain from the terminal PCP onto
the TE domain. (6) TE catalyzed product release by either hydrolysis or macrocyclization. The number of modification domains and modules is very
variable. (c) Examples of single domain structures [20,26,45,46]. PDB accession codes are given in parenthesis.
40 -phospopantetheine (ppant) cofactor [7]. The priming mately 80% of coenzyme A (CoA), the precursor of the
reaction, in which the ppant-arm is attached covalently to ppant cofactor, is acetylated in bacteria [10], thus mis-
a highly conserved serine residue (GGXS-motif), is car- priming events during the Sfp-mediated PCP phospan-
ried out by ppant-transferases such as the promiscuous tetheinylation occur quite regularly, which interrupt the
Bacillus subtilis ppant-transferase Sfp [8]. The PCP NRP assembly (Figure 2). These dead end misprimed
domain exists in three different and converging confor- PCP species are repaired by the action of so-called type II
mations: the apo (A), the A/H, and the holo (H) state. For thioesterases (TEII) [11]. The NMR structure of the
the apo-PCP, the A and A/H conformations coexist and in TEII of the surfactin biosynthesis cluster has been elu-
the case of holo-PCP the presence of H and A/H states cidated recently and gave valuable insights into the
were identified by NMR studies of an excised PCP mechanism of the crucial NRPS repair enzyme [12].
domain (TycC3–PCP) [9]. Thus, the common state for These studies revealed that the TEII interacts specifi-
apo and holo is the A/H state. It was shown that during the cally with the misprimed PCP domain in its H state. In
priming reaction Sfp exclusively interacts with the A-state addition, it was shown that only small molecules (e.g.
to generate the ppant-primed PCP domain. Approxi- acetyl groups) are cleaved of in order to regenerate the
Figure 2
PCP and adenylation domain cycle. The apo-PCP domain (left half, green) is converted to the holo-form by a ppant-transferase (e.g. Sfp, left half, cyan)
by ppant attachment to the active site serine. Mispriming, for example with acetyl-ppant is repaired by the thioesterase type II (left half, orange). The
correctly primed holo-PCP domain interacts with the A domain cycle where it gets loaded with an amino acid. After translocation of the amino acid by
condensation, the holo-PCP can be loaded again. In the A domain cycle (right half), the small C-terminal subdomain (brown) of the A domain (red)
traverses several states. In an open conformation, the A domain is able to bind the amino acid and ATP. The adenylation intermediate then generates
the aminoacyl-AMP and pyrophosphate is cleaved off. The aminoacyl-AMP intermediate is protected from bulk solvent in a closed formation, which
likely facilitates transfer onto the PCP domain (thiolation).
thiol group of the ppant-arm of the holo-PCP. After holo- aa’s, Figure 2, illustrated in red) and a small C-terminal
PCP generation either by direct ppant-transfer or by subdomain (Asub, 100 aa’s, Figure 2, illustrated in
regeneration of misprimed PCPs, the cross-section be- brown), which are connected by a small 5–10 residues
tween the PCP and the A domain cycle is reached and the hinge. The core and subdomain organization of A
PCP is ready for amino acid loading (Figure 2). domains is highly conserved even in other adenylate
generating enzymes, such as acetyl-CoA synthase or fire-
In brief, before addressing the A domain cycle, some fly luciferase (for a comprehensive review on adenylating
background information on A domains: adenylation enzymes, see Gulick [19]). In a recent structure report of
domains are highly selective gatekeeper enzymes for DltA [20], a D-alanine activating adenylation enzyme,
incoming monomeric NRP building blocks. Pioneering extensive manual modeling of subdomain rearrangement
work from the late 1990s on a large number of adenylation was performed to generate a potential cycle of the DltA
domains, in which biochemical, structural, and phylo- catalyzed reaction. This model cycle is also likely to be
genic studies were combined, revealed the so-called applicable to NRPS A domains and hence presented in
specificity-conferring code of A domains [13]. This code Figure 2. The cycle starts with an open conformation of
is composed of 10 amino acids, which, by sequence the A domain (as observed for firefly luciferase [21]), in
alignment with the first A domain structure PheA [14], which Asub is bend away from the active site. It is
are responsible for substrate binding within the active site proposed that this open structure is able to bind the
of A domains. Subsequent bioinformatic studies allowed amino acid and ATP. Upon substrate docking (aa,
the prediction of the substrate amino acids directly from Mg2+, and ATP), an adenylating intermediate is formed,
the primary sequence. Different online tools for A domain as suggested from the crystal structure of DhbE in the
specificity prediction are available [15,16] and might presence of 2,3-dihydroxybenzoate and AMP [22]. This
facilitate the isolation of predicted NRPS products in rearrangement of the subdomain is also supported by
genome mining approaches [17,18]. Each A domain steady-state and presteady-state kinetic experiments on
comprises a large N-terminal core domain (Acore, 450 PheA in the presence of substrate and ATP [23]. Cleavage
of the phosphoester bond with concomitant aminoacyl [27], are connected only by a small hinge region at the
adenylate formation and pyrophosphate release would bottom of the V and by a bridging loop in the middle,
lead to a closed conformation, as observed in the DltA which results in a canyon-like active site structure with
structure [20]. In this state the aminoacyl-AMP is pro- the catalytically active H224 (Figure 3a, shown in yellow)
tected from bulk solvent. Then the activated amino acid in the middle. The V-shaped fold allows the two
is transferred onto the PCP by a yet unknown mechanism, upstream and downstream PCP domains to position
representing the thioester-forming step. Release of the the condensation substrates at each opening (acceptor
loaded PCP domain, the thiolation product of the ade- and donor sites). In the didomain the PCP’s active site
nylation domain complete reaction, would accompany a serine (site of ppant attachment) is rotated away from the
rotation of Asub back to the open conformation (1408). C domain’s active site histidine by approximately 47 Å
After restoring the PCP domain by translocation of the (Figure 3a) [24]. The length of the ppant-arm (not pre-
amino acid by condensation, the combined thiolation– sent is the structure) is 20 Å, that is in this orientation
adenylation cycle could start again (Figure 2). the PCP–C interaction is unproductive. One could
speculate for possible explanation for this orientation:
PCP–C and PCP–TE didomain structures first, the orientation is caused by crystal packing forces
In 2007 the X-ray structure of a PCP–C didomain [24] leading to a nonphysiological architecture; second, the X-
from the third subunit of the tyrocidine synthetase [25] ray snapshot could be an intermediate state between
(TycC5–PCP–TycC6–C) was determined at 1.8 Å resol- catalytic cycles; third, it reflects a state in which the
ution (Figure 3a). During tyrocidine biosynthesis the PCP is oriented backwards to the upstream C domain
ultimate C domain (TycC6–C) catalyzes the last peptide (not present in the structure). In the latter case, the PCP
bond formation between the TycC6–PCP-bound Leu domain must undergo enormous, yet not impossible,
and a nonapeptide bound to the penultimate carrier rearrangements during one round of catalysis to serve
domain (TycC5–PCP). It is essential that the involved all essential domains. Until the structure of a C–A–PCP–
PCP domain is able to reach all domains necessary for C multidomain has not been solved, these possibilities
peptide elongation, that is upstream and downstream remain speculative. A second didomain structure [28]
condensation domains and the upstream A domain. Crys- was released in 2008, which features the solution NMR
tals of the didomain enzyme were grown from hom- structure of the apo-peptidyl carrier protein–thioesterase
ogenous apo-PCP containing protein solution. In the (PCP–TE) fragment of the enterobactin synthetase
crystal structure, the PCP domain (Figure 3a, green) EntF [29] NRPS subunit. To assure a homogeneous
was found in a fold similar to the A/H state described apo-PCP–TE didomain the active site Ser 48 was
in the TycC3–PCP NMR studies [9]. The C domain has replaced by Ala (S48A, Figure 3b). This apo species
V-shaped architecture as previously observed in the free- represents the physiological structure before CoA
standing C domain VibH [26] (Figure 1c). Within this V- addition. The S48A PCP active site is positioned
shaped structure the two equally big subdomains, which 18 Å from the TE active site Ser 180, that is in reachable
belong to the chloramphenicol acetyltransferase fold distance of the 20 Å ppant-arm. The PCP and TE
Figure 3
Didomain structures. (a) X-ray structure of the PCP–C didomain [24] of the fifth PCP and sixth C domain of the tyrocidine NRPS subunit TycC. The
active site serine 43 (shown as orange spheres) of the PCP domain (green) is located 47 Å away from the active site histidine 224 (shown as yellow/blue
spheres) of the C domain (gray). The sites where the peptidyl substrate (from the PCP domain in the structure) is donated and where the growing chain
is accepted by the downstream PCP domain are labeled accordingly. (b) NMR structure of the PCP–TE didomain [28] of the termination module of
the enterobactin NRPS subunit EntF. The PCP active site serine was mutated to alanine (S48A). The distance between S48A and the active serine 180
of the TE domain is 18 Å. The catalytic triad of the TE domain is highlighted in spheres (H313–D207–S180). Interdomain linkers are colored in blue.
domains have a well-defined, but also flexible, relative domains have to be shuffled together in NRPS re-engin-
orientation with a primarily hydrophobic interface of eering approaches in order to create a functional assem-
approximately 1300 Å2 [28]. This strong interaction bly line [39]. In this structure, the adenylation domain is
might explain the need to shuffle PCP–TE fragments positioned in a conformation similar to the adenylate-
together in previous NRPS re-engineering efforts forming conformation (Figure 2). The C-terminal sub-
[30,31]. domain is rotated away from the N-terminal domain by
approximately 408 compared to the PheA structure [14],
Structure of a NRPS termination module resulting in a more open active site [32]. The PCP
Recently the crystal structure of the four domains con- domain and the Asub domain have little interaction with
taining termination module of the surfactin NRPS the platform and thus can move to adopt the required
(SrfA–C) was determined (Figure 4) [32]. Within the conformation during the catalytic cycle, as the active
SrfA–C structure, the individual domains show the same sites of the A and the C domains are separated by
folds as observed previously for the dissected domains approximately 60 Å. In the observed state for the PCP
(Figure 1c) [33,34]. It was necessary to mutate the ppant domain, it is impossible to cover this distance, as the
binding serine to alanine within the PCP domain to ppant-arm is only 20 Å in length. The regions, which
obtain a uniform apo population of protein and to reduce could be reached in the SrfA–C structure by the ppant
the mobility of domains during crystallization. The cofactor are covered by a gray sphere in Figure 4. The
structural core of the SrfA–C module is a compact PCP domain of the SrfA–C protein is positioned to
rectangular catalytic platform, built by the C domain interact productively with the upstream condensation
and the major part of the A domain (Acore) with both domain. The domain is likely to be positioned in the
active sites arrayed on the same site of the platform acceptor site of the C domain, as the catalytic center of
leading to an interaction surface of more than 1600 Å2 the PCP domain is only 16 Å (Figure 4) from the active
[32]. The C–Acore interface can be seen as a stable site histidine of the C domain. The PCP active site
workbench similar to the KS-(M)AT-platform in the serine is not positioned, where it could donate the
condensing wing of the fatty acid synthase [35,36] or pantetheine arm to either the adenylation domain or
the KS-AT-platform in type I polyketide synthases the thioesterase domain [32]. Therefore large struc-
[37,38]. In addition, the strong interaction between tural rearrangements are mandatory during a full cata-
the Acore and the C domain explains why these two lytic cycle.
Figure 4
Back view or acceptor site view of the termination module of the surfactin NRPS subunit SrfA–C. The 20 Å radius, which is the reachable region of the
carrier-arm, around the ppant-attachment site of the PCP domain (S1003A) is depicted as a gray sphere. The PCP domain is well positioned to interact
with the H144 active site of the C domain. The active sites of the TE (S1117) and the A domain substrate leucine cannot be reached. The active site of
each domain is shown in spheres. Linker regions are shown in blue, the domains are arranged and colored corresponding to the schematic illustrations
in upper left corner.
20. Yonus H, Neumann P, Zimmermann S, May JJ, Marahiel MA, By extensive crystallization efforts, the authors succeeded to crystallize
Stubbs MT: Crystal structure of DltA. Implications for the an entire termination module of an NRPS assembly line. The structure of
reaction mechanism of non-ribosomal peptide synthetase the 144 kDa big and 1274 amino acid long Bacillus subtilis termination
adenylation domains. J Biol Chem 2008, 283:32484-32491. module SrfA–C was solved at 2.6 Å resolution. It was found that the
In this article featuring the structure of the D-alanine activating domain adenylation and condensation domains associate closely to form a
DltA, the authors manually modeled different conformational states of catalytic platform, with their active sites on the same side of the platform.
adenylation domains. In this work the adenylation domain cycle (see
Figure 2) was proposed. 33. Marahiel MA, Essen LO: Chapter 13. Nonribosomal peptide
synthetases mechanistic and structural aspects of essential
21. Conti E, Franks NP, Brick P: Crystal structure of firefly luciferase domains. Methods Enzymol 2009, 458:337-351.
throws light on a superfamily of adenylate-forming enzymes.
Structure 1996, 4:287-298. 34. Koglin A, Walsh CT: Structural insights into nonribosomal
peptide enzymatic assembly lines. Nat Prod Rep 2009, 26:987-
22. May JJ, Kessler N, Marahiel MA, Stubbs MT: Crystal structure of 1000.
DhbE, an archetype for aryl acid activating domains of
modular nonribosomal peptide synthetases. Proc Natl Acad Sci 35. Maier T, Leibundgut M, Ban N: The crystal structure of a
U S A 2002, 99:12120-12125. mammalian fatty acid synthase. Science 2008, 321:1315-1322.
23. Stevens BW, Lilien RH, Georgiev I, Donald BR, Anderson AC: 36. Jenni S, Leibundgut M, Boehringer D, Frick C, Mikolasek B, Ban N:
Redesigning the PheA domain of gramicidin synthetase leads Structure of fungal fatty acid synthase and implications for
to a new understanding of the enzyme’s mechanism and iterative substrate shuttling. Science 2007, 316:254-261.
selectivity. Biochemistry 2006, 45:15495-15504.
37. Tang Y, Kim CY, Mathews II, Cane DE, Khosla C: The 2.7-
24. Samel SA, Schoenafinger G, Knappe TA, Marahiel MA, Essen LO: Angstrom crystal structure of a 194-kDa homodimeric
Structural and functional insights into a peptide bond-forming fragment of the 6-deoxyerythronolide B synthase. Proc Natl
bidomain from a nonribosomal peptide synthetase. Structure Acad Sci U S A 2006, 103:11124-11129.
2007, 15:781-792.
38. Crawford JM, Korman TP, Labonte JW, Vagstad AL, Hill EA,
25. Mootz HD, Marahiel MA: The tyrocidine biosynthesis operon of Kamari-Bidkorpeh O, Tsai SC, Townsend CA: Structural basis for
Bacillus brevis: complete nucleotide sequence and biosynthetic programming of fungal aromatic polyketide
biochemical characterization of functional internal cyclization. Nature 2009, 461:1139-1143.
adenylation domains. J Bacteriol 1997, 179:6843-6850.
39. Doekel S, Gal M, Gu JQ, Chu M, Baltz RH, Brian P: Non-ribosomal
26. Keating TA, Marshall CG, Walsh CT, Keating AE: The structure of
peptide synthetase module fusions to produce derivatives of
VibH represents nonribosomal peptide synthetase
daptomycin in Streptomyces roseosporus. Microbiology 2008,
condensation, cyclization and epimerization domains. Nat
154:2872-2880.
Struct Biol 2002, 9:522-526.
27. Leslie AG: Refined crystal structure of type III chloramphenicol 40. Liu Y, Bruner SD: Rational manipulation of carrier-domain
acetyltransferase at 1.75 A resolution. J Mol Biol 1990, 213:167- geometry in nonribosomal peptide synthetases. Chembiochem
186. 2007, 8:617-621.
28. Frueh DP, Arthanari H, Koglin A, Vosburg DA, Bennett AE, 41. Strieker M, Nolan EM, Walsh CT, Marahiel MA: Stereospecific
Walsh CT, Wagner G: Dynamic thiolation-thioesterase synthesis of threo- and erythro-beta-hydroxyglutamic acid
structure of a non-ribosomal peptide synthetase. Nature 2008, during kutzneride biosynthesis. J Am Chem Soc 2009,
454:903-906. 131:13523-13530.
This work, which accompanies Ref. [12], completes the current view of
the mechanisms of NRPS thioesterases. The PCP–TE didomain, which 42. Hur GH, Meier JL, Baskin J, Codelli JA, Bertozzi CR, Marahiel MA,
was examined here, forms a compact but dynamic structure with a well- Burkart MD: Crosslinking studies of protein–protein
defined domain interface and gives insights into the termination steps of interactions in nonribosomal peptide biosynthesis. Chem Biol
NRP synthesis. 2009, 16:372-381.
29. Rusnak F, Sakaitani M, Drueckhammer D, Reichert J, Walsh CT: 43. Kainosho M, Torizawa T, Iwashita Y, Terauchi T, Mei Ono A,
Biosynthesis of the Escherichia coli siderophore enterobactin: Guntert P: Optimal isotope labelling for NMR protein structure
sequence of the entF gene, expression and purification of determinations. Nature 2006, 440:52-57.
EntF, and analysis of covalent phosphopantetheine.
Biochemistry 1991, 30:2916-2927. 44. Neylon C: Small angle neutron and X-ray scattering in
structural biology: recent examples from the literature. Eur
30. Baltz RH: Biosynthesis and genetic engineering of lipopeptide Biophys J 2008, 37:531-541.
antibiotics related to daptomycin. Curr Top Med Chem 2008,
8:618-638. 45. Weber T, Baumgartner R, Renner C, Marahiel MA, Holak TA:
Solution structure of PCP, a prototype for the peptidyl carrier
31. Baltz RH: Molecular engineering approaches to peptide, domains of modular peptide synthetases. Structure 2000,
polyketide and other antibiotics. Nat Biotechnol 2006, 24:1533- 8:407-418.
1540.
46. Bruner SD, Weber T, Kohli RM, Schwarzer D, Marahiel MA,
32. Tanovic A, Samel SA, Essen LO, Marahiel MA: Crystal structure Walsh CT, Stubbs MT: Structural basis for the cyclization of the
of the termination module of a nonribosomal peptide lipopeptide antibiotic surfactin by the thioesterase domain
synthetase. Science 2008, 321:659-663. SrfTE. Structure 2002, 10:301-310.