Professional Documents
Culture Documents
I. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
...................................................................... 6
rase Inhibitors ................................................. 6
gs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
C. HIV-1 Protease Inhibitors .................................................... 12
1. Design of Nonpeptide Cyclic Ureas that are Potent, Bioavailable Inhibitors
of HIV Protease . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2. Optimization of a Coumarin Lead from Random Screening I .............. 15
3. Optimization of a Coumarin Lead from Random Screening I1 . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
4. Optimization of a Lead Obtained by Screening Renin Inhibitors . . . . . . . . . . . . . . . . . . . . . . . . 18
5. Optimization of a Penicillin Lead from Random Screening . . . . . . . . . . . . . . . . . . . . . . . 20
6. Optimization of the C-Terminus of a Known Inhibitor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
7. Design of a Macrocyclic Inhibitor Based on a Known Protease Inhibitor 25
D. NEP Inhibitors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
E. Dual Inhibitors of ACE and NEP .............................................. 29
F. PNP Inhibitors . . . . ................................ 32
G. Sialidase Inhibitors ......................................................... 34
H. Thymidylate Synth ...................... 35
1. Optimization of ors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
2. De novo Design of Thymidylate Synthase Inhibitors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
3. Discovery of Thymidylate Synthase Inhibitors Aided by 3-D Database Searching . . . . . . . . . 38
111. Discussion . . . . . . . . . . . . . . . . . . . . ............................ 39
A. Optimization of Lead Compounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
1. Improvement of Complementarity to the Binding 41
2. Improvement of Conformational Properties . . . . . 42
3. Improvement of Bioavailability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
B. Finding New Leads . . . . .................................................. 42
1. High Throughput Scr 42
2. Computer-Based Screening of Structural Databases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
3. Design of New Leads . . . . . . . . . . . . . . . . .................... 44
4. Potency Prediction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
IV. Conclusions . . . . . . . . . . . . . . . . . . . . . . ........................... 46
References .................................................................... 47
I. INTRODUCTION
The conceptual basis for structure-based drug design was formulated 100 years ago by
Emil Fischer.1 His "lock and key" hypothesis is a constantly recurring leitmotif in mod-
ern drug design, and yet, it is only recently that we have been in possession of detailed
descriptions of the "locks", i.e., the biochemical targets to which potent and selective
*To whom all correspondence should be addressed.
We dedicate this review to the memory of Dr. George deStevens who encouraged us to write it and who
was extremely supportive during its preparation.
Medicinal Research Reviews, Vol. 16, No. 1, 3-50 (1996)
0 1996 John Wiley & Sons, Inc. CCC 0198-6325/96/010003-48
4 BOHACEK, MCMARTIN, AND GUIDA
drugs (the "keys") must fit. Success has been achieved in the past by exploring the shape
of the "keys," i.e., by structure/activity analysis of a series of experimental test ligands.
However, this approach can yield only limited information about the "lock," and a large
amount of work is required to achieve useful results by such experimental trial and error.
Three-dimensional structures obtained by X-ray crystallography and NMR spectroscopy
contain a quantity of information that is orders of magnitude greater than that which can be
obtained by building pharmacophores, even when a well studied set of ligands is available.
This increased information introduces the possibility of directly designing drugs based on a
detailed model of the target binding site, a possibility that raises a number of questions:
In spite of initial skepticism, the design of novel, potent therapeutic agents based
upon 3-dimensional structural information is now a reality. With increasing frequency,
the structure of the target binding site or that of a closely related analog is available prior
to embarking upon a drug design project. Thus, structure-based drug design (i.e., the
design of drugs based on the 3-dimensional structure of a biomacromolecule) has
emerged as a powerful tool for drug discovery. Recent reviews*-4 have presented numer-
ous examples in which the utility, not simply the potential, of structure-based drug
design has been amply demonstrated. An account of this activity has even appeared in
the popular literature.5
Of course, one must ask, how does one measure the success of a project involving
structure-based drug design? One measure might be a reduction in the number of
compounds synthesized and tested in order to produce a clinical candidate. Often,
however, the mere existence of a candidate drug in human clinical trials is taken as an
end-point for structure-based drug design, which some believe is proof of the validity of
Regine S. Bohacek received her Ph. D. in Physical Chemistry from Rutgers University in 1987. Her thesis
dealt with a configurational analysis of a synthetic polymer using the statistical mechanical methods developed by
Flory. Shehasapplied thesemethods in acomputerprograinforde novostructure-baseddrugdesign. Dr. Bohacek
is currently engaged in computer-aided moleculnr design at Ciba in Summit, N . 1.Her interests include: methods
forestimatingenzyme inhibitor bindingenergies, thedesign of new drugs based on the3-dimensional structureof
binding sites or inhibitor templates, and zinc metallo-proteases and their inhibitors.
Colin McMartin is a senior scientist in the Biophysical Chemistry group at Ciba in Summit, N . 1. where
he is engaged in the development and application of computer methods for drug design. Previously, he led a
group of biologists and biochemists investigating pharmacokinetics, metabolism, and bioavailability of pep-
tides at Ciba in Horsham, Sussex, U . K. He received his B. A. in Chemistry with honors from Cambridge
University in the U . K. His interests include: development of new structure prediction methods, methods for
estimating enzyme inhibitor binding energies, and the design of new drugs based on the three-dimensional
structure of binding sites andlor inhibitor templates.
Wayne C. Guida is Executive Director of Core Drug Discovery Technologies at Ciba in Summit, N . J.
Previously, he had been an Associate Professor of Chemistry at Eckerd College in St. Petersburg, Florida. In
1985-1986, while on sabbatical leave, he was a Senior Research Fellow at Columbia University. He is one of
the co-authors of the MacroModel molecular modeling program, which was developed in Professor Clark Still's
laboratory at Columbia. Prior to joining the faculty at Eckerd College, he received his Ph. D. degree in Organic
Chemistryfrom the University of South Florida and was a post-doctoral fellow at Duke University. His recent
research has focused on structure-based drug design and computational methods for conformational analysis.
STRUCTURE-BASED DRUG DESIGN 5
Generate Structure
Purified Protein
3-D Database
Libraries Compound
Searching
1 I
Combinatorial
Drug Design
Libraries
in viva in vivo
Potency
Combinatorial
Drug Design
Libraries
1
[Drug)
Figure 1. Drug discovery and design flow chart.
this novel approach. Although we would argue whether this is the appropriate end-
point, it is a fact that the number of new chemical entities in the clinic for which
structure-based drug design has played a key role continues to grow.5 The list includes
medications for T-cell lymphoma, psoriasis, tumor suppression, viral infection (includ-
ing HIV), and glycoma.
It is clear that structure-based drug design plays an important role in drug discovery.
In fact, a new paradigm for drug discovery and design is beginning to emerge in which
contributions from rational drug design (which includes structure-based methods), tra-
ditional screening and synthesis, and the newer combinatorial approaches are coupled.
In Fig. 1, we illustrate the synergy among these various techniques. This review will
demonstrate that structure-based drug design has been successfully used at every step,
from lead finding to final optimization, in the discovery of novel drugs. For example,
searching of 3-dimensional databases for molecules that can dock to a target binding site
has already contributed to lead finding. The optimization of these and other leads has
been accomplished using structure-based methodology. Moreover, structure-based drug
design techniques have contributed to in vivo potency optimization by serving as a guide
for those modifications to a molecule that might enhance bioavailability without destroy
ing in vitro potency. In addition, new methods are now emerging that will make even
6 BOHACEK, MCMARTIN, AND GUIDA
11. EXAMPLES
A. Carbonic Anhydrase Inhibitors (Merck)
Recently reported work on inhibitors of carbonic anhydrase illustrates the iterative use
of experimental and computational techniques.3 The results indicate the importance of
conformational strain energy for determining free energies of binding.
The conversion of carbon dioxide to bicarbonate by carbonic anhydrase is a critical
step in the secretion of aqueous humor. Thus, inhibitors of this enzyme can lower ocular
pressure and may be of benefit in glaucoma. The starting point for the further develop-
ment of compounds was MK-927. This compound has high water solubility and an
appropriate partition coefficient to enable rapid penetration into the ocular tissues after
topical application.
X-ray structures for complexes of carbonic anhydrase I1 with the two enantiomers of
STRUCTURE-BASED DRUG DESIGN 7
MK-927
MK-927 have been determined, and this was the starting point for compound optimiza-
tion. The S-enantiomer (MK-417) has a Ki of 0.61 nM and is about 100 times more potent
than the R-enantiomer (Ki = 71 nM). Since the two isomers will have the same solvation
energies, it is clear that this difference must result from differences in conformational
energy of the structure docked in the enzyme binding site. The docking geometries of
the two isomers are shown in Fig. 2. This figure shows how the shape of the binding site
strongly determines the docking geometries. The sulfonamide is rigidly locked in a very
small pocket at the back of the site. The ring system and side chain lie across a larger
cavity, but the positions of the sulfone sulfur and oxygen atoms and the iso-butyl methyl
groups are strongly determined by way they fit with precision into cusp-like niches in
the cavity as indicated by the accessible surface (see Fig. 2).
Before designing compounds, the experimental data was subjected to computational
analysis and an hypothesis was formed that would explain observed differences in
potency. While the contacts of the two inhibitors with the enzyme are rather similar, the
internal geometries have small but energetically significant differences. Energy differ-
ences due to internal geometry were computed using ab initio molecular orbital quantum
R-enantiomer
S-enantiomer(MK-417)
Ki = 71 nM
Ki = 0.61 nM
Figure 2. Stereo images illustrating the docking of two isomers of MK-927; (A) S-enantiomer in green; (8)
R-enantiomer (pink) and S-enantiomer (green). The figure was prepared for illustration purposes by docking
the compounds into a structure of a carbonic anhydrase complex obtained from the Brookhaven Protein Data
Bank.12.13 The mesh in this and other figures shows an accessible surface and indicates where ligand atoms in
good van der Waal's contact with the binding site should lie.I4 Mesh colors show the ideal properties of ligand
atoms: yellow: hydrophobic; red: hydrogen-bond accepting; blue: hydrogen-bond donating. For accurate pic-
tures of the X-ray results for the MK-927 compound, the reader is referred to the original study.3
STRUCTURE-BASED DRUG DESIGN 9
TABLE I
Computed
Ki Dihedral Angle Energy Difference
Compound nM T (degrees) (Kcal/mol)
mechanics calculations (3-21 G*, Gaussian 88). The dihedral angles of the bond connec-
ting the sulfonamide group to the thiophene ring ( N - M - S ) were 150" for the
S-enantiomer and 170" for the R-enantiomer, the difference resulting in an energy for the
S isomer that was more favorable by about 1 kcal/mol. A similar energy difference was
found comparing the anti geometry for the N-C bond of the iso-butyl side chain of the
S isomer to the gauche geometry of the R enantiomer. The combined energy difference of
approximately 2 Kcal/mol accounts for an appreciable part of the observed potency ratio
of 1OO:l.
This conclusion is important in a general sense, since there has been some controversy
over whether strain energies should be considered as significant determinants of poten-
cy. This study is unusual, since other factors that might influence potency are relatively
10 BOHACEK, MCMARTIN, AND GUIDA
similar for the two molecules. Examples where it is possible to cleanly isolate one of the
multiple factors that determine potency and study it with reliable experimental and
computational methods are fairly rare but essential for the further development of de-
pendable structure-based design methodology (see Discussion).
Both isomers are docked in a pseudo-axial conformation. Ab initio quantum mechanics
calculations show that this conformation has an energy of about 1 Kcal/mol higher than
the pseudo-equatorial conformation. Ab initio quantum mechanics calculations also
showed that if a methyl group is introduced in the 6 position, the conformational
preference is removed. This modification was, therefore, made and, at the same time, to
conserve the physicochemical properties of the molecule, two methyl substituents were
removed from the iso-butyl side-chain.
The methyl group in the 6 position (Table I) introduces a second chiral center resulting
in four possible stereoisomers, all of which were synthesized and assayed. The com-
pounds differed in potency, the most potent offering a modest improvement over the
starting structure. X-ray analysis showed that each compound had a slightly different
docked conformation, the major differences being in the N-M--S dihedral angle.
The data is summarized in Table I.
Once again, evaluation of conformational strain energy by ab initio quantum mechan-
ics calculations provided an explanation for the difference in potencies. The two trans
isomers had a calculated energy difference of 1.5 kcal/mol due to changes in the NSCS
dihedral, which could account for most of the 20-fold difference in potency. The cis
isomers had a smaller difference in potency and the calculated energy difference was
1.0 kcal/mol.
Further series of compounds were synthesized to explore the effects of varying chain
length at the 6 position and the nitrogen. The results have been reviewed previously.3
Although the gain in potency reported in the carbonic anhydrase work is modest, the
results show the value of repeated experimental structure determinations at each cycle of
drug optimization for confirming hypotheses and further extending our knowledge of
the factors that determine potency. The use of optical antipodes is very elegant, because
it eliminates factors between the molecules such as solvation energy and, thus, focuses
attention on differences in docking mode and internal conformational energy.
1 CyclosporinA
been shown to be similar to the bound conformation of FK506.25 Apparently, the bound
conformation of FK506 and its close analogs must preexist to some extent in aqueous
solution. Based on the X-ray crystal structure of a CsA/Fab complex26 and kinetic evi-
dence,27 it has been suggested that a similar situation occurs for cyclosporin.
The conformationally restricted cyclosporin analog described here was carefully de-
signed so that any modifications would not interfere with either the binding to cyclo-
philin or calcineurin. The residues in contact with cyclophilin were known from the
X-ray structure of CsA bound to cyclophilin. Those residues that do not affect binding to
calcineurin had previously been identified by the synthesis of cyclosporin analogs. The
modifications of cyclosporin that do not affect immunosupressive activity28,29 are in-
ferred to be in positions not interfering with the binding to calcineurin. Therefore, the
segments of CsA chosen for alteration were those not in contact with cyclophilin and
those that can be modified without loss of immunosupressive activity and, thus, pre-
sumed not to be in contact with calcineurin. Two residues of CsA, L-Ala7-D-Alas, met
these criteria and were selected for modification. The conformation of the backbone of
these residues is significantly different in the bound and solution conformations. The
program CAVEAT30 from Cambridge Structural Database31 was used to identify a rigid
core structure that would fix the positions of the amine substituent and a carboxyl
moiety of L-Ala7-D-Ala8 in the same relative orientations as found in the cyclophilin
bound structure. CAVEAT suggested a bicyclic heterocycle (2), and this heterocycle was
incorporated into cyclosporin A resulting in TCsA (3).1
The binding affinity of the conformationally restrained analog (3), TCsA, to cyclophilin
(Ki = 2 nM) was three times better than that of cyclosporin (1,K i = 6 nM). Since this
modified portion of the molecule is not expected to bind to cyclophilin, it is hypothe-
sized that the increase in binding energy is entirely due to entropic effects. The affinity
12 BOHACEK, MCMARTIN, AND GUIDA
Y
H
3 TCsA
exceed 200. Molecular modeling has been used extensively in an attempt to discover
drugs effective in halting this dreaded viral infection.
In this review, we will discuss only a few of the most recent examples (up to November
1994) illustrating some of the different techniques that have been successfully imple-
mented in this area.
TABLE I1
HIV Protease Inhibitors - DuPont-Merck
6 4.7
7 0.31
0.27
14 BOHACEK, MCMARTIN, AND GUIDA
cyclic inhibitors, it would be possible to design smaller inhibitors that would retain
potency but display improved oral biooavailability. From the extensive HIV inhibitor
SAR, it was known that C2-symmetric diols imparted significant potency. Therefore,
scientists at DuPont-Merck also wanted to include this feature in their design. Because
there was no crystallographic information about how diols bind to the protease when the
project was initiated, they used the X-ray structure of an hydroxyethylamine inhibitor
bound to HIV-1 protease33 and used the distance geometry program DGEOMN to devel-
op several pharmacophores. From this model, they constructed a simple 3-dimensional
database query specifying the positions of hydrophobic groups that bind to the S1 and
S1' subsites of the enzyme and either one or two hydrogen bond donors or acceptors to
bind to the catalytic aspartates. This query was used to search a subset of the Cambridge
Structural Database,31 which had been converted for use with MACCS-3D,35 yielding
structure 4 (Table 11). MACCS3D allows the user to search a 3-dimensional database to
locate structures that meet user-defined geometrical constraints. Not only did structure 4
show that a tetra-substituted benzene ring could properly position substituent groups to
interact with aspartates 25 and 25', it also possessed an additional oxygen that matched
the position of the key structural water molecule. However, because a benzene ring
might not be the ideal scaffold to properly position its substituents to reach the enzyme's
hydrophobic pockets, molecular modeling was used to optimize the type of ring in a
stepwise fashion from a benzene ring, to a substituted cyclohexanone, to the larger
cycloheptanone, and finally to a 7-membered cyclic urea, 5. Additional modeling was
performed to determine the ideal stereochemistry and conformation for substituents
attached to the cyclic urea with and without substituents on the nitrogens. The confor-
mational preferences were later confirmed from small molecule X-ray crystallography.
The actual potencies of cyclic ureas with various stereochemistry correlated well with
those predicted by the model. In the final structure, the N-substituted cyclic urea has
substituents with stereochemistry predicted to be optimal by the model: 4R, 5S, 6S, and
7R. This structure appears to be an ideal, rigid scaffold to position groups into the S1,
Sl', S2, and S2' pockets as well as to place the hydroxyl groups to interact with the
catalytic aspartates. In addition, the model showed the urea carbonyl oxygen occupying
the position of the key structural water molecule. As predicted by the model, compound
6 binds to the enzyme with high affinity ( K ; = 4.7 nM).
However, compound 6 exhibited only modest antiviral activity. Molecular modeling
indicated that there was room for a larger group in the S2/S2' subsite and, therefore,
naphthalene substituents were introduced. This compound, 7, showed a 10-fold im-
provement in potency ( K ; = 0.31 nM) probably due to the increased hydrophobic con-
tacts. Unfortunately, this enhanced enzyme inhibitory activity did not translate into
improved antiviral activity, presumably because of the lipophilicity of the naphthalene
groups. Additional modeling suggested the incorporation of a hydrogen bonding group.
The naphthalene rings were replaced by p-hydroxymethylphenyl substituents resulting
in compound 8. Compound 8 combines excellent potency against HIV protease ( K , =
0.27 nM), and against viral replication, with good oral bioavailability. This compound is a
specific HIV protease inhibitor showing no inhibition of renin, pepsin, or cathepsin D.
This specificity is hypothesized to stem in part from the observation that these enzymes
do not have a structural water analogous to that found in HIV protease and mimicked by
cyclic urea 8. The X-ray crystal structures of ten cyclic ureas complexed to HIV protease
have been determined. The crystal structure of compound 7 complexed to HIV protease
was refined to the highest resolution (1.8 A, final R factor = 19.5%), and is depicted in
STRUCTURE-BASED DRUG DESIGN 15
TABLE I11
HIV Protease Inhibitors - Upjohn
~~
Compound Potency
10 K, = 1 pM
11 Ki = 0.5 p M
OH \
12
K, = 0.038 p M
OH \
the article.32 The X-ray structure shows that 7 forms interactions with the enzyme pre-
dicted by molecular modeling.
Analysis of this enzyme inhibitor complex showed that the position of the benzene
ring of the coumarin moiety does not readily allow for the addition of substituents that
will extend into the S2' subsite. Therefore, the fused benzene ring was expunged and
side chains were added to the remaining 4-hydroxy-2-pyrone ring yielding compound
11. The X-ray crystal structure of 11 complexed to HIV protease was determined to 2.3 A
resolution and showed that, despite replacement of the coumarin benzene ring, the
hydrogen bonding pattern of 11 remained the same as that of compound 10. Also, as
anticipated, the phenethyl group at C-6 was located near the S2' subsite. Next, an ethyl
group was added at the position alpha to C-6 (compound 12). The ethyl group was
designed to occupy the S1' pocket and to move the phenyl ring even closer into the S2'
subsite. This compound can be viewed as having pseudosymmetric substitutions with
the four substituents occupying the S2, S1, Sl', and S2' pockets. These modifications led
to substantial improvements in the binding affinity. Compound 12 was found to inhibit
HIV-1 protease with a K , of 38 nM. This compound is a mixture of four stereoisomers, all
of which were isolated. All were found to inhibit HIV, the best with a K j of 14 nM.
Compound 12 also inhibits HIV-2 ( K j = 32 nM), has excellent properties such as selec-
tivity for HIV protease, antiviral activity in HIV-lIIIBinfected MT4 and H9 cells, good oral
bioavailability, and is synthesized in three chemical steps from readily available starting
materials. Compound 12, U-96988, has entered phase I clinical trials.
TABLE IV
HIV Protease Inhibitors - Parke-Davis/NCI
13 &.XI 2.3
14 1.7
15 0.52
STRUCTURE-BASED DRUG DESIGN 17
TABLE V
HIV Protease Inhibitors - Parke-Davis/NCI
Compound Ki (CLM)
on
16 1.1
17 0.7
?"
0.051
hydroxyl group and polar enzyme atoms. Using GRID, the position of a water molecule
to mediate a hydrogen bond between the hydroxyl and the carbonyl of Gly148 could be
identified. In conclusion, the authors indicated that the particular binding mode of this
lead made it difficult to extend the structure to form additional favorable contacts with
the enzyme.
More recently, another study from Parke-Davis and NC140 described the optimization
of structure 16 (Table V), another compound identified by random screening. To predict
the binding mode of compound 16, a number of docking experiments were conducted
using molecular dynamics as implemented in the program SYBYL (Version 5.5, available
from Tripos Associates, St. Louis, MO) and the Monte Carlo-based docking program,
Autodock.38 In order to optimize the position of the P1’ phenyl group to better fit into
the S1‘ enzyme subsite, analogs with 1-3 carbons between the sulfur and the benzene
ring were synthesized. A 2-3 fold enhancement in binding affinity was observed. Com-
pound 17, with a one carbon spacer, has a Ki of 0.7 pM. The binding mode of 17 was also
studied using the molecular modeling techniques mentioned above. Subsequently, the
X-ray structure of 17 complexed to the enzyme was determined and confirmed the
modeling results. The lactone moiety in the pyrone structure was found to displace
water molecule-301 (the critical water molecule usually found in the crystal structure of
HIV protease/inhibitor complexes). The lactone formed hydrogen bonds with Ile50 and
Ile150. The enol hydroxyl interacted with the two catalytic aspartic acid residues, Asp25
and Asp125.
Subsequent molecular modeling studies indicated that a tether could be added to the
6-aryl ring to reach the S1 subsite. Therefore, an analog with a OCH,COOH substituent at
the para position of the benzene ring located at the 6 position of the pyrone ring, which
would position a negatively charged group to interact with Arg 8, was designed and
synthesized. This compound, 18, was found to inhibit HIV protease with a Kiof 0.051 KM.
TABLE VI
HIV Protease Inhibitors - Merck
H
19 I 1.0
N
L364,505 Boc-PhePhr'
7 0
20 0.03
d
21 0.3
L685,4324
22
Ro-31-8959
23 7.8
24 0.56
L-735,524
20 BOHACEK, MCMARTIN, AND GUIDA
tion was found, and compound 23 was synthesized. Compound 23 was found to inhibit
HIV with an IC,, of 7.8 nM, but was weak in inhibiting the spread of viral infections in
the cellular assay. However, because 23 possessed a favorable pharmacokinetic profile,
further optimization was carried out to modify the physical properties while maintaining
potency. The result of this optimization was L-735,524 (24), a compound undergoing
extensive human clinical trials (Phase 11). The crystal structure of a closely related analog
of 24 complexed with HIV protease was determined to 2.15 A resolution and confirmed
the modeling studies.
TABLE VII
HIV Protease Inhibitors - Glaxo
Compound Potency
26
(continued)
STRUCTURE-BASED DRUG DESIGN 21
TABLE VII
(Continued)
Compound Potency
29 IC, = 0.082 FM
30 IC, = 1.9 pM
31 IC, = 4.6 p M
K, = 0.25 nM
32 IC, = 3.8 nM
22 BOHACEK, MCMARTIN, AND GUIDA
weight and, hence, an improved in uivo profile. Therefore, monomeric analogs of com-
pound 25 were designed.
The monomer of 25, compound 26, was found to be inactive against the enzyme, and
scientists at Glaxo speculated that at least one hydroxyl group to interact with the
catalytic aspartic acids and a hydrophobic group to bind to the S1 subsite would be
necessary for activity. A series of compounds that incorporated these groups was syn-
thesized resulting in compound 27 (1C5, = 0.15 pM). The significant difference in poten-
cy between 27 and 28, which differed only in the configuration of the P1' benzyl group,
was rationalized by inspection of the model. The "active" R enantiomer showed that the
benzyl group could fit into the S1' pocket, whereas for the "inactive" S isomer this was
not possible, These modeling results were later confirmed by the X-ray structure of 27
bound to the enzyme determined to 2.2 A r e s ~ l u t i o n . ~ ~
Inspection of the crystal structure of the analog of 25 complexed to HIV protease
suggested the importance of inhibitor carboxamide carbonyl oxygens in the center of the
molecule, which hydrogen bond with H,O 301, which in turn hydrogen bonds with the
NH of Ile 50 and Ile 50'. Compounds 27 and 28 cannot fully partake of these interactions,
since they have only one carbonyl group in this part of the binding site. From the crystal
structure, it was apparent that an additional carbonyl group separated by a 4-atom
spacer would be ideal. Therefore, a statine isostere was incorporated into the computer
model of 27. The carbonyl group was positioned in the model to form the desired
hydrogen bonds and the second amide group was adjusted to form potential hydrogen
bonds with the amide NH and the carbonyl of Gly27'. The structure was then subjected
to energy minimization in the static enzyme structure, which included the crystal-
lographic waters using the Insight/Discover molecular modeling software (available
from Biosym Technologies, Inc., San Diego, CA). Subsequently, compounds 29 and 30,
where synthesized and, 29, with an IC,, = 0.082 pM, did exhibit improved potency.
From the model, it was further postulated that a lipophilic amine could reach the S2'
subsite. A series of compounds were synthesized with a variety of hydrophobic groups
in the P2' position. The best of these were 31 and 32. Although both are potent HIV
inhibitors with an IC,, of 4.6 nM and 3.8 nM, respectively, only compound 31 exhibited
good antiviral activity in vim. Compound 31 had excellent selectivity, thus achieving the
goal of a potent, low-molecular-weight HIV protease inhibitor.
These compounds are interesting in that they differ from the statine analogs. Statine
analogs have the S stereochemistry at the asymmetric center from which the P1 side
chains are attached. In compounds 31 the attachment is in the R configuration and the
lipophilic group binds not to the S1 subsite but to the S1' subsite. The crystal structure of
STRUCTURE-BASED DRUG DESIGN 23
TABLE VIII
HIV Protease Inhibitors - Glaxo
Compound
0
33, 34 3.1 pM
1.5 pM
22
RO-31-8959
35 9.0 nM
JG-365
Ac-Ser-Leu-AsnHH
OH
0 -He-Val-OMe
36 23 nM
32 complexed with HIV protease was subsequently determined and "confirms the for-
mation of the major interactions designed to optimize potency."48
Although very potent, these compounds displayed a poor pharmacokinetic profile and,
therefore, further optimization was undertaken.49The first targets of this new study were
compounds 33 and 34 (Table VIII). The modeling of these compounds was based on the
X-ray structure of a closely related analog of the earlier penicillin derivative, 25, complexed
to HIV protease. In addition, the position of the proline amide was modeled using the P1'
proline of JG365(35)50fromthe structure of JG365bound to synthetic HIV-1 pr0tease.~3The
biological results for 33 and 34 were disappointing (IC50of 3.1 and 1.5 pM, respectively)
when compared to JG365 (35), which has an IC,, of 0.5 nM, indicating that the interactions
that 33 and 34 make with the enzyme must be far from optimal.
Increasing the ring size of the P1' group by substituting a pipecolic acid for the proline did
not result in the expected increase in potency. Modeling the pipecolic acid analogs in the
manner described above did reveal that the pipecolic acid did not form any major
hydrophobic interactions with the enzyme. The decahydroisoquinoline moiety in the P1'
site suggested by the Roche inhibitor, Ro-31-8959, (22), was incorporated resulting in
24 BOHACEK, MCMARTIN, AND GUIDA
compound 36. In the model, improved hydrophobic interactions were observed between
the decahydroisoquinoline group and the enzyme, which did result in a substantial
increase in potency. Compound 36 was found to inhibit HIV protease with an IC,, of 23 nM.
Compounds 33, 34 and 36 were also modeled using the binding mode of the Roche
inhibitor, 22, as described in the communication of Krohn and co-workers.51 As in the
case when the JG365 structure was used as a template, the corresponding groups of 36
and the Roche inhibitor formed similar interactions with the enzyme. Unfortunately, co-
crystallization studies with these types of inhibitors were unsuccessful and, therefore,
the modeling results could not be verified.
Although further optimization was carried out using molecule modeling, compound
36 remained the most potent of the series in the enzyme and cellular assays. Unfor-
tunately, the pharmacokinetic profile of 36 was still poor, and the further development of
the penicillin-derived HIV protease inhibitors was abandoned.
TABLE IX
HIV Protease Inhibitors - Agouron
Compound K,
0.0018 p M
Q
38 1.67 pM
39
40 0.033 pM
STRUCTURE-BASED DRUG DESIGN 25
for this work was compound 37 (Table IX), a known inhibitor, which was modeled in
HIV using the X-ray structure of HIV-1 protease complexed with the inhibitor, MVT-101.
Using this structure, the C-terminal Val-Val methyl ester, which occupies the S2' and S3'
subsites of the protein, was replaced by a diphenylhydramine amide derivative in which
the two phenyl groups fill the 52' and S3' side-chain pockets. The compound, 38, was
synthesized and found to have a Kiof 1.67 pM. The crystal structure of 38 complexed
with HIV protease was solved. This structure compared favorably with the modeled
structure. A more elaborate inhibitor was designed, 39, which replaces the P3' phenyl
group with an indole ring designed so that the NH moiety of the indole would form a
hydrogen bond with the carbonyl oxygen of Gly 48. This compound, synthesized as a
mixture of diastereomers, was found to be eight times more potent than compound 38
with a Ki of 0.20 pM. Again, the X-ray structure was determined and compared to the
model structure. The indole was found to bind to the S3' subsite; however, the indole
NH did not make the predicted hydrogen bond but instead formed a water-mediated
H-bond to the Gly 27 carbonyl oxygen. At this point, the N-terminal alanines were
replaced by a moiety reported by Roche, i.e., an asparagine-2-quinoline-carbonylsubsti-
tuent.
Further optimization was carried out to see if substituents on the P2' phenyl group
could be designed to use the extra space revealed by the X-ray structure. Placing a
methyl or trifluoromethyl group in the meta position resulted in compound 40, which
showed small enhancements in binding affinity.
Subsequently, in order to test the free-energy perturbation method for computing
binding energies, calculations were performed on these compounds. Good correlation
was found between the calculated results and those experimentally determined.53
TABLE X
HIV Protease Inhibitors - Marion Merrel Dow
41 0.6 p,M
0.
42 5 nM
MDL 73,669
43 20 nM
of a macrocycle, compound 43. This idea had been successfully applied to the design of
macrocyclic potent pepsin inhibitors.58 Compound 43 was modeled in the active site of
HIV protease using the protocol described above, subsequently synthesized, and it was
found to inhibit HIV with a K iof 20 nM. NMR studies were performed to determine if
the macrocyclic portion of a close analog of 43 would show a solution conformation
similar to the bound conformation predicted by the model. Parts of the molecule in the
model of the complex corresponded well to the NMR data, whereas other sections of the
molecule could not be explained by the information determined by NMR studies. In
addition, in uacuo molecular dynamics simulations were performed on 43 in order to
qualitatively define its dynamical behavior. Subsequently, the X-ray crystal structure of
43 complexed to HIV was determined, and it was reported that those parts of the
molecule that had been characterized by NMR and found to be stable in the dynamics
simulations correspond well to X-ray structure, while remaining parts of the molecule
appeared to be a time average of two conformations defined by the dynamics calcula-
tions.
The authors do not address reasons for the lack of significant increase in binding
affinity of the macrocycle vs. the lead compound. We note, however, that Professor
Daniel Rxh has suggested that if a linear inhibitor already spends considerable time in
solution in a conformation close to the binding conformation, then introducing confor-
mational constraints may not lead to large gains in binding energy.59
STRUCTURE-BASED DRUG DESIGN 27
TABLE XI
NEP Inhibitors - Ciba
Compound G o
45
HS.,. ..q? 0 COOH
NEP: 3.0 nM
TLN: 2.5 p M
46 NEP: 0.8 nM
CGS 25155
RCO0"
47 NEP: 1.0 nM
CGS 26670 TLN: 68 nM
HS'
28 BOHACEK, MCMARTIN, AND GUIDA
Figure 3. CGS 25155, 46, an orally active potent NEP inhibitor (IC, = 0.8 nM) docked using molecular
modeling techniques in the binding site of thermolysin.
STRUCTURE-BASED DRUG DESIGN 29
containing 10- to 15-membered rings were first modeled in thermolysin. The model
showed that an 11-membered macrocycle, 47, formed the most favorable interactions
with thermolysin. Compounds with 12- and 13-membered rings also appeared favorable
in the model. The 11-, 12-, and 13-membered macrocycles were subsequently synthe-
sized and found to be potent inhibitors of NEP and thermolysin. In fact, the 11-
membered macrocycle inhibited thermolysin with an IC,, of 68 nM, making it one of the
most potent thiol thermolysin inhibitors yet reported.
Clearly, the ideal situation is to have available the 3-dimensional structure of the actual
target enzyme. However, this example illustrates that the structure of an homologous
enzyme that is sufficiently similar to the actual target can be a useful model for structure-
based drug design.
TABLE XI1
ACEiNEP Inhibitors - Ciba
Compound IC50
48
Initial ACE Template HS q COOH
49
Thiorphan
HS f N -coon ACE: 0.8 pM
NEP: 6 nM
46 ACE: 29.7 pM
CGS 25155 NEP: 0.8 nM
50 ACE: 6.1 nM
Benazaprilat 0 NEP: inactive
0
51 ACE: <50 nM
NEP: 2.7 nM
HS
52 ACE: 40 nM
NEP: 48 nM
HS
COOH
STRUCTURE-BASED DRUG DESIGN 31
A single molecule capable of inhibiting these two enzymes must possess those func-
tional groups required for tight binding to each enzyme. The models predict that the
conformation that an inhibitor must adopt to bind to ACE differs from the one required
for binding to NEP. Thus, an inhibitor must be able to adopt a conformation so that it can
bind to the S1, Sl', and S2' subsites of ACE and to the Sl', S2', and S3' subsites of NEP.
In the investigation at Ciba, inhibitors were proposed possessing conformationally re-
stricted sections designed to bind within certain areas in the binding site of each en-
zyme. These structural units were connected by a flexible hinge, allowing the entire
molecule to adopt conformations complementary to the active site of each enzyme. The
3-dimensional computer models were used as an aid in this design process.
The design strategy is outlined in Table XII. Comparison of two potent NEP inhibitors,
thiorphan 49 and the macrocycle 46, suggests that NEP can tolerate two different posi-
tions for the C-terminal carboxylic acid. The situation is different in ACE; thiorphan is a
moderate inhibitor of ACE; but, the macrocycle has little binding affinity for ACE, indi-
cating that the position of the C-terminal carboxylic acid is much more critical in ACE.
However, there are potent ACE inhibitors such as benazeprilat, 50, in which the zinc
chelating group is further removed from the terminal C-terminal carboxylic acid than in
thiorphan. Therefore, combining thiorphan and benazeprilat led to compound 51. To
further improve the potency of 51, a fused proline residue was incorporated into the
C-terminal portion of the molecule as suggested by the original template (48). Since
P-thiols such as 51 usually display poor pharmacokinetic properties, and we also wanted
to improve the in vivo stability of the thiol, the thiol was moved from the p to the a
position. a-Thiols had been previously shown to inhibit ACE.87 Thus, these two mod-
ifications led to compound 52.When docked and optimized using energy minimization
in the thermolysin model, this molecule formed good interactions with the enzyme
atoms. Thiol 52 could also be readily superimposed onto the ACE inhibitor template.
When synthesized, 52 was found to be a potent inhibitor of ACE (IC5" = 40 nM) and NEP
Figure 4. Tricyclic a-thiol dual ACD/NEP inhibitor, 52, bound to thermolysin. In pink is the conformation
predicted by molecular modeling. In green is the conformation subsequently determined by X-ray crystallogra-
phy. Relevant atoms of the thermolysin binding site are shown in darker colors.
32 BOHACEK, MCMARTIN, AND GUIDA
(IC50= 48 nM) and, to a lesser extent, of thermolysin (IC50= 1.6 pM). Scientists at both
Marion Merrell Dow8* and Bristol Myers Squibb89-91 have also reported on potent dual
ACE/NEP inhibitors containing the a-thiol functionality.
Because no 3-dimensional structure of an a-thiol bound to a zinc metalloprotease had
been reported, there was some concern about the reliabilityof the modeling of compound 52
in thermolysin. Therefore, the X-ray crystal structure of thermolysin complexed with 52
was determined at a resolution of 1.9 A. The close agreement between the conformation
predicted by molecular modeling to that determined experimentally (see Fig. 4) validated
the model and supported the hypothesis for the binding mode of 52 in NEP.M
HOOH
HO O H
+ +
Using the X-ray crystal structure of the human PNP/ guanine complex determined to
2.8 A resolution, it was possible to obtain potent inhibitors for this enzyme. Neverthe-
less, one of the initial obstacles in the design of PNP inhibitors was the inability to
accurately predict the bound geometry of potential inhibitors prior to their synthesis.
Since all of the compounds synthesized in this investigation were guanine analogs, this
"docking" problem was reduced to the determination of the conformational preferences
of groups attached to the purine ring of these analogs in the PNP binding site. By using
Monte Carlo/energy minimization conformational search procedures,75,94 low-energy
conformers of potential inhibitors could be located in the PNP binding site. Synthesis of
the most promising compounds and X-ray crystal structure determination of the com-
plexes confirmed that, in all cases, one of the lowest energy conformers compared
STRUCTURE-BASED DRUG DESIGN 33
Figure 5. Low energy conformers of 9-cyclohexylmethyl-9-deazaguanine in the binding site of PNP. A phos-
phate ion is shown in purple. The pastel-colored molecules show several of the possible docking modes
discovered using a search procedure based on Monte Carlo torsion perturbation with energy minimization.
favorably with the X-ray derived structure. It was found that the Monte Carlo/ energy
minimization method as implemented in the MacroModel/ BatchMin software98 was
useful in the design of novel PNP inhibitors, since it provided a means for estimating
how potential (but unsynthesized) inhibitors might dock to the enzyme binding site.
This information provided a basis for estimating the relative binding affinities of these
inhibitors using the complementarity surface14 (see the Discussion section).
One might assume that a disadvantage of the Monte Carlo/ energy minimization
procedure is that it generates a collection of low-energy conformers rather than a single
conformer (i.e., the structure observed crystallographically).However, this ensemble of
conformers effectively maps the space available to molecules bound to the active site of
an enzyme and has proven to be useful for the design of PNP inhibitors. For example, by
comparing the low energy conformers of 9-cyclohexylmethyl-9-deazaguanine(a rela-
tively potent PNP inhibitor) in the binding site of PNP, the adamantyl analog 9-(2-
adamantylmethyl)-9-deazaguanine was suggested. This molecule was synthesized and
was observed to inhibit PNP with good binding affinity (see Fig. 5).
Recently, the Monte Carlo /energy minimization method has been extended and the
studies mentioned above have been repeated so that orientational as well as conforma-
tional sampling was performed.99 Thus, the molecule is allowed to translate and rotate
during the Monte Carlo step. The sampling is quite efficient, and the geometries located
(from poorly docked starting structures) compare well with crystallographically derived
models. This procedure can be used for the complete docking (translational, rotational,
plus conformational sampling) of ligands in binding sites.
34 BOHACEK, MCMARTIN, AND GUIDA
9-cyclohexylmethyl-9-deazaguanine 9-(2-adamantylmethyl)-9deazaguanine
The PNP design effort resulted in a number of 9-deazaguanine analogs that include
the ones shown below, which possess IC,,s in the 5-50 nM range. Using molecular
modeling techniques, molecules (see below) were designed to take advantage of the
major binding sites in PNP discovered using X-ray crystallography. The 9-deazaguanine
moiety forms key hydrogen bonds within the purine binding site, including one be-
tween Asn-243 and the N-7 hydrogen, which is only present in 9-deaza analogs. Substi-
tuents at R, were designed to fill the hydrophobic binding site and substituents at R,
were designed to fill the hydrophobic binding site and substituents at R, were designed
to interact with the phosphate binding site. By incorporating these design features, some
of the most potent PNP inhibitors yet discovered were produced.
H OH
lent binding affinities. GRID allows the user to explore the binding site of an enzyme
with various “functional groups” in order to locate energetically favorable binding re-
gions for these groups. A carefully parameterized molecular mechanics force field is
used to estimate the binding energy, displayed as contours, of the various probes. Thus,
with the aid of GRID, a protonated primary amine probe was used to suggest that
replacement of a hydroxyl group on the previously known inhibitor Neu5Ac2en (53)by
an amino substituent would yield a more potent analog. This molecule (54) was synthe-
sized and it was found to e an excellent sialidase inhibitor [Ki = 50 nM; A/Tokyo/3/67
(N2) sialidase]. Replacement of the hydroxyl group with a guanidino substituent pro-
vided an even more potent compound 155, Ki = 0.2 nM; A/Tokyo/3/67 (N2) sialidase].
It
H3C- C N -
”
R2
& \
COO‘
R,= H OH
CH20H
This example stresses the utility of the structure-based approach. The availability of
the X-ray crystal structure of sialidase allowed extremely potent inhibitors to be de-
signed in essentially one step using a previously known inhibitor as a lead.
2-03m-u OH
durn
0
-
TS
FH4 -
I
OH
dTMP
TABLE XI11
Thymidylate Synthase Inhibitors - Agouron
Compound Ki
56
COGlu
57 2.2 p M
58
59 0.39 uM
60 13 nM
I
SOzPh
TABLE XIV
Thymidylate Synthase Inhibitors - Agouron
~
Compound Ki
1.6 FM
62 34 nM
63 2 nM
tion with a disubstituted nitrogen (see Table XIV). A benzyl group was used as one of the
substituents and it was substituted at the 4-position with a piperazine sulfonamide in
order to impart water solubility to the molecule. Analysis of the binding site indicated
that the piperazine ring should be oriented toward solvent. Synthesis of compound 61
afforded a TS inhibitor that possessed a K j of 1.6 pM against human TS. X-ray analysis
indicated that it was bound to the enzyme in a fashion similar to the one predicted by the
modeling studies. Improvements in the binding affinity of this lead inhibitor were made
using iterative structure-based drug design techniques to afford compound 62 and 63
with K j s of 34 nM and 2 nM (human TS), respectively.
Phenolphthalein
Sulisobenmne
ICso = 15 PM
TS binding site. The X-ray crystal structure of L. casei TS determined to 2.3 8, resolution
was used for the docking experiments performed with DOCK. The MACCS3D program
was then used to search the Fine Chemicals Directory to locate structures that were
similar to sulisobenzone, one of the molecules that had been found by DOCK. DOCK
was then used to dock and score each of the molecules located by MACCS3D. Using this
methodology, phenolphthalein analogs were found that inhibit TS in the low micromo-
lar range. It was suggested that these compounds could serve as potential leads for the
synthesis of novel, more potent analogs.
Interestingly, when the X-ray structure of the sulisobenzone/TS complex was deter-
mined to 2.5 /! resolution in two different buffer solutions, two different binding modes
were observed, which depended upon the buffer. Both structures differed from the one
suggested by DOCK and revealed an alternate binding region within the active site of
TS. The composite binding region was used to dock and evaluate the compounds found
as a result of the MACCS3D search. When the X-ray structure of the phenolphthalein/TS
complex was determined, it was found that the binding mode of phenolphthalein was
similar to the one suggested by DOCK.
111. DISCUSSION
The examples covered in this and other reviewsz-4 show that structure-based design is
a fertile approach to drug discovery that has enormous potential. It should be pointed
out that the literature in this area is already very extensive, and, as a result, reviews have
not yet appeared that are fully representative of a rapidly growing field. The numerous
examples covered in the present review, drawn only from the most recent literature,
suggest a state of exponential growth for this approach to drug discovery.
It is clear that a large number of successful new approaches are currently being devel-
oped in which structural information is employed for the design of drugs. We have the
impression of an area of science in rapid transition, which offers considerable scope and
opportunity to scientists who are comfortable working at the interface between disci-
plines and have imagination and initiative. At present, there is no single well-beaten
path to success. Rather, there are a number of different ways in which experimental
data, or in some cases hypotheses, about the interactions between potential drugs and
their host binding site are being used to catalyze, inspire, and focus the search for
biologically active compounds. In this sense, the practice of structure-based drug design
is somewhat of an art and we have attempted in this review to provide a flavor of the
state of the art in this endeavor.
40 BOHACEK, MCMARTIN, AND GUIDA
The process of drug discovery can be conveniently divided into two stages: (1) the
discovery of novel lead compounds and (2) the optimization of these compounds. A
suitable lead will typically have a binding affinity in the micromolar range and should
also be chemically accessible so that modifications can readily be made. Formerly, in the
absence of the structure of the bound ligand, the easiest leads to convert to a bioavailable
drug were small and not too flexible. For example, it would have been very difficult to
optimize a linear hexapeptide lead compound. Where high-quality structural informa-
tion is available, this task becomes approachable. Once the structure of the complex of a
ligand with a binding site is known, smaller analogs with very different physicochemical
properties can be designed. Some examples are described in Section IIC of this review.
Lead optimization is usually necessary to obtain compounds that are sufficiently po-
tent and have appropriate bioavailability characteristics. Leads may also require optimi-
zation to improve selectivity, although generally, as more potent drugs with highly
optimized binding characteristics are developed, they can be expected to be more selec-
tive. Since lower doses will be required, potent compounds are less likely to have side-
effects arising from actions unrelated to the mechanism of action.
Although the distinction between lead discovery and lead optimization is somewhat
arbitrary, it will serve in this discussion to distinguish between two different aspects of
structure-based design:
1. The use of structural information to suggest and plan a series of small changes
intended to optimize the structure of a weakly active compound. The structural
information may help to eliminate steps with little chance of success and may
encourage or justify the spending of resources on more difficult chemical syn-
theses of compounds that may have improved properties.
2 . The design or discovery of radically different molecules to provide new leads.
Ideally, these should be capable of optimization to highly potent compounds.
Within the drug industry it is also important that they should have sufficient
novelty to be patentable.
Ideally, computational methods would provide the information currently gained by itera-
tive cycles of synthesis and structure determination. This would greatly accelerate the
optimization process by reducing the number of compounds that need to be synthesized
and the time-consuming process of further structure determinations. There are, in fact, a
considerable number of methods for docking.113 Reasonable results have been obtained
in some cases. However, in general, these methods cannot be relied upon to always
produce the correct result. Despite this drawback, it can be very helpful to perform
docking studies to select promising candidates from a variety of molecules that require
lengthy synthesis. A good example, from our laboratories, of the effective use of a
thorough docking study is work on benzo-fused macrocyclic NEP inhibitors (see Section
IID), where the study helped to decide which of a large series of cyclic structures to
synthesize. A similar example has also been described in Section IIF of this review on
docking of PNP inhibitors. We believe computer docking methods are most reliable
when the binding site is relatively rigid and the docking geometry is largely determined
by steric complementarity. These methods are less effective at predicting the correct
geometry when a number of alternative docking geometries of low energy is possible.
Although experimental docked geometries may differ from those obtained from a
computational study, it is probable that, whenever a computation shows a binding mode
with high steric and chemical complementarity, the molecule will indeed bind. A lower
energy binding mode may be possible, but the computational method can still be ex-
pected to act as an effective filter for active compounds. For example, as described in
Section IIH-3, a molecule (sulisobenzone) selected by DOCK as a candidate for inhibition
of thymidylate synthase was subsequently shown by X-ray crystallography to have a
binding mode different than the one predicted using DOCK. In fact, two binding modes
were found, depending on the buffer, and a composite of both of the complexes was
then used for further database searching resulting in inhibitors with IC,,s in the low
p.M range.
A number of considerations are relevant to the process of optimization. They are
discussed here in approximate order of ease and reliability. In general, the effects of
changes that lower conformational energy, or that avoid bad van der Waals contacts, can
be predicted with reasonable confidence. Those based on nonbonded interactions in-
volving electrostatic and solvation energies are difficult to predict and less accurate.
number of cases, is the application of the program GRID39 to identify and graphically
display positions in the binding site where a variety of functional and hydrophobic
probe groups can be expected to bind (see Sections IIG and IIH-2). Although it can be
argued that these methods are not essential for drug design, one should not underesti-
mate their ability to guide and stimulate the creative process by clearly displaying oppor-
tunities for improving the binding affinity of the ligand.
3 . Improvement of Bioavailability
It may be necessary to change physicochemical properties such as octanol/water parti-
tion coefficient and water solubility in order to improve bioavailability. Usually, it is
important to achieve this with minimal loss in potency. The structure of a docked com-
plex can be used: (a) by designing potency-neutral changes in parts of the molecule in
contact with solvent or (b) selecting parts of the molecule that, although in contact with
the site, do not seem to be important for binding. A number of examples of successful
optimization of bioavailability have been cited in this review (see Section IIC).
Figure 6. Schematic illustration of a search for lead compounds by screening. Molecules are assumed to be
arranged so that similar molecules are close to each other. A subset of molecules containing up to 30 C, N, 0,
and S atoms may have more than 1060 members=. Contours show regions where the molecules are potent
inhibitors for a specific (in this case hypothetical) binding site.
aAlthough the number of possible molecules is difficult to estimate accurately, simple considerations show
that it must be very large! Consider growing a linear molecule an atom at a time and choosing a carbon,
nitrogen, oxygen, or sulfur atom at each position. Some of these atoms can be doubly or triply bonded, but not
all combinations of atoms are chemically stable, and some multiple bonds will only be possible in nonlinear
structures, i.e., a C=O group. Assuming a very approximate average choice multiplicity of 6, then 630 or 2 X
1023 molecules could be grown containing 30 atoms. Now consider the ways of introducing branching or
cyclization into the resulting structure. Closure of rings with three or more atoms involves selecting two atoms
to form a bond and could be achieved in 30'28/2 ways. Making a branched molecule could be achieved by
choosing a point to cut the chain and a point in the first part of the chain to attach to the cut end of the second
part of the chain (i.e., 302 ways). Not all atoms can be joined in this way. However, this will be offset by the fact
that when stereochemical considerations are introduced, the number of possibilities will be expanded. Based
upon these considerations, approximately 1040 molecules with u p to four rings and 10 branch points could be
produced from each linear chain, resulting in a very approximate estimate of 1065 molecules in total. Although
this is a rough estimate, it seems likely that when all the different possible combinations of ring closure and
branching are taken into account, the true number will be well in excess of 1060 and will rise steeply with
increasing molecular weight.
44 BOHACEK, MCMARTIN, AND GUIDA
Pofenf molecules
complemenfary
Per
molecule Zone 01 growth -
molecules wifh high
sferic and chemical
complemenW i l y
to binding site
Figure 7. Algorithm for de nouo drug design. Atom-by-atom growth of molecules, complementary to a target
binding site, searches the universe of organic molecules for potential drugs.
STRUCTURE-BASED DRUG DESIGN 45
other programs produce a wide range of compounds. The latter approach provides the
synthetic chemist with a variety of choices, and enables compounds to be selected based
on synthetic accessibility and on subjective factors such as likelihood of achieving bio-
availability. Of course, rules might be devised that will further filter out the output
structures to avoid difficult syntheses or compounds unlikely to be bioavailable.
An alternative approach to structure-based design is to explore possible ways of con-
necting fragments selected and docked by the user. The NEWLEAD program, for exam-
ple, accomplishes this readily and is easy to use.114
4. Potency Prediction
A major requirement for the effective design and optimization of lead compounds is
the prediction of potency. With a reliable predictor, we will be able to scan the large
number of structures produced by de novo programs and select those having potencies
better than a threshold value. Potency prediction should also increase the probability of
forecasting accurate docking geometries for inhibitors and eliminate some of the itera-
tions in the process of optimization. Indeed, an ultimate goal would be to dispense with
the distinction between lead discovery and optimization, and design potent novel mole-
cules in a single step.
A large number of factors are important for determining the binding free energy,
including:
1. Interaction energies (van der Waals and electrostatic) with the binding site
2. Solvation free energy changes
3. Loss of conformational entropy
4. Internal (strain) energy of the bound molecule
For a given series of molecules in a specific binding site, some of these factors may be
much more important than others. For example, we found that for a series of thermo-
lysin inhibitors, a simple complementarity score, based on the number of hydrogen
bonding and hydrophobic contacts, gave a very high correlation with potency.6~14Al-
though the scoring method is simple, the parameters defining hydrophobic contacts and
hydrogen bonds were critical to the success of the method and were based on a careful
study of complementarity in proteins.
Very different results were obtained in the work, cited earlier on carbonic anhydrase
(Section IIA), which clearly shows that internal strain energy is the predominant factor
for the series of compounds reported.3
In another example reported recently,115 a high degree of correlation was achieved
between the calculated intermolecular interaction energy (van der Waals plus electrosta-
tic energies), for a series of HIV-1 protease inhibitor complexes and the observed i n vitro
potency. The inhibitors had been subjected to energy minimization in the HIV protease
binding site. This methodology was subsequently used to predict the potency of poten-
tial HIV protease inhibitors in advance of their synthesis and testing for biological
activity.
Each of these methods accounts for only a subset of the terms listed above, which
account for the total free energy of binding and can be used with confidence only when a
learning set of compounds is available. A method is needed that takes into account all
the factors listed above and can be applied without further parameterization to any
inhibitor in any binding site.
46 BOHACEK, MCMARTIN, AND GUIDA
IV. CONCLUSIONS
The art and practice of structure-based drug design is rapidly evolving into a legiti-
mate scientific discipline. We use the term "scientific discipline" with confidence since
the best work in this area currently involves:
The scientific foundations for this approach were established many years ago with the
development of X-ray and NMR techniques for the accurate determination of structures
of macromolecular complexes and with the development of computational methods for
molecular mechanics and quantum mechanics. However, the basic requirement for the
STRUCTURE-BASED DRUG DESIGN 47
REFERENCES
1. E. Fischer, Ber. Dtsch. Chem. Ges. 27, 2985 (1894).
2. J. W. Erickson and S. W. Fesik, in Annual Reports in Medicinal Chemistry M. C. Venuti, Ed., Academic Press,
Inc., New York, 1992, p. 271.
3. J. Greer, J. W. Erickson, J. J. Baldwin, and M. D. Varney, 1. Med. Chem. 37, 1035 (1994).
4. C. L. M. J. Verlinde and W. G. J. Hol, Structure 2, 577 (1994).
5. C. E. Bugg, W. M. Carson, and J. A. Montgomery, Scientific American 269, 60 (1993).
6. R. S. Bohacek and C. McMartin, 1. Am. Chem. SOC. 116, 5560 (1994).
7. V. J. Gillet, W. Newell, P. Mata, G. Myatt, S. Sike, Z. Zsoldos, and A. P. Johnson, 1. Chem. I f . Comput. Sci.
34, 207 (1994).
8. D. A. Pearlman and M. A. Murcko, 1. Comp. Chem. 14, 1184 (1993).
9. S. H. Rotstein and M. A. Murcko, 1. Camp.-Aided Mol. Design 7, 23 (1993).
10. S. H. Rotstein and M. A. Murcko, 1. Med. Chem. 36, 1700 (1993).
11. Y. Nishibata and A. ltai, 1. Med. Chem. 36, 2921 (1993).
12. E. E. Abola, F. C. Bernstein, S. H. Bryant, T. F. Koetzle, and J. Weng, in Data Commission offhe Int’l Union of
Crystallography, F. H. Allen, G. Bergerhoff, and R. Sievers, Eds., Int’l Union of Crystallography, Bonn,
Cambridge, Chester, 1987, p. 107.
13. F. C. Bernstein, T. F. Koetzle, G. J. 8 . Williams, E. F. Meyer, M. D. Brice, J. R. Rodgers, 0. Kennard,
T. Shimanouchi, and M. Tasumi, 1. Mol. B i d . 112, 535 (1977).
14. R. S. Bohacek and C. McMartin, 1. Med. Chem. 35, 1671 (1992).
15. D. G. Alberg and S. L. Schreiber, Science 262, 248 (1993).
16. C. Weber, G. Wider, B. von Freyberg, R. Traber, W. Braun, H. Widmer, and K. Wuthrich, Biochemistry 30,
6563 (1991).
17. S. W. Fesik, R. T. Gampe Jr., H. L. Eaton, G. Gemmecker, E. T. Olejniczak, P. Neri, T. F. Holzman, D. A.
Egan, R. Edalji, R. Simmer, R. Helfrich, J. Hochlowski, and M. Jackson, Biochemistry 30, 6574 (1991).
18. H.-R. Loosli, H. Kessler, H. Oschkinat, H.-P. Weber, T. J. Petcher, and A. Widmer, Helu. Chim. Acta. 68,
682 (1985).
19. H. Kessler, M. Kock, T. Wein, and M. Gehrke, Helu. Chim. Acta. 73, 1818 (1990).
20. W. L. Jorgensen, Science 254, 954 (1991).
21. K. Wuthrich, B. von Freyberg, C. Weber, G. Wider, R. Traber, H. Widmer, and W. Braun, Science 254,953
(1991).
22. G. D. Van Duyne, R. F. Standaert, P. A. Karplus, S. L. Schreiber, and J. Clardy, Science 252, 839 1991).
48 BOHACEK, MCMARTIN, AND GUIDA
23. T. Taga, H. Tanaka, T. Goto, and S. Tada, Acta. Cryst. C43, 751 (1987).
24. D. F. Mierke, P. Schmieder, P. Karuso, and H. Kessler, Helv. Chim. Acfa. 74, 1027 (1991).
25. A. M. Petros, J. R. Luly, H. Liang, and S. W. Fesik, 1. A m . Chem. Soc. 115, 9920 (1993).
26. D. Altschuh, 0. Vix, B. Rees, and J.-C. Thierry, Science 256, 92 (1992).
27. J. L. Kofon, P. Kuzmic, V. Kishore, G. Gemmecker, S. W. Fesik, and D. H. Rich, 1. Am. Chem. Soc. 114,
2670 (1992).
28. N. H. Sigal, F. Dumont, P. Durette, J. J. Siekierka, L. Peterson, D. H. Rich, 8. E. Dunlap, M. J. Staruch,
M. R. Melino, S. L. Koprak, D. Williams, B. Witzel, and J. M. Pisano, 1. Exp. Med. 173, 619 (1991).
29. A. A. Patchett, D. Taub, 0. D. Hensens, R. T. Goegelman, L. Yang, F. Dumont, L. Peterson, and N. H.
Sigal, 1. Antibiot. 45, 94 (1992).
30. G. Lauri and P. A. Bartlett, 1. Computer-Aided. Mol. Design 8, 51 (1994).
31. F. H. Allen, J. E. Davies, J. J. Galloy, 0.Johnson, 0. Kennard, C. F. Macrae, E. M. Mitchell, G . F. Mitchell,
J. M. Smith, and D. G. Watson, 1. Chem. In!. Comput. Sci. 31, 187 (1991).
32. P. Y. S. Lam, P. K. Jadhav, C. J. Eyerman, C. N. Hodge, Y. Ru, L. T. Bacheler, J. L. Meek, M. J. Otto, M. M.
Rayner, Y. N. Wong, C.-H. Chang, P. C. Weber, D. A. Jackson, T. R. Sharpe, and S. Erickson-Viitanen,
Science 263, 380 (1994).
33. A. L. Swain, M. M. Miller, J. Green, D. L. Rich, J. Schneider, S. 8. H. Kent, and A. Wiodawer, Proc. Natl.
Acud. Sci. U . S . A. 87, 8805 (1990).
34. J. M. Blaney, G. M. Crippen, A. Dearing, and J. S. Dixon, Indiana University-Quantum Chemistry
Program Exchange (QCPE # 590), Bloomington, IN.
35. 0. F. Guner, D. W. Hughes, and L. M. Dumont, 1. Chem. I f . Comput. Sci. 31, 408 (1991).
36. S. Thaisrivongs, P. K. Tomich, K. D. Watenpaugh, K.-T. Chong, W. J. Howe, C.-P. Yang, J. W. Strohbach,
S. R. Turner, J. P. McGrath, M. J. Bohanon, J. C. Lynn, A. M. Mulichak, P. A. Spinelli, R. R. Hinshaw, P. J.
Pagano, J. B. Moon, M. J. Ruwart, K. F. Wilkinson, B. D. Rush, G. L. Zipp, R. J. Dalga, F. J. Schwende,
G. M. Howard, G. E. Padbury, L. N. Toth, Z . Zhao, K. A. Koeplinger, T. J. Kakuk, S. L. Cole, R. M. Zaya,
R. C. Piper, and P. Jeffrey, 1. Med. Chem. 37, 3200 (1994).
37. E. A. Lunney, S. E. Hagen, J. M. Domagala, C. Humblet, J. Kosinski, B. D. Tait, J. S. Warmus, M. Wilson,
D. Ferguson, D. Hupe, P. J. Tummino, E. T.Baldwin, T. N. Bhat, B. Liu, and J. W. Erickson, 1. Med. Chem.
37, 2664 (1994).
38. D. S. Goodsell and A. J. Olson, Proteins: Struct., F u n d . , Genet. 8, 195 (1990).
39. P. J. Goodford, /. Med. Chem. 28, 849 (1985).
40. J. V. N. V. Prasad, K. S. Para, E. A. Lunney, D. F. Ortwine, J. 8 . Dunbar, D. Ferguson, P. J. Tummino, D.
Hupe, B. D. Tait, J. M. Domagala, C. Humblet, T. N. Bhat, 8. Liu, D. M. A. Guerin, E. T. Baldwin, J. W.
Erickson, and T. K. Sawyer, 1. A m . Chem. Soc. 116, 6989 (1994).
41. J. P. Vacca, J. P. Guare, S. J. deSolms, W. M. Sanders, E. A. Giuliani, S. D. Young, P. L. Darke, J. Zugay,
I. S. Sigal, W. A. Schleif, J. C. Quintero, E. A . Emini, P. S. Anderson, and J. R. Huff, 1. Med. Chem. 34,1225
(1991).
42. T. A. Lyle, C. M. Wiscount, J. P. Guare, W. J. Thompson, P. S. Anderson, P. L. Darke, J. A. Zugay, E. A.
Emini, W. A. Schleif, J. C. Quintero, R. A. F. Dixon, I. S. Sigal, and J. R. Huff, 1. Med. Chem. 34, 1228
(1991).
43. B. D. Dursey, R. B. Levin, S. L. McDaniel, J. P. Vacca, J. P. Guare, P. L. Darke, J. A. Zugay, E. A. Emini,
W. A. Schleif, J. C. Quinterno, J. H. Lin, I.-W. Chen, M. K. Holloway, P. M. D. Fitzgerald, M. G. Axel,
D. Ostovic, P. S. Anderson, and J. R. Huff, 1. Med. Chem. 37, 3443 (1994).
44. D. C. Humber, N. Cammack, J. A. V. Coates, K. N. Cobley, D. C. Orr, R. Storer, G. G. Weingarten, and
M. P. Weir, 1. Med. Chem. 35, 3080 (1992).
45. D. C. Humber, M. J. Bamford, R. C. Bethell, N. Cammack, K. Cobley, D. N. Evans, N. M. Gray, M. M.
Hann, D. C. Orr, J. Saunders, B. E. V. Shenoy, R. Storer, G. G. Weingarten, and P. G. Wyatt, 1. Med. Chem.
36, 3120 (1993).
46. D. S. Holmes, R. C. Bethell, N. Cammack, I. R. Clemens, J. Kitchin, P. McMeekin, C. L. Mo, D. C. Orr,
8. Patel, I. L. Paternoster, and R. Storer, 1. Med. Chem. 36, 3129 (1993).
47. A. Wonacott, R. Cooke, F. R. Hayes, M. M. Hann, H. Jhoti, P. McMeekin, A. Mistry, P. Murray-Rust,
0. M. P. Singh, and M. P. Weir, 1. Med. Chem. 36, 3113 (1993).
48. H. Jhoti, 0. M. P. Singh, M. P. Weir, R. Cooke, P. Murray-Rust, and A. Wonacott, Biochem. 33, 8417
(1994).
49. J. Kitchin, R. C. Bethell, N. Cammack, S. Dolan, D. N. Evans, S. Holman, D. S. Holmes, P. McMeekin,
C. L. Mo, N. Nieland, D. C. Orr, J. Saunders, B. E. V. Shenoy, 1. D. Starkey, and R. Storer, 1. Med. Chem.
37, 3707 (1994).
50. D. H. Rich, J. Green, M. V. Toth, G. R. Marshall, and S. B. H. Kent, 1. Med. Chem. 33, 1285 (1990).
STRUCTURE-BASED DRUG DESIGN 49
51. A. Krohn, S. Redshaw, J. C. Ritchie, B. J. Graves, and M. H. Hatada, J. Med. Chem. 34, 3340 (1991).
52. M. D. Varney, K. Appelt, V. Kalish, M. R. Reddy, J. Tatlock, C. L. Palmer, W. H. Romines, B.-W. Wu, and
L. Musick, J. Med. Chem. 37, 2274 (1994).
53. M. R. Reddy, M. D. Varney, V. Kalish, V. N. Viswanadhan, and K. Appelt, j . Med. Chem. 37, 1145 (1994).
54. D. Schirlin, S. Baltzer, V. Van Dorsselaer, F. Weber, C. Weill, J. M. Altenburger, B. Neises, G. Flynn, J. M.
Remy, and C. Tarnus, Biorg. Med. Chem. Lett. 3, 253 (1993).
55. D. Schirlin, V. Van Dorsselaer, C. Tamus, A. S. Tyms, S. Baltzer, F. Weber, J. M. Remy, T. Brennan, R. Farr,
and D. Janowick, Biorg. Med. Chem. Lett. 4, 241 (1994).
56. B. L. Pologar, R. A. Farr, D. Friedrich, C. Tamus, E. W. Huber, R. J. Cregge, and D. Schirlin, J. Med. Chem.
37, 3684 (1994).
57. M. Miller, J. Schneider, B. K. Sathyanarayana, M. V. Toth, G. R. Marshall, L. Clawson, L. Selk, S. B. H.
Kent, and A. Wlodawer, Science 246, 1149 (1989).
58. Z. Szewczuk, K. L. Rebholz, and D. H. Rich, Int. 1. Peptide Protein Res. 40, 233 (1992).
59. R. A. Wiley and D. H. Rich, Med. Res. Reviews 13, 327 (1993).
60. B. M. Brenner, B. J. Ballermann, M. E. Gunning, and M. L. Zeidel, Physiol. Rev. 70, 665 (1990).
61. P. Needleman, E. H. Blaine, J. E. Greenwald, M. L. Michener, and C. 8. Saper, Ann. Rev. Pharmacol.
Toxicol. 29, 23 (1989).
62. R. E. Lang, T. Unger, and D. Ganten, I. Hypertension 5, 255 (1987).
63. A. E. G. Raine, J. G. Firth, and J. G. G. Ledinham, Clin. Sci. 76, 1 (1989).
64. J. P. Bussien, J. Biollaz, B. Waeber, J. Nussberger, G. A. Turini, H. R. Brunner, F. Brunner-Feber, H. J.
Gomez, and E. S. Otterbein, 1. Curdiovasc. Pharmacol. 8, 216 (1986).
65. T. G. Yandle, A. M. Richards, M. G. Nicholls, R. Cuneo, E. A. Espiner, and J. H. Livesey, Life Sci. 38,1822 (1986).
66. J. L. Sonnenberg, Y. K. Sakane, A. Y. Jeng, J. A. Koehn, J. A. Ansell, L. P. Wennogle, and R. D. Ghai,
Peptides 9, 173 (1988).
67. A. J. Kenny and S. L. Stephenson, FEBS Lett. 232, 1 (1988).
68. M. Pozsgay, C. Michaud, M. Liebman, and M. Orlowski, Biochemistry 25, 1292 (1986).
69. L. 8 . Hersh and K. Morihara, 1. Bid. Chem. 261, 6433 (1986).
70. J. C. Danilewicz, P. L. Barclay, I. T. garnish, D. Brown, S. F. Campbell, K. James, G. M. R. Samuels, N. K.
Terrett, and M. J. Wythes, Biochem. Biophys. Res. Commun. 164, 58 (1989).
71. D. R. Holland, P. L. Barclay, J. C. Danilewicz, B. W. Matthews, and K. James, Biochemistry 33, 51 (1994).
72. L. J. MacPherson, E. K. Bayburt, M. P. Capparelli, R. S. Bohacek, F. H. Clarke, R. D. Ghai, Y. Sakane, C. J.
Berry, J. V. Peppard, and A. J. Trapani, j . Med. Chem. 36, 3821 (1993).
73. P. A. Bartlett and C. K. Marlowe, Science 235, 569 (1987).
74. H. M. Holden, D. E. Tronrud, A. F. Monzingo, L. H. Weaver, and B. W. Matthews, Biochemistry 26,8542
(1987).
75. W. C. Guida, R. S. Bohacek, and M. D. Erion, 1. Comp. Chem. 13, 214 (1992).
76. R. S. Bohacek, J. Priestle, and M. Gruetter, unpublished results.
77. L. H. Opie, Angiotensin Converting Enzyme Inhibitors, Wiley-Liss, New York, 1992.
78. A. M. Richards, I. G. Crozier, T. Kosoglou, M. Rallings, E. A. Espiner, M. G. Nicholls, T. G. Yandle,
H. Ikram, and C. Frampton, Hypertension 22, 119 (1993).
79. K. B. Margulies, M. A. Perrella, L. J. McKinley, and J. C. Burnett, 1. Clin. Invest. 88, 1636 (1991).
80. I. Pham, W. Gonzales, A.-I. K. El Amrani, M.-C. Fourni&Zaluski, M. Philippe, I. Laboulandine, B. P.
Roques, and J.-B. Michel, 1. Pharmacol. Exp. Ther. 265, 1339 (1993).
81. A. A. Seymour, M. M. Asaad, V. M. Lanoce, K. M. Langenbacher, S. A. Fennell, and W. L. Rogers, 1.
Phurmacol. Exp. Ther. 266, 872 (1993).
82. C. Krulan, R. D. Ghai, R. W. Lappe, and R. L. Webb, FASEB J. 7, A247 (1993).
83. A. A. Patchett, and E. H. Cordes, in Adv. Enzymol. A. Meister, Ed., John Wiley & Sons, New York, 1985.
84. R. S. Bohacek, S. DeLombaert, C. McMartin, J. Priestle, and M. Gruetter, unpublished results; manu-
script in preparation.
85. G. Ksander, R. S. Bohacek, R. de Jesus, A. Yuan, Y. Sakane, C. Berry, R. Ghai, and A. J. Trapani,
unpublished results, manuscript in preparation.
86. C. McMartin and R. Bohacek, unpublished results.
87. D. W. Cushman, H. S. Cheung, E. F. Sabo, and M. A. Ondetti, Biochemistry 16, 5484 (1977).
88. G. A. Flynn, D. W. Beight, S. Mehdi, J. R. Koelh, E. L. Giroux, J. F. French, P. W. Hake, and R. C. Dage, 1.
Med. Chem. 36, 2420 (1993).
89. N. G. Delaney, J. C. Banish, R. Neubeck, S. Natarajan, M. Cohen, G. C. Rovnyak, G. Huber,
N. Murugesan, R. Girotra, E. Sieber-McMaster, J. A. Robl, M. M. Asaad, H. S. Cheung, J. E. Bird,
T. Waldron, and E. W. Petrillo, Bioorg. Med. Chem. Lett. 4, 1783 (1994).
50 BOHACEK, MCMARTIN, AND GUIDA