Professional Documents
Culture Documents
is much faster
ne docking of flexible Iligands involves programs that use a single novel structure. Automated design
diverse structures in a
fragmentation approach followed by the docking of a rigid and can produce large numbers of
anchor and reconstruction of the ligand. Alternatively, sim- short time period.
ulated annealing or genetic/evolutionary algorithms can be novo drug design was carried out
The early work on de
Used to generate different conformations during the docking manually and an example of one of these studies is given
in Case study 5. From these studies, a number of general
process manual de novo
principles were identified regarding drug
Electronic screening or database mining involves the search
of structural databases to identify structures containing a design.
disadvantages. For example, the novelty of the structures drug design and some are particularly problematic. Fo
obtained is inevitably limited to the operator's own im- example, automated de novo drug design is pnssible
to gen
isslow andandit originality.
agination is really limited
More to the identification
seriously, of a
manual design erating
to synthesize.
structures
Consequently,
which are either
efforts ficult
have been
or impos
ma
nade to
De novo drug design
e the software packages involved, such that they 21.15.2 Automated de novo drug design
out problem structures, or prevent
identifiy.
and filter Several computer software programs have been written
can
generated in the place. first A second prob
being revolves around
which automatically design novel structures to fit known
with automated de novo programs
ons used to estimate binding affinities. binding sites. The following are some examples.
em
he
scoring functions
be useful to
rank the generated structures with
strengths, but the results ob-
wouk
It 21.15.2.1 LUDI
to their binding
been tound to be unreliable.
sined have oftennovo to point out
One of the best known de novo software programs is called
drug design are quick
Critics of de useful this LUDI, which works by fitting molecular fragments to dif
no clinically drug has been designed in ferent regions of the binding site, then linking the fragments
hat
it is hard to see how this could be
nanner. This is true, but1The number and variety of struc- together (Fig, 21.47). There are three stages to the process.
realistic expectation. identified
which could be through de novo drug 1: Identification of interaction sites
tures
chances of 'hit- Stage
virtually limitless and so the
design are
Moreover, there is far First of all, the atoms present in the binding site are an-
tao the ideal structure are poor. that binds alysed identify those that can take part in hydrogen
to
nare to drug design finding a structure
than
and those that can take part in van
novo drug design does not
iden- bonding interactions,
to its target. De
strongly favourable der Waals interactions. Oxygen atoms and tertiary ni-
structures identified will have
HHr whether the atoms are identified as hydrogen bond acceptors.
safety profiles. trogen
pharmacokinetic properties acceptableare
or
H-X H-N
H-N
H-N
Bridging
iGURE 21.47 Stages involved in automated denovo drug design using LUDI. H-X, hydrogen
bond donor interaction
g
AeYs
, hydrogen bond acceptor interaction site. The dots indicate aromatic interaction sites.
644 Chapter 21 Computers in medicinal chemistry
Identify
aliphatic
carbon Trim
Construct Define
interacticon
the sphere
surrounding
sphere sites Binding site Binding site
Binding site Binding site
interaction wites round a netliyl group (10DI),
FIGURE 21.48 ldentification of aliphatle
1BO" 16R0
X
H 10A
10AH.
19A 1.9A
Interaction sites can then be defined. These are positions There are, in fact, four interaction sites that are normall
in the binding site that define where a ligand atom could calculated in this situation. The other three can be visual
be placed to interact with ay of the above atoms. For ex- ived if we take a viewpoint along the line of atoms O-H
A and vary the relative position of Y as shown (Fig, 21.51
ample, suppose the binding site contains a methyl group
As with the der Waals interaction sites,
CFig. 21.48). The program would identify the carbon of that
group as an aliphatic carbon capable of taking part in van
van
bond interaction sites are checked to ensure that they are
hydrogen
der Waals interactions. This is a non-directional interaction, no closer than 1.5 A from any atorn present in the bind
so a sphere is constructed around the carbon atom with a ing site. If they are, they are rejected.
radius corresponding to the ideal distance for such an in-
teraction (4 A). A number of points (normally 14) are then Stage 2: Fitting molecular fragments
placed evenly over the surface of the sphere to define ali- Once interaction sites have been determined, the LUDI
phatic interaction sites. Regions of the sphere which overlap program accesses a library of several hundred molecular
or come too close to the atoms making up the binding site fragments, such as those shown in Fig 21.52. The mol
(ie. less than 3 Aseparation) are rejected, along with any ecules chosen are typically 5-30 atoms in size and are
of the 14 points that were on that part of the surface. The usually rigid in structure because the fitting procedure
remaining points are then used as the aliphatic interaction assumes rigid fragments. Some fragments are included
sites. A similar process is involved in identifying aromatic
interaction sites surrounding an aromatic carbon atom. 1.23 A
Identifying interaction sites for hydrogen bonds is car- A
ried out in a different fashion. As hydrogen bonds are di-
rectional, it is important to define not only the distance be- 180
tween the ligand and the binding region, but the relevant
Bindingsite Binding site
orientation of the atoms as well. This can be done by defin-
ing the hydrogen bond interaction site as a vector involv FIGURE 21.50 The interaction site for ahydrogen bond
ing two atoms. The position of these atoms is determined acceptor, represented by A-Y (LUDI). A denotes the
by the ideal bond lengths and bond angles for a hydrogen hydrogen bond acceptor.
bond. For example, if the binding site has a carbonyl group
present, then there are two possible hydrogen bond inter-
action sites (X-H) which can be defined (Fig. 21.49).
If the binding site has a hydrogen bond donor
sent, then interaction sites for a hydrogen bond accep-
tor would be defined in a similar fashion. For
pre-
example,
Fig. 21.50 shows how the interaction site for a hydrogen
P
Bindingsite Binding siteBindingsiteBindingse
bond acceptor is determined when the binding site has a
FIGURE 21.51 Four possible interaction sites for A-Y(A
hydroxyl group present. hidden).
645
De novo drug design
NH OH
H,C o
OH
NO
FIGORE 21.52
Examples of molecular fragments used by LUDI.
the fitting binding site, the final stage is to link them up. The pro-
in process. EaCh contormation is treated as a gram first identifies the molecular
entity during the htting fragments a are
The atoms which are going to beprocess.
separate
closest to each other in the binding site, then identifies
used in the
have to be fitting pro- the closest hydrogen atoms (Fig. 21.54). These now define
cess predetermined for each fragment. Simi- the link sites for the bridge. The program now tries out
larly, the interaction sites on to which each atom
eted have to be can be various molecular
predetermined. For example, the methyl bridges from a stored ibrary o
out which one fits best. Examples of the types of molecu-
carbons of an acetone ragment are defined as aliphatic lar
bridges that
are stored are shown in Fig. 21.55. Once
and can only be htted onto aliphatic interaction sites. The suitable a
arbonyl group is defined as a bridge has been found, a final molecule is createa.
d can only be fitted on to the hydrogen bond acceptor
corresponding interaction
(Fig. 21.53). The best fit will be the one that 21 15.2.2
site
matches SPROUT
uD the fragment with the maximum number of interac- Another
tion sites. The program can try out' the various early example of an automated de novo drug
in its library and identify those that can be
fragments design program is SPROUT. Like LUDI, the program
matched up or fits fragments to interaction sites, but there are
fitted to the available interaction sites in the inter
binding
site. esting differences in the way that the process is carried
Better
fit
O-H A -H OSe.
HC H:
Aliphatic
FIGURE 21.53 Fitting fragments. A-Y, hydrogen bond acceptor interaction site. The dots indicate aliphatic interaction s
ites.
646 Chapter 21 Computers in rnedicinal cherristry
H-N
A
N
(ILUDI),
FIGURE 21.55 Examples of molecular bridges
in LUDI. The atoms and bonds are specified in these template randomly and positions it into the binding site
fragments and there huge
are a frag-
number of possible by placing one of the vertices at the centre of a sphere. In
ments which could be considered. Templates, however, the early versions of SPROUT, further fragment templates
are designed to represent several different molecular were then added sequentially, and the skeleton was grown'
until it occupied all the other spheres. In the current ver-
fragments. Each template is defined by vertices and edg-
es rather than by atoms and bonds. A vertex represents
sion, fragment templates are placed at all the spheres and
while an are grown towards each other, until they are finally
linked.
a generalized sp, sp', or sp° hybridized atom,
double, triple bond, depend- One advantage of SPROUT is that the 'growth' of fragment
edge represents a single, or
Sp
p
sp2
Fragment template
OOOo0 etc.
Interaction sites
Fitting fragments
Aromatic
HBA
HBA H-N
O-H , H-N
0-H
Hydrophobic
H-N H-N
O-H
0-H
FIGURE 21.57
Generating structures using SPROU
identify an enol and modify it to a carboxylic acid which purpose since it is relatively slow, taking about a minute
for the analysis of the fi-
can still act as a hydrogen bond donor (Fig. 21.58). per structure. This is acceptable
nal structures that are generated, but would slow up the
The program also has the ability to modify structures
if it used to assess the thousands
process considerably
was
such that they are more readily synthesized. For example,
f intermediate structures that are generated during the
introducing a heteroatom into a two-carbon link between
648 Chapter 21 Computers in medicinal chemistry
nteraction point
for H-bond donor
Sp
Add atoms
Modify
sp
Enol
Molecular template Carboxylic
acid
Ketone
Modify
Easier to synthesize
KOO Nal
FIGURE 21.60 Possible synthesis allowing the linkage required in Fig. 21.59.
building
process. Therefore, a less accurate but quicker 21.15.2.3 LEGEND
method of analysis is carried out. The method involves
the identification of molecular features within each par- LEGEND is another long established automated de no
tial structure and identifying how frequently they occur drug design program. A grid is set up within the
binding
in known structures. The rationale is that if a particular site to identify steric and electrostatic interaction ener
feature commonly exists in known compounds, it is like- gies between each grid point and the binding site (section
ly that that same feature should be capable of synthesis in 21.7.5). These are tabulated for different types of atom and
the novel structures generated by de novo design. are used to estimate van der Waals interactions for the
The major structural features within a molecule can growing skeletons that are generated by the program, as
be defined as the various sized rings that are present, as well as for structure optimization of final structures. The
well as any connecting chains. The synthetic feasibility operator has the choice of starting from a single heter
of rings and chains is generally dependent on their
sub 0atom, placed in such a position that it can form a hydro-
stitution pattern. For example, the ten most frequently gen bond ith the binding site. Alternatively, a molecule
observed substitution patterns for a naphthalene ring or molecular fragment can be placed into the binding
amongst a database of known compounds are shown in site to act as a starting structure. This can be useful if one
Fig. 21.61. The analysis of partial structures can be car include the structure of an active
ried out such that structures with uncommon structural
wants to partial com
pound within the generated structures. Once the starting
features (such as a tetrasubstituted naphthalene ring) are atom or fragment has been positioned, the growth stage
penalized and rejected. A measure of the drug-like char- can commence to generate different skeletons
acter of the partial structures can also be gleaned if the Unlike LUDI and SPROUT, LEGEND does not use fra
database used in the analysis is restricted to active com- ske
ments or templates to generate skeletons. Instead, the
pounds from drug databases. etons are grown one atom at a time using random choCES
649
De novo drug design
3748 3288
1608 782 504
486 459
403 397 362
FIGURE 21.61 The ten most
frequent substitution patterns for naphthalene in database
Numbers refer to number of of drug-like compounds.
a
occurrences.
hlinked. The latter is termed the root atom. The typewillof rings are present, these are also completed. The structure
is finally optimized,
Lnd
bor used to connect the new atom to the root
atom is also0
taking into account both intramolec
ular and intermolecular interactions. The
chasen at random. Finaly a random torsion process is then
ta pOsition the new atom relative to
angle is chosen repeated to generate as many structures as desired.
the existing skeleton.
Particular features such as aromatic rings, carbonyl
and amide groups can be generated since some of groups, 21.15.2.4 GROW, ALLEGROW, and SYNOPSIS
the atom
wes that are used are dehned as
belonging to these features. GROW is
For example, if a new atom is defined as being an aromatic a
program that uses molecular fragments to
arbon, then the hnal structure must generate novel ligands for binding sites. The fragments
eventually contain an
aromatic ring containing that atom. The aromatic used represent amino acids, and so the
structures that are
ot be completely formed when the
not
ring may generated are limited to peptides.
growth
stage is over, ALLEGROW is a program that can be used
but the program will automatically to extend
complete the ring. a known ligand such that it can access vacant
This approach adding atoms one by one has the ad-
of regions ofa
vantage that it can
generate greater diversity of struc-
a
binding site (Box 21.5).
MeO. MeO
OAC
HO HO
Me E F3C F FC
HO. IOH HN HN
IMe
Me
N N. N
N N
Me
Structure| Structure lI
Cortivazol
New New
channel channel
HN
-Me
O
NH2