Professional Documents
Culture Documents
σk ε σi ε
= Ep.np p∈k 1- ext = Ep.np p∈i 1- ext
ε0 εint ε0 εint
R R
NH+ NH+
NH2 NH2
–
H2N O H2N
N
N N
O N
OH R N N
–N
O HN R
R R
Pharmacophore Patterns
• The pharmacophore pattern of a molecule
characterizes the relative arrangement of all its
pharmacophore types
– What pharmacophore types are represented?
– How are they arranged (spatially, topologically) with
respect to each other ?
– How can these aspects be captured numerically to yield
molecular descriptors of the pharmacophore pattern?
• Note: Pharmacophore patterns are essentially 3D.
Since geometry is determined by connectivity, 2D
“pharmacophore patterns” also make sense!
Exploiting pharmacophore patterns…
CGRP
MAPkin
IL-8
N
NEUPTh
N
HIVP
N
PK55fyn
N
EGF-TK
O
PKC
H
PDEIV
PDEII
Elast
CatB
O
Cl
Br
Cl
K-ATP
V1Ah
Sigma1
5HTUpt
5HT6h
5HT3h
5HT2ch
5HT1D
5HT1Ah
Muh
NPY
NK1h
M3h
M1h
ML1
H1c
Galan
N
ETAh
DaUpt
N
N
N
D2h
N
N
D1h
Cl
Cl
CCKAh
B2h
O
N
Bomb
BZDc
AT1h
I
Beta1h
Alpha2
I
Alpha1
A1h
100
100
90
80
70
60
50
40
30
20
10
100
0
90
80
70
60
50
40
30
20
10
90
80
70
60
50
40
30
20
10
0
0
Tricentric Pharmacophore Fingerprints:
monitoring feature arrangement
• Topological: the distance between two features equals the
(minimal) number of chemical bonds between them
Cl
O
9 4
N
11
N
Basis Triplets:
• all possible feature combinations
• at a given series of distances…
3 3 3 4 3 5 5
9
3 5
8? 4 3 7
3 4 4 5 5 6
Hp
…
Ar
Ar
…
Hp
Hp
…
…
…
Hp
…
Hp
Hp
4- H
4- H
4-H
3-H
3-H
7-A
3-H
3-H
p3
p3
r4
A5
A5
p3
p3
p3
-H
-H
-P
-H
-H
-H
-A
-A
C6
p4
p5
p3
p4
p5
r5
r5
0 0 0 … 0 0 … … 1 … … … 0 … … 0 …
Pickett, Mason & McLay, J. Chem. Inf. Comp. Sci. 36:1214-1223 (1996)
First key improvement: Fuzzy mapping of
atom triplets onto basis triplets in 2D-FPT
5 4
3 3 3 4 5 3 4 7
3 4 5 5 6
…
…
Ar
Ar
…
Hp
Hp
Hp
…
Hp
…
Hp
Hp
4- H
4- H
4-H
7-A
3-H
3-H
3-H
3-H
p3
p3
r4
A5
p3
A5
p3
p3
-P
-H
-H
-H
-H
-H
-A
-A
C6
p4
p5
p3
p4
p5
r5
r5
0 0 0 … 0 0 … +6 … … +3 … … … … 0 …
4 7
8
Ar4-Hp7-Hp6
Ar5-Hp6-Hp7 9
6
Triplet matching procedure
k =1
Horvath, D. ComPharm pp. 395-439; in "QSPR /QSAR Studies by Molecular Descriptors", Diudea, M.,
Editor, Nova Science Publishers, Inc., New York, 2001
Control parameters for triplet enumeration &
matching in two 2D-FPT versions.
Parameter Description FPT-1 FPT-2
Minimal Edge Length of basis triangles (number of bonds
Emin 2 4
between two pharmacophore types)
Emax Maximal Triangle Edge Length of basis triangles 12 15
Estep Edge length increment for enumeration of basis triangles 2 2
Edge length excess parameter: in a molecule, triplets with
e 0 2
edge length > Emax+e are ignored
Maximal edge length discrepancy tolerated when attempting
Δ 2 2
to overlay a molecular triplet atop of a basis triangle.
Gaussian fuzziness parameter for apolar (Hydrophobic and
ρHp = ρAr 0.6 0.9
Aromatic) types
Gaussian fuzziness parameter for charged (Positive and
ρPC = ρNC 0.6 0.8
Negative Charge) types
Gaussian fuzziness parameter for polar (Hydrogen bond
ρHA = ρHD 0.6 0.7
Donor and Acceptor) types
l Aromatic-Hydrophobic interchangeability level 0.6 0.5
Number of basis triplets at given setup 4494 7155
Second key improvement: Proteolytic
equilibrium dependence of 2D-FPT
8
C8
PC
-P
8-
C5
NC
5- N
8-
12%
Ar
Ar
88%
Some ‘activity cliffs’ in rule-based descriptor
space are smoothed out in 2D-FPT-space
•Neu n
•C a t
tral
io
•Neu Cation
•Neu Cation
• 50%
• 90%
tral
tral
t ral
u
•Ne ion
•An
t ral t ral
u u
•Ne ion •Ne utral
•An
n
•Ne
n
tio
tio
0% al
0% al
Ca
Ca
• 7 eutr
• 4 eutr
•N
•N
Pharmacophore Pattern-Based Similarity
Queries: Lead Hopping!
Pharmacophore Reference Nearest Neighbors Docking
?
Hypothesis Fingerprint
Automated
Fingerprint
Matching... Best Matching Candidates
Potential Pharmacophore
Fingerprint Library
Some examples of "hidden similarity"
CGRP
MAPkin
IL-8
N
NEUPTh
N
HIVP
N
PK55fyn
N
EGF-TK
O
PKC
H
PDEIV
PDEII
Elast
CatB
O
Cl
Br
Cl
K-ATP
V1Ah
Sigma1
5HTUpt
5HT6h
5HT3h
5HT2ch
5HT1D
5HT1Ah
Muh
NPY
NK1h
M3h
M1h
ML1
H1c
Galan
N
ETAh
DaUpt
N
N
N
D2h
N
N
D1h
Cl
Cl
CCKAh
B2h
O
N
Bomb
BZDc
AT1h
I
Beta1h
Alpha2
I
Alpha1
A1h
100
100
90
80
70
60
50
40
30
20
10
100
0
90
80
70
60
50
40
30
20
10
90
80
70
60
50
40
30
20
10
0
0
Successful Virtual Screening Simulations
Confirmed Actives (PF) Confirmed Inactives (PF) Confirmed Actives (PF) Confirmed Inactives (PF)
Confirmed Actives (OPT3)
(FPT-2) Confirmed Inactives (OPT3)
(FPT-2) Confirmed Actives (OPT3)
(FPT-2) Confirmed Inactives (FPT-2)
(OPT3)
90 7
% Retrieved Seed Compounds
40 3
30 2
20
1
10
0 0
8
50
7
40 6
35
5
30
4
25
3
20
15
TK 2
10
1
5
0
0
90
45
70
35
60
30
50
25
40
20
30
15
20
10
10
5
0
0
0 20 40 60 80 100 120 140 160 180 200
0 20 40 60 80 100 120 140 160 180 200
Selection Size
Selection Size
Successful QSAR model construction with 2D-
FPT: predicting c-Met TK activity
Learning Set Compounds Validation Set Compounds
9
.
8.5
7.5
7
Experimental pIC50
6.5
5.5
4.5
25 variables entering nonlinear model
4 153 molecules for training: RMSE=0.4 (log units), R2=0.82
4 4.5 540 molecules
5.5 6 validation:
for 6.5 7
RMSE=0.8 7.5(log units),
8 R28.5
=0.53 9
8 validation molecules outpIC50
Calculated of 40 mispredicted by more than 1 log
What more could be done?
- chosen conformer
of the reference
- chosen conformer
of the candidate
- pair of matching
atoms
- 3 Euler angles
- mirroring toggle
GA-controlled
overlay optimization
ComPharm Pharmacophoric Fields
Pharmacophoric Features
Alk. Aro. HBA HDB (+) (-)
1 X11 X12 X13 X14 X15 X16
Reference Atoms
2 X21 X22 X23 X24 X25 X26
3 X31 X32 X33 X34 X35 X36
4 X41 X42 X43 X44 X45 X46
5 X51 X52 X53 X54 X55 X56