You are on page 1of 12

Biophys Rev

DOI 10.1007/s12551-016-0247-1

REVIEW

Software for molecular docking: a review


Nataraj S. Pagadala 1 & Khajamohiddin Syed 2 & Jack Tuszynski 3,4

Received: 13 November 2016 / Accepted: 27 December 2016


# International Union for Pure and Applied Biophysics (IUPAB) and Springer-Verlag Berlin Heidelberg 2017

Abstract Molecular docking methodology explores the be- Keywords Rigid body docking . Flexible docking . Docking
havior of small molecules in the binding site of a target pro- accuracy
tein. As more protein structures are determined experimentally
using X-ray crystallography or nuclear magnetic resonance
(NMR) spectroscopy, molecular docking is increasingly used Introduction
as a tool in drug discovery. Docking against homology-
modeled targets also becomes possible for proteins whose In modern drug discovery, protein–ligand or protein–protein
structures are not known. With the docking strategies, the docking plays an important role in predicting the orientation
druggability of the compounds and their specificity against a of the ligand when it is bound to a protein receptor or enzyme
particular target can be calculated for further lead optimization using shape and electrostatic interactions to quantify it. The
processes. Molecular docking programs perform a search al- van der Waals interactions also play an important role, in
gorithm in which the conformation of the ligand is evaluated addition to Coulombic interactions and the formation of hy-
recursively until the convergence to the minimum energy is drogen bonds. The sum of all these interactions is approximat-
reached. Finally, an affinity scoring function, ΔG [U total in ed by a docking score, which represents potentiality of bind-
kcal/mol], is employed to rank the candidate poses as the sum ing. In the simplest rigid-body systems, the ligand is searched
of the electrostatic and van der Waals energies. The driving in a six-dimensional rotational or translational space to fit in
forces for these specific interactions in biological systems aim the binding site, which can serve as a lead compound for drug
toward complementarities between the shape and electrostat- design (Alberg and Schreiber 1993).
ics of the binding site surfaces and the ligand or substrate. The docking accuracy in a rigid-body approach is much
greater for bound complexes than uncomplexed molecules
(Shoichet and Kuntz 1991). Even though the observed
structural changes between the bound and free forms are
* Nataraj S. Pagadala small, the difference in accuracy implies that the assump-
nattu251@gmail.com tion of rigidity is not fully warranted (Totrov and Abagyan
1994). Also, the difference between the near native struc-
1
tures and others far from native cannot be distinguished,
Department of Medical Microbiology and Immunology, Li Ka Shing
Institute of Virology, 6-020 Katz Group Centre, University of
even with simple scoring functions such as measures of
Alberta, Edmonton, Alberta T6G 2E1, Canada surface complementarity (Katchalski-Katzir et al. 1992), solvent
2
Unit for Drug Discovery Research, Department of Health Sciences,
accessible surface area (SASA) burial, solvation free energy,
Faculty of Health and Environmental Sciences, Central University of electrostatic interaction energy, or the total molecular mechanics
Technology, Bloemfontein 9300, Free State, South Africa energy (Shoichet and Kuntz 1991). Hence, the docking proce-
3
Department of Experimental Oncology, Cross Cancer Institute, dures were improved by several groups by allowing for receptor
Edmonton, Alberta, Canada and ligand flexibility.
4
Department of Physics, University of Alberta, Edmonton, Alberta, The entropy loss of a flexible ligand in rigid six body de-
Canada grees of freedom in an anisotropic environment of the receptor
Biophys Rev

and the change in its internal energy upon binding can greatly to its biological target and identifying the known com-
affect the binding affinity. Introducing local minimization of a pounds with top scores in virtual screening trials. In order
molecular-mechanics energy function such as in the to assesses the docking accuracy and mode of binding,
CHARMM package yields only limited improvement initially, FlexX was evaluated on a set of 19 protein–ligand
(Brooks et al. 2009). Consequently, information regarding complexes, with a subsequent evaluation on a larger set of
the binding site location before the docking processes became 200 complexes (Rarey et al. 1996). The docking accuracy of
very important to increase the docking efficiency. There are Glide was assessed by redocking ligands from 282 co-
several cavity detection programs or online servers that can crystallized PDB complexes, while GOLD was validated on
detect putative active sites within proteins, e.g., GRID 100 and 305 complexes (Friesner et al. 2004; Jones et al.
(Goodford 1985), POCKET (Levitt and Banaszak 1992), 1997). Further, ligandFit was reported for 19 protein–ligand
SURFNET (Laskowski 1995), PASS (Putative Active Sites complexes (Venkatachalam et al. 2003), while DOCK has been
with Spheres) (Brady and Stouten 2000), and MMC (mapping verified on several targets over the years (Bodian et al. 1993;
macromolecular topography) (Mezei 2003). Debnath et al. 1999; Shoichet et al. 1993). Both AutoDock and
The earliest reported docking methods were based on the AutoDock Vina were calibrated using the same test set of 30
lock-and-key assumption proposed by Fischer, stating that structurally known protein–ligand complexes with experimen-
both the ligand and the receptor can be treated as rigid bodies tally determined binding constants (Österberg et al. 2002; Trott
and their affinity is directly proportional to a geometric fit and Olson 2010).
between their shapes (Mezei 2003). Later, the Binduced-fit^ Among these programs, AutoDock Vina, GOLD, and
theory proposed by Koshland suggested that the ligand and MOE-Dock predicted top ranking poses with best scores.
receptor should be treated as flexible during docking GOLD and LeDock were able to identify the correct ligand
(Hammes 2002; Koshland 1963). Each backbone movement binding poses. Both Glide (XP) and GOLD predict the poses
affects multiple side chains in contrast to relatively indepen- consistently with a 90.0% accuracy (Wang et al. 2016). It was
dent side chains. Thus, the sampling procedure in a fully flex- also shown that GOLD produced higher enrichment factors
ible receptor/ligand docking is of a higher order of magnitude than Glide in a virtual screening trial against Factor Xa,
in terms of the number of degrees of freedom than in flexible whereas Glide outperformed GOLD against the same target
docking with a rigid receptor. Consequently, these flexible in a similar virtual screening trial. Overall, it was reported
docking algorithms not only predict the binding mode of a recently that these docking programs are able to predict
molecule more accurately than rigid body algorithms, but also experimental poses with root-mean-squared deviations
its binding affinity relative to other compounds (Verkhivker (RMSDs) averaging from 1.5 to 2 Å (Bissantz et al. 2000;
et al. 2000). Dixon 1997). However, flexible receptor docking, especially
Over the last two decades, more than 60 different docking backbone flexibility in receptors, still presents a major chal-
tools and programs have been developed for both academic lenge for the available docking methods.
and commercial, use such as DOCK (Venkatachalam et al.
2003) AutoDock (Österberg et al. 2002), FlexX (Rarey et al.
1996), Surflex (Jain 2003), GOLD (Jones et al. 1997), ICM Rigid body docking
(Schapira et al. 2003), Glide (Friesner et al. 2004), Cdocker,
LigandFit (Venkatachalam et al. 2003), MCDock, FRED Rigid body docking produces a large number of docked con-
(McGann et al. 2003), MOE-Dock (Corbeil et al. 2012), formations with favorable surface complementarity, followed
LeDock (Zhao and Caflisch 2013), AutoDock Vina (Trott by the reranking of the conformations using the free energy of
and Olson 2010), rDock (Ruiz-Carmona et al. 2014), UCSF approximation. The fast Fourier transform (FFT) correlation
Dock (Allen et al. 2015), and many others. approach (Katchalski-Katzir et al. 1992) systematically ex-
Although strategies in the ligand placement differ one plores the space of docked conformations using electrostatic
from another, these programs are broadly categorized as interactions (Mandell et al. 2001) or both electrostatic and
ranging from incremental construction approaches, such solvation terms (Chen et al. 2003), but the potential is
as FlexX (Rarey et al. 1996) to shape-based algorithms restricted to a correlation function form. Later, polar
(i.e., DOCK) (Kuntz et al. 1982), genetic algorithms Fourier correlations were used to accelerate the search for
(GOLD) (Jones et al. 1997), systematic search techniques candidate low-energy conformations (Ritchie and Kemp
(Glide, Schrödinger, Portland, OR 97201), and Monte 2000). Additionally, other approaches such as computer
Carlo simulations (LigandFit) (Venkatachalam et al. 2003). vision concepts (Wolfson and Nussinov 2000), Boolean
With the exception of GOLD, almost all current flexible operations (Palma et al. 2000), and the genetic algorithms
ligand docking programs treat the receptor as rigid (Jones (Gardiner et al. 2001) were also used. In fact, the Fourier
et al. 1997). These programs were evaluated to test their transform algorithm can also use spherical harmonic decom-
abilities in producing the correct binding mode of a ligand position to accelerate the search over 3D rotational space, as
Biophys Rev

used in FRODOCK. To further improve the FFT docking, the sum of terms representing shape complementarity, electro-
atomic contact energy is added to estimate the desolvation static, and desolvation energies. The structures obtained in
energy in RDOCK and electrostatic correction in ZDOCK PIPER are very close to their native conformations due to
(Bissantz et al. 2000; Metropolis and Ulam 1949). the decomposition of eigenvalue–eigenvector, which is the
There are also other types of useful FFT based rigid-body key to the efficient use of this potential (Kozakov et al. 2006).
docking tools without a 3D grid-based searching system, such However, these algorithms are not well suited for unbound
as Hex (Ritchie and Kemp 2000; Ritchie and Venkatraman crystal structures and yield many false-positives far from the
2010). HEX uses spherical polar Fourier correlations for both native complex, though they have good surface complemen-
rotational and translational space. Furthermore, the efficiency tarity. To improve in silico prediction further, F2Dock was
of Fourier transform-based algorithms is further accelerated developed, which also uses shape complementarity and scores
computationally with the help of advanced software packages, based on Coulombic potentials. This program is also struc-
such as the 3D convolution library (Pierce et al. 2011), and tured to incorporate the Lennard-Jones potential and docking
new hardware technologies, such as the graphics processing solutions were reranked based on desolvation energy. These
unit (GPU) (Ritchie and Venkatraman 2010) and Cell BE contributions were shown to be effective in more than 70% of
processor (Pons et al. 2012). The docking program ‘DOT’ the cases in a bound–unbound complex. The lowest RMSD
performs a systematic rigid-body search of one molecule, car- was improved by at least 0.5 Å for 45 bound–unbound com-
rying out both translational and rotational orientation on a plexes and less than 1 Å was seen for 27 bound–bound com-
second molecule. Finally, the sum of intermolecular energies plexes (Bajaj et al. 2011). In fact, DOCK was one of the first
of electrostatic and atomic desolvation energies as a correla- programs that involved shape complementarity through a set
tion function for all the generated configurations are computed of spheres in the determination of ligand–protein interactions.
efficiently with FFTs (Roberts et al. 2013). The volume occupied by the ligand depends on the diameter
MEGADOCK is similar to ZDOCK in that it generates of the spheres inside the binding pocket of the protein (Kuntz
docking conformations in a grid-based 3D space using an et al. 1982). The initial orientation of the ligand inside the
FFT. But MEGADOCK calculations are 8.8 times faster than binding pocket is determined by a maximum clique detection
ZDOCK due to a much simpler score function in which only method based on distance compatibility. However, the data
shape complementarity and electrostatics are considered can be accessed rapidly though geometric hashing by
(Ohue et al. 2014a). Using these two programs, the core signal matching features in triplets. The features are represented
process in the bacterial chemotaxis pathway has been identi- in the form of spheres and are clustered as poses (Fischer
fied (Matsuzaki et al. 2014). Later, a soft docking approach in et al. 1993). SDOCK performs global searches by incor-
FFT was developed where the ligand and the receptor are porating the van der Waals attractive potential, geometric
considered as rigid bodies, and their conformational changes collision, screened electrostatic potential, and Lazaridis–
are calculated by allowing a certain degree of inter-protein Karplus desolvation energy into the scoring function.
penetration (Katchalski-Katzir et al. 1992). These domain– Structure flexibility was based on stepwise potentials that
domain poses were also scored by binding energy and a were generated from the corresponding continuous forms
pseudo-energy term based on restraints derived from linker (Zhang and Lai 2011).
and end-to-end distances in pyDockTET (tethered-docking). Cell-Dock also performs the global scan using the transla-
The other programs include SOFTDOCK (Jiang and Kim tional and rotational space of two molecules based on surface
1991), BiGGER (Palma et al. 2000), and SKE-DOCK complementarity and electrostatics. A paramount difference
(Terashi et al. 2007). For the sake of matching efficiency, each with FTDock is that the value of the grid size is fixed in a
grid point is given a value of ‘1’ when occupied by the protein number of cells that reflects grid cell resolution and total span
or ‘0’ otherwise. This grid-based system is similar to FFT- in Angstroms (Pons et al. 2012). Furthermore, to reduce the
based grid searches, except that it has simpler values on the size of molecules from large compound libraries, shape com-
grid. Although the affinity of a ligand–protein complex is plementarity was introduced between ligand and protein in
determined mainly by complementary physical-chemistry fea- MS-DOCK to perform efficient multiple conformation rigid-
tures, shape complementarity became an essential part in rigid body docking (Sauton et al. 2008). The contact surface be-
body docking programs (O’Sullivan et al. 1991). Apart from tween the ligand and the protein is further optimized by a
electrostatics, the hydrophobic complementarity based on ge- Gaussian shape fitting function in FLOG (Miller et al.
ometry was incorporated in the MolFit FFT program to cal- 1994), CLIX (Lawrence and Davis 1992), FRED (McGann
culate the interface of a protein–protein complex (Katchalski- et al. 2003), and PAS-Dock (Protein Alpha Shape-Dock)
Katzir et al. 1992). Recently, PIPER was developed to predict (Tøndel et al. 2006) to perform rigid body docking.
mutual orientation of the two proteins using pairwise interac- The TagDock toolkit produces macromolecular com-
tion potential between the atoms i and j. The contributions to plexes from rigid monomers by generating randomly posed
the scoring function are evaluated in discretized 6D space as docked pairs (decoys) that agree with inter-monomer distance
Biophys Rev

restraints determined experimentally by using a penalty for However, SymmDock only predicts structures with cyclic
each decoy (Smith et al. 2013). Examples of other docking symmetry. If the input monomers are with different symmetry
programs that use local shape featuring algorithms include in its native complex, then SymmDock is not suitable for such
LZerD (Venkatraman et al. 2009), PatchDock (Schneidman- a prediction (Schneidman-Duhovny et al. 2005).
Duhovny et al. 2005) and GAPDOCK (Gardiner et al. 2001). With simple unbound–bound target cases, 47% of the in-
Geometric hashing algorithms also perform a global protein– terface contacts were correctly predicted by ZDOCK, demon-
protein docking using local shape descriptors, such as surface strating its strength in binding site prediction (Wiehe et al.
patches in PatchDock (Harrison et al. 2002) or 3D Zernike 2005). INTELEF, an updated version of SOFTDOCK, pre-
descriptors in LZerD (Venkatraman et al. 2009), between pro- dicted 66 corrected solutions out of 83 with ranks in the top
teins. Recently, an integrated algorithm, MEMDOCK 2000 solutions (Li et al. 2007). Using the SE-Dock server, the
(Membrane Dock) was designed for docking within the mem- smallest RMSD between the model and experimental struc-
branes. The method models both side chain and backbone tures obtained were 3.307 and 3.324 Å, respectively. In the
flexibility and performs rigid body optimization of the ligand docking step, out of eight targets, the SKE-DOCK server gen-
orientation using modified Patchdock and Fiberdock (Hurwitz erated acceptable models with ligand RMSD of 10 Å or lower
et al. 2016). for five targets. For the results of three targets, SKE-DOCK
failed in the geometric docking because of improper confor-
mations obtained during the docking step (Terashi et al. 2007).
Accuracy of rigid body docking When considering only the cases that have at least one accept-
able solution generated by ZDOCK, the success rates of
Docking was considered successful if the binding of a ligand pyDockTET for predicting an acceptable conformation in
into its active site was closer than a given threshold from the the top 10 and 50 solutions are 69% and 77%, respectively,
X-ray solution. The DOCK program applied to aspartic pro- whereas the success rates of pyDock alone are 62% and 69%,
tease of HIV resulted in a candidate inhibitor with high poten- respectively (Cheng et al. 2008).
cy turned out to be several orders of magnitude too low for Except in five known difficult cases (1BGX, 1I4D, 1SBB,
clinical use. However, this molecule can be used as a lead 1HE8, 1IB1), several acceptable solutions have been found in
compound for the design of more potent inhibitors. almost all docking cases with RMSDL ≤10 Å or RMSDL ≤4
Ring and coworkers designed inhibitors against proteases Å within 10,000 default predictions yielded by FRODOCK
of schistosome and malaria parasites that are crucial to the (Garzon et al. 2009). For 64% of acetylcholinesterase com-
pathogenicity by using shape-complementarity function and plexes, the shape complementarity identified by HEX over-
a simplified molecular-mechanics potential approximating the laps with the native binding site (Wass et al. 2011). Further,
interaction energy between the protease and ligand (Ring et al. Cell-Dock was tested on the unbound structures of protein–
1993). The DOT program successfully predicted the electron protein docking benchmark version 2.0 formed by 84 cases. In
transfer complex of the positively charged cytochrome c to the 89% of the cases using CELL-256 and in 85% of the cases
negative region on the cytochrome c oxidase surface formed when using CELL-128, the docking poses are nearer to native
by subunit II (Roberts and Pique 1999). Out of 25 protein– conformations. These results were also assessed by pyDOCK
protein complexes tested using the BiGGER program, 22 based on electrostatics, desolvation, and van der Waals energy.
complexes were near to native docked geometries with The scoring by pyDOCK showed a slightly better success rate
C(alpha) RMS deviations ≤4.0 A from the experimental struc- with CELL-256 than with CELL-128. With CELL-256,
tures, of which 14 were found within the 20 top ranking solu- 19.7% of the cases obtained near native conformations within
tions (Palma et al. 2000). With the omission of water mole- the top 10 scoring solutions, whereas 18.3% of the cases
cules, the top-ranking solutions of the MolFit program using showed near native conformations within the top 10 scoring
geometric and geometric-electrostatic docking identify clus- solutions using CELL-128 (Pons et al. 2012). In both the
ters of nearly correct solutions with limited rotational freedom cases, the differences were minimal and the values in general
at the interface for disassembled and unbound structures were similar to those achieved by pyDOCK when scoring
(Heifetz et al. 2002). In round 1 of CAPRI (Critical FTDOCK models (Pons et al. 2010).
Assessment of PRediction of Interactions) experiments, According to the latest CAPRI experiments carried out in
GAPDOCK correctly predicted 17 of 52 interprotein contacts 2013, the ClusPro server was best in automated protein
with target 1 and 27 of 52 contacts with target 2 compared to docking equivalent to the best human predictor group.
those obtained by other methods (Gardiner et al. 2003). Using HADDOCK (de Vries et al. 2010), SwarmDock (Torchala
PatchDock, out of 35 examples, 31 examples were shown to et al. 2013), and PIE-Dock (Ravikant and Elber 2010) were
have the lowest RMSD below 2 Å. In 26 cases, the correct the next best. In the human predictor category, HADDOCK
poses were ranked first, whereas in the other nine cases, the (Dominguez et al. 2003) was given the first rank, followed by
correct solution is ranked among the first 30 conformations. SwarmDock (Venkatraman and Ritchie 2012). ICM
Biophys Rev

(Fernández-Recio et al. 2002) was ranked in the 2nd to 5th reductase) and its redox partners were modeled and their bind-
positions (Kozakov et al. 2013; Lensink and Wodak 2013). ing interfaces were predicted. The results obtained were high-
The predicted binding mode for the CCDC-Astex set of 85 ly similar to FNR:Fd complexes of Anabaena and maize,
diverse protein–ligand complexes is correct in approximately showing a good correlation computationally. QXP is a multi-
80% of cases with rDock (Ruiz-Carmona et al. 2014). By step docking program using a local Monte Carlo search with a
incorporating the electrostatic term, MEGADOCK 2.1 suc- restricted rotational angle (Pellegrini and Doniach 1993).
cessfully predicted at least one near-native decoy for 128 pro- Recently, a newly designed and implemented version of the
tein complexes in the bound set and 23 complexes in the AutoDock program called AutoDock Vina has been released.
unbound set in the top 100 scored decoys. When compared This version abandoned the former empirical scoring function
with ZDOCK 3.0, MEGADOCK 2.1 was less successful and GA-based optimizer, but adopted a new knowledge-based
(Ohue et al. 2014b). scoring function with a Monte Carlo sampling technique and
the Broyden–Fletcher–Goldfarb–Shanno (BFGS) method for
local optimization. Their simulation results showed a signifi-
Flexible docking cant improvement in both prediction accuracy and docking
time. PSOVina is the first PSO (particle swarm optimization)
In standard virtual docking studies, ligands are freely docked protein–ligand docking algorithm in the framework of
into a rigid receptor. However, it has become increasingly AutoDock Vina (Ng et al. 2015). Through careful integration
clear that side chain flexibility plays a crucial role in ligand– of Vina’s efficient local optimizer into the canonical PSO pro-
protein complexes. These changes allow the receptor to alter cedure and proper tuning of parameters, PSOVina achieved a
its binding site according to the orientation of the ligand. The remarkable execution time reduction of 51–60% without
ligand orients in a (6 + N)-dimensional space of translational, compromising the docking accuracies. In recent years, swarm
rotational, and conformational variables in the anisotropic en- intelligence algorithms have emerged as a fast and reasonably
vironment of the receptor (Jackson et al. 1998; Moon and accurate technique in solving complex search problems in
Howe 1991; Rotstein and Murcko 1993a, b; Nishibata and computer science. To date, there exists only a handful of
Itai 1993). Four different strategies are currently in use for swarm-based docking methods: SODOCK, a hybrid of PSO
docking flexible ligands, namely: (a) Monte Carlo or and Solis and Wets’ local search method (Chen et al. 2007);
molecular-dynamics docking of complete molecules; (b) in- PLANTS, an ant colony optimization method (Korb et al.
site combinatorial search, (c) ligand buildup; and (d) site map- 2009); pso@autodock, a velocity adaptive and regenerative
ping and fragment assembly. constricted PSO method (Namasivayam and Günther 2007);
Monte Carlo methods accept or reject the random changes ParaDockS, a parallel docking suite having PSO as the opti-
of the thermodynamic accessible states by using Metropolis mization algorithm (Banitt and Wolfson 2011); and
criteria (Metropolis and Ulam 1949). The configurations with FIPSDock, the fully informed PSO method (Liu et al. 2013).
increase in temperature T will be accepted by slow cooling Three of the programs were modifications of the popular
through so-called simulated annealing (Kirkpatrick et al. open-source docking program AutoDock, albeit different ver-
1983). The changes in conformations are quite large, allowing sions, and all of them showed better predictive performance
the ligand to cross the energy barriers on the potential energy when compared to the original AutoDock implementation.
surface. This technique of conformational searches combined Furthermore, a novel search method called QPSO-ls
with the potentials of molecular affinity gives an efficient (quantum-behaved particle swarm optimization) was intro-
method of substrate docking with known structures duced for solving a highly flexible docking problem, which
(Goodsell and Olson 1990). Along with affinity potentials, is a hybrid of quantum-behaved particle swarm optimization
distance constraints were added as soft potentials in simulated (QPSO) and a local search method of Solis and Wets (Fu et al.
annealing (Yue 1990). 2015). In another program called GalaxyDock, the receptor
Examples of applying the Monte Carlo methods include an side chains were preselected and globally optimized using an
earlier version of AutoDock (Novotny et al. 1989), ICM AutoDock-based algorithm for flexible side-chain docking
(International Computer Management) (O’Sullivan et al. (Shin and Seok 2012). FLIPDock uses the AutoDock force
1991), QXP (quick explore) (Pellegrini and Doniach 1993), field for generating multiple receptor conformations, termed
and Affinity (Ring et al. 1993). AutoDock 2.4 generates con- as the flexibility tree (FT) (Zhao and Sanner 2007).
formers in real space using Monte Carlo simulated annealing Furthermore, ‘RosettaLigand’ uses a low-resolution docking
with a rapid energy evaluation using molecular affinity grids in the initial step combined with translational and rotational
using common force fields (Leach 1994). ICM software gen- adjustments (DeLuca et al. 2015). GOLD explores the flexi-
erates the ligand in 3D grid space by Monte Carlo movements bility of the ligand through the process of evolution by using a
and minimization of interaction potentials. Using this soft- genetic algorithm and displaces loosely bound water on ligand
ware, the interactions between FNR (ferredoxin:NADP+ binding (Jones et al. 1995, 1997). Later, a wide range of
Biophys Rev

nuclear magnetic resonance (NMR) and available experimen- (maximal common substructure search) against HIV protease
tal as well as bioinformatics data was used to drive the to map a binding site and constructed peptide inhibitors by
docking process in HADDOCK (Dominguez et al. 2003). building bonds to connect the various minima they found
Previous modeling studies on protein–DNA and protein– (Caflisch et al. 1993). In HOOK, MCSS is also used in the
RNA complexes using NMR data have been shown to be mapping stage, but the minima are connected by a database of
successful (Gu et al. 2015; Bursulaya et al. 2003; Paul and molecular scaffolds for possible connectors (Eisen et al.
Rognan 2002; Berman et al. 2002). The various degrees of 1994). FlexX uses a tree-search technique for placing the li-
conformational flexibility of DNA were sampled by the semi- gand into the active site, incrementally starting with the base
flexibility of sugar-phosphate backbone and DNA base pairs fragment (Rarey et al. 1996).
for further docking calculations. In FTDOCK, the docking Unlike other docking programs, Glide performs a complete
score is measured by rotating and translating the protein along systematic search of the conformational, orientational, and
the DNA using shape and electrostatic complementarity by positional space of the docked ligand with the OPLS-AA
approximate flexibility (Bruccoleri and Karplus 1990). force field (Optimized Potentials for Liquid Simulations).
Further rotamer libraries can be used to reduce the side chain The best possible conformation is further refined using
placement problem to a combinatorial optimization problem Monte Carlo sampling (Friesner et al. 2004). SLIDE
with the minimum energy, i.e., the global minimum energy (‘Screening for Ligands by Induced-fit Docking,
conformation (GMEC) (Kohlbacher and Lenhof 2000; Efficiently’) optimization is based on the mean-field theory,
Canutescu et al. 2003). One of these methods is based on balancing flexibility between the ligand and the protein side
the dead-end elimination (DEE) theorem of Desmet et al. chains (Schnecke et al. 1998). Further, a surface-based
(1992). Later, GMEC was investigated as the convex hull of molecular similarity method was implemented in Surflex
all feasible solutions with some classes of facet-defining in- (Jain 2003) to rapidly generate suitable putative poses for
equalities in a branch-and-cut algorithm. The side chain con- molecular fragments using the Hammerhead docking sys-
formations generated by these techniques are then subjected to tem (Jain 2003). In addition, a multi-objective docking
a geometry optimization with a molecular mechanics force strategy, MoDock, has been proposed to further improve
field. Finally, the binding free energy of the optimized struc- the pose prediction with the available scoring functions
ture is estimated (Jackson and Sternberg 1995). divided into the following three types: force field-based,
Further, the algorithms were developed to build ligands empirical-based, and knowledge-based. The results obtained
directly in the binding site in flexible-docking and design indicate that the multi-objective strategy can enhance the pose
strategy. One of these was the de novo design of peptide in- prediction power of docking with the available scoring
hibitors using a library of low-energy conformations of isolat- functions (Gu et al. 2015).
ed amino acid residues as building blocks (Moon and Howe
1991). Subsequently, this method was extended to a non-
peptide ligand design using functional groups or single atoms Accuracy of flexible docking
using GroupBuild and LEGEND (Nishibata and Itai 1993;
Rotstein and Murcko 1993a, b). Goodford introduced the idea Initially, three different docking programs (Dock, FlexX, and
of using functional groups (water, methyl group, amine nitro- GOLD), with six different scoring functions (Chemscore,
gen, carboxy oxygen, and hydroxyl) as molecular probes to Dock, FlexX, Fresno, Gold, Pmf score) were evaluated
map the binding site of a macromolecule (Goodford 1985). against thymidine kinase (TK) and estrogen receptor to mea-
Thus, the energy contour surfaces for the various probes dif- sure the accuracy of virtual screening methods. Out of the
ferentiate regions of attraction between the probe and protein. three docking programs, GOLD showed 60% docking accu-
The procedure is well suited to multiple-copy techniques racy with less than 1.2 Å RMSD, including the worst docked
(Miranker and Karplus 1991). The goal of fragment- orientation, with an RMSD of 3.1 Å. Surprisingly, both Dock
assembly approaches, pioneered by Lewis and Dean (1989a, as well as FlexX were not able to produce a reasonable solu-
b), is to connect the individual molecular fragments into a tion for at least three TK ligands (IdU (5-iododeoxyuridine),
single viable molecule. The CLIX program attempts to make hmtt (6-[6-hydroxymethy-5-methyl-2,4-dioxo-hexahydro-
a pair of favorable interactions in the binding site of the pro- pyrimidin-5-yl-methyl]-5-methyl-1H-pyrimidin-2,4-dione),
tein with a pair of chemical substitutions (Lawrence and Davis and mct ((North)-methanocarba-thymidine) for Dock; hmtt,
1992). LUDI places molecular fragments to form hydrogen ganciclovir, and penciclovir for FlexX). Furthermore, the
bonds with the enzyme so that the hydrophobic pockets are best docking poses for ERα receptor with raloxifene, 4-
filled. These fragments are then linked together with suitable hydroxytamoxifen, were obtained using GOLD and, to
spacers (Böhm 1992). The linked-fragment approach of some extent, with FlexX. On the other hand, DOCK failed
Verlinde and coworkers are based on shape descriptors completely to predict a reliable pose for raloxifene.
(Verlinde et al. 1992). Caflisch and coworkers used MCSS However, no relationship was found between the docking
Biophys Rev

accuracy and ranking score with these programs (Bissantz lowest energy conformations by GOLD and DrugScore with
et al. 2000). a proper ZBG (zinc binding group) binding. However,
In 2003, five docking programs, DOCK 4.0, FlexX 1.8, DOCK, GOLD, and DrugScore produced RMSD values
AutoDock 3.0, GOLD 1.2, and ICM 2.8, were accessed with greater than 8 and improper ZBG binding, showing significant
a dataset of 37 protein–ligand complexes and screening the differences between the docked pose and crystal structures. In
compounds containing 10,037 entries against 11 different pro- contrast, AutoDock and FlexX gave better results, with
teins. The results revealed that ICM provided the highest RMSDs of 2.91 and 2.63 Å and a proper ZBG binding. If
docking accuracy against these receptors, with a value of the RMSD limit is increased to 2.5 Å, the percentage of the
0.93 compared to AutoDock, DOCK, FlexX, and GOLD, well-docked poses with good/fair ZBG binding increased to
with acceptable accuracies of 0.47, 0.31, 0.35, and 0.52, re- 90% for all five approaches (Hu et al. 2004). At the 2% level
spectively. In 17 cases, ICM predicted the original ligands of predicting top-scoring molecules, Glide identifies known
within the top 1% of the total library screened with 50% of active molecules for four of the five protein targets. However,
the potentially active compounds falling under ∼1.5% of top for the 10 and 20% levels, Glide was the only program which
scoring solutions, while DOCK and FlexX predicted only identified one or more of the known active molecules for
∼9% of potentially active compounds. It was also found that each of the five target proteins. The same 2% level of
∼46%, 30%, 35%, 46%, and 76% of the molecules were success was achieved when 5% of the top-scoring mole-
docked correctly within 2 Å RMSD by AutoDock, DOCK, cules were considered by DOCKVISION and Glide. All
FlexX, GOLD, and ICM, respectively (Bursulaya et al. 2003). the other programs achieved between 10 and 20% level of
Furthermore, in 2004, eight docking programs were evalu- success by identifying one or more active seeds for four
ated with 100 protein–ligand complexes (Paul and Rognan of the five targets (Cummings et al. 2005).
2002) from the Protein Data Bank (PDB) (Berman et al. Further results against 164 targets show that ICM and Glide
2002). At an RMSD cutoff of 2 Å, 50–55% of the ligands produce the lowest average RMSDs of 1.08 and 2.37 Å
were successfully docked using FlexX, Glide, GOLD, and matching with the native ligands, while GOLD and FlexX
Surflex, whereas the success rates of DOCK, FRED, SLIDE, fared worse, with RMSDs of 2.80 and 3.98 Å, respectively.
and QXP did not exceed 40%. Using the protein-bound X-ray At the RMSD cutoff of 2.0 Å, ICM and Glide showed success
conformation, OMEGA was able to predict at least one con- rates of 91 and 63%, respectively, by classifying 149 out of
formation closer than 2 Å for 99% of 100 ligands. With ran- 164 and 104 out of 164 compounds correctly within this
dom ligand conformations, the docking poses obtained with threshold. GOLD also performed reasonably well by classify-
FRED were satisfactory, with RMSDs between 1.76 and 2.14 ing 91 from 164 (55%), while FlexX performed less well with
Å of docked poses from X-ray conformation. All these 70 from 164, a percentage of 42%. ICM and Glide again
docking programs performed well with small hydrophobic performed well at the more stringent RMSD cutoff of 1.0 Å,
ligands, while the performance of GOLD and Surflex correctly docking 93 out of 164 and 81 out of 164, leading to
remained roughly unchanged. Moreover, Glide and FRED success rates of 57 and 49%, respectively. GOLD was suc-
are still efficient in ligand placement with a poor ranking abil- cessful in 64 out of 164 cases, for a success rate of 39%, and
ity (Kellenberger et al. 2004). FlexX was successful with a rate of 26% (42 out of 164)
In the same year of 2004, three highly regarded docking (Chen et al. 2006). With eight protein targets, 50% of the
programs, namely, Glide, GOLD, and ICM, were evaluated ligands were placed well for five targets by at least one pro-
on a vertex dataset of 150 diverse protein–ligand complexes to gram. Indeed, 90% of the ligands could be docked with the
predict their ability to reproduce crystallographic binding ori- correct orientation and 100% could be docked in the correct
entations. In 61% of the cases, Glide correctly identified the location for several protein targets (Warren et al. 2006). The
crystallographic pose within 2.0 Å, compared to GOLD with RMSD-based evaluations against 116 complexes of 13 types
48% and 45% for ICM (Perola et al. 2004). In regards to revealed that no docking program was significantly superior
ligand complexity, all these docking programs performed to GOLD. Thirteen complexes found solutions with an RMSD
well, with the ligands having ten or fewer rotatable bonds. of 2 Å or better only by GOLD, and no solution was
However, LigandFit identified 75% of its close conformations found by either AutoDock or DOCK alone. The sizes of
when the ligands have ten or fewer rotatable bonds, while the binding sites for the complexes that were successfully
FlexX identified 69% for less than ten rotatable bonds, which solved only by GOLD were widely distributed, from 2253
increases to 92% if the ligand has 15 or fewer degrees of to 7900 Å, and represented the various protein types
freedom. In contrast, the sensitivity of Glide is less, with a (Onodera et al. 2007).
78% success rate with smaller complexity less than 15, while Later, ten docking programs and 37 scoring functions were
GOLD is the least sensitive of all (Kontoyianni et al. 2004). analyzed against seven protein types to predict the binding
Evaluation of known crystal structures of 40 zinc- mode, lead identification using virtual screening, and lead
dependent metalloproteinase ligand complexes showed the optimization. Out of these ten programs, Glide, GOLD, and
Biophys Rev

QXP showed success for 61–63% of the cases with an RMSD less than 2 Å between the top scored pose and the native pose.
cutoff of 1 Å. At this cutoff, the docking was successful in On the basis of the results for the top scored poses, the perfor-
only 48% and 54% of the cases with FLEXX and Surflex, mance of the academic programs conform to the following
respectively. With an RMSD threshold of 2 Å, 80–90% of order: LeDock (57.4%) > rDock (50.3%) ∼ AutoDock Vina
the ligands using Glide, GOLD, Surflex, and QXP, while (49.0%) > AutoDock (PSO) (47.3%) > UCSF DOCK (44.0%)
66% and 62% of the cases in FLEXX and FRED, respectively, > AutoDock (LGA) (37.4%), and that of the commercial pro-
were place within 2.0 Å of the X-ray pose. Lastly, DOCK and grams confirm to the following order: GOLD (59.8%) > Glide
SLIDE placed only 50% of the ligands within the 2 Å RMSD (XP) (57.8%) > Glide (SP) (53.8%) > Surflex-Dock (53.2%) >
threshold. Studies also showed that GOLD performed well LigandFit (46.1%) > MOE-Dock (45.6%). The averaged suc-
with hydrophilic targets where there is some lipophilic char- cess rates of the commercial docking programs in predicting
acter in the active site (i.e., thermolysins and PPAR-γ). the top scored poses and best poses are 54.0% and 67.8%,
Contrary to GOLD, both LigandFit and Glide performed well which can be compared to academic programs, with success
with COX-2, a target with a mainly hydrophobic binding rates of 47.4% and 68.4%, respectively (Wang et al. 2016).
pocket (Kontoyianni et al. 2004). This shows that all these docking algorithms were able to
Furthermore, seven commonly used programs were evalu- explore the conformational space to generate correctly docked
ated on the PDBbind database with 1300 protein complexes poses in the binding pockets sufficiently well on a diverse set
(Plewczynski et al. 2011). The results showed that Surflex, of protein–ligand complexes.
FlexX, LigandFit, eHiTS, and GOLD were reasonable, with In general. Glide performs well with diversified binding
failed complexes amounting to not more than 30, while 60% sites and flexibility of the ligand, while ICM and GOLD per-
of their complexes in GOLD and eHiTS have their top score form significantly poorer when binding sites are mainly influ-
conformations below 2 Å. AutoDock failed to dock nearly 90 enced by hydrophobic contacts. These results also show that
pairs, while only 1170 (90% of the entire database) complexes the difference between the commercial and academic pro-
overcame the Glide ligand restraints on the number of rotat- grams was not obvious, even though the capability of
able bonds to 35 and ligand size of 200 atoms. Both pose predicting the ligand binding poses by the commercial pro-
prediction and scoring capabilities of Glide, AutoDock, and grams is slightly better than that of the academic programs
Surflex achieved results of around 50%. The docking accura- from a global perspective.
cy by LigandFit reached nearly 60% with a higher number of
rotatable bonds, whereas medium and weak ones achieved
only 50%. The level of correlation for hydrophobic molecules Conclusions
is 0.2, though the ligand–protein contacts were based on van
der Waals and polar interactions. Structure-based drug design is a powerful technique for the
Later, 19 docking protocols were used to predict bound rapid identification of small molecules against the 3D struc-
conformations for the 136 compounds of seven different tar- ture of the macromolecular targets available by either X-ray,
gets (kinase, protease, isomerase, polymerase, synthetase, NMR, or homology models. Because of abundant information
metalloprotease, and NHR) of the available protein/ligand regarding the sequences and structures of the proteins, the
crystal structures. For all targets except HCVP, at least one structural information of individual proteins and their in-
program was able to dock 40% of the ligands within 2 Å of teractions became very important for further drug therapy.
the crystal conformation. In 2010, four popular docking pro- Although many docking programs exist for conformational
grams were evaluated, Glide (version 4.5), GOLD (version searching and binding pose prediction, the scoring func-
3.2), LigandFit (version 2.3), and Surflex (version 2.0), on a tions are not accurate and need to be improved further.
test set of 195 protein–ligand complexes. Out of these four Nevertheless, despite the drawbacks of each docking strategy,
docking programs, GOLD and Surflex processed well with active research is taking place to address all the issues regarding
the dataset, while Glide and LigandFit failed to process 25 scoring, explicit protein flexibility, explicit water, etc.
and 8 complexes, respectively. Except for Surflex, the docking Even in the absence of knowledge regarding the binding
solutions produced in 40% of the cases by other programs site and limited backbone movements, a variety of search
were less than 1.0 Å of the RMSD, whereas Glide and algorithms have been developed for protein–protein docking
GOLD showed a 60% success rate on this highly diverse test. over the past two decades. As rigid body docking can system-
Based on these results, these docking programs were ranked atically explore the shape complementarity between proteins,
as Glide > GOLD + gold score > GOLD + Chemscore ∼ this may not work well for docking the proteins that are crys-
GOLD + ASP ∼ Ligandfit > Surflex (Li et al. 2010). tallized separately. Thus, a high-resolution protocol is very
Recently, ten docking programs were evaluated. The suc- much needed to understand the basic principles to detect the
cess rate for the top scored and best poses varied from 40% to underlying mechanism of protein–protein interactions and ac-
60% and 60% to 80%, respectively. The RMSD obtained is tual binding with other proteins. Rescoring using empirical
Biophys Rev

potentials may not even eliminate all the false-positives. References


Even fine tuning of individual protein–protein interactions
by redesigning the protein interface depends on the accu- Alberg DG, Schreiber SL (1993) Structure-based design of a cyclophilin–
rate structure of the protein complex generated by high- calcineurin bridging ligand. Science 262:248–250
Allen WJ, Balius TE, Mukherjee S et al (2015) DOCK 6: impact of new
resolution docking protocols.
features and current docking performance. J Comput Chem 36:
Although, ZDOCK, rDOCK, and HEX provided the 1132–1156. doi:10.1002/jcc.23905
results with high docking accuracy, the provided com- Bajaj CL, Chowdhury R, Siddahanavalli V (2011) F2Dock: fast Fourier
plexes are not highly useful to design the inhibitors for protein–protein docking. IEEE/ACM Trans Comput Biol Bioinform
8:45–58. doi:10.1109/TCBB.2009.57
the protein interfaces due to constraints in rigid body
Banitt I, Wolfson HJ (2011) ParaDock: a flexible non-specific DNA—
docking. Due to this, flexible approaches were developed rigid protein docking algorithm. Nucleic Acids Res 39, e135.
that generally examine very limited conformations com- doi:10.1093/nar/gkr620
pared to the rigid body methods. These docking methods Berman HM, Battistuz T, Bhat TN et al (2002) The protein data bank.
predict binding poses most likely to occur on the broad Acta Crystallogr D Biol Crystallogr 58:899–907
Bissantz C, Folkers G, Rognan D (2000) Protein-based virtual screening
surface regions and then define the sites into high-affinity of chemical databases. 1. Evaluation of different docking/scoring
complex structures. The best example is the HADDOCK combinations. J Med Chem 43:4759–4767
software, which has been quite successful in resolving a Bodian DL, Yamasaki RB, Buswell RL, Stearns JF, White JM, Kuntz ID
large number of accurate models for protein–protein com- (1993) Inhibition of the fusion-inducing conformational change of
influenza hemagglutinin by benzoquinones and hydroquinones.
plexes. One good example is the study of the complex Biochemistry 32:2967–2978
formed between plectasin, a member of the innate immune Böhm HJ (1992) The computer program LUDI: a new method for
system, and the bacterial wall precursor lipid II. The study the de novo design of enzyme inhibitors. J Comput Aided Mol
has clearly identified the residues involved at the binding Des 6:61–78
Brady GP Jr, Stouten PF (2000) Fast prediction and visualization of
site between the two proteins, providing valuable informa- protein binding pockets with PASS. J Comput Aided Mol Des 14:
tion for the design of novel antibiotics. 383–401
However, the absolute energies associated with the inter- Brooks BR, Brooks CL, MacKerell AD et al (2009) CHARMM: the
molecular interaction are not estimated with satisfactory accu- biomolecular simulation program. J Comput Chem 30:1545–1614.
doi:10.1002/jcc.21287
racy by the current algorithms. The major issues of solvent Bruccoleri RE, Karplus M (1990) Conformational sampling using
effects, entropic effects, and receptor flexibility still need high-temperature molecular dynamics. Biopolymers 29:1847–
to be handled with special attention. As of now, some 1862. doi:10.1002/bip.360291415
methods like MOE-Dock, GOLD, Glide, FlexX, Surflex, Bursulaya BD, Totrov M, Abagyan R, Brooks CL 3rd (2003)
Comparative study of several algorithms for flexible ligand docking.
etc. that deal with side chain flexibility have been proven
J Comput Aided Mol Des 17:755–763
effective and adequate in most of the cases. The realistic Caflisch A, Miranker A, Karplus M (1993) Multiple copy simultaneous
interactions between small molecules and receptors still search and construction of ligands in binding sites: application to
rely on experimental technology. Moreover, using the cur- inhibitors of HIV-1 aspartic proteinase. J Med Chem 36:2142–2167
Canutescu AA, Shelenkov AA, Dunbrack RL Jr (2003) A graph-theory
rent docking methods, although they discriminate between
algorithm for rapid protein side-chain prediction. Protein Sci 12:
different ligands based on binding affinity with high accu- 2001–2014. doi:10.1110/ps.03154503
racy, the mode of binding, solvent effects, entropic effects, Chen R, Li L, Weng Z (2003) ZDOCK: an initial-stage protein-docking
and effects of protonation states of the charged residues in algorithm. Proteins 52:80–87. doi:10.1002/prot.10389
the active site are still major problems. With the aid of Chen H, Lyne PD, Giordanetto F, Lovell T, Li J (2006) On evaluating
molecular-docking methods for pose prediction and enrichment fac-
community efforts such as CAPRI (Critical Assessment of tors. J Chem Inf Model 46:401–415. doi:10.1021/ci0503255
PRediction of Interactions), a large number of docking Chen HM, Liu BF, Huang HL, Hwang SF, Ho SY (2007) SODOCK:
algorithms and their limitations were overcome with swarm optimization for highly flexible protein–ligand docking. J
benchmark testing. But the problem of flexibility is still Comput Chem 28:612–623. doi:10.1002/jcc.20542
Cheng TM, Blundell TL, Fernandez-Recio J (2008) Structural assembly
under investigation and with the accelerated pace of research of two-domain proteins by rigid-body docking. BMC
in this area, it will be tackled soon in the near future. Bioinformatics 9:441. doi:10.1186/1471-2105-9-441
Corbeil CR, Williams CI, Labute P (2012) Variability in docking success
rates due to dataset preparation. J Comput Aided Mol Des 26:775–
Compliance with ethical standards 786. doi:10.1007/s10822-012-9570-1
Cummings MD, DesJarlais RL, Gibbs AC, Mohan V, Jaeger EP (2005)
Comparison of automated docking programs as virtual screening
Conflict of interest Nataraj S. Pagadala declares that he has no conflict
tools. J Med Chem 48:962–976. doi:10.1021/jm049798d
of interest. Khajamohiddin Syed declares that he has no conflict of inter-
de Vries SJ, van Dijk M, Bonvin AM (2010) The HADDOCK web server
est. Jack Tuszynski declares that he has no conflict of interest.
for data-driven biomolecular docking. Nat Protoc 5:883–897.
doi:10.1038/nprot.2010.32
Ethical approval This article does not contain any studies with human Debnath AK, Radigan L, Jiang S (1999) Structure-based identification of
participants or animals performed by any of the authors. small molecule antiviral compounds targeted to the gp41 core
Biophys Rev

structure of the human immunodeficiency virus type 1. J Med Chem Jain AN (2003) Surflex: fully automatic flexible molecular docking using
42:3203–3209. doi:10.1021/jm990154t a molecular similarity-based search engine. J Med Chem 46:499–
DeLuca S, Khar K, Meiler J (2015) Fully flexible docking of medium 511. doi:10.1021/jm020406h
sized ligand libraries with RosettaLigand. PLoS One 10:e0132508. Jiang F, Kim SH (1991) BSoft docking^: matching of molecular surface
doi:10.1371/journal.pone.0132508 cubes. J Mol Biol 219:79–102
Desmet J, De Maeyer M, Hazes B, Lasters I (1992) The dead-end elim- Jones G, Willett P, Glen RC (1995) Molecular recognition of receptor
ination theorem and its use in protein side-chain positioning. Nature sites using a genetic algorithm with a description of desolvation. J
356:539–542 Mol Biol 245:43–53
Dixon JS (1997) Evaluation of the CASP2 docking section. Proteins Jones G, Willett P, Glen RC, Leach AR, Taylor R (1997) Development
29(Suppl 1):198–204 and validation of a genetic algorithm for flexible docking. J Mol
Dominguez C, Boelens R, Bonvin AM (2003) HADDOCK: a protein– Biol 267:727–748. doi:10.1006/jmbi.1996.0897
protein docking approach based on biochemical or biophysical infor- Katchalski-Katzir E, Shariv I, Eisenstein M, Friesem AA, Aflalo C,
mation. J Am Chem Soc 125:1731–1737. doi:10.1021/ja026939x Vakser IA (1992) Molecular surface recognition: determination of
Eisen MB, Wiley DC, Karplus M, Hubbard RE (1994) HOOK: a program geometric fit between proteins and their ligands by correlation tech-
for finding novel molecular architectures that satisfy the chemical niques. Proc Natl Acad Sci U S A 89:2195–2199
and steric requirements of a macromolecule binding site. Proteins Kellenberger E, Rodrigo J, Muller P, Rognan D (2004) Comparative
19:199–221. doi:10.1002/prot.340190305 evaluation of eight docking tools for docking and virtual screening
Fernández-Recio J, Totrov M, Abagyan R (2002) Soft protein–protein accuracy. Proteins 57:225–242. doi:10.1002/prot.20149
docking in internal coordinates. Protein Sci 11:280–291 Kirkpatrick S, Gelatt CD Jr, Vecchi MP (1983) Optimization by simulated
Fischer D, Norel R, Wolfson H, Nussinov R (1993) Surface motifs by a annealing. Science 220:671–680. doi:10.1126/science.220.4598.671
computer vision technique: searches, detection, and implications for Kohlbacher O, Lenhof HP (2000) BALL—rapid software prototyping in
protein–ligand recognition. Proteins 16:278–292. doi:10.1002 computational molecular biology. Bioinform 16:815–824
/prot.340160306 Kontoyianni M, McClellan LM, Sokol GS (2004) Evaluation of docking
Friesner RA, Banks JL, Murphy RB et al (2004) Glide: a new approach performance: comparative data on docking algorithms. J Med Chem
for rapid, accurate docking and scoring. 1. Method and assessment 47:558–565. doi:10.1021/jm0302997
of docking accuracy. J Med Chem 47:1739–1749. doi:10.1021 Korb O, Stutzle T, Exner TE (2009) Empirical scoring functions for
/jm0306430 advanced protein–ligand docking with PLANTS. J Chem Inf
Fu Y, Wu XJ, Chen ZG, Sun J, Zhao J, Xu WB (2015) A new approach Model 49:84–96. doi:10.1021/ci800298z
for flexible molecular docking based on swarm intelligence. Math Koshland DE Jr (1963) Correlation of structure and function in enzyme
Probl Eng. doi:10.1155/2015/540186 action. Science 142:1533–1541
Gardiner EJ, Willett P, Artymiuk PJ (2001) Protein docking using a ge- Kozakov D, Brenke R, Comeau SR, Vajda S (2006) PIPER: an FFT-
netic algorithm. Proteins 44:44–56 based protein docking program with pairwise potentials. Proteins
Gardiner EJ, Willett P, Artymiuk PJ (2003) GAPDOCK: a genetic algo- 65:392–406. doi:10.1002/prot.21117
rithm approach to protein docking in CAPRI round 1. Proteins 52: Kozakov D, Beglov D, Bohnuud T et al (2013) How good is automated
10–14. doi:10.1002/prot.10386 protein docking? Proteins 81:2159–2166. doi:10.1002/prot.24403
Garzon JI, Lopéz-Blanco JR, Pons C et al (2009) FRODOCK: a new Kuntz ID, Blaney JM, Oatley SJ, Langridge R, Ferrin TE (1982) A
approach for fast rotational protein–protein docking. Bioinformatics geometric approach to macromolecule–ligand interactions. J Mol
25:2544–2551. doi:10.1093/bioinformatics/btp447 Biol 161:269–288
Goodford PJ (1985) A computational procedure for determining energet- Laskowski RA (1995) SURFNET: a program for visualizing molecular
ically favorable binding sites on biologically important macromole- surfaces, cavities, and intermolecular interactions. J Mol Graph 13:
cules. J Med Chem 28:849–857 323–330
Goodsell DS, Olson AJ (1990) Automated docking of substrates to pro- Lawrence MC, Davis PC (1992) CLIX: a search algorithm for finding novel
teins by simulated annealing. Proteins 8:195–202. doi:10.1002 ligands capable of binding proteins of known three-dimensional struc-
/prot.340080302 ture. Proteins 12:31–41. doi:10.1002/prot.340120105
Gu J, Yang X, Kang L, Wu J, Wang X (2015) MoDock: a multi-objective Leach AR (1994) Ligand docking to proteins with discrete side-chain
strategy improves the accuracy for molecular docking. Algorithms flexibility. J Mol Biol 235:345–356
Mol Biol 10:8. doi:10.1186/s13015-015-0034-8 Lensink MF, Wodak SJ (2013) Docking, scoring, and affinity prediction
Hammes GG (2002) Multiple conformational changes in enzyme cataly- in CAPRI. Proteins 81:2082–2095. doi:10.1002/prot.24428
sis. Biochemistry 41:8221–8228 Levitt DG, Banaszak LJ (1992) POCKET: a computer graphics method
Harrison SJ, Guidolin A, Faast R et al (2002) Efficient generation of for identifying and displaying protein cavities and their surrounding
alpha(1,3) galactosyltransferase knockout porcine fetal fibroblasts amino acids. J Mol Graph 10:229–234
for nuclear transfer. Transgenic Res 11:143–150 Lewis RA, Dean PM (1989a) Automated site-directed drug design: the
Heifetz A, Katchalski-Katzir E, Eisenstein M (2002) Electrostatics in concept of spacer skeletons for primary structure generation. Proc R
protein–protein docking. Protein Sci 11:571–587 Soc Lond Ser B Biol Sci 236:125–140
Hu X, Balaz S, Shelver WH (2004) A practical approach to docking of Lewis RA, Dean PM (1989b) Automated site-directed drug design: the
zinc metalloproteinase inhibitors. J Mol Graph Model 22:293–307. formation of molecular templates in primary structure generation.
doi:10.1016/j.jmgm.2003.11.002 Proc R Soc Lond Ser B Biol Sci 236:141–162
Hurwitz N, Schneidman-Duhovny D, Wolfson HJ (2016) Memdock: an Li N, Sun Z, Jiang F (2007) SOFTDOCK application to protein–
alpha-helical membrane protein docking algorithm. Bioinformatics protein interaction benchmark and CAPRI. Proteins 69:801–808.
32:2444–2450. doi:10.1093/bioinformatics/btw184 doi:10.1002/prot.21728
Jackson RM, Sternberg MJ (1995) A continuum model for protein–pro- Li X, Li Y, Cheng T, Liu Z, Wang R (2010) Evaluation of the performance
tein interactions: application to the docking problem. J Mol Biol of four molecular docking programs on a diverse set of protein–
250:258–275. doi:10.1006/jmbi.1995.0375 ligand complexes. J Comput Chem 31:2109–2125. doi:10.1002
Jackson RM, Gabb HA, Sternberg MJ (1998) Rapid refinement of protein /jcc.21498
interfaces incorporating solvation: application to the docking prob- Liu Y, Zhao L, Li W, Zhao D, Song M, Yang Y (2013) FIPSDock: a new
lem. J Mol Biol 276:265–285. doi:10.1006/jmbi.1997.1519 molecular docking technique driven by fully informed swarm
Biophys Rev

optimization algorithm. J Comput Chem 34:67–75. doi:10.1002 Pierce BG, Hourai Y, Weng Z (2011) Accelerating protein docking in
/jcc.23108 ZDOCK using an advanced 3D convolution library. PLoS One 6:
Mandell JG, Roberts VA, Pique ME et al (2001) Protein docking using e24657. doi:10.1371/journal.pone.0024657
continuum electrostatics and geometric fit. Protein Eng 14:105–113 Plewczynski D, Łaźniewski M, Augustyniak R, Ginalski K (2011) Can
Matsuzaki Y, Ohue M, Uchikoga N, Akiyama Y (2014) Protein–protein we trust docking results? Evaluation of seven commonly used pro-
interaction network prediction by using rigid-body docking tools: grams on PDBbind database. J Comput Chem 32:742–755.
application to bacterial chemotaxis. Protein Pept Lett 21:790–798 doi:10.1002/jcc.21643
McGann MR, Almond HR, Nicholls A, Grant JA, Brown FK (2003) Pons C, Grosdidier S, Solernou A, Pérez-Cano L, Fernández-Recio J
Gaussian docking functions. Biopolymers 68:76–90. doi:10.1002 (2010) Present and future challenges and limitations in protein–pro-
/bip.10207 tein docking. Proteins 78:95–108. doi:10.1002/prot.22564
Metropolis N, Ulam S (1949) The Monte Carlo method. J Am Stat Assoc Pons C, Jiménez-González D, González-Álvarez C et al (2012) Cell-
44:335–341 Dock: high-performance protein–protein docking. Bioinformatics
Mezei M (2003) A new method for mapping macromolecular topogra- 28:2394–2396. doi:10.1093/bioinformatics/bts454
phy. J Mol Graph Model 21:463–472 Rarey M, Kramer B, Lengauer T, Klebe G (1996) A fast flexible docking
Miller MD, Kearsley SK, Underwood DJ, Sheridan RP (1994) FLOG: a method using an incremental construction algorithm. J Mol Biol
system to select ‘quasi-flexible’ ligands complementary to a receptor 261:470–489. doi:10.1006/jmbi.1996.0477
of known three-dimensional structure. J Comput Aided Mol Des 8: Ravikant DV, Elber R (2010) PIE—efficient filters and coarse grained
153–174 potentials for unbound protein–protein docking. Proteins 78:400–
Miranker A, Karplus M (1991) Functionality maps of binding sites: a 419. doi:10.1002/prot.22550
multiple copy simultaneous search method. Proteins 11:29–34. Ring CS, Sun E, McKerrow JH et al (1993) Structure-based inhibitor
doi:10.1002/prot.340110104 design by using protein models for the development of antiparasitic
Moon JB, Howe WJ (1991) Computer design of bioactive molecules: a agents. Proc Natl Acad Sci U S A 90:3583–3587
method for receptor-based de novo ligand design. Proteins 11:314– Ritchie DW, Kemp GJ (2000) Protein docking using spherical polar
328. doi:10.1002/prot.340110409 Fourier correlations. Proteins 39:178–194
Namasivayam V, Günther R (2007) pso@autodock: a fast flexible mo- Ritchie DW, Venkatraman V (2010) Ultra-fast FFT protein docking on
lecular docking program based on swarm intelligence. Chem Biol graphics processors. Bioinformatics 26:2398–2405. doi:10.1093
Drug Des 70:475–484. doi:10.1111/j.1747-0285.2007.00588.x /bioinformatics/btq444
Ng MC, Fong S, Siu SW (2015) PSOVina: the hybrid particle swarm Roberts VA, Pique ME (1999) Definition of the interaction domain for
optimization algorithm for protein–ligand docking. J Bioinform cytochrome c on cytochrome c oxidase. III. Prediction of the docked
Comput Biol 13:1541007. doi:10.1142/S0219720015410073 complex by a complete, systematic search. J Biol Chem 274:38051–
Nishibata Y, Itai A (1993) Confirmation of usefulness of a structure con- 38060
struction program based on three-dimensional receptor structure for Roberts VA, Thompson EE, Pique ME, Perez MS, Ten Eyck LF (2013)
rational lead generation. J Med Chem 36:2921–2928 DOT2: macromolecular docking with improved biophysical
Novotny J, Bruccoleri RE, Saul FA (1989) On the attribution of binding models. J Comput Chem 34:1743–1758. doi:10.1002/jcc.23304
energy in antigen–antibody complexes McPC 603, D1.3, and Rotstein SH, Murcko MA (1993a) GenStar: a method for de novo drug
HyHEL-5. Biochemistry 28:4735–4749 design. J Comput Aided Mol Des 7:23–43
Ohue M, Matsuzaki Y, Uchikoga N, Ishida T, Akiyama Y (2014a) Rotstein SH, Murcko MA (1993b) GroupBuild: a fragment-based meth-
MEGADOCK: an all-to-all protein–protein interaction prediction od for de novo drug design. J Med Chem 36:1700–1710
system using tertiary structure data. Protein Pept Lett 21:766–778 Ruiz-Carmona S, Alvarez-Garcia D, Foloppe N et al (2014) rDock: a fast,
Ohue M, Shimoda T, Suzuki S, Matsuzaki Y, Ishida T, Akiyama Y versatile and open source program for docking ligands to proteins
(2014b) MEGADOCK 4.0: an ultra-high-performance protein–pro- and nucleic acids. PLoS Comput Biol 10, e1003571. doi:10.1371
tein docking software for heterogeneous supercomputers. /journal.pcbi.1003571
Bioinformatics 30:3281–3283. doi:10.1093/bioinformatics/btu532 Sauton N, Lagorce D, Villoutreix BO, Miteva MA (2008) MS-DOCK:
Onodera K, Satou K, Hirota H (2007) Evaluations of molecular docking accurate multiple conformation generator and rigid docking protocol
programs for virtual screening. J Chem Inf Model 47:1609–1618. for multi-step virtual ligand screening. BMC Bioinformatics 9:184.
doi:10.1021/ci7000378 doi:10.1186/1471-2105-9-184
Österberg F, Morris GM, Sanner MF, Olson AJ, Goodsell DS (2002) Schapira M, Abagyan R, Totrov M (2003) Nuclear hormone receptor
Automated docking to multiple target structures: incorporation of targeted virtual screening. J Med Chem 46:3045–3059.
protein mobility and structural water heterogeneity in AutoDock. doi:10.1021/jm0300173
Proteins 46:34–40 Schnecke V, Swanson CA, Getzoff ED, Tainer JA, Kuhn LA (1998)
O’Sullivan D, Arrhenius T, Sidney JO et al (1991) On the interaction Screening a peptidyl database for potential ligands to proteins with
of promiscuous antigenic peptides with different DR alleles. side-chain flexibility. Proteins 33:74–87
Identification of common structural motifs. J Immunol 147: Schneidman-Duhovny D, Inbar Y, Nussinov R, Wolfson HJ (2005)
2663–2669 PatchDock and SymmDock: servers for rigid and symmetric docking.
Palma PN, Krippahl L, Wampler JE, Moura JJ (2000) BiGGER: a new Nucleic Acids Res 33:W363–W367. doi:10.1093/nar/gki481
(soft) docking algorithm for predicting protein interactions. Proteins Shin WH, Seok C (2012) GalaxyDock: protein–ligand docking with flex-
39:372–384 ible protein side-chains. J Chem Inf Model 52:3225–3232.
Paul N, Rognan D (2002) ConsDock: a new program for the consensus doi:10.1021/ci300342z
analysis of protein–ligand interactions. Proteins 47:521–533. Shoichet BK, Kuntz ID (1991) Protein docking and complementarity. J
doi:10.1002/prot.10119 Mol Biol 221:327–346
Pellegrini M, Doniach S (1993) Computer simulation of antibody binding Shoichet BK, Stroud RM, Santi DV, Kuntz ID, Perry KM (1993)
specificity. Proteins 15:436–444. doi:10.1002/prot.340150410 Structure-based discovery of inhibitors of thymidylate synthase.
Perola E, Walters WP, Charifson PS (2004) A detailed comparison of Science 259:1445–1450
current docking and scoring methods on systems of pharmaceutical Smith JA, Edwards SJ, Moth CW, Lybrand TP (2013) TagDock: an effi-
relevance. Proteins 56:235–249. doi:10.1002/prot.20088 cient rigid body docking algorithm for oligomeric protein complex
Biophys Rev

model construction and experiment planning. Biochemistry 52: Verlinde CL, Rudenko G, Hol WG (1992) In search of new lead com-
5577–5584. doi:10.1021/bi400158k pounds for trypanosomiasis drug design: a protein structure-based
Terashi G, Takeda-Shitaka M, Kanou K, Iwadate M, Takaya D, linked-fragment approach. J Comput Aided Mol Des 6:131–147
Umeyama H (2007) The SKE-DOCK server and human teams Wang Z, Sun H, Yao X et al (2016) Comprehensive evaluation of ten
based on a combined method of shape complementarity and free docking programs on a diverse set of protein–ligand complexes: the
energy estimation. Proteins 69:866–872. doi:10.1002/prot.21772 prediction accuracy of sampling power and scoring power. Phys
Tøndel K, Anderssen E, Drabløs F (2006) Protein Alpha Shape (PAS) Chem Chem Phys 18:12964–12975. doi:10.1039/c6cp01555g
Dock: a new Gaussian-based score function suitable for docking in Warren GL, Andrews CW, Capelli AM et al (2006) A critical assessment
homology modelled protein structures. J Comput Aided Mol Des of docking programs and scoring functions. J Med Chem 49:5912–
20:131–144. doi:10.1007/s10822-006-9041-7 5931. doi:10.1021/jm050362n
Torchala M, Moal IH, Chaleil RA, Fernandez-Recio J, Bates PA (2013) Wass MN, Fuentes G, Pons C, Pazos F, Valencia A (2011) Towards the
SwarmDock: a server for flexible protein–protein docking. prediction of protein interaction partners using physical docking.
Bioinformatics 29:807–809. doi:10.1093/bioinformatics/btt038 Mol Syst Biol 7:469. doi:10.1038/msb.2011.3
Totrov M, Abagyan R (1994) Detailed ab initio prediction of lysozyme– Wiehe K, Pierce B, Mintseris J et al (2005) ZDOCK and RDOCK per-
antibody complex with 1.6 Å accuracy. Nat Struct Mol Biol 1:259–263 formance in CAPRI rounds 3, 4, and 5. Proteins 60:207–213.
Trott O, Olson AJ (2010) AutoDock Vina: improving the speed and accu- doi:10.1002/prot.20559
racy of docking with a new scoring function, efficient optimization,
Wolfson HJ, Nussinov R (2000) Geometrical docking algorithms: a prac-
and multithreading. J Comput Chem 31:455–461. doi:10.1002
tical approach. Methods Mol Biol 143:377–397. doi:10.1385/1-
/jcc.21334
59259-368-2:377
Venkatachalam CM, Jiang X, Oldfield T, Waldman M (2003) LigandFit:
a novel method for the shape-directed rapid docking of ligands to Yue SY (1990) Distance-constrained molecular docking by simulated
protein active sites. J Mol Graph Model 21:289–307 annealing. Protein Eng 4:177–184
Venkatraman V, Ritchie DW (2012) Flexible protein docking refinement Zhang C, Lai L (2011) SDOCK: a global protein–protein docking pro-
using pose-dependent normal mode analysis. Proteins 80:2262– gram using stepwise force-field potentials. J Comput Chem 32:
2274. doi:10.1002/prot.24115 2598–2612. doi:10.1002/jcc.21839
Venkatraman V, Yang YD, Sael L, Kihara D (2009) Protein–protein Zhao H, Caflisch A (2013) Discovery of ZAP70 inhibitors by high-
docking using region-based 3D Zernike descriptors. BMC throughput docking into a conformation of its kinase domain gen-
Bioinformatics 10:407. doi:10.1186/1471-2105-10-407 erated by molecular dynamics. Bioorg Med Chem Lett 23:5721–
Verkhivker GM, Bouzida D, Gehlhaar DK et al (2000) Deciphering com- 5726. doi:10.1016/j.bmcl.2013.08.009
mon failures in molecular docking of ligand–protein complexes. J Zhao Y, Sanner MF (2007) FLIPDock: docking flexible ligands into
Comput Aided Mol Des 14:731–751 flexible receptors. Proteins 68:726–737. doi:10.1002/prot.21423

You might also like