Perera
Abstract
Traditional computer-based simulators for manual molecular docking for rational drug discovery
have been very time consuming. In this research, a multi agent-based solution, named as
NanoAgent, has been developed to automate the drug discovery process with little human
intervention. In this solution, ligands and proteins are implemented as agents who pose the
knowledge of permitted connections with other agents to form new molecules. The system also
includes several other agents for surface determination, cavity finding and energy calculation.
These agents autonomously activate and communicate with each other to come up with a most
probable structure over the ligands and proteins, which are participating in deliberation. Domain
ontology is maintained to store the common knowledge of molecular bindings, whereas specific
rules pertaining to the behaviour of ligands and proteins are stored in their personal ontologies.
Existing, Protein Data Bank (PDB) has also been used to calculate the space required by ligand
to bond with the receptor. The drug discovery process of NanoAgent has exemplified exciting
features of multi agent technology, including communication, coordination, negotiation, butterfly
effect, self-organizing and emergent behaviour. Since agents consume fewer computing
resources, NanoAgent has recorded optimal performance during the drug discovery process.
NanoAgent has been tested for the discovery of the known drugs for the known protein targets. It
has 80% accuracy by considering the prediction of the correct actual existence of the docked
molecules using energy calculations. By comparing the time taken for the manual docking
process with the time taken for the molecular docking by NanoAgent, there has been 95%
efficiency.
1. INTRODUCTION
Molecular docking is a well-recognized branch of research in drug discovery under the field of
structural molecular biology. This area is primarily concerned with binding a smaller molecules
(ligand) with a target macromolecule (protein). Over many decades, the molecular binding
process has been manually modeled by scientist with reference to huge databases of protein and
ligand [1]. Nevertheless, the manual molecular docking has been a very time consuming and
tedious task. In 1990s, some computer-based solutions for molecular docking have begun to
emerge [2]. However, these solutions have appeared as semi-automated approaches to
molecular docking, and expected the scientists to use many additional software tools to complete
the docking process. As such, automated and integrated software solutions for molecular docking
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 37
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
have become a research challenge. This paper present our Multi-Agent Systems based software
solution, NanoAgent, for integrated and automated molecular docking. The NanoAgent provides
automated all-in-one molecular docking system with the facility to energy minimization, active site
prediction, cavity discovery on the target, and orientation calculations. According to experimental
results, NanoAgent shows 80% accuracy and 95% efficiency.
The rest of the research paper is organized as follows: Section II critically review the domain of
molecular docking by highlighting current solutions, practices, technologies, limitations for
defining the research problem. Section III and Section IV describes the essentials of Multi-Agent
Technology showing its relevance to solve the molecular docking domain with the aid of
Ontology. From the Section V to Section VII, this paper presents our novel approach to molecular
docking with Multi-Agent Technology. Section VIII reports on evaluation of the novel solution by
explaining evaluation strategy. Section IX concludes the outcome of the research with a note on
further work.
The field of molecular docking has appeared during the last decades, driven by the needs of
structural molecular biology and structure-based drug discovery [6] using the computer. In
computer aided drug design and structural molecular biology heavily uses the molecular docking
software tools. In manual molecular docking, scientist needs to get the assistance from the
various combinations of separate set of computer-aided tools to generate molecular data
including geometries, energies and properties. Manual molecular docking is tedious and time-
consuming process due to these reasons.
DOCK 1.0 was the first automated receptor-ligand docking program. It was purposed in 1982 by
Irwin Kuntz in The Department of Pharmacology at The University of California at San Francisco
[7]. It is used to achieve docking which is the process that two molecules fit together in 3D space.
It has the limitations in terms of docking accuracy and time consumption [8]. Molecular docking
program, AutoDock [9] was initially written in FORTRAN-77 in 1990 by David S. Goodsell [10] in
Arthur J. Olson's laboratory with the assistance of the concepts from DOCK 1.0. It was proposed
to achieve automated docking of ligands, which are the small molecules like a candidate drug to
their macromolecular targets such as proteins. AutoDock, which is the most famous commercial
molecular docking tool [11] in the current computer aided docking context. But it doesnt have
efficient search methods and accurate empirical free energy scoring functions. It also need the
assistance from other external tools such as MGLTools [12] and FastGrid [13] to perform some
calculations.
The college of Pharmacy in Rosland Franklin University of Medicine and Science in UK [14]
which is very famous for the advance chemistry related researches, is relied heavily on
computational molecular docking tools. Nevertheless, according to them it is not much easy to
use the current available molecular docking tools [15], because they have to access multiple
numbers of tools to perform various steps of the docking operations, to the experiment only one
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 38
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
single molecular docking process. Scientists from Rosland Franklin University, also mentioned
that the regularly, 3D structure data are available for the shape of a protein and a drug
independently, but not for the two together [16]. So they have to use-docking tools to visualize the
docked molecular structures but they are not efficient and accurate enough [17].
Pattern recognition, machine learning and other artificial intelligence approaches also play a
significant role for the screening and identification of candidate molecules [18]. Artificial Neural
Network, Fuzzy logic and Genetic Algorithms has been used by most of the drug design in-silico,
but it takes considerable amount of time to find the correct orientations of the molecules. And also
they need more powerful computer processors to the calculations. In the present most of the
molecular docking systems are rather semi-automated, and do not provide integrated solutions.
This paper presents our fully automated and integrated molecular docking solution, which exploits
the power of Multi Agent Systems and Ontological engineering in modern computing.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 39
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
Collaboration - Agents communicates and works supportively with other agents to form
multi - agent systems working together on some task
Knowledgeable - An agent is capable of reasoning about its goals and knowledge
Mobility - The ability for an agent to move from one executing context to another
Agent systems are plugged with the domain ontologies. It can be used as the operating rule set
[32] of the agents. It is important to exchange and sharing knowledge in multi-agent systems [33]
and ontologies are the best way to achieve it.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 40
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
interacting ions).
VI. Detached the minimized complex in protein (lock) and ligand (key).
VII. Prepare the docking appropriate files for protein and ligand (pdb files).
VIII. Evaluate the docking results
Equation (1) is used to calculate Electrostatic Potential Energy [38] . It is Electrostatic Potential
Energy a pair wise total of columbic interactions as described in equilibrium.
qi qk
Eelectrostatic = (1)
pairsnonbonded Drik
Equation (2) is to calculate Vander Waals Potential Energy. For general behavior of non-bonded
interactions is regularly modeled by the following equation.
Aik Cik
EVanderWaals = 12 6 (2)
pairsnonbonded rik rik
Vander Waals potential which expresses the interaction energy using the atom-type dependent
constants A and C. Values of A and C may be determined by a variety of methods, like non-
bonding distances in crystals and gas-phase scattering measurements.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 41
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
Equation (4) is to find the bonded energy. The Ebonded term is a sum of three terms as follows:
4.4.1 Stretching
Equation (5) is to find the bond energy for stretching. This is the approximation of bond energy
as a function of b0 and Kb, determines the strength of the bond.
4.4.2 Bending
Bond energy for bending can be calculated using Equation (6).
0 - harmonic potential
K - Based on chemical type of atoms constituting the angle
K ( )
2
Estreching = 0
(6)
angles
4.4.3 Rotating
Equation (7) can be used to calculate bond energy for rotating.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 42
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
An orally active drug has no more than one disruption of the following:
I. Should not exceed 5 hydrogen bond donors
a. The total number of nitrogenhydrogen and oxygenhydrogen bonds
II. Should not exceed 10 hydrogen bond acceptors
a. All nitrogen or oxygen atoms
III. A molecular mass should less than 500 Daltons
According to this rule had chosen the appropriate ligands for the docking.
(8)
5.1 Inputs
Multi-Agent system for Molecular Docking has been design to accept multiple inputs coming from
different entities of the Molecular Docking process. The following are the main inputs to the
system. TABLE 1 shows the inputs from corresponding entities.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 43
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
5.2 Outputs
The output of the system will be the docked molecule. These outputs are coming as the rational
drug with its PDB file structure.
As the next step, add the missing hydrogen/side chain atoms and minimized the complex. Then
clean the minimized complex (delete all the water and the solvent molecules and all non-
interacting ions). Separate the minimized complex in macromolecule (lock) and ligand (key)
should be done. Then prepare the docking suitable files for LOCK and KEY (pdb files). Then
perform the docking and analyze the docking results. There are separate ontologies to store the
atom colors and charges for the atoms/elements. Agents get the help from these ontologies to
find the best ligand agent to bind with the protein agent.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 44
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
Usually it starts with a problem description and ending with a solution. Each knowledge source
revises the blackboard with a partial solution when its internal restrictions match the blackboard
state and the specialists work together to solve the problem. The blackboard model was originally
designed as a way to handle complex, ill-defined problems, where the solution is the sum of its
parts. Molecular docking can be consider as a complex problem which the solution is the sum of
different operations, such as energy calculations, active site finding, surface matching and cavity
finding.
FIGURE 6: DOCK program algorithm: Match the spheres shown in gray (distances between them are used
for scoring).
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 45
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
8. EVALUATION
8.1. Evaluation Methods
For the evaluation of the tool, compared known ligand-protein pair docking time taken in the usual
manner with this software tool. Also analyzed the time difference for measuring the efficiency of
the tool. The accuracy of the tool had been measured by the energy calculations. It will help to
find the actual existence of the ligand-protein pairs in the environment found by the docking tool.
Some of the following ligand and pairs found from the reference [53] had been used to the
evaluation of the docking tool as the sample population.
Protein Ligand
1A52 OHT
1A52 EST
2AM9 B5R
1A52 GEN
1EWV KAI
4PVU BRL
4P6W DEX
TABLE 2: Known Protein-Ligand Pairs.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 46
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
To compare the results, used that time taken for the manual molecular docking for each protein
and ligand pairs. The next step was tracking the time taken for each docking process for the
same protein-ligand pairs taken by the developed tool. To evaluate the accuracy of the software
tool, also noted that the prediction of the actual existence of each docked pair using the tools
energy calculation module.
Criteria 01:
The time taken for the molecular docking tool and the manual molecular docking
Criteria 02:
The prediction of the actual existence of the docked molecule
Probability of success (time taken by the software is lesser than the actual manual molecular
docking) = 19/20 = 0.95
Probability of failure (time taken by the software is higher than the actual manual molecular
docking) = 1/20 = 0.05
Probability of success (software predicts the actual existence of the docked molecule correctly) =
16/20 = 0.8
Probability of failure (software not predicts the actual existence of the docked molecule correctly)
= 4/20 = 0.2
Following FIGURE 8 shows the main functions of the molecular docking tool and the interfaces.
Considering the main features of the software it is an all-in-one solution for the molecular docking
comparing other popular tool AutoDock.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 47
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
Following FIGURE 9 shows the ligand and the receptor on the same window.
I have executed the molecular simulation 20 times for the known ligand-protein pairs and
compared it with the actual time taken for the manual molecular docking. The experiment results
derived based on two scenarios.
Time for the docking: By comparing the time taken for the manual docking process with the time
taken for the automated molecular docking, it says the software tool can obtain 95% accuracy
Accuracy: It has 80% accuracy by considering the prediction of the correct actual existence of the
docked molecules according to the energy calculations.
So it is evident that the software can manage a good balance in between the efficiency and the
accuracy.
As the future work we can improve the tool using Artificial Neural Network by training the system.
It will again enhance the efficiency of the tool by reducing the time taken for the docking process.
Currently in this research, it was focused on the ligand-protein docking only, but it can be
extended to the protein-protein docking as well. This tool can be used by the researches in
Biological Chemistry domain. Hope to get their feed back to improve the accuracy of the software
further.
10. REFERENCES
[1] D. T.-H. Chang, C.-H. Ke, J.-H. Lin, and J.-H. Chiang, AutoBind: automatic extraction of
proteinligand-binding affinity data from biological literature, Bioinformatics, vol. 28, no. 16,
pp. 21622168, Aug. 2012.
[2] M. Munetomo, K. Akama, and H. Maeda, An automated ligand evolution system using
Bayesian optimization algorithm, WSEAS Trans. Inf. Sci. Appl., vol. 6, no. 5, pp. 788797,
2009.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 48
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
[3] S. S. Azam and S. W. Abbasi, Molecular docking studies for the identification of novel
melatoninergic inhibitors for acetylserotonin-O-methyltransferase using different docking
routines, Theor. Biol. Med. Model., vol. 10, p. 63, Oct. 2013.
[5] M. A. D. JE, Virtual screening and lead optimisation to identify novel inhibitors for HDAC-8,
ArXiv Prepr. ArXiv12092793, 2012.
[11] M. Lape, C. Elam, and S. Paula, Comparison of current docking tools for the simulation of
inhibitor binding by the transmembrane domain of the sarco/endoplasmic reticulum calcium
ATPase, Biophys. Chem., vol. 150, no. 13, pp. 8897, Aug. 2010.
[13] M. Olk, J. Filipovi, and M. Prokop, FastGridThe Accelerated AutoGrid Potential Maps
Generation for Molecular Docking, Comput. Inform., vol. 29, no. 6+, pp. 13251336, 2012.
[14] Laboratory for Molecular Design and Drug Discovery. [Online]. Available:
http://www.rosalindfranklin.edu/cop/Home/LaboratoryforMolecularDesignandDrugDiscovery.
aspx. [Accessed: 25-Sep-2014].
[15] C. Ma, R. Kotaria, J. A. Mayor, S. Remani, D. E. Walters, and R. S. Kaplan, The Yeast
Mitochondrial Citrate Transport Protein CHARACTERIZATION OF TRANSMEMBRANE
DOMAIN III RESIDUE INVOLVEMENT IN SUBSTRATE TRANSLOCATION, J. Biol.
Chem., vol. 280, no. 3, pp. 23312340, Jan. 2005.
[16] C. A. Nicolaou and N. Brown, Multi-objective optimization methods in drug design, Drug
Discov. Today Technol., vol. 10, no. 3, pp. e427e435, Sep. 2013.
[17] P. H. Palestro, L. Gavernet, G. L. Estiu, and L. E. Bruno Blanch, Docking Applied to the
Prediction of the Affinity of Compounds to P-Glycoprotein, BioMed Res. Int., vol. 2014, p.
e358425, May 2014.
[18] W. Duch, K. Swaminathan, and J. Meller, Artificial intelligence approaches for rational drug
design and discovery, Curr. Pharm. Des., vol. 13, no. 14, pp. 14971508, 2007.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 49
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
[19] E. Yu, Agent-oriented modelling: software versus the world, in Agent-Oriented Software
Engineering II, Springer, 2002, pp. 206225.
[21] B. Chandrasekaran, J. R. Josephson, and V. R. Benjamins, What are ontologies, and why
do we need them?, IEEE Intell. Syst., vol. 14, no. 1, pp. 2026, 1999.
[22] A. H. Andr Filipe de Moraes Batista, Emerson Aguiar Noronha, Fbio Arago da Silva,
Maria das Graas Bruno Marietto, Robson dos Santos Frana Terry Lima Ruas, Modeling
Artificial Life Through Multi-Agent Based Simulation. INTECH Open Access Publisher, 2011.
[25] D. J. Cook, Multi-agent smart environments, J. Ambient Intell. Smart Environ., vol. 1, no. 1,
pp. 5155, 2009.
[26] Q. Guo and M. Zhang, Research on intelligent manufacturing system based on multi-
agent, in Intelligent Robotics and Applications, Springer, 2008, pp. 829838.
[27] Z. Li, C. H. Sim, and M. Y. H. Low, A survey of emergent behavior and its impacts in agent-
based systems, in Industrial Informatics, 2006 IEEE International Conference on, 2006, pp.
12951300.
[31] M. N. Huhns and M. P. Singh, Ontologies for agents, Internet Comput. IEEE, vol. 1, no. 6,
pp. 8183, 1997.
[32] F. Zini and L. Sterling, (1) DISI-Universit a di Genova via Dodecaneso 35, 16146 Genova,
Italy zini@ disi. unige. it (2) Department of Computer Science & Software Engineering.
[33] P. Maret, J. Calmet, and others, Agent-based knowledge communities, Int. J. Comput. Sci.
Appl., vol. 6, no. 2, pp. 118, 2009.
[34] Molecular_Docking_Tutorial.pdf. .
[35] Machado_et_al_LNCS_4643_1-11_2007.pdf. .
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 50
M.Harindra R. Fernando, Asoka S. Karunananda & Roshan P. Perera
[39] P. Leeson, Drug discovery: Chemical beauty contest, Nature, vol. 481, no. 7382, pp. 455
456, Jan. 2012.
[40] J. W. Ponder and D. A. Case, Force fields for protein simulations, Adv. Protein Chem., vol.
66, pp. 2785, 2003.
[41] Jade Site | Java Agent DEvelopment Framework. [Online]. Available: http://jade.tilab.com/.
[Accessed: 17-Sep-2014].
[45] Y. Yan, C. Wang, A. Zhou, W. Qian, L. Ma, and Y. Pan, Efficiently querying rdf data in triple
stores, in Proceedings of the 17th international conference on World Wide Web, 2008, pp.
10531054.
[48] Jmol: an open-source Java viewer for chemical structures in 3D. [Online]. Available:
http://jmol.sourceforge.net/. [Accessed: 17-Sep-2014].
[49] D. Rudenko and A. Borisov, An overview of blackboard architecture application for real
tasks, in Scientific Proceedings Of Riga Technical University, Ser, 2007, vol. 5, pp. 5057.
[50] T. J. Ewing and I. D. Kuntz, Critical evaluation of search algorithms for automated
molecular docking and database screening, J. Comput. Chem., vol. 18, no. 9, pp. 1175
1189, 1997.
International Journal of Artificial Intelligence and Expert Systems (IJAE), Volume (7) : Issue (3) : 2016 51