Professional Documents
Culture Documents
doi: 10.1093/bib/bbaa155
Advance Access Publication Date: 27 July 2020
Problem Solving Protocol
Abstract
Given the scale and rapid spread of the coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome
coronavirus 2 (SARS-CoV-2), there is an urgent need for medicines that can help before vaccines are available. In this study,
we present a viral-associated disease-specific chemogenomics knowledgebase (Virus-CKB) and apply our computational
systems pharmacology-target mapping to rapidly predict the FDA-approved drugs which can quickly progress into clinical
trials to meet the urgent demand of the COVID-19 outbreak. Virus-CKB reuses the underlying platform of our DAKB-GPCRs
but adds new features like multiple-compound support, multi-cavity protein support and customizable symbol display. Our
one-stop computing platform describes the chemical molecules, genes and proteins involved in viral-associated diseases
regulation. To date, Virus-CKB archived 65 antiviral drugs in the market, 107 viral-related targets with 189 available 3D
crystal or cryo-EM structures and 2698 chemical agents reported for these target proteins. Moreover, Virus-CKB is
implemented with web applications for the prediction of the relevant protein targets and analysis and visualization of the
outputs, including HTDocking, TargetHunter, BBB predictor, NGL Viewer, Spider Plot, etc. The Virus-CKB server is accessible
at https://www.cbligand.org/g/virus-ckb.
Key words: virus knowledgebase; COVID-19; systems pharmacology analysis; drug repurposing; drug combination
Zhiwei Feng got his PhD degree in Soochow University in 2013. He is currently an Assistant Professor in the School of Pharmacy, University
of Pittsburgh. His research interests include the development of algorithms/tools/apps for drug discovery, chemogenomics knowledgebase design
and novel tool development for drug design of small molecules and modulators, and big-data or clinical data analysis and pharmacometrics and
systems pharmacology.
Maozi Chen received his master’s degree in 2009 from South China Agricultural University, China. He is currently conducting research work in the School
of Pharmacy, University of Pittsburgh. His research interests are algorithm design and software development.
Tianjian Liang received his bachelor’s degree in 2019 from Jiangnan University, China. He is currently a second year’s master student in the School of
Pharmacy, University of Pittsburgh. His research interests are antibody drug design and systems pharmacology analysis.
Mingzhe Shen received his master’s degree in 2020 from the School of Pharmacy, University of Pittsburgh. His research interests are anti-pain drug design
and computer-aided drug design.
Hui Chen is currently a P2 PharmD student in the School of Pharmacy, University of Pittsburgh. Her research mainly focuses on the construction of viral
database.
Xiang-Qun Xie received his PhD degree in 1993 from the University of Connecticut. He is an Associate Dean of the School of Pharmacy and a Professor
of Pharmaceutical Sciences, and Director/PI of NIH Center of Excellence for Computational Drug Abuse Research and Computational Chemogenomics
Screening Center at the University of Pittsburgh. He serves as a Charter Member of the US FDA Science Advisory Board. He is Director of Pharmacometrics
& System Pharmacology (PSP) Graduate Program for preclinical and clinical education & research. Dr Xie is known for his pioneering research for
the development of renowned ‘Big-Data to Knowledge’ diseases-specific chemogenomics knowledgebases platform implemented with GPU-accelerated
machine-/deep-learning computational TargetHunter system pharmacology for translational medicinal chembiology drug discovery.
Submitted: 23 March 2020; Received (in revised form): 7 May 2020
882
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Virus-CKB 883
small molecule drugs, 1365 approved biologics, 131 nutraceuti- Blood−brain barrier predictor
cals and over 6350 experimental drugs) and retrieved 65 antivi-
Our established [39, 40] blood−brain barrier (BBB) predictor
ral drugs in the market and 2698 chemical agents to be com-
[20–22] was integrated into Virus-CKB, in which BBB predictor
piled into the Virus-CKB platform. Compounds with IC50 lower
will predict whether a query compound can move across the
than 1 μM towards a viral-related target were considered as the
BBB to the central nervous system (CNS).
active ligands, while those larger than 10 μM were regarded
as the inactive ones. The detailed protocol for data mining
and preparation of drugs and chemicals can be found in our
recent publication [24]. HTDocking
Virus-CKB was equipped with HTDocking [18–22], an online
high-throughput molecular docking technique, for the identi-
Database infrastructure
fication of possible interactions between target proteins and the
Users can submit their query compound(s) through JSME user-inputted compound(s). Up to three different conformations
Molecular Editor v2017-03-01 [26]. Virus-CKB was constructed with the highest resolutions for each viral-related protein were
based on our established molecular database prototype cleaned and collected in Virus-CKB, depending on the availability
DAKB-GPCRs [24] (https://www.cbligand.org/dakb-gpcrs/) using of PDB files. Then, HTDocking powered by idock [27] will
the same management systems and servers as described automatically dock each query compound into the previously
previously [24]. defined binding pockets of these different conformations
Virus-CKB 885
and generate docking scores respectively. More detail about other coronavirus into our Virus-CKB, with a total number of 107
HTDocking can be found in our previous publication [24]. related targets. Since eight targets from SARS-CoV-2 including
S-protein (PDB ID: 6M0J, 6LZG), ACE2 (PDB ID: 1R4I), 3CLPro (PDB
ID: 6 LU7), RdRp (PDB ID: 7BV2), NSP16/NSP10 (PDB ID: 6 W61),
TargetHunter NSP3 (PDB ID: 6W6Y), PLP (PDB ID: 6W9C) and NSP15 (PDB ID:
6 W01) are directly involved in the process of COVID-19 infection,
Another powerful web-interfaced chemoinformatics tool
targeting these key proteins or enzymes will allow us to develop
integrated in Virus-CKB is called TargetHunter [23], a target
drug therapies for COVID-19. For example, Gilead Science tracked
identification tool for predicting the potential therapeutic
the responses to Remdesivir intervention therapy for 53 patients
targets of the submitted compounds. Basically, TargetHunter
with COVID-19 and observed a 13% death rate [6], in which
calculates the similarity based on the molecular fingerprints
Remdesivir or GS-5724 that inhibits the RdRp of the Ebola virus
using the Tanimoto coefficients (from 0.0 to 1.0, totally
and Marburg virus has demonstrated in vitro activity against
different to 100% similar) against the prepared active compound
the viral pathogens 2019-nCoV [41]. Moreover, we also collected
dataset of each target protein that is collected from Drugs
other viral-related targets because different antiviral drug(s)
and Chemicals.
Software requirements
Input
The Virus-CKB website is compatible with modern web browsers
Figure 2 shows the user interfaces of Virus-CKB. Users can
(such as Chrome, Firefox, Microsoft Edge and Safari) provided
submit up to five query compounds (Add Molecule button)
JavaScript and cookies are enabled. We recommend the latest
either by drawing its 2D structure with JSME Molecular Editor
release version of these web browsers for better rendering.
or by uploading chemical file (SMILES/SDF) with the [ll1] button.
Meaningful names for the job and each molecule are also
Benchmarks required. Clicking on the Create Job button will initiate a new
task. Afterward a background job worker which periodically
Li and co-workers [27] already compared the scoring functions
monitors the task queue will start to allocate computation
of AutoDock Vina and idock 2.0 on PDBbind v2007 core set
resources and dispatch the computation task.
(N = 195). In terms of both Pearson’s correlation coefficient
(Rp ), Spearman’s correlation coefficient (Rs ) and standard
deviation (SD), AutoDock Vina (Rp = 0.554; Rs = 0.608; SD =
1.98) and idock (Rp = 0.546; Rs = 0.612; SD = 1.99) shared Web interface
almost the same results, which is supported by their same On the menu bar, four buttons (HOME, ALL JOBS and HELP on
scoring function. the left and the iconic user button on the right) are always there
for the user to visit with a single click. Clicking on the HOME
button will bring the user back to the home page (Figure 3) which
Results and discussion displays introductive and technical information about Virus-
Virus-CKB server CKB. Clicking on the ALL JOBS button will lead the user to the job
listing page (Figure 4) where one can see all the jobs submitted
The Virus-CKB server is built on ASP.NET Core 3.1 (a high- to the server, though the ligand information may be hidden from
performance MVC web framework) and is deployed on a Red the listing if the creator elected to set the submitting job as
Hat Enterprise Linux (RHEL) 7.4 server of an Intel® Xeon® E5– private. Clicking on the HELP button will navigate the user to the
2690 CPU and 8GB memory. Virus-CKB shares the exact same tutorial (Figure S1) where one can learn how to use the website
codebase and infrastructure with our pre-existing knowledge- step by step. The user button located on the right of the menu
base DAKB-GPCRs [24] (https://www.cbligand.org/dakb-gpcrs/). bar is for those who want to keep their submitted jobs invisible
In particular, the Apache server is used as the reverse proxy from the public. Clicking on that iconic button will ask the user
of the Kestrel server of ASP.NET Core; Entity Framework Core to sign in with a third-party open authentication provider if the
(an ORM framework) is used for data access; SignalR over Web- user has not done so.
Socket is used for real-time browser–server communication;
SQLite (an in-process DBMS) is used for data persistence; FFm-
peg is used for generating the images of the protein 3D struc-
tures; and Hangfire is used for background job dispatching
Job listing
and management. The web-based user interface of Virus-CKB can be accessed
In the current version of Virus-CKB, we integrated 6 HIV- at https://www.cbligand.org/g/virus-ckb of which the basic
related targets, 7 BCV/HCV-related targets, 49 influenza-related guidance can be found in the Supplementary Information. All
targets, and 39 coronavirus-related targets that included 6 tar- the jobs submitted to Virus-CKB are listed on the ALL JOBS page
gets from SARS, 8 targets from SARS-CoV-2 and 31 targets from (https://www.cbligand.org/g/virus-ckb/job/list). As shown in the
886 Feng et al.
tabular view of Figure 4, the user can identify a job with its page of the blood–brain barrier predictor (Figure 5A) which dis-
name which is given by the job owner at creation. Provided plays the BBB predictions of eight algorithm–fingerprint com-
necessary permission is met, clicking on the job name will binations for each submitted compound. Clicking on the OUT-
bring the user to the detailed information of the job (Figure S2). PUT button will lead the user to the output page of the com-
The Ligands column displays all compounds submitted by putation (Figure 5B) which during job running will automati-
the owner and involved in the computation against all the cally update with the most recent computed results in a real-
Virus-CKB targets. The remaining columns provide the user time manner. On the output page, each block includes docking
additional information including the date and time when the scores by HTDocking computed against the conformations of
job is created and last updated, the computation progress protein target and a similarity score by TargetHunter. Clicking
and the job status (Created/Initializing/Running/Finished). on the SPIDER PLOT button will show the user an interactive
A job appears on the listing immediately after successful vector-based graphic which visually demonstrates the predicted
submission. All jobs are listed from the newest to the earliest, interaction network between the submitted compounds and
while unfinished jobs are listed at the top. Even older jobs the Virus-CKB targets (see below). Both the text-based and the
will be listed by clicking on the paging buttons on the bottom graphic-based output pages have a switch for customizing the
right corner. display name of the targets. Simply click on the ‘Gene/Pro-
tein’ button group on the top right corner below the menu bar
to do so.
The output page has two view types, namely, the tiled view
Job output
(Figure 5B) and the tabular view (Figure S3). In the tiled view,
As shown in Figure 5, four buttons are inserted between the ALL the results are arranged in cards, one for each compound–
JOBS and the HELP buttons on the menu bar when a specific protein pair. Static 3D structures of each Virus-CKB target ren-
job is selected and accessed from the job listing page (Figure 4). dered in rainbow spectrum, docking scores using the HTDock-
Clicking on the BBB button will navigate the user to the output ing technique and the most similar active ChEMBL compound
Virus-CKB 887
Figure 3. The Virus-CKB home page. Short introduction to the goal of virus chemogenomics knowledgebase and its underlying technology and architecture. Two figures
and one movie in this page are used to illustrate the overall structure of SARS-CoV-2.
with its similarity score using the TargetHunter technique are and interacting with the 3D protein structures (Figure 6A) via
shown in one place for the users’ convenience. The tabular the button, downloading resulting docked conformations of the
view (Figure S3) has similar content, but no 3D structure pre- compounds via the button and inspecting detailed information
view is generated. Tighter layout allows more information to on the best matched ChEMBL compounds (Figure 6B) via the
display in a single row, and the tabular form aligns data in button. To toggle between two views, one can click on the iconic
the same field and enables multi-ordering on the columns. button group next to the ‘Gene/Protein’ button group on the top
Both views provide the functionality of inspecting the target right corner.
888 Feng et al.
Figure 5. The output results of BBB predictor, HTDocking and TargetHunter in Virus-CKB. (A) Results from the BBB predictor. (B) Results from HTDocking and
TargetHunter in Tiled view manner. The displayed 3D structures are the first conformations of the targets if more than one exists. Clicking on the static 3D structures
rendering will display the interactive one.
with green solid lines, which is consistent with the fact that and the docking scores on the interaction lines can be achieved
oseltamivir shows high activity at its known target NA_INFB. In with the help of keyboard shortcuts.
addition, clicking on any node will show the user a toolbox on Spider Plot supports any number of submitted compounds
the left of the canvas providing a quick way for customizing the (though we currently have a limitation of up to five submitted
styling of the graphic. Additional customization like rearranging compounds on job creation) and arranges the nodes optimally.
all the nodes and toggling the display of molecule depictions It first scans all compounds and groups them based on the
890 Feng et al.
Figure 6. Information of a selected target and its best match compound in the dataset. (A) Popup window presents additional resources for a protein target. (B) Popup
window showing the comparison between the query compound and the most similar compound in the dataset.
information of interactions with the disjoint-set data structure. Proposed drug repurposing for 3CLpro and drug
Compounds that share no common targets with other com- combination for COVID-19 by Virus-CKB
pounds are placed alone on a side of the canvas. The remaining
compounds are sorted so that the number of crossing interaction To further validate our Virus-CKB, we predicted potential FDA-
lines and overlapped nodes are minimized. To achieve the best approved antiviral drugs as well as the nonviral ones that may
bind to 3CLpro .
visual effect and readability, Spider Plot also tentatively applies
the layout strategy by simulating the layout of other parts in the First, our results show that several antiviral drugs that
graphic. included saquinavir (anti-HIV drug), bictegravir (anti-HIV drug)
Virus-CKB 891
and paritaprevir (anti-HCV drug) are predicted to bind to 3CLpro Besides repurposing the FDA-approved drugs, our research
(3C-like cysteine protease) of SARS-CoV-2, as shown in Figure 7. has shown that combinations of different medications may have
Second, we also conducted the predictions for several FDA- a synergic effect on treating COVID-19, as shown in Figure 9.
approved nonviral drugs, as shown in Figure 8. Our results Taking remdesivir and chloroquine as an example, we found
showed that Digoxin (anti-cardiovascular), eltrombopag (for that one green target node RDRP_SARS2 (RNA-dependent RNA
thrombocytopenia) and proscillaridin (anti-cardiovascular) were polymerase) connects to remdesivir with green solid lines
predicted to bind to 3CLpro (Node: PR_SARS2) of SARS-CoV-2, and another green target node ACE2 (angiotensin-converting
which our findings is supported by a recent study, in which enzyme 2) connects to chloroquine with green solid lines,
digoxin, eltrombopag and proscillaridin exhibited antiviral indicating the algorithms in our platform are reliable. Since ACE2
efficacy (0.1 μM < IC50 < 10 μM) against SARS-CoV-2 [42]. All involves in SARS-CoV-2 invasion and RdRp is responsible for the
these findings may lead to rapid identification of FDA-approved virus replication, we suggested the combination of remdesivir
drugs which can quickly progress into clinical trials to meet the and chloroquine may exert the synergistic effect for the
urgent demand of the 2019-nCoV outbreak. treatment of SARS-CoV-2 infection. Of course, in silico identified
892 Feng et al.
drug repurposing and drug combination will require the in vitro Mapping can facilitate rapid drug development to meet the
target validation experiments as well as human clinical studies, urgent demand of the COVID-19 outbreak. We will integrate the
which are currently underway via collaborations. information of gene signatures and signalling pathways of target
proteins as well as our newly developed algorithms or tools into
our Virus-CKB server in the future. Moreover, we will include
Conclusion
or develop more techniques (such as NLP) into our database
In this study, we provided a platform of national viral-associated to improve the accuracy and user interface. Recently, adverse
computing resources and tools to help researchers in a broad events were reported from the clinical data of remdesivir [43],
range of disciplines and to help advance our knowledge about which require the further improvement of drug properties. We
potential drug repurposing and drug combinations. This work will integrate tools or algorithms for further optimization of
demonstrates that the use of a knowledgebase and CSP-Target drug-like properties of the compound or lead in the future.
Virus-CKB 893
Supplementary data 11. Lee H, Ahn S, Ann J, et al. Discovery of dual-acting opioid
ligand and TRPV1 antagonists as novel therapeutic agents
Supplementary data are available online at https://academic.
for pain. Eur J Med Chem 2019;182:111634.
oup.com/bib.
12. Pang M-H, Kim Y, Jung KW, et al. A series of case
studies: practical methodology for identifying antinoci-
ceptive multi-target drugs. Drug Discov Today 2012;17:
Availability 425–34.
13. Lan J, Ge J, Yu J, et al. Structure of the SARS-CoV-2 spike
The Virus-CKB server is accessible at https://www.cbligand. receptor-binding domain bound to the ACE2 receptor. Nature
org/g/virus-ckb. 2020;581:215–20.
14. Jin Z, Du X, Xu Y, et al. Structure of M(pro) from COVID-19
virus and discovery of its inhibitors. Nature 2020;582:289–93.
15. Gao Y, Yan L, Huang Y, et al. Structure of the RNA-
Authors’ contributions dependent RNA polymerase from COVID-19 virus. Science
enabling research and education in fundamental biology, 37. Gaulton A, Bellis LJ, Bento AP, et al. ChEMBL: a large-scale
biomedicine, biotechnology and energy. Nucleic Acids Res bioactivity database for drug discovery. Nucleic Acids Res
2019;47:D464–74. 2012;40:D1100–7.
31. Li YH, Yu CY, Li XX, et al. Therapeutic target database 38. Wishart DS, Feunang YD, Guo AC, et al. DrugBank 5.0: a major
update 2018: enriched resource for facilitating bench-to- update to the DrugBank database for 2018. Nucleic Acids Res
clinic research of targeted therapeutics. Nucleic Acids Res 2018;46:D1074–82.
2018;46:D1121–7. 39. Ma C, Wang L, Xie X-Q. Ligand classifier of adaptively boost-
32. Wang Y, Zhang S, Li F, et al. Therapeutic target database ing ensemble decision stumps (LiCABEDS) and its applica-
2020: enriched resource for facilitating research and early tion on modeling ligand functionality for 5HT-subtype GPCR
development of targeted therapeutics. Nucleic Acids Res families. J Chem Inf Model 2011;51:521–31.
2020;48:D1031–d1041. 40. Ma C, Wang L, Yang P, et al. LiCABEDS II. Modeling of ligand
33. Zhu F, Shi Z, Qin C, et al. Therapeutic target database update selectivity for G-protein-coupled cannabinoid receptors. J
2012: a resource for facilitating target-oriented drug discov- Chem Inf Model 2013;53:11–26.
ery. Nucleic Acids Res 2012;40:D1128–36. 41. Wang M, Cao R, Zhang L, et al. Remdesivir and chloroquine