Professional Documents
Culture Documents
flippocastelli/KEGGutils
WHAT IS
KEGGUTILS ?
AND HOW CAN WE USE IT
2
LET’S SUPPOSE WE WANT TO
WORK WITH DATA COMING
FROM KEGG
3
KEGG’S API EXPOSES A FEW COMMANDS VIA GET REQUESTS
▸ RAW TEXT
▸ IMAGES (.GIF OR .PNG)
▸ XML FILES
▸ CONFIGURATION FILES
▸ JSON FILES
5
MANUALLY HANDLING HTTP REQUESTS AND RESPONSES
SIGNIFICANTLY SLOWS DOWN EVERY KIND OF WORKFLOW AS
ONE WOULD HAVE TO:
7
HOW DO WE INSTALL IT?
8
ALTERNATIVELY
9
IN THE REPO YOU WILL FIND
10
A QUICK
OVERVIEW OF PART 1:
KEGG REST API
SOME FEATURES INTERFACE
11
LET’S START SIMPLE: HOW TO GET INFOS ON A DATABASE
In[2]: kg.__version___
Out[2]: '0.3.2'
In [3]: kg.keggapi_info("hsa")
INFO:root:Infos on hsa from KEGG:
In[1]: kg.keggapi_get("hsa:10458")
Infos on hsa:10458 from KEGG:
In[2]: list(zip(sn,tn))[:5]
[('hsa:9344', 'ec:2.7.11.1'),
('hsa:5894', 'ec:2.7.11.1'),
('hsa:673', 'ec:2.7.11.1'),
('hsa:5607', 'ec:2.7.12.2'),
('hsa:5598', 'ec:2.7.11.24')]
15
KEGGUTILS PROVIDES CLASSES TO FOR KEGG DATA STORAGE
AND USAGE
In[2]: dis_gene.linked_nodes["ds:H00773"][:5]
['hsa:55777',
'hsa:81704',
'hsa:1013',
'hsa:84623',
'hsa:8831']
18
WE CAN CHAIN MULTIPLE KEGGLINKGRAPHS TO SEARCH
FOR NONTRIVIAL LINKS
In[1]: mychain.directed_propagation(["ds:H00773"])
19
YOU CAN DOWNLOAD, PARSE AND EXPLORE AN ENTIRE PATHWAY
JUST BY SPECIFIYNG IT'S KEGG IDENTIFIER
{'reference_hook': 'PMID:12878745',
'authors': 'Nelson WG, De Marzo AM, Isaacs WB.',
'title': 'Prostate cancer.',
'journal': 'DOI:10.1056/NEJMra021562'}
20
YOU CAN FIND ALL OF THIS AND MUCH MORE ON KEGGUTIL'S
TUTORIALS:
https://github.com/filippocastelli/KEGGutils/tree/master/tutorials
https://github.com/filippocastelli/KEGGutils/
21