You are on page 1of 8

Application of IT in Healthcare: A Systematic Review

Rashmeet Toor Inderveer Chana


Thapar University, India Thapar University, India
rashmeett@gmail.com inderveer@thapar.edu

ABSTRACT
The key mission of Healthcare industry is improving lives through tests, images, bookings and billing information. This eliminates errors
better healthcare solutions. Technical innovations in the last decade due to handwriting and promotes data transparency. Data of visits is
have led to solutions that are safe, cost effective, high-quality and easily centralized at one place and acts as a reference for doctors, pharmacists
accessible. A wide variety of computational techniques, storage and patients. With the technological changes, the EHR also underwent
techniques, softwares and tools are already shaping the future of parallel modifications. EHRs encouraged accurate and efficient
healthcare. In this paper we have systematically reviewed the emerging information exchange across geographically dispersed organizations
trends of Information Technology (IT) in healthcare. Further, this paper like laboratories, hospitals, pharmacies and specialists. By 2013, 78.4
elaborates on the impact of healthcare data, technological percent of physicians in United States had switched to EHR systems
transformations and tools which will eventually merge and culminate and more than 14 percent intended to switch in future [39]. Microsoft
into user-centric healthcare in near future. A total of 108 papers were HealthVault, which is a personal EHR, facilitates people to gather,
analyzed, out of which 40 papers were identified to be relevant and store, use and share health information online [36].
further we classified 19 papers into four broad categories according to
the technologies used. This paper also reveals issues in the current The EHRs laid foundation for Clinical Decision Support Systems
approaches and suggests possible future outcomes which will help (CDSS) and expert systems which act as an aid to decision making for
researchers to gain ideas for further research. physicians. The program allows user to enter symptoms and signs in
medical terms and it outputs a list of hypotheses generated using an
Categories and Subject Descriptors extensive knowledge base, along with suggestions for additional
J.3 [Computer Applications]: Life and Medical Sciences –health, parameters that might improve diagnosis. CDSS improves quality of
medical information systems decisions and provides reliable and cost efficient consultation [34].
From a generic system, it eventually evolved into the form of an expert
General Terms system where machine learning techniques assisted problem solving.
Management, Performance, Human factors, Algorithms EHRs and CDSS were the initial steps towards an IT-led reform, but
there has been less success in their actual application due to many
Keywords factors. First, the doctors found it difficult to learn these computer
Healthcare, Data Mining, Network Analysis, Cloud-based Services, systems and they believed it was more time consuming. Second, the
Text Mining patients demanded privacy and rights to give consent for using their
information in research. Third, the ongoing technological evolution
1. INTRODUCTION offered numerous other opportunities like integration and analysis of
Healthcare is a vast domain and it essentially comprises of hospitals, data, but these applications were inclined towards management of data.
clinical trials, telemedicine, pharmaceuticals and medical equipment. In CDSS, pre-defined rules were used to extract possible solutions and
The main motive of healthcare industry is to improve the lives of there was no scope of analysis of data to discover new rules. But now,
individuals and make World a healthier place. In 1995,The Institute of analysis of even genetic data is possible. In the last decade, IT based
Medicine (IOM) reported that about 98,000 people die in hospitals healthcare networks have seen tremendous growth. Figure 1 depicts the
every year due to medical errors and about $29 billion is spent every upcoming revolution by comparing the current and emerging trends.
year in fixing these [36].These errors were attributed to decentralized
and fragmented nature of information related to patients, clinical
pathways, drugs and medical procedures. IOM also reported that about Current Trends Future Trends
three out of four errors could have been eliminated if better information
systems would have been used to make required information readily Patients visit doctor and Information sharing at
available. Information technology has been a boon to Healthcare share health information home using an app
industry. The advent of Internet and better techniques for storage of data
revolutionized healthcare. Hence, an efficient IT backbone has helped Clinical data used for
removing errors in various kinds of applications leading to better Clinical and genetic data
disease or outcome
based predictions
treatments for patients. Accurate, cost effective and timely solutions can predictions
now be provided with the use of technology. These innovations are an No communication
aid not only to the patients, but also to the members of their family, Information exchange
interface between doctors
doctors, pharmacist, hospitals and biological or bioinformatics over the cloud
and pharmacists
researchers.
Same medicine to every Customized medicine
2. BACKGROUND patient of a particular according to person's
In the early 19th century, medical information of patients was limited to disease genomic profile
papers and handwritten notes. Due to this, the medical history of a Patient monitoring at
No provision of home-
patient turned out to be fragmented and arduous to access. Since home using sensors and
based patient monitoring
Information Technology had not intersected the healthcare domain, data cloud
from varied medical tasks required physical storage. The idea of EHR and Social media
Manual extraction of
Electronic Health Records (EHR) developed around 1960s. These mining using tools to
medical knowledge
records serve as an electronic repository of valuable data related to extract knowledge
patient’s medical history, drug information, prescription, diagnostic
Figure 1 Comparison of Current and Future T rends
This paper aims to provide an overview of the state of technologies in Table 2 Search Keywords and Synonyms
IT based healthcare in the last decade with the intention to facilitate
Search keywords Synonyms
researchers with issues and future research directions. The rest of the
paper is organized as follows: Section 3 describes the research Genomics Human Genome Data
technique used for this study. Section 4 elaborates the trends in Cloud Computing for E-health Cloud based Medical Services
healthcare by enlightening on the types of healthcare data, technological
advances, databases and tools emerged since the last decade. Finally, Data Mining in E-health Big Data in Cloud for E-health
Section 5 describes the issues that need attention followed by Section 6 Medical Decision Support Prediction System for Health
which concludes the paper by suggesting some future directions. Systems Services
3. RESEARCH TECHNIQUE Mining genomic data Gene Data Mining
3.1 Research Questions and Motivation
This review aims at summarizing the numerous fields of IT providing 3.4 Study Selection
medical solutions. Various research questions as described in Table 1 The selection procedure followed in this review is described in Figure 2.
were proposed in order to plan the survey. The corresponding Initially, research questions were defined as discussed in section 3.1.
motivation is also discussed. Then, search keywords and synonyms were defined based on the
research questions. Next step was to search the keywords listed in Table
3.2 Sources of Knowledge 2 from relevant sources of knowledge. Since the objective of this paper
Due to an immense medical literature, thousands of options are is to summarize the trends in healthcare, papers from the last decade are
available for search. In this study we have mainly used : incorporated. After extracting the papers, irrelevant papers were purged
by going through their title and abstract. The repetitive papers from
1) NCBI (<www.ncbi.nlm.nih.gov>) diverse sources were also removed. Finally, major technologies from
2) Google Scholar (<www.scholar.google.co.in>) the papers were extracted and each paper was categorized according to
3) IEEE Xplore (<www.ieeexplore.ieee.org>) the identified categories.
4) ACM Digital Library (<www.acm.org/dl>)
5) XRDS: Crossroads, The ACM Magazine for Students Volume 21 3.5 Results and discussion
Issue 4 (2015) From the survey, 19 papers were discovered to be part of four major
areas which are Data Mining, Network Analysis and Similarity based
3.3 Search Keywords measures, Text Mining and Cloud-based Services. Figure 3 depicts the
Search strings were defined based on the research questions. Keywords percentage of publications in each year. From the figure, majority of
and similar words used in this study are as described in Table 2. publications were done in 2014, followed by 2011 and 2015.

Table 1 Research Questions and Motivation


Review Questions Motivation
a) 1. What kind of data is used for mining in healthcare It aims to identify various types of data used in
research areas and how can such data be extracted? healthcare and possible techniques to extract and use
2. What are the possible sources of extraction of different such data. Healthcare data is an important aspect for
kinds of data? innovations and understanding its types, use and methods
3. How can gene expression data be used in clustering and of extraction is indispensable.
what can we infer from the results?
4. How can the data related to drugs be used for
extracting relations?
b) 1. What are various kinds of prediction or decision The health applications are a blend of different
support systems currently in use and how the data used in technologies used altogether. A vast variety of tools,
it is extracted? databases and techniques are being used. These
2. What are various kinds of technologies in use for cloud questions aid in recognizing the technologies and
important tools utilized in various medical solutions.
based healthcare applications?
3. What are the various biological networks and how they
can be analyzed?
4. How can a biomedical literature be mined and what are
possible tools for it?
5. What are the important databases and tools used in
healthcare domain?

Eliminate Extract major


Define Search
Define Search irrelevant and technologies and
Research Defined
Keywords redundant Categorize each
Questions Sources
papers paper

Figure 2 Study Selection Procedure


computer based method which predicts patients at risk of septic shocks.
The John Hopkins School of Medicine, along with Microsoft has
3% planned to provide IT solution for Intensive Care Unit where medical
2002
11% 8% devices would interoperate and identify key trends for prev enting
2003
25% 2004 injuries and complications, improving patient’s safety.
5% 2006 In this section, we present the ongoing IT revolution in healthcare in
2007 different dimensions comprising the medical data, intersecting
6% 2008 technologies, diverse databases and tools and focus on the future trends
8% 2010 which might return novel outcomes in coming years.
6% 8% 2011
11%
6% 2012 4.1 Healthcare Data
2013 Due to the data-centric nature of medical applications, it is essential to
3% 2014
2015 understand the varied types of data. From using only the clinical data in
past to mining genetic data in current scenario, data and its
Figure 3 Percentage of Publications in 2002-2015 computations have evolved enormously. A vast variety of data is
accumulated in healthcare applications. This data can be mainly
Figure 4 compares the publications in different technological areas over classified into three types which are Genomics, Proteomics, Drugs
the years. Being a recent research area, Cloud-based services emerged related data and Clinical data. All the data mining tasks, machine
around 2012 and publications in Data Mining accelerated in 2014. learning algorithms, analysis or statistics are applied on these types of
data. Tools and databases are designed according to this data. The
relationship of the technologies with data has been explained in Figure
5.
Data Mining Network Analysis
Text Mining Cloud Based Services Data
Mining

Genomic
Network Proteomic Text
Cloud Based Services Analysis Drugs data Mining
Text Mining
Network Analysis Clinical data
Data Mining

Cloud
based
Services
Figure 4 Comparison of papers published in Technological areas
from 2002-2015 Figure 5 Relationship between Data and Technologies
4.1.1 Genomic
4. EMERGING TRENDS Deoxyribonucleic acid (DNA) is a linear series of chemical
With the accumulation of large volumes of text data, newer techniques components. It stores the genetic information and template for synthesis
for mining, analyzing and storing the data were devised. Recently, this of proteins. It basically contains sequences of nucleotides which are one
data-driven revolution has touched some areas of the medical domain. of G, A, U, C, T. A gene is made up of a sequence of triplets of the
In past few years, machine learning techniques are used extensively in nucleotides (exons). The gene undergoes transcription process, which
healthcare applications. In an application of heart disease prediction forms the ribonucleic acid (RNA) and then the process of translation
[27], a new CoActive Neuro-Fuzzy Inference System (CANFIS)- takes place where each nucleotide pairs with another, from a
Genetic Algorithm Approach was proposed. Similarly, a natural complementary strand (A-T and G-C) forming codons which generate
walking monitor for pulmonary patients was developed using a simple corresponding amino acids or proteins [26]. These proteins are the basic
smart phone with underlying Support Vector Machine algorithm [16]. building blocks for the development and function of a human being.
Artificial intelligence also enhanced decision support systems by The Human Genome Project (HGP) was the international research
providing various neural network techniques for decision making [22]. program whose goal was the complete mapping and understanding of
Similarly, the EHRs are now used to extract information using text all the genes of humans. It was named Genome, as all the genes
mining approaches. Further, the convergence of cloud computing with together are known as "genome". The HGP has revealed that there are
diverse technologies such as wireless networks, sensors and mobile probably about 20,500 human genes. It gave complete sequence of
computing is leading to creation of newer type of cloud services, which human genes in 2003 which is a resource of detailed information about
in turn is proving beneficial for health applications. The data which the structure and function of genes
earlier required physical or desktop storage can now be easily accessed (http://www.genome.gov/12011238). Due to provision of such
from cloud based storage repositories, making it more reliable, available resources and continuous improvements in technology, biologists and
and cost effective. Innovations and research in IT go hand in hand. John researchers can now analyze the genomic data to provide customized
Hopkins University is one of the several academic initiatives healthcare solutions. The explosion of enormous genetic data led
contributing towards the research work in diverse areas including researchers to not just only accurately analyzing such data but also there
healthcare. The Individualized Health Initiative by the university is a was a need to find out optimal performance solutions [20].
step towards customization of medicine for each patient by analyzing
big databases. The researchers at John Hopkins have devised a
4.1.2 Proteomic In this section, we discuss some major tools and techniques used in
Protein-protein interaction networks (PPIs) are network models which respective technologies.
represent the pair wise protein interactions of an organism. Proteins that 4.2.2 Data Mining
interact with another can be clustered according to their biological Data mining refers to extracting new relationships or patterns from large
function or as they participate in same biological process [4]. Unknown amount of data. In bioinformatics, data mining has been used
functions and properties of proteins can be recognized this way. A set of extensively for gene finding, disease diagnosis and treatment
proteins produced in an organism is known as proteome. optimization, identifying similar genes, protein and gene interaction
Proteomics is the large-scale study of proteomes. It differs from cell to network reconstruction and many more. Clustering techniques for gene
cell and changes over time. Proteomics can help finding new drug expression data has helped in recognizing unknown gene function or
targets and hence alternate therapies for patients can be suggested. discovering unknown subtypes of a disease. Similar expression profile
4.1.3 Drugs related data genes are placed in the same cluster and changing levels of gene
There is a term P4 medicine which will be the ultimate goal of expression are observed, so that unknown genes with similar profile can
researchers in coming years. P4 refers to a medicine which is not only be identified [8]. Comparing differentially expressed genes in normal
personalized, but also predictive, preventive and participatory [14]. To and diseased state leads to a gene expression profile or signature for that
fulfill this goal, underlying data used will be the drugs related data in disease.
conjunction with proteomic or genomic data to infer unknown According to the papers analysed, the progress has been done mainly in
observations. The tasks of finding differentially expressed genes two areas (I) Disease related predictions (II) Gene Selection methods.
actually help in drug target identification. Once this is done, drugs can One of the recent works done in Disease related predictions is [7] where
be produced accordingly. This way, side effects of drugs can be reduced high-throughput gene expression data and clinical data was analysed to
as now the drug becomes more specific. The comparison of drug gene create a prediction model for prognosis of lung cancer patients. The
expression and disease gene expression can also be used to infer correlation between gene signatures and patient survival time was
possible drugs for a disease [32]. It is possible to predict drug response examined. Artificial Neural Network (ANN) architecture was used with
of cancer patients according to their genomic profile [31]. If the drug training data to build the model and five correlated genes were
response is known, better selection of drugs is possible for individual identified. Other techniques like hierarchical clustering, decision trees
patients leading to the notion of personalized medicine. and risk scores for analysis of data were also compared. In [17],
response of cancer patients is predicted using Linear Discriminant
4.1.4 Clinical data Analysis (LDA) classification. Early detection of lung cancer is also
Clinical data mainly comprises of health parameters observed by possible as proposed in [2], in which K- means clustering is first used
physician or sensor technologies. An extensive research has been and then AprioriTiD algorithm and decision trees are generated to
carried out in mining such data. In [], parameters related to diabetes like discover frequent patterns.
BP and Cholesterol have been monitored and analyzed to predict The other area entailed the notion of Single Nucleotide Polymorphism
diabetic patients. (SNP). SNPs are the variations of a single nucleotide (organic
molecule) at same locus of two individuals of same species. The
Clinical records are generic in nature as they help in predicting the analysis of SNPs determines SNP/gene patterns for a particular disease
outcomes according to rules generated by observing patterns in data. On or relationships between genotype and phenotype information. Such
the other hand, genomic, proteomic and drugs related data are generic knowledge can help in better drugs development or in personalized
as well as specific. Every living being will possess these data which medicine. Due to high cost, such data should first be analysed to select
makes it generic and each individual will have unique set of genes, most informative genes/SNPs. For this, four main approaches are
protein interactions and drug responses which makes it specific. Due to proposed [29] [11], namely the Weighted Decision tree based gene
specificity of data, these can be used for the concept of personalized selection (WDTGS) , Genetic Algorithm based gene selection (GAGS),
medicine. feature set intersection and Support Vector Machine (SVM) approach.
4.2 Technologies used In Weighted Decision tree, decision rules are formed while in GAGS,
From this survey, major technologies for healthcare have been realized genetic algorithm is used to retrieve significant genes. Similarly, one
where IT has progressed in past few years. The technologies include: approach uses SVM algorithm while in feature set intersection, any two
 Data Mining datasets from first two approaches are merged and important features
 Network Analysis are selected. Table 3 summarizes the findings in this domain in past few
 Text Mining years.
 Cloud Based Services

Table 3 Findings, Tools and Techniques used in Data Mining


Technology Author Year Findings Techniques/Tools
used
LM. Fu and ES.Youn [11] 2003 Reliable Gene Selection SVM algorithm
SC. Shah and A.Kusiak 2004 Gene Selection Methods WDTGS-GAGS,
[29] feature set intersection
K. Jung et al. [17] 2010 Predict therapy response of LDA classification
cancer patients
Data
Mining K. Ahmed et al. [2] 2013 Early detection of lung K-means, AprioriTid,
cancer Decision tree
Y. C. Chen, W. C.Ke and 2014 Risk classification of ANN algorithm
H. W.Chiu [7] cancer
J. Ma, C. Peng and Q. Chen 2014 Heart disease prediction CANFIS-Genetic
[27] system Algorithm
is a prototype for collaboration of interactions from biological literature.
4.2.3 Network Analysis and Similarity based measures Protein-protein interactions or gene-disease relationships are extracted
In searching for candidate genes related to a disease, another popular using natural language processing methods. The researchers can use the
methodology which researchers opted was analyzing the various server for searching the database and annotate relevant facts. Similarly,
networks like gene-gene, gene-disease interaction networks. In [10], literature mined gene interaction network is used to identify gene-
clustering of diseases is done based on similarity measures calculated disease associations [25].M. Huang et al. [15] devised an algorithm to
using phenotypic attributes. Hence potential disease genes not known extract PPIs from patterns. Knowledge can be extracted from raw
yet are identified. The difference in [12] lies in that it believes that a medical records, social media or search engine logs using text mining
gene will be related to a disease if its neighbours are associated with and machine learning [37] which can help in identifying signals of
similar diseases. M.B. Carson, and H. Lu[5] extracted proteins adverse drug reactions [18]. J. Mork et al. [23] explained MetaMap tool
associated with disease using network analysis along with data mining. which maps biomedical text to the Unified Medical Language System
So interactions with other proteins and the disease relationships of (UMLS). Table 5 summarizes the techniques and tools of text mining.
neighbouring proteins determined if a protein had a relationship to 4.2.5 Cloud-based Services
disease. Further, a machine learning algorithm-ADTree algorithm was Cloud Computing is a fast growing trend that includes several services
proposed which provided the order of most important characteristics for like designing computing systems, developing applications and
classification of proteins. Table 4 summarizes the findings. leveraging existing services for building software, all offered on
demand over the internet in a pay-as-you-go model. It lowers the costs
4.2.4 Text Mining and increases the speed of deployment of applications. Due to
Most of the biological literature is in an unstructured form. Even if we technology advances like use of Body Area Networks, Wireless
want to search a particular entity like a gene, it leads us to discovering Networks, Bluetooth connections, now healthcare data can be easily
thousands of articles which are difficult to read as such. One solution is communicated across remote areas. So many applications have used the
manual curation of articles but again with such data, it is not a feasible same concept along with cloud to provide efficient storage, better
approach. Another solution is automatic extraction using computers. availability and analysis of medical data over remote areas.
This poses problems of accuracy. So, CBioC was introduced [3] which
Table 4 Findings, Tools and Techniques used in Network Analysis and Similarity based measures
Technology Author Year Findings Techniques/Tools
used
J. Freudenberg and P. 2002 Predict disease relevant Similarity based
Propping [10] genes measures
Network P. Shannon et al [30] 2003 New tool for network Cytoscape
Analysis analysis
X. Guo, et al [12] 2011 Predict disease relevant Iterative algorithm
genes
MB. Carson and H. Lu [5] 2015 Predict protein-disease ADTree algorithm
relation

Table 5 Findings, Tools and Techniques used in Text Mining


Technology Author Year Findings Techniques/Tools
used
M. Huang, et al [15] 2004 Discover patterns to extract Pattern matching
PPIs from text algorithm

Text C. Baral et al [3] 2007 Extract biomedical CBioC prototype


Mining relations
A. Özgüret al [25] 2008 Using literature mined gene Dependency parsing
interaction network
S. Karimi et al [18] 2015 Adverse drug reaction Different techniques
detection

Table 6 Findings, Tools and Techniques used in Cloud-based Services


Technology Author Year Findings Techniques/Tools
used
M. Fahim et al [9] 2012 Elderly person reminder & Cloud, Android
care giver assistant app platform

Cloud- V. Koufiet al [19] 2012 Cloud based Emergency NefaliEMS


based Medical Service (EMS)
Services B.E. Reddy, TVS. Kumar 2012 Cloud based Healthcare CHMS using Aneka,
and G.Ramu [28] Monitoring System(CHMS) Xen Server
J. Ma, C. Peng and Q. Chen 2014 Digital home based self- OpenStack, Amazon
[21] care Web Services
S. Mukherjee, K. Dolui and 2014 Patient health management Cloud storage,ReTiHA
SK. Datta [24] system for emergency cases
Some major applications or frameworks studied in this review are confidentiality concerns, public cloud technologies are not motivated
tabulated in Table 6. B.E Reddy et. al. [28] discussed a better healthcare much. One of the reasons behind the failure of Google Health and
monitoring system as it utilizes the cloud capabilities. In this, the data popularity of Microsoft Health Vault is privacy concern. Due to the
from various remote patients is stored on a cloud based medical same reason, gathering genomic data is also not very easy.
repository using Aneka. The remote patients as well as concerned Using data mining or machine learning techniques for providing
doctors can communicate among themselves on this platform. In healthcare solutions require a great deal of understanding of both the
another study of monitoring system [24], there is provision of Real domains as it is an interdisciplinary subject. Moreover, biological
Time Health Advice and Action (ReTiHA) service, which is invoked literature is vast and integrating such data is complicated. Although
when the Emergency medical response system fails.Cloud provides an hundreds of tools are present but in order to innovate new solutions, one
efficient, fast and secure platform for Emergency cases as well, using should be well verse with relevant tools and techniques.
the Electronic Medical Records in cloud environment[19]. Another Further, healthcare data needs to be updated from time to time. In
application was developed for smart homes on android platform for therapeutic study, the data still needs to be gathered for various new
tracking of daily life activities of users (mainly elderly people) [9]. drugs and diseases. Table 9 describes some common issues and future
Applications have been made where the patients can manage their own outcomes expected in respective technologies.
medical data at home. J. Maet. al. [21] proposed a cloud based solution
which enables patients of chronic diseases to share the health
information for different purposes. A. Hans and S. Kalra performed a
6. CONCLUSION AND FUTURE WORK
This review paper describes the major technologies in healthcare
detailed comparative analysis of cloud-based medical services [13].
domain along with discussion about popular techniques, tools and
4.3 Databases and Tools databases, so that interested researchers can comprehend the state of the
In this section, we give an overview of some important tools and work done in different technologies. Due to issues in these technologies,
databases which are an aid to healthcare professionals and researchers new ideas will be needed to overcome the reluctance to accept cloud
so that they can perform various computational as well as medical tasks. and other technologies.
The medical literature is interrelated and complex so the database can Two very new domains in this field are pharmaceutical and disease
also be observed in various forms. In Table 7, some of the common dietomics. For the former, work is in progress which is evident from the
publicly available databases and the kind of data provided by them are research work done using drugs data as we have discussed earlier in
tabulated. Table 8 illustrates type, purpose and special features of some section 4.1.3. For the latter, diet related data can be used to infer
popular tools. relations between our diet and diseases. Blood sugar response to
changes in daily diet, exercise and other activities can be used to predict
personalized nutrition. So in coming years, we can expect outcomes in
5. ISSUES NEED TO BE CONSIDERED these areas. We hope that this study will be beneficial for those who
Although the healthcare industry is progressing at a fast pace, still there want to pursue work in any of the technologies discussed.
are issues in certain areas that need to be addressed. Due to privacy and

Table 8 Purpose and Features of tools


Tools Type Purpose Special Features

GeneMerge[6] Web-based standalone program -Returns descriptive -Takes gene association


info(functional and files, description file,
genomic data) for each study set and population
study gene set gene file as input
-Returns rank scores for -Outputs tab delimited txt
over-representation of file
descriptors in genes
Cytoscape[30][33] Open-source software project -Integration of molecular -core extensible through
interaction network data at plug-in architecture
large scale - mapping of data
-analysis and visualization attributes to visual
of networks display and n/w layout
Arraytrack[35] Bioinformatics review tool -manage, analyse and -development done with
interpret genomic, Voluntary eXploratory
proteomic and Data Submission (VXDS)
metabolomic data and MicroArray Quality
Control (MAQC) projects
-particularly used in Food
and Drug Administration
Galaxy[1] Open, web-based platform -users can interact with -can install different
various bioinformatics bioinformatics tools
tools and share data -Galaxy Main free and
- perform analysis without publicly available, plug-
worrying about resources ins also available
-available on various
cloud computing
resources
Table 9 Common issues and future outcomes for various technologies
Technologies Issues in current state Future outcomes

Data Mining Numerous studies for only cancer related Other diseases and subtypes of cancer
predictions should also be considered
A single data mining technique used in every Mixture of these techniques can be used
study
Network Reliable PPIs and disease description not Composition of better annotated databases
Analysis known for PPIs and disease description
Analysis done only for a particular disease Identify protein sub networks helpful for
multi factorial diseases
Cloud-based Privacy and security issues Include features like data encryption and
Services access control or use hybrid cloud
Emergency medical service not considered in Merge the emergency medical service with
some applications other cloud applications
High cost of using cloud resources Use SLA management

REFERENCES networks for predicting disease-gene associations. PloS one 6, 9,


[1] Afgan, E., Baker, D., Chilton, J., Coraor, N., Taylor J., and Galaxy e24171. DOI= http://dx.doi.org/10.1371/journal.pone.0024171
Team. 2014. Galaxy cluster to cloud-genomics at scale. In Proceedings
of the 9th Gateway Computing Environments Workshop, IEEE Press. [13] Hans A., and Kalra, S. 2014. Comparitive analysis of various cloud
DOI= 10.1109/GCE.2014.13 based biomedcial services. 2014 International Conference on Medical
[2] Ahmed, K., Abdullah-Al-Emran, A. A. E., Jesmin, T., Mukti, R. F., Imaging, m-Health and Emerging Communication Systems (MedCom).
Rahman M., and Ahmed F. 2013. Early detection of lung cancer risk DOI= 10.1109/MedCom.2014.7006038
using data mining . Asian Pacific Journal of Cancer Prevention 14, 1, [14] Leroy, H., and Auffray, C. 2013. Participatory medicine: a driving
595-598. DOI= http://dx.doi.org/10.7314/APJCP.2013.14.1.595. force for revolutionizing healthcare. Genome medicine 5,12,1-4. DOI=
[3] Baral, C., Gonzalez, G., Gitter, A., Teegarden, C., Zeigler A., and 10.1186/gm514
Joshi-Topé, G. 2007. CBioC: beyond a prototype for collaborative [15] Huang M., Zhu X., Hao, Y., Payan, D.G., Qu, K., and Li, M. 2004.
annotation of molecular interactions from the literature. ComputSyst Discovering patterns to extract protein–protein interactions from full
Bioinformatics Conf. 6, 381-4. texts. Bioinformatics 20, 18, 3604-3612. DOI=
[4] Brohee, S., and Van Helden J. 2006. Evaluation of clustering 10.1093/bioinformatics/bth451
algorithms for protein-protein interaction networks BMC [16] Juen, J., Cheng, Q., and Schatz, B. 2014. Towards a natural
bioinformatics 7, 1, 1. DOI= 10.1186/1471-2105-7-488 walking monitor for pulmonary patients using simple smart phones. In
[5] Carson, M.B., and Lu, H. 2015. Network-based prediction and Proceedings of the 5th ACM Conference on Bioinformatics,
knowledge mining of disease genes. BMC medical genomics 8, 2, S9. Computational Biology, and Health Informatics. DOI=
DOI= 10.1186/1755-8794-8-S2-S9 http://dx.doi.org/10.1145/2506583.2512362
[6] Castillo-Davis, C.I., and Hartl, D.L. 2003. GeneMerge—post- [17] Jung, K., Grade, M., Gaedcke, J., Jo, P., Opitz, L., and Becker, H.
genomic analysis, data mining, and hypothesis testing Bioinformatics 2010. A new sensitivity-preferred strategy to build prediction rules for
19, 7, 891-892. DOI= 10.1093/bioinformatics/btg114 therapy response of cancer patients using gene expression data.
[7] Chen, Y. C., Ke, W. C., and Chiu, H. W. 2014. Risk classification of Computer methods and programs in biomedicine 100, 2, 132-139.
cancer survival using ANN with gene expression data from multiple DOI= http://dx.doi.org/10.1016/j.cmpb.2010.03.016
laboratories. Computers in biology and medicine 48, 1-7. DOI= [18] Karimi, S., Wang, C., Metke-Jimenez, A., Gaire, R., and Paris, C.
10.1016/j.compbiomed.2014.02.006 2015. Text and data mining techniques in adverse drug reaction
[8] Deepika, T., and Porkodi, R. 2015. A Survey on Microarray Gene detection. ACM Computing Surveys (CSUR), 47, 4, 56. DOI=
Expression Data sets in Clustering and Visualization Plots. http://dx.doi.org/10.1145/0000000.0000000
International Journal of Emerging Research in Management and [19] Koufi, V., Malamateniou, F., and Vassilacopoulos, G. 2012. An
Technology 4, 3, 56-66. android-enabled mobile framework for ubiquitous access to cloud
[9] Fahim, M., Fatima, I., Lee, S., and Lee, Y.K. 2012. Daily life emergency medical services. In IEEE Second Symposium on Network
activity tracking application for smart homes using android smartphone. Cloud Computing and Applications (NCCA). DOI=
In Proceedings of 14th International Conference on Advanced 10.1109/NCCA.2012.30
Communication Technology. ICACT ’12, IEEE. [20] Kutlu M., and Agrawal, G. 2014. Cluster-based SNP calling on
[10] Freudenberg, J., and Propping, P. 2002. A similarity-based method large-scale genome sequencing data. In 14th IEEE/ACM International
for genome-wide prediction of disease-relevant human genes. Symposium on Cluster, Cloud and Grid Computing (CCGrid). DOI=
Bioinformatics,18, 2, S110-S115. DOI= 10.1109/CCGrid.2014.111
10.1093/bioinformatics/18.suppl_2.S110 [21] Ma, J., Peng, C., and Chen, Q. 2014. Health Information Exchange
[11] Fu, Li M., and Youn, E S. 2003. Improving reliability of gene for Home-Based Chronic Disease Self-Management--A Hybrid Cloud
selection from microarray functional genomics data. IEEE Transactions Approach. In 5th IEEE International Conference Digital Home (ICDH).
on Information Technology in Biomedicine 7, 3,191-196. DOI= DOI= 10.1109/ICDH.2014.54
10.1109/TITB.2003.816558 [22] Meireles, A., Figueiredo, L., Lopes, L.S., and Almeida, A. 2014.
[12] Guo, X., Gao, L., Wei, C., Yang, X., Zhao, Y., and Dong, A. 2011. Portable decision support system for heart failure detection and medical
A computational method based on the integration of heterogeneous diagnosis. In Proceedings of the 18th ACM International Database
Engineering & Applications Symposium. DOI=
10.1145/2628194.2628204
[23] Mork, J., Peters, L., Jimeno-Yepes, A., Aronson, A.R., [32] Sirota, M., Dudley, J.T., Kim, J., Chiang A.P., and Morgan, A.A.
Bodenreider, O. MetaMap in the CALBC Workshop II. 2011. Discovery and preclinical validation of drug indications using
[24] Mukherjee, S., Dolui, K., and Datta, S.K. 2014. Patient health compendia of public gene expression data. Science translational
management system using e-health monitoring architecture. In IEEE medicine 3, 96, 96ra77. DOI: 10.1126/scitranslmed.3001318
International Advance Computing Conference (IACC). DOI= [33] Smoot, M.E., Ono, K., Ruscheinski, J. , Wang, P.L., and Ideker, T.
10.1109/IAdCC.2014.6779357 2011. Cytoscape 2.8: new features for data integration and network
[25] Özgür, A., Vu, T., Erkan, G., Radev, D.R. 2008. Identifying gene- visualization. Bioinformatics 27, 3, 431-432. DOI=
disease associations using centrality on a literature mined gene- 10.1093/bioinformatics/btq675
interaction network. Bioinformatics, 24, 13, i277-i285. DOI= [34] Soni, S.R., Khuntetaand, A., Gupta., M. 2011. A review on
10.1093/bioinformatics/btn182 intelligent methods used in medicine and life science. In Proceedings of
[26] Pal, S.K., Bandyopadhyay, S., and Ray, S.S. 2006. Evolutionary the ACM International Conference & Workshop on Emerging Trends in
computation in bioinformatics: A review. In IEEE transactions Technology. DOI= 10.1145/1980022.1980173
on Systems, man, and cybernetics, Part c: Applications and reviews 36, [35] Tong, W., Harris, S.C., Fang, H., Shi, L., and Perkins, R. 2007.
5, 601-615. DOI= 10.1109/TSMCC.2005.855515 An integrated bioinformatics infrastructure essential for advancing
[27] Parthibanand, L., and Subramanian, R. 2008. Intelligent heart pharmacogenomics and personalized medicine in the context of the
disease prediction system using CANFIS and genetic algorithm. FDA's Critical Path Initiative. Drug Discovery Today: Technologies 4,
International Journal of Biological, Biomedical and Medical Sciences 1, 3-8. DOI= 10.1016/j.ddtec.2007.10.008
3, 3. DOI=10.1.1.451.9421 [36] Venkatraman, S., Bala, H., Venkatesh, V., and Bates, J. 2008 "Six
[28] Reddy, B.E., Kumar, T.V.S., and Ramu, G. 2012. An efficient strategies for electronic medical records systems," Communications of
cloud framework for health care monitoring system. In IEEE the ACM 51, 11, 140-144. DOI= 10.1145/1400214.1400243
International Symposium on Cloud and Services Computing (ISCOS). [37] Yadav, V.P. ,and Kumari, M. 2014. Name Entity Conflict
DOI= 10.1109/ISCOS.2012.11 Detection in Biomedical Text Data Based on Probabilistic Topic
[29] Shah, S.C., and Kusiak, A. 2004. Data mining and genetic Models. In Proceedings of the ACM International Conference on
algorithm based gene/SNP selection. Artificial intelligence in medicine Information and Communication Technology for Competitive
31, 3, 183-196. DOI= 10.1016/j.artmed.2004.04.002 Strategies. DOI= 10.1145/2677855.2677916
[30] Shannon, P., Markiel, A., Ozier, O., Baliga, N.S., and Wang, J.T. [38] Bhattacharyya, M. 2015. Disease dietomics. XRDS: Crossroads,
2003.Cytoscape: a software environment for integrated models of The ACM Magazine for Students 21, 4, 38-44. DOI= 10.1145/2788508
biomolecular interaction networks. Genome research, 13, 11, 2498- [39] Razavian, N. 2015. Advancing the frontier of data-driven
2504. DOI= 10.1101/gr.1239303 healthcare. XRDS: Crossroads, The ACM Magazine for Students 21, 4,
[31] Sheng, J., Li, F., and Wong, S.T.C. 2015. Optimal drug prediction 34-37. DOI= 10.1145/2788506
from personal genomics profiles. In IEEE Journal of Biomedical and [40] Kaur P.D., and Chana. I. 2014. Cloud based intelligent system for
Health Informatics 19, 4,1264-127. DOI: 10.1109/JBHI.2015.2412522 delivering health care as a service. Computer methods and programs in
biomedicine 113, 1, 346-359. DOI= 10.1016/j.cmpb.2013.09.013

You might also like