Professional Documents
Culture Documents
ICIIECS’15
Abstract- Data in the healthcare sector is growing beyond Digitized data in the health care sector is growing
dealing capacity of the health care organizations and is expected massively with data coming in from internal as well as external
to increase significantly in the coming years. Majority of the sources, from mobile devices, wearable sensor devices[1,3],
Healthcare data is often unstructured, exists in silos and resides Electronic Health Records (EHR), Radiology images, Videos,
in imaging systems, medical prescription notes, insurance claims
clinical notes, social media, blogs, remote health monitoring
data, EPR (Electronic Patient Records) etc. integrating these
heterogeneous data and factoring it in to advance analytics is devices etc. newer forms of big data such as imaging, sensor
critical to improve healthcare outcomes. Either because data are reading is also fueling to the need of Big Data solutions to
isolated in disparate or incompatible formats or due to the lack in manage these massive and silos of data available in the
processing capability to load and query large datasets in a timely healthcare industry. The health care industry need to work on
fashion the Healthcare organizations are not in a position to prediction, prevention and personalization to improve their
leverage the benefits of the vast data they have. With outcomes.
convergence of advanced computing and numerous Big Data Numerous amounts of data-structured, semi-structured and
technological options like commercial solutions, Open Source, unstructured data are a characteristic that makes the health care
Cloud etc. it is now possible to attain high performance,
scalability at a relatively low cost. Big data solutions often come
data most challenging. Most of the data in health care comes
with set of innovative data management solutions and analytical from various sources like X-Rays images, MRI Scan reports,
tools, when effectively implemented can transform the healthcare Blood test value, hand written prescriptions, real time data such
outcomes. as OT room monitors for anesthesia, heart monitors, blood
pressure readings [8] etc. The health care data has large amount
Index Terms: Analytics, Big Data, Cloud, Data Management, of data coming from internal and external sources these data
Healthcare, Open Source. majorly comes from:
1. Providers: medical data (EHRs, EPRs)
I. INTRODUCTION 2. Payers: claims and cost data
3. Researchers: academic, independent
Till the recent past the health care industry had been using
4. Consumers and Marketers: patient behavior and
the conservative approach for diagnosis and treatment, where
sentiment data
most doctors depended on their individual knowledge and
5. Government: population and public health data
skills in diagnosing diseases in patients resulting in a less
6. Developers: pharmacy and medical device R&D
precise and patient centric. Digitization, rising rates of chronic
The five key characteristics that define big data are:
diseases, increased population, advancement in technology,
1. Volume: Data is continuously generated in large volumes
need for evidence based medicine, inability to process and get
from real time health monitoring systems, EHRs, EPRs, Labs,
insight from ever increasing heterogeneous medical data are
sensor devices etc.
some of the drivers for adopting Big Data solutions.
2. Velocity: The need to process the data in real time
Adoption of Big Data solutions will play an important role
coming from streaming data like Remote Patient Monitoring,
in transforming the outcomes of the health care industry by
data from sensor devices, Telemedicine etc.
promoting evidence based reasoning in treatment, providing
patient centric treatment by enabling a 360-degree view of each 3. Variety: Data can be structured, semi-structured or
patient. Now most of the stakeholders are starting to embrace unstructured collected from different sources like
the concept of evidence based medicine a system where Patient/Member conversations, Health Community Blogs,
treatment decision are based also on the scientific evidence Social Media etc.
available rather than just the doctors skill and knowledge 4. Veracity: Deals with the quality of data being captured.
providing a measurable outcome towards treatment. 5. Value: It is the most important V of big data, deals with
extracting value from the data.