You are on page 1of 10

Big Data In Cloud Computing : Features & Challenges

Under the guidance of


Dr. Anuradha Kanade
School Of Computer Science
MIT WPU

Ritwik Sonal Parag Bhangale Ashish Pal


TY MCA TY MCA TY MCA
Faculty of Science Faculty of Science Faculty of Science
MIT WPU MIT WPU MIT WPU
Pune, India Pune , India Pune , India

Abstract information or addresses exaflop


computing abilities efficiently. This paper
Big data refers to the large amount of overviews both cloud and big data
information that is just too difficult to technologies describing the current issues
manage with traditional data-modeling with these technologies.
techniques. Big data technology is able to
store and process large volumes of Literature Review
different types of data, delivering
meaningful insights by exploring its Big Data is a major trend that is increasing
clients' or experiments' entire in size day by day, and new challenges and
activities/statistics. Cloud computing solutions are being published every day.
provides a way to manage big data in an
Cloud computing is awesome for
environment that is reliable,available and
performing massive and complex
scalable. In this paper, we will discuss an
calculations. It reduces the need to
overview of big data and cloud computing
maintain expensive computer hardware,
when integrating both together. Though
giving you more time to think up
big data is rather powerful and successful
innovative ways of working out some
so far, it still presents some gaps and
problems that are completely new to you.
issues that require thorough addressing.
Massive growth in the scale of data
Security risks are a pretty common
generated through cloud computing has
problem to face today as are privacy issues
been observed. Addressing the big data
of various natures, which also have their
problem efficiently with high performance
own sets of challenges to deal with in
is a challenging task that requires large
terms of scaling and recovery mechanisms,
computational infrastructure to ensure
as well as problems relating to data
successful data analysis and processing.
heterogeneity, especially of different
types. Threats such as disaster
Introduction
recovery.Other concerns relate to cloud
computing as that is enormous amounts of
Big data is BIG! The type of big data you processing power and storage capacity of
have is directly related to the kinds of Big Data. Storing information demands an
devices and tools that are being used these extensive amount of time, resources, and
days, such as the IoT for example (which space. The cloud takes over where the
monitors things like vast amounts of costs become too great to handle by an
internet-connected devices). Big data is individual user or other areas who need
defined as any information that has been access to complete-data backup-up
gathered from a host of sources related to services.
digital technology. Data types can be
visual or audio in nature, such as images In This paper we will discuss the
and videos, or text-based, like emails and relationship between them, as well as
social media. obstacles and challenges that these
relationships may encounter.
Cloud computing refers to on-demand
computing resources that can be used to Big Data
easily and quickly access a variety of
useful computer services. This often refers Big data is a large amount of data that has
to remote servers, software, data storage been collected from multiple sources no
and any other functions that can be matter big or small. This process takes
delivered through the network in order for quite a bit of processing power and high
users to access and experience the benefits capabilities for information analysis. It is
of cloud computing. By using common crucial to call for the term big data during
resources and having them readily this time because it connects directly to
available on demand, users are provided how well companies have become more
with unlimited scalability from multiple analytical in nature as they have come to
providers as opposed to only one source or understand the importance of big data in
server which would naturally result in generating an informed decision which can
limited capabilities and many limitations. then be used by those who are working in
these areas to provide better and faster
Data is growing in the cloud. It increases services to their clients.
rapidly due to the increased number of
Internet users around the world. With this Characteristics Of Big Data
rapid growth, businesses wonder how to
provide reliable storage for their The five different characteristics used to
customers' data and grow their cloud describe big data are Volume, Variety,
storage infrastructure cost effectively. Velocity, Value and Veracity.
They need a storage technology that can
handle this exponential increase in data
and yet stay affordable, reliable, high
capability and scalable.

Cloud computing and Big Data have


developed a strong relationship, one in
which cloud data storage supports the
Veracity : It refers to the quality of the
data and how accurate it is. Data’s
accuracy also determines how valuable it
is. Bad quality data can lead to inaccurate
analysis. Although there is wide agreement
on the potential value of big data,the data
it’s almost worthless if it isn't accurate.

Benefits of big data

Big data is helpful in making high choices


and taking purposeful action at the right
time. Hadoop as fast forward to it and
Volume : It represents the amount of data
technologies as it gives you size and
which represents the huge number of
flexibility to store data .Use a business
numbers from multiple sources as a sum of
intelligence tool to measure your finances
zeta bytes, which is the most evident
that can give you a clear picture of where
dimension in what concerns big data.
your business stands.Organizations may
fine-tune their business strategy using
Variety : It is associated with big data,
social data from search engines and sites
one of the main characteristics of this
like Facebook and Twitter.Traditional
field. Because it has to deal with data from
consumer feedback systems are being
various sources, big data frameworks
phased out in favour of new Big Data-
ought to be prepared for all types of
based methods. Big Data and natural
information - numbers, text, videos and
language processing technologies are
many more.
being utilized to read and assess user
reactions in these new platforms.
Velocity : It has been proven that data can
be retrieved from various sources at an
Cloud Computing
exponential rate. As more and more
sources, such as Twitter and Facebook for It’s a term that refers to on-demand
example, release volumes of information computing resources that can be used to
at an increased velocity, this means a easily and quickly access a variety of
change in the strategy needed to analyze useful computer services. This often refers
data in a timely manner. to remote servers, software, data storage
and any other functions that can be
Value : Data is full of information which delivered through the network in order for
helps us find the valuable information users to access and experience the benefits
from it. The value comes after processing, of cloud computing. By using common
modelling and visualization. The stage resources and having them readily
also includes refining and cleaning up data available on demand, users are provided
too. with unlimited scalability from multiple
providers as opposed to only one source or
server which would naturally result in
limited capabilities and many limitations . Measured service : Cloud systems track
Cloud computing is a shared resource resource use and Cloud systems also allow
system that can offer cost effective for monitoring that's complete visibility
opportunities for small businesses because into the system as well as control over
they could now run their whole business what users are able to see—an important
without having to purchase tons of aspect of both security and compliance.
hardware or take on massive debts!
Cloud computing models
Characteristics Of Cloud Computing
There are three cloud computing models.
Cloud computing is one of the most
advanced models. NIST (The National
Institute for Standards and Technology)
has identified five important aspects of
cloud systems, a model that simplifies the
usage of cloud computer.

On demand Service : Cloud services are


easy, convenient, and less expensive than
having your own servers. Because they
take care of technical considerations for
you, cloud services make it easier to run
Infrastructure as a Service (IAAS): Cloud
your business, so you can focus on the big
service providers provide virtualized
picture.
computing resources on the Internet. In an
Broad Network Access : cloud computing IaaS model, a third-party provider will
resources are accessible over the internet host hardware, software, and servers on
through wireless devices and mobile behalf of its user-- who only need to focus
technology. on a specific aspect of their business to
grow upon.
Resource Pooling : Cloud computing
users share a wide range of resources and, Platform as a Service (PAAS) : Cloud
as such, users can determine how they service providers offer platforms, tools and
want their resources and in which location other services to users. They manage,
they prefer but cannot dictate the actual including the operating system and
physical location of their computing unit. middleware with resources that enable you
to deliver everything from simple cloud-
Rapid Elasticity : systems can be scaled based applications to sophisticated
rapidly, to suit any and all needs. They can enterprise grade solutions.
be scaled up in size substantially by adding
extra hardware resources (e.g., storage Software as a Service (SAAS) : Cloud
devices, processing units etc.) and/or service providers provide a variety of
scaling down to run on less powerful or software to users who can use them
smaller machines. And the process will without installing the applications on their
usually be handled by automation. computers. The user is not responsible for
anything other than adjusting the settings As more data transfers to the cloud,
and customizing the service as appropriate security and privacy are two major things
to their needs. SAAS helps big data clients that need to be considered. Other topics
to perform data analysis. that arise include data recovery, data
uploading to the cloud, and Exaflop
Big Data In Cloud computing.

Storing and processing huge amounts of Security


data requires scalability and fault
tolerance. Cloud computing refers to using Cloud computing and big data security are
the power of internet together with vast both major concerns for businesses. Before
networks of computers that are connected a business plans on storing confidential
all the time. The advancements in cloud data in the cloud, they must answer many
computing has enabled more powerful questions such as who owns that data,
hardware to handle the large datasets. where it is stored, who has access to it and
what type of rights they have permission to
Cloud computing and big data are merging change it.
together to help better serve people who
need to collect or store large amounts of a) Who is the real owner of the data ?
data. Rapid growth in big data is a
A client may pay for a cloud
problem that needs solving because the
service, but does the data belong to
amounts of information being collected are
them? How much access does their
becoming too large for standard storage
provider have to information and
systems to handle, not to mention
for what reasons may it use it? Is
difficulties in getting it to integrate with
the data useful to the cloud service
other interconnected databases effectively.
provider?
That's why cloud computing has evolved;
it allows users more ways of looking at b)Where is the data?
their information and serving as a means
by which information can be shared Data is permission to share with
between multiple disconnected locations, other countries. when some time
making things such as backups easier. Big data to help to anywhere to access
data may be a problem but cloud but sometimes not allow to another
computing provides solutions! country so many software and
company policies register to data
Big Data Challenges allow to another way.

Big Data in cloud computing helps us Privacy


solve some of today's biggest problems
with large amounts of data. But it is a Data privacy is the most important part of
continually evolving field and still has the cloud with every company wanting to
many limitations. In this part, we will be fully in control of what kind of
discuss some difficulties which need to information they should have access to.
overcome. Big Data security and protecting privacy
have become extremely difficult as more
online services offer users online transfer floods, and fires, information losses got to
data systems for analytics, cloud storage be nominal. to meet this demand, just in
etc. In fact, hackers can now easily attach case of any incident, information should be
themselves to emails and send login details quickly accessible with nominal period
to people whom they wish to provide and loss. However, though this is often a
unauthorized access towards captured really vital issue because the loss of
sensitive data from various companies. information can probably lead to the loss
Traditionally organizations have relied on of cash, it's vital to be able to respond
using various methods like firewalls, expeditiously to unsafe incidents. with
certain passwords or even structures within success deploying huge information
their own business networks to protect the DBMSs within the cloud and keeping it
privacy rights of their customers. invariably accessible and fault-tolerant
might powerfully rely upon disaster
Data Governance recovery mechanisms.

The data organization allows some steps to Other Challenges


follow from storage to privacy. In some
organizations without following, policies Within this section we discuss data
follow to work in storage data. transference onto the cloud, Exaflop
computing, and scalability and elasticity
Data governance also plays a role in issues in cloud computing and big data
regulatory control of compliance and
better-improved data quality and a) Transferring information onto a cloud
management tools. They want better result may be a terribly slow method and firms
making and better business outcomes for usually like better to physically send
the benefit of people. arduous drives to the data centers so data
will be uploaded. However, this is often
Improving data security. One of the data neither the foremost sensible nor the safest
governance is secure data and to any other resolution to transfer information onto the
untheorized misused. cloud. Through the years there has been a
shot to enhance and make economical
Ensuring data is used properly. They help
information transferring algorithms to
many users create data to privacy but some
reduce upload times and supply a secure
years organization errors show to miss the
way to transferring data onto the cloud,
data then claim to make data to not share
however, this method still remains a
any clue of the data. The data are limited
serious bottleneck.
to save and no other misused and all are
followed to policies. b) Scaling in the cloud and with big data
systems (sometimes called elasticity) is
Disaster Recovery
still new technology. Today, there are only
Data could be a terribly valuable business rudimentary ways of scaling when your
and losing information will definitely lead cloud-based systems are involved because
to losing price. just in case of emergency of high load times, inconsistent load
or unsafe accidents like earthquakes, distribution and other related difficulties.
Most systems don't have very good
automated methods for detecting a generation of systems, on the other
problem before it gets out of control and hand, must ensure that data is
many algorithms that are used today are accessible fast and that encryption
reactive or proactive meaning they either does not significantly slow down
monitor the problem or work to prevent it. processing times.
Unfortunately, in most cases it's after an ● Data standardisation can be used to
overload has already occurred that they address the diversity of Big Data.
begin working on the issue so their ● New and safe data uploading
reaction time can be too slow to help techniques based on QoS (quality
businesses keep up with high performance of service) may be the answer to
demands. making data uploading to the cloud
easier.
c) Exaflop computing is one of today’s ● Scalability and elasticity
problems. Supercomputers and clouds deal approaches exist, and numerous
with petabyte data sets, though they may Big Data suppliers, such as
not be able to handle exabytes since high Amazon and Microsoft, employ
performance and high bandwidth are them extensively. The main focus
required to transfer and process such huge is the development of completely
volumes of data over the network. Some automatic reactive and proactive
believe cloud computing is not ideal systems capable of automatically
because it is slower than supercomputers responding with load needs.
in some cases, since it's restrained by
existing bandwidth and latency. ● Exaflop computing is one of the
grand challenges that require
Solutions of these Challenge copious amounts of research and
funding from certain governments
In terms of current problems, following around the globe. To date, the best
are some probable advancements in the expedient computer solutions use
coming years: HPCs and GPUs;
● Data encryption can help with
security and privacy. A new
Conclusion
With the amount of data being generated [3] Subashini, S. & Kavitha, V., 2011. A
on a daily basis, big data technologies, survey on security issues in service
particularly analytic tools, have emerged delivery models of cloud computing.
as a key source of innovation, allowing Journal of Network and Computer
users to store, analyse, and retrieve Applications
information from petabyte datasets. Big
data systems benefit greatly from cloud [4] Tallon, P.P., 2013. Corporate
settings, which provide fault-tolerant, governance of big data: perspectives on
scalable, and available environments. value, risk, and cost

Despite the fact that big data systems are [5] Tene, O. & Polonetsky, J., 2012.
strong tools that allow businesses and Privacy in the Age of big data.
scientists to gain insights from data, there [6] Popović , K. & Hocenski, Z., 2015.
are certain challenges that need to be cloud computing security issues and
investigated more. It will take more time challenges.
and effort to implement security
procedures and standardise data formats. [6] Wood, T. et al., 2010. Disaster
recovery as a cloud service: Economic
Another important aspect of Big Data is
benefits & deployment challenges.
scalability, which is usually manual in
commercial solutions rather than
[7] L. Chang, R. Ranjan, Z. Xuyun, Y.
automatic.
Chi, D. Georgakopoulos, C. Jinjun, Public
To address this issue, more study is Auditing for Big Data Storage in Cloud
required. In this case, we want to leverage Computing – a Survey, Computational
adaptive techniques to create a solution for Science and Engineering (CSE), 2013
providing elasticity across several IEEE 16th International Conference on,
dimensions of big data systems running in 2013, pp. 1128–1135
cloud settings. The purpose is to look at
[8]http://www.kciti.edu/wp-content/
the techniques that adaptable software can
uploads/2017/07/dbms_tutorial.pdf, 2015
employ to activate scalability at various
by Tutorials Point.
levels

REFERENCES [9] Zomaya, A. Y., & Sakr, S. (2017).


Handbook of Big Data Technologies.
[1] Majhi, S.K. & Shial, G., 2015. Springer.
Challenges in big data cloud Computing
And Future Research Prospects: A
Review. The Smart Computing Review.
[10]http://searchcio.techtarget.com/
[2] Zhang, L. et al., 2013. Moving big data definition/big-data-as-a-service-bdaas,
to the cloud. INFOCOM, 2013
[11]https://www.maximizer.com/blog/
Proceedings IEEE
entering-the-age-of-big-data-as-a-service/
[12]https://en.wikipedia.org/wiki/
Cloud_computing

[13] Fonseca, N., & Boutaba, R. (2015).


Cloud services, networking, and
management. John Wiley & Sons.

14] McAfee, Andrew, and Erik


Brynjolfsson. "Big data: the management
revolution." Harvard business review
90.10 (2012): 60-68.

You might also like