You are on page 1of 25

BIG DATA

BIG DATA STORAGE

Huge volume of data

PROCESS
What is BIG DATA?

Big Data is also data but with a huge


size. Big Data is a term used to describe
a collection of data that is huge in size
and yet growing exponentially with
time. In short such data is so large and
complex that none of the traditional data
management tools are able to store it or
process it efficiently.
HOW DO YOU CLASIFY DATA AS BIG
DATA

?
This is possible with the concept of
Value

5
Veracity

Variety

Velocity

V's Volume
CHARACTERISTIC OF BIG
DATA

Volume Velocity Variety Verocity Value

This refers to the sheer Speed in which data is Can use structured as well as Data reliability and trust. Extract useful data. Just
emanating and changes unstructured data. Different Verifying and validating the having big data is of no use
volume of data being
formats of data from various data unless we can turn it into
generated every second are occuring between the sources value
divirese data sets
There are three primary
sources of Big Data

SOCIAL TRANSAC-
MACHINE TIONAL
DATA DATA DATA
SO
CI
AL Social data comes from the Likes, Tweets &
DA Retweets, Comments, Video Uploads, and
TA general media that are uploaded and shared via
the world’s favorite social media platforms. This
kind of data provides invaluable insights into
consumer behavior and sentiment and can be
enormously influential in marketing analytics.
The public web is another good source of social
data, and tools like Google Trends can be used
to good effect to increase the volume of big
data.
DATA
Machine data is defined as information which is MAC
generated by industrial equipment, sensors that
are installed in machinery, and even web logs
HINE
which track user behavior. This type of data is
expected to grow exponentially as the internet of
things grows ever more pervasive and expands
around the world. Sensors such as medical
devices, smart meters, road cameras, satellites,
games and the rapidly growing Internet Of
Things will deliver high velocity, value, volume
and variety of data in the very near future.
DATA
TRAN
SACT
IONA
L Transactional data is generated from all the
daily transactions that take place both online and
offline. Invoices, payment orders, storage
records, delivery receipts – all are characterized
as transactional data yet data alone is almost
meaningless, and most organizations struggle to
make sense of the data that they are generating
and how it can be put to good use.
Types of Big Data
Un-
structured
Data

Stuctured
Data

Semi-
Structured
Data
Structured
Structured data are those type of data which Data
are stored already in an order. There are nearly
20% of the total exixting data are structured
data. All the data generated from Sensors,
Weblogs, these are all Machine Generted
Structured Dta . The human generated
structured data are those which are taken as
information from a human. Like their names,
addresses etc.

The example of Structured data is Database.


Structured
Data

Machines Humans

-Sensors Mainly includes all the data


-Weblogs that humans input into
-Financial Systems computers, such as names
and other persoonal details.
Unstructured
Unstructured data have no clear format in
Data
storage.We can store structured data in a row-
column database, but unstructured data cannot be
stored like that. At least 80% of data are
unstructured . All satelline--generated images,
scientific data or images are categorized as
machine generated unstructured data. There are
various types of human -generated unstructured
data. These are images, videos, social media data
etc.

The examples of Unstructured Data are text


document,PDFs, Images, videos etc.
Unstructured
Data

Machines Humans

-Satellite images
-scientific data
-Radar images
Semi-
Structured
Semi-structured data it is very difficult to Data
categorize this type of data. Sometimes they
look structured, or sometimes unstructured.
So that's why these data are known as semi
structured data. We cannot store these type
of data using traditional database format, but
it contains some organizational properties.

The examples of Semi-structured Data are


Spread Sheet files, XML or JSON
documents, NoSQL database data items etc.
Examples Of
Big Data

The New York Stock Exchange generates about one


terabyte of new trade data per day.
Examples Of
Social Media Big Data
The statistic shows that 500+terabytes
of new data get ingested into the
databases of social media site
Facebook, every day. This data is
mainly generated in terms of photo
and video uploads, message
exchanges, putting comments etc.
Examples Of
A single Jet engine can generate 10+terabytes of
Big Data
data in 30 minutes of flight time. With many
thousand flights per day, generation of data
reaches up to many Petabytes.
Big Data Tools
B e n e f i t s o f B i g D a t a P ro c e s s i n g

Businesses can utilize


outside intelligence while Improved customer
taking decisions service

Early identification of risk


Better operational
to the product/services, if
efficiency
any
Access to social data from search engines and sites like facebook, twitter
are enabling organizations to fine tune their business strategies.

0 Enter title
Traditional customer feedback systems are getting replaced by new
systems designed with Big Data technologies. In these new systems, Big
Conveniently architect value option. Seamlessly deliver.
Data and natural language processing technologies are being used to read
and evaluate consumer responses.

2
Big Data technologies can be used for creating a staging area or landing
zone for new data before identifying what data should be moved to the
data warehouse. In addition, such integration of Big Data technologies
and data warehouse helps an organization to offload infrequently accessed
data.
Thank you

You might also like