You are on page 1of 12

BIG DATA & HADOOP

Data – The Most Valuable Resource

“In its raw form oil has little value. Once processed
and refined , it helps power the world”
-Ann Winblad

“Data is the new oil.” -Clive Humby, CNBC


Types of Data
• The following three types of data can be identified:

Structured Data :
Data which is represented in a tabular format or having
– some schemas.
E.g.: Databases

Semi Structured Data:


Data which does not have a formal data model or not
having schemas.
E.g.: XML files

Unstructured Data :
Data which does not have a pre defined data model.
E.g. : Text files, pdf, images, videos ,social media
data, emails. Etc.
History of Big Data
Need of Big Data
Following are the reasons why Big Data is needed:

90% of the data in the world today has been created in the last two years
alone.
80% of the data is unstructured or exists in widely varying structures, which
are difficult to analyze.
Structured formats have some limitations with respect to handling large
quantities of data.
It is difficult to integrate information distributed across multiple systems.
Most business users do not know what should be analyzed.
Potentially valuable data is dormant or discarded.
It is too expensive to justify the integration of large volumes of unstructured
data.
A lot of information has a short, useful lifespan.
Big Data & Its Sources
Big Data Architecture
Three Characteristic of Big Data
Three Characteristic of Big Data
Importance of Big data
Big Data Applications
Big Data Applications

You might also like