0% found this document useful (0 votes)
23 views8 pages

Understanding Big Data Concepts and Tools

Big data refers to the methodology for acquiring, processing, and analyzing large volumes of heterogeneous data that traditional systems cannot handle. It is classified by size from megabytes to yottabytes, with big data reaching up to exabytes. Hadoop is an open-source framework that enables distributed storage and processing of massive datasets using clusters of commodity hardware.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
23 views8 pages

Understanding Big Data Concepts and Tools

Big data refers to the methodology for acquiring, processing, and analyzing large volumes of heterogeneous data that traditional systems cannot handle. It is classified by size from megabytes to yottabytes, with big data reaching up to exabytes. Hadoop is an open-source framework that enables distributed storage and processing of massive datasets using clusters of commodity hardware.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

BIG DATA

INTRODUCTION
Evaluation of Data

DBMS OOPS Model Big Data

1960 1980 1990

1970 1985 2000


Data
RDBMS warehousing
Traditional Files
BIG DATA?
Big data is phrase or methodology defines
the process of acquire, cure, store,
process, analyze, visualize of huge
volume of heterogeneous data which
flows in different frequency with-in
stipulated period of time where the
traditional system lacks
Size Classification
SIZE

MB – Megabyte

GB – Gigabyte

TB – Terabyte

PB – Petabyte

EB – Exabyte

ZB – Zetabyte

YB - Yotabyte

DW – Max 2 PB
Bigdata – Upto EB
Global Vol – 20 to 25 ZB
Hadoop?
Hadoop is a open source TERMINOLOGIES
Apache Software DATACENTER
Foundation framework that CLUSTER
provides distributed RACK
storage & processing on
NODE
huge dataset that runs on
cluster of commodity DISK

hardware BLOCK

DAEMON

You might also like