3
Hadoop Echo System
Apache Hadoop is a collection of open-source softwarefor reliable, scalable, distributed computing.
Hadoop Common: The common utilities that support theother Hadoop subprojects.
HDFS: A distributed file system that provides highthroughput access to application data.
MapReduce: A software framework for distributedprocessing of large data sets on compute clusters.
Pig: A high-level data-flow language and executionframework for parallel computation.
HBase: A scalable, distributed database that supportsstructured data storage for large tables.
Add a Comment