Professional Documents
Culture Documents
management if you want to maintain efficient data pipelines. Learn Sqoop, Flume and
Oozie in this module.
2nd step phase 1: Big data processing-After learning how to collect and manage
data, learn how to process it with skills in MapReduce, Pig, Hive, HQL & HBase.
MapReduce phases
Data processing methods
Handling big data with Apache Pig
Architecture of Hive
Database creation and operation of tables
Hive query language (HQL) statements
HBase NoSQL database and integration with Hadoop