This document outlines the modules and contents of a Hadoop Developer training course. The course covers topics including Linux, Hadoop, HDFS, MapReduce, Pig, Hive, Sqoop, HBase, Flume, Oozie and MongoDB. It provides details on the hardware requirements, installation of software, concepts and programming languages used in Hadoop such as MapReduce, Pig and Hive. The course also includes hands-on projects and assistance with Hadoop certification.
This document outlines the modules and contents of a Hadoop Developer training course. The course covers topics including Linux, Hadoop, HDFS, MapReduce, Pig, Hive, Sqoop, HBase, Flume, Oozie and MongoDB. It provides details on the hardware requirements, installation of software, concepts and programming languages used in Hadoop such as MapReduce, Pig and Hive. The course also includes hands-on projects and assistance with Hadoop certification.
This document outlines the modules and contents of a Hadoop Developer training course. The course covers topics including Linux, Hadoop, HDFS, MapReduce, Pig, Hive, Sqoop, HBase, Flume, Oozie and MongoDB. It provides details on the hardware requirements, installation of software, concepts and programming languages used in Hadoop such as MapReduce, Pig and Hive. The course also includes hands-on projects and assistance with Hadoop certification.
Requirements: - Core java knowledge is preferable. Hardware Requirements: - Systems must have atleast 2gb RAM. Contents:-Virtual box/VM Ware a. Basics b. Installations c. Backups d. Snapshots Linux a. Basics b. Installations c. Commands Hadoop a. Why Hadoop? b. Scaling c. Distributed Framework d. Hadoop v/s RDBMS e. Brief history of hadoop Setup hadoop a. Pseudo mode b. Cluster mode c. Ssh d. Installation of java, hadoop e. Configurations of hadoop f. Common errors when running hadoop cluster, solutions
HDFS- Hadoop distributed File System
a. HDFS Design and Architecture
b. HDFS Concepts c. Interacting HDFS using command line d. Interacting HDFS using Java APIs e. Dataflow f. Blocks g. Replica Hadoop Processes a. Name node b. Secondary name node c. Job tracker d. Task tracker e. Data node Map Reduce a. Developing Map Reduce Application b. Phases in Map Reduce Framework c. Map Reduce Input and Output Formats d. Advanced Concepts e. Sample Applications f. Combiner g. HAR Joining datasets in Map reduce jobs a. Map-side join b. Reduce-Side join Map reduce customization a. Custom Input format class b. Hash Practitioner c. Custom Practitioner d. Sorting techniques e. Custom Output format class
Hadoop Programming Languages:PIG
a. Introduction
b. Installation and Configuration
c. Interacting HDFS using PIG d. Map Reduce Programs through PIG e. PIG Commands f. Loading, Filtering, Grouping. g. Data types, Operators.. h. Joins, Groups. i. Sample programs in PIG Hive a. Basics b. Installation and Configurations c. Commands. d. HIVE User defined functions. e. Hive UDAFs SQOOP a. Installation b. Usage of Sqoop c. Sqoop Queries d. Sqoop Jobs e. Sqoop with Hive & Hbase HBASE
a. Introduction to Hbase b. HBase vs. RDBMS c. Exploring HBase Master and Region Servers d. Column Families and Regions e. Basic Hbase shell commands. f. Hbase table
FLUME & OOZIE
a. Brief Explanation of Contents
b. Where these will be used?
c. Overview of Flume & Oozie with Examples MONGODB a. Why MongoDB? b. Comparison with NOSQL c. Complete Knowledge PROJECT ON HADOOP AS WELL COMPLETE ARCHITECTURE EXPLANATION WITH PROJECT CODE. MONGODB POC WILL BE PROVIDED. CERTIFICATION ASSISTANCE WILL BE THEIR.