Professional Documents
Culture Documents
Hive
BigData Hadoop o Comparison with RDBMS
Fundamentals o HQL
o Data Storage and Analysis o Data types
o Comparison with RDBMS o Importing and Exporting
Hadoop – A Brief History o Partitioning and Bucketing – Advanced.
o Joins and Join Optimization.
HDFS
o Functions- Built in & user defined
o Blocks
o Advanced Optimization of HQL
o NN & DN
o Storage File Formats – Advanced
o HDFS Federation & High Availability
o Loading and Storing Data
HDFS Clients o SerDes – Advanced
o HDFS Command Line
Sqoop
o HDFS CLI – File System Operations Lab
o Introduction
o HDFS Web UI
o Import – Deep dive
o HDFS Java Client
o Export – Deep dive
o HDFS Java Client – File System Operations
o Sqoop Optimization – Incremental Load
Lab
o Real time scenarios
o CRUD Operations using Java Client
YARN – Cluster Management (Hadoop Flume
o Configure Flume and Import data
2.x) o Architecture and LAB
o How Yarn Applications run?
o YARN vs Map Reduce
Oozie
o Different workflow jobs
o YARN Scheduling
o Ooze scheduler. LAB
Capacity, Fair Scheduler, FIFO
Map Reduce HBase
o MR Programming Model o NoSQL databases Introduction
o Input Formats o CAP theorem
o Output Formats o HBase Architecture
o Compression o HBase Clients – Java Client
o Serialization & Data Types o Loading Data
o File Based Data structures o Hive – HBase Integration
o Sequence file, Map File, ORC, Parquet Monitoring the Cluster
o Tuning Map Reduce Jobs o Horton Works Ambari
o Advanced Map Reduce o Cloudera Manager
Joins -Map-side, o MapR MCS
Reduce-side o HUE, RM UI
Distributed Cache Real Time Project
1/2 | P a g e
TechGeest Solutions
www.techgeest.com Opp Manyatha Tech Park,
+91-9620828049, 8095799993 Gate No:1 (IBM), 2nd Floor,
(By Real Time Expert) Siddhartha Learning Academy,
Above Kuttunad Restaurant
2/2 | P a g e
TechGeest Solutions
www.techgeest.com Opp Manyatha Tech Park,
+91-9620828049, 8095799993 Gate No:1 (IBM), 2nd Floor,
(By Real Time Expert) Siddhartha Learning Academy,
Above Kuttunad Restaurant
3/2 | P a g e