You are on page 1of 3

Duration of the course 3 days

Program Name Bigdata crash course Daywise

Hadoop Spark Nosql Module 1


Introduction to Big Data 1
Characteristics 1
Why, How and What s of Big data 1
Existing OLTP, ETL,DWH,OLAP 1

Module 2 1
Introduction to Hadoop Ecosystem 1
Architecture-HDFS
Sharding , Distributed and Replication factor (SDR) 1
Daemons 1
Hadoop Fs shell commands 1
Writing Data to HDFS 1
Reading Data from DFS 1
Map reduce and Yarn 1
Hands on 1

Module 3 1
Introduction to Hive Data warehouse 1
Hive QL Commands 1
Manipulation and anlytical function in hive 1
Managed table and external tables 2
Partitioning and Bucketing 2

Day 2

Module 4 2
Nosql Database 2
CAP theorem /BASE 2
The HBase Data Model
The HBase Shell 2
HBase Architecture 2
Schema Design 2
Module 5 2
Spark core and Components 2
Spark Shell 2
RDD 2
Dataframe /Dataset 2
Spark sql 2

Day 3
Module 6
Sqoop 3
Flume 3
Kafka 3
Oozie 3
HUE 3
Hands-on each module 3
Bigdata sources as Source and target 3
ETL integration with Hadoop (Informatica or SSIS ) 3
OLAP /Data visualization integration with Hadoop (Microstrategy o 3
Distribution : Cloudera /Horton works ( HDInsight ) /MapR or A 3
Daywise

You might also like