You are on page 1of 3

Apache Hadoop

Introduction
The Motivation For Hadoop
Problems with traditional large-scale systems
Requirements for a new approach
Hadoop: Basic Concepts
An Overview of Hadoop
The Hadoop Distributed File System
Hands-On Exercise
How MapReduce Works
Anatomy of a Hadoop Cluster
Other Hadoop Ecosystem Components

Writing a MapReduce Program


The MapReduce Flow
Examining a Sample MapReduce Program
Basic MapReduce API Concepts
The Driver Code
The Mapper
The Reducer
Hadoop’s Streaming API
Using Eclipse for Rapid Development
Hands-on exercise
The New MapReduce API
Integrating Hadoop Into The Workflow
Relational Database Management Systems
Storage Systems
Importing Data from RDBMSs With Sqoop
Hands-on exercise
Importing Real-Time Data with Flume
Accessing HDFS Using FuseDFS and Hoop
Delving Deeper Into The Hadoop API
More about ToolRunner
Testing with MRUnit
Reducing Intermediate Data With Combiners
The configure and close methods for Map/Reduce Setup and Teardown
Writing Partitioners for Better Load Balancing
Hands-On Exercise
Directly Accessing HDFS
Using the Distributed Cache
Common MapReduce Algorithms
Sorting and Searching
Indexing
Machine Learning With Mahout
Term Frequency – Inverse Document Frequency
Word Co-Occurrence
Hands-On Exercise
Using Hive and Pig
Hive Basics
Pig Basics
Hands-on exercise
Practical Development Tips and Techniques
Debugging MapReduce Code
Using LocalJobRunner Mode For Easier Debugging
Retrieving Job Information with Counters
Logging
Splittable File Formats
Determining the Optimal Number of Reducers
Map-Only MapReduce Jobs
Hands-On Exercise
More Advanced MapReduce Programming
Custom Writables and WritableComparables
Saving Binary Data using SequenceFiles and Avro Files
Creating InputFormats and OutputFormats
Hands-On Exercise
Joining Data Sets in MapReduce
Map-Side Joins
The Secondary Sort
Reduce-Side Joins
Graph Manipulation in Hadoop
Introduction to graph techniques
Representing graphs in Hadoop
Implementing a sample algorithm: Single Source Shortest Path
Creating Workflows With Oozie
The Motivation for Oozie
Oozie’s Workflow Definition Format
Hands-On Exercise

Partners :

NOIDA GREATER NOIDA GHAZIABAD FARIDABAD


A-43 & A-52, Sector-16, E - 35, SITE - 4, Near Swarna 1, Anand Industrial Estate, SCO-32, 1st Floor, Sec.-16,
Noida - 201301, (U.P.) INDIA Nagari, Adjacent J.P.
. Golf Near ITS College, Mohan Nagar, Faridabad (HARYANA)
Ph. : 0120-4646464 Course, Greater Noida (U. P.) Ghaziabad (U.P.) Ph. : 0129-4150605-09
M. : 09871055180 Ph. : 0120-4345190-91-92 to 97 Ph.: 0120-4835400...98-99 M : 09811612707
M. :09899909738, 09899913475 M : 09810831363 / 9818106660
: 08802288258 - 59-60
GURGAON JAIPUR GWALIOR
1808/2, 2nd floor old DLF, 38,Jai Jawan Colony 3rd, C-8, Ist floor, Opposite Aditya
Near Honda Showroom, Near Gaurav Tower,JLN College, Near Airtel Office,
Sec.-14, Gurgaon (Haryana) Marg, Jaipur (Rajsthan) City Centre, Gwalior (M.P.)
Ph. : 0124-4219095-96-97-98 Ph. : 0141-2550077, 2550202 Ph. : 0751-4078733-44
M. : 09873477222-333 M : 08824246937 M: 09754478733
www.facebook.com/ducateducation

You might also like