Professional Documents
Culture Documents
College, Ajmer
Execution of
UDF present In Map Reduce
LFS Function O/P File stored
in HDFS
To run pig in local mode, we need to access a
single machine; all files, jars, which are
going to process should be installed and run
in local environment.
This mode is considered, when there are
smaller set of data for testing the code.
Mapreduce is locally simulated with the local
JobRunner class of hadoop
Pig –x local
To run pig in Distributed mode, we need to
access Hadoop clusters and HDFS installation.
Map reduce mode is the default mode.
In this mode, pig translates the queries into
mapreduce job and runs the job on the
hadoop cluster. This cluster can be pseudo or
fully distributed cluster.
pig
pig –x mapreduce
SayingHello to Hive, Seeing How the Hive is
Put Together, Getting Started with Apache
Hive. Examining the Hive Clients. Working
with Hive Data Types. Creating and Managing
Databases and Tables, Seeing How the Hive
Data Manipulation Language Works, Querying
and Analyzing Data.