Professional Documents
Culture Documents
Some functions are built-in functions in pig like COUNT, SUM, MAX…..So I can use them directly in
my pig command
Now if I do not want to use register command there are two things:-
1. Either my piggybank.jar or the udfs jar will be in lib folder of pig, I do not need to use register
command
2. There is a property variable called “pig.additional.jars” in pig.properties file, here I can give
the path of my piggybank jar or udfs jar or I can use this property in the command line like
Pig –d pig.additional.jars ‘path of jar’ then this property will be set for this particular session.
Now if I do not want to give full path of function then again there are two things:-
Pig can eat anything means they can store and operate on any data whether it is structured,
unstructured or semi structured
Pig lives anywhere it mean pig is design to be a language for parallel processing but not tied to a
particular framework. It can live on mapreduce or spark or apache Tej framework.
Pig are domestic animal means it is designed to be easily controlled and modified by its users
Pig allows integration of user code wherever possible. It support UDFs written in java or in a scripting
language that can be compiled down to java like Jython.
Pigs fly means Pig processes data quickly. We want to consistently improve performance, and not
Implement features in ways that weigh Pig down so it can’t fly.
Q. Comments in pig?
Ans: they use both SQL type single line comments (--) and java like multiline comments
(/*………….*/)