/  30
 
 
Apache Pig
Pittsburgh Hadoop User Group
11/3/2009
Dmitriy RyaboyAshutosh Chauhan
 
 
In This Talk
What is Pig and why it’s needed
 –This part is going to be brief. –It’s a Hadoop User Group, after all.
Examples
 –As well as some extra motivation
Advanced FeaturesImprovements currently in developmentInteresting research problems
 –Want to get involved?
 
 
What is Pig
or, “duality of Pig”
Pig compiles dataanalysis tasks intoMap-Reduce jobs andruns them on Hadoop.Pig Latin is a languagefor expressing datatransformation flows.
Pig can be made to understand other languages, too.There is an SQL prototype. Just a question of compiling alanguage into internal operator tree.
See: Smith, Agent. The Matrix Trilogy, Warner Bros.

Share & Embed

More from this user

Add a Comment

Characters: ...