Professional Documents
Culture Documents
MapReduce is a programming model for data processing. Hadoop can run mapreduce programs
written in various languages like Java, Ruby, Python and C++. MapReduce programs run in
parallel fashion to analyze data of any size.
MapReduce Job
Map
Phase
Reduce
Phase
Explanation
baba
baba
black
1..
baba
black
[1,1]
baba
[1]
black
sheep [1]
sheep 1
Map Phase
Reduce Phase
Output