Professional Documents
Culture Documents
BIG DATA ANALYTICS Unit4 Part 2
BIG DATA ANALYTICS Unit4 Part 2
COURSE OUTCOMES
After successful completion of this course, the students should be able to ,
CO1: Identify the need for big data analytics for a domain.
CO2: Use Hadoop and Map Reduce framework.
CO3: Apply big data analytics for a given problem.
CO4: Suggest areas to apply big data to increase business outcome.
Shuffle and Sort
Shuffle Phase
Sort Phase
Sort Phase Cont’d
To increase disk i/o efficiency
TASK EXECUTION
Speculative Execution
Speculative Execution
Task JVM Reuse
Skipping Bad Records
Mapreduce Types and
Formats.
MAPREDUCE INPUT AND
OUTPUT
General form of Map/Reduce functions:
Partition function:
DESERIALIZATION :
- Receiver End .
- Process of converting the streams into series of structured object.