MapReduce BigData 09

Uploaded by

Seikh Sadi

0% found this document useful (0 votes)

12 views9 pages

What is MapReduce in big data

Original Title

MapReduce_BigData_09

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

What is MapReduce in big data

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

12 views9 pages

MapReduce BigData 09

Uploaded by

Seikh Sadi

What is MapReduce in big data

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 9

Search inside document

MapReduce

Presented By:
Aashraful Islam Souraov
(ID 2025351009)
What is MapReduce? ①

• MapReduce is a processing technique and a program model for

distributed computing based on java.
• The MapReduce algorithm contains two important tasks, namely
Map and Reduce.
• Developer: Google ①
Why do we use MapReduce?

• MapReduce is a framework using which we can write applications

to process huge amounts of data, in parallel, on large clusters of
commodity hardware in a reliable manner.
What is the purpose of MapReduce?

• MapReduce serves two essential functions:

1. It filters and parcels out work to various nodes within the cluster or
map, a function sometimes referred to as the mapper,
2. It organizes and reduces the results from each node into a cohesive
answer to a query, referred to as the reducer.
The Algorithm of MapReduce
• Generally MapReduce paradigm is based on sending the computer to where
the data resides!
• MapReduce program executes in three stages, namely map stage, shuffle
stage, and reduce stage.
• Map stage − The map or mapper’s job is to process the input data. Generally the input
data is in the form of file or directory and is stored in the Hadoop file system (HDFS).
The input file is passed to the mapper function line by line. The mapper processes the
data and creates several small chunks of data.
• Reduce stage − This stage is the combination of the Shuffle stage and
the Reduce stage. The Reducer’s job is to process the data that comes from the
mapper. After processing, it produces a new set of output, which will be stored in the
HDFS.
The Algorithm of MapReduce

• During a MapReduce job, Hadoop sends the Map and Reduce tasks to the
appropriate servers in the cluster.
• The framework manages all the details of data-passing such as issuing tasks,
verifying task completion, and copying data around the cluster between the
nodes.
• Most of the computing takes place on nodes with data on local disks that
reduces the network traffic.
• After completion of the given tasks, the cluster collects and reduces the data to
form an appropriate result, and sends it back to the Hadoop server.
What are the Advantages & Disadvantages of
MapReduce?

Advantages:
• Scalable (due to simple design)
• Runs on cheap commodity hardware
• Procedural control i.e. we can control of the execution of every step
What are the Advantages & Disadvantages of
MapReduce?

Disadvantages:
• It is not flexible i.e. the MapReduce framework is rigid.
• A lot of manual coding is required, even for common operations such as
join, filter, projection, aggregates, sorting, distinct.
• Semantics are hidden inside the map and reduce functions, so it is difficult
to maintain, extend and optimize them

The Map Reduce Programming
Document15 pages
The Map Reduce Programming
manjunath
No ratings yet
Chapter 4 - Understanding Map Reduce Fundamentals
Document45 pages
Chapter 4 - Understanding Map Reduce Fundamentals
WEGENE ARGOW
No ratings yet
Sen-762 Advanced Big Data Analytics: Mapreduce
Document46 pages
Sen-762 Advanced Big Data Analytics: Mapreduce
بالیراجپوت
No ratings yet
Hadoop (Mapreduce)
Document43 pages
Hadoop (Mapreduce)
Nisrine Mofakir
No ratings yet
HadoopMapreduce Summerization
Document24 pages
HadoopMapreduce Summerization
Atharv Chaudhari
No ratings yet
Unit - III Advanced Analytics Technology and Tools
Document44 pages
Unit - III Advanced Analytics Technology and Tools
Diksha Chhabra
No ratings yet
Chapter 3MapReduce
Document30 pages
Chapter 3MapReduce
Komal
No ratings yet
DSBDA Manual Assignment 11
Document6 pages
DSBDA Manual Assignment 11
kartiknikumbh11
No ratings yet
777 1651400043 BD Module 4
Document21 pages
777 1651400043 BD Module 4
nimmy
No ratings yet
Unit 3 - Big Data Technologies
Document42 pages
Unit 3 - Big Data Technologies
prakash N
No ratings yet
Unit 3 Bda
Document59 pages
Unit 3 Bda
teja.ksp1801
No ratings yet
Unit-2 (MapReduce-II)
Document11 pages
Unit-2 (MapReduce-II)
tripathineeharika
No ratings yet
Parallel Data Processing in The Cloud
Document25 pages
Parallel Data Processing in The Cloud
Vinu Davis
No ratings yet
Big Data Unit 2 AKTU Notes
Document63 pages
Big Data Unit 2 AKTU Notes
abhijitraj229
No ratings yet
Lecturer 5
Document21 pages
Lecturer 5
Rebaz Mohsen
No ratings yet
Understand: The First Phase of Mapreduce Paradigm, What Is A Map/Mapper, What Is The Input To The
Document5 pages
Understand: The First Phase of Mapreduce Paradigm, What Is A Map/Mapper, What Is The Input To The
Anjali Adwani
No ratings yet
Term Paper Java
Document14 pages
Term Paper Java
Muskan Bharti
No ratings yet
Big Data and Analytics and MapReduce 29052023 054155pm
Document35 pages
Big Data and Analytics and MapReduce 29052023 054155pm
Talha Mughal
No ratings yet
05 Movies Data Analysis Using Mapreduce
Document20 pages
05 Movies Data Analysis Using Mapreduce
mohammadkhaja.shaik
No ratings yet
Big Data Engines: Binary Batch Processing
Document12 pages
Big Data Engines: Binary Batch Processing
Sonakshi Gupta
No ratings yet
Unit 3 Bba
Document11 pages
Unit 3 Bba
rajendrameena172003
No ratings yet
2 Hadoop Ecosystem
Document41 pages
2 Hadoop Ecosystem
tranngocbaooooo12062003
No ratings yet
BY K.Karthikeyan: Hadoop & Map Reduce
Document7 pages
BY K.Karthikeyan: Hadoop & Map Reduce
Karthik S
No ratings yet
Dynamicmr: A Dynamic Slot Allocation Optimization Framework For Map Reduce Clusters
Document18 pages
Dynamicmr: A Dynamic Slot Allocation Optimization Framework For Map Reduce Clusters
Harikrishnan Shunmugam
No ratings yet
226 Unit-6
Document32 pages
226 Unit-6
shivam saxena
No ratings yet
Data Science Presentation
Document20 pages
Data Science Presentation
yadvendra dhakad
No ratings yet
Matchmaking: A New Mapreduce Scheduling Technique: Digitalcommons@University of Nebraska - Lincoln
Document9 pages
Matchmaking: A New Mapreduce Scheduling Technique: Digitalcommons@University of Nebraska - Lincoln
Netra Jjoshi
No ratings yet
What Is MapReduce Process
Document3 pages
What Is MapReduce Process
Pam G.
No ratings yet
Mapreduce and Hadoop Ecosystem
Document64 pages
Mapreduce and Hadoop Ecosystem
Rin Rin Nurmalasari
No ratings yet
Chap 6 - MapReduce Programming
Document37 pages
Chap 6 - MapReduce Programming
Harshitha Raaj
No ratings yet
Map Reduce: Simplified Processing On Large Clusters
Document29 pages
Map Reduce: Simplified Processing On Large Clusters
Joy Bagdi
No ratings yet
1 Purpose: Single Node Setup Cluster Setup
Document1 page
1 Purpose: Single Node Setup Cluster Setup
p001
No ratings yet
BDL8 PDF
Document41 pages
BDL8 PDF
Mrs. Usha Naidu S
No ratings yet
BDA-MapReduce (1) 5rfgy656yhgvcft6
Document60 pages
BDA-MapReduce (1) 5rfgy656yhgvcft6
Ayush Jha
No ratings yet
By Christian Mechem and Geoff Crowley
Document11 pages
By Christian Mechem and Geoff Crowley
Christian Mechem
No ratings yet
Unit 2 Notes BDA
Document10 pages
Unit 2 Notes BDA
vasusrivastava138
No ratings yet
Unit - III
Document37 pages
Unit - III
praneelp2000
No ratings yet
PPT1 Module2 Hadoop Distribution
Document23 pages
PPT1 Module2 Hadoop Distribution
Hiran Suresh
No ratings yet
Bda - 3 Unit
Document18 pages
Bda - 3 Unit
ASMA UL HUSNA
No ratings yet
What Is MapReduce
Document6 pages
What Is MapReduce
Sundaram yadav
No ratings yet
Understanding MapReduce
Document15 pages
Understanding MapReduce
gopikrishna
No ratings yet
Q1. Discuss Hadoop and Map Reduce Algorithm.: Data Is Located
Document7 pages
Q1. Discuss Hadoop and Map Reduce Algorithm.: Data Is Located
Hîмanî Jayas
No ratings yet
Bda 03
Document10 pages
Bda 03
HARSH NAG
No ratings yet
Chapter 10
Document45 pages
Chapter 10
Sarita Samal
No ratings yet
Unit 3
Document27 pages
Unit 3
Radhamani V
No ratings yet
Intro To Apache Spark: Credits To CS 347-Stanford Course, 2015, Reynold Xin, Databricks (Spark Provider)
Document96 pages
Intro To Apache Spark: Credits To CS 347-Stanford Course, 2015, Reynold Xin, Databricks (Spark Provider)
Costi Stoian
No ratings yet
Unit 2 - From Hadoop Streaming PDF
Document20 pages
Unit 2 - From Hadoop Streaming PDF
Gopal Agarwal
No ratings yet
Hadoop
Document5 pages
Hadoop
Vaishnavi Chockalingam
No ratings yet
MapReduce Online
Document15 pages
MapReduce Online
Vyhx
No ratings yet
Map-Reduce Is A Programming Model That Is Mainly Divided Into Two
Document2 pages
Map-Reduce Is A Programming Model That Is Mainly Divided Into Two
Fahim Muntasir
No ratings yet
System Design and Implementation 5.1 System Design
Document14 pages
System Design and Implementation 5.1 System Design
sararajee
No ratings yet
Unit V FRAMEWORKS AND VISUALIZATION
Document71 pages
Unit V FRAMEWORKS AND VISUALIZATION
Yash Deep
No ratings yet
Unit Iv-1
Document84 pages
Unit Iv-1
keerthanavelmurugan02
No ratings yet
Module 3 - Mapreduce
Document40 pages
Module 3 - Mapreduce
Aditya Raj
No ratings yet
Unit 3 MapReduce Part 1
Document12 pages
Unit 3 MapReduce Part 1
Ruparel Education Pvt. Ltd.
No ratings yet
Map Reduce
Document36 pages
Map Reduce
Papai Rana
No ratings yet
Mapreduce in Cloud Computing: Mapreduce, Mapreduce Paradigm, Mapreduce Examples, Hadoop, Hdfs
Document10 pages
Mapreduce in Cloud Computing: Mapreduce, Mapreduce Paradigm, Mapreduce Examples, Hadoop, Hdfs
Muhammad umar
No ratings yet
Map Reduce
Document11 pages
Map Reduce
Indra Kishor Chaudhary Avaiduwai
No ratings yet
Shortnotes For Cloud
Document22 pages
Shortnotes For Cloud
Mahi Mahi
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
No ratings yet
North America Colour Trend Concepts A W 21 22
Document12 pages
North America Colour Trend Concepts A W 21 22
Raquel Mantovani
No ratings yet
SpaceClaim - Developers Guide-5-6
Document2 pages
SpaceClaim - Developers Guide-5-6
Alexgh1993
No ratings yet
A2Z Telugu Boothu Kathalu
Document26 pages
A2Z Telugu Boothu Kathalu
Bommalu
50% (32)
Machine Drawing Manual Using Autocad
Document103 pages
Machine Drawing Manual Using Autocad
AVINASH
No ratings yet
ABAP Proxy Generation - General Procedure
Document1 page
ABAP Proxy Generation - General Procedure
amit1381
No ratings yet
Recommender Systems With Generative Retrieval
Document11 pages
Recommender Systems With Generative Retrieval
manel
No ratings yet
Contact Tachometer: Model: DT-2235B
Document2 pages
Contact Tachometer: Model: DT-2235B
nisne0730
No ratings yet
Our Sexuality 13th Edition Crooks Test Bank
Document22 pages
Our Sexuality 13th Edition Crooks Test Bank
KaylaWalkerncas
100% (50)
Softube Console 1 Manual
Document83 pages
Softube Console 1 Manual
Adam Smith
No ratings yet
Method 2022
Document17 pages
Method 2022
we need you
No ratings yet
Road Runner
Document1 page
Road Runner
campbelljohntaylor
No ratings yet
SRS On Online Passport Registration
Document15 pages
SRS On Online Passport Registration
CLIFTON DANISH
No ratings yet
AI For Business Leaders Executive Program Syllabus PDF
Document12 pages
AI For Business Leaders Executive Program Syllabus PDF
Lab x
No ratings yet
CCTV Installation Checklist
Document6 pages
CCTV Installation Checklist
David Beech
No ratings yet
WEG CFW11 Anybus CC Manual 0899.5750 en
Document76 pages
WEG CFW11 Anybus CC Manual 0899.5750 en
Luis Salvatierra
No ratings yet
Coa Lab 4
Document6 pages
Coa Lab 4
05raiuttu
No ratings yet
Endoscopic Flushing Pump and Accessories
Document5 pages
Endoscopic Flushing Pump and Accessories
ЭнхЗаяа
No ratings yet
Index of - Scimag - Repository - Torrent PDF
Document20 pages
Index of - Scimag - Repository - Torrent PDF
rishit
No ratings yet
Cyber Security Internship
Document10 pages
Cyber Security Internship
R Bharath Kumar
100% (1)
Lom Log
Document2 pages
Lom Log
Vilma Cucul
No ratings yet
Propiñan de Melyor: Javier Martos Carretero
Document4 pages
Propiñan de Melyor: Javier Martos Carretero
Alejandro Parino
No ratings yet
Mad Mini Project
Document15 pages
Mad Mini Project
Nalla
No ratings yet
Moto G10 Service and Repair Manual V1.0
Document129 pages
Moto G10 Service and Repair Manual V1.0
Fenix On
No ratings yet
Reading 1 Draft
Document1 page
Reading 1 Draft
Nathaniel Dilag
No ratings yet
A Meta-Analysis of Use of Serious Games in Education Over A Decade
Document9 pages
A Meta-Analysis of Use of Serious Games in Education Over A Decade
Galdor Tasartir
No ratings yet
End Term Part I Examination December - 2018 COURSE: B. Tech. (CSE) Semester: 5 PAPER NAME (Paper Code) : JAVA PROGRAMMING (0500536)
Document3 pages
End Term Part I Examination December - 2018 COURSE: B. Tech. (CSE) Semester: 5 PAPER NAME (Paper Code) : JAVA PROGRAMMING (0500536)
Jyoti Totla
No ratings yet
Assignment 3. Summary of Bahasa Indonesia Bahasa Indonesia Indonesian Language
Document6 pages
Assignment 3. Summary of Bahasa Indonesia Bahasa Indonesia Indonesian Language
imas
No ratings yet
365C and 385C Excavators and Medium Pressure Electrical Schematic (Attachment) 385C MHPU
Document2 pages
365C and 385C Excavators and Medium Pressure Electrical Schematic (Attachment) 385C MHPU
Ibrahim Awad
No ratings yet
Allplan 2014 SBS SysadmNetwork
Document157 pages
Allplan 2014 SBS SysadmNetwork
can can
No ratings yet
Arduino Based Hand Gesture Control of Computer
Document7 pages
Arduino Based Hand Gesture Control of Computer
Vedant Dhote
No ratings yet