Unit 4 2 - CC

Uploaded by

teja v

0% found this document useful (0 votes)

5 views6 pages

cloud computing notes

Original Title

unit-4-2_CC

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

cloud computing notes

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views6 pages

Unit 4 2 - CC

Uploaded by

teja v

cloud computing notes

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 6

Search inside document

Cloud Concepts and Technologies Map Reduce

• Map Reduce is a parallel data processing technique for

processing and analyzing large scale data. (data intensive
problem). (The framework takes care of scheduling tasks,
monitoring them and re-executing any failed tasks.)
• The Map Reduce model includes two important phases: Map
and Reduce functions.
• Map function takes a set of data and converts it into another
set of data, where individual elements are broken down into
tuples (key/value pairs). Secondly, reduce function, which
takes the output from a map as an input and combines those
data tuples into a smaller set of tuples.
MAPREDUCE PROGRAMMING MODEL: Example taken from
net for reference
• A Word Count Example of MapReduce
• Let us understand, how a MapReduce works by taking an example
where
• there is a text file called example.txt whose contents are as follows:
• Dear, Bear, River, Car, Car, River, Deer, Car and Bear
• Now, suppose, we have to perform a word count on the example.txt
using
• MapReduce to find the unique words and the number of occurrences
of those
• unique words.
First, we divide the input in three splits as shown in the figure. This will distribute the work among all
the map nodes. (Divides into file into 3 subfiles and
each file given to each Map node.)
• Then, Map function within each Map node tokenize the words in the file and give a hardcoded value
(1) to each of the tokens or words.
• Now, a list of key-value pair will be created where the key is nothing but the individual words and
value is one. So, for the first line (Dear Bear
River) we have 3 key-value pairs – (Dear, 1); (Bear, 1); (River, 1). The mapping process remains the
same on all the nodes.
• After mapper phase, a partition process takes place where sorting and shuffling happens so that all
the tuples with the same key are sent to the
corresponding reducer.
• So, after the sorting and shuffling phase, each reducer will have a unique key and a list of values
corresponding to that key. For example, Bear,
[1,1]; Car, [1,1,1].., etc. (Input to Reduce Node)
• Now, each Reducer counts the values which are present in that list of values. As shown in the figure,
reducer gets a list of values which is [1,1]
for the key Bear. Then, it counts the number of ones in the list and gives the final output as – Bear, 2.
• Finally, all the output key/value pairs are then collected and written in the output file.
The overall map reduce word count process
Among the cluster of nodes, one acts like a “master” and the rest are “workers.”
• The master is responsible for scheduling (assigns the map and reduce tasks to the worker) and monitoring (monitors the task progress
and the
worker health). (Master instructs which nodes should act like a mapper and which nodes should act like a reducer. Also responsible for
monitoring health condition of each worker node)
• When map tasks arise, the master assigns the task to an idle worker, taking into account the data locality.
• A worker reads the content of the corresponding input split and emits a key/value pairs. (On what operation performing and
corresponding
analysed value)
• The intermediate key/value pairs produced by the Map function are first buffered in memory and then periodically written to a local
disk,
sorts the intermediate keys so that all occurrences of the same key are grouped together into R sets by the partitioning function.
• The master passes the location of these stored pairs to the reduce worker, which reads the buffered data using remote procedure calls
(RPC).
• For each key, the worker passes the corresponding intermediate value for its entire occurrence to the Reduce function. (worker node
gives
intermediate set to reduce function, which generates output file)
• Finally, the output is available in R output files (one per reduce task).

Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Unit-2 (MapReduce-I)
Document28 pages
Unit-2 (MapReduce-I)
tripathineeharika
No ratings yet
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
Rating: 1 out of 5 stars
1/5 (1)
Chapter4 - MapReduce
Document29 pages
Chapter4 - MapReduce
genace
No ratings yet
Mastering Data Structures and Algorithms in C and C++
From Everand
Mastering Data Structures and Algorithms in C and C++
Sachin Naha
No ratings yet
Mapreduce Types and Formats
Document65 pages
Mapreduce Types and Formats
Tejaswini Karmakonda
No ratings yet
Modern Introduction to Object Oriented Programming for Prospective Developers
From Everand
Modern Introduction to Object Oriented Programming for Prospective Developers
Fabian Gebert
No ratings yet
Understanding Inputs and Outputs of Mapreduce
Document13 pages
Understanding Inputs and Outputs of Mapreduce
Divya Panta
No ratings yet
Unit 3 - Big Data Technologies
Document42 pages
Unit 3 - Big Data Technologies
prakash N
No ratings yet
BDA Unit 3 1
Document37 pages
BDA Unit 3 1
Jerald Ruban
No ratings yet
PPT1 Module2 Hadoop Distribution
Document23 pages
PPT1 Module2 Hadoop Distribution
Hiran Suresh
No ratings yet
Module 3 (Part-1) - Big Data
Document46 pages
Module 3 (Part-1) - Big Data
sujith
No ratings yet
Big Data Computing
Document36 pages
Big Data Computing
jefferyleclerc
No ratings yet
(BIG DATA) (MapReduce - Quick Guide, Tutorialspoint - Com)
Document36 pages
(BIG DATA) (MapReduce - Quick Guide, Tutorialspoint - Com)
Mony Simonetti
No ratings yet
03 Firstmrjob Invertedindexconstruction 141206231216 Conversion Gate01 PDF
Document54 pages
03 Firstmrjob Invertedindexconstruction 141206231216 Conversion Gate01 PDF
Vignesh Srinivasan
No ratings yet
Why MapReduce
Document8 pages
Why MapReduce
punekarmanish_347129
No ratings yet
Ecs765p W2
Document55 pages
Ecs765p W2
Yen-Kai Cheng
No ratings yet
Map Reduce Programming
Document21 pages
Map Reduce Programming
Nyerere Oyaro
No ratings yet
BSC in Information Technology (Data Science) : Massive or Big Data Processing J.Alosius
Document30 pages
BSC in Information Technology (Data Science) : Massive or Big Data Processing J.Alosius
Geethma Minoli
No ratings yet
Hadoop Interview Questions Faq
Document14 pages
Hadoop Interview Questions Faq
mihirhota
No ratings yet
Hadoop Wordcount Program
Document20 pages
Hadoop Wordcount Program
Arjun S
No ratings yet
Hadoop Karunesh
Document14 pages
Hadoop Karunesh
Mukul Mishra
No ratings yet
Lecture Notes Map Reduce
Document24 pages
Lecture Notes Map Reduce
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Map Reduce Programming
Document21 pages
Map Reduce Programming
Sachin Pukale
No ratings yet
Developing A Mapreduce Application: by Dr. K. Venkateswara Rao Professor Department of Cse
Document83 pages
Developing A Mapreduce Application: by Dr. K. Venkateswara Rao Professor Department of Cse
Tejaswini Karmakonda
No ratings yet
BDA Unit 3 Notes
Document11 pages
BDA Unit 3 Notes
duraisamy
No ratings yet
Unit - III
Document37 pages
Unit - III
praneelp2000
No ratings yet
Bda Unit 4
Document20 pages
Bda Unit 4
Keshava Varma
No ratings yet
What Is MapReduce
Document6 pages
What Is MapReduce
Sundaram yadav
No ratings yet
FAL (2022-23) CSE1006 ETH AP2022232000618 Reference Material I 24-Aug-2022 Introduction To R Programming
Document49 pages
FAL (2022-23) CSE1006 ETH AP2022232000618 Reference Material I 24-Aug-2022 Introduction To R Programming
Nithin Shankar
No ratings yet
Big Data Analysis
Document26 pages
Big Data Analysis
Lakshayjeet swami
No ratings yet
Map Reduce-LO2
Document62 pages
Map Reduce-LO2
Kai Enezhu
No ratings yet
Map Reduce Tutorial-1
Document7 pages
Map Reduce Tutorial-1
jefferyleclerc
No ratings yet
Revision On MATLAB & Image Processing With Matlab
Document36 pages
Revision On MATLAB & Image Processing With Matlab
Hesham Fadl-Allah Mohamad
No ratings yet
Computational Tools DTU Presentation Week3
Document33 pages
Computational Tools DTU Presentation Week3
dr.rawstone
No ratings yet
Unit-2 (MapReduce-II)
Document11 pages
Unit-2 (MapReduce-II)
tripathineeharika
No ratings yet
Mapreduce and Hadoop Ecosystem
Document64 pages
Mapreduce and Hadoop Ecosystem
Rin Rin Nurmalasari
No ratings yet
Mapper Reducer
Document10 pages
Mapper Reducer
BigData Professional
No ratings yet
Unit 3 MapReduce Part 1
Document12 pages
Unit 3 MapReduce Part 1
Ruparel Education Pvt. Ltd.
No ratings yet
Unit Ii Iintroduction To Map Reduce
Document4 pages
Unit Ii Iintroduction To Map Reduce
kathirdcn
No ratings yet
Lecture 5 Principles of Parallel Algorithm Design
Document30 pages
Lecture 5 Principles of Parallel Algorithm Design
nimranoor137
No ratings yet
BDA RepeatedImp Questions
Document30 pages
BDA RepeatedImp Questions
A32PRIT PATIL
No ratings yet
Map Reduce Examples
Document16 pages
Map Reduce Examples
icecream-likey
No ratings yet
Fundamentals of MapReduce With Example
Document2 pages
Fundamentals of MapReduce With Example
Sanidhya Singh Rajawat
No ratings yet
1 Dsa
Document46 pages
1 Dsa
abhinavreddy3057
No ratings yet
Map Reduce: Simplified Processing On Large Clusters
Document29 pages
Map Reduce: Simplified Processing On Large Clusters
Joy Bagdi
No ratings yet
Map Reduce and Hadoop
Document39 pages
Map Reduce and Hadoop
Arifa Khadri
No ratings yet
Map Reduce
Document10 pages
Map Reduce
Vikas Sinha
No ratings yet
Lecture - 16 Data Flow and Systolic Array Architectures
Document15 pages
Lecture - 16 Data Flow and Systolic Array Architectures
mohsinmanzoor
No ratings yet
Introduction To Map Reduce
Document50 pages
Introduction To Map Reduce
KhAn Zainab
No ratings yet
Map Reduce Algorithm
Document4 pages
Map Reduce Algorithm
Leela Rallapudi
No ratings yet
Lecture 3 Tensors
Document25 pages
Lecture 3 Tensors
Devyansh Gupta
No ratings yet
Understand: The First Phase of Mapreduce Paradigm, What Is A Map/Mapper, What Is The Input To The
Document5 pages
Understand: The First Phase of Mapreduce Paradigm, What Is A Map/Mapper, What Is The Input To The
Anjali Adwani
No ratings yet
Chapter 3MapReduce
Document30 pages
Chapter 3MapReduce
Komal
No ratings yet
Anatomy of A MapReduce Job
Document5 pages
Anatomy of A MapReduce Job
kumar
No ratings yet
Map Reduce
Document18 pages
Map Reduce
Soni Nsit
No ratings yet
Mapreduce Programming Model and Design Patterns: Andrea Lottarini January 17, 2012
Document23 pages
Mapreduce Programming Model and Design Patterns: Andrea Lottarini January 17, 2012
Andrea Lottarini
No ratings yet
Map Reduce
Document11 pages
Map Reduce
Indra Kishor Chaudhary Avaiduwai
No ratings yet
Map Reduce
Document5 pages
Map Reduce
kathirdcn
No ratings yet
Map Reduce
Document40 pages
Map Reduce
Hareen Reddy
No ratings yet
Entropy Equation For A Control Volume
Document12 pages
Entropy Equation For A Control Volume
nirattisaikul
No ratings yet
The First-Fourth Books of The Hitopadésa
Document116 pages
The First-Fourth Books of The Hitopadésa
Miguel Rosa
No ratings yet
Animal Defenses Test
Document3 pages
Animal Defenses Test
Nermine Mouallem
No ratings yet
Data Migration Good Document
Document16 pages
Data Migration Good Document
sambit76
No ratings yet
Ad 9915
Document47 pages
Ad 9915
Jime nita
No ratings yet
Re 150821
Document2 pages
Re 150821
francis puthuseril
No ratings yet
07 Endodontic Retreatment
Document64 pages
07 Endodontic Retreatment
Gayathri
No ratings yet
Century Vemap PDF
Document5 pages
Century Vemap PDF
Master Mirror
No ratings yet
Law of Demand
Document16 pages
Law of Demand
ARUN KUMAR
No ratings yet
Conquering College The Most Fun You Can Have Learning The Things You Need To Know Nodrm
Document144 pages
Conquering College The Most Fun You Can Have Learning The Things You Need To Know Nodrm
Vithor
No ratings yet
MT4400 STRG Flo Amp Valve
Document7 pages
MT4400 STRG Flo Amp Valve
Brian Careel
0% (1)
Union Metal Semiconductor
Document4 pages
Union Metal Semiconductor
skinhugo
No ratings yet
Haberman Data Logistic Regression Analysis
Document5 pages
Haberman Data Logistic Regression Analysis
Evelyn
No ratings yet
Laser Diffraction Physics Project: Submitted By, Disha Dinesh
$Laser Diffraction Physics Project: Submitted By, Disha Dinesh$
Document11 pages
Laser Diffraction Physics Project: Submitted By, Disha Dinesh
Nidalee
No ratings yet
Ultimate Electronics - Book - CircuitLab
Document3 pages
Ultimate Electronics - Book - CircuitLab
Eldon
50% (2)
Passmore Et Al (2019) Workplace Coaching
Document47 pages
Passmore Et Al (2019) Workplace Coaching
Malarvilie Krishnasamy
No ratings yet
1991 - Defect Chemistry of BaTiO3
Document20 pages
1991 - Defect Chemistry of BaTiO3
Beh Naat
No ratings yet
One Plan Student 1
Document7 pages
One Plan Student 1
api-465826207
No ratings yet
St. Louis College of Bulanao: Title/Topic Technical English I Introduction To Police Report Writing
Document41 pages
St. Louis College of Bulanao: Title/Topic Technical English I Introduction To Police Report Writing
Novelyn Lumboy
No ratings yet
SFN Profile
Document4 pages
SFN Profile
dinesh panchal
No ratings yet
Vitalis 2000
Document26 pages
Vitalis 2000
ARL
No ratings yet
KP For Ram
Document23 pages
KP For Ram
Monu Narwar
No ratings yet
Summative 1
Document4 pages
Summative 1
Nean Ysabelle
No ratings yet
NC Error Propagation
Document12 pages
NC Error Propagation
Salman Khan
No ratings yet
Module 14-Area Computations
Document5 pages
Module 14-Area Computations
Gerovic Parinas
50% (2)
A Brief Tutorial On Studio Monitors
Document18 pages
A Brief Tutorial On Studio Monitors
Curtis O'Brien
No ratings yet
Ideal Vs Real Otto
Document5 pages
Ideal Vs Real Otto
a7med Souliman
No ratings yet
Addressing Diversity Through The Years Special and Inclusive Education
Document6 pages
Addressing Diversity Through The Years Special and Inclusive Education
Jiezel Surin
No ratings yet
Prof Ed 9-A - Module 6 - Tumacder, DHML
Document6 pages
Prof Ed 9-A - Module 6 - Tumacder, DHML
Danica Hannah Mae Tumacder
No ratings yet
Researchpaper Should Removable Media Be Encrypted - PDF - Report
Document15 pages
Researchpaper Should Removable Media Be Encrypted - PDF - Report
Sakshi Dhananjay Kamble
No ratings yet