Welcome to Scribd!

Bda Expt7 - 60002190056

Uploaded by

0% found this document useful (0 votes)

4 views3 pages

The document discusses implementing a word count using MapReduce. It explains that the input data is divided and processed in parallel by map nodes, with mappers counting word frequencies as key-value pairs. The reducers then combine the values for each key from across the mappers to give a final count of word frequencies in the data. The experiment creates jar files, runs a word count on sample text, and displays the results, demonstrating how MapReduce can be used to efficiently count words in large datasets in a distributed manner.

Original Description:

Original Title

BDA EXPT7_60002190056

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views3 pages

Bda Expt7 - 60002190056

Uploaded by

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

Name: Krittika Roy SAP: 60002190056 BE E21

Department of Electronics and Telecommunication Engineering AcademicYear 2022-23

Aim: To implement word count using MapReduce

Theory:

Mapreduce programming model for word count can be explained using following example.

 The input data is divided into multiple segments, then processed in parallel to reduce
processing time. In this case, the input data will be divided into two input splits so that work
can be distributed over all the map nodes.
 The Mapper counts the number of times each word occurs from input splits in the form of key-
value pairs where the key is the word, and the value is the frequency.
 For the first input split, it generates 4 key-value pairs: This, 1; is, 1; an, 1; apple, 1; and for the
second, it generates 5 key-value pairs: Apple, 1; is, 1; red, 1; in, 1; color.
 It is followed by the shuffle phase, in which the values are grouped by keys in the form of key-
value pairs. Here we get a total of 6 groups of key-value pairs.
 The same reducer is used for all key-value pairs with the same key.
 All the words present in the data are combined into a single output in the reducer phase. The
output shows the frequency of each word.
 Here in the example, we get the final output of key-value pairs as This, 1; is, 2; an, 1; apple,
2; red, 1; in, 1; color, 1.
 The record writer writes the output key-value pairs from the reducer into the output files, and
the final output data is by default stored on HDFS.
Result:

Creating a new directory and creating the respective jar files :

Generated jar files are pasted in the desired directory.

New directory is also created and the contents are pasted in it.

Displaying the contents of the file:

Displaying the word counts of the file:

Name: Manav Mody SAP: 60002190058 BE E21

Conclusion:

In this experiment we learnt how to implement word count using MapReduce.

Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
From Everand
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
Sherwyn Allibang
Rating: 5 out of 5 stars
5/5 (1)
R Data Science Quick Reference: A Pocket Guide to APIs, Libraries, and Packages
From Everand
R Data Science Quick Reference: A Pocket Guide to APIs, Libraries, and Packages
Thomas Mailund
No ratings yet
R Programming Interview
Document24 pages
R Programming Interview
Rahul Shrivastava
No ratings yet
Data Science in R Interview Questions and Answers
Document56 pages
Data Science in R Interview Questions and Answers
Arun
No ratings yet
Modern Introduction to Object Oriented Programming for Prospective Developers
From Everand
Modern Introduction to Object Oriented Programming for Prospective Developers
Fabian Gebert
No ratings yet
Map Reduce
Document18 pages
Map Reduce
Soni Nsit
No ratings yet
Practise Quiz Ccd-410 Exam (02-2014) - Cloudera Quiz Learning
Document50 pages
Practise Quiz Ccd-410 Exam (02-2014) - Cloudera Quiz Learning
ratneshkumarg
No ratings yet
BDA Experiment 3
Document7 pages
BDA Experiment 3
AYAAN Satkut
No ratings yet
Map-Reduce Is A Programming Model That Is Mainly Divided Into Two
Document2 pages
Map-Reduce Is A Programming Model That Is Mainly Divided Into Two
Fahim Muntasir
No ratings yet
Unit 2 - From Hadoop Streaming PDF
Document20 pages
Unit 2 - From Hadoop Streaming PDF
Gopal Agarwal
No ratings yet
Assignment 11 DSBDA
Document4 pages
Assignment 11 DSBDA
DARSHAN JADHAV
No ratings yet
Ecs765p W2
Document55 pages
Ecs765p W2
Yen-Kai Cheng
No ratings yet
UNIT 2-tt1
Document7 pages
UNIT 2-tt1
avm5439
No ratings yet
Hadoop Karunesh
Document14 pages
Hadoop Karunesh
Mukul Mishra
No ratings yet
MapReduce Tutorial
Document32 pages
MapReduce Tutorial
ShivanshuSingh
No ratings yet
Distributed and Cloud Computing
Document58 pages
Distributed and Cloud Computing
18JE0254 CHIRAG JAIN
No ratings yet
Bda Expt5 - 60002190056
Document5 pages
Bda Expt5 - 60002190056
kr
No ratings yet
Understanding MapReduce
Document15 pages
Understanding MapReduce
gopikrishna
No ratings yet
What Is MapReduce in Hadoop
Document5 pages
What Is MapReduce in Hadoop
Rakesh Shaw
No ratings yet
Map Reduce Tutorial-1
Document7 pages
Map Reduce Tutorial-1
jefferyleclerc
No ratings yet
DSBDA Manual Assignment 11
Document6 pages
DSBDA Manual Assignment 11
kartiknikumbh11
No ratings yet
Mapreduce Programming Model and Design Patterns: Andrea Lottarini January 17, 2012
Document23 pages
Mapreduce Programming Model and Design Patterns: Andrea Lottarini January 17, 2012
Andrea Lottarini
No ratings yet
CS-702 (D) BigData
Document61 pages
CS-702 (D) BigData
garima bh
No ratings yet
1c MR YARN Transcript
Document4 pages
1c MR YARN Transcript
NetF feree
No ratings yet
(BIG DATA) (MapReduce - Quick Guide, Tutorialspoint - Com)
Document36 pages
(BIG DATA) (MapReduce - Quick Guide, Tutorialspoint - Com)
Mony Simonetti
No ratings yet
Unit-2 (MapReduce-I)
Document28 pages
Unit-2 (MapReduce-I)
tripathineeharika
No ratings yet
Chapter 05 Google Re
Document69 pages
Chapter 05 Google Re
William
No ratings yet
Hadoop Wordcount Program
Document20 pages
Hadoop Wordcount Program
Arjun S
No ratings yet
Map Reduce Examples
Document7 pages
Map Reduce Examples
Singsg Singsg
No ratings yet
Itcertmaster: Safe, Simple and Fast. 100% Pass Guarantee!
Document6 pages
Itcertmaster: Safe, Simple and Fast. 100% Pass Guarantee!
Lokesh Makwane
No ratings yet
MapReduce: Simplified Data Processing On Large Clusters
Document13 pages
MapReduce: Simplified Data Processing On Large Clusters
zzztimbo
100% (1)
MapReduce Patterns, Algorithms, and Use Cases - Highly Scalable Blog
Document7 pages
MapReduce Patterns, Algorithms, and Use Cases - Highly Scalable Blog
jefferyleclerc
No ratings yet
Data Analytics in R
Document13 pages
Data Analytics in R
Meliodas Dmo
No ratings yet
BDA RepeatedImp Questions
Document30 pages
BDA RepeatedImp Questions
A32PRIT PATIL
No ratings yet
HKBK College of Engineering Department of Ise: Big Data Analytics (18Cs72) Seminar On The Topic Key-Value Pairs
Document15 pages
HKBK College of Engineering Department of Ise: Big Data Analytics (18Cs72) Seminar On The Topic Key-Value Pairs
Akhila R
100% (1)
Unit 3 MapReduce Part 1
Document12 pages
Unit 3 MapReduce Part 1
Ruparel Education Pvt. Ltd.
No ratings yet
8-1P Answers 2
Document6 pages
8-1P Answers 2
Najma Batool
No ratings yet
Bda Unit 1
Document13 pages
Bda Unit 1
CrazyYT Gaming channel
No ratings yet
Dean 08 Map Reduce
Document7 pages
Dean 08 Map Reduce
Lindsey Hoffman
No ratings yet
PPT1 Module2 Hadoop Distribution
Document23 pages
PPT1 Module2 Hadoop Distribution
Hiran Suresh
No ratings yet
Genetic Algorithm in Matlab (Exemple)
Document21 pages
Genetic Algorithm in Matlab (Exemple)
Astrid Lovasz
100% (1)
Welcome: Computer Programming-I
Document47 pages
Welcome: Computer Programming-I
Salim Isyaku
No ratings yet
Gating Mechanism Based Natural Language Generation For Spoken Dialogue Systems
Document29 pages
Gating Mechanism Based Natural Language Generation For Spoken Dialogue Systems
Frank Ayala
No ratings yet
Language Processing and Computational
Document10 pages
Language Processing and Computational
Carolyn Hardy
No ratings yet
Data Science Presentation
Document20 pages
Data Science Presentation
yadvendra dhakad
No ratings yet
Hadoop Interview Questions Author: Pappupass Learning Resource
Document16 pages
Hadoop Interview Questions Author: Pappupass Learning Resource
Dheeraj Reddy
No ratings yet
Unit-2 Bda Kalyan - Pagenumber
Document15 pages
Unit-2 Bda Kalyan - Pagenumber
rohit sai tech viewers
No ratings yet
CSEC IT Notes - Term 1 Week 9 - Problem Solving and Programming Lecture 2
Document47 pages
CSEC IT Notes - Term 1 Week 9 - Problem Solving and Programming Lecture 2
kxng ultimate
No ratings yet
Lecture Notes Map Reduce
Document24 pages
Lecture Notes Map Reduce
Yuvaraj V, Assistant Professor, BCA
No ratings yet
BigData Spark Sparklyr
Document80 pages
BigData Spark Sparklyr
michelle
No ratings yet
Unit III EBDP 2022
Document77 pages
Unit III EBDP 2022
Raju Jacob Raj
No ratings yet
Why MapReduce
Document8 pages
Why MapReduce
punekarmanish_347129
No ratings yet
Commnication Lab Manual 2021 Scheme
Document51 pages
Commnication Lab Manual 2021 Scheme
29 Naib rasool shaikh
No ratings yet
Lecture 10 MapReduce Hadoop
Document37 pages
Lecture 10 MapReduce Hadoop
DAVID HUMBERTO GUTIERREZ MARTINEZ
No ratings yet
Hadoop Interview Questions Faq
Document14 pages
Hadoop Interview Questions Faq
mihirhota
No ratings yet
Chapter 3MapReduce
Document30 pages
Chapter 3MapReduce
Komal
No ratings yet
Anatomy of A MapReduce Job
Document5 pages
Anatomy of A MapReduce Job
kumar
No ratings yet
Color Management System: Optimizing Visual Perception in Digital Environments
From Everand
Color Management System: Optimizing Visual Perception in Digital Environments
Fouad Sabry
No ratings yet
The Joys of Hashing: Hash Table Programming with C
From Everand
The Joys of Hashing: Hash Table Programming with C
Thomas Mailund
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Experiment No: 2 Pig Latin Commands Aim
Document7 pages
Experiment No: 2 Pig Latin Commands Aim
kr
No ratings yet
Bda Expt6 - 60002190056
Document4 pages
Bda Expt6 - 60002190056
kr
No ratings yet
Bda Expt5 - 60002190056
Document5 pages
Bda Expt5 - 60002190056
kr
No ratings yet
BDA Exp4
Document7 pages
BDA Exp4
kr
No ratings yet
MWE Exp 4
Document6 pages
MWE Exp 4
kr
No ratings yet
Magic Tee As An Isolator
Document7 pages
Magic Tee As An Isolator
kr
No ratings yet
Mwe Expt3
Document5 pages
Mwe Expt3
kr
No ratings yet
Mode Pattern Analysis For RW
Document6 pages
Mode Pattern Analysis For RW
kr
No ratings yet
MWE Exp 1
Document7 pages
MWE Exp 1
kr
No ratings yet
Expt2 Mwe
Document4 pages
Expt2 Mwe
kr
No ratings yet