Welcome to Scribd!

Bda Expt6 - 60002190056

Uploaded by

0% found this document useful (0 votes)

10 views4 pages

This document describes implementing matrix multiplication using Hadoop MapReduce. It provides pseudocode for the map and reduce functions. The map function emits key-value pairs for each element, with the key being the row and column and value being the matrix element. The reduce function receives all values for a key, sorts them by the common index, extracts the elements and multiplies them, then sums the products to calculate the result matrix element. Logging confirms successful execution and storage of results. The conclusion states the aim was to learn how to implement matrix multiplication using MapReduce commands.

Original Description:

Original Title

BDA EXPT6_60002190056

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views4 pages

Bda Expt6 - 60002190056

Uploaded by

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

Name: Krittika Roy SAP: 60002190056 BE E21

Semester: VII
Subject: Big Data Analytics
Matrix Multiplication using Hadoop MapReduce Framework

Aim: To implement matrix multiplication using Map Reduce.

Theory:
Matrix Multiplication with One MapReduce Step:
There often is more than one way to use MapReduce to solve a problem. You may wish to
use only a single MapReduce pass to perform matrix multiplication P = MN. 5 It is possible
to do so if we put more work into the two functions. Start by using the Map function to create
the sets of matrix elements that are needed to compute each element of the answer P. Notice
that an element of M or N contributes to many elements of the result, so one input element
will be turned into many key-value pairs. The keys will be pairs (i, k), where i is a row of M
and k is a column of N. Here is a synopsis of the Map and Reduce functions.
The Map Function: For each element mij of M, produce all the key-value pairs (i, k), (M, j,
mij ) for k = 1, 2, . . ., up to the number of columns of N. Similarly, for each element njk of
N, produce all the key-value pairs (i, k), (N, j, njk) for i = 1, 2, . . ., up to the number of rows
of M. As before, M and N are really bits to tell which of the two matrices a value comes
from.
The Reduce Function: Each key (i, k) will have an associated list with all the values (M, j,
mij ) and (N, j, njk), for all possible values of j. The Reduce function needs to connect the
two values on the list that have the same value of j, for each j. An easy way to do this step is
to sort by j the values that begin with M and sort by j the values that begin with N, in separate
lists. The jth values on each list must have their third components, mij and njk extracted and
multiplied. Then, these products are summed and the result is paired with (i, k) in the output
of the Reduce function
You may notice that if a row of the matrix M or a column of the matrix N is so large that it
will not fit in main memory, then the Reduce tasks will be forced to use an external sort to
order the values associated with a given key (i, k). However, in that case, the matrices
themselves are so large, perhaps 1020 elements, that it is unlikely we would attempt this
calculation if the matrices were dense. If they are sparse, then we would expect many fewer
values to be associated with any one key, and it would be feasible to do the sum of products
in main memory.
PseudoCode:
map(key, value):
// value is ("A", i, j, a_ij) or ("B", j, k, b_jk)
if value[0] == "A":
i = value[1] j
= value[2] a_ij =
value[3] for k = 1
Name: Krittika Roy SAP: 60002190056 BE E21

to p: emit((i, k),
(A, j, a_ij)) else:
j = value[1] k
= value[2] b_jk =
value[3] for i = 1 to
m: emit((i, k),
(B, j, b_jk))

reduce(key, values):
// key is (i, k)
// values is a list of ("A", j, a_ij) and ("B", j, b_jk)
hash_A = {j: a_ij for (x, j, a_ij) in values if x == A}
hash_B = {j: b_jk for (x, j, b_jk) in values if x == B}
result = 0 for j = 1 to n:
result += hash_A[j] * hash_B[j]
emit(key, result)

Result:

Creating new directory and JAR files. Also checking the list of files.

Creating a Hadoop directory and putting the file in it. Also printing the content in it.
Name: Krittika Roy SAP: 60002190056 BE E21

Storing and logging the sorted result.

Name: Krittika Roy SAP: 60002190056 BE E21

Conclusion:

In this experiment we learnt how to implement matrix multiplication using Map Reduce using
commands.

NA FinalExam Summer15 PDF
Document8 pages
NA FinalExam Summer15 PDF
Anonymous jITO0qQH
No ratings yet
Basic Simulation Lab File (4Mae5-Y)
Document53 pages
Basic Simulation Lab File (4Mae5-Y)
aditya b
No ratings yet
Monoidify! - Monoids As A Design Principle For Efficient MapReduce Algorithms
Document3 pages
Monoidify! - Monoids As A Design Principle For Efficient MapReduce Algorithms
olwenntaron1707
No ratings yet
Spark Interview Questions
Document8 pages
Spark Interview Questions
Jnsk Srinu
100% (1)
Unit-Iii: A Weather Dataset
Document12 pages
Unit-Iii: A Weather Dataset
Abhay Dabhade
No ratings yet
Matrix Multiplication Using Hadoop Map-Reduce
Document10 pages
Matrix Multiplication Using Hadoop Map-Reduce
Niri
No ratings yet
Bda Lab
Document4 pages
Bda Lab
Nihar Sardal
No ratings yet
Exercise 02 RadonovIvan 5967988
Document1 page
Exercise 02 RadonovIvan 5967988
Erika
No ratings yet
Assignment 3 Specification
Document3 pages
Assignment 3 Specification
Razin
No ratings yet
W9a Autoencoders Pca
Document7 pages
W9a Autoencoders Pca
zeliawillscumberg
No ratings yet
Awini Mustapha-Project1
Document8 pages
Awini Mustapha-Project1
writersleed
No ratings yet
Notes Bug Data and of Apache
Document10 pages
Notes Bug Data and of Apache
ysakhare94
No ratings yet
Department of Metallurgical Engineering and Materials Science, IIT Bombay
Document5 pages
Department of Metallurgical Engineering and Materials Science, IIT Bombay
Prince Kumar
No ratings yet
Week 10
Document15 pages
Week 10
Hanumanthu Gouthami
No ratings yet
ML Lab Manual
Document37 pages
ML Lab Manual
apekshapandekar01
100% (1)
Matlab vs. IDL
Document5 pages
Matlab vs. IDL
Scott Knight
No ratings yet
Index: S.no. Name of The Experiment Date
Document14 pages
Index: S.no. Name of The Experiment Date
Aman Singh
No ratings yet
Introduction To MATLAB (Basics) : Reference From: Azernikov Sergei Mesergei@tx - Technion.ac - Il
Document35 pages
Introduction To MATLAB (Basics) : Reference From: Azernikov Sergei Mesergei@tx - Technion.ac - Il
Raju Reddy
No ratings yet
Matlab For Microeconometrics: Numerical Optimization: Nick Kuminoff Virginia Tech: Fall 2008
Document16 pages
Matlab For Microeconometrics: Numerical Optimization: Nick Kuminoff Virginia Tech: Fall 2008
mjdjar
No ratings yet
Analysis Report
Document8 pages
Analysis Report
writersleed
No ratings yet
Linear Regression
Document14 pages
Linear Regression
Syed Tariq Naqshbandi
No ratings yet
9334 Hood D Excelode
Document4 pages
9334 Hood D Excelode
Rémy
No ratings yet
Solution First Point ML-HW4
Document6 pages
Solution First Point ML-HW4
Juan Sebastian Otálora Montenegro
100% (1)
hw2 2020
Document3 pages
hw2 2020
Prakhar Srivastava
No ratings yet
Assignment 3
Document6 pages
Assignment 3
Aayush Mittal
No ratings yet
Parameter Estimation
Document24 pages
Parameter Estimation
Mina Arya
100% (1)
Daa Unit 3
Document22 pages
Daa Unit 3
Rahul Gusain
No ratings yet
Simplified Optimal Parenthesization Scheme For Matrix Chain Multiplication Problem Using Bottom-Up Practice in 2-Tree Structure
Document6 pages
Simplified Optimal Parenthesization Scheme For Matrix Chain Multiplication Problem Using Bottom-Up Practice in 2-Tree Structure
Lucian Palievici
No ratings yet
MIT6 094IAP10 Assn02
Document10 pages
MIT6 094IAP10 Assn02
Rosh Otojanov
No ratings yet
12 Nov Relational Operators, Equations2022
Document5 pages
12 Nov Relational Operators, Equations2022
THOMAS SHONEY PUTHUSSERI 2240233
No ratings yet
BDA List of Experiments For Practical Exam
Document21 pages
BDA List of Experiments For Practical Exam
Pharoah Gamerz
No ratings yet
Hadoop
Document7 pages
Hadoop
Bùi Phi Long
No ratings yet
K Mean Clustering
Document3 pages
K Mean Clustering
raf
No ratings yet
Medha 8059
Document4 pages
Medha 8059
jefferyleclerc
No ratings yet
Lecture Notes Map Reduce
Document24 pages
Lecture Notes Map Reduce
Yuvaraj V, Assistant Professor, BCA
No ratings yet
12 Lab Lapena
Document12 pages
12 Lab Lapena
Le Andro
No ratings yet
ML 2022 Sheet 05
Document2 pages
ML 2022 Sheet 05
dummy
No ratings yet
Peer Review Assignment 4: Instructions
Document17 pages
Peer Review Assignment 4: Instructions
hamza omar
100% (1)
BDA Module 3
Document66 pages
BDA Module 3
yashchheda2002
No ratings yet
Team Members Register No: Class B.Tech (Cse) Year 2 YR Batch 2019-2023 Subject
Document11 pages
Team Members Register No: Class B.Tech (Cse) Year 2 YR Batch 2019-2023 Subject
Srisowmiya N
No ratings yet
Prs l6
Document10 pages
Prs l6
Teodora Furcovici
No ratings yet
Image Compression Using PCA With Clustering
Document5 pages
Image Compression Using PCA With Clustering
Bhavya Sahay
No ratings yet
9.map 1 HashTable
Document31 pages
9.map 1 HashTable
Gaith Rjoub
No ratings yet
MATLAB
Document24 pages
MATLAB
eshonshahzod01
No ratings yet
Machine Learning: E0270 2015 Assignment 4: Due March 24 Before Class
Document3 pages
Machine Learning: E0270 2015 Assignment 4: Due March 24 Before Class
Mahesh Yada
No ratings yet
Parallel Mapreduce: K - Means Clustering Based On
Document6 pages
Parallel Mapreduce: K - Means Clustering Based On
Ashish
No ratings yet
Practical V - PYTHON
Document50 pages
Practical V - PYTHON
E01202913-KARTHICK S MCA
No ratings yet
Notes
Document5 pages
Notes
Sneha Gowda
No ratings yet
Global Institute of Engineering and Technology
Document13 pages
Global Institute of Engineering and Technology
Meer Mustafa Ali
No ratings yet
A Third Dimension To Rough Sets: Ron Kohavi Computer Science Dept. Stanford University Stanford, CA 94305
Document8 pages
A Third Dimension To Rough Sets: Ron Kohavi Computer Science Dept. Stanford University Stanford, CA 94305
Jeniffel Lugo
No ratings yet
Advanced Statistics With Matlab
Document5 pages
Advanced Statistics With Matlab
Rohit Vishal Kumar
100% (2)
Assignment 1+2 IME MA 244L 1
Document12 pages
Assignment 1+2 IME MA 244L 1
Muhammad Asim Muhammad Arshad
No ratings yet
H23 SectionHandout5 PDF
Document4 pages
H23 SectionHandout5 PDF
rahulmnnit_cs
No ratings yet
PCA Explained
Document9 pages
PCA Explained
Raj kumar
No ratings yet
DM Practice
Document15 pages
DM Practice
66 Rohit Patil
No ratings yet
Assignment No 11
Document4 pages
Assignment No 11
Nandini Yamale
No ratings yet
MatLab and Solving Equations
Document170 pages
MatLab and Solving Equations
Julia-e Regina-e Alexandre
No ratings yet
ECM6Lecture11aVietnam 2014
Document12 pages
ECM6Lecture11aVietnam 2014
duyvu
No ratings yet
The Matlab ... : Overview
Document6 pages
The Matlab ... : Overview
Usama Javed
No ratings yet
Digital Signal Processing: Introduction To Matlab: Ms. T. Samanta Lecturer Department of Information Technology
Document33 pages
Digital Signal Processing: Introduction To Matlab: Ms. T. Samanta Lecturer Department of Information Technology
Aparna Viswanath
No ratings yet
Ggplot2 Exercise
Document6 pages
Ggplot2 Exercise
retokoller44
No ratings yet
Daa Unit 2 - Completed ND2019
Document34 pages
Daa Unit 2 - Completed ND2019
karthickamsec
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
Rating: 3 out of 5 stars
3/5 (1)
Experiment No: 2 Pig Latin Commands Aim
Document7 pages
Experiment No: 2 Pig Latin Commands Aim
kr
No ratings yet
BDA Exp4
Document7 pages
BDA Exp4
kr
No ratings yet
Bda Expt5 - 60002190056
Document5 pages
Bda Expt5 - 60002190056
kr
No ratings yet
Bda Expt7 - 60002190056
Document3 pages
Bda Expt7 - 60002190056
kr
No ratings yet
MWE Exp 4
Document6 pages
MWE Exp 4
kr
No ratings yet
Magic Tee As An Isolator
Document7 pages
Magic Tee As An Isolator
kr
No ratings yet
Mwe Expt3
Document5 pages
Mwe Expt3
kr
No ratings yet
Mode Pattern Analysis For RW
Document6 pages
Mode Pattern Analysis For RW
kr
No ratings yet
MWE Exp 1
Document7 pages
MWE Exp 1
kr
No ratings yet
Expt2 Mwe
Document4 pages
Expt2 Mwe
kr
No ratings yet
Introduction To Spark
Document84 pages
Introduction To Spark
Namruta G H
No ratings yet
Shourie Amireddy Resume
Document1 page
Shourie Amireddy Resume
Shourie Reddy
No ratings yet
UG 4-1 R19 IT Syllabus
Document31 pages
UG 4-1 R19 IT Syllabus
Masimukkala Sunitha
No ratings yet
Python
Document23 pages
Python
Manish Goyal
No ratings yet
Mapreduce: Simpli - Ed Data Processing On Large Clusters
Document4 pages
Mapreduce: Simpli - Ed Data Processing On Large Clusters
Ibrahim Hamza
No ratings yet
Unit V FRAMEWORKS AND VISUALIZATION
Document71 pages
Unit V FRAMEWORKS AND VISUALIZATION
Yash Deep
No ratings yet
Unit 01
Document36 pages
Unit 01
BARATH
No ratings yet
B.tech Viii Bda Chapter 3
Document21 pages
B.tech Viii Bda Chapter 3
duggy
No ratings yet
III-II Syllabus
Document57 pages
III-II Syllabus
bharath
No ratings yet
Introduction To The Hadoop Ecosystem
Document106 pages
Introduction To The Hadoop Ecosystem
ud
No ratings yet
BIG DATA - 25.09.2020 (19 Files Merged)
Document184 pages
BIG DATA - 25.09.2020 (19 Files Merged)
Arindam Mondal
No ratings yet
Efficient Algorithm For Big Data Application
Document4 pages
Efficient Algorithm For Big Data Application
International Journal of Advanced and Innovative Research
No ratings yet
Untitled
Document16 pages
Untitled
MAGESH K SEC 2020
No ratings yet
Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
Document14 pages
Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
Anirban dey
No ratings yet
Architecture: Teradata DBC 1012 Kryder's Law
Document2 pages
Architecture: Teradata DBC 1012 Kryder's Law
Lotti Lotti
No ratings yet
Social Media Analytics
Document6 pages
Social Media Analytics
Jithin Prasad
No ratings yet
Hadoop Map Reduce For Mobile Clouds PDF
Document14 pages
Hadoop Map Reduce For Mobile Clouds PDF
rock star
No ratings yet
Hadoop Installation Step by Step
Document6 pages
Hadoop Installation Step by Step
Umesh Nagar
No ratings yet
Resilient Distributed Datasets: A Fault-Tolerant Abstraction For In-Memory Cluster Computing
Document18 pages
Resilient Distributed Datasets: A Fault-Tolerant Abstraction For In-Memory Cluster Computing
SandraPerera
No ratings yet
II-Sem-BIG DATA ANALYTICS
Document2 pages
II-Sem-BIG DATA ANALYTICS
Dinesh E
No ratings yet
Map Reduce Applications
Document94 pages
Map Reduce Applications
Hirdesh Sharma
No ratings yet
Spark Training in Bangalore
Document36 pages
Spark Training in Bangalore
kellytechnologies
No ratings yet
Scala and Spark Overview PDF
Document37 pages
Scala and Spark Overview PDF
ingrobertorivas
No ratings yet
Big Data - Wikipedia, The Free Encyclopedia
Document10 pages
Big Data - Wikipedia, The Free Encyclopedia
Shivkumar Kurnawal
No ratings yet
PG Syllabus
Document23 pages
PG Syllabus
SubramaniAppavupillai
No ratings yet
Implementing K-Means Clustering Algorithm Using Mapreduce Paradigm
Document5 pages
Implementing K-Means Clustering Algorithm Using Mapreduce Paradigm
vrkatevarapu
No ratings yet