Lab 5

Uploaded by

ims

0% found this document useful (0 votes)

4 views1 page

Original Title

lab5

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views1 page

Lab 5

Uploaded by

ims

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

Lab 5.

MPI – matrix operations and performance studies

Main goals of the assignment

• Understand the principles of collective communication.

• Learn about perfformance through experiments.

The problem to solve

General overview
A MPI-based parallel version of matrix to matrix multiplication is requested.

Background
See the description from Lab 2.

How to paralelize
Similar with the solution from Lab 2.

To do
1. Recall the sequential code and OpenMP code from Lab 2 for multiplying two matrices.
2. Write the parallel code that uses MPI (hint: see the code from the text book from page 316 - bottom; note that
bands of all three matrices are split and distributed to the active processes, not only two matrices as in the case of
Lab 2).
3. Introduce time records (hint: MPI Wtime) at the start and the end of the code (without including the final displaying
of checked values).
4. Record the times for 1 to maximum number of processors that are available for the dimension of the matrices of
1600, 2000, respectively 2400, compute the speedups and display them in a graphic (similar with lab 2).
5. Compare the speedups obtained with MPI and the ones obtained with OpenMP when running on one computer.
Which one provides a faster response? Explain! (but Remember that MPI is to be used for multiple servers, while
OpenMP is a single multi-core system.
6. Investigate if another communication schema is more efficient. Eg. allocate both matrices in one process and then
distribute the parts necessary to be used in computation to other processes. Is this schema providing a faster response
of the code?

Computer Architecture Question Bank
Document16 pages
Computer Architecture Question Bank
Greenkings
100% (1)
NumPy Recipes
From Everand
NumPy Recipes
Martin McBride
No ratings yet
Parallel Computing Lab Manual PDF
Document51 pages
Parallel Computing Lab Manual PDF
SAMINA ATTARI
No ratings yet
Parallel and Distributed Computing
Document10 pages
Parallel and Distributed Computing
Raunak Sharma
50% (2)
Fallsem2019-20 Cse4001 Eth Vl2019201001348 Reference Material Cse4001 Parallel and Distributed Computing May 2019 (003) 18
Document4 pages
Fallsem2019-20 Cse4001 Eth Vl2019201001348 Reference Material Cse4001 Parallel and Distributed Computing May 2019 (003) 18
pranav
No ratings yet
Master in High Performance Computing Advanced Parallel Programming LABS
Document2 pages
Master in High Performance Computing Advanced Parallel Programming LABS
nijota1
No ratings yet
Parallel Processing Message Passing Interface
Document7 pages
Parallel Processing Message Passing Interface
omar alani
No ratings yet
Lab 4
Document1 page
Lab 4
ims
No ratings yet
Lab 2: Brief Tutorial On Openmp Programming Model: Adrián Álvarez, Sergi Gil Par4207 2019/2020
Document11 pages
Lab 2: Brief Tutorial On Openmp Programming Model: Adrián Álvarez, Sergi Gil Par4207 2019/2020
Adrián Alvarez
No ratings yet
High Performance Computing-1 PDF
Document15 pages
High Performance Computing-1 PDF
Priyanka Jadhav
No ratings yet
Data Parallel Patterns
Document9 pages
Data Parallel Patterns
juanarcos_778612
No ratings yet
Parallel Programming
Document17 pages
Parallel Programming
Yang Yi
No ratings yet
CD Multicore
Document10 pages
CD Multicore
bhalchimtushar0
No ratings yet
22 R. A. Kendall Et Al.: 1.3.2 The Openmp Model
Document4 pages
22 R. A. Kendall Et Al.: 1.3.2 The Openmp Model
latinwolf
No ratings yet
Lectia 4. Aspecte Comparative Ale Modelelor de Programare Paralela Mpi Si Openmp
Document9 pages
Lectia 4. Aspecte Comparative Ale Modelelor de Programare Paralela Mpi Si Openmp
Slavic Bodiştean
No ratings yet
Fig. 1.22. Parallel Sparse Matrix Multiply Code Fragment, in Fortran, That Times The Operation
Document5 pages
Fig. 1.22. Parallel Sparse Matrix Multiply Code Fragment, in Fortran, That Times The Operation
latinwolf
No ratings yet
PDC - Lecture - No. 2
Document31 pages
PDC - Lecture - No. 2
nauman tariq
No ratings yet
Parallel Performance Study of Monte Carlo Photon Transport Code On Shared-, Distributed-, and Distributed-Shared-Memory Architectures
Document7 pages
Parallel Performance Study of Monte Carlo Photon Transport Code On Shared-, Distributed-, and Distributed-Shared-Memory Architectures
vermashanu89
No ratings yet
Unit3 RMD PDF
Document25 pages
Unit3 RMD PDF
Monika
No ratings yet
K Means Clustering Using Openmp: Subject: Operating Systems
Document12 pages
K Means Clustering Using Openmp: Subject: Operating Systems
Thìn Nguyễn
No ratings yet
Ab Initio Quantum Chemistry On The Ibm Pseries 690: Ibm Performance Technical Report
Document19 pages
Ab Initio Quantum Chemistry On The Ibm Pseries 690: Ibm Performance Technical Report
KOUROOSH
No ratings yet
Task Level Parallelization of All Pair Shortest Path Algorithm in Openmp 3.0
Document4 pages
Task Level Parallelization of All Pair Shortest Path Algorithm in Openmp 3.0
Hoàng Văn
No ratings yet
18 R. A. Kendall Et Al.: Mpi Put
Document4 pages
18 R. A. Kendall Et Al.: Mpi Put
latinwolf
No ratings yet
Master in High Performance Computing Advanced Parallel Programming MPI: Topologies and Neighborhood Collectives
Document1 page
Master in High Performance Computing Advanced Parallel Programming MPI: Topologies and Neighborhood Collectives
nijota1
No ratings yet
Master in High Performance Computing Advanced Parallel Programming MPI: Nonblocking Collective Communications
Document1 page
Master in High Performance Computing Advanced Parallel Programming MPI: Nonblocking Collective Communications
nijota1
No ratings yet
LAB 9 - The Performance of MIPS: Goals
Document6 pages
LAB 9 - The Performance of MIPS: Goals
Joey Wang
No ratings yet
Nscet E-Learning Presentation: Listen Learn Lead
Document67 pages
Nscet E-Learning Presentation: Listen Learn Lead
durai murugan
No ratings yet
3unit3 Mca Pecnotes
Document23 pages
3unit3 Mca Pecnotes
Monika
No ratings yet
CCP - Parallel Computing
Document10 pages
CCP - Parallel Computing
Naveen Setty
No ratings yet
Department of Computing: Lab 7: Programming Threads CLO4 (Develop Programs To Interact With OS Components Through Its API)
Document4 pages
Department of Computing: Lab 7: Programming Threads CLO4 (Develop Programs To Interact With OS Components Through Its API)
Waseem Abbas
No ratings yet
Computer Architecture 16 Marks
Document28 pages
Computer Architecture 16 Marks
Balachandar2000
100% (1)
Quick Sort
Document5 pages
Quick Sort
dmaheshk
No ratings yet
Practical-02 PC
Document12 pages
Practical-02 PC
SAMINA ATTARI
No ratings yet
Agenda 23 2 39 568 715-1
Document2 pages
Agenda 23 2 39 568 715-1
hamzaaligamer710
No ratings yet
Parallel Execution of A Parameter Sweep For Molecular Dynamics Simulations in A Hybrid GPU/CPU Environment
Document10 pages
Parallel Execution of A Parameter Sweep For Molecular Dynamics Simulations in A Hybrid GPU/CPU Environment
Syd Barrett
No ratings yet
Simultaneous Multithreading Processor
Document4 pages
Simultaneous Multithreading Processor
simon sylvester
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 5
Document7 pages
Parallel and Distributed Computing Lab Digital Assignment - 5
ajay
No ratings yet
Performance Analysis of Parallel Algorithms On Mul
Document11 pages
Performance Analysis of Parallel Algorithms On Mul
RIYA GUPTA
No ratings yet
Parallel and Distributed Computing-CSE4001L Assessment-2
Document1 page
Parallel and Distributed Computing-CSE4001L Assessment-2
Nagul
No ratings yet
WINSEM2022-23 CSE4001 ETH VL2022230504003 Reference Material I 22-12-2022 M-2 1parallel Architectures
Document18 pages
WINSEM2022-23 CSE4001 ETH VL2022230504003 Reference Material I 22-12-2022 M-2 1parallel Architectures
neha
No ratings yet
Towards Transparent Parallel/Distributed Support For Real-Time Embedded Applications
Document4 pages
Towards Transparent Parallel/Distributed Support For Real-Time Embedded Applications
bhalchimtushar0
No ratings yet
1
Document11 pages
1
aiadaiad
No ratings yet
Introduction To Supercomputing: Tina Benhaim Mary Li
Document25 pages
Introduction To Supercomputing: Tina Benhaim Mary Li
gisselydsz
No ratings yet
Lecture15 PDF
Document32 pages
Lecture15 PDF
Daniel Mora
No ratings yet
LAB 9 - The Performance of MIPS: Goals
Document5 pages
LAB 9 - The Performance of MIPS: Goals
Chris Nutsukpui
No ratings yet
PDC
Document14 pages
PDC
zapperfatter
No ratings yet
Begin Parallel Programming With OpenMP - CodeProject
Document8 pages
Begin Parallel Programming With OpenMP - CodeProject
ManojSudarshan
No ratings yet
MPI The Best Possible Practise
Document5 pages
MPI The Best Possible Practise
kam_anw
No ratings yet
Openmp
Document127 pages
Openmp
ivofrompisa
No ratings yet
Understanding The Behavior and Performance of Non-Blocking Communications in MPI
Document10 pages
Understanding The Behavior and Performance of Non-Blocking Communications in MPI
whatever
No ratings yet
SPCC Ia1
Document6 pages
SPCC Ia1
vapog80368
No ratings yet
Comp Architecture Sample Questions
Document9 pages
Comp Architecture Sample Questions
Mohamaad Sihatth
No ratings yet
A Specimen of Parallel Programming: Parallel Merge Sort Implementation
Document6 pages
A Specimen of Parallel Programming: Parallel Merge Sort Implementation
razinelb
No ratings yet
Introduction To MPI Ranger Lonestar
Document67 pages
Introduction To MPI Ranger Lonestar
tareqkh1
No ratings yet
Code: First Method:: (1) Write A C Program Using Open MP To Estimate The Value of PI (Use Minimum Two Methods)
Document8 pages
Code: First Method:: (1) Write A C Program Using Open MP To Estimate The Value of PI (Use Minimum Two Methods)
Salina Ranabhat
No ratings yet
Answer:: Answer: The Two Models of Interprocess Communication Are Message-Passing Model and
Document7 pages
Answer:: Answer: The Two Models of Interprocess Communication Are Message-Passing Model and
Cao Hùng
No ratings yet
Linpack Benchmark MatLab PDF
Document3 pages
Linpack Benchmark MatLab PDF
Angeel Vaalle
No ratings yet
Assignment Unit-2
Document1 page
Assignment Unit-2
16FYCM-IDhakane Aditya Arun
No ratings yet
Hybrid Mpi-Openmp Parallelism in The Onetep Linear-Scaling Electronic Structure Code: Application To The Delamination of Cellulose Nano Fibrils
Document13 pages
Hybrid Mpi-Openmp Parallelism in The Onetep Linear-Scaling Electronic Structure Code: Application To The Delamination of Cellulose Nano Fibrils
Wassini Bens
No ratings yet
Parallel Quicksort Implementation Using Mpi and Pthreads: Puneet Kataria RUID - 117004233
Document14 pages
Parallel Quicksort Implementation Using Mpi and Pthreads: Puneet Kataria RUID - 117004233
fff
No ratings yet
23.00 FD An2 s1 Civil - Eng. Topography 23-24
Document5 pages
23.00 FD An2 s1 Civil - Eng. Topography 23-24
ims
No ratings yet
22.00 - FD - An2 - s1 - CCIA-eng - Numerical Methods - 23-24
Document4 pages
22.00 - FD - An2 - s1 - CCIA-eng - Numerical Methods - 23-24
ims
No ratings yet
20.0 - FD - An2 - s1 - CCIA-eng - Mechanics II - 23-24
Document4 pages
20.0 - FD - An2 - s1 - CCIA-eng - Mechanics II - 23-24
ims
No ratings yet
ComponentSpecification2014 Prez
Document64 pages
ComponentSpecification2014 Prez
ims
No ratings yet
5.00 FD An1 s1 CE Chemistry 23-24
Document4 pages
5.00 FD An1 s1 CE Chemistry 23-24
ims
No ratings yet
Maryam Tavakkoli Master Thesis TUDelft
Document90 pages
Maryam Tavakkoli Master Thesis TUDelft
ims
No ratings yet
8 Services and IPC Parts 14 15 and 16
Document89 pages
8 Services and IPC Parts 14 15 and 16
ims
No ratings yet
Graph Definition
Document18 pages
Graph Definition
ims
No ratings yet
Components Intro 2022
Document40 pages
Components Intro 2022
ims
No ratings yet
Method of Coupling Metrics For Object-Oriented Sof
Document20 pages
Method of Coupling Metrics For Object-Oriented Sof
ims
No ratings yet
8 Services and IPC Parts 22 23 24
Document76 pages
8 Services and IPC Parts 22 23 24
ims
No ratings yet
PC 1
Document53 pages
PC 1
ims
No ratings yet
8 Services and IPC Part 17
Document37 pages
8 Services and IPC Part 17
ims
No ratings yet
Lab 2
Document2 pages
Lab 2
ims
No ratings yet
Lab 6
Document2 pages
Lab 6
ims
No ratings yet
Review DS
Document7 pages
Review DS
ims
No ratings yet
Final 12sp Solution
Document12 pages
Final 12sp Solution
ims
No ratings yet
cs170 Fa2013 mt1 Rao Soln
Document10 pages
cs170 Fa2013 mt1 Rao Soln
ims
No ratings yet