Welcome to Scribd!

Strong Weak Scaling

Uploaded by

0% found this document useful (0 votes)

7 views5 pages

This document summarizes strong and weak scaling results from several applications. For strong scaling, it describes applications that achieved linear scaling up to 32 cores and 20x speedup, quadratic scaling up to 14,336 cores and 2.9x speedup, and logarithmic scaling up to 15 cores and 2.09x speedup. For weak scaling, it describes applications that maintained constant execution time as problem and core counts increased proportionally, including up to 16 and 256 cores.

Original Description:

Original Title

Strong_weak_scaling

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

7 views5 pages

Strong Weak Scaling

Uploaded by

avinash kumar

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 5

Search inside document

ISS: INTRO TO SCALABLE SYSTEM

Strong Scaling and Weak Scaling Results

NAME: AVINASH KUMAR SR NO.21773 DATE:
NOV 2,2022

For strong scaling,

1)
a) Application: “HTS: A Threaded Multilevel Sparse Hybrid Solver”
b) Number of cores: 32
c) Scaling obtained: linear
d) speedup obtained: 20 times (5%)

HTS have over 20× speedup on 32 cores is incredible for sparse

linear solver packages

a) Application: “A scalable adaptive-matrix SPMV for heterogeneous architectures”

b) Number of cores up to which strong scaling results were shown: 14336
c) Scaling obtained: quadratic
d) Speedup obtained: 2.9x in setup, 1.4x than SpMV
3)
a) Application: “MICCO: An Enhanced Multi-GPU Scheduling Framework for Many-
Body Correlation Functions”
b) Number of cores: 256cores, 8 GPU
c) comment on the kind of scaling obtained (linear or any other),
d) speedup obtained: 1.96x

Scalability. Tensor size is 384. Vector size is 64.

4)
a) Application:” Virtual-Link: A Scalable Multi-Producer Multi-Consumer Message Queue
Architecture for Cross-Core Communication ”
b) Number of cores: 15
c) comment on the kind of scaling obtained: logarithmic
d) speedup obtained: 2.09x

5)
a) Application: “NVMe-CR: A Scalable Ephemeral Storage Runtime for
Checkpoint/Restart with NVMe-over-Fabrics”
b) Number of cores: 16
c) scaling obtained: linear

d) speedup obtained: 2x
For weak scaling,

1)
a) Application:” HTS: A Threaded Multilevel Sparse Hybrid Solver”
b) Weak scaling methodology: input size doubled and also the no of cores used
c) Number of cores up to which the results were shown: 16
d) comment on the weak scaling results: we do observe that as the problem size
increases HTS can continue to scale,

2)
a) Application: “A scalable adaptive-matrix SPMV for heterogeneous architectures”
b) Weak scaling methodology: 488 DoFs per MPI process
c) Number of cores up to which the results were shown:66
d) comment on the weak scaling results: 3x in setup time and 1.5x in SpMV than
PETSc-GPU
3)
a) Application: “PARallel Subgraph Enumeration in CUDA”
b) the weak scaling methodology used: we use synthetic random geometric graphs
(RGGs), A synthetic RGG at scale s has exactly 2s nodes
c) the number of cores: 224
d) comment on the weak scaling results: increasing in number of nodes proportional to
input size. Execution time shows linear result as expected.

4)
a) mention the application: “MICCO: An Enhanced Multi-GPU Scheduling Framework for
Many-Body Correlation Functions”
b) the weak scaling methodology used: Tensor size increases by 128 each time
c) the number of cores up to which the results were shown (8 GPU each having 32
cores=256 cores)
d) comment on the weak scaling results: GFLOPS proportional to increase in tensor size

Tensor size varies from 128 to 768. Vector size is 64.

5)
a) Application:” NVMe-CR: A Scalable Ephemeral Storage Runtime for
Checkpoint/Restart with NVMe-over-Fabrics”
b) Weak scaling methodology used: 32K atoms per process
c) Number of cores:16
d) comment on the weak scaling results: NVMe-CR achieves near perfect efficiency
(0.96 for checkpoint and 0.99 for recovery) at 448 processes

Architecture-Aware Optimization Strategies in Real-time Image Processing
From Everand
Architecture-Aware Optimization Strategies in Real-time Image Processing
Chao Li
No ratings yet
Evolutionary Algorithms for Mobile Ad Hoc Networks
From Everand
Evolutionary Algorithms for Mobile Ad Hoc Networks
Bernabé Dorronsoro
No ratings yet
Vlsi Lab Manual (18ecl77) - 2022-23
Document226 pages
Vlsi Lab Manual (18ecl77) - 2022-23
Prajwal Koppa
No ratings yet
3 Cuda
Document5 pages
3 Cuda
manvitha thottempudi
No ratings yet
Performance and Scalability Test of Code Saturne - HPC Wiki - Confluence PDF
Document4 pages
Performance and Scalability Test of Code Saturne - HPC Wiki - Confluence PDF
Domenico Mastropasqua
No ratings yet
sc14 HPCG
Document11 pages
sc14 HPCG
semabay
No ratings yet
Heroux App Perf On Multicores Mantevo Project SAND2008-1085P 020408
Document21 pages
Heroux App Perf On Multicores Mantevo Project SAND2008-1085P 020408
radaki-1
No ratings yet
Tnavigator Eng
Document12 pages
Tnavigator Eng
Alvaro Quinteros Cabrera
No ratings yet
VLSI LAB MANUAL (18ECL77) - Analog dt14-01-2022
Document148 pages
VLSI LAB MANUAL (18ECL77) - Analog dt14-01-2022
Aamish Priyam
No ratings yet
Multicore Quiz2
Document2 pages
Multicore Quiz2
Akhlad Najeem
No ratings yet
Tnavigator Reservoir Simulation
Document12 pages
Tnavigator Reservoir Simulation
MuhammadMulyawan
No ratings yet
2011 Advanced Computer Architecture: CS/B.TECH (CSE) /SEM-4/CS-403/2011
Document7 pages
2011 Advanced Computer Architecture: CS/B.TECH (CSE) /SEM-4/CS-403/2011
Avik Mitra
No ratings yet
Embedded System Design
Document15 pages
Embedded System Design
Kirthi Rk
No ratings yet
OSMid Exam SP 20
Document8 pages
OSMid Exam SP 20
Muhammad Akbar
No ratings yet
Christen 07
Document8 pages
Christen 07
bernasek
No ratings yet
Technical Questions: Calypso
Document17 pages
Technical Questions: Calypso
Pradeep Tiwari
No ratings yet
CUDA 2D Stencil Computations For The Jacobi Method: Jos e Mar Ia Cecilia, Jos e Manuel Garc Ia, and Manuel Ujald On
Document4 pages
CUDA 2D Stencil Computations For The Jacobi Method: Jos e Mar Ia Cecilia, Jos e Manuel Garc Ia, and Manuel Ujald On
openid_AePkLAJc
No ratings yet
ESD Bits
Document20 pages
ESD Bits
Kiran
No ratings yet
BuddyBland Titan SC12
Document12 pages
BuddyBland Titan SC12
bernasek
No ratings yet
Bits HD Pyq 2022
Document1 page
Bits HD Pyq 2022
Sudhakar Sharma
No ratings yet
Understanding The Efficiency of GPU Algorithms For Matrix-Matrix Multiplication
Document5 pages
Understanding The Efficiency of GPU Algorithms For Matrix-Matrix Multiplication
kere hore
No ratings yet
Model Qu
Document12 pages
Model Qu
Saurav Neupane
No ratings yet
MCQ's For LDCO Unit V
Document18 pages
MCQ's For LDCO Unit V
TEIT38 prasad Pansare
No ratings yet
2021-Final-Examination Update 05jan2022
Document5 pages
2021-Final-Examination Update 05jan2022
Tâm Trần
No ratings yet
Computer Architecture Questions
Document10 pages
Computer Architecture Questions
daniel
No ratings yet
K-Node Set Reliability Optimization of A Distributed Computing System Using Particle Swarm Algorithm
Document10 pages
K-Node Set Reliability Optimization of A Distributed Computing System Using Particle Swarm Algorithm
Oyeniyi Samuel Kehinde
No ratings yet
Advanced Computer Architecture Question 1: Mcqs
Document4 pages
Advanced Computer Architecture Question 1: Mcqs
Aliza Saddal
No ratings yet
Adv Map Des 8 Old
Document15 pages
Adv Map Des 8 Old
Anusha Ramanathan
No ratings yet
Be It Question
Document12 pages
Be It Question
sheham ihjam
No ratings yet
CN 3rd Unit MCQ 130
Document19 pages
CN 3rd Unit MCQ 130
Mohanaprakash Ece
100% (1)
Hardware-Software Codesign Lab Report
Document15 pages
Hardware-Software Codesign Lab Report
valjok
No ratings yet
A Performance Study of Applying CUDA-Enabled GPU in Polar Hough Transform For Lines
Document4 pages
A Performance Study of Applying CUDA-Enabled GPU in Polar Hough Transform For Lines
Journal of Computing
No ratings yet
Hpec12 Olofsson Publish
Document1 page
Hpec12 Olofsson Publish
echostorm
No ratings yet
Assignment 1 Computer Graphics
Document1 page
Assignment 1 Computer Graphics
kfrahman
No ratings yet
Computer Architecture: CS/B.TECH (CSE-NEW) /SEM-4/CS-403/2012
Document8 pages
Computer Architecture: CS/B.TECH (CSE-NEW) /SEM-4/CS-403/2012
Avik Mitra
No ratings yet
A High-Level Simulator For The H.264/AVC Decoding Process in Multi-Core Systems
Document23 pages
A High-Level Simulator For The H.264/AVC Decoding Process in Multi-Core Systems
StarLink1
No ratings yet
Facta Universitatis (Ni S) Ser. Math. Inform. Vol. 22, No. 2 (2007), Pp. 175-188
Document14 pages
Facta Universitatis (Ni S) Ser. Math. Inform. Vol. 22, No. 2 (2007), Pp. 175-188
coolguypj1953
No ratings yet
ABC: An Industrial-Strength Logic Synthesis and Verification Tool
Document29 pages
ABC: An Industrial-Strength Logic Synthesis and Verification Tool
M Chandan Shankar
No ratings yet
Embedded System MCQ
Document11 pages
Embedded System MCQ
ARPAN KUMAR BHANDARI
No ratings yet
QA Hardware Development - 310519
Document5 pages
QA Hardware Development - 310519
Joker Jr
No ratings yet
Yang 2018 Europa R
Document16 pages
Yang 2018 Europa R
Shrey Agarwal
No ratings yet
QA Hardware Development - 310519
Document5 pages
QA Hardware Development - 310519
Joker Jr
No ratings yet
PPSC Lecturer Computer Science Past Paper 2017
Document33 pages
PPSC Lecturer Computer Science Past Paper 2017
Faisal Khan
No ratings yet
Final Project Report Transient Stability of Power System (Programming Massively Parallel Graphics Multiprocessors Using CUDA)
Document5 pages
Final Project Report Transient Stability of Power System (Programming Massively Parallel Graphics Multiprocessors Using CUDA)
shotorbari
No ratings yet
CSE
Document6 pages
CSE
Koutheesh Sellamuthu
No ratings yet
Sequence Alignment Algorithm Overview
Document1 page
Sequence Alignment Algorithm Overview
robthomas1
No ratings yet
A LBM Solver 3D Fluid Simulation On GPU
Document9 pages
A LBM Solver 3D Fluid Simulation On GPU
Zhe Li
No ratings yet
PR301 MF Sample Questions - 1 SET Cics
Document12 pages
PR301 MF Sample Questions - 1 SET Cics
Pinank Parikh
No ratings yet
Parallel Project Section 3
Document2 pages
Parallel Project Section 3
Vawinda Vanichkhokool
No ratings yet
MCQ Esiot-2
Document35 pages
MCQ Esiot-2
Chaitanya Magar
50% (2)
Ecen 324 Practice Exam: Midterm #2: Int Unknown (A B && B C: B B A && A C: A 1: C )
Document5 pages
Ecen 324 Practice Exam: Midterm #2: Int Unknown (A B && B C: B B A && A C: A 1: C )
ThatOnePerson123
No ratings yet
Midterm: RISC, Etc.)
Document4 pages
Midterm: RISC, Etc.)
Easy
No ratings yet
LOW Power VLSI Design Paper PESCE
Document10 pages
LOW Power VLSI Design Paper PESCE
luckymanju
No ratings yet
UNIT V Scalable Multi-GPU Programming (T2 Chapter 6) - P P With CUDA
Document43 pages
UNIT V Scalable Multi-GPU Programming (T2 Chapter 6) - P P With CUDA
20BD1A519 KMIT
No ratings yet
64-Bit Versus 32-Bit Virtual Machines For Java: Kris Venstermans, Lieven Eeckhout and Koen de Bosschere
Document26 pages
64-Bit Versus 32-Bit Virtual Machines For Java: Kris Venstermans, Lieven Eeckhout and Koen de Bosschere
Abirami Senthilkumar
No ratings yet
Bigdata Bits PDF
Document2 pages
Bigdata Bits PDF
Shreyansh Diwan
No ratings yet
SLR-PK - 278: SLRPK278
Document4 pages
SLR-PK - 278: SLRPK278
Mayur Hanchate
No ratings yet
Cshardware Portfolio
Document3 pages
Cshardware Portfolio
api-243868301
No ratings yet
Generative Adversarial Networks For Data Generation and Damage Detection Using Resnet in Structural Health Monitoring
Document25 pages
Generative Adversarial Networks For Data Generation and Damage Detection Using Resnet in Structural Health Monitoring
Bhavana Bollarapu
No ratings yet
VLSI Projects
Document10 pages
VLSI Projects
B Naresh Kumar Reddy
No ratings yet
Sapient Paper On 11Th April, 2008
Document12 pages
Sapient Paper On 11Th April, 2008
rayoriz
No ratings yet
Karatina University: University Examinations 2017/2018 ACADEMIC YEAR
Document4 pages
Karatina University: University Examinations 2017/2018 ACADEMIC YEAR
Kimondo King
No ratings yet
Contemporary
Document2 pages
Contemporary
Albert Paggao
No ratings yet
Frequently Asked Questions For Revisiting Spacetrack Report #3
Document2 pages
Frequently Asked Questions For Revisiting Spacetrack Report #3
javiermonroyc
No ratings yet
Curriculum Vitae: Alex Gnanamani.K
Document4 pages
Curriculum Vitae: Alex Gnanamani.K
DIJU
No ratings yet
30 Contoh Soal Passive Voice Pilihan Ganda Dan Jawabannya
Document13 pages
30 Contoh Soal Passive Voice Pilihan Ganda Dan Jawabannya
futrika saragi
100% (1)
Research Proposal
Document3 pages
Research Proposal
jan ray aribuabo
No ratings yet
Quotes Ps
Document10 pages
Quotes Ps
Srinivasan Parthasarathy
No ratings yet
Dwnload Full Business Research Methods 9th Edition Zikmund Solutions Manual PDF
Document35 pages
Dwnload Full Business Research Methods 9th Edition Zikmund Solutions Manual PDF
erichuel33a
100% (14)
Government Polytechnic, Pune: ET2107 - NO
Document8 pages
Government Polytechnic, Pune: ET2107 - NO
G012 Bhise Aniket
No ratings yet
Contribucion de Las FE A La Autorregulacion
Document24 pages
Contribucion de Las FE A La Autorregulacion
Amanda Riedemann Carrillo
No ratings yet
Final Differential Equations (PDF)
Document88 pages
Final Differential Equations (PDF)
Tina Shah
No ratings yet
Claves Eset 5
Document3 pages
Claves Eset 5
Orquesta Sensacion Caribe
No ratings yet
Cambridge O Level: PHYSICS 5054/42
Document16 pages
Cambridge O Level: PHYSICS 5054/42
Lapu Lapu
No ratings yet
English 6 W10 DAYS 1-2
Document13 pages
English 6 W10 DAYS 1-2
Mary Jane Cuevas
No ratings yet
Toplotna Pumpa Hidria Clint - Eu - Cha K - 182 P 604 P - cls61.7 Eng
Document2 pages
Toplotna Pumpa Hidria Clint - Eu - Cha K - 182 P 604 P - cls61.7 Eng
Muhidin Kozica
No ratings yet
SPHE8281D
Document35 pages
SPHE8281D
diego-t
No ratings yet
BARAKA Modelling Proposal
Document9 pages
BARAKA Modelling Proposal
murali.5482
No ratings yet
Kinematics of A Novel Nine Degree of Freedom Configurable Gough-Stewart Platform
Document19 pages
Kinematics of A Novel Nine Degree of Freedom Configurable Gough-Stewart Platform
Neider Nadid
No ratings yet
Composite Insulators Profile Optimization Using Particle Swarm Algorithm and Finite Element Method
Document6 pages
Composite Insulators Profile Optimization Using Particle Swarm Algorithm and Finite Element Method
Fernando Santana
No ratings yet
Modul Customer Service
Document5 pages
Modul Customer Service
Fandy Bestario Harlan
No ratings yet
AWS Cloud Architect: Nanodegree Program Syllabus
Document14 pages
AWS Cloud Architect: Nanodegree Program Syllabus
tausif shaikh
No ratings yet
LGIT Catalogue
Document11 pages
LGIT Catalogue
Arjun Sharma
No ratings yet
11+ CEM English and Verbal Reasoning PRACTICE PAPER 1
Document15 pages
11+ CEM English and Verbal Reasoning PRACTICE PAPER 1
Olawale
No ratings yet
Reference Manual: Buildings and Infrastructure Protection Series
Document514 pages
Reference Manual: Buildings and Infrastructure Protection Series
mydearteacher
No ratings yet
Bulletin: ICC International Court of Arbitration
Document30 pages
Bulletin: ICC International Court of Arbitration
Priscila Machado Martins
No ratings yet
Rele A Gas Buchholts
Document18 pages
Rele A Gas Buchholts
Marco Giraldo
No ratings yet
Edi Wow
Document11 pages
Edi Wow
fantasigh
No ratings yet
A Study On Pushover Analysis Using Capacity Spectrum Method Based On Eurocode 8
Document13 pages
A Study On Pushover Analysis Using Capacity Spectrum Method Based On Eurocode 8
eph
No ratings yet
Conductivity Type of Extrinsic Semiconducting Materials: Standard Test Methods For
Document6 pages
Conductivity Type of Extrinsic Semiconducting Materials: Standard Test Methods For
Rob Gridley
No ratings yet