Open navigation menu

Welcome to Scribd!

HW 1 Xsede

Uploaded by

0% found this document useful (0 votes)

18 views8 pages

Original Title

HW 1 XSEDE

Copyright

© © All Rights Reserved

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

18 views8 pages

HW 1 Xsede

Uploaded by

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 8

Search inside document

CS 267 HW 1

Ben Brock
Optimizing Matrix Multiply
- In HW 1, you’ll be optimizing matrix multiply

- C = C + AB, where A, B, and C are dense matrices

- For simplicity, we’ll consider the case of square matrices

Problem Pseudocode
for i = 1 to N:
for j = 1 to N:
for k = 1 to N:
c[i, j] = c[i, j] + a[i, k] * b[k, j]

3 nested loops => n3 complexity

Your Job: Implement This Interface

void square_dgemm (int n, double* A, double* B,

double* C);

You write this function, we call your function in a test harness.

Your job is to make it run as fast as possible.

Optimization Techniques
1) Blocking
a) L1 blocking
b) Register blocking
c) L2 blocking
2) Copy optimization
a) Copy to an aligned buffer
b) Transpose?
3) Vectorization
a) Write small, fixed-size (n=8-16) GEMM, examine assembly
b) Intrinsics
Blocking (or Tiling)
Copy Optimization

You might also like

Good Habits for Great Coding: Improving Programming Skills with Examples in Python
From Everand
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
Michael Stueben
No ratings yet
Advanced Opensees Algorithms, Volume 1: Probability Analysis Of High Pier Cable-Stayed Bridge Under Multiple-Support Excitations, And Liquefaction
From Everand
Advanced Opensees Algorithms, Volume 1: Probability Analysis Of High Pier Cable-Stayed Bridge Under Multiple-Support Excitations, And Liquefaction
Mahdi Alborzi Verki
No ratings yet
CS4961 L15
Document14 pages
CS4961 L15
Anirudh Seth
No ratings yet
CS267 Dense Linear Algebra I.38 Demmel Fa 2001
Document32 pages
CS267 Dense Linear Algebra I.38 Demmel Fa 2001
Web dev
No ratings yet
F.M. - 50 Time: 2 Hours Answer Any Five
Document3 pages
F.M. - 50 Time: 2 Hours Answer Any Five
Probir Mondal
No ratings yet
Aoa Final Spring 2021
Document2 pages
Aoa Final Spring 2021
Maaz Chauhann
No ratings yet
Analog & Digital Electronics: Course No: PH-218 Lec-29: Combinational Logic Modules
Document14 pages
Analog & Digital Electronics: Course No: PH-218 Lec-29: Combinational Logic Modules
Keerthi Nalliboyina
No ratings yet
Data Structures and Algorithms: (CS210/ESO207/ESO211)
Document24 pages
Data Structures and Algorithms: (CS210/ESO207/ESO211)
Moazzam Hussain
No ratings yet
PAI Model Answer 2021 22
Document15 pages
PAI Model Answer 2021 22
basketsahmed
No ratings yet
Problem Set 2
Document5 pages
Problem Set 2
Eren Güngör (Student)
No ratings yet
HMK 4 ARC
Document3 pages
HMK 4 ARC
skeletorfearsme
No ratings yet
All
Document57 pages
All
Khondoker Razzakul Haque
No ratings yet
Compiler Design
Document1 page
Compiler Design
charvisi2002
No ratings yet
I B.Tech II Semester - Descriptive Examination-I Digital Electronics Important Questions Unit Wise
Document2 pages
I B.Tech II Semester - Descriptive Examination-I Digital Electronics Important Questions Unit Wise
manohar thota
No ratings yet
EC330 Applied Algorithms and Data Structures For Engineers Fall 2018 Homework 3
Document2 pages
EC330 Applied Algorithms and Data Structures For Engineers Fall 2018 Homework 3
DF QWERT
No ratings yet
II B.Tech II Semester, Regular Examinations, April - 2012 Switching Theory and Logic Design
Document8 pages
II B.Tech II Semester, Regular Examinations, April - 2012 Switching Theory and Logic Design
142545
No ratings yet
Cryptography - Exercises: 1 Historic Ciphers
Document7 pages
Cryptography - Exercises: 1 Historic Ciphers
Beni Rodriguez
No ratings yet
MCS31 combinedQPapers
Document23 pages
MCS31 combinedQPapers
ash.webstar
No ratings yet
Basic Data Compression Concepts: - Lossless - Lossy - Compression Ratio
Document9 pages
Basic Data Compression Concepts: - Lossless - Lossy - Compression Ratio
Harshit Lalpura
No ratings yet
Digital Logic and Microprocessor
Document44 pages
Digital Logic and Microprocessor
gnananvesh
No ratings yet
Blocked Matrix Multiply
Document6 pages
Blocked Matrix Multiply
Deivid CArpio
No ratings yet
Assignment #2: Due March 14, 2016
Document14 pages
Assignment #2: Due March 14, 2016
Ahmed Hamouda
No ratings yet
Goldensection
Document28 pages
Goldensection
gjrfjsdgfhjk
No ratings yet
Lecture 9 - October 11, 2005
Document4 pages
Lecture 9 - October 11, 2005
Vasiliy Ivanov
No ratings yet
1.code Generation
Document23 pages
1.code Generation
Abhi
No ratings yet
9A04401 Switching Theory & Logic Design
Document1 page
9A04401 Switching Theory & Logic Design
sivabharathamurthy
No ratings yet
DP 2
Document31 pages
DP 2
Nuzhat Afrin
No ratings yet
DP - Intro With Links
Document10 pages
DP - Intro With Links
Basisoneoone
No ratings yet
Lecture14 - Dynamic II
Document31 pages
Lecture14 - Dynamic II
MD.ASHADUZZAMAN Ayon
No ratings yet
Crypto
Document15 pages
Crypto
Nihar Ranjan Panda
No ratings yet
2023 Midsem Spring
Document2 pages
2023 Midsem Spring
amsrivastava2002
No ratings yet
Hyd 2018 19
Document3 pages
Hyd 2018 19
Rishab Jain
No ratings yet
Coding Using MATLAB
Document27 pages
Coding Using MATLAB
Kawthar Zaidan
No ratings yet
Home Exercise 3: Dynamic Programming and Randomized Algorithms
Document5 pages
Home Exercise 3: Dynamic Programming and Randomized Algorithms
Jaokd
No ratings yet
Assembly Language Lab
Document32 pages
Assembly Language Lab
Eri Matraku
100% (1)
Cse 323 Mid
Document3 pages
Cse 323 Mid
BD Entertainment
No ratings yet
Codes Correcting A Burst of Deletions or Insertions
Document5 pages
Codes Correcting A Burst of Deletions or Insertions
Kaan
No ratings yet
Quiz 2
Document2 pages
Quiz 2
Ana Neacsu
No ratings yet
Digital Logic Design
Document4 pages
Digital Logic Design
Shareef Khan
No ratings yet
Digital Logic Design
Document4 pages
Digital Logic Design
rppvch
No ratings yet
Discussion 12 Sol
Document3 pages
Discussion 12 Sol
Dipan Das
No ratings yet
STLD 2009 Regular
Document4 pages
STLD 2009 Regular
ranger
No ratings yet
Sample Midterm
Document2 pages
Sample Midterm
KamySingh
No ratings yet
Digital Electronics (K-Wiki - Combinational Circuits)
Document36 pages
Digital Electronics (K-Wiki - Combinational Circuits)
SUNOBHAI
No ratings yet
MCQ-Test-Questions-on-Data-Structures-and-Algorithms-www.psexam.com_
Document10 pages
MCQ-Test-Questions-on-Data-Structures-and-Algorithms-www.psexam.com_
Arabinda Parida
No ratings yet
CSC 2426 Report
Document3 pages
CSC 2426 Report
nekriachvv
No ratings yet
DP III
Document84 pages
DP III
Himanshu Rajput
No ratings yet
Ada Module 3 Notes
Document40 pages
Ada Module 3 Notes
Lokko Prince
No ratings yet
DLD Lecture 11
Document15 pages
DLD Lecture 11
speado record
No ratings yet
WWW - Manaresults.Co - In: II B. Tech I Semester Model Question Paper Oct/Nov - 2017 Switching Theory and Logic Design
Document4 pages
WWW - Manaresults.Co - In: II B. Tech I Semester Model Question Paper Oct/Nov - 2017 Switching Theory and Logic Design
badiganti tejakrishna
No ratings yet
Midterm Exam Solution
Document11 pages
Midterm Exam Solution
ShelaRamos
No ratings yet
Lecture 6
Document16 pages
Lecture 6
f20201862
No ratings yet
Ec2m Dseclzg519 Key
Document6 pages
Ec2m Dseclzg519 Key
Dimpu Shah
No ratings yet
DSD Assin !
Document2 pages
DSD Assin !
apurva.mukherjee03
No ratings yet
BCS 042 PDF
Document3 pages
BCS 042 PDF
TSN Prasad
No ratings yet
DSAmid 2019
Document2 pages
DSAmid 2019
Viswajeet Ray
No ratings yet
Final Fall 2019 - Os
Document12 pages
Final Fall 2019 - Os
Jannat Nasir
No ratings yet
Integer Programming
From Everand
Integer Programming
Elsevier Books Reference
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
Rating: 2.5 out of 5 stars
2.5/5 (2)
Principles of Control Engineering
From Everand
Principles of Control Engineering
Fred White
No ratings yet
Fracture Analysis of A Total Hip Prosthesis Based On Reverse Engineering - Milena Babic - Ozren Veric
Document11 pages
Fracture Analysis of A Total Hip Prosthesis Based On Reverse Engineering - Milena Babic - Ozren Veric
Allen Prasad
No ratings yet
Robinhood Securities LLC: Tax Information Account 667446561
Document6 pages
Robinhood Securities LLC: Tax Information Account 667446561
Allen Prasad
No ratings yet
Sensors and Actuators B: Chemical: Mariajose Gonzalez-Guerrero, Frank A. Gomez T
Document8 pages
Sensors and Actuators B: Chemical: Mariajose Gonzalez-Guerrero, Frank A. Gomez T
Allen Prasad
No ratings yet
Extrusion of Polymers
Document11 pages
Extrusion of Polymers
Allen Prasad
No ratings yet
Assignment 3
Document10 pages
Assignment 3
Allen Prasad
No ratings yet
Engineering Survey
Document8 pages
Engineering Survey
Allen Prasad
No ratings yet
Microsoft Word - Secondary-Driving S 1
Document9 pages
Microsoft Word - Secondary-Driving S 1
Allen Prasad
No ratings yet
Microsoft Word - Secondary-Driving S 1 PDF
Document9 pages
Microsoft Word - Secondary-Driving S 1 PDF
Allen Prasad
No ratings yet
Optimization of A Profile Extrusion Die For Flow Balance: Fibers and Polymers April 2014
Document10 pages
Optimization of A Profile Extrusion Die For Flow Balance: Fibers and Polymers April 2014
Allen Prasad
No ratings yet
Test PDF
Document1 page
Test PDF
Allen Prasad
No ratings yet
School Experience Survey
Document44 pages
School Experience Survey
Allen Prasad
No ratings yet
Additive Manufacturing
Document35 pages
Additive Manufacturing
Allen Prasad
No ratings yet
Power Generation by Kinetic Energy of Speed Breaker 2014
Document4 pages
Power Generation by Kinetic Energy of Speed Breaker 2014
Allen Prasad
No ratings yet
Electricity Generation From Speed Breakers 2016
Document5 pages
Electricity Generation From Speed Breakers 2016
Allen Prasad
No ratings yet
Grade B+ Universities Name
Document13 pages
Grade B+ Universities Name
Allen Prasad
No ratings yet
CREO Lab Report For Mechanical Engineering
Document11 pages
CREO Lab Report For Mechanical Engineering
Allen Prasad
No ratings yet
Grade B+ Universities Name
Document13 pages
Grade B+ Universities Name
Allen Prasad
No ratings yet