Assignment Two

Uploaded by

Eliza Caraman

0% found this document useful (0 votes)

10 views2 pages

Original Title

Assignment two

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views2 pages

Assignment Two

Uploaded by

Eliza Caraman

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Addis Ababa University

Addis Ababa Institute of Technology

School of Electrical and Computer Engineering

ECEG 6503 - Advanced Computer Architecture

Assignment Two

This assignment is adopted from the book by Hennessy and Patterson.

1. Memory Optimization
We use compiler optimization to improve the performance of the memory subsystem. In this
exercise we will use two experiments to appreciate this improvement.
a. Loop Interchange (a kind of)
i. Implement a simple matrix multiplication program. Make the matrix size
above 2048x2048. Measure the runtime.
ii. Transpose matrix B and change the matrix multiplication algorithm to
accommodate this change. Measure both the transposing time and
multiplication time together. Comment on the results!
b. Blocking
i. For the same matrix size stated above use the blocking method and algorithm
described on pages 107 – 109 of the textbook and measure the run-time for
block size of 2, 4, 8, 16, 32, 64, 128, 256, 512, 1024.
2. Memory access behavior study
In this study you will follow the case study (case study 2) described on pages 150-152.
Answer question 2.4 from the plots of your experiment.

3. Vector Processing
Most modern processors come equipped with vector processors to enhance performance of
data parallel application. Do the following experiments and report your finding.
a. Simple vector addition: given the following vector addition function measure the time
it takes to complete it. Vary size from 1024, 2048, 4096, 8192, 16384, 32768, 65536,
131072, 262144, 524588, 1048576, 2097152, 4194304, 8388608, 16777216,
33554432, 67108864.
void vecAdd(int *A, int *B, int *C, int size)
{
For(int i=0; i<size;i++)
C[i]=A[i] + B[i];
}

b. Follow the same process as in A except use loop unrolling. How does this differ from
the results in a?

For(int i=0;i<size;i+=4)

1
{

C[i]=A[i] + B[i];
C[i+1]=A[i+1] + B[i+1];
C[i+2]=A[i+2] + B[i+2];
C[i+3]=A[i+3] + B[i+3];

c. In this experiment we will implement the same vector addition task but using the
vector processing capability of the processor. Most Intel CPUs should have this
capability. Compare and contrast the results you got with a and b. Make sure to
include the header file #include <emmintrin.h>

void vecAddRealSSE(float a,float b,float *c, int N)

{
for(int i=0;i<N;i+=4)
{
__m128 sse_a=_mm_load_ps(&a[i]); //loading into vector register
__m128 sse_b=_mm_load_ps(&b[i]); //loading into vector register
__m128 sse_c=_mm_add_ps(sse_a,sse_b); //adding the two vectors
_mm_store_ps(&c[i],sse_c); //storing the result back
}

Springer SV Solutions Manual
Document63 pages
Springer SV Solutions Manual
ShivanandKundral
89% (38)
Computer Organization and Architecture 10th Edition Stallings Solutions Manual
Document26 pages
Computer Organization and Architecture 10th Edition Stallings Solutions Manual
MeganJonesjwbp
96% (54)
BAHRIA UNIVERSITY (Karachi Campus) : Object-Oriented Programming (Csc-210)
Document5 pages
BAHRIA UNIVERSITY (Karachi Campus) : Object-Oriented Programming (Csc-210)
AISHA 20682
No ratings yet
HPC
Document7 pages
HPC
Smita Shrestha
No ratings yet
Solutions Manual Data Structures With C++ Using STL 2nd Edition Ford PDF
Document9 pages
Solutions Manual Data Structures With C++ Using STL 2nd Edition Ford PDF
123456rano
No ratings yet
R Studio Cheat Sheet
Document30 pages
R Studio Cheat Sheet
anon_393044353
100% (1)
PLC Training Manual - China
Document62 pages
PLC Training Manual - China
Arsonval Fleury
100% (1)
Oops Using C++/Advance Concepts of Oops Using C++: (1-MARK Questions)
Document14 pages
Oops Using C++/Advance Concepts of Oops Using C++: (1-MARK Questions)
Prathibha S Nair
No ratings yet
Oops
Document49 pages
Oops
senthil2uin
No ratings yet
EEE440CA-Spring BCS-7A, B, C-Sessional-II
Document1 page
EEE440CA-Spring BCS-7A, B, C-Sessional-II
Samiya Ahsan
No ratings yet
Smu Bca Object Oriented Programming Using C++ (Bc0037) Sem 2 Question Papers 2
Document14 pages
Smu Bca Object Oriented Programming Using C++ (Bc0037) Sem 2 Question Papers 2
Nadeem Mohammed
No ratings yet
ECE 452: Computer Organization and Design
Document9 pages
ECE 452: Computer Organization and Design
dermeister1988
No ratings yet
OBJECT Oriented Programming Using C++: (1-MARK Questions)
Document14 pages
OBJECT Oriented Programming Using C++: (1-MARK Questions)
sunil4052
No ratings yet
Course: CSE4001 - Parallel and Distributed Computing
Document2 pages
Course: CSE4001 - Parallel and Distributed Computing
Vardhan
No ratings yet
Matrix Multiplication Using SIMD Technologies
Document13 pages
Matrix Multiplication Using SIMD Technologies
Gurpreet Singh
No ratings yet
05 CS107 Midterm Winter 2014
Document11 pages
05 CS107 Midterm Winter 2014
akgal
No ratings yet
Shantanu-3 3
Document5 pages
Shantanu-3 3
Srijan Mishra
No ratings yet
CS2209 Oops Lab
Document79 pages
CS2209 Oops Lab
tp2006ster
No ratings yet
Par - 1 In-Term Exam - Course 2018/19-Q2
Document9 pages
Par - 1 In-Term Exam - Course 2018/19-Q2
Juan
No ratings yet
ECE 264 Advanced C Programming 2009/01/16: Reminder
Document14 pages
ECE 264 Advanced C Programming 2009/01/16: Reminder
truongvinhlan19895148
No ratings yet
Assignment PDF
Document5 pages
Assignment PDF
Saurabh Raghuvanshi
0% (1)
Lab Journal 10 CP 21092022 011933pm
Document6 pages
Lab Journal 10 CP 21092022 011933pm
Alley Raza
No ratings yet
Object Oriented Programming Lab Journal - Lab 7: Objective
Document3 pages
Object Oriented Programming Lab Journal - Lab 7: Objective
Burhan Ahmed
No ratings yet
Oops Lab Manual
Document47 pages
Oops Lab Manual
nirmala16
No ratings yet
CC 319 Final Spring 2020
Document6 pages
CC 319 Final Spring 2020
Manar Abdelmeged
No ratings yet
30 Assignments PDF
Document5 pages
30 Assignments PDF
Agent Sharon
No ratings yet
Sheet 03
Document3 pages
Sheet 03
eir.gn
No ratings yet
Codeforcoder CSE-408 MCQ-1
Document18 pages
Codeforcoder CSE-408 MCQ-1
Maheswari Chimata
No ratings yet
Lab Journal 09 CP 13092022 124635pm
Document5 pages
Lab Journal 09 CP 13092022 124635pm
Alley Raza
No ratings yet
Assignment 5 - OpenCL Optimizations
Document2 pages
Assignment 5 - OpenCL Optimizations
Abdulahi Abebe
100% (1)
Computer Assesment I QP
Document5 pages
Computer Assesment I QP
Swayam Gosavi
No ratings yet
PL01 Guiao
Document3 pages
PL01 Guiao
João Lourenço
No ratings yet
CN Lab Manual
Document51 pages
CN Lab Manual
A Sai Nikhith
No ratings yet
Cs Cert Answers ANON 001
Document11 pages
Cs Cert Answers ANON 001
tomahsplash00
No ratings yet
Computer Vision (CS 6384.002) Project 2: Program Description
Document3 pages
Computer Vision (CS 6384.002) Project 2: Program Description
Lazy Leetcoder
No ratings yet
D.The Seventh Element of Array C Is Specified by C (7) .: Java How To Program, 5/e Test Item File 1 of 6
Document6 pages
D.The Seventh Element of Array C Is Specified by C (7) .: Java How To Program, 5/e Test Item File 1 of 6
Leonardo Mattera
No ratings yet
CS2312 LM
Document43 pages
CS2312 LM
mukesh_mlb
No ratings yet
Implementation of A High Speed Single Precision Floating Point Unit Using Verilog
Document5 pages
Implementation of A High Speed Single Precision Floating Point Unit Using Verilog
Vikas Pakhretia
No ratings yet
Implementation of An Efficient Multiplier Based On Vedic Mathematics Using EDA Tool
Document5 pages
Implementation of An Efficient Multiplier Based On Vedic Mathematics Using EDA Tool
rbangaram_1
No ratings yet
Csi 321 JAVA Mid
Document3 pages
Csi 321 JAVA Mid
light66
No ratings yet
KISA QP-Computer Applications - X
Document6 pages
KISA QP-Computer Applications - X
Saraswathi Ramesh
No ratings yet
Assignment 4: Rachana Chaudhari 171081014 Ty Btech It
Document4 pages
Assignment 4: Rachana Chaudhari 171081014 Ty Btech It
SAMINA ATTARI
No ratings yet
Test Bank For Intro To Java Programming Comp Version 10 e 10th Edition 0133813460
Document33 pages
Test Bank For Intro To Java Programming Comp Version 10 e 10th Edition 0133813460
suspendgruesome1i
No ratings yet
Introduction To Java Programming Comprehensive Version 10th Edition Liang Test Bank
Document34 pages
Introduction To Java Programming Comprehensive Version 10th Edition Liang Test Bank
pinderrhematicw05t
No ratings yet
Midterm Sample Answer: Instructor: Cristiana Amza Department of Electrical and Computer Engineering University of Toronto
Document18 pages
Midterm Sample Answer: Instructor: Cristiana Amza Department of Electrical and Computer Engineering University of Toronto
jhusseth
No ratings yet
ch05 (4) - Multi Dim Arrays
Document15 pages
ch05 (4) - Multi Dim Arrays
Samantha Morin
No ratings yet
All 3 CC Java Mad Complex and Interediate Problems
Document23 pages
All 3 CC Java Mad Complex and Interediate Problems
Bad Bunny
No ratings yet
Unit-III Advanced Machine Learning
Document8 pages
Unit-III Advanced Machine Learning
Suja Mary
No ratings yet
Implementation of An Efficient Multiplier Based On Vedic Mathematics Using EDA Tool
Document5 pages
Implementation of An Efficient Multiplier Based On Vedic Mathematics Using EDA Tool
techiealyy
No ratings yet
ET0301 Assignment 2 BK
Document7 pages
ET0301 Assignment 2 BK
Saravanan Guna Sekaran
No ratings yet
Week 8
Document39 pages
Week 8
Minh Tieu
No ratings yet
Lab 1
Document2 pages
Lab 1
Farahain Idrus
No ratings yet
FINAL December 2016, Questions and Answers FINAL December 2016, Questions and Answers
Document12 pages
FINAL December 2016, Questions and Answers FINAL December 2016, Questions and Answers
Jordan Ramsey
No ratings yet
BC0037 OBJECT Oriented Programming Using C++ PAPER 2
Document14 pages
BC0037 OBJECT Oriented Programming Using C++ PAPER 2
SeekEducation
No ratings yet
CEN103 Practical1
Document3 pages
CEN103 Practical1
asfdSFDSFGE
No ratings yet
New Clustering Algorithm For Vector Quantization Using Rotation of Error Vector
Document7 pages
New Clustering Algorithm For Vector Quantization Using Rotation of Error Vector
cipitunk
No ratings yet
Programming Techniques Sheet #1
Document2 pages
Programming Techniques Sheet #1
Bishoy Emile
No ratings yet
DS LAB Cycle 1
Document38 pages
DS LAB Cycle 1
sarath sachu
No ratings yet
EEE160.1 Lab-Report
Document7 pages
EEE160.1 Lab-Report
ninjaawddds
No ratings yet
Ap Sample Questions From College Board
Document43 pages
Ap Sample Questions From College Board
pinkleaflover4344
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
Rating: 3 out of 5 stars
3/5 (1)
MCTS 70-515 Exam: Web Applications Development with Microsoft .NET Framework 4 (Exam Prep)
From Everand
MCTS 70-515 Exam: Web Applications Development with Microsoft .NET Framework 4 (Exam Prep)
Eddie Vi
Rating: 4 out of 5 stars
4/5 (1)
TRX DG-UG Guideline - v2
Document5 pages
TRX DG-UG Guideline - v2
Syafrialdi Masri
No ratings yet
Performance Management For Public Sector Organisations in Scotland
Document8 pages
Performance Management For Public Sector Organisations in Scotland
Charteris Plc
100% (1)
Wazo PDF
Document603 pages
Wazo PDF
ShareFile Pro
No ratings yet
Evan Malveda Algo
Document4 pages
Evan Malveda Algo
Joseph Malveda
No ratings yet
Vista Release 5
Document40 pages
Vista Release 5
John
No ratings yet
Book Marketing Guide
Document12 pages
Book Marketing Guide
cYbernaTIc enHancE
No ratings yet
Surveying 2 Fieldwork 2
Document5 pages
Surveying 2 Fieldwork 2
api-20006973
0% (2)
Linux Server Installation Configuration Manual
Document32 pages
Linux Server Installation Configuration Manual
Anbu Raj
No ratings yet
MT880 User Manual
Document74 pages
MT880 User Manual
sergioperr
No ratings yet
Membership Form
Document1 page
Membership Form
Novey Casio
No ratings yet
Pune IT Software Companies in Baner Pune List
Document4 pages
Pune IT Software Companies in Baner Pune List
Sagar Sononi
No ratings yet
Report On The Analysis of The Fibonacci Sequence
Document15 pages
Report On The Analysis of The Fibonacci Sequence
Anchit Nayak
No ratings yet
Backup
Document9 pages
Backup
Jusdy Joe
No ratings yet
Time Dimension For Data Warehouse
Document710 pages
Time Dimension For Data Warehouse
wsxedc998877
No ratings yet
Oracle Brochure Revised
Document4 pages
Oracle Brochure Revised
vmkamath
No ratings yet
1.HCI Orchestration and PipelineSteps 20161116
Document30 pages
1.HCI Orchestration and PipelineSteps 20161116
Stenish Peter
0% (1)
Drivetools32™ Downloads To Bulletin 3500-Tbx Interfaces
Document8 pages
Drivetools32™ Downloads To Bulletin 3500-Tbx Interfaces
Mario Bozicevic
No ratings yet
(123doc) Xu Ly Tin Hieu So Bai3a
Document24 pages
(123doc) Xu Ly Tin Hieu So Bai3a
Thành Vỹ
No ratings yet
Tcs Verbal Ability
Document3 pages
Tcs Verbal Ability
NiveythaJegatheesan
No ratings yet
Matrix Indexing in MATLAB
Document6 pages
Matrix Indexing in MATLAB
Akash Ramann
No ratings yet
trpl1 PDF
Document411 pages
trpl1 PDF
Walker Talker
No ratings yet
Banks and Data Leak Prevention
Document4 pages
Banks and Data Leak Prevention
quocirca
No ratings yet
Numbers
Document6 pages
Numbers
chandravedalabs
No ratings yet
Fractions and Multiples
Document4 pages
Fractions and Multiples
Gisela Delicia
No ratings yet
Matrix-Chain Multiplication: - Suppose We Have A Sequence or Chain A, A,, A of N Matrices To Be Multiplied
Document15 pages
Matrix-Chain Multiplication: - Suppose We Have A Sequence or Chain A, A,, A of N Matrices To Be Multiplied
rosev15
No ratings yet
Construction Cost
Document2 pages
Construction Cost
Kira Yamato
No ratings yet
TECDIS Feature Guide
Document7 pages
TECDIS Feature Guide
ringbolt
No ratings yet