Welcome to Scribd!

Zareen 14

Uploaded by

0% found this document useful (0 votes)

15 views9 pages

The document discusses instruction-level parallelism and its exploitation through techniques like loop unrolling and scheduling. It provides examples of how unrolling and scheduling a sample loop can reduce the number of cycles needed per loop iteration. It also shows how unrolling a loop for a VLIW architecture that can issue multiple operations per cycle can eliminate stalls. However, overly unrolling loops can increase code size and unused functional units in the VLIW model can waste encoding bits.

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

15 views9 pages

Zareen 14

Uploaded by

Jehangir Vakil

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 9

Search inside document

Multiple Issue Architectures

Book 1 – Computer Architecture: A Quantitative Approach, Henessy and Patterson,

5th Edition, Morgan Kaufmann, 2012
Chapter Three : Instruction-Level Parallelism and Its Exploitation
Example of loop unrolling
Show our loop unrolled so that there are four copies of the loop body, assuming
R1 – R2 (that is, the size of the array) is initially a multiple of 32, which means
that the number of loop iterations is a multiple of 4. Eliminate any obviously
redundant computations and do not reuse any of the registers.
Example of loop unrolling

• Eliminated three branches and three

decrements of R1
• This loop will run in 27 cycles – 14
instruction issue cycles, 13 stall cycles =>
6.75 cycles per element
• The performance can be improved further
if the unrolled loop is also scheduled
Example of loop unrolling and
scheduling
Show the unrolled loop in the
previous example after it has
been scheduled.

For unrolled and scheduled:

• Total cycles = 14
• 14/4 = 3.5 cycles/element
For unrolled only:
6.75 cycles/element
For scheduled only:
Total cycles = 7 cycles / element
Example of basic VLIW model
Suppose we have a VLIW that could issue two memory references,
two FP operations, and one integer operation or branch in every clock
cycle. Show an unrolled version of the loop x[i] = x[i] + s for such a
processor. Unroll as many times as necessary to eliminate any stalls.
Ignore delayed branches.
Total cycles: 9
Issue rate : 23 operations in 9 clock cycles
Efficiency (the percentage of available slots that contained an operation) ≈ 52%
This VLIW code sequence requires at least 8 FP registers while same code sequence for the
base MIPS processor can use as few as two FP registers
Two technical problems with VLIW model:
1. generating enough operations in a straight-line code fragment
requires ambitiously unrolling loops, thereby increasing code size.
2. whenever instructions are not full, the unused functional units
translate to wasted bits in the instruction encoding
THE END

Lab Task UTCN
Document3 pages
Lab Task UTCN
Cristian-Vlad Pop
No ratings yet
4.1 Basic Compiler Techniques For Exposing ILP Instruction-Level Parallelism
Document11 pages
4.1 Basic Compiler Techniques For Exposing ILP Instruction-Level Parallelism
Richer Zara Morano
No ratings yet
6 Nested Loops 15052021 010521pm
Document19 pages
6 Nested Loops 15052021 010521pm
Sanjar Abbasi
No ratings yet
SMT and CMP Architectures
Document19 pages
SMT and CMP Architectures
ChippyVijayan
No ratings yet
Project Report CS 341: Computer Architecture Lab
Document12 pages
Project Report CS 341: Computer Architecture Lab
thumarushik
No ratings yet
EEF011 Computer Architecture 計算機結構: Exploiting Instruction-Level Parallelism with Software Approaches
Document40 pages
EEF011 Computer Architecture 計算機結構: Exploiting Instruction-Level Parallelism with Software Approaches
i_2loveu3235
0% (1)
Unit - 11 Fortran Lecture-2: Computer Programming
Document26 pages
Unit - 11 Fortran Lecture-2: Computer Programming
Rusano Irakuga
No ratings yet
Missing Topic of PC
Document4 pages
Missing Topic of PC
Harseerat Sidhu
No ratings yet
Chapter 6 PPTV 2004 Short V1
Document21 pages
Chapter 6 PPTV 2004 Short V1
zelalem2022
No ratings yet
5 6
Document26 pages
5 6
cn3588
No ratings yet
Scenario - I
Document2 pages
Scenario - I
bhuvneshengg
No ratings yet
ECE 351 Table of Contents PDF
Document24 pages
ECE 351 Table of Contents PDF
Nikolay Nikolov
No ratings yet
CH 13.ppt Type I
Document35 pages
CH 13.ppt Type I
adele5eve55
No ratings yet
Assigment Lepy
Document1 page
Assigment Lepy
ZULHILMI BIN ZULKIFLI A19EE0180
No ratings yet
W7L2 OpenMP4 Worksharing
Document26 pages
W7L2 OpenMP4 Worksharing
l215376
No ratings yet
Chapter 6 Loops
Document53 pages
Chapter 6 Loops
talhagg333 talhagg333
No ratings yet
Reference Material II 13-Jan-2020 4.WhiteBox BlackBox
Document16 pages
Reference Material II 13-Jan-2020 4.WhiteBox BlackBox
Medarametla Sreeram
No ratings yet
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
Document46 pages
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
askbilladdmicrosoft
No ratings yet
Computer Architecture CT 2 Paper Solution: K M K M K M K M T T) S (M
Document12 pages
Computer Architecture CT 2 Paper Solution: K M K M K M K M T T) S (M
Nishant Agarwal
No ratings yet
Chapter # 4: The Processor: Course Instructor: Dr. Afshan Jamil Lecture # 9
Document18 pages
Chapter # 4: The Processor: Course Instructor: Dr. Afshan Jamil Lecture # 9
Muhammad Azam Rajpoot
No ratings yet
COA UNIT-III Parallel Processors
Document51 pages
COA UNIT-III Parallel Processors
Devika csbs
No ratings yet
05 Iteration
Document46 pages
05 Iteration
ZHEN-HONG LEE
No ratings yet
Chap4 OpenMP
Document35 pages
Chap4 OpenMP
Michael Shi
No ratings yet
Lecture: Pipelining Basics
Document28 pages
Lecture: Pipelining Basics
Tahsin Arik Tusan
No ratings yet
2017 Marking Micro 3
Document10 pages
2017 Marking Micro 3
Kolitha
No ratings yet
05 LoopingAndFiles
Document72 pages
05 LoopingAndFiles
ryanzhang357
No ratings yet
Module5 Loops
Document28 pages
Module5 Loops
Dexter Ranalan
No ratings yet
Compte Rendu TP N°1: Microcontroleur
Document7 pages
Compte Rendu TP N°1: Microcontroleur
ali
No ratings yet
Binary Literacy - Static - 6 - Optimizations
Document67 pages
Binary Literacy - Static - 6 - Optimizations
magic_fyodor
No ratings yet
Solutions Ch4
Document7 pages
Solutions Ch4
Sangam Jindal
No ratings yet
Parallelism
Document22 pages
Parallelism
deivasigamani
No ratings yet
Programming 1A: (PROG5121)
Document10 pages
Programming 1A: (PROG5121)
Dzudzi Manyuha
No ratings yet
Lecture 25-27
Document64 pages
Lecture 25-27
Kripansh mehra
No ratings yet
Laboratory Exercise No 6
Document1 page
Laboratory Exercise No 6
Mika Pelagio
No ratings yet
Superscalar Processors Superscalar Processors vs. VLIW: Computer Science
Document17 pages
Superscalar Processors Superscalar Processors vs. VLIW: Computer Science
Leena John
No ratings yet
IPE 483-T1-Assembly Line Balancing
Document16 pages
IPE 483-T1-Assembly Line Balancing
Aa Bb
No ratings yet
Lesson Five Text
Document24 pages
Lesson Five Text
Relu Chiru
No ratings yet
SMT and CMP Architectures
Document19 pages
SMT and CMP Architectures
tp2006ster
No ratings yet
Openmp
Document127 pages
Openmp
ivofrompisa
No ratings yet
Cs433 Sp12 Midterm Sol
Document9 pages
Cs433 Sp12 Midterm Sol
SwatiMeena
No ratings yet
(WEEK3) Repetition Structures
Document42 pages
(WEEK3) Repetition Structures
Victor EYOMA
No ratings yet
CO Mod5 SB
Document16 pages
CO Mod5 SB
Hemanth Hemanth
No ratings yet
Salinan 03 ALPRO
Document33 pages
Salinan 03 ALPRO
rendidwirusti
No ratings yet
CA Classes-91-95
Document5 pages
CA Classes-91-95
SrinivasaRao
No ratings yet
Functional Verification and Testbench Generation - Direct and Random Testing
Document23 pages
Functional Verification and Testbench Generation - Direct and Random Testing
Mohammad Seemab Aslam
No ratings yet
Assignment Questions
Document3 pages
Assignment Questions
Sarbendu Paul
No ratings yet
Ch3 PII
Document23 pages
Ch3 PII
speedystories97
No ratings yet
Comparch Comparch-002 Exams Midterm A8Xj46NCRo
Document9 pages
Comparch Comparch-002 Exams Midterm A8Xj46NCRo
Mattia Le
No ratings yet
Stud CSA Mod 5p2 Arithmetic SuperPipeline
Document57 pages
Stud CSA Mod 5p2 Arithmetic SuperPipeline
sheenanees
No ratings yet
Loops in Java
Document41 pages
Loops in Java
SANIDHYSINGH RAGHUWANSHI
No ratings yet
CS/COE 1541 Term 2174 Quiz 1: (Solutions)
Document2 pages
CS/COE 1541 Term 2174 Quiz 1: (Solutions)
Kasun Pradeep Thilakarathna
No ratings yet
classVIII Coding Teacher Presentation
Document44 pages
classVIII Coding Teacher Presentation
P Srinivas
No ratings yet
Instruction Tables
Document323 pages
Instruction Tables
tahloko
No ratings yet
Generating A Periodic Pattern For VLIW
Document18 pages
Generating A Periodic Pattern For VLIW
anon_817055971
No ratings yet
ILP - Appendix C PDF
Document52 pages
ILP - Appendix C PDF
Dhananjay Jahagirdar
No ratings yet
CHAPTER 12 Computer Class 12
Document4 pages
CHAPTER 12 Computer Class 12
shaheeneditx97
No ratings yet
SMT and CMP Architectures
Document19 pages
SMT and CMP Architectures
i_2loveu3235
100% (3)
Week 14
Document20 pages
Week 14
EMİRCAN İPEK
No ratings yet
Itwiki PHP
Document47 pages
Itwiki PHP
wildan sudibyo
No ratings yet
Beginning Software Engineering
From Everand
Beginning Software Engineering
Rod Stephens
Rating: 4 out of 5 stars
4/5 (1)
Engineering Economics & Management: Week 2 Prepared By: Miss. Fatima M. Saleem
Document81 pages
Engineering Economics & Management: Week 2 Prepared By: Miss. Fatima M. Saleem
Jehangir Vakil
No ratings yet
Engineering Economics & Management: Week 1 Prepared By: Miss. Fatima M. Saleem
Document34 pages
Engineering Economics & Management: Week 1 Prepared By: Miss. Fatima M. Saleem
Jehangir Vakil
No ratings yet
ZS#5
Document17 pages
ZS#5
Jehangir Vakil
No ratings yet
Zareen 11
Document8 pages
Zareen 11
Jehangir Vakil
No ratings yet
Solving Machine Learning Optimization Problems Using Quantum Computers
Document6 pages
Solving Machine Learning Optimization Problems Using Quantum Computers
Jehangir Vakil
No ratings yet
Coarse Grained Lattice Folding Quantum
Document12 pages
Coarse Grained Lattice Folding Quantum
Jehangir Vakil
No ratings yet
Attack Strategies BB84 (Sir)
Document4 pages
Attack Strategies BB84 (Sir)
Jehangir Vakil
No ratings yet
Attack Via Light Injection
Document21 pages
Attack Via Light Injection
Jehangir Vakil
No ratings yet