- Material View[1]
- Chapters
- Budynas SM Ch01
- Introduction to AI and the perceptron
- 23DemoPartitioning.pdf
- Prime Number
- Mopso Guide
- 304Various Image Segmentation Method for Underwater Acoustic Image a Survey PDF
- PQ
- Efficient Formation of Storage Classes For Warehouse Storage Location
- syntax wavelet.docx
- Corlett Method for Solving Cyphers
- SKEMA 1449_Mathematics SPM.doc
- Particle Swarm Optimization-Meander Line Polarizer
- Filter Fir
- Sample Reporting Posted
- limit1
- 05_table of Contents
- DynProg-1 (1)
- Deng et al
- mvg_exercisesheet_01
- derivatives - Solve the differential equation_ $(y^2-xy)dx+x^2dy=0$ - Mathematics Stack Exchange
- Final_Student Practice Sheet _AHP Example
- [Blair]_Convex Optimization and Lagrange Multipliers (1977)
- Z transform
- Fett(1998)
- aalg8
- Mathematics 2010
- ralhan2013.pdf
- DSP Lab 1-5.docx
- Sapiens: A Brief History of Humankind
- The Unwinding: An Inner History of the New America
- Yes Please
- Dispatches from Pluto: Lost and Found in the Mississippi Delta
- Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
- The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution
- John Adams
- Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
- A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
- Grand Pursuit: The Story of Economic Genius
- This Changes Everything: Capitalism vs. The Climate
- The Emperor of All Maladies: A Biography of Cancer
- The Prize: The Epic Quest for Oil, Money & Power
- Team of Rivals: The Political Genius of Abraham Lincoln
- The New Confessions of an Economic Hit Man
- The World Is Flat 3.0: A Brief History of the Twenty-first Century
- Rise of ISIS: A Threat We Can't Ignore
- Smart People Should Build Things: How to Restore Our Culture of Achievement, Build a Path for Entrepreneurs, and Create New Jobs in America
- The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
- How To Win Friends and Influence People
- Angela's Ashes: A Memoir
- Steve Jobs
- Bad Feminist: Essays
- The Light Between Oceans: A Novel
- Leaving Berlin: A Novel
- The Silver Linings Playbook: A Novel
- The Sympathizer: A Novel (Pulitzer Prize for Fiction)
- Extremely Loud and Incredibly Close: A Novel
- You Too Can Have a Body Like Mine: A Novel
- The Incarnations: A Novel
- Life of Pi
- The Love Affairs of Nathaniel P.: A Novel
- A Man Called Ove: A Novel
- Bel Canto
- The Master
- We Are Not Ourselves: A Novel
- The First Bad Man: A Novel
- The Rosie Project: A Novel
- The Blazing World: A Novel
- Brooklyn: A Novel
- The Flamethrowers: A Novel
- The Art of Racing in the Rain: A Novel
- Wolf Hall: A Novel
- The Wallcreeper
- Interpreter of Maladies
- Beautiful Ruins: A Novel
- The Kitchen House: A Novel
- The Perks of Being a Wallflower
- The Bonfire of the Vanities: A Novel
- Lovers at the Chameleon Club, Paris 1932: A Novel
- A Prayer for Owen Meany: A Novel
- The Cider House Rules
- Mislaid: A Novel

Parameter-Optimized Simulated Annealing for Application Mapping on Networks-on-Chip

Outline

Introduction Application Mapping Implementation of SA Nelder-Mead Simplex Method Experiment and Analysis Conclusion

ZDNet.. • Intel: Why a 1.000-core chip is feasible..Introduction Moore’s Law is still valid (ITRS’s perspective) What could we do with billions of transistors? • Tens to hundreds of cores on a single chip • 80-core Intel Terascale Chip • Tilera TILE-Gx Family with 16 to 100 processing cores • . 2010 Manycore architecture has become the mainstream for parallel commputing .

bottleneck Networks-on-Chip (NoC) • Better scalability • Higher reliability • More reusability . point-to-point connections.Introduction Major Concern • Communication. instead of computation • Great impact on perfomance and energy consumption • Conventional bus.

Application Mapping Application • a set of concurrent tasks • modeled by the communication weighted graph (CWG) Many-core NoC • a set of tiles and links • Modeled by the computation and communication resource graph (CCRG) .

Objective: mapping solution to minimize the communication energy consumption .Application Mapping The role is to determine how to place each task on a tile of the NoC so that the specific design interests and costraints are fulfilled.

Application Mapping Energy model of NoC [Jingcao2005] • Energy consumped by one communication • where • : data volume transferred from task i to j : distance of communication channel from node i to j on the NoC • and : energy consumed by switch and link for transferring one bit of data on the NoC .

.Application Mapping Objective Formulization • Communication energy consumption of an application • Given constants the product of and and . Weghted Communication of an Application (WCA) Smaller WCA Better solution The objective of the application mapping is to find the optimal solution with minimal WCA. Eapp is linearly proportional to of all communications .

. Greedy Incremental (GI).55e25 ) • Heuristic search including Simulated Annealing (SA). n=m=25. 25! 1.Application Mapping NP-hard problem • to map m tasks on n cores (m • n) (n n! m 1 )! possible solutions • search space increases exponentionally with problem size m and n • Exhaustive search is impossible.. Tabu Search (TS).g. etc. (e.

. T) • etc. T) • Termination function Terminate(i. R) • Move function Move(S.Simulated Annealing Pro and Con • Be able to find global optima • Numerous computions and evaluations-long runtime Parameters and Functions • Initial temperature T0 • Final temperature Tf • Cost fucntion Cost(S) • Temperature function Temp(i) • Acceptance function Accept( C.

q: cooling ratio • L: # of iterations at each temperature .Simulated Annealing Cost function Cost(S) • Cost(S) = WCA of solution S Temperature function Temp(i) Temp(i ) T 0 q i L • i: # of iterations.

C : cost difference • K: normalized ratio Termination function Terminate(i. T) • Single random swapping • A task in current solution is randomly selected and swapped to a randomly selected tiles to generate a new solution .Simulated Annealing Acceptance function Accept( C. T) random() prob 1 1 exp( KCCT ) 0 • C0: initial cost. R) Temp(i ) Tf R R max N C 0 Z Move function Move(S.

Simulated Annealing Initial temperature T0 and final temperature Tf • Solve the acceptance function for T T KC 0 ln( C 1 1) prob T0 and Tf can be derived by: T0 C max 1 1) KC0 ln( prob0 Tf C min 1 KC0 ln( 1) prob f • prob0:probability of accepting Cmax at temperature T0 • probf:probability of accepting Cmin at temperature Tf .

we need to determine parameters: • q: colling ratio • K: normalizing ratio • prob0: probability of accepting Cmax at temperature T0 • probf: probability of accepting Cmin at temperature Tf • Cmax. instead of being set mannually and independently. .Simulated Annealing To summary. Cmin : coputed using a finite number of trial moves Considerations on parameter selection • problem-specific • jointly afftect the performance of SA The set of parameters should be selected in a system way.

the expansion point or the contraction point. …. xkn) • Sort the n+1 function values so that f(p0) f(p1) … f(pn) To get the minimum of f(p). x2. xk2. or by updating all points when the preceding replacements failed.Nelder-Mead Simplex Method Method for minimization of a function f(p) • Proposed by Nelder et. xn • n+1 points form the initial simplex.al in 1965 • f(p) : function with n variables x1. each point pk is a ntuple (xk1. in each iteration: • a new simplex is formed:either by replacing the point pn with the refelection point. …. • Sort the n+1 function values of points in the new simplex and continue the process .

refer J. f(p1). f(pn) converge to one value which is the approximation of the mimimum value of function f(p) For more detail.Nelder-Mead Simplex Method The process terminates until f(p0). A simplex method for function minimization. …. .Mead.Nelder and R.A.

prob0 and probf • Initial simplex:5 initial points consisting of selected values of 4 variables • SA algorihm applies each set of parameters of one point and finds one mapping solution • The WCA of the mapping solution found by the SA algorithm is defined as the value of function f(p) and compared with others • The Nelder-Mead method terminates when all 5 points converge to one point which represents the set of optimized parameters we try to find • This set of optimized parameters is then applied to the SA algorithm to find the best mapping solution .Nelder-Mead Simplex Method Parameter-Optimized SA (POSA) algorithm • Variables: q. K.

16 tasks) • MPEG4 12 tasks • multimedia systems application (MMS.Expereiment and Result Setup Four applications • video object plane decoder (VOPD.264 decoder (H264. Tf:unbounded • Exponential form of acceptance function • Random swapping move function Simulator in NoCMap is used to obtain the communication energy consumption for POSA and NoCMap . T0:100. 16 tasks) Reference work: NoCMap ([Jingcao2005]) • Parameters are set manually • q:0. 25 tasks) • H.9.

05 0. for different problems.05 0.36 0. different sets of parmaters should be applied to the SA algorithm .05 0.42 probf 0.34 0.36 0.94 0.49 Parameters are problem-specific Instead of using identical set of parameters.44 0.Expereiment and Result Optimized Parameters Application VOPD MPEG4 MMS H264 q 0.89 prob0 0.72 0.95 0.05 K 0.91 0.62 0.

77e4 1.06% 1.64% 1.14e7 1.Expereiment and Result Number of Iterations Application VOPD MPEG4 MMS H264 Avg.18e5 1.94% • POSA uses significantly less number of iterations • On average less than 1% of that in NoCMap .61e6 1.74e4 2.30e6 2.02% 0.61e6 POSA 2.04% 1. NoCMap 4.94e4 POSA/NoCMap 0.

75 11.17 0.Expereiment and Result Runtime of SA (seconds) App VOPD MPEG4 MMS H264 Avg.41 POSA’ 0.059 1.04 1.63 0. a 237 times of speedup is achieved .087 0.072 NoCMap /POSA’ 364 267 147 171 237 POSA includes the runtime of the Nelder-Mead method POSA’ is the runtime of SA applying the optimized parameters On average.94 1.74 171. NoCMap 31.34 POSA 15.50 9.67 181.04 1.69 15.74 12.90 NoCMap /POSA 2.

Expereiment and Result Weighted Communication (WCA) The mapping solution of POSA yields comparable WCA with that of NoCMap .

Expereiment and Result Energy Consumption(EC) Be consistent with the result of WCA The mapping solution of POSA yields comparable communication energy consumption with that of NoCMap .

With the set of optimized parameters. For the set of benchmarks.Conclusion A method to systematically select the parameters of the SA algorithm for the application mapping problem is proposed. . significantly less number of evaluations are processed in the POSA and the SA algorithm is accelerated. The accelerated POSA algorithm achieves comparable energy consumption with NoCMap. the POSA obtains the same quality mapping solutions while using less than 1% of iterations of NoCMap and achieving an average of 237 times of speedup.

Thank for your attention! Comments and Questions .

- Material View[1]Uploaded byImagine Imm
- ChaptersUploaded byKrushnasamy Suramaniyan
- Budynas SM Ch01Uploaded byHarold Aguirre
- Introduction to AI and the perceptronUploaded byRaghav Gupta
- 23DemoPartitioning.pdfUploaded bybui ton
- Prime NumberUploaded bytriptiag
- Mopso GuideUploaded byHarshyt GOel
- 304Various Image Segmentation Method for Underwater Acoustic Image a Survey PDFUploaded byprabhabathi devi
- PQUploaded byPranjal Verma
- Efficient Formation of Storage Classes For Warehouse Storage LocationUploaded byhmtnsl
- syntax wavelet.docxUploaded byAzhar Muttaqin
- Corlett Method for Solving CyphersUploaded bytixiercat
- SKEMA 1449_Mathematics SPM.docUploaded bymizah1306
- Particle Swarm Optimization-Meander Line PolarizerUploaded bynstl0101
- Filter FirUploaded byKhurram Sami
- Sample Reporting PostedUploaded bySammy Marquez
- limit1Uploaded byrreee
- 05_table of ContentsUploaded bysankarsada
- DynProg-1 (1)Uploaded byDaniel Wilson
- Deng et alUploaded bybatlounis4932
- mvg_exercisesheet_01Uploaded bylukkio
- derivatives - Solve the differential equation_ $(y^2-xy)dx+x^2dy=0$ - Mathematics Stack ExchangeUploaded byAbhishek Patra
- Final_Student Practice Sheet _AHP ExampleUploaded bykpmits
- [Blair]_Convex Optimization and Lagrange Multipliers (1977)Uploaded byKopaka Lewa
- Z transformUploaded byPrakash Chandrasekaran
- Fett(1998)Uploaded byElizabeth Santiago
- aalg8Uploaded byKfor Kritika
- Mathematics 2010Uploaded byAmirtaraj
- ralhan2013.pdfUploaded bytigerman man
- DSP Lab 1-5.docxUploaded bySmit Majithia