Welcome to Scribd!

Machine Learned Machines: Reinforcement Learning Exploration For Architecture Co-Optimization

Uploaded by

0% found this document useful (0 votes)

17 views2 pages

This document discusses using reinforcement learning techniques for co-optimizing system resources like cache partitioning and voltage/frequency scaling to improve performance metrics. It proposes two approaches: 1) Machine Learned Machines (MLM) which uses reinforcement learning to optimize energy-delay-product by dynamically partitioning cache and scaling cores/uncore voltage/frequency. 2) Machine Learned Caches (MLC) which optimizes system throughput by co-partitioning multiple cache levels using coordinated multi-agent reinforcement learning. Evaluation shows MLM improves energy-delay-product by up to 21.7% with less than 5% throughput degradation, while MLC improves throughput by 9.35% with 13.5% energy-delay-product improvement

Original Description:

Machine Learned Machine, IIT Delhi

Original Title

MLM Abstract

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

17 views2 pages

Machine Learned Machines: Reinforcement Learning Exploration For Architecture Co-Optimization

Uploaded by

Rahul

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Abstract

Machine Learned Machines: Reinforcement Learning Exploration for

Architecture Co-optimization
Rahul Jain
Indian Institute of Technology Delhi
rahuljain9@gmail.com

Modern multi-core systems provide huge computational capabilities which can be used to
run multiple processes concurrently. The integration of multiple cores on a single chip has
resulted in very large performance gap between the processor and the on-chip memory. This
has resulted in multiple levels of on-chip caches which are often shared among multiple
applications executing simultaneously on various cores. The increasing core count results in
larger shared resources like the last level cache, interconnect network, memory controllers,
etc. To achieve the best possible performance within limited power budget the various
system resources need to be allocated effectively. Any mismatch between runtime resource
requirement and allocation results in a sub-optimal performance.

Different optimization techniques exist for addressing the problem of mismatch between the
dynamic requirement and runtime allocation of the system resources. Choosing between
multiple optimizations at runtime is complex due to the non-additive effects, making the
scenario suitable for the application of machine learning techniques. We have proposed and
evaluated different reinforcement learning (RL) based techniques targeting multiple resource
allocation at runtime. We study two different co-optimization problems and show that the
co-optimization techniques result in better system metrics than any of the techniques applied
individually. We study the application of reinforcement learning (RL) techniques to the
various architecture optimization problems. The RL technique is capable of learning the
utility function of the various resource allocations at runtime by interacting with the
architecture. We have explored multiple agent RL models based on independent,
cooperative and coordinated learners.

The first co-optimization problem is targeted towards optimization of the system

energy-delay-product (EDP). We present a novel method, Machine Learned Machines
(MLM), by using Online Reinforcement Learning (RL) to perform dynamic partitioning of the
last level cache (LLC), along with dynamic voltage and frequency scaling (DVFS) of the
cores and uncore (interconnection network and LLC). The DVFS+DCP co-optimization
problem is modeled using independent, cooperative and coordinated multi-agent learners
and compared on different system metrics.

The second co-optimization problem is targeted towards optimization of the system

throughput (STP).We propose a novel problem of dynamic cache co-partitioning (DCCP) of
the multiple cache levels and present a coordinated multi-agent RL technique, Machine
Learned Caches (MLC).MLC results in better system metrics compared to the independent
application of the dynamic cache partitioning (DCP) techniques at multiple cache levels.
The explored machine learning techniques are found to be effective in handling the different
co-optimization problems resulting in better system metrics compared to state-of-the-art
research work. We study the various co-optimization techniques targeting different system
metrics and study their effect on system EDP, system throughput (STP), and Fairness. The
proposed MLM co-optimization techniques are effective in improving the system EDP by up
to 21.7% with less than 5% STP degradation on a 4-core system. The proposed MLC
co-partitioning exhibited 9.35% STP improvement with 13.5% system EDP improvement on
an 8-SMT core (4 physical core) system.

ARINC Standards Document List
Document45 pages
ARINC Standards Document List
indra_wirasendjaja
No ratings yet
2009, Multi-Core Programming For Medical Imaging PDF
Document163 pages
2009, Multi-Core Programming For Medical Imaging PDF
Anatoli Krasilnikov
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
Document63 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
ANTHONY NIKHIL REDDY
No ratings yet
JavaScript Drag and Drop
Document38 pages
JavaScript Drag and Drop
RAVI Kumar
No ratings yet
High Performance Computing
Document164 pages
High Performance Computing
tpabbas
100% (2)
Multicore Software Development Techniques: Applications, Tips, and Tricks
From Everand
Multicore Software Development Techniques: Applications, Tips, and Tricks
Robert Oshana
Rating: 2.5 out of 5 stars
2.5/5 (2)
FA1 - Laboratory Exercise
Document28 pages
FA1 - Laboratory Exercise
Nuriel Aguilar
No ratings yet
Materi Sertifikasi TI UB
Document138 pages
Materi Sertifikasi TI UB
Maika
100% (2)
1 PB
Document9 pages
1 PB
Okwuogu Chisom
No ratings yet
HPC Notes
Document24 pages
HPC Notes
ShadowOP
No ratings yet
A Systematic Approach To Composing and Optimizing Application Workflows
Document9 pages
A Systematic Approach To Composing and Optimizing Application Workflows
Leo Kwee Wah
No ratings yet
Optimal Computing Performance With Symmetric Multiprocessors
Document8 pages
Optimal Computing Performance With Symmetric Multiprocessors
Death Angel
No ratings yet
Dynamic Data Management Among
Document11 pages
Dynamic Data Management Among
CS & IT
No ratings yet
A Survey of Clustering Algorithms Based On Parallel Mechanism
Document4 pages
A Survey of Clustering Algorithms Based On Parallel Mechanism
Divyashree B
No ratings yet
EMBA: Efficient Memory Bandwidth Allocation To Improve Performance On Intel Commodity Processor
Document12 pages
EMBA: Efficient Memory Bandwidth Allocation To Improve Performance On Intel Commodity Processor
SHREYA
No ratings yet
Week 5 - The Impact of Multi-Core Computing On Computational Optimization
Document11 pages
Week 5 - The Impact of Multi-Core Computing On Computational Optimization
Game Account
No ratings yet
Parallel Processing: Types of Parallelism
Document7 pages
Parallel Processing: Types of Parallelism
Rupesh Mishra
No ratings yet
Analysis and Simulation of Mixed-Technology VLSI Systems
Document26 pages
Analysis and Simulation of Mixed-Technology VLSI Systems
coralonso
No ratings yet
Computer Cluster: Definition - What Does Mean?
Document1 page
Computer Cluster: Definition - What Does Mean?
Kristel Marasigan
No ratings yet
CSE 423 Virtualization and Cloud Computinglecture0
Document16 pages
CSE 423 Virtualization and Cloud Computinglecture0
Shahad Shahul
No ratings yet
1
Document11 pages
1
aiadaiad
No ratings yet
A Light-Weight Approach To Dynamical Runtime Linking Supporting Heterogenous, Parallel, and Reconfigurable Architectures
Document12 pages
A Light-Weight Approach To Dynamical Runtime Linking Supporting Heterogenous, Parallel, and Reconfigurable Architectures
Abhishek Chaturvedi
No ratings yet
Scalable Hierarchical Aggregation Protocol SHArP A Hardware Architecture For Efficient Data Reduction
Document10 pages
Scalable Hierarchical Aggregation Protocol SHArP A Hardware Architecture For Efficient Data Reduction
lizarrd
No ratings yet
Watercolor Organic Shapes SlidesMania
Document23 pages
Watercolor Organic Shapes SlidesMania
Ayush kumar
No ratings yet
Hardware-Based Job Queue Management For Manycore Architectures and Openmp Environments
Document12 pages
Hardware-Based Job Queue Management For Manycore Architectures and Openmp Environments
Manali Bhutiyani
No ratings yet
Node Node Node Node: 1 Parallel Programming Models 5
Document5 pages
Node Node Node Node: 1 Parallel Programming Models 5
latinwolf
No ratings yet
Cluster Computing: A Paper Presentation On
Document16 pages
Cluster Computing: A Paper Presentation On
Vishal Jaiswal
No ratings yet
C4-IEEE ParallelRT
Document8 pages
C4-IEEE ParallelRT
Lordu Gamer
No ratings yet
Research Paper PDF
Document6 pages
Research Paper PDF
Nemra
No ratings yet
A Case For Courseware: Trixie Mendeer and Debra Rohr
Document4 pages
A Case For Courseware: Trixie Mendeer and Debra Rohr
frida
No ratings yet
Chapter 1
Document25 pages
Chapter 1
chinmay
No ratings yet
Assignment: Parallel and Distributed Computing Submitted To: Sir Shoaib Date: 25-03-2019
Document5 pages
Assignment: Parallel and Distributed Computing Submitted To: Sir Shoaib Date: 25-03-2019
Umer
No ratings yet
Multicore Resource Managment
Document6 pages
Multicore Resource Managment
Haris Shamim
No ratings yet
A Review On Use of MPI in Parallel Algorithms: IPASJ International Journal of Computer Science (IIJCS)
Document8 pages
A Review On Use of MPI in Parallel Algorithms: IPASJ International Journal of Computer Science (IIJCS)
International Journal of Application or Innovation in Engineering & Management
No ratings yet
A Survey of Parallel Programming Models and Tools in The Multi and Many-Core Era
Document18 pages
A Survey of Parallel Programming Models and Tools in The Multi and Many-Core Era
ramon91
No ratings yet
Qthreads PDF
Document8 pages
Qthreads PDF
Stefano Nefasto
No ratings yet
Tensorflow: A System For Large-Scale Machine Learning
Document21 pages
Tensorflow: A System For Large-Scale Machine Learning
patilrushal824
No ratings yet
Performance Analysis of Parallel Algorithms On Mul
Document11 pages
Performance Analysis of Parallel Algorithms On Mul
RIYA GUPTA
No ratings yet
Module 5
Document40 pages
Module 5
1HK16CS104 Muntazir Hussain Bhat
No ratings yet
1 of 1 PDF
Document7 pages
1 of 1 PDF
patasutosh
No ratings yet
Introduction To Parallel Computing: John Von Neumann Institute For Computing
Document18 pages
Introduction To Parallel Computing: John Von Neumann Institute For Computing
sandeep_nagar29
No ratings yet
FPGA Based
Document7 pages
FPGA Based
Pushpa Latha
No ratings yet
Unit 5
Document14 pages
Unit 5
aa
No ratings yet
Research Paper On Distributed Operating System
Document4 pages
Research Paper On Distributed Operating System
aflbtcxfc
100% (1)
Module 1 - New
Document59 pages
Module 1 - New
Bantu Aadhf
No ratings yet
Lyu 2019
Document8 pages
Lyu 2019
vchicav
No ratings yet
2022 (001) - Low-Power Algorithm On Generic
Document7 pages
2022 (001) - Low-Power Algorithm On Generic
fsmondiolatiro
No ratings yet
Rameshwaraiah - 2021 - IOP - Conf. - Ser. - Mater. - Sci. - Eng. - 1074 - 012015
Document6 pages
Rameshwaraiah - 2021 - IOP - Conf. - Ser. - Mater. - Sci. - Eng. - 1074 - 012015
Elmustafa Sayed Ali Ahmed
No ratings yet
Autosys: The Design and Operation of Learning-Augmented Systems
Document15 pages
Autosys: The Design and Operation of Learning-Augmented Systems
Pramod
No ratings yet
Qualitative Study On The Efficiency of Load Balancing Algorithms in Cloud Environment
Document4 pages
Qualitative Study On The Efficiency of Load Balancing Algorithms in Cloud Environment
International Organization of Scientific Research (IOSR)
No ratings yet
Article 1 R-E-W Heuristic
Document11 pages
Article 1 R-E-W Heuristic
helin
No ratings yet
Rubio Campillo2018
Document3 pages
Rubio Campillo2018
Aiman
No ratings yet
ICAC08
Document10 pages
ICAC08
Kevin Mondragon
No ratings yet
NTSD-ass3
Document3 pages
NTSD-ass3
Md. Ziaul Haque Shipon
No ratings yet
HPC2
Document12 pages
HPC2
Sid-ahmed KHOBZAOUI
No ratings yet
PROJECT TOPIC: Performance Analysis of Load Balancing Algorithms in Cloud Heterogeneous Environment
Document2 pages
PROJECT TOPIC: Performance Analysis of Load Balancing Algorithms in Cloud Heterogeneous Environment
Lipi Seth
No ratings yet
A Data Mining Approach For Unification of Association Rules in Distributed and Parallel Databases
Document5 pages
A Data Mining Approach For Unification of Association Rules in Distributed and Parallel Databases
International Journal of Application or Innovation in Engineering & Management
No ratings yet
Lecture 2 (Parallelism)
Document7 pages
Lecture 2 (Parallelism)
hussiandavid26
No ratings yet
Serial and Parallel First 3 Lecture
Document17 pages
Serial and Parallel First 3 Lecture
Asif Khan
No ratings yet
(HPC) Pratik
Document8 pages
(HPC) Pratik
60Abhishek Kalpavruksha
No ratings yet
Understanding The Impact of Multi-Core Architecture in Cluster Computing: A Case Study With Intel Dual-Core System
Document8 pages
Understanding The Impact of Multi-Core Architecture in Cluster Computing: A Case Study With Intel Dual-Core System
hfarrukhn
No ratings yet
Parallel and Distributed Computing Systems
Document57 pages
Parallel and Distributed Computing Systems
srishtityagi2020
No ratings yet
HTAM
Document30 pages
HTAM
niteshtripathi_jobs
100% (1)
Massively Scalable Density Based Clustering (DBSCAN) On The HPCC Systems Big Data Platform
Document8 pages
Massively Scalable Density Based Clustering (DBSCAN) On The HPCC Systems Big Data Platform
IAES IJAI
No ratings yet
405 LS Operator's Manual
Document240 pages
405 LS Operator's Manual
Matovu Herbert
No ratings yet
99 157055 B Sailor 6120 Ssa System Type Approval Certificate DNV
Document3 pages
99 157055 B Sailor 6120 Ssa System Type Approval Certificate DNV
Anıl Kahya
No ratings yet
Lec 5 Use Case Modelling
Document37 pages
Lec 5 Use Case Modelling
Moeen Khan
No ratings yet
80290354UK
Document292 pages
80290354UK
Weslei Cunha
No ratings yet
Ig2 Mock2 Ict Paper3 MS
Document8 pages
Ig2 Mock2 Ict Paper3 MS
Sakthipreya V
No ratings yet
Mycoursefree - Click - 001 Microsoft-Word-Password-Cracking-with-John
Document22 pages
Mycoursefree - Click - 001 Microsoft-Word-Password-Cracking-with-John
alrames262
No ratings yet
Information Systems Analysis: Topic 7: Process-Oriented IS Methodologies
Document24 pages
Information Systems Analysis: Topic 7: Process-Oriented IS Methodologies
Akuzike Nguku
No ratings yet
List of E-Learning - B
Document3 pages
List of E-Learning - B
MostafaElrakhawy
No ratings yet
USB Focus Motor Controller User Manual: Model FCUSB
Document7 pages
USB Focus Motor Controller User Manual: Model FCUSB
falmven
No ratings yet
BOM 11i Training Presentation
Document47 pages
BOM 11i Training Presentation
appsloader
No ratings yet
Kalpana's Resume
Document6 pages
Kalpana's Resume
Manjunath
No ratings yet
Python Notes-1
Document178 pages
Python Notes-1
harini thiyagarajan
No ratings yet
Homework 2 Solution
Document7 pages
Homework 2 Solution
Edward Lee
No ratings yet
Comptia Linux Xk0 005 Exam Cram 1St Edition William Rothwell Full Chapter
Document67 pages
Comptia Linux Xk0 005 Exam Cram 1St Edition William Rothwell Full Chapter
sherman.prichard680
100% (4)
TSPSC Aee-General Studies Key
Document41 pages
TSPSC Aee-General Studies Key
darimadugu
No ratings yet
Service
Document3,393 pages
Service
Maria Fedalto
No ratings yet
DIGSI5 Product Information V09.30 Manual C001-T
Document46 pages
DIGSI5 Product Information V09.30 Manual C001-T
enriq7
No ratings yet
Hackathon Statements V1
Document10 pages
Hackathon Statements V1
Aayush
No ratings yet
Dolby CineAsset Player - ProductSheet
Document2 pages
Dolby CineAsset Player - ProductSheet
Darius White
No ratings yet
MD-100.exam.24q: Website: VCE To PDF Converter: Facebook: Twitter
Document25 pages
MD-100.exam.24q: Website: VCE To PDF Converter: Facebook: Twitter
Willson Isaac Barrueta Barrueta
No ratings yet
Inter IIT Cult Meet 5.0 Creative Arts Rulebook
Document9 pages
Inter IIT Cult Meet 5.0 Creative Arts Rulebook
Adwit Barun
No ratings yet
Webspace Session Issue
Document2 pages
Webspace Session Issue
Gopalakrishnan Gokul
No ratings yet
Struct Int Struct Struct Struct Void Int Struct Struct Sizeof Struct Struct Struct
Document23 pages
Struct Int Struct Struct Struct Void Int Struct Struct Sizeof Struct Struct Struct
Marshall
No ratings yet
UriSed 3 PRO Sw4.0 Service Manual v1
Document248 pages
UriSed 3 PRO Sw4.0 Service Manual v1
Orlando
No ratings yet
Abap 7.40 PDF
Document24 pages
Abap 7.40 PDF
anil
No ratings yet
Purvi
Document56 pages
Purvi
Purvi Sharma
No ratings yet