Simultaneous and Heterogeneous Multithreading

Simultaneous and heterogeneous multithreading (SHMT) is a framework that optimizes heterogeneous computing systems by utilizing various processors like CPUs, GPUs, and TPUs to improve parallelism and resource utilization. It dynamically maps virtual processors to physical ones, allowing independent subtasks to run on appropriate processors, resulting in a 1.95X performance increase and 51% energy reduction compared to conventional systems. The framework was benchmarked using a modified smartphone configuration resembling a data center server, demonstrating significant efficiency gains across multiple tasks.

Uploaded by

michael.moshe.forrest

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views2 pages

Simultaneous and Heterogeneous Multithreading

Uploaded by

michael.moshe.forrest

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Simultaneous and heterogeneous

multithreading
Simultaneous and heterogeneous multithreading (SHMT) is a software framework that takes
advantage of heterogeneous computing systems that contain a mixture of central processing units (CPUs),
graphics processing units (GPUs), and special purpose machine learning hardware, for example Tensor
Processing Units (TPUs).[1][2]

Each component processes information differently. Often data has to move among processors, which can
create bottlenecks, with one processor starving while waiting on another to finish.[1]

Architecture
The system defines virtual processors and virtual operations (VOPs). VOPs decompose into one or more
high-level operations (HLOPs). It then distributes the operations across the processors. The runtime
system then dynamically maps virtual processors to physical processors, assessing resource availability in
order to keep all the processors busy. The scheduler employs a light-weight, quality-aware work-stealing
(QAWS) policy.[1]

Conventional runtimes use assign one processor (set) to each subtask, leaving other types of processors
idle. In other words, the CPU(s) run (possibly in parallel), then when that subtask completes, the next
subtask is handed to the GPU(s). When they finish the next subtask is handed to the TPU(s).[2]

Adding software pipelining allows the second subtask to run using partial results from the first subtask,
which improves resource utilization.[2]

SHMT takes things a step further, identifying subtasks that can run independently of others to the
appropriate processor type, allow even better parallelism. Some subtasks can be performed on multiple
processor types. SHMT can divide a single subtask across such processor types. Thus the fundamental
breakthrough is to keep more processors working simultaneously, reducing time and energy costs.[2]

Benchmark
Researchers tested the concept using a typical smartphone configuration tweaked so that it resembled a
data center server.[1]

The hardware was Nvidia's Jetson Nano module containing a quad-core ARM Cortex-A57 processor
(CPU) and 128 Maxwell architecture GPU cores. A Google Edge TPU was connected via its M.2 Key E
slot. The processors communicated via an onboard PCI Express (PCIe) interface. Shared data was hosted
in a 4 GB 64-bit LPDDR4. The Edge TPU adds an 8 MB device memory. Ubuntu Linux 18.04 was the
operating system.[1]
Compared to a conventional system performance increased by 1.95X boost, while energy consumption
was reduced by 51%, on a range of benchmarks, including Black–Scholes, DCT8X8, DWT, FFT,
Histogram, Hotspot, Laplacian, MF, Sobel, SRAD, and GMEAN.[1]

See also
Asymmetric multiprocessing
Instruction-level parallelism (ILP)
Parallel computing
Simultaneous multithreading
Superscalar processor
Symmetric multiprocessing (SMP)
Variable SMP
Thread (computing)

References
1. McClure, Paul (February 22, 2024). "Software tweak doubles computer processing speed,
halves energy use" (https://newatlas.com/computers/smht-parallel-processing/). New Atlas.
Retrieved 2024-02-25.
2. Hsu, Kuan-Chieh; Tseng, Hung-Wei (2023-12-08). "Simultaneous and Heterogenous
Multithreading". 56th Annual IEEE/ACM International Symposium on Microarchitecture.
MICRO '23. New York, NY, USA: Association for Computing Machinery. pp. 137–152.
doi:10.1145/3613424.3614285 (https://doi.org/10.1145%2F3613424.3614285). ISBN 979-8-
4007-0329-4.

Retrieved from "https://en.wikipedia.org/w/index.php?

title=Simultaneous_and_heterogeneous_multithreading&oldid=1239906888"

Hyper-Threading Technology Overview
No ratings yet
Hyper-Threading Technology Overview
23 pages
Hyper Threading Technology in Microprocessors
No ratings yet
Hyper Threading Technology in Microprocessors
3 pages
Overview of Hyper-Threading Technology
No ratings yet
Overview of Hyper-Threading Technology
9 pages
Hyperthreading for Tech Enthusiasts
No ratings yet
Hyperthreading for Tech Enthusiasts
28 pages
Hyper Threading Technology
No ratings yet
Hyper Threading Technology
5 pages
Hyper Threading Seminar Report
No ratings yet
Hyper Threading Seminar Report
11 pages
Z-HYPER THREADING Seminar Report
No ratings yet
Z-HYPER THREADING Seminar Report
30 pages
Mesh-TensorFlow for Large DNNs
No ratings yet
Mesh-TensorFlow for Large DNNs
16 pages
Hyper-Threading Technology Speeds Clusters: Lecture Notes in Computer Science September 2003
No ratings yet
Hyper-Threading Technology Speeds Clusters: Lecture Notes in Computer Science September 2003
8 pages
Understanding Multithreading Techniques
No ratings yet
Understanding Multithreading Techniques
22 pages
Multithreading, SMT and CMP
No ratings yet
Multithreading, SMT and CMP
7 pages
Improving GPU Multitasking Efficiency
No ratings yet
Improving GPU Multitasking Efficiency
4 pages
HPC Chapter 1
No ratings yet
HPC Chapter 1
12 pages
Mesh Tensorflow Deep Learning For Supercomputers
No ratings yet
Mesh Tensorflow Deep Learning For Supercomputers
10 pages
1 s2.0 S0141933122001089 Main
No ratings yet
1 s2.0 S0141933122001089 Main
10 pages
Hardware Multithreading
100% (1)
Hardware Multithreading
4 pages
Hyper Threading Technology
No ratings yet
Hyper Threading Technology
3 pages
Hyper - : Threading Technology
No ratings yet
Hyper - : Threading Technology
20 pages
Flynn's Taxonomy of Parallel Computing
0% (1)
Flynn's Taxonomy of Parallel Computing
79 pages
Understanding Hardware Multithreading
No ratings yet
Understanding Hardware Multithreading
12 pages
Hyper - : Threading Technology
No ratings yet
Hyper - : Threading Technology
20 pages
Technologies For Network
No ratings yet
Technologies For Network
3 pages
Homogeneous and Heterogeneous Multicore Systems
No ratings yet
Homogeneous and Heterogeneous Multicore Systems
9 pages
Understanding Hyper-Threading Technology
No ratings yet
Understanding Hyper-Threading Technology
27 pages
GPU Tensor Core Matmul Codegen with MLIR
No ratings yet
GPU Tensor Core Matmul Codegen with MLIR
57 pages
Modeling Deep Learning Accelerator Enabled Gpus
No ratings yet
Modeling Deep Learning Accelerator Enabled Gpus
14 pages
ILP MThread
No ratings yet
ILP MThread
3 pages
HP-UX 11i Knowledge-on-Demand: Performance Optimization Best-Practices From Our Labs To You
No ratings yet
HP-UX 11i Knowledge-on-Demand: Performance Optimization Best-Practices From Our Labs To You
12 pages
High Performance Computing Unit 1
No ratings yet
High Performance Computing Unit 1
3 pages
Hyper-Threading (HT) Technology
No ratings yet
Hyper-Threading (HT) Technology
3 pages
Preprints202501 0901 v1
No ratings yet
Preprints202501 0901 v1
26 pages
Module - 01 CC (BCS601)
No ratings yet
Module - 01 CC (BCS601)
47 pages
EE6304 Lecture12 TLP
No ratings yet
EE6304 Lecture12 TLP
70 pages
Fine-Grained Multithreading Overview
No ratings yet
Fine-Grained Multithreading Overview
22 pages
Efficient Synchronization Primitives For GPUs
No ratings yet
Efficient Synchronization Primitives For GPUs
31 pages
CC Unit 1
No ratings yet
CC Unit 1
24 pages
Introduction To High-Performance Computing (HPC) : Scientific Research Engineering Data Analytics Machine Learning
No ratings yet
Introduction To High-Performance Computing (HPC) : Scientific Research Engineering Data Analytics Machine Learning
30 pages
High-Performance Computing Overview
No ratings yet
High-Performance Computing Overview
36 pages
NTX: An Energy-Efficient Streaming Accelerator For Floating-Point Generalized Reduction Workloads in 22 NM FD-SOI
No ratings yet
NTX: An Energy-Efficient Streaming Accelerator For Floating-Point Generalized Reduction Workloads in 22 NM FD-SOI
6 pages
Heterogeneous Computing and GPUs at NTHU
No ratings yet
Heterogeneous Computing and GPUs at NTHU
22 pages
Architecture
No ratings yet
Architecture
67 pages
TensorFlow Overview and Release History
No ratings yet
TensorFlow Overview and Release History
12 pages
Amdahl's Law and GPU Acceleration Insights
No ratings yet
Amdahl's Law and GPU Acceleration Insights
52 pages
HPC Mid-I
No ratings yet
HPC Mid-I
47 pages
Grade 12 IT Theory Notes PDF
No ratings yet
Grade 12 IT Theory Notes PDF
126 pages
Data, Tensor, Pipeline, Expert and Hybrid Parallelisms - LLM Inference Handbook
No ratings yet
Data, Tensor, Pipeline, Expert and Hybrid Parallelisms - LLM Inference Handbook
6 pages
A Study On Hyper-Threading: Vimal Reddy Ambarish Sule Aravindh Anantaraman
No ratings yet
A Study On Hyper-Threading: Vimal Reddy Ambarish Sule Aravindh Anantaraman
29 pages
Parallel Computing Architectures Explained
No ratings yet
Parallel Computing Architectures Explained
43 pages
Dynamic Load Balancing for GPUs
No ratings yet
Dynamic Load Balancing for GPUs
12 pages
Mech Hyper Threading
No ratings yet
Mech Hyper Threading
16 pages
Antenna Design
No ratings yet
Antenna Design
6 pages
Shedding Light On Static Partitioning Hypervisors PDF
No ratings yet
Shedding Light On Static Partitioning Hypervisors PDF
15 pages
Understanding Multithreading Techniques
No ratings yet
Understanding Multithreading Techniques
5 pages
Proceedings of The 2nd Workshop On Industrial Experiences With Systems Software
No ratings yet
Proceedings of The 2nd Workshop On Industrial Experiences With Systems Software
15 pages
Deep Learning Inference Optimization with TensorRT
No ratings yet
Deep Learning Inference Optimization with TensorRT
32 pages
AAAI: Advancing AI Research Globally
No ratings yet
AAAI: Advancing AI Research Globally
4 pages
Supervised Learning Explained: Key Concepts
No ratings yet
Supervised Learning Explained: Key Concepts
8 pages
Understanding Cluster Analysis Techniques
No ratings yet
Understanding Cluster Analysis Techniques
22 pages
David S. Touretzky
No ratings yet
David S. Touretzky
5 pages
Understanding Structured Prediction in ML
No ratings yet
Understanding Structured Prediction in ML
3 pages
Physics-Based Computer Vision Library
No ratings yet
Physics-Based Computer Vision Library
5 pages
Mark V. Shaney: The Synthetic Usenet Persona
No ratings yet
Mark V. Shaney: The Synthetic Usenet Persona
5 pages
Molecular Modeling On GPUs
No ratings yet
Molecular Modeling On GPUs
4 pages
Mi 2025 Market Outlook Fullbook en
No ratings yet
Mi 2025 Market Outlook Fullbook en
32 pages
Understanding Globalization: Pros and Cons
No ratings yet
Understanding Globalization: Pros and Cons
10 pages
Predictive Analytics in Finance: Methods & Trends
No ratings yet
Predictive Analytics in Finance: Methods & Trends
8 pages
Circuit Theorems
No ratings yet
Circuit Theorems
21 pages
NE100, NE101 Spectra Theodolite Data Sheet
No ratings yet
NE100, NE101 Spectra Theodolite Data Sheet
2 pages
Comprehensive Financial Statements in PHP
No ratings yet
Comprehensive Financial Statements in PHP
7 pages
PEI102 R&A Question Bank
No ratings yet
PEI102 R&A Question Bank
13 pages
Advancements in Managed Pressure Drilling
No ratings yet
Advancements in Managed Pressure Drilling
1 page
Automotive Electrical and Autotronics
No ratings yet
Automotive Electrical and Autotronics
4 pages
Language Assessment Principles & Rubrics
No ratings yet
Language Assessment Principles & Rubrics
15 pages
Black Beauty and Slovakia's Fed Cup Win
No ratings yet
Black Beauty and Slovakia's Fed Cup Win
3 pages
Interpersonal Relationships Done
No ratings yet
Interpersonal Relationships Done
3 pages
XXX - XXXX: SSC-3D Tad-GTC-182V VSC
No ratings yet
XXX - XXXX: SSC-3D Tad-GTC-182V VSC
11 pages
Edward de Bono's Lateral Thinking Explained
No ratings yet
Edward de Bono's Lateral Thinking Explained
16 pages
Cement Industry
100% (1)
Cement Industry
28 pages
CHE 443 - Assignment 1
No ratings yet
CHE 443 - Assignment 1
6 pages
Lesson 5: Gathering Information From Surveys
No ratings yet
Lesson 5: Gathering Information From Surveys
18 pages
Cambium Networks Data Sheet Cnmatrix EX1000 Series
No ratings yet
Cambium Networks Data Sheet Cnmatrix EX1000 Series
10 pages
Hitachi EH3500 AC
No ratings yet
Hitachi EH3500 AC
9 pages
Dispute Resolution in Family Property Case
No ratings yet
Dispute Resolution in Family Property Case
2 pages
Sasa
No ratings yet
Sasa
8 pages
Prov Rem 1
100% (1)
Prov Rem 1
79 pages
Visitors Orientation (Ibsf)
No ratings yet
Visitors Orientation (Ibsf)
15 pages
The Impact of Marketing Mix On Customer Loyalty Towards Plaza Indonesia Shopping Center
No ratings yet
The Impact of Marketing Mix On Customer Loyalty Towards Plaza Indonesia Shopping Center
11 pages
The Quality Gurus PDF
No ratings yet
The Quality Gurus PDF
31 pages
Flexible Ac Transmission Systems (Facts) - Full Paper Presentation - Eeerulez - Blogspot
100% (11)
Flexible Ac Transmission Systems (Facts) - Full Paper Presentation - Eeerulez - Blogspot
20 pages
LPU Distance Education
No ratings yet
LPU Distance Education
12 pages
Micro Mechanics of One-Dimensional Compression
No ratings yet
Micro Mechanics of One-Dimensional Compression
14 pages
Coruption in Kenya
No ratings yet
Coruption in Kenya
41 pages
Saddar Town Overview and Map
No ratings yet
Saddar Town Overview and Map
54 pages

Simultaneous and Heterogeneous Multithreading

Uploaded by

Simultaneous and Heterogeneous Multithreading

Uploaded by

Simultaneous and heterogeneous

Retrieved from "https://en.wikipedia.org/w/index.php?

You might also like