Professional Documents
Culture Documents
K computer
Supercomputer
Fugaku
© RIKEN
Approaches
Develop Achieve
1. High-performance Arm CPU A64FX in HPC and AI areas - High performance in real applications
2. Cutting-edge hardware design - High efficiency in key features for AI
3. System software stack applications
HBM2 Interface
Core Core Core Core Core Core Core Core Core Core
Core Core Core Core Core Core Core Core Core Core Core Core Peak performance (Chip level)
Ring Bus
(TOPS) HPC AI
Core Core Core Core Core Core Core Core Core Core Core Core
25 21.6+
A64FX (Fugaku)
20
HBM2 Interface SPARC64 VIIIfx (K computer)
HBM2 Interface
Core
L2
Cache
Core Core
L2
Cache
Core 15 10.8+
10
5.4+
5 2.7+
Core Core Core Core Core Core Core Core Core Core
0.128 0.128 N/A N/A
0
64 bits 32 bits 16 bits 8 bits
(Element size)
Configuration 1x rack including SSDs 80x compute racks & 20x disk racks
Nodes 384 8,160
Footprint 1.1 m2 (0.8 m x 1.4 m) 128 m2 (4 m x 32 m)
QSFP28 (X)
QSFP28 (Y)
QSFP28 (Z)
mate connectors for
Nodes 1 2 16 48 384 150k+ electrical signals and
water TofuD
Performance cables
AOC
AOC
AOC
[Flops] 2.7 T+ 5.4 T+ 43 T+ 129 T+ 1 P+ 400 P+
Fugaku applications
Fujitsu Technical Computing Suite / RIKEN developing system software
Fugaku applications
Fujitsu Technical Computing Suite / RIKEN developing system software
400 346
2
1.56 286 305
Higher is better
Higher is better
1.5 300
1.00
GFlops
1 200
85 103
0.5 100
0 0
Skylake A64FX Skylake(Xeon FX100 A64FX SX-Aurora Tesla V100
(Xeon Platinum 8168) 1 CPU Platinum 8168) 1CPU 1CPU TSUBASA 1GPU†
2 CPUs with source tuning 2CPUs 1VE†
†Performance evaluation of a vector supercomputer SX-aurora TSUBASA
* Normalized by the average elapsed time for timestep of Skylake https://dl.acm.org/citation.cfm?id=3291728
A0 A1 A2 A3