Professional Documents
Culture Documents
February 2023
Overview
CPU
Memory
Storage
Interconnect
Graphic Card
GPU for Computing
Example SPECS
Hardware & HPC Partners
Overview
• Focuses on getting started with hardware selections for Ansys applications.
• X86-64 processor from Intel and AMD based hardware only
• Ansys is hardware vendor neutral and partners with all major hardware vendors
L40 Ada Lovelace 22Q3 AD102 8.9 18,176 142 GDDR6 48 864 90.5 1.4 300 PCIe Gen4 x16 Double TSMC 4nm
H100 SXM Hopper 22Q1 GH100 9.0 16,896 132 HBM3 80 3,430 67 34 700 SXM5 TSMC 4nm
H100 PCIe Hopper 22Q1 GH100 9.0 14,592 114 HBM2e 80 2,048 51 26 350 PCIe Gen5 x16 Double TSMC 4nm
A100 80GB SXM Ampere 20Q2 GA100 8.0 6,912 108 HBM2e 80 2,039 19.5 9.7 400 SXM4 TSMC 7nm
A100 80GB PCIe Ampere 20Q2 GA100 8.0 6,912 108 HBM2e 80 1,935 19.5 9.7 300 PCIe Gen4 x16 Double TSMC 7nm
A100 40GB SXM Ampere 20Q2 GA100 8.0 6,912 108 HBM2 40 1,555 19.5 9.7 400 SXM4 TSMC 7nm
A100 40GB PCIe Ampere 20Q2 GA100 8.0 6,912 108 HBM2 40 1,555 19.5 9.7 250 PCIe Gen4 x16 Double TSMC 7nm
A40 Ampere 20Q4 GA102 8.6 10,752 84 GDDR6 48 696 37.4 0.6 300 PCIe Gen4 x16 Double Samsung 8nm
A30 Ampere 21Q2 GA100 8.0 3,584 56 HBM2 24 933 10.3 5.2 165 PCIe Gen4 x16 Double TSMC 7nm
A16 Ampere 21Q2 4X GA107 8.6 4X 1,280 4X 10 GDDR6 4X 16 4X 200 4X 4.5 4X 0.14 250 PCIe Gen4 x16 Double Samsung 8nm
A10 Ampere 21Q2 GA102 8.6 9,216 72 GDDR6 24 600 31.2 0.98 150 PCIe Gen4 x16 Single Samsung 8nm
A2 Ampere 21Q4 GA107 8.6 1028 10 GDDR6 16 200 4.5 0.07 60 PCIe Gen4 x8 Single Samsung 8nm
T4 Turing 18Q3 TU104 7.5 2,560 40 GDDR6 16 300 8.1 0.25 70 PCIe Gen3 x16 Single TSMC 12nm
Quadro RTX 8000 Turing 18Q3 TU102 7.5 4608 72 GDDR6 48 624 14.9 0.47 250 PCIe Gen3 x16 Double TSMC 12nm
Quadro RTX 6000 Turning 18Q3 TU102 7.5 4608 72 GDDR6 24 624 14.9 0.45 250 PCIe Gen3 x16 Double TSMC 12nm
V100 SXM Volta 17Q2 GV100 7.0 5,120 80 HBM2 32 / 16 900 15.7 7.8 300 SXM3 TSMC 12nm
V100 PCIe Volta 17Q2 GV100 7.0 5,120 80 HBM2 32 /16 900 14 7 250 PCIe Gen3 x16 Double TSMC 12nm
V100S Volta 19Q4 GV100 7.0 5120 80 HBM2 32 1,134 16.4 8.2 250 PCIe Gen3 x16 Double TSMC 12nm
P100 SXM Pascal 16Q2 GP100 6.0 3584 56 HBM2 16 732 10.6 5.3 300 SXM2 TSMC 16nm
P100 PCIe Pascal 16Q2 GP100 6.0 3584 56 HBM2 16 / 12 732 / 549 9.3 4.7 250 PCIe Gen3 x16 Double TSMC 16nm
M60 Maxwell 15Q3 2X GM204 5.2 4096 2X 16 GDDR5 2X 8 2X 160 2X 4.8 2X 0.15 300 PCIe Gen3 x16 Double TSMC 28nm
K80 Kepler 14Q4 2X GK210 3.7 4992 2X 13 GDDR5 2X 12 2X 240 5.6 1.87 300 PCIe Gen3 x16 Double TSMC 28nm
CUDA Total
Memory Memory FP64 Max.
Micro Compute # # Memory FP32 Foundry
Model Name Date Chips Size BW TFLOP TDP Interface
Arch. Capability CUDA SMs Type TFLOPS Process Size
(GB) (GB/s) S (W)
Version Cores
RTX A5500 Ampere 22Q1 GA103S 8.6 7424 58 GDDR6 16 448 24.7 0.73 165 PCIe Gen4 x16 Samsung 8nm
RTX A5000 Ampere 21Q2 GA104 8.6 6144 48 GDDR6 16 448 21.7 0.54 165 PCIe Gen4 x16 Samsung 8nm
RTX A4500 Ampere 22Q1 GA104 8.6 5888 46 GDDR6 16 448 18.5 0.45 140 PCIe Gen4 x16 Samsung 8nm
RTX A4000 Ampere 21Q2 GA104 8.6 5120 40 GDDR6 8 384 17.8 0.54 140 PCIe Gen4 x16 Samsung 8nm
RTX A3000 12GB Ampere 22Q1 GA104 8.6 4096 32 GDDR6 12 336 14.1 0.20 130 PCIe Gen4 x16 Samsung 8nm
RTX A2000 8GB Ampere 22Q1 GA107 8.6 2560 20 GDDR6 8 224 9.3 0.74 95 PCIe Gen4 x16 Samsung 8nm
Quadro RTX 6000 Turing 20Qx TU102 7.5 4608 72 GDDR6 24 672 14.9 0.42 250 PCIe Gen3 x16 TSMC 12nm
Quadro RTX 5000 Turing 19Q2 TU104 7.5 3072 48 GDDR6 16 448 9.4 0.34 110 PCIe Gen3 x16 TSMC 12nm
Quadro RTX 4000 Turing 19Q2 TU104 7.5 2560 40 GDDR6 8 448 8.0 0.25 110 PCIe Gen3 x16 TSMC 12nm
Memory 2X16GB DDR4-3200 non-ECC 2X32GB DDR5-4800 non-ECC 128GB DDR5-4800/3600 non-ECC CAMM module
2X16GB DDR5-4800 non-ECC 2X32GB LPDDR5-5200 non-ECC 4X32GB DDR5-4800 ECC SODIMM
2X16GB LPDDR5-6400 non-ECC (higher speed in 13th Gen Intel Core Mobile) (higher speed in 13th Gen Intel Core Mobile)
Storage 1TB mix-use PCIe NVMe SSD 1TB or 2TB mix-use PCIe NVMe SSD 2TB+ mix-use PCIe NVMe SSD
Graphic NVIDIA T550 (4GB) NVIDIA T600 (4GB) NVIDIA RTX A3000 (12GB)
NVIDIA RTX A500 (4GB) NVIDIA T1200 (4GB) NVIDIA RTX A4500 (16GB)
NVIDIA RTX A1000(4GB) NVIDIA RTX A5500 (16GB)
NVIDIA RTX A2000 (8GB)
Brands and Dell Precision 3470 (14”) Dell Precision 5470 (14”) Boxx GoBoXX SLM17
models Dell Precision 3570 (15”) Dell Precision 3571 (15”) Dell Precision 7670
HP ZBook Firefly 14-inch G9 Dell Precision 5770 (17”) Dell Precision 7770
HP ZBook Firefly 16-inch G9 HP ZBook Studio 16-inch G9 HP ZBook Fury 16 G9
Lenovo ThinkPad P14s Gen 3 (Intel) HP ZBook Power 15.6-inch G9 Lenovo ThinkPad P16 Gen 1
Lenovo ThinkPad P16s Gen 1 (Intel) Lenovo ThinkPad P1 Gen 5
Lenovo ThinkPad P15v Gen 3 (Intel)
Compute CPU: Dual Intel Xeon Platinum 8358 (2X32C, 2.6Ghz) or CPU: Dual AMD EPYC 7573X (2X32C, 2.8Ghz, 768MB L3 Cache) or
Dual Intel Xeon Gold 6338 (2X32C, 2Ghz) or Dual AMD EPYC 75F3 (2X32C, 2.95Ghz, 256MB L3 Cache) or
Nodes Dual Intel Xeon Gold 6354 (2X18C, 3.0Ghz) Dual AMD EPYC 7543 (2X32C, 2.8Ghz,256MB L3 Cache)
Memory: 16x 32GB DDR4-3200 ECC (CFD or Explicit FEA) or Memory: 16x 32GB DDR4-3200 ECC (CFD or Explicit FEA) or
16x 64GB DDR4-3200 ECC (MAPDL) 16x 64GB DDR4-3200 ECC (MAPDL)
OS: 2x SSDs RAID 1 OS: 2x SSDs RAID 1
Local SSD (MAPDL): mix-use or write intensive NVMe SSD ~2TB Local SSD (MAPDL): mix-use or write intensive NVMe SSD ~2TB
Interconnect: HDR Infiniband Interconnect: HDR Infiniband
Brands • Dell EMC PowerEdge R750 (head node, 2U), C6520 within C6400 • Dell EMC PowerEdge R7515 (head node, 2U), C6525 within C6400
chassis (4 servers in 2U) or R650 (1U) chassis (4 servers in 2U) or R6525 (1U)
and • HPE ProLiant DL380 Gen10 Plus (head node, 2U), XL220n within • HPE ProLiant DL345 Gen10 Plus (head node, 2U), XL225n within
models Apollo 2000 Gen10 Plus chassis (4 servers in 2U) or DL360 Apollo 2000 Gen10 Plus chassis (4 servers in 2U), or DL325 Gen
Gen10 Plus (1U) 10 plus(1U)
• Lenovo ThinkSystem SR650 V2 (head node, 2U), SD630 V2 within • Lenovo ThinkSystem SR655 (head node, 2U), SR635 (1U)
DA240 enclosure (4 servers in 2U) or SR630 V2 (1U)
Parallel For large HPC cluster with 10s or more users/compute nodes.
IBM Spectrum Scale (GPFS) parallel file system based storage solutions.
Storage Lustre parallel file system based storage solutions.
Compute CPU: Dual AMD EPYC 9554 (2x64C, 3.10Ghz, 256MB L3 Cache, 360W) or
Dual AMD EPYC 9534 (2x64C, 2.45Ghz, 256MB L3 Cache, 280W) or
Nodes Dual AMD EPYC 9474F (2X48C, 3.60Ghz, 256MB L3 Cache, 360W) or
Dual AMD EPYC 9454 (2X48C, 2.75Ghz, 256MB L3 Cache, 290W) or
Dual AMD EPYC 9374F (2X32C, 3.85Ghz, 256MB L3 Cache, 320W) or
Dual AMD EPYC 9354 (2X32C, 3.25Ghz, 256MB L3 Cache, 280W)
Memory: 768GB (24X32GB) DDR5-4800 ECC (CFD or Explicit FEA) or
1.5TB (24X64GB) DDR5-4800 ECC (MAPDL)
OS: 2x 512GB SATA/SAS/NVMe SSD RAID 1
Local SSD (MAPDL): mix-use or write intensive NVMe SSD ~2TB
Interconnect: HDR Infiniband
Brands and • Dell EMC PowerEdge R7625 (head node, 2U), R6625 (compute nodes, 1U)
• HPE ProLiant DL345 Gen11 (head node, 2U), DL365 Gen 11(compute nodes, 1U)
models • Lenovo ThinkSystem SR665 V3 (head node, 2U), SR645 V3(compute nodes, 1U) or SD665 V3 (12 dual-socket nodes in 6U
DW612S enclosure)
Parallel For large HPC cluster with 10s or more users/compute nodes.
IBM Spectrum Scale (GPFS) parallel file system based storage solutions.
Storage Lustre parallel file system based storage solutions.
Compute CPU: Dual Intel Xeon Max 9460 (2X40C, 2.2Ghz, 97.5MB Cache, 350W, 2X64GB HBM2e Memory) or
Dual Intel Xeon Max 9462 (2X32C, 2.7Ghz, 75.0MB Cache, 350W, 2X64GB HBM2e Memory) or
Nodes Dual Intel Xeon Gold 6458Q (2X32C, 3.1Ghz, 60MB Cache, 350W, Liquid Cooled) or
Dual Intel Xeon Platinum 8460Y+ (2x40C, 2.0Ghz, 105MB Cache, 300W) or
Dual Intel Xeon Platinum 8462Y+ (2x32C, 2.8Ghz, 60MB Cache, 300W) or
Dual Intel Xeon Gold 6448Y (2X32C, 2.1Ghz, 60MB Cache, 225W) or
Dual Intel Xeon Gold 6438Y (2X32C, 2.0Ghz, 60MB Cache, 205W) or
Dual Intel Xeon Gold 6442Y (2X24C, 2.6GHz, 60MB Cache, 225W)
Memory: 512GB (16X32GB) DDR5-4800 ECC (CFD or Explicit FEA) or
1.0TB (16X64GB) DDR5-4800 ECC (MAPDL)
OS: 2x 512GB SATA/SAS/NVMe SSD RAID 1
Local SSD (MAPDL): mix-use or write intensive NVMe SSD ~2TB
Interconnect: HDR Infiniband
Brands and • Dell EMC PowerEdge R760 (head node, 2U), R660 (compute nodes, 1U) or C6620 (4 servers in 2U)
• HPE ProLiant DL380 Gen11 (head node, 2U), DL360 Gen 11 (compute nodes, 1U)
models • Lenovo ThinkSystem SR650 V3 (head node, 2U), SR630 V3 (compute nodes, 1U) or SD650 V3 (12 dual-socket nodes in a
6U DW612S enclosure)
Parallel For large HPC cluster with 10s or more users/compute nodes.
IBM Spectrum Scale (GPFS) parallel file system based storage solutions.
Storage Lustre parallel file system based storage solutions.
Electronics c5n, c6i, m6i Recommended instance types: c5n.18xlarge, c5n.9xlarge, c6i.32xlarge, m6i.32xlarge
Fluids c5n.18xlarge EFA is strongly recommended for all Fluids solvers. These instance types support EFA.
c6i.32xlarge
hpc6a.48xlarge
LS-DYNA hpc6id.32xlarge
hpc6a.48xlarge
c6i.32xlarge
These instance types support EFA (recommended)
c6a.48xlarge
hpc6id.32xlarge and hpc6a.48xlarge are recommended
m6a.48xlarge
m6a.48xlarge
m6i.32xlarge
RedHawk-SC Instances from the M or R families
Speos c5a, c5d, c5n, c6i, m6i Instance types that support EFA: c5n.18xlarge, c5n.9xlarge, c6i.32xlarge, m6i.32xlarge
Workbench/Mechanical c6i, m6i, r6i, hpc6id c to m to r provide increased memory (RAM)
Need good balance between cores and memory per core
Instance types that support EFA: c6i.32xlarge, m6i.32xlarge, r6i.32xlarge, hpc6id.32xlarge
Note:
• The same availability zone should be selected for all resources in the HPC workflow setup (virtual machine used for job submission, file storage server, cluster).
• Ansys recommends that you disable multithreading/Hyper-Threading when selecting an instance type.
• Some instances are available in selected regions only.
• For information about EFA, see Elastic Fabric Adapter (EFA) in this guide.
www.ansys.com/ws-roi-estimator www.ansys.com/hpc-trial
www.ansys.com/free-hpc-benchmark www.ansys.com/cloud-trial
www.ansys.com/it-solutions/hpc/on- www.ansys.com/cloud
premises-hardware
www.ansys.com/events/it-webinar- www.ansys.com/events/cloud-
series webinar-series