...
The GPU partition of Setonix is made up of 192 nodes, 38 of which are high memory nodes (512 GB RAM instead of 256GB). Each GPU node features 4 AMD MI250X GPUs, as depicted in Figure 1. Each MI250X comprises 2 Graphics Complex Die (GCD), with each effectively seen as a standalone GPU by the system. A 64-core AMD Trento CPU is connected to the four MI250X with the AMD InfinityFabric interconnect, the same interconnection between the GPU cards, with a peak bandwidth of 200Gb/s. For more information refer to the Setonix General Information. Each GCD can access 64GB of GPU memory. This totals to 128GB per MI250X, and 256GB per standard GPU node.
Figure 1. A GPU node of Setonix
...
Several scientific applications are already able to offload computations to the MI250X, many others are in the process of being ported to AMD GPUs. Here is a list of the main ones and their current status.
Name | AMD GPU Acceleration | Module on Setonix |
---|---|---|
Amber | Yes | Yes |
Gromacs | Yes | Yes |
LAMMPS | Yes | Yes |
NAMD | Yes | |
NekRS | Yes | |
PyTorch | Yes | Yes* |
ROMS | No | |
Tensorflow | Yes | Yes* |
Table 1. List of popular applications applications. * indicates module is a container as module.
Module names of AMD GPU applications end with the postfix amd-gfx90a
. The most accurate list is given by the module
command:
...
Popular numerical routines and functions have been implemented by AMD to run on their GPU hardware. All of the following are available when loading the rocm/5.0.2
module modules.
Name | Description |
---|---|
rocFFT | Fast Fourier Transform. Documentation pages (external site). |
rocBLAS | rocBLAS is the AMD library for Basic Linear Algebra Subprograms (BLAS) on the ROCm platform. Documentation pages (external site). |
rocSOLVER | rocSOLVER is a work-in-progress implementation of a subset of LAPACK functionality on the ROCm platform. Documentation pages (external site). |
...
The default ROCm installation is rocm/5.2.3
provided by HPE Cray. In addition, Pawsey staff have installed the more recent versions up to rocm/5.47.3
from source using ROCm-from-source. It is an experimental installation and users might encounter compilation or linking errors. You are encouraged to explore it during development and to report any issues. For production jobs, however, we currently recommend using rocm/5.2.3
. We recommend the use of the latest available version unless it creates troubles in your code. Available versions can be checked with the command:
module avail rocm
.
Submitting Jobs
You can submit GPU jobs to the gpu
, gpu-dev
and gpu-highmem
Slurm partitions using your GPU allocation.
...
Column | |||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| |||||||||||||||||||||||||||||||||||||||||||||||
|
Full guides
...