Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Column


The content of this section is currently being updated to include material relevant for the upcoming changes of Slurm in September 5, 2023
Panel
titleOn this page

Table of Contents

Note
titleWork in Progress for Phase-2 Documentation


Node architecture

The GPU node architecture is different from that on the CPU-only nodes. The following diagram shows the connections between the CPU and GPUs on the node, which will assist with understanding recommendations for Slurm job scripts later on this page. Note that the numbering of the cores of the CPU has a slightly different order to that of the GPUs.

...

Column
width900px


Code Block
languagebash
themeEmacs
titleListing N. selectGPU_X.sh wrapper script for "manually" selecting 1 GPU per task
linenumberstrue
#!/bin/bash

export ROCR_VISIBLE_DEVICES=$SLURM_LOCALID
exec $*


(Note that the wrapper need to have execution permissions. The command: "chmod 755 selectGPU_X.sh", or similar will do the job for that.)

The wrapper script defines the value of the ROCm environment variable variable ROCR_VISIBLE_DEVICES with the value of the Slurm environment variable SLURM_LOCALID. It then executes the rest of the parameters given to the script which are the usual execution instructions for the program intended to be executed. The SLURM_LOCALID variable has the identification number of the task within each of the nodes (not a global identification, but an identification number local to the node). Further details about the variable are available in the Slurm documentation.

...