Skip to main content

AMD GPU nodes

AMD GPU Nodes

We currently host only few AMD GPUs that are available for benchmarking.

TypeSpecNodeRemarks
MI100Milan 7703 processor with 1x MI100 GPUga004Integrated into COSMA. Partition cosma8-shm2. Access via SLURM only.
MI200AMD EPYC 7713 with
2x MI200
ga005, ga006Integrated into COSMA. Partition cosma8-shm2. Access via SLURM only.
MI50AMD EPYC 7282 with
6x AMD MI50
ga003Integrated into COSMA. Interactive access only.

Cosma – gn003

These nodes are available for interactive access only. Log into a COSMA login node and ssh straightforwardly into the GPU nodes that you want to experiment with.

Cosma – ga004-ga006

Please log into COSMA8 login nodes and submit batch jobs from there via sbatch or salloc.

Batch jobs

The nodes are available within the cosma8-shm2 partition and have to be selected specifically within your SLURM script:

#SBATCH -p cosma8-shm2
#SBATCH -w ga004

Alternatively, you can use the –include or –exclude settings to pick the exact node.

Environment

ROCm toolchain

To be written.

Intel toolchain

The Intel toolchain on the AMD nodes works and yields good performance. However, the compiler has to be told about the architecture explicitly:

-O3 -fomit-frame-pointer -fstrict-aliasing -ffast-math -funroll-loops -axCOMMON-AVX512 -march=x86-64-v4 -mavx512vbmi

Funding and acknowledgements

The AMD test nodes have been installed in collaboration and as addendum to DiRAC@Durham facility managed by the Institute for Computational Cosmology on behalf of the STFC DiRAC HPC Facility (www.dirac.ac.uk). DiRAC equipment was funded by BEIS capital funding via STFC capital grants ST/P002293/1, ST/R002371/1 and ST/S002502/1, Durham University and STFC operations grant ST/R000832/1. DiRAC is part of the National e-Infrastructure.