Why does requesting GPUs as a generic resource on a cluster running SLURM with the built-in plugin fail?
Solution 1:
Slurm in the installed Version (14.11.5
) seems to have problems with types assigned to the GPUs since removing Type=...
from the gres.conf
and changing the node configuration lines accordingly (to Gres=gpu:N,ram:...
) results in successful execution of jobs requiring gpus via --gres=gpu:N
.