User Tools

Site Tools


doku:slurm_job_reason_codes

SLURM: JOB REASON CODES

These codes identify the reason that a job is waiting for execution. A job may be waiting for more than one reason, in which case only one of those reasons is displayed.

Code Meaning
InvalidQOS The job's QOS is invalid (does not exist).
QOSNotAllowed The job is not allowed to run on that QOS.
PartitionNodeLimit The number of nodes required by this job is outside of it's partitions current limits. Can also indicate that required nodes are DOWN or DRAINED.
PartitionTimeLimit The job's time limit exceeds it's partition's current time limit.
Priority One or more higher priority jobs exist for this partition or advanced reservation.
QOSJobLimit The job's QOS has reached its maximum job count.
QOSResourceLimit The job's QOS has reached some resource limit.
QOSTimeLimit The job's QOS has reached its time limit.
QOSGrpNodeLimit The job's QOS has reached the maximum number of nodes. No free nodes are available within this QOS.
QOSGrpCpuLimit The job's QOS has reached the maximum number of available CPUs. No free CPUs are available within this QOS.
QOSMaxNodePerUserLimit The maximum number of nodes allowed per user has been reached in that QOS. No free nodes are available for the user.
Resources The job is waiting for resources to become available
None There are so many jobs waiting that the requested job cannot even get a priority.

There are some codes caused intentionally by the user:

Code Meaning
Dependency The job is waiting for a dependency from another job.
JobHeldUser The job has been suspended by the user.
doku/slurm_job_reason_codes.txt · Last modified: 2022/05/09 09:22 by goldenberg