UMass Boston

Gibbs Scheduler

Slurm

Gibbs is now using the Slurm scheduler. 

Slurm is a free and open-source job scheduler for the Linux kernel used by many of the world's supercomputers and computer clusters. There is significant documentation available via the man pages. Try 'man sbatch' for submission related commands and 'man sstat' for status queries.

Partitions 

(Can use sinfo to see partition. Jobs submitted will choose nodes in billing weight order, unless specified otherwise)

Partition Name #Nodes           Billing weights
cpu mem gpu
GTX670 gibbs[01-08] 0.125 0.031G 0.25
TITAN gibbs[11-12] 0.207 0.052G 0.415
GTX780 gibbs[13-15] 0.261 0.065G 0.521
K20 gibbs[09-10] 0.386 0.096G 0.772
GTX980 gibbs[18-36] 0.207 0.026G 0.828
K80 gibbs[16-17] 0.588 0.037G 1.176

All the partitions have 4 GPU's except K80 with 8 GPU's.

Jobs can be submitted using the 'sbatch' command, generally followed by the name of the submission script.  Common command line options are included in the sample scripts. A full listing can be found in the man pages ('man sbatch'). Status of queued or running jobs can be obtained via the 'sstat' command. 

A generic sample script is available, as well as select submission scripts for specific applications, which can be found in their Application Pages.

Torque vs Slurm Commands

Description Torque Slurm
Submit a job qsub [file] sbatch [file]
Job hold qhold jobid scontrol hold jobid
Cancel a job qdel scancel
Release a job qrls [job_id] scontrol release
Job status qstat [job_id] sstat OR scontrol show jobs
Queue list qstat squeue
Node list pbsnodes  -l sinfo  --Node OR scontrol show nodes
Cluster status qstat sinfo

Job Specification

Script directive #PBS #SBATCH
Queue -q [queue]    none - see QoS
QoS           none  --qos=
Node Count - l  nodes=[count] -N [min[-max]]
CPU Count -l ppn=[count] OR -l
mppwidth=[PE_count]
-n [count]
Wall Clock Limit -l walltime=[hh:mm:ss] -t [min] OR -t [days-hh:mm:ss]
Standard Output File -o [file_name] -o [file_name]
Standard error File -e [file_name] -e [file_name]
Event Notification -m abe --mail-type=[events]
Email Address -M [address] --mail-user=[address]
Job Name -N [name] --job-name=[name]
Memory Size -l mem=[MB] --mem=[mem][M|G|T] OR --mem-per-cpu=
[mem][M|G|T]
Account        none --account=[account]
IT Research Computing
Healey Library, Lower Level
UMass Boston
100 Morrissey Blvd.
Boston, MA 02125
Book a Consultation
 617.287.5399
 It-rc@umb.edu