-
Notifications
You must be signed in to change notification settings - Fork 4
Switching to Slurm
This page describes how to transition to the newer generation of Hyak. Most of this information is taken from:
https://wiki.cac.washington.edu/display/hyakusers/Mox_scheduler
https://wiki.cac.washington.edu/display/hyakusers/Hyak+HOWTO
https://wiki.cac.washington.edu/display/hyakusers/Hyak+mox+Overview
http://www.arc.ox.ac.uk/content/running-r
https://wiki.cac.washington.edu/display/hyakusers/Hyak+R+programming
https://cran.r-project.org/web/packages/rslurm/vignettes/rslurm.html
https://hpc.nih.gov/docs/pbs2slurm.html
https://www.glue.umd.edu/hpcc/help/slurm-vs-moab.html
Here are 6 main differences from ikt:
- Mox is an entirely separate cluster. They share nothing with one another.
- You only get what you ask for, regardless of the resources available on the node. If you ask for 1 CPU, you'll only get one. If you ask for 1GB of RAM, you'll only get 1GB.
- An allocation won't get the same set of nodes all the time, just access to the particular number of nodes to which they're entitled.
- No occasional preemption in ckpt (formerly bf queue) for the moment.
- Preempted jobs get 10s to do something smart before being killed and requeued.
- Please report any problems to [email protected] with Hyak as the first word in the subject. Please also let us know you're using mox not ikt.
Old: ssh [email protected]
New: ssh [email protected]
https://slurm.schedmd.com/rosetta.pdf
Old: showq
New: squeue
squeue -p csde
squeue -u kweiss2
squeue -p ckpt
hyakalloc
hyakalloc xyz
Old: logout
New: exit
Single job (1234): scancel 1234
All jobs: scancel -u kweiss2
You can copy files at high speed without a password between the Hyak systems using commands like the ones below. Here ikt is hyak classic and mox is hyak nextgen. Below xyz (csde) is your group name and abc (kweiss2) is your userid. (If you are using a non-default PATH environment variable then you can find hyakbbcp at this location /sw/local/bin/hyakbbcp .)
File: ikt1$ hyakbbcp myfile mox1.hyak.uw.edu:/gscratch/xyz/abc/mydir
Directory: ikt1$ hyakbbcp -r mydirectory mox1.hyak.uw.edu:/gscratch/xyz/abc/mydir
For me, this would be:
ikt1$ hyakbbcp myfile mox1.hyak.uw.edu:/gscratch/csde/kweiss2/sti
ikt1$ hyakbbcp -r sti mox1.hyak.uw.edu:/gscratch/csde/kweiss2/sti
Interactive build node: srun -p build --time=2:00:00 --mem=100G --pty /bin/bash
Interactive build node in own group: srun -p xyz -A xyz --time=2:00:00 --mem=100G --pty /bin/bash
Multiple nodes: srun -N 2 -p xyz -A xyz --time=2:00:00 --mem=100G --pty /bin/bash
Find names of allocated nodes: scontrol show hostnames
sbatch -p xyz -A xyz myscript.slurm
Open up an interactive build node:
srun -p build --time=2:00:00 --mem=100G --pty /bin/bash
Find available modules and load:
module avail
module load r_3.3.3
Access R: R
Update packages: update.packages()
, choose a CRAN mirror, and then say yes to all of the options