...
Code Block |
---|
#PBS -l nodes=1:ppn=4:group0 OR #PBS -l nodes=1:ppn=8:group0 OR #PBS -l nodes=n024:ppn=8 OR #PBS -l nodes=1:ppn=8+nodes=1:ppn=4 OR #PBS -l nodes=n024:ppn=8+nodes=1:ppn=8 OR #PBS -l nodes=1:ppn=8:mem16 |
Note that the ":group0" addition searches for a node with the property "group0". On cyrus1, this property will be present on all nodes designated for the queuing system. Nodes n017 through n021 will be reserved for software unable to use the queue system. On quantum2, this group0 designation should not be included. (If this proves to be difficult for users to manage, we will simply remove those non-designated queues from the configuration manager.)Multiple job submission through job arrays is available on Cyrus1, but not Quantum2 (which has an earlier version of Torque. This feature can be especially helpful to users who need to submit a large number of similar jobs. For more details, see section 2.1.1 of the Torque user manual.
Some or all of these arguments can also be given at the command line. Command-line settings override any settings in the script.
...
command | purpose |
---|---|
qsub jobscript | Submit job in script jobscript. Can accept other arguments as discussed above. |
qsub -I -l nodes=1:ppn=4 | Request interactive job with indicated resources. |
qdel jobID | Delete job number jobID. Seems to kill processes on compute nodes cleanly. |
showq | list jobs by state, with scheduling details. |
showbf | list "backfill," or number of processors available at the moment. |
qstat | List active jobs. Use "-i" for a different format. |
qstat -q | List queue attributes for all queues. |
qstat -f jobID | List detailed information for job number jobID. |
qnodes | List all nodes and their state and properties. |
qnodes -l down | List those nodes currently down. |
qnodes -l active or qnodes -l active | List nodes currently used for jobs. |
qnodes -l free | List nodes currently free. |
qmgr -c "print server" | Print queue configuration details |
administrative command | purpose |
pbsnodes -o nodename | Take node offline after allowing current job to finish. |
pbsnodes -r nodename | Check node and then return to free status if possible. |
checkjob jobid | Check status of job. |
releasehold -a jobid | Release holds on job. |
qrun jobid | Force job to execute now. |
showstats | Show usage statistics. |
showgrid statistic_name | Print a text table of statistic_name by time. |
The PBS queue system allocates a set of nodes and processors to an individual job, either for the walltime specified in the job or the maximum walltime in the queue. It then provides a set of environmental variables to the shell in which the script runs, such as PBS_NODEFILE, the temporary node file describing allocated CPUs. When running with OpenMPI's mpiexec, the submitted script seems to just launch processes without needing that nodefile specified as an argument to mpiexec, although it's not clear whether that behavior is a feature or a bug.
Additional information for queue administrators on PBS and Maui can be found on a helpful page maintained by the University of Cambridge's Theoretical Chemistry department.
Anchor | ||||
---|---|---|---|---|
|
Table of queues attributes
The following tables are available in printer-friendly form in an attached file. Note that these settings can be adjusted to meet users' needs as those needs become clearwere updated on June 11th to reflect changes made on that date. Note that a dash ("--") indicates no limit for the given queue.
Queue attributes on Cyrus1 and Quantum2
| debug | short | long |
---|---|---|---|
max walltime | 20 min | 24 hr | 6 5 days |
max nodes per job | 1 | 2 5 | 1 |
max nodes per queue | – | – | 10 |
max jobs running per user | – | – | 5 |
max jobs queuable per user | – | – | - |
priority | 100 | 80 | 60 |
Queue attributes on DariusDarius1, Darius2, Xerxes and Artaxerxes
| debug | short | long | verylong |
---|---|---|---|---|
max walltime | 20 min | 24 hr | 12 5 days | 14 days |
max nodes per job | 1 | 32 | 4 | 4 |
max nodes per queue | – | – | 15 | 5 |
max jobs running per user | – | – | 15 | 5 8 |
priority | 100 | 80 | 60 | 40 |