qq respawn
The qq respawn command is used to "respawn" jobs, i.e. put failed or killed jobs back into the queue to be retried. It has no direct equivalent in Infinity.
Imagine you find that your job has failed because it unexpectedly reached a walltime limit (e.g., because it was running on a slow node). You want to just put the job back into the queue to be retried. Normally, you would run something like the following sequence of commands:
# go to the directory with the crashed job qq cd <crashed-job-id> # remove the working directory (optional) qq wipe # clear the runtime files from the crashed job qq clear # submit the job to the queue with the same parameters qq submit -q <queue> --ncpus 8 --ngpus 1 --walltime 1d <script-name>With
qq respawn, you can just run:# remove the working directory, clear runtime files, # and submit a new job with the original parameters qq respawn <crashed-job-id>
Description
Respawns the specified qq job(s), or all qq jobs submitted from the current directory.
qq respawn [OPTIONS] JOB_ID
JOB_ID — One or more IDs of jobs to respawn. Optional.
If no JOB_ID is provided, qq respawn searches for qq jobs in the current directory. If multiple suitable jobs are found, qq respawn respawns each one in turn.
Examples
qq respawn 123456
Respawns the job with ID 123456 located on the default batch server. If the job is located on a different batch server, you need to use the full ID including the sever address.
Only failed and killed jobs can be respawned. If you try to respawn a job in any other state, you will get an error.
qq respawn 123456 144844 156432
Respawns jobs 123456, 144844, and 156432, if they are suitable.
qq respawn
Respawns all suitable qq jobs whose info files are present in the current directory (typically one, since qq requires one job per directory).