oci-hpc icon indicating copy to clipboard operation
oci-hpc copied to clipboard

Slurm enters provisioning loop even when resources are unavailable

Open cbutakoff opened this issue 1 year ago • 1 comments

Not sure if this can be resolved, but I wonder if it would be possible to check if the nodes are available before provisioning the cluster network rather than provisioning and then waiting for an error.

cbutakoff avatar Apr 27 '23 14:04 cbutakoff

When provisioning the Cluster Network, the first step is a reservation of the nodes that will fail almost right away.

arnaudfroidmont avatar Jan 08 '24 16:01 arnaudfroidmont