flux-core icon indicating copy to clipboard operation
flux-core copied to clipboard

housekeeping only drains nodes if systemd unit can be run

Open grondo opened this issue 7 months ago • 1 comments

The housekeeping service relies on the systemd unit to drain ranks that fail housekeeping. However, if the housekeeping systemd service isn't configured or fails to start, then the node is not drained. Instead the node is put back into service without housekeeping being run, which could cause any number of failures.

grondo avatar Jul 18 '24 23:07 grondo