pacemaker icon indicating copy to clipboard operation
pacemaker copied to clipboard

Fix: libpe_status: Use pcmk_monitor_timeout for recurring monitors

Open nrwahl2 opened this issue 1 year ago • 0 comments

The executor uses pcmk_monitor_timeout, but the controller considers a recurring monitor to have timed out after its op timeout expires. If pcmk_monitor_timeout is very long (for example, 240 seconds), a stonith stop action can fail. In this situation, the monitor is declared as timed out before the pcmk_monitor_timeout expires, the stop action is requested, and its timer begins counting down. However, the stop action can't begin until after the monitor finishes or pcmk_monitor_timeout expires.

This also makes special handling in controld_execd.c unnecessary. pcmk__unpack_action_meta() has already replaced the meta timeout with the pcmk_monitor_timeout.

Closes RHEL-14826 (JIRA).

nrwahl2 avatar Oct 27 '23 04:10 nrwahl2