cluster-api-provider-tinkerbell icon indicating copy to clipboard operation
cluster-api-provider-tinkerbell copied to clipboard

Implement retries for BMC interactions

Open chrisdoherty4 opened this issue 1 year ago • 0 comments

BMCs are known to fail/act oddly. CAPT uses Rufio when BMC data is referenced by the Hardware resource to power machines off/on and configure netboot. The Rufio Tasks/Jobs indicate whether they failed or succeeded. For increased resiliancy we should consider implementing retries in CAPT for the Rufio interactions.

chrisdoherty4 avatar Jan 10 '24 20:01 chrisdoherty4