neon icon indicating copy to clipboard operation
neon copied to clipboard

storage controller: anti-flapping

Open jcsp opened this issue 1 year ago • 0 comments

Currently our reconciliation loop has the minimum required behavior: it will try to reconcile, and if a reconciliation fails, it will eventually try again (via the background reconciliation task).

For a good operational experience, we would like two more properties:

  • a reconciliation that has failed too many times should give up/back off rather than trying again every single time the background reconciliation loop checks it.
  • if a reconciliation loop claims to have succeeded, but maybe_reconcile still thinks the tenant is in need of reconciliation, then that's a probable bug and we should activate a similar backoff to prevent flapping.

jcsp avatar Feb 22 '24 12:02 jcsp