Damien Grisonnet
Damien Grisonnet
/triage accepted /assign
> This metric is suitable for an alert if it is above 0 for a long time. I wouldn't agree with that statement. It is pretty normal for workqueue items...
> I'm not sure I got your point. What your said is exactly the reason we are not satisfied with the existing retries metrics. We have many errors which are...
/triage accepted /assign
/triage accepted
cc @rexagod
/triage accepted /assign
The changes looks good to me, thank you @rarruda for pushing that improvement :) I'll update our perfs-tests and see how much we improve with this.
Please refrain from merging until we have the results.
I need to fix the tests, kms doesn't seem to be deploying properly in their CI cluster: https://github.com/kubernetes/perf-tests/pull/2920, but I don't have much time on my hands to look at...