kubebench "step-run" fails during long benchmark run

"step-run" fails during long benchmark run

Open xyhuang opened this issue 6 years ago • 2 comments

This is caused by using "successCondition" in the Argo step to track the status of the created kubeflow resources (tfjob), which causes the step to timeout in a few minutes if the "successCondition" is not met. However the kubeflow resources running benchmarks might take longer time than the step can wait. We need a more proper way to track the status of the created kubeflow resources.

May 02 '18 14:05 xyhuang

/priority p2

Jun 05 '18 21:06 xyhuang

is this still an open issue and is this being handled?

Jun 17 '19 10:06 Feelas

kubebench kubebench copied to clipboard

"step-run" fails during long benchmark run

kubebench
kubebench copied to clipboard