et-operator icon indicating copy to clipboard operation
et-operator copied to clipboard

Job level restartpolicy and backofflimit support

Open xychu opened this issue 4 years ago • 1 comments

When testing horovod elastic worker preemption, the whole job may fail in some cases with uncertain reason. It'd be better if operator support job level restart(restartpolicy + backofflimit).

xychu avatar Dec 30 '20 03:12 xychu

Good suggestion. We will support it this mouth.

xiaozhouX avatar Jan 06 '21 02:01 xiaozhouX