et-operator
et-operator copied to clipboard
Job level restartpolicy and backofflimit support
When testing horovod elastic worker preemption, the whole job may fail in some cases with uncertain reason. It'd be better if operator support job level restart(restartpolicy + backofflimit).
Good suggestion. We will support it this mouth.