workflow-controller
workflow-controller copied to clipboard
Workflow with FAILED status
Currently, Workflow can have one of two possible statuses:
WorkflowComplete- when all its steps are completedWorkflowFailed- when "Deadline" is exceeded
IMHO, it should be failed also when one of workflow steps fails (Job fails).
Yeah, in general, I concur. Problem is what are we going to do when a Job failed?
- Should we implement a retry policy? I think we should.
- When a job failed should we stop all the workflow? I think we should
- Remove the all the workflows? I think we should not.
Thoughts? @clamoriniere1A: ideas?
@sdminonne when Job fail, after backoffLimit retries, we can stop the Workflow (if a user wants to, by specifying some tag).
And I think we should not remove the failed workflow, user can do it later if wants to. Keeping workflow generated jobs/pods will allow inspecting "Failure" and maybe "fixing" workflow for next run.
@alexei-led agree. Need to put together a proposal for the tags.