verl
verl copied to clipboard
[Bug] workflow `e2e_prime` is sometimes stuck
Motivation
The workflow e2e_prime is sometimes stuck and reaches timeout, while sometimes finishes normally. See https://github.com/volcengine/verl/actions/workflows/e2e_prime.yml for details.
Plan
- [ ] Reproduce the issue locally with the container and commands used in the workflow.
- [ ] Check where the root cause is.
I believe this is deprecated now. https://github.com/volcengine/verl/blob/2c6c65cb0f69dd3761e71682fb01d939aa437710/.github/workflows/.deprecate/e2e_prime.yml#L4