kuberay
kuberay copied to clipboard
feat(kubectl-plugin): Add clusterTimeout flag support for job submit
Why are these changes needed?
kubectl ray job submit currently hardcodes clusterTimeout to 120s, causing failures when RayJob triggers slow node pool scaling (e.g., cloud environments taking ~10 min). This PR makes clusterTimeout configurable to better support submissionMode: InteractiveMode in scaling scenarios.
Related issue number
Closes issue #3900
Checks
- [x] I've made sure the tests are passing.
- Testing Strategy
- [x] Unit tests
- [x] Manual tests
- [ ] This PR is not tested :(
Hey @kevin85421 and @win5923 👋 Any follow-up thoughts on this change?
Any word on this PR by chance?
Left a few comments, good to merge after that