jobset
jobset copied to clipboard
JobSet: An API for managing a group of Jobs as a unit
**What would you like to be added**: I would like to support features to create a single PVC and mount the PV to some replicatedJobs like this: ```yaml apiVersion: jobset.x-k8s.io/v1alpha2...
**What would you like to be added**: A JobSetTemplate API similar to [PodTemplate](https://github.com/kubernetes/kubernetes/blob/466b10fcabd92cf1c1f250522c6e2c5d1045747c/pkg/apis/core/types.go#L4034). **Why is this needed**: APIs building on top of JobSet requires referencing a JobSet spec. The common...
**What would you like to be added**: I would like to support the new `JobSetConditionType` and `Running` so that researchers can easily find out if the JobSet is still running....
**What would you like to be added**: A comprehensive example showing how to run a training workload on GPUs using JobSet. We could have one example per major cloud provider....
- Fix https://github.com/kubernetes-sigs/jobset/issues/548 I would like to add a usage example of the network field in simple/example and demonstrate how to access each other between two pods. This will allow...
Graduate JobSet API to v1. We need to keep v1apha1 for a few more releases to make it easier to customers to migrate. Ref: https://book.kubebuilder.io/multiversion-tutorial/api-changes
This pull request is to implement configurable failure policy. There is one difference to note from the KEP. I added a new field to the JobSetStatus that tracks the number...
**What would you like to be added**: I would like to support [scale subresource](https://kubernetes.io/docs/tasks/extend-kubernetes/custom-resources/custom-resource-definitions/#scale-subresource) and the metrics corresponding to HPA resource like this: ```yaml apiVersion: jobset.x-k8s.io/v1alpha2 kind: JobSet metadata: name:...
**What would you like to be added**: Integration tests for changes in #562 **Why is this needed**: Improving test coverage
**What would you like to be added**: API reference for the JobSet API. Similarly as for Kueue: https://kueue.sigs.k8s.io/docs/reference/kueue.v1beta1/ **Why is this needed**: So that we can easily reference JobSet API...