codeflare-sdk
codeflare-sdk copied to clipboard
An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the developer's life while enabling access to high-performance compute re...
# Issue link # What changes have been made Use lookup table for status # Verification steps ## Checks - [ ] I've made sure the tests are passing. -...
# Issue link # What changes have been made adding support for custom accelerators # Verification steps ## Checks - [ ] I've made sure the tests are passing. -...
### Name of Feature or Improvement I'd like to change from a hardcoding of `nvidia.com/gpu` to instead having a dict or something of resources. There are other accelerators and it'd...
### Name of Feature or Improvement Create an integration test case to validate DSP, CodeFlare and KubeRay implementation. ### Describe the Solution You Would Like to See #### Test environment...
# Issue link Resolves #408 # What changes have been made - Created a function that checks for env variables in the DDPJobDefinition, and looks for the `PIP_INDEX_URL` and/or `PIP_TRUSTED_HOST`...
Login method must return the error to the caller codeflare-sdk 0.12.1 https://github.com/project-codeflare/codeflare-sdk/blob/073b17114e91443977f802758f099c8dc2d78d46/src/codeflare_sdk/cluster/auth.py#L122
### Name of Feature or Improvement Auto generated documentation is built from source code function descriptions, we should include arg and return descriptions there so that anyone using the function...
### ClusterConfiguration should support tolerations Run Ray clusters (especially the worker pods) on tainted nodes. ### Description of Problem the Feature Should Solve You cannot create a Ray cluster with...
### Describe the Bug I want to submit Ray job with environment variables specified, however provided environment variables aren't passed into the Ray. [SDK doc](https://project-codeflare.github.io/codeflare-sdk/job/jobs.html#codeflare_sdk.job.jobs.DDPJobDefinition) specifies that `DDPJobDefinition` contains property...
### Name of Feature or Improvement The documentation around the cluster configuration options is not easy for users to find and grok the features that are exposed. ### Description of...