codeflare-sdk icon indicating copy to clipboard operation
codeflare-sdk copied to clipboard

An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the developer's life while enabling access to high-performance compute re...

Results 163 codeflare-sdk issues
Sort by recently updated
recently updated
newest added

Bumps [nest-asyncio](https://github.com/erdewit/nest_asyncio) from 1.5.5 to 1.5.7. Commits f8245a9 v1.5.7 7868510 Simplify run() method 9d46de9 Patch event loop policy to always return a patched loop 6baf3be Add manual workflow trigger 1ddc550...

dependencies

Bumps [numpy](https://github.com/numpy/numpy) from 1.23.3 to 1.25.2. Release notes Sourced from numpy's releases. v1.25.2 NumPy 1.25.2 Release Notes NumPy 1.25.2 is a maintenance release that fixes bugs and regressions discovered after...

dependencies

As part of my automated Codeflare testing, I'm hitting [this exception](https://gcsweb-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/origin-ci-test/pr-logs/pull/openshift-psap_ci-artifacts/866/pull-ci-openshift-psap-ci-artifacts-main-codeflare-e2e/1684093966636552192/artifacts/e2e/test/artifacts/000__sdk_user_run_many/000__local_ci__run_multi/ci-pods_artifacts/ci-pod-1/run.log): ``` Traceback (most recent call last): File "/opt/ci-artifacts/src/testing/codeflare/test.py", line 180, in sys.exit(main()) File "/opt/ci-artifacts/src/testing/codeflare/test.py", line 175, in main fire.Fire(Entrypoint())...

As part of my automated Codeflare testing, I'm hitting [this exception](https://gcsweb-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/origin-ci-test/pr-logs/pull/openshift-psap_ci-artifacts/866/pull-ci-openshift-psap-ci-artifacts-main-codeflare-e2e/1684093966636552192/artifacts/e2e/test/artifacts/000__sdk_user_run_many/000__local_ci__run_multi/ci-pods_artifacts/ci-pod-0/run.log): ``` ERROR:root:Caught exception HTTPError: 503 Server Error: Service Unavailable for url: http://ray-dashboard-mnisttest-user0-codeflare-sdk-user-test-user-0.apps.kpouget-sutest-20230726-07h01.psap.aws.rhperfscale.org/api/version Traceback (most recent call last): File "/opt/ci-artifacts/src/testing/codeflare/test.py",...

Bumps [certifi](https://github.com/certifi/python-certifi) from 2022.9.24 to 2023.7.22. Commits 8fb96ed 2023.07.22 afe7722 Bump actions/setup-python from 4.6.1 to 4.7.0 (#230) 2038739 Bump dessant/lock-threads from 3.0.0 to 4.0.1 (#229) 44df761 Hash pin Actions and...

dependencies

I'm running multi-node training of ResNet50 with Torchx, codeflare-sdk, MCAD on OCP 4.12. I have a 3 node OCP 4.12 cluster, each node has one Nvidia GPU. Each of the...

Provide some documentation on how a developer can set up an environment to actually use/test/develop the CodeFlare components (like working on the SDK) without an existing available OpenShift cluster. Current...

Bumps [aiohttp](https://github.com/aio-libs/aiohttp) from 3.8.3 to 3.8.5. Release notes Sourced from aiohttp's releases. 3.8.5 Security bugfixes Upgraded the vendored copy of llhttp_ to v8.1.1 -- by :user:webknjaz and :user:Dreamsorcerer. Thanks to...

dependencies

Bumps [pygments](https://github.com/pygments/pygments) from 2.13.0 to 2.15.0. Release notes Sourced from pygments's releases. 2.15.0 Added lexers: Carbon (#2362, #2365, #2366, #2367, #2368, #2369, #2370) Dax (#2335, #2345) MediaWiki Wikitext (#2373, #827)...

dependencies

Following unit tests fail when the user is connected to kubernetes cluster which is not accessible: - tests/unit_test.py::test_ray_details - tests/unit_test.py::test_cluster_status - tests/unit_test.py::test_wait_ready Unit tests pass successfully when user is not...

kind/bug