cuda-python issues

Ensure `ObjectCode` are pickle-able

An `ObjectCode` instance can encapsulate either PTX, LTO-IR, or CUBIN. All of these can be serialized. This would help us as well as any downstream project implement a simple persistent...

leofang

triage

feature

cuda.core

CI: Ensure all actions are pinned by commit instead of by version/branch

leofang

triage

P0

CI/CD

CI: Ensure PRs are correctly mentioned in the release notes

1

We want to automate this so that we don't need to revisit all past PRs by release time. During a recent offline discussion, https://sphinx-github-changelog.readthedocs.io/en/latest/ was suggested, but upon a closer...

leofang

triage

CI/CD

RFC: Add support for `launch_attr` in `LaunchConfig` ctor

7

Today, `LaunchConfig` only supports `cuLaunchKernel` driver API to launch kernels on a single GPU. When extending to broader usecases where there is a need for inter-SM synchronization or multi-GPU synchronization,...

realarnavgoel

P0

feature

cuda.core

Add nvfatbin bindings

leofang

P2

feature

cuda.bindings

Querying current device is slow compared to CuPy

7

Getting the current device using `cuda.core` is quite a bit slower than CuPy: ```python In [1]: import cupy as cp In [2]: %timeit cp.cuda.Device() 69 ns ± 0.496 ns per...

shwina

enhancement

P1

cuda.bindings

CI: Improve release process to follow SPEC 8

For example, attestation seems like a fancy thing we can easily do https://scientific-python.org/specs/spec-0008/#example-workflow

leofang

triage

CI/CD

Make NumPy an optional dependency perhaps?

3

Would it be possible to make numpy an optional dependency for `cuda.core`? For example, if you just want to use `cuda.core` to query system device properties, installing a BLAS implementation...

carterbox

awaiting-response

triage

Update vendored DLPack header to v1.1

https://github.com/dmlc/dlpack/releases/tag/v1.1

leofang

enhancement

P1

cuda.core

Support custom naming of `nvrtc` programs

Currently the program name when compiling `c++` code with `Program` always [defaults](https://github.com/NVIDIA/cuda-python/blob/main/cuda_core/cuda/core/experimental/_program.py#L399) to `'default_program'`. This makes errors a little less descriptive than what is currently supported by `numba-cuda`. For instance...

brandon-b-miller

enhancement

triage

P2

cuda.core

cuda-python
cuda-python copied to clipboard

Metadata

Ensure `ObjectCode` are pickle-able

CI: Ensure all actions are pinned by commit instead of by version/branch

CI: Ensure PRs are correctly mentioned in the release notes

RFC: Add support for `launch_attr` in `LaunchConfig` ctor

Add nvfatbin bindings

Querying current device is slow compared to CuPy

CI: Improve release process to follow SPEC 8

Make NumPy an optional dependency perhaps?

Update vendored DLPack header to v1.1

Support custom naming of `nvrtc` programs

← Metadata

Owner

Metadata

cuda-python cuda-python copied to clipboard

Metadata

← Metadata

Owner

Metadata

cuda-python
cuda-python copied to clipboard