cuda-python icon indicating copy to clipboard operation
cuda-python copied to clipboard

CUDA Python: Performance meets Productivity

Results 167 cuda-python issues
Sort by recently updated
recently updated
newest added

Currently the attributes of `StridedMemoryView` are as follows: https://github.com/NVIDIA/cuda-python/blob/8c841cdb24f64e65138cd2658d30fdeabd18769b/cuda_core/cuda/core/experimental/_memoryview.pyx#L24-L32 There is a todo in the code noting that this is worth converting to Cython types. Would also support this recommendation...

enhancement
P1
cuda.core

See the discussion starting https://github.com/NVIDIA/cuda-python/pull/470#issuecomment-2690826121.

documentation
P1
cuda.core

triage
P2
feature
cuda.bindings

tl;dr: For the Python 3.13 free-threading build (`cp313t`), the per-thread default stream is enabled and used by default. Users need to set `CUDA_PYTHON_CUDA_PER_THREAD_DEFAULT_STREAM=0` to explicitly opt out and restore the...

triage
P1
RFC

It seems hatchling (according to @rwgk from https://github.com/NVIDIA/cccl/pull/3201#issue-2751245974) and pip (#476) are confused by the conflict between our intention of `cuda` being a namespace package and `cuda/__init__.py` still exists today.

bug
P0
cuda.bindings

On Linux Ubuntu 22.08, I've recently updated the cuda-python package from 12.6 to 12.8.0 in my Python project and when I'm now trying to do: `from cuda import cuda, cudart,...

triage

During investigation of #454 and related doc rendering issues, it has come to my attention that the type checking detection logic does not work well with Sphinx (ex: https://github.com/sphinx-doc/sphinx/issues/13137, https://github.com/sphinx-doc/sphinx/issues/11225)....

documentation
enhancement
P1
cuda.core

I think Sphinx is able to figure out the types and keep them in the document by inspecting the typing info, which going forward should be our source of truth...

documentation
P1
cuda.core

Prior art: https://docs.cupy.dev/en/stable/reference/cuda.html#texture-and-surface-memory

triage
feature
cuda.core