cuda-python
cuda-python copied to clipboard
CUDA Python: Performance meets Productivity
Currently the attributes of `StridedMemoryView` are as follows: https://github.com/NVIDIA/cuda-python/blob/8c841cdb24f64e65138cd2658d30fdeabd18769b/cuda_core/cuda/core/experimental/_memoryview.pyx#L24-L32 There is a todo in the code noting that this is worth converting to Cython types. Would also support this recommendation...
See the discussion starting https://github.com/NVIDIA/cuda-python/pull/470#issuecomment-2690826121.
tl;dr: For the Python 3.13 free-threading build (`cp313t`), the per-thread default stream is enabled and used by default. Users need to set `CUDA_PYTHON_CUDA_PER_THREAD_DEFAULT_STREAM=0` to explicitly opt out and restore the...
It seems hatchling (according to @rwgk from https://github.com/NVIDIA/cccl/pull/3201#issue-2751245974) and pip (#476) are confused by the conflict between our intention of `cuda` being a namespace package and `cuda/__init__.py` still exists today.
On Linux Ubuntu 22.08, I've recently updated the cuda-python package from 12.6 to 12.8.0 in my Python project and when I'm now trying to do: `from cuda import cuda, cudart,...
During investigation of #454 and related doc rendering issues, it has come to my attention that the type checking detection logic does not work well with Sphinx (ex: https://github.com/sphinx-doc/sphinx/issues/13137, https://github.com/sphinx-doc/sphinx/issues/11225)....
I think Sphinx is able to figure out the types and keep them in the document by inspecting the typing info, which going forward should be our source of truth...
Prior art: https://docs.cupy.dev/en/stable/reference/cuda.html#texture-and-surface-memory