Leo Fang

Results 1175 comments of Leo Fang

Summarizing offline discussion with @xiakun-lu below: 1. It was a mistake that I made to not expose `IsStreamT`, `DevicePointer`, etc., to the public namespace - The purpose was for cuda.core...

Discussed offline with Phillip, we'd like to push this out to cuda-core v0.6.0 if it's OK @xiakun-lu? > Is there a list of things that need to be exposed? I...

I have in mind: host: ```python arr = cp.empty(...) ... view = StridedMemotyView(arr, s=-1) launch(stream, config, kernel, view.as_mdspan()) ``` device: ```python template __global__ void kernel(cuda::std::mdspan a) { ... } ```

In an offline discussion @cpcloud pointed out that with Python `memoryview` assignment is allowed: ```python >>> m = memoryview(bytearray(10)) >>> m[1] 0 >>> m[1]= 3 >>> m[1] 3 ``` This...

(Update: I updated the issue description to cover CTK libraries such as NVRTC and cuFILE, I missed that when summarizing the meeting notes I took)

cc @shwina @tpn @ZzEeKkAa @isvoid for vis

@zzeekkaa does Sebastian's prototype work for you out of box, or does it require additional tweaks?

@rwgk could you check if https://github.com/NVIDIA/cuda-python/pull/1468 would fix it?

Current thought after discussing with @brycelelbach: For packages that depend on the compiler (NVCC, numba-cuda, or cutile-python in the future), they should agree on a lower bound of the compiler...

@Andy-Jost should prove me wrong, but I believe we already kept a door for this. - In the exporting process: - create an `Event` with the [`support_ipc` flag](https://nvidia.github.io/cuda-python/cuda-core/latest/generated/cuda.core.experimental.EventOptions.html) set -...