cuda-python
cuda-python copied to clipboard
Provide a fast path for constructing a `StridedMemoryView` from a `cupy.ndarray`
I have in mind:
host:
arr = cp.empty(...)
...
view = StridedMemotyView(arr, s=-1)
launch(stream, config, kernel, view.as_mdspan())
device:
template <typename T>
__global__ void kernel(cuda::std::mdspan<T, ...> a) {
...
}