cccl
cccl copied to clipboard
[FEA]: A `cuda::uninitialized_async_buffer<T, Properties...>` that handles allocation / deallocation and other low level things for the user
We want this as a wrapper around a plain memory allocation, that takes care of alignment proper size calculations as well as deallocation of the memory
Importantly we also need to use this to bind the stream to the allocation so that cuda::launch has both available
### Tasks