cccl
cccl copied to clipboard
[FEA]: Introduce combinational ranges into cuda.parallel
Is this a duplicate?
- [x] I confirmed there appear to be no duplicate issues for this request and that I agree to the Code of Conduct
Area
General CCCL
Is your feature request related to a problem? Please describe.
We are lacking combinational ranges, such as thrust::transform_{input,output}_iterator, thrust::zip_iterator, and cub::CacheModified{Input,Output}Iterator in cuda.parallel.
Describe the solution you'd like
Design and implement combinational ranges for cuda.parallel.experimental.itertools. We might consider map(python_function, iterable) for thrust::transform_{input,output}_iterator, zip(rng1, rng2, ...) for thrust::zip_iterator, and cache(modifier, iterable) where modifier is from https://github.com/NVIDIA/cccl/blob/0e09815f7f59a369d63fc8fc3c12dcc697561fbc/cub/cub/thread/thread_load.cuh#L60-L69
Describe alternatives you've considered
No response
Additional context
No response