[FEA]: Introduce combinational ranges into cuda.parallel

Open gevtushenko opened this issue 1 year ago • 0 comments

Is this a duplicate?

[x] I confirmed there appear to be no duplicate issues for this request and that I agree to the Code of Conduct

Area

General CCCL

Is your feature request related to a problem? Please describe.

We are lacking combinational ranges, such as thrust::transform_{input,output}_iterator, thrust::zip_iterator, and cub::CacheModified{Input,Output}Iterator in cuda.parallel.

Describe the solution you'd like

Design and implement combinational ranges for cuda.parallel.experimental.itertools. We might consider map(python_function, iterable) for thrust::transform_{input,output}_iterator, zip(rng1, rng2, ...) for thrust::zip_iterator, and cache(modifier, iterable) where modifier is from https://github.com/NVIDIA/cccl/blob/0e09815f7f59a369d63fc8fc3c12dcc697561fbc/cub/cub/thread/thread_load.cuh#L60-L69

Describe alternatives you've considered

No response

Additional context

No response

Sep 29 '24 20:09 gevtushenko