cudaKDTree icon indicating copy to clipboard operation
cudaKDTree copied to clipboard

cub::DeviceRadixSort or thrust::sort

Open mcmingchang opened this issue 5 months ago • 3 comments

In my previous experience, it has been shown that if sort is called multiple times in a short period of time, the latency of the cube library is much shorter than that of the thrust library. You can choose between cube:: DeviceRadixSort or cube:: DeviceSegmentedRadixSort. I wonder if you are interested in trying them out, and I look forward to your reply.

mcmingchang avatar Sep 24 '24 07:09 mcmingchang