MithunR comments

Results 156 comments of


                                            MithunR

[FEA] Improved performance for strings finder_warp_parallel_fn / contains_warp_parallel_fn kernels

I'll explore the data and post more info here. I'm still looking at the call stack, etc. There are certainly wins to be had by switching the query from using...

[FEA] Improved performance for strings finder_warp_parallel_fn / contains_warp_parallel_fn kernels

@jlowe mentioned this: @revans2 already has a fix/workaround in place in Spark-RAPIDS, to detect queries of the from `instr(...,...) > 0`, to convert them into a call to `strings::contains()`. It...

[FEA] Improved performance for strings finder_warp_parallel_fn / contains_warp_parallel_fn kernels

I have done some exploration of the data in question, and the query. 1. The query has about 25 invocations of `instr`, which amounts to 25 calls to `strings::contains()` per...

[FEA] Improved performance for strings finder_warp_parallel_fn / contains_warp_parallel_fn kernels

I'm working on a block-parallel version of `contains()` that looks a lot like the warp-parallel one. Testing it out now.

[FEA] Improved performance for strings finder_warp_parallel_fn / contains_warp_parallel_fn kernels

As an aside, I should mention that the data distributions I mentioned above can be ignored, for the moment. The sample is not representative of the user's data.

[FEA] Improved performance for strings finder_warp_parallel_fn / contains_warp_parallel_fn kernels

I have a naive block-parallel implementation [here](https://github.com/mythrocks/cudf/blob/block-parallel-contains/cpp/src/strings/search/find.cu#L378). This change switches to block-parallel if the average string length reaches 256 or 512. (I've tried both.) Here are some results from running...