spark-rapids icon indicating copy to clipboard operation
spark-rapids copied to clipboard

[FEA] Add support for `REGEXP_SUBSTR`

Open andygrove opened this issue 3 years ago • 2 comments

Is your feature request related to a problem? Please describe.

Spark recently added support for REGEXP_SUBSTR in https://github.com/apache/spark/commit/5adcddb87a052ce8e3b3c917c61f019bea5532ae

Describe the solution you'd like Support this on GPU

Describe alternatives you've considered None

Additional context None

andygrove avatar Jul 08 '22 17:07 andygrove

This is for a runtime replaceable operator. So if we support RegExpExtract, then there is nothing more we need to do. We might want to add a test for RegExpSubString just to be sure that the replacements are happening as we expect them to. But it should just work out of the box.

revans2 avatar Jul 08 '22 19:07 revans2

Blocked until the build can compile, when https://github.com/NVIDIA/spark-rapids/issues/5806, https://github.com/NVIDIA/spark-rapids/issues/5807, and https://github.com/NVIDIA/spark-rapids/issues/5827 are resolved

anthony-chang avatar Jul 15 '22 22:07 anthony-chang