lance icon indicating copy to clipboard operation
lance copied to clipboard

Support `regexp_match` for `LargeUtf8`

Open chebbyChefNEQ opened this issue 1 year ago • 0 comments

#899 enabled regexp_match, but if we call with regexp_match(col, pattern) where col is of type LargeUtf8 the code panics.

a few notes:

  1. regexp_match on LargeUtf8 is probably highly inefficient, as the datasize practically doubles in memory (compare to Utf8)
  2. I'm not sure how to dispatch to arrow_cast function in datafusion expressions.

chebbyChefNEQ avatar May 25 '23 21:05 chebbyChefNEQ