datafusion-comet icon indicating copy to clipboard operation
datafusion-comet copied to clipboard

Add support for `RegExpExtract` and `RegExpExtractAll`

Open andygrove opened this issue 1 month ago • 2 comments

What is the problem the feature request solves?

Spark docs:

  • https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.regexp_extract.html
  • https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.regexp_extract_all.html

Describe the potential solution

No response

Additional context

No response

andygrove avatar Nov 06 '25 14:11 andygrove

@rishabhjain1712

rishabhjain1712 avatar Nov 08 '25 17:11 rishabhjain1712

Hi @andygrove I’d like to work on this issue and add support for RegExpExtract and RegExpExtractAll.

Could you please point me to the parts of the codebase where similar functions (like other string or regex expressions) are implemented, so I can make the additions in the right place?

Thanks!

rishabhjain1712 avatar Nov 11 '25 01:11 rishabhjain1712