Implementation for regex_instr
Which issue does this PR close?
- Closes #13009
Rationale for this change
Implements a regex SQL standard function in datafusion
What changes are included in this PR?
Implementation, tests, benches and docs for the regexp_instr function
Are these changes tested?
Yes
Are there any user-facing changes?
Yes
No
Thank you. I'm wondering what's the reference system for this function's behavior (like postgres or others)
Thank you. I'm wondering what's the reference system for this function's behavior (like postgres or others)
The reference system for this function's behaviour is postgres.
Thank you for this PR @nirnayroy
Can you please resolve the CI error: https://github.com/apache/datafusion/actions/runs/14820525339/job/41754009017?pr=15928
If you encounter an error, run './dev/update_function_docs.sh' and commit
Thank you for this PR @nirnayroy
Can you please resolve the CI error: https://github.com/apache/datafusion/actions/runs/14820525339/job/41754009017?pr=15928
If you encounter an error, run './dev/update_function_docs.sh' and commit
I ran the bash script, but Iām not sure if the workflow succeeded.
fixed the cippy errors showing up in the workflow
fixed formatting error in workflow
@Omega359 I wonder if you might have time to review this PR?
Of course @alamb, not sure how I missed this one. It may be a day or two though
Hi @blaginin, thanks for the review and regret the delay in reply. I think I have rectified a majority of the concerns raised. Please have a look again.
Hi @blaginin , thanks for the help and suggestions for improvement. I have addressed the requested changes. Please have another look.
Tests are failing. If that helps, you can run the CI command locally to debug:
I have tried running it and the tests are passing on my local.
I'll pull this branch later this week and run the tests but in general this PR is looking pretty good! I left a few comments/suggestions for a few things I found from a quick review.
Run extended tests
Run extended tests
It works!
Clippy failures related to rand update (I think https://github.com/apache/datafusion/pull/16062)
Edit: looks like the usages of rand for the benchmark was updated in the above commit ... I'm thinking the additions in this PR do not reflect that change.
Run extended tests
Run extended tests
LGTM :) @blaginin
@nirnayroy thank you so much for your work, i think that's a very useful new function š welcome to the project!!!
š