spark icon indicating copy to clipboard operation
spark copied to clipboard

[SPARK-48280][SQL] Improve collation testing surface area using expression walking

Open mihailom-db opened this issue 1 year ago • 1 comments

What changes were proposed in this pull request?

This PR is introducing Expression Walker in different forms in order to improve collation testing surface area. The tests added include:

  1. Expression Walker for expression evaluation
  2. Expression Walker for SQL query examples
  3. Expression Walker for codeGen generation

Why are the changes needed?

Collations introduced a lot of changes to many functions and parts of the code and these tests aim to catch existing errors and prevent addition of new functions without proper implementation of collation support. To emphasise the importance of these tests, some of the relevant tickets that were opened as a byproduct of this testing:

  • https://issues.apache.org/jira/browse/SPARK-48472
  • https://issues.apache.org/jira/browse/SPARK-48572
  • https://issues.apache.org/jira/browse/SPARK-48574
  • https://issues.apache.org/jira/browse/SPARK-48600
  • https://issues.apache.org/jira/browse/SPARK-48662

Does this PR introduce any user-facing change?

No.

How was this patch tested?

This PR is only related to testing.

Was this patch authored or co-authored using generative AI tooling?

No.

mihailom-db avatar May 30 '24 07:05 mihailom-db

Can you update PR title to reflect the changes?

dbatomic avatar Jun 07 '24 10:06 dbatomic

This is now ready for merge, all tests have passed, as find_in_set was fixed this morning.

mihailom-db avatar Jul 03 '24 12:07 mihailom-db

thanks, merging to master!

cloud-fan avatar Jul 04 '24 08:07 cloud-fan