Robert (Bobby) Evans comments

Results 204 comments of


                                            Robert (Bobby) Evans

[FEA] Finish LIKE support

This is likely to be a really low priority, because I don't know of any queries where the LIKE pattern is non-scalar.

[BUG] Failed test_hash_reduction_sum [DATAGEN_SEED=1700579573, INJECT_OOM, IGNORE_ORDER, INCOMPAT, APPROXIMATE_FLOAT] on CI

I think this case might even change run to run. Our aggregations do not guarantee an order that the sum will happen. And floating point is not truly commutative. My...

[BUG] get_json_object cannot handle ints or boolean values

So it looks like most of this has been fixed in 24.06 after the upmerge went in. I will retest things

[FEA] Support function array_distinct

I think CUDF already supports this through dropListDuplicates https://github.com/rapidsai/cudf/blob/ac27757092e9ba2bc0656b6a7dfbc79ce8b5e76a/java/src/main/java/ai/rapids/cudf/ColumnView.java#L2375-L2386 We should be able to implement this without any issues, so long at dropListDuplicates supports the types.

[FEA] Support function array_distinct

@phish3y happy to have you start to work on this. https://github.com/apache/spark/blob/0d7c07047a628bd42eb53eb49935f5e3f81ea1a1/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala#L4036 is the CPU implementation that we want to try and target. It looks like they have special case equality...

Robert (Bobby) Evans

[FEA] Finish LIKE support

[BUG] Failed test_hash_reduction_sum [DATAGEN_SEED=1700579573, INJECT_OOM, IGNORE_ORDER, INCOMPAT, APPROXIMATE_FLOAT] on CI

[BUG] get_json_object cannot handle ints or boolean values

[FEA] Support function array_distinct

[FEA] Support function array_distinct

[BUG] hash_aggregate_test.py::test_exact_percentile_reduction failed with DATAGEN_SEED=1705866905

[BUG] hash_aggregate_test.py::test_exact_percentile_reduction failed with DATAGEN_SEED=1705866905

[FEA] JSON input support

[FEA] JSON input support

[FEA] JSON input support