Andy Grove
Andy Grove
**Report needed documentation** We will likely need to implement more optimization rules, either in DataFusion, or in Dask SQL. There is no documentation currently on how to do this. **Describe...
Following on from https://github.com/dask-contrib/dask-sql/pull/803, this PR demonstrates how we can use a macro to make some of the boilerplate code more concise. More generally, this PR allows us to discuss...
**Is your feature request related to a problem? Please describe.** There are a number of improvements that we can make to DataFusion's optimizer that should benefit Dask SQL. **Describe the...
**Is your feature request related to a problem? Please describe.** DataFusion 13.0.0 will have new optimizations for TopK queries as described in https://godatadriven.com/blog/optimizing-topk-queries-in-datafusion/ **Describe the solution you'd like** Leverage this...
**Is your feature request related to a problem? Please describe.** I would like the pre-commit hook to run `cargo test` and `cargo +nightly fmt` so that we can format imports....
**Is your feature request related to a problem? Please describe.** The `eliminate_agg_filter` rule fails with a duplicate field error when running `test_agg_count`. This is the query: ```sql SELECT a, b,...
**Is your feature request related to a problem? Please describe.** See the following tests: - `test_multi_agg_count_no_group_by_dupe_distinct` - `test_agg_count` **Describe the solution you'd like** Enable these tests **Describe alternatives you've considered**...
**Is your feature request related to a problem? Please describe.** PR https://github.com/NVIDIA/spark-rapids/pull/4938 adds support for reading CSV and JSON strings as timestamps and it supports valid timestamps formatted in a...
**Is your feature request related to a problem? Please describe.** Once https://github.com/rapidsai/cudf/issues/11643 is merged, we will need to make corresponding changes in the regexp transpiler, particularly in the following method:...
Closes https://github.com/NVIDIA/spark-rapids/issues/6518 This is a follow on from https://github.com/NVIDIA/spark-rapids/pull/6548 to confirm that we should still be rejecting certain regexp patterns even though cuDF no longer rejects them. We reject them...