dask-sql
dask-sql copied to clipboard
Distributed SQL Engine in Python using Dask
**Is your feature request related to a problem? Please describe.** There are a number of improvements that we can make to DataFusion's optimizer that should benefit Dask SQL. **Describe the...
Enabling more queries in test suite
**Is your feature request related to a problem? Please describe.** Currently, many of the custom plugins (such as `CREATE MODEL`) are not supported for non-default schemas, operating directly on `context.schema_name`....
Re-opening #758 here.
I'd like to be able to convert data representing time since UNIX epoch to explicit timestamps format with `to_timestamp`, like I can in Spark SQL and PosgreSQL. ```python from pyspark.sql...
When a column name contains non-lower case characters, the parser fails to recognize it. It looks like the parser converts the names to lower case when evaluating the schema. ```python...
When I try to create a table from a file using double quotes rather than single quotes, I get a parsing error. ```python from dask.distributed import Client from dask_sql import...
**Is your feature request related to a problem? Please describe.** DataFusion 13.0.0 will have new optimizations for TopK queries as described in https://godatadriven.com/blog/optimizing-topk-queries-in-datafusion/ **Describe the solution you'd like** Leverage this...
**Is your feature request related to a problem? Please describe.** I would like the pre-commit hook to run `cargo test` and `cargo +nightly fmt` so that we can format imports....
With https://github.com/rapidsai/cudf/pull/11755 merged in and reflected in our gpuCI images, we should no longer need #802 in for gpuCI to pass.