dask-sql icon indicating copy to clipboard operation
dask-sql copied to clipboard

Distributed SQL Engine in Python using Dask

Results 258 dask-sql issues
Sort by recently updated
recently updated
newest added

**What happened**: "SELECT (((((NOT t1.c0))AND(('A' LIKE 'B' ESCAPE '/'))))=(t2.c0)) FROM t1, t2" brings crash,when using CPU. However it is able to output result, when using GPU. **What you expected to...

bug
needs triage

https://github.com/dask-contrib/dask-sql/pull/1102 adds dynamic partition pruning functionality. While working on this, I noticed several features that could be used to enhance this optimization rule that are outside of the original intended...

enhancement
needs triage

Remove local `expression.rs` file and use the one in Arrow DataFusion Python instead.

Should unblock a few queries, still needs some work to get CPU tests passing

https://github.com/dask-contrib/dask-sql/pull/1027 added basic join reordering functionality. However, there are several things that need to be fixed and improved upon in the future: - [ ] Probably the biggest improvement we...

enhancement
datafusion
benchmark

**Is your feature request related to a problem? Please describe.** While working on #1184, I noticed that we were getting a misleading test pass on `test_filter_cast_timestamp`, and that currently we...

enhancement
needs triage

**What happened**: When performing a left semi or leftanti join, `getFieldList` and `getFieldNames` return an extra field that we need to filter out: https://github.com/dask-contrib/dask-sql/blob/5421bbf9b363ab15c657432e0c9d367c6f236df7/dask_sql/context.py#L849 **Minimal Complete Verifiable Example**: For example:...

bug
needs triage

With #1182 dash-sql supports left semi joins for GPU tables, however for CPU we're still just performing inner joins. I would like it if I could also perform true left...

enhancement
needs triage

Currently, something like ``` SELECT * FROM d_table WHERE d_date BETWEEN '2000-02-01' AND (CAST('2000-02-01' AS DATE) + 90 days ``` fails with a `ParserError("Expected ), found: days")`. Fixing this will...

bug
needs triage

https://github.com/dask-contrib/dask-sql/pull/1025 enhanced our logic in dealing with scalar timestamps. However, there are still a couple of bugs as marked in `test_scalar_timestamps`: - [ ] `SELECT TIMESTAMPDIFF(DAY, to_timestamp(d1), to_timestamp(d2)) AS dt...

bug
needs triage