dask-sql
dask-sql copied to clipboard
Distributed SQL Engine in Python using Dask
Closes #925
**Is your feature request related to a problem? Please describe.** Once https://github.com/dask-contrib/dask-sql/pull/854 is finalized and merged, there will only be one remaining use of `context.fqn` in the codebase, used to...
**Is your feature request related to a problem? Please describe.** Performing an intersect operation adds a `Distinct` operation to the query plan, however in situations where Distinct has already been...
**Is your feature request related to a problem? Please describe.** Currently, the automated version bumps done by Dependabot can get quite noisy since they require manual review and merging. **Describe...
I'm exploring different queries in ipython and wanting to inspect results. When I'm using jit unspilling in my LocalCUDACluster, Dask-SQL's sql magic returns a [proxy object](https://docs.rapids.ai/api/dask-cuda/stable/spilling.html#limitations): ``` In [6]: %%sql...
**Is your feature request related to a problem? Please describe.** A common filter applied to many sql queries is filtering out nulls for certain tables that usually get's pushed down...
**Is your feature request related to a problem? Please describe.** **Describe the solution you'd like** **Describe alternatives you've considered** **Additional context**
**Is your feature request related to a problem? Please describe.** In a variety of sql applications the dataset is already pre-partitioned by a given key (esp for parquet and orc)...
**What happened**: When running queries containing subqueries, we sometimes run into an issue where a subquery's tabular result will get passed directly into a plugin expecting a column or scalar...
**What happened**: When attempting to project a column that has been casted to a different dtype, unexpected behavior can occur due to the fact that DataFusion seems to map `cast`...