dask-sql
dask-sql copied to clipboard
Distributed SQL Engine in Python using Dask
Refactors our `DataContainer.assign` logic to avoid creating a dataframe with multiple columns on GPU, which silently fails and causes errors later down the road. Closes #1133
**What happened**: When attempting to select the same column with multiple aliases on a GPU table, we get a `ValueError`: ```python-traceback File ~/dev/dask-sql/main/dask_sql/context.py:503, in Context.sql(self, sql, return_futures, dataframes, gpu, config_options)...
In #886, we removed all dependencies on Dask-ML in favor of scikit-learn, cuML, and our own classes (ParallelPostFit and Incremental). Previously, when creating an experiment, `experiment_class` was expected to be...
**Is your feature request related to a problem? Please describe.** Currently dask-sql has integration with Fugue. The problem is some Fugue change will break this integration. Also Fugue-dask by default...
**What happened**: When setting `sql.identifier.case_sensitive=True` dask-sql still ends up converting identifiers to lowercase during the planning stage. **What you expected to happen**: Case sensitivity being honored when set to `True`...
this worked in dask 2022.8, but after the switch to dataFusion, I get this error when running queries. We believe this is because dataFusion doesn't support schemas - is it...
**What happened**: When attempting to query a column with a name containing `.` (ex: `Utf8("2.0")`) we get a runtime error: ```python-traceback RuntimeError Traceback (most recent call last) Cell In[1], line...
**Is your feature request related to a problem? Please describe.** As discussed in https://github.com/dask-contrib/dask-sql/pull/1033#issuecomment-1420855385 setting up the conda environment can take upto 6-8 minutes on certain test runs especially on...
In trying to use Apache SuperSet's DataSets tab to explore existing tables over the Dask-SQL server, I hit what appear to be unsupported metadata queries:  For both options, `root`...
**Is your feature request related to a problem? Please describe.** I'm working on plugging the server into Metabase with https://github.com/dask-contrib/dask-sql/pull/1063 and one of the queries received to poll the metadata...