dask-sql
dask-sql copied to clipboard
Distributed SQL Engine in Python using Dask
The ML implementation is still a bit experimental - we can improve on this: - [x] `SHOW MODELS` and `DESCRIBE MODEL` - [x] Hyperparameter optimizations, AutoML-like behaviour - [x] @romainr...
**Is your feature request related to a problem? Please describe.** As we are working becoming more backend agnostic (GPU/CPU) , we should look into a way of supporting multiple ML...
Google Colab depends on an older version of prompt-toolkit, breaking dask-sql imports. ``` !pip install dask-sql from dask_sql import Context ``` Trace: ``` --------------------------------------------------------------------------- ImportError Traceback (most recent call last)...
Hey, I was working on a Dask-SQL portion of a webpage that also had a separate Dask portion and logo. In lieu of an official logo, I created these 3...
The rules currently implemented in the `RelationalAlgebraGenerator` java class are a mixture of the standard rules and the ones from the blazingSQL project. So far, they have worked quite well....
**Is your feature request related to a problem? Please describe.** It would be great if filter operations in a sql clause could be pushed down to the io layer for...
I'd like to be able to execute correlated subqueries. It looks like the following example is successfully parsed by Calcite, but there's a logic error in mapping to Dask DataFrame...
Hi pytests are failing on my Mac using pytest tests e.g. ERROR tests/integration/test_postgres.py::test_select - docker.errors.APIError: 400 Client Error for http+docker://localhost/v1.41/containers/eb41afb2350c9a5f82a7f872db25539aed6115bf5cf2ae3e813b68d9e2e0736b/... I have docker installed Any ideas? Thanks Peter
**What happened**: It is relatively trivial to connect dask-sql's Presto server to Apache Superset for some basic visualization. Things seem to be working fine for Pandas-backed tables, but when attempting...
**What is your question?** Dask-SQL uses many workarounds in Dask/Pandas to return results that generally mirror PostgreSQL; these workarounds often have significant performance impacts. Some that I've worked on: -...