dask-sql
dask-sql copied to clipboard
Distributed SQL Engine in Python using Dask
Bumps [mimalloc](https://github.com/purpleprotocol/mimalloc_rust) from 0.1.29 to 0.1.30. Release notes Sourced from mimalloc's releases. Version 0.1.30 Changes Fix mimalloc extended Fix CI Commits 37235f0 v0.1.30 release 07321df Merge pull request #81 from...
Generalizes plugins to accept and use a schema name whenever possible, instead of the currently set schema. Closes #841
Several of our pytests in `tests/integration/test_model.py` have this note: ``` # TODO - many ML tests fail on clusters without sklearn - can we avoid this? @skip_if_external_scheduler ``` We should...
With our CI matrix significantly slimmed with the switch to DataFusion, now might be a good time to begin doing dedicated tests on OSX, since we are publishing packages for...
- Extract microseconds/millennium/millisecond - Last_day - Timestampadd - Ceil - Floor
When I try: ``` from dask_sql import Context import pandas as pd import dask.dataframe as dd from datetime import datetime c = Context() date = datetime(2021, 10, 3, 15, 53,...
As part of our initiative to move away from Dask-ML, I've migrated some code from Dask-ML into Dask-SQL to support ParallelPostFit.
This is currently being tracked by #832 and #839 to replace the `ParallelPostFit` and `Incremental` classes, respectively. In addition to these, we should look into removing all Dask-ML references in...
DataFusion's parser cannot parse `timestampadd(year, 2, d)` because `year` is not a keyword, so it assumes that this is an identifier and looks for a column with this name. This...