Sarah Yurick

Results 62 issues of Sarah Yurick

Re-opening #758 here.

Hi, I'd like to be able to parse a statement like CREATE TABLE films2 AS TABLE films as from the second Postgres example found at [this source](https://www.postgresql.org/docs/current/sql-createtableas.html). I currently get...

enhancement
needs triage

**Is your feature request related to a problem? Please describe.** In #746 I am resolving the `test_stats_aggregation` integration test for our DataFusion SQL planner branch. However, I'm having trouble with...

enhancement
needs triage

Hi, I'd like to be able to handle nullable types and empty partitions before making a prediction in Dask-ML. With the former, Dask DataFrame columns with nullable types can be...

Hi, I'd like to be able to parse a statement like `CREATE TABLE films2 AS TABLE films` as from the second Postgres example found at [this source](https://www.postgresql.org/docs/current/sql-createtableas.html). I currently tried...

https://github.com/dask-contrib/dask-sql/pull/1102 adds dynamic partition pruning functionality. While working on this, I noticed several features that could be used to enhance this optimization rule that are outside of the original intended...

enhancement
needs triage

https://github.com/dask-contrib/dask-sql/pull/1027 added basic join reordering functionality. However, there are several things that need to be fixed and improved upon in the future: - [ ] Probably the biggest improvement we...

enhancement
datafusion
benchmark

Currently, something like ``` SELECT * FROM d_table WHERE d_date BETWEEN '2000-02-01' AND (CAST('2000-02-01' AS DATE) + 90 days ``` fails with a `ParserError("Expected ), found: days")`. Fixing this will...

bug
needs triage

https://github.com/dask-contrib/dask-sql/pull/1025 enhanced our logic in dealing with scalar timestamps. However, there are still a couple of bugs as marked in `test_scalar_timestamps`: - [ ] `SELECT TIMESTAMPDIFF(DAY, to_timestamp(d1), to_timestamp(d2)) AS dt...

bug
needs triage

In #886, we removed all dependencies on Dask-ML in favor of scikit-learn, cuML, and our own classes (ParallelPostFit and Incremental). Previously, when creating an experiment, `experiment_class` was expected to be...

bug
needs triage