ibis issues

refactor: simplify relational operators

3

Many logically distinct operations like filter, project, and aggregate are fused together into a single operation. While this was useful initially for generating clean SQL, it has a number of...

cpcloud

developer-api

refactor

breaking change

feat(pyspark): add option to allow treating nans as nulls in the pyspark backend

9

Backends like the Pandas backend treat `np.nan`s as nulls when computing aggregations. The PySpark backend was not treating `np.nan`s as nulls, leading to results for aggregations that are inconsistent with...

timothydijamco

bug

backends - pyspark

breaking change

bug: pandas backend error on `case()` + grouped aggregation

4

### `case()` + ungrouped aggregation This works OK. ``` In [1]: import pandas as pd In [2]: import ibis In [3]: backend = ibis.backends.pandas.Backend() In [4]: conn = backend.connect({}) In...

timothydijamco

bug

backends - pandas

community

bug: different treatment of int columns in Pandas vs. Pyspark backend

1

If I create an ibis TableExpr that I mutate with an `IntegerColumn` `(dype=int8)`, the dtype of the resulting materialized pandas DataFrame is different between the Pandas backend and the Pyspark...

emilyreff7

bug

backends - pandas

backends - pyspark

needs test

bug(api): windows do not accept expressions for bounds

Right now the following lines of code do not work: ```python import ibis ibis.range_window(following=(ibis.interval(seconds=1), None)) ibis.window(following=(ibis.literal(1), None)) ``` with this traceback: ``` Traceback (most recent call last): File "/home/cloud/src/ibis/timeseries.py", line...

cpcloud

bug

docs(perf): figure out how to stop re-rendering all jupyter notebooks during mkdocs serve

mkdocs renders all jupyter notebooks during `mkdocs serve`, slowing documentation update/checking to a snail's pace. It's tedious. Is there some way to speed it up, have them render concurrently, or...

p-a-a-a-trick

docs

ux

performance

docs: mismatch between backend capabilities and docs

3

Speaking mainly about SQLite (but I'd assume this would be true for other backends), there are quite a few methods listed as supported by a specific backend (with an `Inherited`...

drabastomek

docs

ux

docs: move to Diataxis framework for documentation

2

Based on discussion here: https://github.com/ibis-project/ibis/discussions/3579 we should move our docs to the Diataxis setup.

cpcloud

docs

docs: api documentation for rules

1

i think we could add auto docs for the standard rules

jreback

docs

expressions

feat(sql): make filter with window operation more convenient for SQL-based backends

3

For example, the following would fail in SQL, mysql, sqlite, postgres, impala, clickhouse, spark: ``` window = ibis.window(group_by=table.id) table = table.filter(lambda t: t['id'].mean().over(window) > 3).sort_by( 'id' ) ``` with the...

emilyreff7

feature

ux

backends - sql

ibis
ibis copied to clipboard

Metadata

refactor: simplify relational operators

feat(pyspark): add option to allow treating nans as nulls in the pyspark backend

bug: pandas backend error on `case()` + grouped aggregation

bug: different treatment of int columns in Pandas vs. Pyspark backend

bug(api): windows do not accept expressions for bounds

docs(perf): figure out how to stop re-rendering all jupyter notebooks during mkdocs serve

docs: mismatch between backend capabilities and docs

docs: move to Diataxis framework for documentation

docs: api documentation for rules

feat(sql): make filter with window operation more convenient for SQL-based backends

← Metadata

Owner

Metadata

ibis ibis copied to clipboard

Metadata

← Metadata

Owner

Metadata

ibis
ibis copied to clipboard