ibis
ibis copied to clipboard
the portable Python dataframe library
Let's make a follow-up issue to give datatype some Rust `enum`-like conveniences like `is_boolean`, `is_integer`, `is_int64` etc. _Originally posted by @cpcloud in https://github.com/ibis-project/ibis/pull/4378#discussion_r953733703_
So we can directly locate backend on an operation node instead of using `Expr._find_backend()` method. _Originally posted by @cpcloud in https://github.com/ibis-project/ibis/pull/4378#discussion_r953729137_
Since it contains routines for compiling geospatial values.
_Originally posted by @cpcloud in https://github.com/ibis-project/ibis/pull/4378#discussion_r953728200_ I have marked the occurences using todo notes similar to: `TODO(kszucs): avoid expression roundtrip`
We really need to get rid of `ir.TopK` if that still exists (I realize you're using the `ops` variant here). _Originally posted by @cpcloud in https://github.com/ibis-project/ibis/pull/4378#discussion_r953774606_
@kszucs The first approach I took was to try to integrate it into the `value` rule, I couldn't figure out a way to do it. Rules are "eager" in their...
We should consider to extend the UDF operations to mark them idempotent, [see how spark handles deterministic VS non-deterministic UDFs](https://spark.apache.org/docs/3.1.3/api/python/reference/api/pyspark.sql.functions.udf.html). DataFusion uses the [volatility enum](https://github.com/apache/arrow-datafusion/blob/41b4e491663029f653e491b110d0b5e74d08a0b6/datafusion/expr/src/signature.rs#L25-L37) for this purpose. _Originally posted...
Right now we enforce the output of `expr.execute()` matches `expr.schema()` through `Schema.apply_to`. This is nice since it ensures the result types are the same across backends. However, right now `apply_to`...
The `4.x.x` branch contains breaking changes, so we should make a migration guide for both the downstream backend implementors and users. Though the latter may not be required since the...