Nicholas Gates
Nicholas Gates
Filter expressions in the scan convert nulls to false in `Mask::try_from(&dyn Array)`, the earlier we can remove nulls, the more performant these expressions are to compute instead of pointlessly respecting...
Invoke a callback with `n of m` splits completed, can be used by DuckDB, Python, etc.
- [ ] #2653 - [ ] VarBinView builder to have option to de-duplicate (replaces StringDictBuilder) - [ ] DictLayout to share a dictionary across chunks (probably requires an is_in...
This isn't a priority at the moment, but filing a tracking issue so users can upvote if it's important to them, as well as for us to remain aware of...
We should make it such that the stats table stores a mirror of the DType underneath each stats column. For example, `min: i32` for an `i32` array; or `min: {a:...
We still use flex buffers heavily as a hangover from the prototype. This is like a binary form of JSON, vs a schema'd format like flat buffer. Therefore our files...
We should write a test harness for each compute function. It should ask the encoding for arrays of various dtype, length, nullability etc. This code can be feature flagged behind...