Jay Chia
Jay Chia
**Is your feature request related to a problem? Please describe.** When retrieving nested columns in structs, we currently rely on the `Expression.struct.get(...)` accessor. However, for deeply nested structs this may...
Our DataType inference from arrow types happen in separate places right now for the Python path and Rust path We should consolidate this for more predictable behavior
**Is your feature request related to a problem? Please describe.** Windows functions: functions that are applied over **windows** of data. Here is a great illustration from DuckDB:  Valid expressions...
**Is your feature request related to a problem? Please describe.** **Pivot**: Converts rows into columns **Unpivot** : Converts columns into rows **Tasks** - [ ] Pivot with explicit values passed...
# Summary It is useful for convenience to have a `.fillna` function to fill all null/NaN values in a column ## Proposal ``` df["x"].float.fillnan(0.5) df["x"].fillnull(0.5) df["x"].fillna(0.5) ``` The above expressions...
This Epic tracks issues relating to better code modularization within Daft. - [x] #1131 - [x] #1132 - [x] #1173
This long-running issue records any bugs found with a roundtrip write + read from formats such as Parquet and CSV. Tests were added here: #1616 ### Parquet - [ ]...
Write a guide to enumerate key concepts around partitioning: ``` Increasing the number of partitions in your DataFrame has the following effects: 1. Increase the amount of parallelism available to...