Daft
Daft copied to clipboard
Distributed DataFrame for Python designed for the cloud, powered by Rust
## Features - [x] Basic reads - [x] Partitioned reads - [x] Basic writes - [ ] Merge-on-read position delete reads - [ ] Merge-on-read equality delete reads - blocked...
Adds parallelism in sinks + a few refactors: - No more box clones - Spawns tasks on a joinset so they are awaitable and can propagate errors.
**Is your feature request related to a problem? Please describe.** For large-scale anti-joins, we can speed it up by not performing an expensive repartition on both sides. Since the LHS...
Allow users to run their own two stage reduce. something like: ``` df.reduce(per_partition_reduce_fn, global_reduce_fn) ``` or if they are the same ``` df.reduce(both_reduce_fn) ```
**Is your feature request related to a problem? Please describe.** ``` df = df.from_pydict({"foo": [1, 2, 3, 3, 3], "bar": ["a", "a", "b", "b", "b"]}) # should return a new...
**Is your feature request related to a problem? Please describe.** Sometimes I have some columns or other text that is in one casing, such as `camelCase` and I want to...
Bumps [adlfs](https://github.com/fsspec/adlfs) from 2023.10.0 to 2024.7.0. Release notes Sourced from adlfs's releases. 2024.7.0 What's Changed Fix account host by @dorbaker in fsspec/adlfs#480 Allow blobs and file systems to pickle by...
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 5 to 6. Release notes Sourced from docker/build-push-action's releases. v6.0.0 Export build record and generate build summary by @crazy-max in docker/build-push-action#1120 Bump @docker/actions-toolkit from 0.24.0 to 0.26.0...