Daft icon indicating copy to clipboard operation
Daft copied to clipboard

Distributed DataFrame for Python designed for the cloud, powered by Rust

Results 272 Daft issues
Sort by recently updated
recently updated
newest added

## Features - [x] Basic reads - [x] Partitioned reads - [x] Basic writes - [ ] Merge-on-read position delete reads - [ ] Merge-on-read equality delete reads - blocked...

data-catalogs
iceberg

Adds parallelism in sinks + a few refactors: - No more box clones - Spawns tasks on a joinset so they are awaitable and can propagate errors.

chore

**Is your feature request related to a problem? Please describe.** For large-scale anti-joins, we can speed it up by not performing an expensive repartition on both sides. Since the LHS...

p0

Allow users to run their own two stage reduce. something like: ``` df.reduce(per_partition_reduce_fn, global_reduce_fn) ``` or if they are the same ``` df.reduce(both_reduce_fn) ```

**Is your feature request related to a problem? Please describe.** ``` df = df.from_pydict({"foo": [1, 2, 3, 3, 3], "bar": ["a", "a", "b", "b", "b"]}) # should return a new...

**Is your feature request related to a problem? Please describe.** Sometimes I have some columns or other text that is in one casing, such as `camelCase` and I want to...

Bumps [adlfs](https://github.com/fsspec/adlfs) from 2023.10.0 to 2024.7.0. Release notes Sourced from adlfs's releases. 2024.7.0 What's Changed Fix account host by @​dorbaker in fsspec/adlfs#480 Allow blobs and file systems to pickle by...

dependencies
python

Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 5 to 6. Release notes Sourced from docker/build-push-action's releases. v6.0.0 Export build record and generate build summary by @​crazy-max in docker/build-push-action#1120 Bump @​docker/actions-toolkit from 0.24.0 to 0.26.0...

dependencies
github_actions