Daft issues

[FEAT] Allow overwrite / overwrite partitions for write operations

1

Closes #1768 This is a POC for adding overwrite / overwrite partitions mode for our write methods. The idea is to collect all the file paths that were written across...

colin-ho

enhancement

Write a guide on partitioning

4

Write a guide to enumerate key concepts around partitioning: ``` Increasing the number of partitions in your DataFrame has the following effects: 1. Increase the amount of parallelism available to...

jaychia

documentation

data-catalogs

Account for task crashes during writes on append mode

**Describe the bug** If a task crashes during a write on append mode, it will restart and write all the files again, leaving behind dirty files. **To Reproduce** Steps to...

colin-ho

Fix df.count() behavior to perform count_rows instead

**Is your feature request related to a problem? Please describe.** When users run `df.count()`, they often expect `df.count_rows()` behavior. Instead, `df.count()` will perform a count aggregation on every column, which...

jaychia

p1

performance

tech-debt

p3

Daft
Daft copied to clipboard

Metadata

[FEAT] Allow overwrite / overwrite partitions for write operations

Write a guide on partitioning

Account for task crashes during writes on append mode

Fix df.count() behavior to perform count_rows instead

[FEAT] User-defined global expressions

[FEAT] Additional global expressions

[FEAT] Group by list columns

Column Level Static Typing

Support selector expressions

[PERF] [Ray] Make reduce task inputs top-level task arguments

← Metadata

Owner

Metadata

Daft Daft copied to clipboard

Metadata

← Metadata

Owner

Metadata

Daft
Daft copied to clipboard