Daft
Daft copied to clipboard
Distributed DataFrame for Python designed for the cloud, powered by Rust
`.str.ilike(patterns : str | list[str])` Match patterns against self, case-insensitive. This function is modeled after SQL’s ILIKE directive. Use % as a multiple-character wildcard or _ as a single-character wildcard....
`.str.like(patterns : str | list[str])` Match patterns against self, case-sensitive. This function is modeled after the SQL LIKE directive. Use % as a multiple-character wildcard or _ as a single-character...
**Is your feature request related to a problem? Please describe.** When writing data, Daft currently performs an append by default. We should additionally provide options to: 1. Overwrite the entire...
The current `TreeDisplay` + `multiline_display` visualization protocol for logical/physical plans does not provide enough display flexibility for logical/physical ops with nested structures and/or long descriptions. We should refactor this to...
Adding a builder abstraction for `PhysicalPlan`, similar to our `LogicalPlanBuilder` for `LogicalPlan`, has a few advantages: - logic for adding a particular op to the `PhysicalPlan` can be isolated to...
We've recently added a good bit more logical to our logical -> physical plan translation, since that's where we currently house our "pick the best implementation" optimization rules (such as...
Our sort-merge join currently only supports join keys with primitive dtypes - the `Boundaries` abstraction that we use to align joinable partitions convert upper/lower bounds to Python and do bound-comparisons...
`log(base=None]` Computes the log of the expression with optional base, default to `e`.
Currently when we read parquet file we are performing 2 redundant HEAD requests to grab the size of the file before grabbing the metadata. (We may also be grabbing the...
**Is your feature request related to a problem? Please describe.** No, this feature allows us to write data using Daft to our internal catalog. This feature is similar to what...