Cory Grinstead
Cory Grinstead
**Is your feature request related to a problem? Please describe.** conditional logic in daft is hard to write and read ```py # %% import daft # %% data = {...
i want to read directly from huggingface datasets. ```py df = daft.read_parquet('hf://datasets/ibm/duorc/ParaphraseRC/*.parquet'); ```
from the documentation, it is a little unclear as the preferred way to set up a distributed daft cluster. There is brief mention [here](https://www.getdaft.io/projects/docs/en/latest/user_guide/poweruser/scaling-up.html), but a full end to end...
**Is your feature request related to a problem? Please describe.** Sometimes I have some columns or other text that is in one casing, such as `camelCase` and I want to...
**Is your feature request related to a problem? Please describe.** I want to write to a single parquet file **Describe the solution you'd like** ```py daft.read_parquet("./my_file.parquet").write_parquet('my_file_new.parquet') ``` Currently this writes...
**Is your feature request related to a problem? Please describe.** Currently there is an optimization inlined inside [translate_single_logical_node](https://github.com/universalmind303/Daft/blob/158291c66e03be9b2252a428f826b6e78c2fb30a/src/daft-plan/src/physical_planner/translate.rs#L52). I think it'd make the code a bit easier to reason about...
**Is your feature request related to a problem? Please describe.** A lot of sources support reading a range instead of just a limit. **Describe the solution you'd like** Modify the...
**Is your feature request related to a problem? Please describe.** If a plan contains nested unions/concats, we can instead flatten those to a single operation Example: ```py df.concat(df.concat(df.concat(df)).explain(True) ``` which...
**Is your feature request related to a problem? Please describe.** Most other rule based execution engines have some form of expression simplification. Some common optimizations - inline constant expressions such...