Daft
Daft copied to clipboard
Distributed DataFrame for Python designed for the cloud, powered by Rust
### Describe the bug The column ordering of the output after a groupby and selection on keys seems to be not consistent with what the user specifies ### To Reproduce...
### Describe the bug ```py import daft df = daft.from_pydict({"a": [0, 2, 3, 11, 12]}) daft.sql('select * from df where (a % 2 == 0) and a < 10').explain() ==...
**Is your feature request related to a problem? Please describe.** most sql dialects support selecting expressions without a `from` ```sql > select 1; > select 'hello'; > select (1,2,3); ```
### Describe the bug ### To Reproduce ``` import daft df = daft.from_pydict({"list": [[1,2,3], [4,5], [6]], "x": [1, 2, 3]}) daft.sql("SELECT x, explode(list) FROM df").collect() ``` ### Expected behavior _No...
Solves #2769 - Added `length` function to Expression.list - Added deprecation warning to `Expression.list.lengths`
**Is your feature request related to a problem? Please describe.** [Apache Beam](https://beam.apache.org/about/) is an open-source, unified programming model for batch and streaming data processing pipelines that simplifies large-scale data processing...
Given the embedding udf below, the model re-loads and is reinitializing after each write and completed partition / parquet file: ```python import daft import torch import numpy as np from...
I am trying to read a table stored in Unity Catalog (external data access enabled) in Databricks and I am getting "OSError: Generic S3 error: Received redirect without LOCATION, this...
Depends on https://github.com/Eventual-Inc/Daft/pull/2860