Daft icon indicating copy to clipboard operation
Daft copied to clipboard

[FEAT] Include file paths as column from read_parquet/csv/json

Open colin-ho opened this issue 4 months ago • 4 comments

Addresses: https://github.com/Eventual-Inc/Daft/issues/2808

This PR enables adding file path as a column from file reads via the file_path_column: str | None parameter. This works by appending a column of the file path to the Table post read + pushdowns.

Having it as a string makes it easy to have unique field name guarantees, i.e. if the user specifies a column name that already exists then an error is thrown.

colin-ho avatar Sep 26 '24 22:09 colin-ho