explorer icon indicating copy to clipboard operation
explorer copied to clipboard

Add delta lake file support

Open the-destro opened this issue 2 years ago • 4 comments

Python Polars has support for delta lake files already though I can't find the function exposed in the rust package.

the-destro avatar Dec 01 '23 18:12 the-destro

Can you please link to the Python version of the function?

josevalim avatar Dec 01 '23 20:12 josevalim

@the-destro https://pola-rs.github.io/polars/py-polars/html/reference/io.html#delta-lake Is scan_delta, read_delta and DataFrame.write_delta() these 3 functions ?

https://github.com/pola-rs/polars/blob/40d3e0818408d836abf6c31146a3f69fd628f0fb/py-polars/polars/io/delta.py#L295

Make sure to install deltalake>=0.8.0. Read the documentation here <https://delta-io.github.io/delta-rs/python/installation.html>_.

The rust package repository is https://github.com/delta-io/delta-rs

watsy0007 avatar Dec 24 '23 09:12 watsy0007

Btw, isn't delta-lake storage pretty much Parquet files? Could you access them directly instead? Writing them would be a bit more complicated though.

josevalim avatar Apr 16 '24 07:04 josevalim

Btw, isn't delta-lake storage pretty much Parquet files? Could you access them directly instead? Writing them would be a bit more complicated though.

Yes, that's correct. In our company, we integrate DuckDB, dbt, and Delta Lake with Python for business operations. I'm currently considering replacing some of these components with Elixir.

watsy0007 avatar Apr 16 '24 10:04 watsy0007