iceberg-rust icon indicating copy to clipboard operation
iceberg-rust copied to clipboard

Table Maintenance functions

Open ForeverAngry opened this issue 1 year ago • 5 comments

Being able to use datafusion for Snapshot Expiration, Manifest Re-Writes, and Compaction. I'd be happy to lend a hand!

ForeverAngry avatar Dec 20 '24 14:12 ForeverAngry

Hey, Sorry for the late reply. These are definitely on the agenda and it would be cool to have the capabilities. I'm currently working on some other stuff so that I don't have the time to implement them in the near future. But I would gladly try to help. Which topic would you be most interested in?

JanKaul avatar Jan 10 '25 11:01 JanKaul

So Expire Snapshot and Compaction are the big ones on my mind. I started an ExpireSnapshot module based on this repo, i think it works - however, i wasn't completely sure if i followed the iceberg concurrency model or where you would want this module to be located in the project. If I submit a PR, would you help me get it all sorted out? I know your busy, so i dont want to bog you down with trivial things :). let me know what you think!

ForeverAngry avatar Jan 11 '25 03:01 ForeverAngry

That's awesome! Sure I'll gladly help to sort it out. I have to read up a bit about how this is supposed to work.

I'm currently a bit busy with implementing and preparing my talk for the Datafusion Meetup on the 23rd of January.

JanKaul avatar Jan 11 '25 07:01 JanKaul

That's awesome! Sure I'll gladly help to sort it out. I have to read up a bit about how this is supposed to work.

I'm currently a bit busy with implementing and preparing my talk for the Datafusion Meetup on the 23rd of January.

I was going to ask, if your going to stream that event - i cant seem to find a link!

ForeverAngry avatar Jan 12 '25 01:01 ForeverAngry

I'm not sure. There will definitely be recordings afterwards.

JanKaul avatar Jan 12 '25 05:01 JanKaul