lakeFS
lakeFS copied to clipboard
Research: use Parquet format as the input for GC
trafficstars
Garbage collection in lakeFS is essentially an anti-join between the lists of "expired" and "active" addresses. Check whether performing this operation on Parquet files improves performance, and map the risk and considerations.