lakeFS
lakeFS copied to clipboard
Use bulk-delete API for recursive delete in lakeFSFS
Measurements indicate that this can speed up the cleanupJob operation on FileOutputCommitter (which uses recursive delete) by 7x.
Implementation
Bulk-delete makes it really hard to report errors accurately. Instead, we shall report the first bulk of errors and stop. S3AFileSystem appears to do something similar.
Additionally, should add a config flag to turn off bulk deletion, for when things really start confusing the user! 👻