alluxio icon indicating copy to clipboard operation
alluxio copied to clipboard

Speed ​​up rocksdb checkpoint

Open adol001 opened this issue 2 years ago • 3 comments

Is your feature request related to a problem? Please describe. We now store 10 billion alluxio inode metadata using rocksdb. The current alluxio checkpoint will tar.gz rocksdb data file by single thread, alluxio restart or re-election leader will untar.gz by single thread. This speed is not fast.

Describe the solution you'd like To divide the rocksdb data into parts and tar.gz each part separately, so that rocksdb write to checkpoint or restore from checkpoint can speed up the speed

adol001 avatar Jun 06 '22 12:06 adol001

In addition to checkpoint, backup is also single-threaded gz. It should also be optimized too.

adol001 avatar Jun 13 '22 11:06 adol001

@jenoudet FYI as well.

yuzhu avatar Jun 13 '22 16:06 yuzhu

Refine checkpoint by Parallel zip compression and decompression https://github.com/Alluxio/alluxio/pull/16151

The current implementation of backup, instead of executing backup, it is better to reload it

adol001 avatar Sep 06 '22 06:09 adol001

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in two weeks if no further activity occurs. Thank you for your contributions.

github-actions[bot] avatar Jan 31 '23 15:01 github-actions[bot]