jackrabbit-oak icon indicating copy to clipboard operation
jackrabbit-oak copied to clipboard

OAK-9790 - Implement parallel indexing for speeding up oak run indexing command

Open Ewocker opened this issue 2 years ago • 1 comments

OAK-9790 - Implement parallel indexing for speeding up oak run indexing command

Since indexing was single threads, which is slow for large repository. In order to improve the indexing performance we need to implement parallel indexing.

The work is cover for both lucene and elastic indexing. In order to support parallel indexing, it need to split the big flat file store file ahead, which add a big overhead, but make parallel index possible and much faster.

Another change together is support the LZ4 compression since which is much faster compare to gzip.

Ewocker avatar Jun 03 '22 18:06 Ewocker

New PR which incorporates the review comments https://github.com/apache/jackrabbit-oak/pull/715

amit-jain avatar Sep 21 '22 10:09 amit-jain

superseded by PR https://github.com/apache/jackrabbit-oak/pull/715

amit-jain avatar Oct 27 '22 04:10 amit-jain