qlever-control icon indicating copy to clipboard operation
qlever-control copied to clipboard

Optimization of QLever File Configuration for High-Speed Index Creation

Open arcangelo7 opened this issue 9 months ago • 2 comments

Hello QLever Team,

I am currently working on configuring QLever for a project that involves indexing a substantial dataset with nearly 5 billion triples. The objective is to maximize the speed of the indexing process on a high-performance server with ~1 TB RAM.

Given the high RAM capacity, I am seeking advice on the optimal combination of parameters to significantly speed up the index creation process. Specifically, I would like guidance on the following:

  • Number of Triples per Batch: What would be the ideal setting considering the server's high RAM capacity?
  • STXXL Memory: How can we best utilize the available 1 TB of RAM?
  • Any Additional Parameters: Are there other settings or parameters that can be adjusted to further enhance the indexing speed?

The speed of the indexing process is a crucial factor for us. Any insights or recommendations on how to best leverage our server's capabilities would be greatly appreciated.

Thank you in advance for your support!

arcangelo7 avatar May 21 '24 14:05 arcangelo7