root icon indicating copy to clipboard operation
root copied to clipboard

[ntuple] Improve RDF MT scheduling

Open jblomer opened this issue 7 months ago • 1 comments

Implement two improvements to the RNTuple RDF scheduler for the multi-threaded case:

  1. Files of a chain are opened in batches the background: while a batch of files (batch size == number of slots) is processed, the next batch is opened.
  2. For the tail scheduling (multiple slots sharing the same files), the clones of the page source pointing to the same file do not need to re-read meta-data (anchor, header, footer) again.

The patch results in a speed improvement of about 10% for AGC, EOS cached, 32 cores.

jblomer avatar Jun 28 '24 09:06 jblomer