BlackLab icon indicating copy to clipboard operation
BlackLab copied to clipboard

Make it possible to index a large file with many documents using multiple threads

Open jan-niestadt opened this issue 2 years ago • 0 comments

Right now, multithreading works per-file. This means that a single large file that contains many documents cannot use more than one CPU core.

We could try reading a few input documents into memory in a single thread, then process those documents in multiple threads. This would allow better CPU usage in this case.

jan-niestadt avatar Aug 19 '22 10:08 jan-niestadt