fairseq2 icon indicating copy to clipboard operation
fairseq2 copied to clipboard

Perf drop with OMP_NUM_THREAD/MKL_NUM_THREADS

Open cbalioglu opened this issue 8 months ago • 0 comments

Thanks to @mavlyutovr's investigation we found out that the throughput of our TBB-based data pipeline API can drop significantly if OMP_NUM_THREADS and/or MKL_NUM_THREADS are not bound to a low thread count. We should investigate if there is anyway for us to mitigate this issue on our end. If not, we should document this behavior and advise users to be cautious about it.

cbalioglu avatar Oct 12 '23 13:10 cbalioglu