OpenBLAS icon indicating copy to clipboard operation
OpenBLAS copied to clipboard

Add Thread Throttling Optimization for Power10 in GEMV

Open pratiklp00 opened this issue 2 months ago • 2 comments

This PR adds the thread thresholding for Power10 by introducing get_gemv_optimal_nthreads_power10 function.

pratiklp00 avatar Oct 16 '25 03:10 pratiklp00

Hi @pratiklp00

Can you show performance improvement values /graphs for [s/d]gemv?

abhishek-iitmadras avatar Oct 16 '25 09:10 abhishek-iitmadras

Hi @abhishek-iitmadras I will share it.

pratiklp00 avatar Oct 21 '25 05:10 pratiklp00