Pierre K.

Results 4 comments of Pierre K.

Pluq with `double` looks depressingly slow on AVX512 machines, compared to AVX2 (ftrsm may be to blame, and @ClementPernet suggested it might be an issue with the backend blas?) -...

A small comment about how things go with PR #250 on hpac: Before: ``` karpman@hpac>./benchmark-fgemm -p 0 -i 7 Time: 0.895271 Gfops: 17.8717 -q 131071 -m 2000 -k 2000 -n...

After a quick check, the slowness on AVX-512 indeed seems to be caused by the backend blas. Switching to openblas, I get: ``` benchmark-pluq Time: 0.222758 Gfops: 23.9423 -s N...

This seems to have been fixed, although I cannot find where?