filikat

Results 4 comments of filikat

I know that there are redundant copies that could be avoided. I expect this to have a small impact on most problems, so I put it quite low on my...

These issues appear to be related to multi-threaded OpenBLAS. HiPO uses OpenBLAS only in single-threaded mode. I would expect OpenBLAS to outperform the BLAS reference implementation by orders of magnitude...

Yes, we call openblas_set_num_threads(1). We parallelise some of the large matrix-matrix productst by hand and there is other parallel stuff happening in HiPO. Using more than one thread for OpenBLAS...