filikat
filikat
I know that there are redundant copies that could be avoided. I expect this to have a small impact on most problems, so I put it quite low on my...
Closed by #2706
These issues appear to be related to multi-threaded OpenBLAS. HiPO uses OpenBLAS only in single-threaded mode. I would expect OpenBLAS to outperform the BLAS reference implementation by orders of magnitude...
Yes, we call openblas_set_num_threads(1). We parallelise some of the large matrix-matrix productst by hand and there is other parallel stuff happening in HiPO. Using more than one thread for OpenBLAS...