OpenBLAS
OpenBLAS copied to clipboard
how to use parallel potrf
I am currently using the develop version of OpenBLAS which was built with simpe make , and I'm trying to utilize the parallel version of the POTRF function. However, I am encountering issues with parallel execution.
Here is the core code.
#include "cblas.h"
#include "lapacke.h"
// ...
info = LAPACKE_spotrf(LAPACK_ROW_MAJOR, 'L', n, A, n);
matrix sizes ranging from 36000 x 36000 to 60000 x 60000 and runs on a 256 core tsv110 architecture machine. I have set OPENBLAS_NUM_THREADS=20, but when I monitor the process with the top command, I observe that only one core is being utilized.