OpenBLAS
OpenBLAS copied to clipboard
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
The documentation of multithreading/thread safety capabilities are currently incomplete. Case in point, I want to build a Fortran library that makes some BLAS/LAPACK calls, then use this library in a...
In my testing, on a 4-core two-way hyperthreaded Xeon-W Skylake machine, I've found that the following environment variable settings produce consistently high performance: OMP_NUM_THREADS 4 OMP_PLACES "{0,1,2,3}" OMP_PROC_BIND spread This...
It seems that openblas just dies when pthread_create fails. It should instead continue execution with the threads already created, or at least it should fall back on the single-thread mode....
I'm working on optimizing the inner GEMM kernels for RISC-V. I'm confused about the way the arrays are arranged once S/DGEMMKERNEL is called. The array ba[] and bb[] arguments seem...
Hello When I compile with `make BINARY=64 TARGET=HASWELL DYNAMIC_ARCH=1 HOSTCC=g++ NUM_THREADS=64 CC=gcc FC=x86_64-w64-mingw32-gfortran USE_THREAD=1 USE_OPENMP=1 OPENBLAS_COMPLEX_STRUCT=1 -j8` everything is fine, but when i compile without `DYNAMIC_ARCH=1` library compiles but during...
Hi all, I have small test case where I call "**cblas_dgemm**()" using OpenBLAS and when I use a matrix of size N x N with N >= 70, I have...
For the development releases of OpenBLAS 0.2.20, the designation `0.2.20.dev` was chosen. The problem with this numbering is that pkg-config recognizes this version as being at least as recent as...
OpenBLAS-0.2.18 coredump: Program terminated with signal 11, Segmentation fault #0 0x0000003161a06fb6 in pthread_create@@GLIBC_2.2.5 () from /lib64/libpthread.so.0 #1 0x00007eff2bb6825f in blas_thread_init () from ../lib/libopenblas.so.0 #2 0x00007eff2bb675d7 in gotoblas_init () from ../lib/libopenblas.so.0...
I have used 'make USE_THREAD=0 USE_OPENMP=0' on 0.2.19 & 0.2.20 & compiled on mingw64 using msys2 & it does not work. Compiling using 'make' works fine on windows.
May be somebody would like to improve the performance of omatcopy(), so here's some info: Forget to tell about it for more than a year.... One may write a code...