OpenBLAS
OpenBLAS copied to clipboard
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
When we run lapack-test on Power7, we found some Eigen test failed. This PR tries to turn back to generic kernel for GEMV and GEMM, etc. Then all test passed...
I am attempting to use a simple python script: ```python3 #!/usr/bin/env python3 import numpy ``` This fails due to: ``` OpenBLAS blas_thread_init: pthread_create failed for thread 21 of 128: Resource...
After upgrading Scilab OpenBLAS build, we detected a poor precision result impacting `dgemm` when using AVX2 kernels. With OpenBLAS 0.3.27: ``` $ OPENBLAS_CORETYPE=NEHALEM scilab/test_dgemm [1, 0.666667] * [-6 0 ;...
Hello developers! I found that functions `dsyev` and `dsyevx` seems not fully paralleled, when - compiled by gcc (11.4/12.3) - AMD (16 cores @ Ryzen 7945HX of laptop / 2...
fixes #4580 and fixes #528
For the last 4 days I've been trying to compile the OpenBLAS library on WIndows on Arm (for Windows on Arm) using [Build OpenBLAS static library with BLAS and LAPACK...
Hello. In the previous pull request #4381, the P and Q parameter of [SD]GEMM were increased to make better use of the L2 cache of Neoverse V1, but the complex...
Dear OpenBLAS team, Just curious how OpenBLAS on ARM will look like when compared to the ARM official performance library here, on ARM CPUs: https://developer.arm.com/Tools%20and%20Software/Arm%20Performance%20Libraries. It is only for ARM...
No need for unofficial Apple AMX intruction set on M4.. 2tflops possible..