OpenBLAS icon indicating copy to clipboard operation
OpenBLAS copied to clipboard

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

Results 282 OpenBLAS issues
Sort by recently updated
recently updated
newest added

Step to reproduce build error: Host: modern x86_64 with AVX2 + FMA3. Build: CMake, DYNAMIC_ARCH=OFF (e.g. vcpkg). Results in GCC errors in sdot_k / _mm256_fmadd_ps due to missing -mfma. My...

Support
Waiting for answer

Hi @martin-frbg , I wanted to know if there's any tentative date for release / branchcut for 0.31.0 ? I was referring to the milestone here: [Milestone-0.31.0](https://github.com/OpenMathLib/OpenBLAS/milestone/50)/

Significant performance improvements are gained by the proposed changes on both c7g (NEOVERSEV1) and c8g (NEOVERSEV2) instances. To reproduce these values you need to run with OMP_ADAPTIVE=1. The plots below...

Resolves #5553. The parameters `[SD]GEMM_DEFAULT_[PQR]` have been tuned to obtain the performance improvement in `[SD]GEMM` under the multi-process evaluation using all cores of `A64FX`. This change improves the performance of...

Hi, I’d like clarification on using OpenBLAS on a 2×192-core AmpereOne (384 cores total), AArch64 Linux. Planned build: ```bash make TARGET=ARMV8 USE_OPENMP=1 NUM_THREADS=384 BIGNUMA=1 ``` Docs mention a default 256-core...

while building OpenBLAS-0.3.30 on OpenIndiana SPARC the following error happens in directory: test OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 ./dblat1 Real BLAS Test Program Results ... Test of subprogram number 7 DNRM2 Program received...

The parameters `[SD]GEMM_DEFAULT_[PQR]` are already set in `param.h` for several CPUs, but those values do not appear to be fine-tuned for `A64FX`. I am willing to resolve this issue.

Add a CI run for clang on ppc64le in order to prevent build/tests regressions. related to #5545

0.3.30 version It seems that cblas_sgemm() always output the zero result. ``` #include #include #include #include #include void generate_random_matrix(float *matrix, int rows, int cols) { for (int i = 0;...