Jérémie du Boisberranger
Jérémie du Boisberranger
It seems that svd calls ` xGESVD` or `xGESDD` with x = s or d
Thanks. I guess #678 is addressing the same concern, but it's still at the thinking process step :( About the hyperthreading, I'd strongly argue against using pseudocores. On every machine...
@brada4 Thanks for looking into it ! the cpu is an Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz There are 2 sockets with 22 physical cores on each socket, and...
I edited my previous comment because at first I limited the cores the code could use, but I didn't limit the number of threads for openblas (I only used taskset)....
numactl is not installed and I don't have root access. However, lscpu is enough here and shows 2 numa nodes (with their assigned core ids). When setting OPENBLAS_NUM_THREADS=44, the OS...
> Do you have any authoritative documentation that hyperthreads are waste? Absolutely not. As I said, it's just the result of experiences. It should not be taken for granted, but...
``omp_get_max_threads()`` returns 4 (I have 2 physical cores + hyperthreading). Do you think OpenBLAS is using too many threads ? It only occurs when I use an OpenMP loop, so...
For what it's wotrh the number of running threads showed by htop only increases by 4 when I run this
a profiling with linux perf shows that most of the time comes from ``do_wait`` from libgomp
My main concern is that the issue happens with OpenBLAS built with **pthreads** (not when built with OpenMP) ! When I execute my snippet above, libopenblas is loaded because numpy...