Sayed Adel
Sayed Adel
I was thinking about the same thing. Sleef needs a flexible mechanism to manage the relationship between the baseline and the dispatch-able CPU features, also simplifying the runtime dispatching. similar...
@mattip, > is there a known problem with gcc 6? Regards to the current build error, here are some: - limited support for AVX512 mask operations * synonyms intrinsics `_k[and,...
Sorry for the delay, but I need a full build log to determine what we are facing here, the current error is related to MMX(x86) intrinsics while the host is...
@HowJMay, First of all, I would like to thank you for your interest in improving the performance of NumPy, we really appreciate your efforts but again, there's no need to...
I have one question before I go further in reviewing this pr. Does the SVML implementation use any of `AVX512FP16` instruction set or it just count on single-precision operations? if...
@r-devulap, Would you please respond to https://github.com/numpy/numpy/pull/21955#discussion_r949668043? if you disagree then there're a few changes that will need to be done.
To ensure clarity, the environment variable `NPY_DISABLE_CPU_FEATURES`, it only disables the dispatched features during runtime without the need to rebuild, this process is inexpensive because the process of detecting and...
This feature is generally accepted, but I fear that it may be misused. For example, if the `AVX` features are enabled without the non-baseline `SSE` features e.g. `sse41`, it will...
@charris, yes, unless if @howjmay decided to re-implemented similar to #19780
@r-devulap, just one thing left, would you please execute [source/reference/simd/gen_features.py](https://github.com/numpy/numpy/blob/main/doc/source/reference/simd/gen_features.py) locally to update the tables of supported features for the SIMD doc?