Raghuveer Devulapalli

Results 122 comments of Raghuveer Devulapalli

@ixgbe thanks for your contribution. @seiko2plus had a few comments about using a wrapper around Highway tags to make it easier to develop template functions. I have tagged him to...

From https://github.com/numpy/numpy/pull/21057: If I understood correctly, it was along the lines of https://github.com/numpy/numpy/blob/b70cda03ab4591806f3206827999144c33cd1bc3/numpy/_core/src/umath/loops_comparison.dispatch.cpp#L12-L82

> During the last optimization meeting, I proposed a thin wrapper over Google's Highway > SIMD library to simplify its interface. The wrapper would eliminate the need for > class...

Hello @Rohanjames1997. Thank you for your interest! PR #21955 leveraged Intel SVML library which is currently platform specific (x86, AVX-512). We ideally want to convert these from asm to highway...

If you want to get started on how to use highway, then converting any of dispatch C sources to highway might be a great start. See example: https://github.com/numpy/numpy/pull/27402

~@jankoslavic I am little confused. Your NumPy output says `{'architecture': 'Haswell',` but you mention that you have a MeteorLake processor? Is NumPy reporting architecture incorrectly?~ EDIT: nevermind, I see the...

BTW, this doesn't seem to be related to E-core/P-core or anything to do with Windows. I see this behavior on my linux SkylakeX too which has all uniform cores. The...

> The Meteor Lake hardware only aggravates this by advertising full-featured thread capabilities on the two low-power E-cores while they're apparently barely able to cope with BLAS workloads. [EDIT]: these...

hah, @martin-frbg you are right. Looks like that behavior was with turbo disabled which makes all the cores run at the same low frequency. Once I enable turbo, the P...

121 tests fail https://dev.azure.com/numpy/numpy/_build/results?buildId=36327&view=logs&j=bb985aa7-6f2e-5862-34d1-fe760a3f4424&t=fedaa2b4-fa4d-5ee0-669f-9fb1714eeeb2 Have you looked into these yet?