b-sumner

Results 144 comments of b-sumner

Are any opposed to making this change?

I added the agenda label hoping to get final approval of this change.

@JNYXR your situation is not clear to me. But if you are writing HIP source code, then I would recommend using `__builtin_nontemporal_{load, store}`. This will work on all targets.

Is your shared library linked to the hip runtime, i.e. are all of its dependencies explicitly linked in?

But is it NEEDED? "readelf -d your_shared.so | grep NEEDED"

Thanks! I guess we need to dig deeper. Can you provide a minimal reproducer?

I wasn't sure if I should start a new issue or add to this one. It would be nice if herbie would accept sinpi(x) and cospi(x) in expressions since these...

@zjin-lcf that code has every single thread hammering on a small set of locations in very few cache lines. Code doing that should be expected to be "slow". However, we...