Andrey Alekseenko

Results 92 comments of Andrey Alekseenko

> This is not currently possible. What do you do with those values once you have obtained them? > I'm optimizing register comsumption of SYCL code for higher performance running...

CMake's [`GNUInstallDirs`](https://cmake.org/cmake/help/latest/module/GNUInstallDirs.html) module seems to be a good solution here, but the `acpp` wrapper would also need to be modified to use the correct path for libraries (and, optionally, includes)....

> Could you please explain the reason of change from get_pointer() to get_multi_ptr in SYCL 2020? I was not involved in the decision, but my understanding is that `multi_ptr` interface...

> Just taking a shot in the dark, I wonder if export AMD_SERIALIZE_COPY=3 would prevent this. Thanks for the suggestion! We don't do any copies here, so `AMD_SERIALIZE_COPY=3` does not...

> Have you built AdaptiveCpp itself (i.e. the runtime library) with amdclang too With amdclang itself; on MI250X with libstdc++ from GCC 7, and on MI50 with libstd++ from GCC...

> This might not be easily possible. On CUDA or HIP, we can just assume that all discovered devices are NVIDIA or AMD respectively. Is `HIP_PLATFORM=nvidia` not supported? :thinking:

Cannot reproduce anymore with 19cd6144aaed64719475c4f4a0ee626bdbc428dc / Compute Runtime 23.35.

Nice progress for SSCP! As I understand, no work is planned for the multipass flow? > The conversions are still there (to remove those I think we'd have to take...

> Yes, I guess we could always add some macro that, if set, potentially adds llvm.assume calls, or changes datatypes inside nd_item. The `assume` calls, I think, can be added...

> The assume calls, I think, can be added unconditionally, no? Unlike changing the return type of API, they don't enforce anything, but, per "as-if" rule, the LLVM optimizer should...