llvm
llvm copied to clipboard
Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
Signed-off-by: jinge90
**Bug Description** For a specific loop that could be unrolled, compiler generates inefficient native binary. Loop looks like this (kernlels.cc:350 in reproducer): ```cpp #pragma unroll for (int k = 0;...
Signed-off-by: Konstantin S Bobrovsky
Two changes were made: 1. Driver passes an absolute path to an original source file through `-main-file-name` if integration footer is enabled. 2. Clang CodeGen uses the file specified through...
**Describe the bug** I am trying to run a program on NVIDIA backend. The code is similar in CUDA and DPCPP version, but dpcpp code considerably longer time(x10) compared to...
**Describe the bug** When I try to compile a kernel with a recursive constexpr function for the CUDA backend I get ``` error: SYCL kernel cannot call a recursive function...
This patch implements `aspect::atomic64` by querying the information related to the 64-bit integer atomic operations. At the moment, the floating-point atomics are not supported at all (single and double precision)...
This document is still work in progress, publishing it as a draft to get early feedback
The optimization is turned on for non-ESIMD path even with -O0. It also work-arounds the SegFault on host when specialization constants are set on host, but not used on device....