llvm issues

[WIP][SYCL-MLIR]: Changes to fix codegen

1. Added all remaining functions needed for the `single_task` test case in supported function list. Only declaration remains in the generated LLVMIR: ``` declare i8* @malloc(i64) declare void @free(i8*) declare...

whitneywhtsang

sycl-mlir

[SYCL-MLIR] Use GPUModuleOp to host SYCL device code

1

SYCL kernels are GPUFuncOps residing in the new GPUModuleOp, keeping host code in the regular module. Functions to be used by kernels must be cloned to the GPU module. When...

victor-eds

sycl-mlir

ptxas fatality Unresolved extern function '__muldc3'

1

Greetings, I'm getting the below compilation error when I target my gpu. I don't get the same issue when i target the intel fpga simulator or intel cpu. code is...

e404044

bug

cuda

compiler

[SYCL] Fix PI_KERNEL_MAX_SUB_GROUP_SIZE in OpenCL backend

2

Currently PI_KERNEL_MAX_SUB_GROUP_SIZE in the PI OpenCL backend uses the max work item sizes as the input to the corresponding OpenCL query to avoid truncation. However, using the max work item...

steffenlarsen

[SYCL][PI/CL] check device version/extensions rather than platform version/extensions

For OpenCL backends currently piProgramCreate() queries the platform version (CL_PLATFORM_VERSION) and platform extensions (CL_PLATFORM_EXTENSIONS) to check whether we're capable of running on top of a particular OpenCL backend. However, there...

dakr

[SYCL] Fix debug info generation when integration footer is present

4

This patch is to fix two known issues with debugging caused by integration footer presence, without redesigning the integration footer approach. One issue is the missing checksum for the main...

zahiraam

[SYCL] Implement missing accessor functions

5

llvm-test-suite patch: https://github.com/intel/llvm-test-suite/pull/1265

KornevNikita

[WIP][SYCL] Test opaque pointers support status.

For test purpose only! Do not merge.

bader

ignore-lint

[Driver][SYCL] Introduce gpu specific device targets for AOT

Expands spir64_gen target capabilities with -fsycl by introducing a number of GPU specific targets that can be specified via -fsycl-targets. These targets (intel_gpu_* in format) are a set of reserved...

mdtoguchi

[ROCm OpenCL] device::get_info<device::sub_group_sizes> throws Native API failed

12

**Describe the bug** With ROCm 4.5.2, trying to call `device.get_info()` on an AMD device throws `cl::sycl::runtime_error`. **To Reproduce** ```cpp #include #include int main() { std::vector devices = sycl::device::get_devices(); for (const...

al42and

bug

confirmed

llvm
llvm copied to clipboard

Metadata

[WIP][SYCL-MLIR]: Changes to fix codegen

[SYCL-MLIR] Use GPUModuleOp to host SYCL device code

ptxas fatality Unresolved extern function '__muldc3'

[SYCL] Fix PI_KERNEL_MAX_SUB_GROUP_SIZE in OpenCL backend

[SYCL][PI/CL] check device version/extensions rather than platform version/extensions

[SYCL] Fix debug info generation when integration footer is present

[SYCL] Implement missing accessor functions

[WIP][SYCL] Test opaque pointers support status.

[Driver][SYCL] Introduce gpu specific device targets for AOT

[ROCm OpenCL] device::get_info<device::sub_group_sizes> throws Native API failed

← Metadata

Owner

Metadata

llvm llvm copied to clipboard

Metadata

← Metadata

Owner

Metadata

llvm
llvm copied to clipboard