alpaka issues

`alpaka::getWarpSizes` incurs a noticeable overhead

5

While porting the CMS pixel reconstruction from native CUDA to Alpaka, it was noticed that the use of the `alpaka::getWarpSizes(device)` function incurs a noticeable overhead. See https://github.com/cms-sw/cmssw/pull/43064#issuecomment-1817590926 for the discussion....

fwyzard

Type:Enhancement

Backend:CUDA

Backend:SYCL

Backend:HIP

Enable MSVC+CUDA jobs

10

#1958 disabled our Windows+CUDA CI because of a bug in the Windows `nvcc`. Once this is fixed we should reenable the CUDA-on-Windows jobs.

j-stephan

Type:Testing

Backend:CUDA

OS:Windows

atomic_ref based atomics are too strong

2

The CPU atomic implementation using `std::atomic_ref` use a sequentially consistent memory ordering, which is a stronger guarantee than their CUDA counterparts, which are weakly ordered and always require explicit fences....

bernhardmgruber

Type:Enhancement

device global variables in SYCL

11

That is a tough topic.. first a quick look into the usage of device global variables in SYCL (since [oneAPI 2023.2](https://github.com/intel/llvm/blob/sycl/sycl/doc/extensions/experimental/sycl_ext_oneapi_device_global.asciidoc)): ```cpp // declaration of the variable sycl::ext::oneapi::experimental::device_global myGlobVar; int...

AuroraPerego

Type:Enhancement

Backend:SYCL

Add CI jobs with `alpaka_DEBUG=2`

10

Currently, all (but one analysis) Debug CI jobs run with `alpaka_DEBUG=0`. This means, that extra debugging code is never tested by the CI. We should add at least a few...

bernhardmgruber

Type:Testing

[RFC] Clarify authorship and copyright

10

Preface: This issue is not about relicensing alpaka or stripping people / institutions of their (copy)rights. Its purpose is simply to be a clarification of the current legal state. While...

j-stephan

Type:Question

Type:Documentation

Gitlab CI: enable test for the sycl CPU backend

6

I didn't expect, that it will work now, but it is interesting.

SimeonEhrig

alpaka's implementation of shared memory is slower than the native SYCL one

_Originally posted by @AuroraPerego in https://github.com/alpaka-group/alpaka/pull/2140#discussion_r1316475428_

j-stephan

Type:Bug

Backend:SYCL

Gitlab CI: add ARM Custom job

Alpaka is working on ARM and we have an ARM CI runner. Unfortunately, we cannot use the pre-build container for the ARM job, because they are build for the x86...

SimeonEhrig

Type:Enhancement

GitLab CI: GCC CPU and Sycl CPU do not run ctest

3

At the moment, the CI does not run `ctest` on the CPU runner if CPU backends is used on GitLab CI. This is a left over from the beginning of...

SimeonEhrig

Type:Bug

Type:Testing

Backend:TBB

Backend:std::thread

Backend:SYCL

Backend:Serial

alpaka
alpaka copied to clipboard

Metadata

`alpaka::getWarpSizes` incurs a noticeable overhead

Enable MSVC+CUDA jobs

atomic_ref based atomics are too strong

device global variables in SYCL

Add CI jobs with `alpaka_DEBUG=2`

[RFC] Clarify authorship and copyright

Gitlab CI: enable test for the sycl CPU backend

alpaka's implementation of shared memory is slower than the native SYCL one

Gitlab CI: add ARM Custom job

GitLab CI: GCC CPU and Sycl CPU do not run ctest

← Metadata

Owner

Metadata

alpaka alpaka copied to clipboard

Metadata

← Metadata

Owner

Metadata

alpaka
alpaka copied to clipboard