alpaka issues

refactor template order `allocMappedBuf`

3

Move template TPlatform as the last template. There is no need to provide the platform template signature if we pass the platform as an instance. follow up of #2162

SimeonEhrig

Type:Enhancement

Type:Refactoring

Support CUDA 12.4 and 12.5

8

SimeonEhrig

Type:Enhancement

Backend:CUDA

add/fix automatic build test of the sphinx doc

If saw, that the table for the section `Memory Management` in the CUDA Runtime API is missing: https://alpaka.readthedocs.io/en/latest/dev/backends.html#cuda-runtime-api At a local build, I saw the table has a format issue....

SimeonEhrig

Type:Bug

Type:Testing

Type:Documentation

Compilation errors with single-source branch and clang

2

Compiling with clang and the single source file included gives me these errors: ``` In file included from :4: /app/raw.githubusercontent.com/alpaka-group/alpaka/single-header/include/alpaka/alpaka.hpp:26621:67: error: template template argument has different template parameters than its...

StewMH

document cmake flag to build alpaka benchmarks

1

missing cmake documentation for PR #2237

SimeonEhrig

Type:Documentation

Type:CMake

alignment of CUDA/HIP shared memory ?

`GetDynSharedMem::getMem(acc)` is defined as: https://github.com/alpaka-group/alpaka/blob/9b15e664d103c581020aa5285171b67483eb5c59/include/alpaka/block/shared/dyn/BlockSharedMemDynUniformCudaHipBuiltIn.hpp#L38-L46 1. if the concern is that the memory may not be aligned enough for `T`, why not declare it as ```c++ extern __shared__ T shMem[];...

fwyzard

Type:Question

Backend:CUDA

Backend:HIP

Fix getValidWorkDiv for CUDA and ROCm [I] #2222

3

`getValidWorkDiv` has a bug. It only considers device _hard_ properties ( `TApi::getDeviceProperties()`); does not consider the kernel function. Actually kernel function properties can limit number of threads per block. In...

mehmetyusufoglu

Type:Bug

Document different alpaka terms and it relationships

11

During my work on PR #2180 I had some trouble to add the memory visibility on the correct concepts. Therefore I had a offline discussion with @psychocoderHPC and started to...

SimeonEhrig

State:Help Wanted

alpaka_RELOCATABLE_DEVICE_CODE is not tested with alpaka_add_library

PR #2273 fixes the broken `alpaka_RELOCATABLE_DEVICE_CODE` feature. The [separableCompilationTest](https://github.com/alpaka-group/alpaka/blob/develop/test/integ/separableCompilation/CMakeLists.txt) test tests only `alpaka_add_executeable`.

SimeonEhrig

Type:Bug

Type:Testing

Type:CMake

implement alpaka::meta::isList, alpaka::meta::ToList and alpaka::meta::toTuple

1

- `alpaka::meta::isTuple`: checks if a given type is a `std::tuple` or not - `alpaka::meta::toTuple`: pack a arbitrary number of types in a `std::tuple`. If the given type is a `std::tuple`...

SimeonEhrig

Type:Enhancement

alpaka
alpaka copied to clipboard

Metadata

refactor template order `allocMappedBuf`

Support CUDA 12.4 and 12.5

add/fix automatic build test of the sphinx doc

Compilation errors with single-source branch and clang

document cmake flag to build alpaka benchmarks

alignment of CUDA/HIP shared memory ?

Fix getValidWorkDiv for CUDA and ROCm [I] #2222

Document different alpaka terms and it relationships

alpaka_RELOCATABLE_DEVICE_CODE is not tested with alpaka_add_library

implement alpaka::meta::isList, alpaka::meta::ToList and alpaka::meta::toTuple

← Metadata

Owner

Metadata

alpaka alpaka copied to clipboard

Metadata

← Metadata

Owner

Metadata

alpaka
alpaka copied to clipboard