alpaka issues

[RFC] Future of the Boost.fiber back-end

11

While working on #1713 I discovered that the Boost.fiber back-end is broken when enabling C++20. This was fixed in their repository a few days ago but all stable versions including...

j-stephan

Type:Question

Backend:Boost.Fiber

[WIP] Add example illustrating typical data-parallel patterns with alpaka

The example is not ready yet, but here is the current version. I feel for now the memory part is awkward and I have to somehow reformulate it. The goals...

sbastrakov

Type:Example

Make AccCpuThreads parallelize over blocks

7

AccCpuThreads is currently a bad showcase of C++11 threads as it uses the sub-optimal strategy of spawning CPU threads at thread instead of block level just like the equally useless...

jkelling

Type:Enhancement

Backend:std::thread

Support CUDA 11.7

CUDA 11.7 is released: https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html

SimeonEhrig

make `AtomicStdLibLock`'s mutex non-static

4

Currently `AtomicStdLibLock` has a static mutex hash table which allows it to synchronize between all grids executed within a process. However, this is not documented and does not conform to...

BenjaminW3

Type:Bug

Type:Refactoring

Backend:std::thread

Passing kernel arguments by value/&/const &/&&

1

This is the follow up to an offline discussion. It is not yet a real issue. Alpaka enforces that kernel arguments are either taken by `value` or by `const &`....

BenjaminW3

Type:Documentation

execute fibers row by row

1

By executing the fibers randomly we prevent memory prefetching. Iterating X 1st, Y 2nd, Z 3rd (native C memory order) we would assist the prefetcher by using the expected default...

BenjaminW3

Type:Enhancement

State:Wontfix

Backend:Boost.Fiber

parallelize fiber block execution

Enhance the fibers implementation by parallelizing the execution of the blocks.

BenjaminW3

Type:Enhancement

State:Wontfix

Backend:Boost.Fiber

CUDA/HIP device is not freed when alpaka object is destroyed

1

I realized https://github.com/alpaka-group/alpaka/pull/1707#discussion_r867812402 that in cases the alpaka device is destroyed the device is not correctly freed. The class https://github.com/alpaka-group/alpaka/blob/b074b0df68a96321dc73261ab2b9d3d41180f18c/include/alpaka/dev/DevUniformCudaHipRt.hpp#L62 should call `reset()` which is calling `cudaDeviceReset()/hipDeviceReset()` and guarantees that...

psychocoderHPC

Type:Bug

Backend:CUDA

Backend:HIP

CMake variable and documentation for ALPAKA_DEFAULT_HOST_MEMORY_ALIGNMENT

#1686 introduced the macro `ALPAKA_DEFAULT_HOST_MEMORY_ALIGNMENT`. We should document it properly and consider making it available via cmake. _Originally posted by @bernhardmgruber in https://github.com/alpaka-group/alpaka/issues/1686#issuecomment-1096531865_

bernhardmgruber

Type:Documentation

Type:CMake

alpaka
alpaka copied to clipboard

Metadata

[RFC] Future of the Boost.fiber back-end

[WIP] Add example illustrating typical data-parallel patterns with alpaka

Make AccCpuThreads parallelize over blocks

Support CUDA 11.7

make `AtomicStdLibLock`'s mutex non-static

Passing kernel arguments by value/&/const &/&&

execute fibers row by row

parallelize fiber block execution

CUDA/HIP device is not freed when alpaka object is destroyed

CMake variable and documentation for ALPAKA_DEFAULT_HOST_MEMORY_ALIGNMENT

← Metadata

Owner

Metadata

alpaka alpaka copied to clipboard

Metadata

← Metadata

Owner

Metadata

alpaka
alpaka copied to clipboard