Mikael Simberg issues

Results 232 issues of


                                            Mikael Simberg

Use `counting_semaphore`/`sliding_semaphore` for throttling in MPI polling implementation

effort: 3

priority: low

type: refactoring

category: senders/receivers

Use `fmt` in MPI/CUDA modules for debugging output

effort: 2

effort: 3

priority: low

type: cleanup

Possible integration with OpenMP runtime

There are many levels of potential integration with the OpenMP runtime to avoid contention between pika's and OpenMP's runtime. The most simple would be to replace the loop spawning `std::thread`s...

effort: 3

effort: 4

priority: low

type: feature

type: refactoring

Add support for scheduling on same worker thread in default scheduler

The shared priority queue scheduler supports choosing how to schedule new tasks: round-robin or current thread. The default scheduler should support this as well.

effort: 3

priority: low

type: feature

Attempt to use `condition_variable` to signal completion instead of callback in CUDA and MPI integrations

As pointed out by @biddisco an alternative way to signal completion in the CUDA and MPI integrations would be to have the receivers wait on a condition variable and have...

effort: 3

priority: low

type: refactoring

category: CUDA

category: senders/receivers

Attempt to replace `pika::invoke` and `pika::function` with `std` equivalents

Assuming we don't require our own version for e.g. CUDA and the replacements actually work, we need to take care that the following don't get worse: - compile time -...

effort: 3

effort: 4

priority: low

type: refactoring

Attempt to add a faster path for getting the next thread from unstaged queue

Currently, if the queue of pending threads is empty on a worker thread, `get_next_thread` returns, the scheduler goes to add new threads from the staged to pending threads, and then...

effort: 3

priority: low

type: refactoring

Revisit background thread support in scheduling loop

effort: 2

effort: 3

priority: low

type: cleanup

Revisit CPO structure

The sender/receiver CPOs currently use a helper base class to define fallback implementations with `tag_fallback_invoke`. The need for `tag_fallback_invoke` should be revisited and the CPO types should potentially be in...

effort: 3

priority: medium

type: refactoring

Test if CUDA callbacks would again be a viable replacement for polling

The event polling has been successful and turned out to perform significantly better than using CUDA callbacks. However, that was tested when the CUDA callbacks still required runtime registration on...

effort: 3

effort: 4

priority: low

type: refactoring