Fabian Knorr
Fabian Knorr
In addition to execution commands, scheduling a task will generate push / await-push / reduction cascades on participating nodes. So far, all of these implicit commands are `MPI_Sent` to workers...
- As of C++11, integral and floating-point literals can be followed by a user-defined literal - Class and Struct definitions can contain the final, private, protected, public keywords
In this issue I want to make the case for an API that allows submitting SYCL kernels directly onto existing CUDA streams (or equivalent native backend queue types). This would...
Open SYCL currently [prints warnings and errors to stdout](https://github.com/OpenSYCL/OpenSYCL/blob/67e972bee890eafa3e29243ecac9cb987226a395/include/hipSYCL/common/debug.hpp#L64). These lines mix with any output the user decides to print as part of normal program execution, and any script that...
This is the second PR in the Instruction Graph series, and the first one to touch the execution side of the new runtime. It adds the necessary infrastructure to perform...
RFC, based on #182. A common parallelization pattern, e.g. in dense matrix-vector products, is to map 1D thread-ids to the rows of a 2D matrix while iterating over all columns...
### Describe the bug To split work in a multi-GPU setting, we need to find sets of equal / compatible GPUs on a system. On a system with 4x Nvidia...
### Is your feature request related to a problem? Please describe SYCL allows submitting native backend operations to (in-order) queues without paying the synchronization overhead cost of a `host_task` through...
For some setups it might be desirable to have window decorations appear at the bottom of a window or container: - For mouse-heavy applications, the top of the window is...