Georgi Mirazchiyski issues

Results 17 issues of


                                            Georgi Mirazchiyski

[SYCL][Ext] Query kernel maximum active work-groups based on occupancy

The currently proposed and implemented query is `max_num_work_group_occupancy_per_cu` which retrieves the maximum actively executing workgroups based on compute unit occupancy granularity. This commit also overloads the `max_num_num_work_group_sync` query to take...

[CUDA] Implement urKernelSuggestMaxCooperativeGroupCountExp for Cuda

This commit implements the experimental `urKernelSuggestMaxCooperativeGroupCountExp`, for the Cuda adapter, to retrieve the maximum number of cooperative groups that can be launched on the device. Additionally, the changes also cache...

experimental

cuda

[Cuda] Reintroduce catching and reporting of bad_alloc for event object creation

Reintroduces the changes from commit https://github.com/oneapi-src/unified-runtime/commit/c4ae460f779021aa6840ea47a373f6ac336bc589, which were reverted in related merged commit https://github.com/oneapi-src/unified-runtime/commit/1b4a8b852c6b86545b42a71927f88c4fed107217 due to being mistakenly deleted and omitted on rebasing. This happened because the former changes got...

cuda

ready to merge

Georgi Mirazchiyski

[SYCL][Ext] Query kernel maximum active work-groups based on occupancy

[CUDA] Implement urKernelSuggestMaxCooperativeGroupCountExp for Cuda

[Cuda] Reintroduce catching and reporting of bad_alloc for event object creation

[SYCL] Limit allocation size in atomic_memory_order_seq_cst test to a safe maximum size

[Cuda] Simplify the 'getMaxRegistersJitOptionValue' utility and its use

[Cuda] Save the Cuda native error code on adapter-specific errors

[SYCL] Add missing supported AMDGPU architectures to SYCL