Aurora Perego comments

Results 25 comments of


                                            Aurora Perego

[SYCL] Adopt the experimental free function extension

The performance does not change: | nThreads | base[@c9dbac0](https://github.com/AuroraPerego/alpaka/tree/c9dbac0e8f347a00c6bb93861d0c38bf191fcc8c) | base + PR | |-------|-------|-------| | 1 | 148.44 +/- 3.48) | 148.32 +/- 0.35) | | 2 | 172.73...

Add CI jobs with `alpaka_DEBUG=2`

FYI in debug mode with the CUDA back-end I see: ```bash 93% tests passed, 2 tests failed out of 30 Total Test time (real) = 316.42 sec The following tests...

Gitlab CI: enable test for the sycl CPU backend

Good! The tests that fail are due to the OpenCL CPU runtime :(

Gitlab CI: enable test for the sycl CPU backend

Something is broken in the runtime. I do not think is a bug in alpaka because all the tests pass with an older runtime (see #2140).

Gitlab CI: enable test for the sycl CPU backend

As I wrote in the README, the only problem is that there is a bug in `all_of_group`/`any_of_group` methods (that does not affect the results of the tests). For that we...

[SYCL] remove dependencies from `memcpy`, `memset` and kernel launch

Yes, i've built and run the tests for CPU and GPU with the usual result (all pass on GPU, 4 out of resources on CPU)

[SYCL] remove dependencies from `memcpy`, `memset` and kernel launch

(I'll check also the impact on the performance)

[SYCL] remove dependencies from `memcpy`, `memset` and kernel launch

The situation is worse than expected, apparently there are many more synchronization introduced with these changes. I compared the baseline with this PR and with another version that has only...

[SYCL] remove dependencies from `memcpy`, `memset` and kernel launch

Yes, sorry. The tests have been done on pixeltrack-standalone[@dcf2898](https://github.com/AuroraPerego/pixeltrack-standalone/tree/dcf2898da7d66d89fdff2607d016e03c1671f1c3). These numbers are the throughput, i.e. the number of concurrent events processed per second, computed processing 10000 events and `nThreads` is...

[SYCL] remove dependencies from `memcpy`, `memset` and kernel launch

Not sure the handling it's correct, it was done mostly to remove the `ext_oneapi_submit_barrier()` and `ext_oneapi_empty()` calls to check the impact. At this point I'm not sure if it's worth...