dbollweg
Results
2
issues of
dbollweg
I finally got access to Intel GPUs and could test the sycl version of sliceSum on those. This PR fixes a bug where sliceSum gave wrong results for SpinColourMatrix objects.
Adding a new AutoTimer class that adds roctx/nvtx markers and timing to every kernel launch issued through our runFunctor interface. Additionally, arbitrary scopes inside the code can be marked and...