dbollweg

Results 2 issues of dbollweg

I finally got access to Intel GPUs and could test the sycl version of sliceSum on those. This PR fixes a bug where sliceSum gave wrong results for SpinColourMatrix objects.

Adding a new AutoTimer class that adds roctx/nvtx markers and timing to every kernel launch issued through our runFunctor interface. Additionally, arbitrary scopes inside the code can be marked and...