Tom Lin
Tom Lin
Good thing to have when comparing with other parallel programming models, mostly for syntax. This also makes us consistent with CloverLeaf, TeaLeaf, and miniBUDE. Closes #190 .
Clang++ supports CUDA, we'll need to tweak the CMake models to accommodate this.
Probably just need to take the OpenMP version and remove the pragmas.
The current Julia version in the v2 branch doesn't implement all the auto-tuning features of the v2 C++ driver.
If we try to launch the benchmark with an non existent kernel WGSIZE, the program actually gives you an invalid result instead of reporting this and terminating early: ``` miniBUDE:...
We should consolidate all driver code to use the C++ version, this brings name-based matching and unified argument parsing across all C/C++ implementations.
At least for the C++ implementations, if the numpose parameter is set to something other than 65536, the benchmark terminates saying `bad poses: N`.
We should eventually consolidate all the different terms. Maybe keep only two, so something like: * `UNROLL_ITER` for the static vector/loop unroll thing. Should be available in all impl. *...
Trying the current(https://github.com/UoB-HPC/bude-portability-benchmark/commit/37a6bd8f4b23da39973cfb2be3b7bd6798792782) OpenCL version on Intel UHD630 with the [NEO CL driver from Intel](https://github.com/intel/compute-runtime/blob/master/opencl/doc/DISTRIBUTIONS.md) produces a segfault: ``` Running OpenCL [New Thread 0x7fffef5db700 (LWP 219939)] Using device: Intel(R) Gen9...
Would be great if we can setup [clang-tidy](https://clang.llvm.org/extra/clang-tidy/) to perform checks in the CI. Note that clang-tidy is directly supported in CMake, we can simply do: ```cmake set(CMAKE_CXX_CLANG_TIDY clang-tidy; -format-style='my-style;...