YAKL
YAKL copied to clipboard
Yet Another Kernel Launcher: A simple C++ framework for performance portability and Fortran code porting
I am refactoring some code to be 2/3D agnostic, which involves making large arrays of order [Na, Nx, Ny, Nz] into [Na, Nx\*Ny\*Nz] (Na is typically ~10). These could easily...
Feature request to add ability to attach attributes to netcdf objects (files, dimensions, variables) for metadata. I can do this and submit a PR... This issue serves as a reminder...
Add Kokkos backend similar to CUDA, HIP, and SYCL using Kokkos C-style allocators and other functionality for parallel_for.
On current main branch, hash: d29e739f446cb9bcf3a12899cbabe754f471f58b ```bash qsub -I -t 30 -n 1 -q florentia_debug ``` ```bash source jlse_gpu_O3.sh make -j make test ``` ```bash [ac.normanmr@florentia02:~/YAKL/unit/build/machines/jlse] >:O ./Streams/Streams Running on...
The following code can fail on Sunspot with `(max_a, max_b) = (2, 2)`. It doesn't happen with `GATOR_DISABLE=1`. It also doesn't fail when `SYCL_PI_LEVEL_ZERO_USE_IMMEDIATE_COMMANDLISTS=1`. I am using the experimental runtime...
Using current main, hash: d706e2f44fb8f5651cb2f6f3748e6fd3261a70ae ```bash qsub -I -t 120 -n 1 -q arcticus ``` ```bash source source jlse_gpu_O3.sh make -j make test ``` ```bash [ac.normanmr@arcticus09:~/YAKL/unit/build/machines/jlse] >:O ./CArray/CArray terminate called...
WIP. CTest didn't like " -Xsycl-target-backend \" -device 12.60.7 \" " even after escaping quotes.
Create inner level parallel reductions using CUB and hipCUB inside the `YAKL_reductions.h` header, and create simple `intrinsics` functions for `sum`, `maxval`, `minval`, and `product`. Whenever inner-level loop size exceeds the...
Should close #162. I've modified when the YAKL Array takes the mutex to be only when the reference count pointer isn't null. This makes temporary unowned views much cheaper on...