Eyal Rozenberg
Eyal Rozenberg
Now that we have (half-)decent unit test coverage (see #24), we should introduce code coverage checks to see how much remains uncovered. This requires: * Getting a coverage-related CMake module...
Beginning with CUDA 10 (or maybe 9?) we have three kinds of atomics: * `atomicFoo()` - atomic w.r.t. other memory access from within the same GPU. * `atomicFoo_system()` - atomic...
An index [is](https://www.merriam-webster.com/dictionary/index) either a "list of items" arranged in order, or "a number... used as an indicator or measure", or "a number ... associated with another to indicate... position...
Shuffles are warp collaboration primitives. They should be in namespace `kat::collaboration::warp` - and declared in the warp collaboration primitives header - if only perhaps through an inclusion of another file.
We've adapted a tuple implementation; however, that tuple doesn't know that there's "another tuple" it needs to be compatible with... we _do_ know. So, let's try and make `kat::tuple` usable...
The programming guide [says](https://docs.nvidia.com/cuda/archive/8.0/cuda-c-programming-guide/index.html): > **E.3.14.3. Rvalue references** > > By default, the CUDA compiler will implicitly consider `std::move` and `std::forward` function templates to have `__host__ __device__` execution space qualifiers,...
We have many functions returning lane ids or numbers-of-lanes. Mostly those use `unsigned`. But for better readability/clarity, I'm thinking of introducing something like: ``` using lane_id_t = unsigned; ``` within...
In a 2D or 3D block, the CUDA "thread index" - according to official documentation - is a 3D or 3D entity, while the "thread ID" is its linearization (where...
The test fixtures have improved, and become a bit more flexible and requiring less boilerplate, from one test suite to another. We should use the later ones - currently on...
We have grid-scope action in two forms - at grid stride and at block stride. The block stride action means each block acts on consecutive data. At block-scope - we...