nvidia-cuda-tutorial issues

Results 4 nvidia-cuda-tutorial issues

Sort by recently updated

Session 4 (Extending Numba) should mention `hash` and `eq` methods of types

Failing to implement these results in weird behaviour for parameterised types: - `__hash__` is required for correct interning. - `__eq__` is required to determine if casts are required.

gmarkall

Replace use of `definition` and `definitions` with `overloads`

Some parts of the course use kernel definitions from the `definition` and `definitions` properties. These properties are deprecated, and should be replaced with the use of `overloads` instead.

gmarkall

Add section on preventing widening integer indices

The section on the widening on integer indices produced in a loop over a `range` seems to accidentally be missing - it should be just before the "Limiting register usage"...

gmarkall

Add section on Grid Groups and Grid sync

[Grid groups and grid sync](https://numba.readthedocs.io/en/latest/cuda/cooperative_groups.html) were added in Numba 0.53.1. A short section on using these to implement a global barrier would be good, perhaps based around the example kernel...

gmarkall

nvidia-cuda-tutorial
nvidia-cuda-tutorial copied to clipboard

Metadata

Session 4 (Extending Numba) should mention `hash` and `eq` methods of types

Replace use of `definition` and `definitions` with `overloads`

Add section on preventing widening integer indices

Add section on Grid Groups and Grid sync

← Metadata

Owner

Metadata

nvidia-cuda-tutorial nvidia-cuda-tutorial copied to clipboard

Metadata

Session 4 (Extending Numba) should mention `__hash__` and `__eq__` methods of types

Replace use of `definition` and `definitions` with `overloads`

Add section on preventing widening integer indices

Add section on Grid Groups and Grid sync

← Metadata

Owner

Metadata

nvidia-cuda-tutorial
nvidia-cuda-tutorial copied to clipboard

Session 4 (Extending Numba) should mention `hash` and `eq` methods of types