dePaul Miller

Results 2 issues of dePaul Miller

Programmatic Dependent Launch (PDL) enables kernels within the same CUDA stream to overlap while programmatically resolving inter-kernel dependencies. This allows consecutive kernels to overlap their ramp-down and ramp-up periods, efficiently...

Ports over some of the latex printing functionality from C++ and adds an example.

inactive-30d