Daniel Galvez

Results 42 issues of Daniel Galvez

Now we conditionally xfail only when a cuda driver version less than 12.6 is installed. CUDA 12.6 fixes this issue. Before it, cooperative kernels could not be used within the...

core
stale
ASR
Run CICD

Previously, several small GPU->CPU copies were used, which caused excess latency linearly proportional to the batch size. For small copies, it is much more efficient to do a single memory...

stale
ASR
Run CICD