Bend
Bend copied to clipboard
Bitonic Sort example fails with GPU kernel error.
Description
When I run the compiled CUDA bitonic sorter example (linked in the README) I get this error:
Failed to launch kernels (error code an illegal memory access was encountered)!
To Reproduce
Steps to reproduce the behavior:
-
bend gen-cu sorter.bend > sorter.cu
-
nvcc sorter.cu -o sorter
-
prime-run ./sorter
(Launches it on the GPU for Arch Linux.) - Error recieved.
Expected behavior
The program runs on the GPU.
Desktop (please complete the following information):
- OS: Linux (Arch 6.9.1-arch1-1)
- CPU: Intel i7-11800H
- GPU: RTX 3050 Ti Mobile
- GPU Driver: Nvidia open kernel modules v550.78
- CUDA release 12.4, V12.4.131
Additional context
The program runs using the C codegen backend, but with the CUDA backend, it seems to fail regardless of what I do. If anyone is curious about the prime-run
command, it's really just a script that forces the dGPU to handle a task - nothing fancy.