RuntimeError: CUDA error: no kernel image is available for execution on the device
I get this error:
RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.
This is my driver:
cuda-toolkit 12.1.0 0 nvidia torch 2.4.0 pypi_0 pypi
The error comes when i run model.generate. I have tried a variety of different cuda versions (12.1, 12.4, 12.6), but no one seems to work. Is it just that my GPU (tesla t4) is not compatible with the awq library?
same error in Tesla V100, but not appears in NVIDIA A100. it seems the older device has something weird.