exo icon indicating copy to clipboard operation
exo copied to clipboard

Error processing prompt (see logs with DEBUG>=2): module load failed with status code 222: CUDA_ERROR_UNSUPPORTED_PTX_VERSION

Open Pentar0o opened this issue 1 year ago • 1 comments

I have this error Error processing prompt (see logs with DEBUG>=2): module load failed with status code 222: CUDA_ERROR_UNSUPPORTED_PTX_VERSION : trying Llama 3.1 70b on a server with Cuda 12.6 / Nvidia driver 535.183.06, with 3 Tesla T4 Gpu, any help appreciated.

Pentar0o avatar Oct 09 '24 16:10 Pentar0o

I had this issue but when I scrolled up in the logs it told me that ptxas executable was not found. I had to add my cuda bin to my shell path. Also I had to make sure that my nvidia driver version and cuda toolkit versions matched (as shown by running nvidia-smi and nvcc --version). Once I updated my path, updated my drivers, and restarted the machine, I got past the error

pickettd avatar Oct 18 '24 21:10 pickettd