llama-chat
llama-chat copied to clipboard
CUDA error: an illegal memory access was encountered
Seeing this error in the console:
{
"detail": "CUDA error: an illegal memory access was encountered\\nCUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.\\nFor debugging consider passing CUDA_LAUNCH_BLOCKING=1.\\nCompile with `TORCH_USE_CUDA_DSA` to enable device-side assertions."
}
Seems to always happen on the second message. First chat interaction works, then the second fails.
@zeke Is this still happening for you?