nerfren
nerfren copied to clipboard
RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling `cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)`
Hi, when I running the script 'nerfren/scripts/nerfren/train_mirror.sh' for training phase,
meeting the Error 'RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
'?
Is this Error why? The dimensions of torch.nn.Linear correct? Does anyone encounter this problem and how to solve it?
This is probably a torch/cuda version mismatch. Which PyTorch version are you using? How did you install PyTorch?
the CUDA version installed is cuda-11.6, the PyTorch vesion = 1.13.0. And install PyTorch way as official command 'pip3 install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu116'.
I haven't tested on new PyTorch versions. Could you please try to install earlier versions? For example v1.8:
pip3 install torch==1.8.2 torchvision==0.9.2 torchaudio==0.8.2 --extra-index-url https://download.pytorch.org/whl/lts/1.8/cu111
OK, thanks! Let me have a try later.