DeepSpeedExamples
DeepSpeedExamples copied to clipboard
[BUG]Step1 RuntimeError: CUDA error: CUBLAS_STATUS_ALLOC_FAILED when calling `cublasCreate(handle)`
bash training_scripts/single_node/run_1.3b_lora.sh
Traceback (most recent call last):
File "main.py", line 328, in cublasCreate(handle)
@qinqinqaq, this looks like an OOM. Can you share your GPU memory size?