EmerJK
EmerJK
I'd love to be able to load multiple lora from the command line as well.
Same problem here. I'm on ubuntu 20.04 using nvidia driver version: 530.30.02, CUDA version: 12.1, and an M40 GPU. Just as a possible solution I've tried totally wiping nvidia drivers...
> Just curious, does it still crash without `--lora-base`? For me at least, yep, I still get the crash if I don't use lora-base.
Just did a fresh pull, make clean and LLAMA_CUBLAS=1 make. No changes with the crash, I'm afraid. But thanks to everyone trying to figure it out!
> I looked into the issue and quite frankly I don't think it's worth the effort to fix. Currently the CUDA code runs everything as f32 by default and it...