RWKV-LM-LoRA
RWKV-LM-LoRA copied to clipboard
PSA: Issue with Multi-GPU & CUDA 12.0
Currently with RWKV and DeepSpeed, there seems to be an issue where it "hangs" when activating DeepSpeed with bf16
Specifically around this line
Currently this is tested to be resolved in Cuda 12.2