tofu
tofu copied to clipboard
Finetuning with LORA causes DeepSpeed error
When finetuning with LORA, the following error is produced:
RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
This is mentioned in this issue in the DeepSpeed library.
Can be fixed with one line proposed in that issues comments.