tofu Finetuning with LORA causes DeepSpeed error

Finetuning with LORA causes DeepSpeed error

Open mikeFore4 opened this issue 10 months ago • 0 comments

When finetuning with LORA, the following error is produced: RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn

This is mentioned in this issue in the DeepSpeed library.

Can be fixed with one line proposed in that issues comments.

Apr 03 '24 18:04 mikeFore4