TIGERScore icon indicating copy to clipboard operation
TIGERScore copied to clipboard

DeepSpeed error with LoRa

Open g-batalhao-a opened this issue 3 months ago • 0 comments

Hi,

I was trying to use the fine-tune script with a quantized Mistral and I get the following error: ValueError: DeepSpeed Zero-3 is not compatible with "low_cpu_mem_usage=True" or with passing a "device_map".

After looking at this issue, I removed this line but then the error that appears is the following: RuntimeError: Only Tensors of floating point and complex dtype can require gradients

Could I get help solving this issue? Thanks

g-batalhao-a avatar Apr 01 '24 12:04 g-batalhao-a