llama2-lora-fine-tuning MultiGPU+Deepspeed+4bitQlora

MultiGPU+Deepspeed+4bitQlora

Open yaoching0 opened this issue 2 years ago • 1 comments

非常感谢作者~ 我目前的情况是，当我使用8张GPU+deepspeed zero3+4bit qlora就会报错和这个一样:https://github.com/microsoft/DeepSpeed/issues/3775 RuntimeError: expected there to be only one unique element in <generator object Init._convert_to_deepspeed_param..all_gather_coalesced.. at 0x7f7019a30890> 在这个讨论串中，作者尝试修改但仍然报错，怀疑deepspeed是不是目前不支持4bit qlora 但是我如果只用一个gpu跑4bit qlora+deepspeed就不会报错一旦使用多gpu就会跳上面的错误我看您有提供4bit的量化finetune，但实际默认的参数是使用8bit 想请问是否用成功用两张gpu+deepspeed+4bit qlora成功finetune过？

Aug 31 '23 14:08 yaoching0

在https://github.com/git-cloner/Llama2-chinese，两张GPU，8bit微调成功过。4bit的没试成功过。

Sep 01 '23 05:09 little51

llama2-lora-fine-tuning llama2-lora-fine-tuning copied to clipboard

MultiGPU+Deepspeed+4bitQlora

llama2-lora-fine-tuning
llama2-lora-fine-tuning copied to clipboard