LISA icon indicating copy to clipboard operation
LISA copied to clipboard

Which file is the lora weight?

Open hxx-who opened this issue 1 year ago • 3 comments

Hi, Great work! I was wondering what is the difference between different model files, like bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt, mp_rank_00_model_states.pt, pytorch_model.bin which is obtained after the zero_to_fp32.py, and the file obtained after merge_lora_weights_and_save_hf_model.py. Which of them is the lora weight? or they are just different formats of lora weight? Thanks!

hxx-who avatar Jan 17 '24 10:01 hxx-who

and which of them is the full model weight?

hxx-who avatar Jan 17 '24 10:01 hxx-who

+1

GaoXiaoshan avatar May 07 '24 07:05 GaoXiaoshan

I have met the same problem. If you train with deepspeed, save the model and run python zero_to_fp32.py, you will get the pytorch_model.bin. The pytorch_model.bin is the full model weight which means there is no need to merge.

GaoXiaoshan avatar May 08 '24 09:05 GaoXiaoshan