peft Add `save_base_model=True` attribute to `save

Add `save_base_model=True` attribute to `save_pretrained` method

Open philschmid opened this issue 2 years ago • 4 comments

Currently, when calling model.save_pretrained, only the adapter weights are stored and not the frozen base_model. Would it make sense to add a kwargs parameter, .e.g save_base_model to also save the base_model weights for easier offline usage, e.g. for deployment?

Mar 16 '23 19:03 philschmid

I'm using diffusers, it would be very helpful to save the base_model + lora_model together. But it is not supported yet, as LoRAmodel doesn't have save_pretrained.

Mar 22 '23 13:03 haofanwang

@haofanwang before we have an integration you can add model.base_model.save_pretrained() at the end of your training.

Mar 22 '23 13:03 philschmid

Thanks for quick response. Let's take train_dreambooth.py for instance.

The LoRA is saved via

accelerator.save(state_dict, os.path.join(args.output_dir, f"{args.instance_prompt}_lora.pt"))
with open(os.path.join(args.output_dir, f"{args.instance_prompt}_lora_config.json"), "w") as f:
    json.dump(lora_config, f)

How should I save other modules? Can I just merge the LoRA weights into base model so that I can load the model in one line as pipeline = DiffusionPipeline.from_pretrained(base_path, torch_dtype=torch.float16)? @philschmid

Mar 22 '23 14:03 haofanwang

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Apr 16 '23 15:04 github-actions[bot]

I need exactly that feature but from this thread it's not entirely clear to me how to save the full, finetuned model (base_model + adapter_model). Here is my according colab (saves only the adapter weights).

Jun 05 '23 09:06 matthiasdroth

peft peft copied to clipboard

Add `save_base_model=True` attribute to `save_pretrained` method

peft
peft copied to clipboard