Training ControlNet parameters instead of finetuning

Open MEZTech-LLC opened this issue 2 years ago • 1 comments

Firstly, thanks for this excellent work.

After reading the paper and experimenting with the code, I thought I'd drop a suggestion. Rather than altering a pretrained LDM model (Stable Diffusion) directly, and fine-tuning weight to account for the additional camera pose and domain, it might be beneficial to instead tune a separate set of UNet parameters (as is done in the ControlNet architecture (https://github.com/lllyasviel/ControlNet) to prevent deterioration of unconditioned model output.

Apologies for making this suggestion in a Github issue - but I didn't see contact info on your site/paper.

Dec 19 '23 00:12 MEZTech-LLC

Hello. Thanks for your suggestions! Indeed, we don't conduct such experiments. I think your suggestion is worth trying. However, due to limited resources, we don't plan to do this in the near future. We welcome cooperations on this topic.

Dec 19 '23 08:12 flamehaze1115