ProFusion icon indicating copy to clipboard operation
ProFusion copied to clipboard

Could you provide a model and parameters to reproduce the davidwu effect in the paper?

Open XiaoluDing opened this issue 1 year ago • 3 comments

Amazing effect!! could you provide a model and parameters to reproduce the davidwu effect in the paper, plz

image

XiaoluDing avatar May 25 '23 09:05 XiaoluDing

I didn't save weights of that fine-tuned model, but you can try to fine-tune the model by your self. You can try slightly larger batch size and more iteration steps to ensure good details, then tune the hyper-parameters of fusion sampling at testing.

drboog avatar May 25 '23 14:05 drboog

I didn't save weights of that fine-tuned model, but you can try to fine-tune the model by your self. You can try slightly larger batch size and more iteration steps to ensure good details, then tune the hyper-parameters of fusion sampling at testing.

Which hyper-parameters of fusion is important? I test my own pic, with bs=2, iteration=100, other hyper-parameters is the same with your test code. But the image is not so good.

ywdong avatar May 26 '23 03:05 ywdong

I didn't save weights of that fine-tuned model, but you can try to fine-tune the model by your self. You can try slightly larger batch size and more iteration steps to ensure good details, then tune the hyper-parameters of fusion sampling at testing.

Which hyper-parameters of fusion is important? I test my own pic, with bs=2, iteration=100, other hyper-parameters is the same with your test code. But the image is not so good.

First of all, you need to make sure that your details are well-preserved after fine-tuning, i.e. "a photo of S*" can generate expected details. Otherwise, you may want to change batch size, learning rate, and iteration number. I think you should try more iterations, if your batch size is 2.

After fine-tuning, please read the comments in the example, different hyper-parameters control different things. Basically, tune cfg, ref_cfg and refine_emb_scale. How to tune them is based on what your current generation looks like. You may want to use a loop to generate multiple images for your current hyper-parameter combination, to better understand and tune it.

drboog avatar May 26 '23 05:05 drboog