ControlNet icon indicating copy to clipboard operation
ControlNet copied to clipboard

style transfer not wanted

Open tensorflowi opened this issue 2 years ago • 5 comments

Hi, I trained controlnet with depth on my own blender dataset and found that the pose is really good, but the artificial style of my blender images is also a heavy influence in the resulting model (style transfer), which I do not want. They should look realistic instead. Is there any way to improve that? Greetings, Matthes

tensorflowi avatar Apr 26 '23 11:04 tensorflowi

how you generate prompts?

lllyasviel avatar Apr 27 '23 05:04 lllyasviel

(Since I was too lazy for annotation,) I put the prompt "object in front of background" for everything image in the train set (6000 images with various backgrounds). While inference, I do not mention any of the tokens in the prompt (in order to not tempt the model to transfer the style, I thought). Instead, I just put the actual thing that I want to create (like "car") as a prompt. The result is that the pose is perfect, but style transfer is an issue, as mentioned. Do you think proper prompts are the solution? And why?

tensorflowi avatar Apr 27 '23 06:04 tensorflowi

if training prompts are better then cn wont learn to change style of sd because the style is already given in training prompts

lllyasviel avatar Apr 28 '23 09:04 lllyasviel

ok, I'll try it thank you

tensorflowi avatar Apr 28 '23 11:04 tensorflowi

Hi @tensorflowi

Can you please share the inference script you are using?

Thanks in advance.

SuroshAhmadZobair avatar May 08 '23 16:05 SuroshAhmadZobair

if training prompts are better then cn wont learn to change style of sd because the style is already given in training prompts

Can I solve this issue by finetuning the text encoder?

qsun1 avatar Jun 06 '23 04:06 qsun1