guided-diffusion icon indicating copy to clipboard operation
guided-diffusion copied to clipboard

generated images are noisy

Open fido20160817 opened this issue 1 year ago • 8 comments

Hi, there is a work based guided-diffusion: https://github.com/WeilunWang/semantic-diffusion-model which implements a semantic diffusion model. But when I try to do sampling, the quality is unsatisfactory. I receive no response by asking its authors, so I have to go to this repository to find answers (reasons may be related to theories of diffusion model).

Here are some generated images on human faces: https://github.com/WeilunWang/semantic-diffusion-model/issues/14

and more on ade20k and cityscapes:

ddpm sampling ADE_val_00000320_use_ddim_False_2

ddim sampling ADE_val_00000320_use_ddim_True_3

ddpm sampling frankfurt_000000_000294_leftImg8bituse_ddim_False

ddim sampling frankfurt_000000_000294_leftImg8bit

Is there any way to improve the quality of ddim sampling. I tried to set 'guidance scale', but it doesn't help. I don't know what causes noise in the generated images. Any tips?

fido20160817 avatar Jun 06 '23 09:06 fido20160817

Hello! I want to ask how much the loss of the model you trained converges to? The images generated by my training are all noise, and I can't see the content of the images at all.

hxy-123-coder avatar Jun 12 '23 09:06 hxy-123-coder

Hello! Have you solved this problem? I am in the same situation.

kuanglongli avatar Jul 11 '23 07:07 kuanglongli

sorry,but I still struggle with it. if you have some solutions, please tell me

在 2023-07-11 15:36:47,"kuanglongli" @.***> 写道:

你好! 我想问一下你训练的模型的损失收敛到多少?我训练产生的图像都是噪音,我根本看不到图像的内容。

Hello! Have you solved this problem? I am also trained to generate pictures that are all noise.

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: @.***>

hxy-123-coder avatar Jul 12 '23 02:07 hxy-123-coder

Could you please solve this problem? I also have this problem.

UESTC-Med424-JYX avatar Oct 27 '23 13:10 UESTC-Med424-JYX

I solved it by reducing the learning rate to 2e-5

traverso-inspectifai avatar Oct 30 '23 06:10 traverso-inspectifai

I solved it by reducing the learning rate to 2e-5

Thank you, I think this is a good way.

UESTC-Med424-JYX avatar Oct 30 '23 08:10 UESTC-Med424-JYX

Hi, there is a work based guided-diffusion: https://github.com/WeilunWang/semantic-diffusion-model which implements a semantic diffusion model. But when I try to do sampling, the quality is unsatisfactory. I receive no response by asking its authors, so I have to go to this repository to find answers (reasons may be related to theories of diffusion model).

Here are some generated images on human faces: WeilunWang/semantic-diffusion-model#14

and more on ade20k and cityscapes:

ddpm sampling ADE_val_00000320_use_ddim_False_2

ddim sampling ADE_val_00000320_use_ddim_True_3

ddpm sampling frankfurt_000000_000294_leftImg8bituse_ddim_False

ddim sampling frankfurt_000000_000294_leftImg8bit

Is there any way to improve the quality of ddim sampling. I tried to set 'guidance scale', but it doesn't help. I don't know what causes noise in the generated images. Any tips?

Hi, could you help me please ,I also tried the code of https://github.com/WeilunWang/semantic-diffusion-model I want to train it for image-to-image translation and not for segmentation. How can I set the number of classes and this condition ? : --class_cond True , and thank you.

yug125lk avatar Nov 18 '23 15:11 yug125lk