latent-diffusion icon indicating copy to clipboard operation
latent-diffusion copied to clipboard

Difficulty with Inference for High-Resolution Semantic Image Synthesis

Open vladherasymenko opened this issue 1 year ago • 2 comments

Hello!

Firstly, I want to express my gratitude for sharing this outstanding work! I am currently working on a Semantic Image Synthesis task and have been using your LDM implementation to generate high-resolution images, as described in section 4.3.2 of the paper. I have successfully trained a 256x256 model on my dataset, but I am facing challenges in generalizing it to higher resolutions. Specifically, I am unsure how to perform inference for custom resolutions, particularly non-square formats like 512x1024.

Could you kindly provide guidance on how to set up an inference configuration for achieving this? Your assistance would be immensely appreciated!

Thank you in advance!

Best regards,

Vlad

vladherasymenko avatar Jul 19 '23 13:07 vladherasymenko

I'd also be grateful, if you could share some insights on how to train an LDM directly on custom resolutions, like, for example, 256x512

vladherasymenko avatar Jul 19 '23 14:07 vladherasymenko

same

elhamAm avatar Jul 26 '23 09:07 elhamAm