ComfyUI icon indicating copy to clipboard operation
ComfyUI copied to clipboard

DiffuseHigh for 2K+ resolution with any model

Open dathide opened this issue 1 year ago • 3 comments

Feature Idea

"DiffuseHigh enables the pre-trained text-to-image diffusion models (SDXL in this figure) to generate higher-resolution images than the originally trained resolution, e.g., 4×, 16×, without any training or fine-tuning." https://yhyun225.github.io/DiffuseHigh

Existing Solutions

No response

Other

No response

dathide avatar Sep 11 '24 19:09 dathide

@dathide if you're still looking, i made a version for ComfyUI. currently lightly tested/alpha status.

Repo link: https://github.com/blepping/comfyui_jankdiffusehigh

has a few possible improvements:

  • you can use any sampler/schedule for the reference or upscale iteration steps.
  • you can use an upscale model instead of just scaling the image.
  • possible to apply guidance via the latent rather than image, saving a VAE round trip (not sure it's better, it's definitely much faster though).
  • possible to apply the guidance during guidance steps at less than 100% strength, also can fade it out over time.
  • possible to use contrast adaptive sharpening rather than gaussian-blur based sharpening.

blepping avatar Oct 18 '24 16:10 blepping

Yes, I was definitely still looking. Thank you, I installed it and gave it a star. I've done a couple tests with good results besides some color issues that I was also having with other upscaling methods. Edit: I think the specific model I was using had some issues.

dathide avatar Oct 18 '24 16:10 dathide

thanks for the feedback! glad to hear it seems to be working for you. if you run into any issues/have questions/etc please feel free to open an issue or start a discussion in that repo.

blepping avatar Oct 21 '24 12:10 blepping