stable-diffusion-webui icon indicating copy to clipboard operation
stable-diffusion-webui copied to clipboard

[Feature Request]: Add Würstchen / Stable Cascade support

Open dome272 opened this issue 1 year ago • 15 comments

Is there an existing issue for this?

  • [X] I have searched the existing issues and checked the recent builds/commits

What would your feature do ?

We released Würstchen 2 days ago in Diffusers. The blog post can be found here: https://huggingface.co/blog/wuerstchen TLDR; It is much faster than SDXL, requires less memory and generates images at higher resolutions. The tradeoff? Fine details are hard for the model due to its f42 spatial compression of images. However, with future versions we are sure we can improve this a lot. The GitHub with code is here and the Diffusers documentation is here.

Proposed workflow

It could just be loaded like any other model in the GUI.

Additional information

No response

dome272 avatar Sep 15 '23 22:09 dome272

LGTM.

prp-e avatar Sep 16 '23 10:09 prp-e

SD.Next (a A1111 fork) just implemented it, but had to roll back support as there's an undisclosed problem with the latest diffusers package

paniq avatar Sep 16 '23 12:09 paniq

I hope this gets added. But please try making your own fork until it's in the official version.

operationairstrike avatar Sep 18 '23 08:09 operationairstrike

In case you're wondering, basic support for Würstchen has been implemented in the project DiffusionMagic . There's almost no customization, but it does allow for some experimentation. I can't say that images are better than on SD XL in terms of quality, but resource consumption is really lower and processing is faster. Maybe custom models will improve image quality (hopefully) like SD did, but when that will happen is not clear. The promise of faster learning gives hope that it will be soon, but many models will forever be SD-only.

XZVB12 avatar Sep 19 '23 00:09 XZVB12

Yeah but it doesn't seem to load on Google colab

operationairstrike avatar Sep 19 '23 09:09 operationairstrike

Try DiffusionMagic, diffuser-based diffusion workflows; supports wurstchen ( VRAM < 4GB).Google Colab is also available. https://github.com/rupeshs/diffusionmagic

rupeshs avatar Sep 30 '23 15:09 rupeshs

For reference, the stable cascade project is here https://github.com/Stability-AI/StableCascade

paniq avatar Feb 14 '24 13:02 paniq

There's a "temporary" extension for A1111 that will allow you to generate w/ Stable Cascade (it an independent tab) here:

https://github.com/blue-pen5805/sdweb-easy-stablecascade-diffusers

Hopefully it's natively implemented sometime in the near future.

CCpt5 avatar Feb 15 '24 01:02 CCpt5

AssertionError("Torch not compiled with CUDA enabled") dose not work with AMD. Even with directml argument wont make it work!

VeteranXT avatar Feb 19 '24 23:02 VeteranXT

Looking forward to this feature. This is a great update no less than SDXL.

Pevernow avatar Feb 20 '24 04:02 Pevernow

Looking forward to this feature. This is a great update no less than SDXL.

There is extension for this. https://github.com/blue-pen5805/sdweb-easy-stablecascade-diffusers

But for me it does not work

VeteranXT avatar Feb 20 '24 14:02 VeteranXT

hurry! Every day without this there's hordes of people flocking to comfyUI. I don't want to, i have all my shit set up for A1111. I know devs don't owe me anything tho, but as an user of years i think i can at least ask, right? i can be ignored.

placeboyue avatar Feb 29 '24 20:02 placeboyue

Is this added?

PierrunoYT avatar Mar 04 '24 14:03 PierrunoYT

What's the release schedule for this feature? What is the top blocker? It really hurts my projects when the competition can use models that are half a year newer. I will start familiarizing myself with ComfyUI next week if this bottleneck persists.

paboum avatar Mar 08 '24 12:03 paboum

I'm being forced to focus ParrotLUX support on ComfyUI (despite its terrible API) because Stable Cascade is not supported in A1111. Is there any way to make this feature happen?

QuintessentialForms avatar May 06 '24 08:05 QuintessentialForms