stable-diffusion.cpp icon indicating copy to clipboard operation
stable-diffusion.cpp copied to clipboard

Feature Requests

Open KintCark opened this issue 1 year ago • 9 comments

Can u add the ability to save the qauntized models to storage that way if we want to use it again we don't have to keep reconverting every time we start an generation.

KintCark avatar Aug 13 '24 16:08 KintCark

You should already be able to do that by setting the "-M, --mode" argument to "convert" to save the model as a quantized gguf file at the location specified with "-o, --output".

More information in the docs: docs/quantization_and_gguf.md

grauho avatar Aug 14 '24 15:08 grauho

It auto saves to the main folder but I tried qauntized aura flow buy it killed when saving output

KintCark avatar Aug 16 '24 23:08 KintCark

Interesting, please recompile with --config Debug, re-run with --verbose, and post the output

grauho avatar Aug 17 '24 10:08 grauho

Interesting, please recompile with --config Debug, re-run with --verbose, and post the output

I can't post output termux crashes immediately I got 7gb ram to spare but it loads the tensors but as soon as it trys to save output it crashes. Could someone else try converting aura flow 0.3 please I need q5 or q4 and q8

KintCark avatar Aug 18 '24 02:08 KintCark

I'm not familiar with aura flow but I wonder if it's based on a model type that sdcpp doesn't currently support. You could pipe the output to a file to avoid losing it when termux crashes, eg: "./sd etc etc &> foo.txt" to pipe both stdout and stderr to a file.

grauho avatar Aug 18 '24 03:08 grauho

I'm not familiar with aura flow but I wonder if it's based on a model type that sdcpp doesn't currently support. You could pipe the output to a file to avoid losing it when termux crashes, eg: "./sd etc etc &> foo.txt" to pipe both stdout and stderr to a file.

You're right that has to be what it is it's not supported so it don't work so I can quantize sd3 I bet. Will flux and aura flow be added soon?

KintCark avatar Aug 18 '24 04:08 KintCark

Gguf used less memory I can run flux q8_0 and t5xxlfp16 in comfyui on my phone so using gguf is better than safetensors

KintCark avatar Aug 18 '24 04:08 KintCark

Sd3 uses flow so how come flux and aura not auto support?

KintCark avatar Aug 18 '24 04:08 KintCark

Sd3 uses flow so how come flux and aura not auto support?

From my understanding of the similarities between Flux and SD3 is that while they share some components, they are not identical in their architecture. So, that's why despite SD3 being available in sdcpp there is no "auto support" for Flux.

grauho avatar Aug 18 '24 12:08 grauho