stable-diffusion-webui-forge icon indicating copy to clipboard operation
stable-diffusion-webui-forge copied to clipboard

⚡ Request for SVDQuant Checkpoints Support & Development of FLUX fn4 with Nunchaku Technology

Open sashaok123 opened this issue 1 year ago • 4 comments

Description:
This issue proposes adding support for SVDQuant checkpoints and developing a custom FLUX fn4 checkpoint using Nunchaku’s SVDQuant, a 4-bit quantization technique. SVDQuant reduces memory and increases efficiency by absorbing outliers in model weights via low-rank components, providing a robust solution for memory-constrained environments without degrading performance.

Justification:
SVDQuant uniquely addresses the challenge of memory and latency in large models:

  • Efficient Memory Management: By absorbing weight outliers, SVDQuant reduces memory usage by up to 3.5× in models like FLUX.1, achieving 3× speedup over weight-only quantized models.
  • Maintained Quality: Visual fidelity is preserved, and SVDQuant matches 16-bit model quality, even with 4-bit quantization, making it ideal for high-performance applications. image

Implementation Steps:

  1. Load SVDQuant Checkpoints: Integrate support for the loading of SVDQuant-formatted checkpoints.
  2. Develop FLUX fn4 Checkpoint: Train and validate FLUX fn4 using SVDQuant’s quantization, with benchmarks against non-quantized models to ensure quality retention.
  3. Optimize Performance: Use Nunchaku’s kernel fusion to minimize data movement and reduce latency by combining low-rank and low-bit processing.

References:

sashaok123 avatar Nov 09 '24 05:11 sashaok123

pls+1

angelika3377 avatar Jul 07 '25 16:07 angelika3377

Any news about this?

Iory1998 avatar Jul 17 '25 02:07 Iory1998

Please Please

ivancollider avatar Aug 01 '25 12:08 ivancollider

Since November last year and no maintainer has even read the issue.

darkanubis0100 avatar Aug 02 '25 09:08 darkanubis0100