nunchaku
nunchaku copied to clipboard
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Hi there, anyone successfully setup nunchaku with the docker file of the comfyUI worker ( runpod ) ? https://github.com/blib-la/runpod-worker-comfy/tree/main#use-the-docker-image-on-runpod
Since Flex-Alpha is a de-distilled version of the Flux Schnell, maybe it could be supported. I think the original unet is pruned, but maybe many things remain unchanged. Since it...
批次超过1,采样器就报错。
use ComfyUI-ppm error ( ComfyUI-ppm a nice way to give flux negative),   GPU 0 (NVIDIA GeForce RTX 3060 Laptop GPU) 显存: 12287.375 MB 用户启用 CPU offload [2025-03-14 07:42:39.096]...
Hey there! I've got a super simple question after doing a ton of code searching. Which code can show that the activation's quantization is based on a per group size...
Running most workflows will report an error.
Hello, I am doing an experiment about lora-like layer's performance under different precision and hardware platform. I would appreciate it if you can provide a single fused kernel mentioned in...
https://huggingface.co/lodestones/Chroma
Can I quantify the desired model myself, e.g.majicflus
do you have any plan to add sana 1.5 model? or can i get a script for converting those model? https://huggingface.co/Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers