nunchaku icon indicating copy to clipboard operation
nunchaku copied to clipboard

[Bug] t5 encoder problem

Open zwukong opened this issue 9 months ago • 8 comments

i use it in flux comfyui workflow, the textencoder node seems needing three files,clip-L t5 and svdq-t5. The memory usage is huge,if we can only use clip-L.safetensor and svdq-t5.safetensors(3G file only),it will be great.

zwukong avatar Mar 09 '25 10:03 zwukong

use int4 Loading checkpoint shards from combine these two model very slow

why just use svdq-t5.safetensors

Image

xueqing0622 avatar Mar 10 '25 03:03 xueqing0622

Hi, we are optimizing the RAM usage for loading our T5 encoder. You can also use the FP8 text encoder by choosing the corresponding file and set the T5 precision to BF16 for now.

lmxyy avatar Mar 10 '25 04:03 lmxyy

hope use t5 only svdq-t5.safetensors 2.99g

xueqing0622 avatar Mar 10 '25 04:03 xueqing0622

use dual clip gguf clip can run! so it is another way to save vram.

Image

xueqing0622 avatar Mar 12 '25 22:03 xueqing0622

yeah, it is compatible.

lmxyy avatar Mar 12 '25 22:03 lmxyy

hello, after use version 0.2, use gguf clip loader easy to out of memory, clean vram and out of memory on the second queue Image

use nunchaku text encoder loader is ok, and plz support support only use 2.9g t5 model when use int4_model Image

xueqing0622 avatar Apr 07 '25 00:04 xueqing0622

#320 should fix this issue.

lmxyy avatar Apr 27 '25 04:04 lmxyy

why are these files still needed? model-00001-of-00002.safetensors model-00002-of-00002.safetensors

tazztone avatar May 20 '25 19:05 tazztone

fixed in v0.3.0

lmxyy avatar Jun 03 '25 06:06 lmxyy

btw remember to use the v2 version node of nunchaku text encoder. 🤝

tazztone avatar Jun 03 '25 06:06 tazztone