nunchaku
nunchaku copied to clipboard

Published 20 hours ago •

Reame
Issues

[Bug] t5 encoder problem

Open zwukong opened this issue 9 months ago • 8 comments

i use it in flux comfyui workflow, the textencoder node seems needing three files,clip-L t5 and svdq-t5. The memory usage is huge,if we can only use clip-L.safetensor and svdq-t5.safetensors(3G file only),it will be great.

Mar 09 '25 10:03 zwukong

use int4 Loading checkpoint shards from combine these two model very slow

why just use svdq-t5.safetensors

Mar 10 '25 03:03 xueqing0622

Hi, we are optimizing the RAM usage for loading our T5 encoder. You can also use the FP8 text encoder by choosing the corresponding file and set the T5 precision to BF16 for now.

Mar 10 '25 04:03 lmxyy

hope use t5 only svdq-t5.safetensors 2.99g

Mar 10 '25 04:03 xueqing0622

use dual clip gguf clip can run! so it is another way to save vram.

Mar 12 '25 22:03 xueqing0622

yeah, it is compatible.

Mar 12 '25 22:03 lmxyy

hello, after use version 0.2, use gguf clip loader easy to out of memory, clean vram and out of memory on the second queue

use nunchaku text encoder loader is ok, and plz support support only use 2.9g t5 model when use int4_model

Apr 07 '25 00:04 xueqing0622

#320 should fix this issue.

Apr 27 '25 04:04 lmxyy

why are these files still needed? model-00001-of-00002.safetensors model-00002-of-00002.safetensors

May 20 '25 19:05 tazztone

fixed in v0.3.0

Jun 03 '25 06:06 lmxyy

btw remember to use the v2 version node of nunchaku text encoder. 🤝

Jun 03 '25 06:06 tazztone