Yuxuan Zhang

Results 479 comments of Yuxuan Zhang

显存不够,内存也不够。。 需要28G显存和内存运行BF16

啊,那个模型我们也不太了解,我们现在维护的是GLM-4 不过,正常来说,你这个实现方法应该是对的,同时建议把seed换成一样的,top p 和temperature改成 0

did you use pipe.enable_model_cpu_offload()? if not will use 36GB and will cause this problem if you want to use multi GPUS just remove pipe.enable_model_cpu_offload()

PYTORCH_CUDA_ALLOC_CONF=exandable_segments:True Try this. And what is your nvidia driver and GPU? V100 I guess. It should work( Although we only test in 3090 and A100)

Not now, we will try tied vae, we tested balance in 3 GPU in 20GB. you can try if you can run at 3 * 16G GPU

> > Not now, we will try tied vae, we tested balance in 3 GPU in 20GB. you can try if you can run at 3 * 16G GPU >...

did you follow with cli_demo.py code and using NVIDIA Ampere or higher GPU like 3090 4090

you can try reinstall the diffusers and accelerate libs from source, and A100 must work with using infersence/cli_demo.py in this github repos