CogVideo Can the RTX 2080 Ti with 11GB RAM perform inference tasks ？

Can the RTX 2080 Ti with 11GB RAM perform inference tasks ？

Open pily1 opened this issue 1 year ago • 1 comments

System Info / 系統信息

I used the master version,but when it inferenced with 2b model or 5b model,it reported a error,cuda out of memory my question is "Can the RTX 2080 Ti with 11GB RAM perform inference tasks ？“

Information / 问题信息

[X] The official example scripts / 官方的示例脚本
[ ] My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

I use the command "python cli_demo.py --prompt "A girl ridding a bike." --model_path THUDM/CogVideoX-2b torch.OutofMemoryError:CUDA out of memory.Tried to allocate 35.31 GiB

Expected behavior / 期待表现

output a mp4 video

Sep 10 '24 07:09 pily1

They added this to the README.md, maybe it can help you make it work:

🔥🔥 News: 2024/8/29: By adding pipe.enable_sequential_cpu_offload() and pipe.vae.enable_slicing() to the inference code of CogVideoX-5B, VRAM usage can be reduced to 5GB. Please check the updated cli_demo.

Sep 10 '24 18:09 jjhaggar

I followed the README.md instruction,then it reported error "torch.OutofMemoryError:CUDA out of memory.Tried to allocate 35.31 GiB"

Sep 11 '24 06:09 pily1

same as #204

Sep 11 '24 07:09 zRzRzRzRzRzRzR

They added this to the README.md, maybe it can help you make it work:

🔥🔥 News: 2024/8/29: By adding pipe.enable_sequential_cpu_offload() and pipe.vae.enable_slicing() to the inference code of CogVideoX-5B, VRAM usage can be reduced to 5GB. Please check the updated cli_demo.

Thank you, after runing

pipe.enable_sequential_cpu_offload()
pipe.vae.enable_slicing()
pipe.vae.enable_tiling()

I can run in my 6GB GPU laptop !

Reference:
https://github.com/zai-org/CogVideo?tab=readme-ov-file#cogvideox-5b

Oct 06 '25 19:10 wayne931121

CogVideo CogVideo copied to clipboard

Can the RTX 2080 Ti with 11GB RAM perform inference tasks ？

System Info / 系統信息

Information / 问题信息

Reproduction / 复现过程

Expected behavior / 期待表现

CogVideo
CogVideo copied to clipboard