Langchain-Chatchat [BUG] CUDA out of memory with container deployment

[BUG] CUDA out of memory with container deployment

Open BillShiyaoZhang opened this issue 2 years ago • 3 comments

问题描述 / Problem Description 按照 README 下载 image 运行后，完成启动所需安装，报错 CUDA out of memory.

复现问题的步骤 / Steps to Reproduce

预期的结果 / Expected Result 完成启动，打开网页进入

实际结果 / Actual Result

ERROR 2023-05-10 18:15:04,235-1d: CUDA out of memory. Tried to allocate 128.00 MiB (GPU 0; 8.00 GiB total capacity; 7.25 GiB already allocated; 0 bytes free; 7.25 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
INFO 2023-05-10 18:15:04,236-1d: 模型未成功加载，请到页面左上角"模型配置"选项卡中重新选择后点击"加载模型"按钮

环境信息 / Environment Information