inference
inference copied to clipboard
运行的GPU有资源,却提示别的GPU资源不够
System Info / 系統信息
NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2
Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?
- [X] docker / docker
- [ ] pip install / 通过 pip install 安装
- [ ] installation from source / 从源码安装
Version info / 版本信息
0.15.2
The command used to start Xinference / 用以启动 xinference 的命令
docker run
Reproduction / 复现过程
qwen2.5根据提示运行在1,2号GPU上,
但是提示0号没有资源,导致fastgpt报错LLM model response empty
Expected behavior / 期待表现
正常运行
This issue is stale because it has been open for 7 days with no activity.
This issue was closed because it has been inactive for 5 days since being marked as stale.