inference
inference copied to clipboard
一张大显存的显卡(一个slot)可以运行多个语言模型
Feature request / 功能建议
我希望在我24G显存的显卡上同时运行两个语言模型
Motivation / 动机
显卡显存算力的充分利用
Your contribution / 您的贡献
无
正在搞。
This issue is stale because it has been open for 7 days with no activity.
gpu index手动指定可以吗
可以试下。
This issue is stale because it has been open for 7 days with no activity.
This issue was closed because it has been inactive for 5 days since being marked as stale.