inference icon indicating copy to clipboard operation
inference copied to clipboard

一张大显存的显卡(一个slot)可以运行多个语言模型

Open gzGRQup opened this issue 1 year ago • 5 comments

Feature request / 功能建议

我希望在我24G显存的显卡上同时运行两个语言模型

Motivation / 动机

显卡显存算力的充分利用

Your contribution / 您的贡献

gzGRQup avatar Sep 13 '24 07:09 gzGRQup

正在搞。

qinxuye avatar Sep 13 '24 07:09 qinxuye

This issue is stale because it has been open for 7 days with no activity.

github-actions[bot] avatar Sep 20 '24 19:09 github-actions[bot]

gpu index手动指定可以吗

goactiongo avatar Sep 24 '24 04:09 goactiongo

可以试下。

qinxuye avatar Sep 25 '24 08:09 qinxuye

This issue is stale because it has been open for 7 days with no activity.

github-actions[bot] avatar Oct 02 '24 19:10 github-actions[bot]

This issue was closed because it has been inactive for 5 days since being marked as stale.

github-actions[bot] avatar Oct 08 '24 19:10 github-actions[bot]