i-Code icon indicating copy to clipboard operation
i-Code copied to clipboard

How to use single machine multi card reasoning

Open hcxss opened this issue 1 year ago • 8 comments

How to use single machine multi card reasoning

hcxss avatar Jun 30 '23 08:06 hcxss

Is your GPU memory below 30GB ? If not, 32GB mem or 40GB mem is enough

zinengtang avatar Jun 30 '23 08:06 zinengtang

单卡24GB

---Original--- From: "Zineng @.> Date: Fri, Jun 30, 2023 16:50 PM To: @.>; Cc: "Changxiu @.@.>; Subject: Re: [microsoft/i-Code] How to use single machine multi card reasoning(Issue #66)

Is your GPU memory below 30GB ? If not, 32GB mem or 40GB mem is enough

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>

hcxss avatar Jun 30 '23 08:06 hcxss

Let me try to make a PR to incorporate fp16 and it should be less than 24GB mem.

zinengtang avatar Jun 30 '23 09:06 zinengtang

Can't configure single machine multi card

---Original--- From: "Zineng @.> Date: Fri, Jun 30, 2023 17:04 PM To: @.>; Cc: "Changxiu @.@.>; Subject: Re: [microsoft/i-Code] How to use single machine multi card reasoning(Issue #66)

Let me try to make a PR to incorporate fp16 and it should be less than 24GB mem.

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>

hcxss avatar Jun 30 '23 09:06 hcxss

what's the reason of it?

zinengtang avatar Jun 30 '23 09:06 zinengtang

you can try deepspeed stage 3 for model parameter states partitioning

zinengtang avatar Jun 30 '23 09:06 zinengtang

Can you elaborate on this a little more? Thanks

PHOENIXFURY007 avatar Jul 10 '23 20:07 PHOENIXFURY007

so in using deepspeed stage 3, the model parameters states will be split among gpus and thus reducing the model memory. deepspeed is supported in pytorch, pytorch lightning, etc.

zinengtang avatar Jul 11 '23 04:07 zinengtang