feature: How to use multi GPU

Open Euraxluo opened this issue 10 months ago • 3 comments

An example of a model that uses multiple gpus at the same time, or we should have a way to make this easy to use

No response

No response

Feb 20 '25 03:02 Euraxluo

@bentoml.service(resources={"gpu": 4})
class MyService:
    ...

And it will inject CUDA_VISIBLE_DEVICES=0,1,2,3 into the environment

Feb 20 '25 03:02 frostming

Will it automatically schedule and use all gpus at the same time?

Feb 25 '25 01:02 Euraxluo

What automation do you mean, that is all it does and it depends on how the framework respects the env var.

Feb 25 '25 01:02 frostming