BentoML [minor issue] The BentoML image becomes too heavy after including the dependency on nvidia-ml-py3.

[minor issue] The BentoML image becomes too heavy after including the dependency on nvidia-ml-py3.

Open KimSoungRyoul opened this issue 7 months ago • 2 comments

https://github.com/bentoml/BentoML/blob/cc765bba83501f446297de31fdc819cd7dcc2901/pyproject.toml#L40C23-L40C23

To be precise, build times have increased since the pynvml<12 dependency was added.

bento image size was increased more than 2~4GB (in my case (pytorch cpu))

스크린샷 2023-11-30 오후 5 54 52

given that most model serving is GPU-based anyway, adding a dependency makes sense.

but I think it can be more useful if exclude gpu extra option is added

like a bentoml[cpu-only]

# bentofile.yaml

docker:
   cuda_enable: false 
   # cuda_version: "11.6.2"

Nov 30 '23 09:11 KimSoungRyoul