BentoML
BentoML copied to clipboard
[minor issue] The BentoML image becomes too heavy after including the dependency on nvidia-ml-py3.
https://github.com/bentoml/BentoML/blob/cc765bba83501f446297de31fdc819cd7dcc2901/pyproject.toml#L40C23-L40C23
To be precise, build times have increased since the pynvml<12
dependency was added.
bento image size was increased more than 2~4GB (in my case (pytorch cpu)
)
given that most model serving is GPU-based anyway, adding a dependency makes sense.
but I think it can be more useful if exclude gpu extra option is added
like a bentoml[cpu-only]
or
# bentofile.yaml
docker:
cuda_enable: false
# cuda_version: "11.6.2"