Krishna Nadiminti
Krishna Nadiminti
Thanks @sudeepg545 ! I will look into this POV. Will update on this @jianshen92 .
Interesting! Having the same bug for 2 GPU settings, 1 GPU one is working fine for me.
I am even facing this issue mainly on transformer models, any one had any breakthrough ?
Hi @jianshen92 ! I hope this one I created earlier would help! https://github.com/bentoml/BentoML/issues/4238 bentoml serve bento:xx works fine. but containerizing and running the container kind of causes this issue.