xianxian.zhang
xianxian.zhang
> What are the bugs within the current implementation? context in https://bentoml-team.slack.com/archives/C02QLC8RB5W/p1695088745009929 as a summary, currently it will take too much memory when do a `bentoml push` since it use...
> Looks like unit tests are failing, maybe because of the requests change...? > > Should we just use a `SpooledTemporaryFile` for everything? - checked the ut failed logs, seems...
Definitely runner can run in parallel especially for the VLLM case we can do batching to enhance performance. We can not make such assumption in BentoML. BTW, If you want...
Hi. thanks for reporting. Bentoml 1.2 Service have not impl Grpc support yet.