Sean Sheng comments

Results 23 comments of


Sean Sheng

Provide the feature to save all model input/output to an external storage for debugging or replay later

@withsmilo we are in the design phase of a model monitoring solution where we offer APIs for logging features and inference results and configurations for shipping the logs to a...

SnapML support

@andrewsi-z we have released a stable BentoML framework API in 1.0. You are welcome to contribute SnapML as one of the natively supported frameworks.

API server SLOs

Should we implement max latency similar to the `deadline` feature in gRPC or have a 10 max latency PER runner?

--production flag slows down my requests

@udevnl Thanks for your help with benchmarking BentoML performance. If neither CPU nor GPU is saturated, the bottleneck could be in the API or Runner code itself. Here are common...

fix(scheduling): raise an error if user provide a invalid resources request

There were CI test failures. Could you please take a look, @bojiang?

Add Yatai deployment APIs to Yatai client

@fwindolf How's the progress on this? Any help we can provide?

Add Yatai deployment APIs to Yatai client

@fwindolf, thanks for the update. No you are not blocking anyone. Just that we'd like to extend our help if needed. Looking forward to your PR.

An easier way to include a bento_server configuration file inside of the bento

An option is to add a `configuration` section in bentofile.yml.

An easier way to include a bento_server configuration file inside of the bento

- Configuration should be versioned - Configuration should be mapped to environment variable names

feat(EXPERIMENTAL): grpc

Thanks for compiling this PR, @aarnphm. On CLI, should we use the `--grpc` option or `serve-grpc` sub-command? Using `--grpc` makes sense if most of the other options are also applicable...