BentoML issues

feat: Support gRPC in BentoML API Server

6

Support using gRPC instead of HTTP API for sending prediction requests. When an API model server is deployed as a backend service, many teams prefer using gRPC over HTTP. See...

parano

feature

feat: onnxmlir

2

- feat: skafolding onnxmlir support - feat: onnxmlir api work in progress, put here as a draft for trying out #2693 Test will follow accordingly.

aarnphm

feat(wip): CLI autocompletion

2

Signed-off-by: Aaron Pham added zsh completion. Ideally we want to extend click-completion, but right now click-completion is very slow and wouldn't understand how to autocomplete bento and models

aarnphm

Add Yatai deployment APIs to Yatai client

6

**Is your feature request related to a problem? Please describe.** Currently, Yatai provides a few ways to create and manage deployments: * Web UI (requires yatai account logged in) *...

parano

help-wanted

good-first-issue

bug: Exhaustive run during `bentoml build` on AzureDevOps

### Describe the bug cc https://bentoml.slack.com/archives/CKRANBHPH/p1658494302553029 TLDR: When running `bentoml build` locally, it works as expected. However, on AzureDevOps python agent, the process seems to hang. ### To reproduce Current...

aarnphm

bug

from community

runner/runnable support multiple outputs if ml frameworks support it

to solve: - [ ] auto DataContainer recognize multiple outputs

larme

feature

feedback-wanted

Add Algolia Docsearch to a docs

https://github.com/readthedocs/sphinx_rtd_theme/issues/761

ssheng

documentation

feat: adding custom metrics from API service

There is an increasing demand from the community for adding custom metrics to the API service. BentoML supports basic service level metrics out-of-box, including request duration, in-progress, and count, using...

ssheng

feature

documentation

API server SLOs

2

- [ ] `max-latency` & `timeout` - [x] api server timeout - [x] provide both max-latency and timeout in BentoServer config - [x] default `max-latency`: `10s` - [ ] default...

parano

chore: Set `OMP_NUM_THREADS` and related env vars prior to importing `numpy`

`OMP_NUM_THREADS` must be set before numpy is imported for it to work, our current implementation doesn’t guarantee that.

ssheng

feature

BentoML
BentoML copied to clipboard

Metadata

feat: Support gRPC in BentoML API Server

feat: onnxmlir

feat(wip): CLI autocompletion

Add Yatai deployment APIs to Yatai client

bug: Exhaustive run during `bentoml build` on AzureDevOps

runner/runnable support multiple outputs if ml frameworks support it

Add Algolia Docsearch to a docs

feat: adding custom metrics from API service

API server SLOs

chore: Set `OMP_NUM_THREADS` and related env vars prior to importing `numpy`

← Metadata

Owner

Metadata

BentoML BentoML copied to clipboard

Metadata

← Metadata

Owner

Metadata

BentoML
BentoML copied to clipboard