MLServer issues

Tracker ticket for alibi-explain tickets

Here are a list of tickets that might affect alibi-explain runtime: Allow to specify channel dimension in data:https://github.com/SeldonIO/alibi/issues/487 AnchorImage will fail with != fp64 models:https://github.com/SeldonIO/alibi/issues/499 AnchorImage image_shape as List (needed...

seldondev

alibi-explain

Explainer output to follow a structure v2 protocol output

Currently we return the output of explainers as v2 `BYTES` and downstream would need to know details about `alibi.Explanation` in order to decode and consume the response. Perhaps we need...

seldondev

alibi-explain

There a slight mismatch between the input for anchor image (no batch) and integrated gradients (a batch)

This might be a ticket on the alibi / core side but it is referenced here as it might affect mlserver batching. Currently we can only do one explanation for...

seldondev

alibi-explain

Unload unused models from model registry

Currently, MLServer saves all the loaded models into an internal model registry (i.e. `dict`). In order to save resources when models are not being used, it could be useful to...

adriangonz

Add support for binary data in payloads in http requests

Similarly to what Triton server supports we could add support for binary data in payloads for http requests.

seldondev

Forward outputs when batching input requests together

When adaptive batching is enabled, the `outputs` field should be forwarded down to the merged request to ensure that the model receives all the info.

adriangonz

Move Tempo's insight logger to MLServer

Tempo currently has an implementation for an Insights Logger, which lets its internal runtime log metrics and payloads. Since this is relative low-level, it could be interesting to move this...

axsaucedo

Lightning Flash examples

1

Hey there, Awesome framework. Would be nice to have a PyTorch Lightning or a Flash example (https://github.com/PyTorchLightning/lightning-flash). Best, T.C

tchaton

Allow to load settings and model-settings from CLI flags

3

Currently, `mlserver` relies on having `settings.json` and `model-settings.json` files present or falling back to environment variables. It would be good to also allow users to specify these flags directly through...

adriangonz

Add PaddlePaddle inference runtime

It would be great to add a PaddlePaddle inference runtime to MLServer. This could be modelled after the existing KFServing integration, which can be seen here: https://github.com/kubeflow/kfserving/blob/master/python/paddleserver/paddleserver/model.py

adriangonz

MLServer
MLServer copied to clipboard

Metadata

Tracker ticket for alibi-explain tickets

Explainer output to follow a structure v2 protocol output

There a slight mismatch between the input for anchor image (no batch) and integrated gradients (a batch)

Unload unused models from model registry

Add support for binary data in payloads in http requests

Forward outputs when batching input requests together

Move Tempo's insight logger to MLServer

Lightning Flash examples

Allow to load settings and model-settings from CLI flags

Add PaddlePaddle inference runtime

← Metadata

Owner

Metadata

MLServer MLServer copied to clipboard

Metadata

← Metadata

Owner

Metadata

MLServer
MLServer copied to clipboard