MLServer issues

Use model-settings.json's `inputs` for defaults to datatype/shape

When using model-settings.json to define the inputs, it appears that the client inference request [doesn't need to define](https://mlserver.readthedocs.io/en/latest/user-guide/content-type.html#model-metadata) the content_type, but still needs to define the `datatype` and/or `shape`. It...

johnpaulett

Add inference graphs support

8

Currently we only expose the `predict` method. However, some orchestrations frameworks like SC support the use of other "inference steps", like routing or aggregation. It would be good to explore...

adriangonz

Package up environment as part of MLServer CLI

4

MLServer now has built-in support to unpack and activate a [conda-pack](https://conda.github.io/conda-pack/) tarball. This feature could be leveraged to run the custom environment defined in the`conda.yaml` file usually present in MLflow...

adriangonz

Investigate Intel accelerator for scikit-learn models

Intel has an accelerator (both for training and inference) of scikit-learn models: https://github.com/intel/scikit-learn-intelex. It would be interesting to see if we could use this in our scikit-learn runtime. It's unclear...

jklaise

MLServer custom example missing standarization during inference

1

https://github.com/SeldonIO/MLServer/blob/master/docs/examples/custom/README.md In the above example ^ I see that in the training phase the data in standardized (lambda function) and this doesn't happen in the inference, so the model would...

smolendawid

Extend configurability of the built-in logger to account for runtime/model names

Using the built-in logger is convenient because it's auto-configured:https://github.com/SeldonIO/MLServer/blob/743778766be536865c847135af93fedbcc89ba96/mlserver/logging.py#L23 However, it falls short when trying to debug messages coming from multiple different models - all of these will be assigned...

jklaise

HuggingFace Optimum Runtime extensions

As a follow-up from the initial PR that introduced HuggingFace Optimum Runtime via #4081 we have identified a set of followup tasks to improve the servers: * [ ] Extend...

axsaucedo

Add lockfile to MLServer

5

To better support use cases where `mlserver` is used as a library, we shouldn't restrict the dependencies versions too much. Instead, we should look into adding some sort of lockfile...

adriangonz

Consider adding a utility for retrieving inputs by name

Inputs are a list of tensors with a `name` entry, however it's not possible to use this so select tensors by name. Instead one must either resort to index-based selection...

jklaise

add catboost classifier support

3

add catboost support: * add a new runtime by heavily copying lightgbm runtime * add test that builds a model, sends an inference request, validates the prediction * add example...

theofpa

MLServer
MLServer copied to clipboard

Metadata

Use model-settings.json's `inputs` for defaults to datatype/shape

Add inference graphs support

Package up environment as part of MLServer CLI

Investigate Intel accelerator for scikit-learn models

MLServer custom example missing standarization during inference

Extend configurability of the built-in logger to account for runtime/model names

HuggingFace Optimum Runtime extensions

Add lockfile to MLServer

Consider adding a utility for retrieving inputs by name

add catboost classifier support

← Metadata

Owner

Metadata

MLServer MLServer copied to clipboard

Metadata

← Metadata

Owner

Metadata

MLServer
MLServer copied to clipboard