MLServer issues

pydantic.errors.PydanticImportError: `BaseSettings` has been moved to the `pydantic-settings` package

7

## How to reproduce: ``` docker run -it python:3.9 bash >>> pip install mlserver >>> python -c "import mlserver" ``` Package versions: ``` > pip list | grep mlserver mlserver...

harupy

Documentation for parameter usage in HuggingFace runtime missing

6

I am interested in using MLServer with HuggingFace models. To get the full potential of these models, it is essential to modify the model parameters (https://huggingface.co/docs/transformers/main_classes/text_generation). I have tried to...

JochenGuckSnk

Select GPU to be used for each worker on parallel inference

I am trying to run a transformer model using parallel inference on 4 workers on a machine that has 4 GPUs. The 4 workers are able to load the model...

teddy-ambona

Adaptive batching leads to parameters being cut off

5

Hi, I observed some weird behavior when using the REST API with adaptive batching enabled. When sending a **single** request to the v2 REST endpoint `/v2/models//infer` the Parameters within the...

tobbber

`mlserver build` fails if requirements.txt includes a git path

As git isn't installed on the mlserver images, any references to git paths in your requirements.txt will fail `mlserver build` commands. Reproduce: - Specify a git path in your requirements.txt...

isaac-smothers

Add support for Pydantic V2

9

MLServer only supports Pydantic V1, which is a problem for us as we would like to move to Pydantic V2 for all our services using Pydantic. Do you think MLServer...

stephen37

InferenceRequests with Tensor of Tensors does not decode properly

Hello, First of all thank you for the project, and the time spent on it. We have a very simple MLFlow pyfunc model that is being pickled and loaded in...

GDegrove

Add Japanese language dependencies

9

Resolves #1506

jbauer2718

Fix mlserver_huggingface settings device type

5

If using a `model-settings.json` of the following form: ```json { "name": "my-model", "implementation": "mlserver_huggingface.HuggingFaceRuntime", "parameters": { "extra": { "task": "text-generation", "pretrained_model": "model/path", "model_kwargs": { "load_in_8bit": true } } } }...

geodavic

Heterogeneous pool of workers

4

Support a Heterogeneous pool of workers with variable number of model replicas per worker

adriangonz

MLServer
MLServer copied to clipboard

Metadata

pydantic.errors.PydanticImportError: `BaseSettings` has been moved to the `pydantic-settings` package

Documentation for parameter usage in HuggingFace runtime missing

Select GPU to be used for each worker on parallel inference

Adaptive batching leads to parameters being cut off

`mlserver build` fails if requirements.txt includes a git path

Add support for Pydantic V2

InferenceRequests with Tensor of Tensors does not decode properly

Add Japanese language dependencies

Fix mlserver_huggingface settings device type

Heterogeneous pool of workers

← Metadata

Owner

Metadata

MLServer MLServer copied to clipboard

Metadata

← Metadata

Owner

Metadata

MLServer
MLServer copied to clipboard