MLServer
MLServer copied to clipboard
An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more
I was trying to serve a model in MLFlow Model format created using MLFlow > 2.10. The MLFLow 2.10 added support for `Array` and `Object` datatypes in signature (https://www.mlflow.org/docs/latest/model/signatures.html), which...
This PR removes macos from the github worflow. `torch >= 2.2.0` not longer supported for macos. See issue [here](https://github.com/pytorch/pytorch/issues/114602)
Bumps [optimum](https://github.com/huggingface/optimum) from 1.19.1 to 1.20.0. Release notes Sourced from optimum's releases. v1.20.0: VITS, Phi-3 ONNX export Extended ONNX export VITS ONNX export by @echarlaix in huggingface/optimum#1607 Phi-3 ONNX export...
I am trying to use GPU in my custom inference endpoint built using MLserver. I am unable to load the model on gpu. Can you please let me know if...
## Description This PR aim to fix this issue #1747 by serializing `NaN` and `Inf` values to `null` ## Changes Made Configure pydantic base model in order to serialize `nan`...
Hello everyone, I was exploring using mlserver to deploy ML models as a REST service. I noticed an issue: if you plan to use mlserver with Python and want to...
# Using Pre-existing Python Environments ## Description The current implementation of mlserver only supports specifying a Python environment through a tarball, which is then unpacked before the workers are activated....
Bumps [optimum](https://github.com/huggingface/optimum) from 1.21.2 to 1.21.4. Release notes Sourced from optimum's releases. v1.21.4: Patch release Update Habana extra in setup.py by @regisss in #1991 Full Changelog: https://github.com/huggingface/optimum/compare/v1.21.3...v1.21.4 v1.21.3: Patch release...
Bumps [transformers](https://github.com/huggingface/transformers) from 4.41.2 to 4.42.4. Release notes Sourced from transformers's releases. Patch release v4.42.4 Mostly gemma2 support FA2 softcapping! but also fix the sliding window for long context and...
Bumps [pyspark](https://github.com/apache/spark) from 3.5.1 to 3.5.2. Commits bb7846d Preparing Spark release v3.5.2-rc5 d13808c [SPARK-49099][SQL][FOLLOWUP][3.5] recover tests in DDLSuite f2e2601 [SPARK-49099][SQL] CatalogManager.setCurrentNamespace should respect custom s... b33a3ee [SPARK-48791][CORE][FOLLOW-UP][3.5] Fix regression caused...