inference-operator topic

List inference-operator repositories

kubeai

1.1k
Stars
120
Forks
1.1k
Watchers

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.