text-embeddings-inference
text-embeddings-inference copied to clipboard
Multiple Model Endpoint support
Feature request
Hello, My question is that if/how we support multiple models selection by a single api endpoint. For example, I have gte, bge etc. and I can deploy them together with the same url and switch easily among these model?
Motivation
support multiple models
Your contribution
Something like the fastchat controller register