text-embeddings-inference issues

Multiple Model Endpoint support

### Feature request Hello, My question is that if/how we support multiple models selection by a single api endpoint. For example, I have gte, bge etc. and I can deploy...

ruifengma

Missing CONTRIBUTING.md

1

### Feature request The repo should add a CONTRIBUTING.md ### Motivation To help new contributors ### Your contribution I can help with adding one if there is a widely used...

onel

Adding a cache layer

### Feature request Does it make sense for TEI to add a cache layer for embeddings? Not sure if TEI supports this already. If not, I'd be curious if it...

onel

Why two batching_task are Required?

1

### Feature request In the concurrent scenario, I tried to reduce a batching_task, and the batchsize of each embed is larger, so that the inference performance is better.In the single-concurrency...

sulude

any plan support Volta?

2

### Feature request support Volta gpu ### Motivation support Volta gpu ### Your contribution ....

Lzhang-hub

sbert based mpnet model(related issue #33)

12

### Model description Hello! Thanks for this great work :) Previously, I implemented [mpnet-rs](https://github.com/NewBornRustacean/mpnet-rs) and found a related issue(feature request) #33 If there is no on-going work for the mpnet...

NewBornRustacean

Port unavailable error when running in colab

### System Info Colab Pro T4 ### Information - [ ] Docker - [X] The CLI directly ### Tasks - [X] An officially supported command - [ ] My own...

cheburakshu

Enable intel devices CPU/XPU/HPU for python backend

6

# What does this PR do? Enable CPU device for python backend Fixes # (issue) ## Before submitting - [ ] This PR fixes a typo or improves the docs...

yuanwu2017

Support for deberta-v2 model variant

6

### Feature request When attempting to use this reranker model [mxbai-rerank-large-v1](https://huggingface.co/mixedbread-ai/mxbai-rerank-large-v1) from huggingface with TEI, I got the following error message: ``` Error: Could not create backend Caused by: Could...

w3iw3i

Support for config_sentence_transformers.json

### Feature request Add cli option to auto-format input text with config_sentence_transformers.json prompt settings (if provided) before toknizing. ### Motivation A lot of models now expect a prompt prefix so...

sam-ulrich1

text-embeddings-inference
text-embeddings-inference copied to clipboard

Metadata

Multiple Model Endpoint support

Missing CONTRIBUTING.md

Adding a cache layer

Why two batching_task are Required?

any plan support Volta?

sbert based mpnet model(related issue #33)

Port unavailable error when running in colab

Enable intel devices CPU/XPU/HPU for python backend

Support for deberta-v2 model variant

Support for config_sentence_transformers.json

← Metadata

Owner

Metadata

text-embeddings-inference text-embeddings-inference copied to clipboard

Metadata

← Metadata

Owner

Metadata

text-embeddings-inference
text-embeddings-inference copied to clipboard