text-embeddings-inference icon indicating copy to clipboard operation
text-embeddings-inference copied to clipboard

Any plan to support qwen3 reranker model in TEI

Open tinafengfun opened this issue 4 months ago • 4 comments

Model description

HI, TEI. I am working with real-world customer i.e. ISV and startup to deploy RAG system. As qwen3. reranker is becoming popular, customers are asking if TEI can support this serving. Fundamentally this is a decoding model and need extra prompt to initialize the task request which is unlike old-school reranker model.

Open source status

  • [x] The model implementation is available
  • [x] The model weights are available

Provide useful links for the implementation

No response

tinafengfun avatar Aug 04 '25 03:08 tinafengfun

Duplicated. See #643

bgeneto avatar Aug 11 '25 17:08 bgeneto

Hey @tinafengfun, thanks for opening the issue! Indeed we do plan to support it and there are already a couple of PRs adding support for it (I still need to review those), but given that LLM-based reranking is rather new, that implies some core changes, so supporting those might not be straight forward (unless any of the aforementioned PRs do solve those things already). In any case, we'll work on it, and ping back in the relevant issues! 🤗

In the meantime, I'd suggest you to check Sentence Transformers to use Qwen3-Reranker models in the meantime if that helps, in any case, expect those to be available here soon!

alvarobartt avatar Aug 14 '25 09:08 alvarobartt

just added the support in this PR. check it out please https://github.com/huggingface/text-embeddings-inference/pull/695

sigridjineth avatar Oct 04 '25 05:10 sigridjineth

We are in desperate need for TEI to support Qwen3 rerankers.

forrestbao avatar Oct 21 '25 18:10 forrestbao