text-embeddings-inference icon indicating copy to clipboard operation
text-embeddings-inference copied to clipboard

A blazing fast inference solution for text embeddings models

Results 180 text-embeddings-inference issues
Sort by recently updated
recently updated
newest added

### System Info version: text-embedding-inference 1.6.1 OS: ubuntu 24.04 python: 3.12.3 Embedding short (single-word) inputs randomly cause null vectors. ### Information - [x] Docker - [x] The CLI directly ###...

# What does this PR do? Fixes # (issue) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks...

### System Info none ### Information - [ ] Docker - [ ] The CLI directly ### Tasks - [ ] An officially supported command - [ ] My own...

### Model description The model is the top opensource model on the https://huggingface.co/spaces/mteb/leaderboard so please add it. ### Open source status - [x] The model implementation is available - [x]...

### System Info docker ``` docker run \ -d \ --name reranker \ --gpus '"device=0"' \ --env CUDA_VISIBLE_DEVICES=0 \ -p 7863:80 \ -v /data/ai/models:/data \ ghcr.io/huggingface/text-embeddings-inference:86-1.5 \ --model-id "/data/bge-reranker-base" \...

### Model description https://huggingface.co/lier007/xiaobu-embedding-v2 The model has additional Dense,. I translate the model to onnx format by "https://huggingface.co/TencentBAC/Conan-embedding-v1", but it only output 1024 dimensions, but it should be 1792 ###...

### System Info I would like to use jinaii embedding & reranker but not working error "missing field `model_type` at line 51 column 1" King regards, David. ``` (HF_TEI) :~$...

### system environment ``` cargo 1.85.0 (d73d2caf9 2024-12-31) ``` ### build command ``` cargo install --path router -F candle-cuda -F http --no-default-features ``` ### build errors ``` error[E0277]: the trait...

### Feature request I would like to request to have a single docker image for both CPU and GPU cases. This can be done using a combination of Dockerfile and...

### System Info offline and airgapped ENV OS version: rhel8.19 Model: bge-m3 Hardware: NVIDIA GPU T4 Deployment: Kubernetes (kserve) Current version: turing-1.6 ### Information - [x] Docker - [ ]...