infinity icon indicating copy to clipboard operation
infinity copied to clipboard

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.

Results 144 infinity issues
Sort by recently updated
recently updated
newest added

Hello, First of all thank you very much for this tool! I am trying it out (on CPU) with the following code: ```python DEVICE = os.environ.get("DEVICE", "cpu") MODEL_NAME = 'sentence-transformers/all-MiniLM-L6-v2'...

I was trying out `Snowflake/snowflake-arctic-embed-m-long` with onnx engine ``` infinity_emb --model-name-or-path Snowflake/snowflake-arctic-embed-m-long --revision 08e7a4449e3f07709fb9387bc3172d393a6cc5e2 --engine optimum --device cpu --dtype float32 --batch-size 4 --port 8000 --trust-remote-code ``` But trust_remote_code is not...

### System Info Hey I am working on a custom component in Haystack right now to build an Infinity Reranker. Might turn it into an Haystack Integration if I get...

### Feature request Being able to split models into multiple GPUs, as with vllm/aphrodite engine for LLMs. ### Motivation It would be extremely helpful to be able to split larger...

wontfix

### System Info The full command line used that causes issues: `docker run --rm -p 7997:7997 michaelf34/infinity:latest --model-name-or-path mixedbread-ai/mxbai-embed-large-v1 --port 7997` OS version: macos Model being used: mixedbread-ai/mxbai-embed-large-v1 Hardware used...

### Model description Could you please provide guidance on how to enable API_KEY authentication? Alternatively, is there a plan to implement API_KEY authentication similar to OpenAI's approach? ### Open source...

Can you please support the instructor models here? https://github.com/xlang-ai/instructor-embedding These are arguably the best models for their sizes.

### Model description You have mentioned that sfr-embedding model is supported along with all other huggingface embedding models (ref.nomic). However, both are not working : infinity | ERROR 2024-03-21 14:35:59,554...