infinity issues

ColQwen on RunPOD. ValueError: The checkpoint you are trying to load has model type `paligemma` but Transformers does not recognize this architecture.

1

### System Info RunPod template ### Information - [ ] Docker + cli - [ ] pip + cli - [ ] pip + usage of Python interface ### Tasks...

simjak

refactor -> select_model(functional)

1

## Related Issue ## Checklist - [ ] I have read the [CONTRIBUTING](https://github.com/michaelfeil/infinity/tree/main?tab=readme-ov-file#contribute-and-develop) guidelines. - [ ] I have added tests to cover my changes. - [ ] I have...

michaelfeil

Support for colbert style late interaction models in rerank endpoint

### Feature request There have been discussions on having decent performance in using colbert style models as rerankers (e.g. https://www.answer.ai/posts/2024-09-16-rerankers.html), and it would be useful if the rerank endpoint can...

wwymak

Reranker detected as embedder Jina rerank tiny

3

### System Info infinity onnx image latest ### Information - [ ] Docker + cli - [ ] pip + cli - [ ] pip + usage of Python interface...

rawsh-rubrik

Pass kwargs to encoder

6

### Feature request Models like https://huggingface.co/BAAI/bge-m3 and https://huggingface.co/jinaai/jina-embeddings-v3 can take extras kwargs as input of the `encode` function such as `task=...` for Jina v3 or `return_dense=False/True` for bge-m3 It would...

RichaMax

Binary quantization - evaluate quality

1

### Feature request Is there a way of receiving the embeddings back in BQ format? Right now, I receive the full precision embedding and quantize it in the client, but...

cduk

health endpoint does not really provide insights about healthiness

3

### System Info latest any platform ### Information - [ ] Docker + cli - [ ] pip + cli - [ ] pip + usage of Python interface ###...

bufferoverflow

Support for custom SentenceTransformer models

3

### Model description I have a custom SentenceTransformer model that is a custom class (And also quite nested), so on the top level the modules.json file look like ``` [...

wwymak

optimum openvino

1

@tjtanaa FYI, continued by merging your branch into this and main.

michaelfeil

Missing Contribution.md file

3

There is a need to add a contribution.md file. So that anyone who wants to contribute have an idea of what steps to follow for contribution. I want to work...

yaarAdarsh

infinity
infinity copied to clipboard

Metadata

ColQwen on RunPOD. ValueError: The checkpoint you are trying to load has model type `paligemma` but Transformers does not recognize this architecture.

refactor -> select_model(functional)

Support for colbert style late interaction models in rerank endpoint

Reranker detected as embedder Jina rerank tiny

Pass kwargs to encoder

Binary quantization - evaluate quality

health endpoint does not really provide insights about healthiness

Support for custom SentenceTransformer models

optimum openvino

Missing Contribution.md file

← Metadata

Owner

Metadata

infinity infinity copied to clipboard

Metadata

← Metadata

Owner

Metadata

infinity
infinity copied to clipboard