byaldi issues

Decouple inference and indexing

2

We have infra to host the model and I’m looking to just host an “indexing service” that calls the hosted model instead of running inference “locally”. This decouples the “stateless”...

dconathan

enhancement

Support for searching with query image

9

Hi @bclavie & team, I currently don't see support for searching through an index with a query image instead of a text query. I understand that there is an encode_image...

nuschandra

load the model in parallel gpu

can we load the colpali model in parallel gpu?

bakwankawa

Error : "Document ID 0 with page ID 1 already exists in the index"

1

Hi, the error "Document ID 0 with page ID 1 already exists in the index" happens when I create an index with the same files as a previous one, even...

Leflak

Add metadata filtering support and fix multi-document and metadata issue

6

This PR includes the following changes 1. If we pass multiple metadata for each document in a folder, the `add_to_index` will error out because in the below code, it tries...

Athe-kunal

Ordering of input files nondeterministic, which can assign incorrect doc id, metadata

2

`for i, item in enumerate(list(input_path.iterdir())): ` can return the files in all sorts of ordering styles. I'm not even sure what it is on my local, but it isn't lexical....

dimroc

enhancement

Unable to add Metadata to index

4

When trying to add metadata to an index, either using a list of metadata dicts or a mapping of uid to metadata dict (shown below), it always produces a key...

NMVRodrigues

bug

ColPali Upstream

Added `Colsmolvlm` support

2

In this PR, I’ve added support for [ColSmolVLM](https://huggingface.co/vidore/colsmolvlm-alpha). The code is fully functional, with the only pending issue stemming from the [colpali-engine](https://github.com/illuin-tech/colpali) package, as the latest package version has yet...

sergiopaniego

How to store Colqwen model's embeddings in PG vector

3

Hi, Is it necessary for byaldi to always create a new .byaldi folder and store the embeddings in a folder? Can we store the embeddings in a traditional vector database...

hrishit123

Unexpected Output When Passing Multiple Base64 Images to Qwen2-VL-7B-Instruct Using VLLM

Hi, thank you for this awesome repository. It works really well. However, I’m encountering an issue when sending two base64-encoded images to the Qwen2-VL-7B-Instruct model (served via VLLM with --limit-mm-per-prompt...

sushruthaaaaa0697

byaldi
byaldi copied to clipboard

Metadata

Decouple inference and indexing

Support for searching with query image

load the model in parallel gpu

Error : "Document ID 0 with page ID 1 already exists in the index"

Add metadata filtering support and fix multi-document and metadata issue

Ordering of input files nondeterministic, which can assign incorrect doc id, metadata

Unable to add Metadata to index

Added `Colsmolvlm` support

How to store Colqwen model's embeddings in PG vector

Unexpected Output When Passing Multiple Base64 Images to Qwen2-VL-7B-Instruct Using VLLM

← Metadata

Owner

Metadata

byaldi byaldi copied to clipboard

Metadata

← Metadata

Owner

Metadata

byaldi
byaldi copied to clipboard