fastembed icon indicating copy to clipboard operation
fastembed copied to clipboard

Support BAAI/bge-m3

Open hogesako opened this issue 1 year ago • 16 comments

Requesting support for BAAI/bge-m3. Thanks.

hogesako avatar Feb 03 '24 08:02 hogesako

[RESOLVED] Blocked on https://github.com/huggingface/transformers/issues/28859

NirantK avatar Feb 05 '24 06:02 NirantK

there's already the onnx version, is it compatible with fastembed ?

x4080 avatar Feb 06 '24 02:02 x4080

@NirantK Can I just add "Qdrant/bge-m3-onnx-Q" into models.json (with correct json formatting) ?

x4080 avatar Feb 07 '24 21:02 x4080

Hey! That model has some loading issues — hence I've not integrated it yet. Somebody from the team should delete that soon from HF as well.

NirantK avatar Feb 08 '24 03:02 NirantK

@NirantK I see thanks

x4080 avatar Feb 08 '24 21:02 x4080

@NirantK is there anything external contributor can do to make this happen? seems like this would be nice first PR 😅

yamalight avatar Feb 19 '24 16:02 yamalight

I see that the block on the transformer library was solved on new releases.

I wonder if part of the loading issues are related to the support from the model of the three kinds of vectors: sparse, dense and multi-vent (colBERT).

elsatch avatar Apr 02 '24 01:04 elsatch

Hey @elsatch - yes, the 2 linear models for Sparse and multi-vec need to have an ONNX export and loader as well. That's something we've to do for now. We can support the dense model without friction but I'd rather avoid the confusion and support all 3 vectors when we support BGE-M3.

NirantK avatar Apr 02 '24 09:04 NirantK

Any news on this?

davide445 avatar Aug 14 '24 21:08 davide445

Any update?

IneffableBunch avatar Aug 26 '24 20:08 IneffableBunch

Any update?

seetimee avatar Sep 24 '24 05:09 seetimee

Hey, thank you for waiting, we're going to start working on it soon

joein avatar Sep 24 '24 07:09 joein