candle
candle copied to clipboard
[QUESTION] Protocol of adding a new model (Stella_en_<*>_v5 family) implementation with Candle
Hi,
I have a working implementation of Stella_en_<*>_v5 family of models which is one of the top ranking model in the MTEB leaderboard for reranking and retrieval.
It's basically built on top of the candle-transformers::qwen2 implementation with the language modeling head swapped for their pre-trained dense layer.
I was hoping to open a pull request with candle along with an example.
Questions:
- Would it be OK?
- If it is, is there a protocol to follow for
candle_transformers? Something akin to How to add a model totransformers - For
embeddingmodels are there anycandlestandard API implementations required? My implementations just spits out the logits from the forward pass.
Looking for some guidance. Thanks in advance.
CC: @EricLBuehler @LaurentMazare