llama.cpp icon indicating copy to clipboard operation
llama.cpp copied to clipboard

Feature Request: support embedding stella_en_400M and stella_en_400M.gguf conversion

Open raymond-infinitecode opened this issue 1 year ago • 4 comments

Prerequisites

  • [X] I am running the latest code. Mention the version if possible as well.
  • [X] I carefully followed the README.md.
  • [X] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
  • [X] I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

Need help supporting stella_en_400M, observed that we have embedding model https://ollama.com/Losspost/stella_en_1.5b_v5

but there I couldn't convert stella_en_400M myself

Model Download: https://hf.rst.im/dunzhang/stella_en_400M_v5

D:\llama.cpp>python convert_hf_to_gguf.py d:/llama.cpp/stella_en_400M_v5 --outfile stella_en_400M.gguf --outtype q8_0 INFO:hf-to-gguf:Loading model: stella_en_400M_v5 ERROR:hf-to-gguf:Model NewModel is not supported

Motivation

To have better embedding model

Possible Implementation

No response

raymond-infinitecode avatar Aug 27 '24 15:08 raymond-infinitecode

related models, i believe:

https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5 https://huggingface.co/Alibaba-NLP/gte-base-en-v1.5

electroglyph avatar Aug 28 '24 11:08 electroglyph

Please! These are the SoTA for performance:resource ratio! @ggerganov we want to be able to have robust local retrieval models!

anishjain123 avatar Sep 22 '24 13:09 anishjain123

Has anyone had any luck creating a GGUF version of stella_en_400M_v5? I've had a go but wasn't successful.

sammcj avatar Sep 30 '24 22:09 sammcj

stella_en_400M_v5 is derived from GTE based model which is not officially supported by llama.cpp / ollama, probably that's why no people manage to support that.

raymond-infinitecode avatar Oct 02 '24 12:10 raymond-infinitecode

Same issue persists for Alibaba-NLP/gte-multilingual-base. Any updates on this ? @ggerganov

devrimcavusoglu avatar Oct 18 '24 12:10 devrimcavusoglu

This issue was closed because it has been inactive for 14 days since being marked as stale.

github-actions[bot] avatar Dec 03 '24 01:12 github-actions[bot]