vector-admin icon indicating copy to clipboard operation
vector-admin copied to clipboard

What embeddings model is used for the uploaded documents?

Open algsupport opened this issue 2 years ago • 6 comments

I saw the project on the youtube. Seems excellent.

I was wondering. When uploading a new document, what is the embeddings model used to convert it?

Is it possible to chose a custom one? If so, how can it be done?

Thank you.

algsupport avatar Oct 02 '23 09:10 algsupport

Custom embedding models arent supported at this time just because we haven't expanded the scope yet. Right now its just the normal 1536 text-ada-embedding-002 by OpenAI. Obviously not everyone uses that and if you tried to edit or add a doc and the dimensions dont match it would stop you from doing so.

Are you using a Hugging face model for embeddings?

timothycarambat avatar Oct 02 '23 16:10 timothycarambat

Yes, I would want to use a hugging face model if possible. text-ada-embedding-002 works too, but would be more convinient to select the embeddings model.

Would you mind if I try to add it myself (Of course I will make a pull request if I succeed.)? Could you point me towards the part of the code that would be responsible for the embeddings CRUD?

Thank you

algsupport avatar Oct 02 '23 17:10 algsupport

It is used in several areas (job for each vectorDB at this time).

If you look for anywhere openAi.embedTextChunk or openAi.embedTextChunks is used - that is the only place embeddings are currently used!

timothycarambat avatar Oct 02 '23 19:10 timothycarambat

can we have support on hugging face embeddings as well? or is not possible?

andsty avatar Nov 28 '23 14:11 andsty

You can, but it is supported via LocalAI and not via the HuggingFace API directly. Is that what you are using for embedding currently?

timothycarambat avatar Nov 29 '23 17:11 timothycarambat

Any plan on adding this feature? Since this post, openai also released another model for embeddings. Also there are more specialized embeddings models being developed by companies like voyage ai etc which we'd like to be able to use.

Thanks!

hasani114 avatar Jul 14 '24 17:07 hasani114