l0rinc comments

Results 251 comments of


                                            l0rinc

trafficstars

ChromaDB score goes the wrong way

I just noticed the same while trying to figure out https://github.com/hwchase17/langchain/issues/7427, the code should do a `1 - score`

Similarity search returns random docs, not the ones that contain the specified keywords

Thanks a lot guys for checking, appreciate it! So the culprit for the mismatched expectations was the OpenAI embeddings - I wonder why the direct Chroma way works so much...

Similarity search returns random docs, not the ones that contain the specified keywords

@Bearnardd, @Guidosalimbeni is there a way for me to tip you guys for your help?

Similarity search returns random docs, not the ones that contain the specified keywords

Let me return the favor somehow, you guys were really helpful!

Similarity search returns random docs, not the ones that contain the specified keywords

These searches are working a lot better now, just a note that `all-MiniLM-L6-v2` seems to require a lot more memory, the pod was suddenly crashing with OOM.

How to train the model with my own files

My understanding is that embeddings and retraining (fine-tuning) are different. If you just want extra info, you can embed, if you want new knowledge or style, you probably need to...

How to train the model with my own files

I saw a few posts about it, e.g. https://github.com/nomic-ai/gpt4all/issues/173#issuecomment-1496681937 My understanding is you can use gpt4all with langchain https://python.langchain.com/en/latest/modules/models/llms/integrations/gpt4all.html and use indexes as https://python.langchain.com/en/latest/modules/indexes/getting_started.html I personally have to retrain, since...