karthink

Results 854 comments of karthink

Could you explain how persistent RAG support is? Do you have to specify the sources for RAG in each chat session, or are the sources/embeddings of sources persistently available to...

There's some disconnect in our understanding of RAG. As I understand it, RAG and embeddings are two independent methods. RAG works as follows: 0. A corpus of (text) documents is...

Closing as it doesn't seem like there's anything to be done on the gptel side.

> The workflow is that when input data is available, it can first find the corresponding information in the vector database and then pass it to the LLM to enhance...

> It's about "pass the corresponding text chunks". And embedding model is use for vector the raw data, and find the similar raw data, before pass the information(text, for example),...

> [@karthink](https://github.com/karthink) Could you tell us a bit more about the bring-your-own-RAG support, and what that will look like from a user perspective? Sure, it's nothing fancy. I'm just carving...

> Your perspective is a bit unusual; embedding is the first step in the RAG > process, but you're looking at it separately. @yibie Perhaps there is some misunderstanding. Are...

There were some recent changes where we set the buffer-file-coding-system in the process buffer for non-streaming requests explicitly to `utf-8-unix`. Could you check if your reported behavior still exists?

@ultronozm Thanks for the PR. I think a better way to do this is to use a processing hook that the user can customize. Then they can include/exclude the Org...

I've added an option `gptel-org-convert-response` that does this, so this PR is no longer required. This is still an interim solution while I figure out how to provide a filtering...