serverless-chat-langchainjs icon indicating copy to clipboard operation
serverless-chat-langchainjs copied to clipboard

Allow updating existing documents

Open sinedied opened this issue 10 months ago • 3 comments

Currently if you upload the same document twice, it will create duplicated chunks of the same documents. Before add a document to the database, we should remove all chunks related to the document using its filename as ID to avoid duplication and allow updates.

Tasks

  • [ ] Remove all chunks with the same filename of uploaded document in POST /documents
    • [ ] for Azure ComosDB for MongoDB vCore path
    • [ ] for FAISS path when running locally

sinedied avatar Apr 09 '24 07:04 sinedied

This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this issue will be closed.

github-actions[bot] avatar Jun 09 '24 01:06 github-actions[bot]

This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this issue will be closed.

github-actions[bot] avatar Aug 10 '24 01:08 github-actions[bot]