rag_api icon indicating copy to clipboard operation
rag_api copied to clipboard

feat: Pinecone Vector Database

Open FinnConnor opened this issue 1 year ago • 1 comments

Added Pinecone vector database support. To setup (also in readme) set VECTOR_DB_TYPE=pinecone and PINECONE_API_KEY=<api key>

Tested with Pinecone, openai text-embedding-3-small

  1. Embedding multiple text files
  2. Getting all ids for multiple files
  3. Deleting documents, ensuring not in database
  4. Querying one file, ensuring relevant results to query
  5. Getting documents according to id
  6. Querying multiple documents, ensuring all documents are queried
  7. Change back to Pgvector , to ensure still working

FinnConnor avatar Oct 09 '24 19:10 FinnConnor

IDs per vector assigned as (file_id)_# . Using a prefix file_id with numbering 0,1,2,.... Should not affect naming file_id able to include '_' in file_id such that read my_file_0

FinnConnor avatar Oct 09 '24 19:10 FinnConnor