prompttools icon indicating copy to clipboard operation
prompttools copied to clipboard

Add ingestion harness for vectorDB experiments

Open steventkrawczyk opened this issue 1 year ago • 0 comments

🚀 The feature

We need a way to experiment with different chunking + ingestion strategies. For example, we have some "raw" documents we want to ingest into a vector database, and there are different ways of transforming those "raw" documents into the documents we end up vectorizing. For example, we can ingest them as is, "chunk" them into 10-line chunks, or do other pre-processing to extract keywords and relevant phrases.

Motivation, pitch

Talking to some customers about their needs regarding vector DB evaluation at scale.

Alternatives

No response

Additional context

No response

steventkrawczyk avatar Aug 01 '23 22:08 steventkrawczyk