alkemio icon indicating copy to clipboard operation
alkemio copied to clipboard

VC: Smart Embeddings

Open bobbykolev opened this issue 8 months ago • 0 comments

Description

We need smarter ingestion support for files and callouts to improve the quality of the results.

Must have scope

  • [ ] Preserve the context when splitting chunks.
  • [ ] Gather more context when reading chunks.

Improved handling of formats:

  • [ ] Improve Excel ingestion - extract in separate epic

Architectural roadmap epic:

  • [ ] Use elastic search ingestion instead of VectorDB?

Investigate multiple embeddings. Context increase, tokens limit?

Additional context

About both better ingesting and how the chunks are returned

bobbykolev avatar Jun 13 '24 12:06 bobbykolev