cocoindex icon indicating copy to clipboard operation
cocoindex copied to clipboard

[FEATURE] support ChromaDB as a builtin target

Open georgeh0 opened this issue 2 months ago • 6 comments

Similar to LanceDB support. Please make ChromaDB an optional dependency.


❤️ Contributors, please refer to 📙Contributing Guide. Unless the PR can be sent immediately (e.g. just a few lines of code), we recommend you to leave a comment on the issue like I'm working on it or Can I work on this issue? to avoid duplicating work. Our Discord server is always open and friendly.

georgeh0 avatar Oct 23 '25 06:10 georgeh0

Hi @georgeh0! I'd like to work on this ChromaDB integration.

I've reviewed the LanceDB implementation (~450 lines) and understand the pattern. However, before I start, I wanted to check if you'd be open to a phased approach:

Phase 1 (MVP - this PR):

  • Basic ChromaDB connector with core CRUD operations
  • Schema mapping (PyArrow types → ChromaDB metadata)
  • Optional dependency setup
  • Basic tests
  • Estimated scope: ~150-200 lines

Phase 2 (follow-up PR):

  • Vector index support with configurable metrics
  • Advanced indexing options
  • Performance optimizations

This would allow for faster initial integration and incremental improvements. The ChromaDB API is simpler than LanceDB in some ways (no PyArrow conversion needed), so the basic implementation should be more compact.

Would this approach work for you, or would you prefer a complete implementation matching LanceDB's feature set in a single PR?

Looking forward to your guidance!

ambicuity avatar Oct 23 '25 07:10 ambicuity

@ambicuity thanks for taking this! assigned.

georgeh0 avatar Oct 24 '25 05:10 georgeh0

open for take!

badmonster0 avatar Dec 07 '25 05:12 badmonster0

Would like to take this up 😄

Haleshot avatar Dec 10 '25 04:12 Haleshot

Dropping a ping again to be assigned (just for the record); will be working on it anyway!

Haleshot avatar Dec 12 '25 17:12 Haleshot

@Haleshot assigned. Thanks for taking this!

georgeh0 avatar Dec 12 '25 17:12 georgeh0