haystack icon indicating copy to clipboard operation
haystack copied to clipboard

docs: updating DocumentSplitter docstring, adding supported DocumentSores

Open davidsbatista opened this issue 1 year ago • 3 comments

Checklist

davidsbatista avatar Aug 22 '24 09:08 davidsbatista

Pull Request Test Coverage Report for Build 10527111686

Details

  • 0 of 0 changed or added relevant lines in 0 files are covered.
  • 1 unchanged line in 1 file lost coverage.
  • Overall coverage remained the same at 90.21%

Files with Coverage Reduction New Missed Lines %
components/preprocessors/document_splitter.py 1 98.96%
<!-- Total: 1
Totals Coverage Status
Change from base Build 10522538514: 0.0%
Covered Lines: 6975
Relevant Lines: 7732

💛 - Coveralls

coveralls avatar Aug 22 '24 10:08 coveralls

Sorry, we talked about it offline with David.

DocumentSplitter is and should be compatible with each Document Store we support.

Only some specific metadata produced by this component is not supported (and discarded) by some Document Stores. We should express this clearly.

anakin87 avatar Aug 23 '24 08:08 anakin87

I've added a single disclaimer explaining that _split_overlap with Chroma is lost and added the same info to the docs. I would merge it now.

davidsbatista avatar Aug 23 '24 13:08 davidsbatista