haystack-core-integrations icon indicating copy to clipboard operation
haystack-core-integrations copied to clipboard

Additional packages (components, document stores and the likes) to extend the capabilities of Haystack version 2.0 and onwards

Results 229 haystack-core-integrations issues
Sort by recently updated
recently updated
newest added

## Summary and motivation [Chonkie](https://github.com/chonkie-inc/chonkie) is a chunking library. With [release 1.2](https://github.com/chonkie-inc/chonkie/releases/tag/v1.2.0), Chonkie comes with [8 chunkers](https://github.com/chonkie-inc/chonkie?tab=readme-ov-file#chunkers) that could all be made available in a Haystack integration. In particular, SemanticChunker...

new integration
P3

#DUG Describe When using ChromaDocumentStore from Haystack integrations, the write_documents method does not support storing list values in metadata fields, only supporting str, int, float, and bool types. However, during...

integration:chroma
P3

We have expanded `StreamingChunk` to include new fields such as ```python content: str meta: Dict[str, Any] = field(default_factory=dict, hash=False) component_info: Optional[ComponentInfo] = field(default=None) # All fields below here are new...

feature request
P3

Refer to parent issue for more details

feature request
P3

**Describe the bug** When using the PgvectorDocumentStore as stated in [the documentation of pgvector](https://github.com/pgvector/pgvector?tab=readme-ov-file#troubleshooting) : > The query needs to have an ORDER BY and LIMIT, and the ORDER BY...

bug
P3
integration:pgvector

Original issue in Haystack: https://github.com/deepset-ai/haystack/issues/9505 Example implementation in Haystack: https://github.com/deepset-ai/haystack/pull/9693 (using `dataclass.replace`) Potentially affected integrations: - [ ] Bedrock - [ ] Cohere - [ ] FastEmbed - [ ]...

contributions wanted!
P2

Refer to the parent issue for details. Relevant docs: https://openrouter.ai/docs/use-cases/reasoning-tokens

P2
integration:openrouter

Refer to the parent issue for details. Relevant docs: https://docs.mistral.ai/capabilities/reasoning/ To be investigated. Might require major adaptations.

P3
integration:mistral

Haystack's `finish_reason` is used both in `StreamingChunk` and `ChatMessage.meta["finish_reason"]` to indicate why the text generation stopped. Google GenAI uses `STOP` as a general-purpose finish reason, for both normal text completions...

P2
integration:google-genai

## Summary and motivation S3 Vectors is the latest KB solution from AWS. Although in preview, this would be a great addition to the existing DocStore integrations. ## Detailed design...

contributions wanted!
new integration
P3