haystack-core-integrations
haystack-core-integrations copied to clipboard
Add streaming support in LlamaCPPGenerator AND LlamaCPPChatGenerator
Is your feature request related to a problem? Please describe. Llama CPP generators don't support streaming.
Describe the solution you'd like
Implement support for streaming, as done for other generators (see streaming_callback
).
Additional context Drafted in https://github.com/deepset-ai/haystack-core-integrations/pull/723#issuecomment-2099831919