chainlit multimodal conversation support

multimodal conversation support

Open fire opened this issue 9 months ago • 1 comments

Is your feature request related to a problem? Please describe.

A real assistant would not only converse by text but can speak and use video / images.

Describe the solution you'd like A clear and concise description of what you want to happen.

Support text, images, audio and vidoe.

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

Use OpenAI's chatgpt.

Additional context Add any other context or screenshots about the feature request here

I know that multimodal ais are still a challenge for FOSS tooling.

May 15 '24 16:05 fire

See https://huggingface.co/vonjack/Hermes-2-Pro-BakLLaVA-Mistral-7B

May 15 '24 16:05 fire

Did you check https://github.com/Chainlit/cookbook/tree/main/audio-assistant ?

May 29 '24 08:05 willydouhard

Hello, check the Multi-modality on the documentation to include sound and files

Sep 13 '24 12:09 ModEnter