gpt4all icon indicating copy to clipboard operation
gpt4all copied to clipboard

[Feature] Add multimodal support

Open Barshan-Mandal opened this issue 4 months ago • 0 comments
trafficstars

Feature Request

Add video ,audio and image input for better multimodal q&a. Many pc cant run chatrtx due to some requirements.So these has become mandatory for our day to day life.Even chatrtx support describing YouTube video.

Its would be very good if you support both multimodal input and output for text ,video ,audio and image. They will work model specifically but you may add multimodal support by tricking multiple llm working for a prompt sequencially.

Barshan-Mandal avatar Jul 06 '25 09:07 Barshan-Mandal