Feature Request: Add Support for Audio and Video File Uploads
Hi assistant-ui team,
First, thanks for the excellent work on assistant-ui and ai-sdk. It's been a great help in building AI-powered interfaces.
I'm writing to request a feature that I believe would be valuable for many users: support for uploading audio and video files directly within the assistant-ui chat interface.
Currently, attempting to upload files like .mp3, .mp4, etc., is blocked by internal validation logic (e.g., isAcceptedFile in adapters).
While there are workarounds (like pre-uploading files to a server and sending a URL as text), native support for common audio/video formats would greatly simplify the user experience for applications involving:
- Audio/Video content analysis or summarization.
- Speech-to-text conversion triggered via AI.
- General multimodal interactions where audio/video context is needed.
Many modern AI models (including some from OpenAI, Anthropic, and others) are increasingly capable of processing audio and video inputs directly. Having assistant-ui support these file types would align well with this trend and enable more seamless integrations.
Could you please share if there are any plans to add support for audio and video file uploads in the near future? Understanding the roadmap or the reasoning behind the current file type restrictions would be very helpful.
Thanks again for your time and consideration!