feat: Add vision support
- Gemini Pro Vision supported
- GPT-4 Vision supported
- Fixed not reporting to the user when Gemini request error
- Image features are now globally supported in this application


@TheRamU is attempting to deploy a commit to the NextChat Team on Vercel.
A member of the Team first needs to authorize it.
LGTM
Show off
Bot detected the issue body's language is not English, translate it automatically.
Is it possible to select the Google visual model so that the conversation does not use multiple rounds of chat, and only uploads single-discussion conversations, without needing to clear the context every time it is used?
能否在选中谷歌视觉模型的时候,对话不使用多轮聊天,仅上传单次对话,不用每次使用需要清除上下文
Bot detected the issue body's language is not English, translate it automatically.
Can the conversation not use multiple rounds of chat when the Google visual model is selected, and only a single conversation can be uploaded without the need to clear the context each time it is used?
Why are my responses using the vision-preview api incomplete