chat-ui
chat-ui copied to clipboard
Analyze uploaded PDF files through OpenAI API
When I upload a PDF file and leverage it, I will get the base64 data. But I didn't find the code to process it in endpoints/openai, while it can handle the image base64 data. Besides, I failed to transfer it back to text. How can I analyze the file through OpenAI API?
What model are you using - I don't think the OpenAI models have native PDF Support do they? I added this for the Anthropic endpoint a couple of months ago.
I would also like to add audio at the same run. Since Gemini supports both and has an openai api this could make sense. Especially when running a litellm proxy