Alok Saboo
Alok Saboo
I'm unsure why the docs suggest that Ollama does not support vision. Can Skyvern use the vision capabilities of qwen3-vl?
Right now, we only support ElevenLabs for TTS. It would be nice to extend this to support other free/self-hosted OpenAI API compatible services. A few come to mind: https://github.com/travisvn/chatterbox-tts-api https://github.com/eduardolat/kokoro-web...
I tried different Ollama models, and a few of them don't work (right now I am seeing this error only with `gpt-oss:20b`): I do see the following in the browser...
Can we please add docker compose for this project... thanks
Introduce Pydantic models for structured output schemas related to recipes and nutrition information. Implement validation functions to ensure responses from the AI conform to these schemas, enhancing response integrity and...
### Tandoor Version 2.3.5 ### Setup Docker / Docker-Compose ### Reverse Proxy Nginx Proxy Manager (NPM) ### Other _No response_ ### Bug description When I try to import anything using...
Hi team, I’d like to propose adding native support for [AudioMuse-AI](https://github.com/NeptuneHub/AudioMuse-AI) as an optional backend for audio attribute extraction in Beets. Beets users rely on smart playlists, recommendations, and other...