[Feature]: Support for more ASR services?
The Feature
Popular ASR services that should be included - Deepgram, AssemblyAI, Google ASR, Self hosted whisper should be supported in litellm.
Motivation, pitch
These are popular ASR systems used widely, would be good to have support for these in LiteLLM.
Twitter / LinkedIn details
No response
which are you planning on using today?
we are using all 4 in our production, essentially it's a fallback mechanism for a resilient system (there are several cases when one of the service throttles/fails due to whatever reason and needs a backup), in this order - self hosted whisper, followed by deepgram, and then followed by google ASR as the last in order. Assembly AI is an exception as it has excellent timestamping (probably the best out there) of all the services out there, so we use it for all our async usecases. The other 3 mentioned before it are used more for streaming usecases. @krrishdholakia
interesting - why do you want this on litellm, if you already have it working?
multiple reasons,
- common api gateway interface for all ASRs, makes code cleaner?
- no need to handle hardcoded fallbacks on clients, rather have them controlled via litellm serverside?
+1 Waiting on Deepgram to be added before we can really review the idea of litellm in production.
Hi @LiamSystems can we hop on a call to learn what you need to use LiteLLM in production ? I'd love to unblock you
my cal for your convenience: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat my linkedin if you prefer DMs: https://www.linkedin.com/in/reffajnaahsi/
+1 for Deepgram for STT and TTS, might also want to consider ElevenLabs for TTS. Deepgram has features like diarization, entities detection, PII redaction, etc... and supports direct S3 access with resigned URLs with callback, and is extremely quick.
+1 for ElevenLabs and Deepgram
+1 deepgram
+1 deepgram and gladia
Working on deepgram as a v0.
Deepgram seems like they have a couple different endpoints here. Would appreciate help making sure our implementation is good here.
Working on an initial version for today's release.