whisper-jax
whisper-jax copied to clipboard
Can this be deployed on replicate.com?
Has anyone deployed it on replicate.com?
I want to use the model via an API but only paying per second of use. Not pay to keep a server running at all times.
Any ideas on how to achieve that on replicate.com or elsewhere, please let me know.
You can use the Hugging Face one for free as far as I'm aware, the queue is usually always 0-2.
I need to deploy a faster version of my own for use in production via Hugging Face, but when I try to duplicate the space to change the hardware, I get this API_KEY setting that I don't know what it is:
And if you leave it blank it just fails to build.
Any idea on how to set it or otherwise on how to deploy whisper-jax in a high performance environemnt that only charges for use time?
Hey! Have you implement any of these so far? Any updates?
Perhaps the Hugging Face space is just a proxy to the real back end, hence the API_URL
secret? Just a guess
@troublesprouter I've managed to deploy it on replicate. But it wasn't as fast as expected because replicate doesn't support TPUs. However, I tried to deploy faster-whisper and it was blazingly fast.
https://replicate.com/alqasemy2020/whisper-jax ignore its name (it's not whisper-jax) ( :
If you want to duplicate the hugging face version have a look at this https://huggingface.co/spaces/sanchit-gandhi/whisper-jax/discussions/38#648b03a7084699ca1535eddc