vosk-api
vosk-api copied to clipboard
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Hello, i have created a basic compose multiplatform desktop app and i have a LaunchedEffect(Unit) { ... } (launched effect just launches the code as a coroutine when the UI...
Hi! Great project, especially excited about the gpu support. But i have a question, is it possible to use something like PartialResult() when working on gpu (rtx2080ti, cuda12.3), as it...
Functions in C for partial results in gpu batch mode with a minimal example
Hello my dear, my name is Thiago, I am a master's student in computer science, currently focusing on software engineering. However, in the past few weeks, I have been studying...
I can't load the vosk model and it doesn't give me an error. I did some editing to find the error. ``` def open_dll(): try: dlldir = os.path.abspath(os.path.dirname(__file__)) if sys.platform...
We are using it on Android devices. Initially, we used the webSocket server. If the speaker is far away from the device, about 1 meter away, it will be difficult...
Hi. I have been developing this [free transcription website](https://fastranscribe.com) using the model `vosk-model-en-us-0.42-gigaspeech`, so it should accept only English videos, however, I've noticed some people sending videos in Portuguese, Spanish,...
fun recognizeSpeech(recordedAudio: ByteArray): String { // Assuming VoskModel is a function to load the model from assets val model = VoskModel.loadModel("assets/final.mdl") // Check Vosk documentation for the appropriate recognizer constructor...
hello, i'm new to speech recognition, vosx and python, but i want to translate speech from a simple video i downloaded from the internet (and later even tts'ing to my...
Hi, dear author, setting grammar in Vosk is very useful, so I copy UpdateGrammarFst() to Kaldi and test it with open source chain model(http://kaldi-asr.org/models/m13), but the result is bad( I...