Vilas Ninawe
Vilas Ninawe
Transcription time varies device to device. On high end device, transcription time will be less. You can debug what is taking more time. Whether it is Mel spectrogram calculation or...
Yes. It is possible. We are working on capturing audio from mic and transcribe in realtime.
@KihongK The primary reason for the low accuracy of the Whisper-Tiny model is due to how the audio data is being segmented. Currently, we are feeding 3-second audio clips without...
Use vad
Can you try with base or small model?
There is issue in post processing code in whisper_java app. But, this problem is not in whisper_native.
Yes. It is there in GitHub project.
Yes. I will update
Multilingual model support transcription and translation. For Transcription, it supports Any -> Any For Translation, it supports Any -> English I see your requirement is transcription. But, the generated multilingual...
Yes. Multilingual model support transcription and translation. For Transcription, it supports Any -> Any For Translation, it supports Any -> English I see your requirement is first one, transcription. But,...