whisper-rs icon indicating copy to clipboard operation
whisper-rs copied to clipboard

Real-time audio input example

Open bruskajp opened this issue 2 years ago • 2 comments

The whisper.cpp repo has a real time audio input example. I think we should have a version of that in the rust code here.

I'm on my second iteration of it, following similar techniques to those used in the the cpp example code.

How do you feel about this being a part of the whisper-rs repo?

The other packages that I use are: cpal: microphone recording webrtc-vad: voice recognition detection ringbuf: a ring buffer

bruskajp avatar Apr 02 '23 22:04 bruskajp

This would be certainly a very good idea I think. If and when you do make a PR, request a review from me that way I see it quicker.

tazz4843 avatar Apr 04 '23 13:04 tazz4843

This is exactly what I need, I want to add live captioning to VRChat.
I spent all day getting vosk to work but results are poor, Whisper looks promising.

I can't wait for this 😄

EDIT: Does this fork/pull request currently work? it gives me various actions like breathing, but no words.

ShayBox avatar Apr 08 '23 02:04 ShayBox