RealtimeSTT icon indicating copy to clipboard operation
RealtimeSTT copied to clipboard

Whisper as wakeword detection

Open olegchomp opened this issue 1 year ago • 4 comments

Hi! Is there any easy way to switch wakeword detection from porcupine to whisper?

olegchomp avatar Jan 29 '24 18:01 olegchomp

Not really, I also really wouldn't recommend that. Whisper would have to nonstop transcribe, which would result in high GPU load / energy consumption compare to porcupine. Also it can't compete with a wakeword engine in terms of accuracy and detection speed.

I think for the future switching to openWakeWord could be the right thing to do. Need to do some tests before.

KoljaB avatar Jan 29 '24 19:01 KoljaB

Sadly. As i understand porcupine have only cloud solution for training new words and openwakeword support only EN language.

olegchomp avatar Jan 29 '24 19:01 olegchomp

Oh, didn't know openwakeword only supports english. That's disappointing, we really need a multilingual, trainable wake word solution. I started with a Whisper wake word implementation first but esp accuracy was rather disappointing. I feel currently there is no easy solution to this problem out there.

KoljaB avatar Jan 29 '24 19:01 KoljaB

Yes, we do need multilingualism

sailorsale avatar Feb 02 '24 04:02 sailorsale

OpenWakeword done with v0.2.0 Multilingualism can be done with spelling out the sounds phonetically with underscores separating each part. See OpenWakeword google collab notebook.

KoljaB avatar Jun 30 '24 07:06 KoljaB