ai-jsx icon indicating copy to clipboard operation
ai-jsx copied to clipboard

Noise cancellation on top of Voice ASR

Open anunayajoshi opened this issue 2 years ago • 3 comments

Love what @juberti is building with fixie voice, and the open-source benchmarking sites for ASR and TTS. Something I personally want to test is how noise cancellation algorithms like Picovoice's Koala affects these ASR's transcription, and I think that benchmarking site would be a good place for me to contribute and test that out.

Just wanted to check with @juberti if that's a welcome addition to the site for me to contribute? Not sure how exactly you'd imagine the layout, but I'd like to compare the transcriptions both with and without the noise cancellation going in. If you'd want it as a separate page or a checkbox on top of the ASR to enable noise cancellation? Happy to discuss!

anunayajoshi avatar Nov 08 '23 07:11 anunayajoshi

Sure, that would be an interesting thing to try. Note that the browser is already performing some ANR (via either WebRTC or built-in device NR as on iPhone) so the effects of an additional NR stage may be modest.

juberti avatar Nov 09 '23 06:11 juberti

One area where additional research could be quite useful is on VAD algorithms. Currently we use the WebRTC VAD in the ASR component, but it's pretty old and quite prone to false positives. I bet we would get much better perf from something like Silero VAD.

juberti avatar Nov 09 '23 07:11 juberti

Regardless, please feel free to contribute any interesting improvements!

juberti avatar Nov 09 '23 07:11 juberti