speech_recognition Add flush_flag to listener to flush the recorded audio immediately without waiting for the phrase to complete.

Add flush_flag to listener to flush the recorded audio immediately without waiting for the phrase to complete.

Open sreekanthputta opened this issue 6 months ago • 2 comments

Add flush_flag to listener to flush the recorded audio immediately without waiting for the phrase to complete.

I am working on a real time speech to text application where I am kinda facing an issue. When the user is done talking, the speech_recognizer waits until the pause_threshold is elapsed. This gets even worse in noisy environments with the dynamic_energy_threshold turned off.

My users don't want to wait as they know that they are done talking. They want to be able to hit enter and reduce the time taken to show them the transcription.

This is just one example of where this could be helpful. I'm sure this feature can be useful in many ways.

I have tried stopper but, it takes a maximum of a second to stop but wont flush the audio. Also, the stopper wont stop the recorder when the audio is being actively recorded at the times where energy > energy_threshold.

Hence this change.

How to use?

self.flush_flag = [False]
self.recorder.listen_in_background(self.source, self.record_callback, phrase_time_limit=self.record_timeout, flush_flag=self.flush_flag)
        
def onEnter():
    self.flush_flag = [True] # this flag will be reset to false once the audio is flushed.

Please feel free to modify the logic to make it more clean and robust. TIA. <|endoftext|>

Jul 28 '24 17:07 sreekanthputta

speech_recognition speech_recognition copied to clipboard

Add flush_flag to listener to flush the recorded audio immediately without waiting for the phrase to complete.

speech_recognition
speech_recognition copied to clipboard