RAD-NeRF icon indicating copy to clipboard operation
RAD-NeRF copied to clipboard

Running the code in stream mode

Open pegahs1993 opened this issue 2 years ago • 2 comments

How can we have a stream using asr.py ? So that the audio file does not need to be input as a file. For example, my words can be repeated in real-time by Talking-Head.

Thanks a lot @ashawkey

pegahs1993 avatar Jan 14 '23 23:01 pegahs1993

@tylersky1993 Hi, unfortunately the current streaming mode is not performing well, since the ASR model we use is not specifically designed for real-time ASR (it requires at least 1 second input to make good prediction). If you really want to try the streaming mode, you could find an example code here. Note that the GUI mode is necessary and the ASR sliding window (-l, -m, -r) is smaller for lower latency.

ashawkey avatar Jan 15 '23 01:01 ashawkey

@ashawkey do you think chopping up the audio and feeding it second by second could work

amuvarma13 avatar Jul 29 '23 16:07 amuvarma13