RAD-NeRF Running the code in stream mode

Running the code in stream mode

Open pegahs1993 opened this issue 2 years ago • 2 comments

How can we have a stream using asr.py ? So that the audio file does not need to be input as a file. For example, my words can be repeated in real-time by Talking-Head.

Thanks a lot @ashawkey

Jan 14 '23 23:01 pegahs1993

@tylersky1993 Hi, unfortunately the current streaming mode is not performing well, since the ASR model we use is not specifically designed for real-time ASR (it requires at least 1 second input to make good prediction). If you really want to try the streaming mode, you could find an example code here. Note that the GUI mode is necessary and the ASR sliding window (-l, -m, -r) is smaller for lower latency.

Jan 15 '23 01:01 ashawkey

@ashawkey do you think chopping up the audio and feeding it second by second could work

Jul 29 '23 16:07 amuvarma13

RAD-NeRF RAD-NeRF copied to clipboard

Running the code in stream mode

RAD-NeRF
RAD-NeRF copied to clipboard