RAD-NeRF
RAD-NeRF copied to clipboard
Running the code in stream mode
How can we have a stream using asr.py
? So that the audio file does not need to be input as a file.
For example, my words can be repeated in real-time by Talking-Head.
Thanks a lot @ashawkey
@tylersky1993 Hi, unfortunately the current streaming mode is not performing well, since the ASR model we use is not specifically designed for real-time ASR (it requires at least 1 second input to make good prediction).
If you really want to try the streaming mode, you could find an example code here. Note that the GUI mode is necessary and the ASR sliding window (-l, -m, -r
) is smaller for lower latency.
@ashawkey do you think chopping up the audio and feeding it second by second could work