junh1024
junh1024
You could: - train a ML to add "stuff", but that would sum to more than the original. - add production FX like distortion & reverberation, but that would also...
I think it might be better to go algorithmic instead of ML for realtime. https://github.com/tachi-hi/euterpe/ is realtime for vox, but unsure of latency. (I think it's algorithmic) https://www.yellownoiseaudio.com/ is realtime...
the 16khz is a fake extension https://github.com/deezer/spleeter/issues/2#issuecomment-548798493 . I suggest removing the separated stem(s) (excl. others) from the input, that way you have unity. DAWs may not support PDC properly...
S1 should be OK. Accordingly to https://github.com/deezer/spleeter/blob/master/configs/5stems/base_config.json#L8 , the FFT size is 4096, so the latency should be 4096sa or 4096/44100 = 92ms. Can you explain how you got it...
@Scylla2020 I believe VDJ/djay are using classic FFT algorithmic things to do realtime separation e.g, maybe things like [this](https://www.yellownoiseaudio.com/) , also see my previous comments in this thread. It's been...
I think the included 2stems model should do an ok job on dialogue. Have you tried?
1. I think it would be hard to improve it by a lot, but you're welcome to prove me wrong & impress me. 2. C sometimes has SFX &/ music...
>I was hoping that one trained for the purpose of film would be a little more specific and possibly better for that specific application. voice & program is film. voice...
> Another use-case beyond fan edits: The best way to learn a language has always been to listen/watch/expose yourself to that language as much as possible. But some languages don't...
Spleeter works on short blocks, so you can use it for (almost) realtime. See also - VSTspleeter https://github.com/gvne/vstSpleeter - euterpe https://github.com/tachi-hi/euterpe/ (lower CPU V/I separation)