flowtron
flowtron copied to clipboard
Changing speed of speech
Is there a way to sample z such that the speed of speech is changed, .i.e, make it faster or slower ?
I tried with different sigma but that seems to control variability in speech and couldn't really change speed.
The paper also didn't talk about it. Any thoughts on this ?
Yes, we're looking into it!
An alternative way to change speed is to train Flowtron with a token duration predictor, modifying the attention mechanism to work the hard alignments and then scale the durations during inference. The ground truth durations can be obtained from Flowtron alignments. This works quite well for us and provides a mechanism to directly control speech rate.
We'll soon make Flowtron Parallel available https://twitter.com/RafaelValleArt/status/1281268833504751616?s=20
Great, really excited and looking forward to it.
@rafaelvalle any updates on Flowtron Parallel?
hello, is there any function to change the speech rate when inference?