flowtron icon indicating copy to clipboard operation
flowtron copied to clipboard

Changing speed of speech

Open sgulasekaran opened this issue 4 years ago • 6 comments

Is there a way to sample z such that the speed of speech is changed, .i.e, make it faster or slower ?

I tried with different sigma but that seems to control variability in speech and couldn't really change speed.

The paper also didn't talk about it. Any thoughts on this ?

sgulasekaran avatar May 27 '20 09:05 sgulasekaran

Yes, we're looking into it!

rafaelvalle avatar May 28 '20 23:05 rafaelvalle

An alternative way to change speed is to train Flowtron with a token duration predictor, modifying the attention mechanism to work the hard alignments and then scale the durations during inference. The ground truth durations can be obtained from Flowtron alignments. This works quite well for us and provides a mechanism to directly control speech rate.

rafaelvalle avatar Jun 12 '20 18:06 rafaelvalle

We'll soon make Flowtron Parallel available https://twitter.com/RafaelValleArt/status/1281268833504751616?s=20

rafaelvalle avatar Jul 09 '20 21:07 rafaelvalle

Great, really excited and looking forward to it.

sgulasekaran avatar Jul 09 '20 23:07 sgulasekaran

@rafaelvalle any updates on Flowtron Parallel?

artemg avatar Oct 07 '20 16:10 artemg

hello, is there any function to change the speech rate when inference?

evelynyhc avatar Mar 24 '21 09:03 evelynyhc