SpeechSplit How to align multiple sequences while they are from different source?

How to align multiple sequences while they are from different source?

Open inconnu11 opened this issue 4 years ago • 2 comments

If the length of content code, rhythm code and pitch code is different from each other, how do they align since there is no attention mechanism in decoder?

Jun 10 '21 10:06 inconnu11

The rhythm code provides the alignment information. The decoder just use this information automatically to align the content code and/or pitch code.

Jun 10 '21 17:06 auspicious3000

The rhythm code provides the alignment information. The decoder just use this information automatically to align the content code and/or pitch code.

Is it right in your code(model.py: line308-309) when content code, rhythm code and pitch code are from different utterance, since the three have alignment problem. I don't find details of the variant to get alignment information propose in your papar, Appendix B.3, can you tell me the details. thank you.

Oct 14 '21 04:10 dbkest

SpeechSplit SpeechSplit copied to clipboard

How to align multiple sequences while they are from different source?

SpeechSplit
SpeechSplit copied to clipboard