SpeechSplit icon indicating copy to clipboard operation
SpeechSplit copied to clipboard

How to align multiple sequences while they are from different source?

Open inconnu11 opened this issue 4 years ago • 2 comments

If the length of content code, rhythm code and pitch code is different from each other, how do they align since there is no attention mechanism in decoder?

inconnu11 avatar Jun 10 '21 10:06 inconnu11

The rhythm code provides the alignment information. The decoder just use this information automatically to align the content code and/or pitch code.

auspicious3000 avatar Jun 10 '21 17:06 auspicious3000

The rhythm code provides the alignment information. The decoder just use this information automatically to align the content code and/or pitch code.

Is it right in your code(model.py: line308-309) when content code, rhythm code and pitch code are from different utterance, since the three have alignment problem. I don't find details of the variant to get alignment information propose in your papar, Appendix B.3, can you tell me the details. thank you.

dbkest avatar Oct 14 '21 04:10 dbkest