SpeechSplit
SpeechSplit copied to clipboard
How to align multiple sequences while they are from different source?
If the length of content code, rhythm code and pitch code is different from each other, how do they align since there is no attention mechanism in decoder?
The rhythm code provides the alignment information. The decoder just use this information automatically to align the content code and/or pitch code.
The rhythm code provides the alignment information. The decoder just use this information automatically to align the content code and/or pitch code.
Is it right in your code(model.py: line308-309) when content code, rhythm code and pitch code are from different utterance, since the three have alignment problem. I don't find details of the variant to get alignment information propose in your papar, Appendix B.3, can you tell me the details. thank you.