vall-e
vall-e copied to clipboard
Are the ar and nar models trained in parallel ( at the same time) or separately?
In addition, if I would like to train it on different languages (French) do I have to use another G2P tool?
the ar and the nar models are trained separately, and you should use another G2P tool Maybe refer to a g2p tool called phonemizer. It is cross-lingual.