Montreal-Forced-Aligner
Montreal-Forced-Aligner copied to clipboard
How to train a Chinese and English hybrid acoustic model by using MFA?
Hi, all
I have a request to generate the alignments for the Chinese and English hybrid dataset, but the pretrained model is either pure Chinese or English one. Therefore, I have to train my own model. However, I can't find any tutorial. I don't know which phone set is suitable for me, and how to make a dictory, and so on. Any useful advice will be grateful. Thanks!
Hey! There are multiple ways you could do this! The easiest way, in my opinion, would be using IPA for both. MFA can link to a dictionary-per-speaker model. You could use a Mandarin dictionary and an English dictionary and then use this methodology: https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/dictionary.html#per-speaker-dictionaries
Mind you, this should also improve the model for each other.