Montreal-Forced-Aligner icon indicating copy to clipboard operation
Montreal-Forced-Aligner copied to clipboard

How to train a Chinese and English hybrid acoustic model by using MFA?

Open Raise-me-up opened this issue 1 year ago • 1 comments

Hi, all

I have a request to generate the alignments for the Chinese and English hybrid dataset, but the pretrained model is either pure Chinese or English one. Therefore, I have to train my own model. However, I can't find any tutorial. I don't know which phone set is suitable for me, and how to make a dictory, and so on. Any useful advice will be grateful. Thanks!

Raise-me-up avatar Jan 12 '24 03:01 Raise-me-up

Hey! There are multiple ways you could do this! The easiest way, in my opinion, would be using IPA for both. MFA can link to a dictionary-per-speaker model. You could use a Mandarin dictionary and an English dictionary and then use this methodology: https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/dictionary.html#per-speaker-dictionaries

Mind you, this should also improve the model for each other.

NataliaShmueli avatar Jan 16 '24 08:01 NataliaShmueli