Montreal-Forced-Aligner icon indicating copy to clipboard operation
Montreal-Forced-Aligner copied to clipboard

missing some phones in the generated TextGrid files [BUG]

Open zaynabmu opened this issue 2 years ago • 6 comments

Debugging checklist

[ ] Have you updated to latest MFA version? yes 2.0.1 [ ] Have you tried rerunning the command with the --clean flag? yes

*Describe the issue

Actually I trained a new acoustic model on my language (Arabic) correctly as documentation step by step , but the generated TectGrid output contains (spn) and some phones were missing since I want to use TextGrids files in training Fastspeech model but the generated voices worse and then I figured out the reason from missing phones in TG files could you pls help me. image

For Reproducing your issue Please fill out the following:

  1. Corpus structure
    • What language is the corpus in? Arabic
    • How many files/speakers? 23/23
    • Are you using lab files or TextGrid files for input? lab files
  2. Dictionary
    • Are you using a dictionary from MFA? If so, which one? no
    • If it's a custom dictionary, what is the phoneset? Ara
  3. Acoustic model
    • If you're using an acoustic model, is it one download through MFA? If so, which one? no
    • If it's a model you've trained, what data was it trained on? audio files with wav format

Log file Please attach the log file for the run that encountered an error (by default these will be stored in ~/Documents/MFA).

Desktop (please complete the following information):

  • OS: [e.g. Windows, OSX, Linux] Windows
  • Version [e.g. MacOSX 10.15, Ubuntu 20.04, Windows 10, etc] Windows 10
  • Any other details about the setup (Cloud, Docker, etc)

Additional context Add any other context about the problem here. when I run mfa validate everythings seems okay

zaynabmu avatar Jun 20 '22 23:06 zaynabmu

The "spn" are generated from bracketed words or words not in the dictionary. In the root temporary directory (~/Documents/MFA/{corpus_name}), there will be an "oovs_found.txt". If you add those words to your dictionary with pronunciations, then they should show up correctly in the aligned TextGrid.

mmcauliffe avatar Jun 21 '22 00:06 mmcauliffe

@mmcauliffe @zaynabmu I have encountered the same problem. Have you solved it? Please send me an email [email protected]

peanut1101 avatar Aug 10 '22 07:08 peanut1101

@mmcauliffe @zaynabmu I have encountered the same problem. Have you solved it? Please send me an email [email protected]

Yes its solved , follow the same instructions that mmcauliffe said and it will be work with you

zaynabmu avatar Aug 10 '22 07:08 zaynabmu

@zaynabmu Thanks

peanut1101 avatar Aug 10 '22 09:08 peanut1101