Montreal-Forced-Aligner
Montreal-Forced-Aligner copied to clipboard
[BUG] mfa transcribe completes successfully but the transcript has "<ukn>"
Debugging checklist
[x] Have you updated to latest MFA version?
[x] Have you tried rerunning the command with the --clean
flag?
Describe the issue
When running mfa transcribe
, the output .lab
file contains a couple of <ukn>
strings only.
For Reproducing your issue Please fill out the following:
- Corpus structure
- What language is the corpus in? English
- How many files/speakers? 1 file, 1 speaker
- Are you using lab files or TextGrid files for input? NA
- Dictionary
- Are you using a dictionary from MFA? If so, which one? english_uk_mfa
- If it's a custom dictionary, what is the phoneset? NA
- Acoustic model
- If you're using an acoustic model, is it one download through MFA? If so, which one? english_mfa
- If it's a model you've trained, what data was it trained on? NA
Log file
Please attach the log file for the run that encountered an error (by default these will be stored in ~/Documents/MFA
).
Desktop (please complete the following information):
- OS: macOS
- Version: Sonoma 14.2.1 (23C71)
- Any other details about the setup (Cloud, Docker, etc): local conda installation
Additional context
The wav file I tried to transcribe has only one sentence (it's a test file).