Montreal-Forced-Aligner
Montreal-Forced-Aligner copied to clipboard
Command line utility for forced alignment using Kaldi
I tried using the pretrained french acoustic model (trained on GP using GP dictionary), but got error messages about not finding the pretrained model. The MFA documentation page (https://montreal-forced-aligner.readthedocs.io/en/latest/aligning.html#align-using-pretrained-models) says...
Hello, thanks for creating MFA. I am trynig to replicate FastSpeech2 with MFA. When generating alignments for a new corpus, the process is supposed to be: 1. Download dictionary 2....
**Debugging checklist** [x] Have you updated to latest MFA version? [x] Have you tried rerunning the command with the `--clean` flag? **Describe the issue** A clear and concise description of...
``` from montreal_forced_aligner.data import Pronunciation, Word ``` migration tips?? ``` from montreal_forced_aligner.data import Pronunciation, Word ImportError: cannot import name 'Pronunciation' from 'montreal_forced_aligner.data' ```
**Debugging checklist** [x] Have you updated to latest MFA version? [x] Have you tried rerunning the command with the `--clean` flag? **Describe the issue** A clear and concise description of...
I have been trying to run the forced aligner using mfa validate (and later mfa align just to see if it would work) because I'd like to see if I...
Montreal-Forced-Aligner/montreal_forced_aligner/corpus/acoustic_corpus.py", line 776, in get_feat_dim return int(feats) ValueError: invalid literal for int() with base 10: ''
Hi, from my experiments it seems that the MFA is running on only one cpu when there is only one speaker to align. I am running the alignment on the...
On the First Step's page, the [Examples](https://montreal-forced-aligner.readthedocs.io/en/latest/first_steps/example.html) are telling us to download things that don't seem to exist anymore (english, mandarin_pinyin_g2) For example in Example 2, I get the following...
Is there currently a way to align a single sound file (i.e., not a directory of files) based on acoustic model (e.g. in .wav with.lab transcription)? I guess i could...