Michael McAuliffe

Results 186 comments of Michael McAuliffe

You can set up the environment with: ``` conda install kaldi pynini ``` on mac/linux (and minus `pynini` on Windows) and that will have all the conda-only dependencies installed, at...

Yes, but those are dependencies of Pynini, so you don't have to specify them.

Do you mean the case where the word `A` has multiple transcriptions? In the above example, it should always pick `A` because `A(1)` is not considered a pronunciation variant (different...

Sure, I can add some functionality for that in the next release. I'll have to think about how to structure it and expose it, but can probably at least have...

The text file looks like it's pretty long, how long of an audio file is it? Is it possible for you to chunk it into an input TextGrid like: https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/corpus_structure.html#textgrid-format...

It looks like you might be able to use https://github.com/CohenPr-XPF/XPF to generate transcriptions for Sinhalese (https://github.com/CohenPr-XPF/XPF/tree/master/Data/si_Sinhala). That's probably the easiest route. I haven't used XPF before, but it looks like...

So you'll have to play around with the `frame_shift` property: https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/configuration/global.html#feature-configuration, that's the value that determines how accurate MFA is with respect to time. However, since it's trained on 10ms,...

Right, so I've looked into this a bit in the past, and the short answer is that there's no supported way to do it in Kaldi (see: https://groups.google.com/forum/#!topic/kaldi-help/yi9hyWxTPQQ, https://groups.google.com/forum/#!topic/kaldi-help/tBOrm7WBSf8, https://sourceforge.net/p/kaldi/discussion/1355348/thread/fd0f3c27/,...

Oh interesting, yeah I don't think I have any explicit correction for that, so yeah I'll add some dithering to it before it takes the log.

Are you thinking like within a script? There's an example you might be able to use in the GUI widgets for generating pronunciations for single words: https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner/blob/main/montreal_forced_aligner/gui/widgets.py#L1297, it would just...