charsiu
charsiu copied to clipboard
Charsiu: A neural phonetic aligner.
Based on the paper, I've successfully reproduced results for Charsiu's FC-10ms, textless FC-10ms, MFA, WebMaus, but I'm having trouble reproducing the pretrained FS-20ms model. I first downloaded the charsiu/en_w2v2_fs_10ms from...
Would you have the scripts to reproduce the results from the papers (I'm particularly interested in table 2), or maybe the procedures to reproduce them from this repo?
Something seems to be not right with how SIL is used in the word transcriptions. This is the first example in the LibriSpeech Test set. Here is the true transcript:...
I guess it's pinyin, is there an official introduction?
Hello, from your paper, it seems that the W2V2-FS‘s alignment is better than the W2V2-FC's, but now there is English W2V2_FS model only . Have you tested the W2V2-FS alignment...
Does it support Chinese-English alignment ?
Thanks for this great package! I forked the repo to tweak a few things to help my use case, and some of them might be useful to merge back into...
[experiments](https://github.com/lingjzhu/charsiu/tree/main/experiments) Those were original research code for training the model. Good job. I want to pre-train on my Chinese dataset. I don't know whether the code in experiments is OK....
Hi, I find that getting a pretrained predictive aligner (aligner='charsiu/en_w2v2_fc_10ms') to work with librispeech seems straightforward. However, I'm unable to get the attention aligner working - how do I go...