Aaron (Yinghao) Li

Columbia University New York, US

Results 110 comments of


                                            Aaron (Yinghao) Li

Possible bug in LJSpeech training data

Unfortunately it seems like a bug. I took the data directly from VITS repo (https://github.com/jaywalnut310/vits/blob/main/filelists/ljs_audio_text_test_filelist.txt.cleaned) without any scrutinization. @Kreevoz I guess you are correct😂. I just tested it and the...

Possible bug in LJSpeech training data

Maybe I'll redo the preprocessing of LJSpeech dataset and train a new model with corrected data file when I get time.

Possible bug in LJSpeech training data

> Unfortunately it seems like a bug. I took the data directly from VITS repo (https://github.com/jaywalnut310/vits/blob/main/filelists/ljs_audio_text_test_filelist.txt.cleaned) without any scrutinization. > > @Kreevoz I guess you are correct😂. I just tested...

Possible bug in LJSpeech training data

@Kreevoz I found another problem. The quote in the LibriTTS dataset was actually `"content"`, not ` ``content'' `: https://raw.githubusercontent.com/yl4579/StyleTTS2/main/Data/OOD_texts.txt, so the inference code for sentences with quotes is also wrong.

Which model is 7B (Default) and which is 13B (Beta)?

The online API is not working right now. If it’s different though, since I’m running inference on A40, how do I get it working in the same way as the...

Which model is 7B (Default) and which is 13B (Beta)?

I just checked the output and I'm pretty sure the default model produces output very similar to `13B (Beta)` in the huggingface space (though down now). How do I get...

Which model is 7B (Default) and which is 13B (Beta)?

Now I have confirmed they give similar response, but the response is different from those I got a month ago (around early Nov). Did you change the model for your...

Which model is 7B (Default) and which is 13B (Beta)?

In your experience which one is better? I changed to `eval_mdl_path = '../../pretrained_mdls/ltu_ori_paper.bin'` but got the following error: ``` RuntimeError Traceback (most recent call last) Cell In[3], line 50 47...

Possible bug in masked index generation?

Thanks for your question. This was intentional. The masked indices are used for loss calculation here: https://github.com/yl4579/PL-BERT/blob/main/train.ipynb (see the `if len(_masked_indices) > 0:` line), so the masked token also includes...

Possible bug in masked index generation?

@tekinek The token separator doesn't need to be predicted because it has a one-to-one correspondence between the grapheme and phoneme (i.e., the space token in the phoneme domain always corresponds...

‹
1
2
3
4
5
6
7
8
9
10
11
›