yanzhuangzhuang-beep comments

Results 21 comments of


yanzhuangzhuang-beep

yaml

I have data LJSpeech-1.1 but it's wavs data .so it's path wav_dir. You said I need dowoload data which I don't have the href. could you provide the url ，thanks

yaml

ok I have finish nvidia_preprocessing. when i run train_fastspeech have a error no key "hp.model.phoneme_acoustic_embed" I find that in config.yaml no key model..phoneme_acoustic_embed and the value

In yaml, I set model.phoneme_acoustic_embed =True then,self.utterance_encoder(ys.transpose(1, 2)).transpose(1, 2) error ndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)

uttr = self.utterance_encoder(ys.transpose(1, 2)).transpose(1, 2) IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)

have you solved this problem?

Phonetic acoustic embedding , utterance level embedding data missing

hello ,have you found what's the value Phonetic acoustic embedding

Will it support BIAOBEI dataset?

have you got it?

Will it support BIAOBEI dataset?

重点是我不会使用MFA ... ![1638442790(1)](https://user-images.githubusercontent.com/62825785/144409721-7275cee1-20f5-4e44-a04f-475e3b87a9ca.png) 这是我用标贝可以跑起来但是nan 我感觉是数据有杂乱的

Will it support BIAOBEI dataset?

使用标贝后长句依然有问题

torch.nn.modules.module.ModuleAttributeError: 'FastSpeech2' object has no attribute 'module' how to reduce

Thank you, I have found this problem and solved it

RuntimeError: The size of tensor a (51) must match the size of tensor b (53) at non-singleton dimension 1

我之前也是这样，我觉得是词库不是完整导致有的索引为空数据不对齐。但是当我补充完词汇后发现了新的问题不知道是不是采样率的问题你可以先打印缺少的字符在text/system中