xuexidi

Results 32 comments of xuexidi

一样的问题,同问~

I tried for several times,the voice qulity stuck in a sad point whether VCTK dataset or my own chinese speech dataset.........so sad.....

> I am using AISHELL-3 mandarin corpus to training the VC model; for preprocessing, the speaker embedder using the pretrained 3000000-BL.ckpt. run through the main.py which train 1000000 iters [although...

> Check that the tensors are the same shape before computing their loss? > […](#) > On Wed, Jan 27, 2021 at 11:15 AM JohnHerry ***@***.***> wrote: I am using...

> @seungwonpark yeah sure, I will train MelGAN on GTA. I am also planning to train it in multiple voices as I have a huge repo of large (> 40...

> @xuexidi it doesn't give good result, I had trained model around 1.5 Million step but normal mel training gives better result than GTA. Though one option which I havn't...

> @xuexidi it doesn't matter choose any pre-processing (mel-extraction) and use the same for both TTS and melgan. The mels on which TTS trained always be same as on which...

@ruddradev Hello, did you solve this error?I meet the same error, and I have no idea how to fix it......please help me..... :(

我也遇到了楼上的问题,抽了5000条VCTK数据集的语音来从头训练WavRNN(MOL模式),Batch size=64,训练1了450k steps效果还是很糟糕,真心请教您一下,有什么需要注意的地方吗? Loss曲线: ![31a6e8c899b5e9e5f479d0fc641843c](https://user-images.githubusercontent.com/48052959/94285398-2cc48a80-ff86-11ea-8fc5-7dc55519a386.jpg) 400K steps时候生成的语音: ![2bccfba6a7efa4e1dc6874c85644243](https://user-images.githubusercontent.com/48052959/94285445-39e17980-ff86-11ea-94c7-bcf7cfa2396d.jpg)

I met the same problem.....who can help me!