Jaehyeon Kim

Results 16 comments of Jaehyeon Kim

I have no empirical evidence, but I think the difference comes from whether to capture dependencies between output mel-spectrogram frames or not. The probabilistic modeling of each model is quite...

If your concern is the prosody of the synthesized samples such as intonation, some techniques such as prosody embedding, and style tokens could be useful. In my ongoing experiments, such...

@echelon Hi echelon. As I haven't tested on such small datasets, I couldn't give you a solution. Sorry for that. In my case, I didn't care much for the dimension,...

10000 epochs is meaningless. You can reduce number of epochs, or just cancel during training. I trained my model with the base config on 2 V100 gpus, and it took...

Sorry, I think your information is not enough to find out errors. Could you give more details? For example, is there no error when you build the cython code?

While I fix those two problems in [a new branch](https://github.com/jaywalnut310/MelGAN-Pytorch/tree/research), I found synthesis quality's worse than before. There are three things that I changed: 2 problems as above, and melspectrogram...