ParallelWaveGAN icon indicating copy to clipboard operation
ParallelWaveGAN copied to clipboard

Harmonic noise in MB Melgan if not trimming silence in training

Open fervmsanas opened this issue 2 years ago • 3 comments

Screen Shot 2021-08-19 at 3 05 28 PM

Hi, I got some harmonic noise at silence segments (top plot) if I set the silence trimming option to False for MB Melgan training. It does not happen if I set this the trimming to True (bottom plot) and I keep the same training conditions. I have been trying different configurations and this is the only case when I get this issue. By including the silence in training I was expecting to reduce some slight noises appearing sometimes at these segments. Any thoughts or experience on this?, thank you in advance.

fervmsanas avatar Aug 19 '21 22:08 fervmsanas

This is an interesting observations. I have never seen this kind of phenomenon but in my experiences trimming the silence part as much as possible leads the improvement of the quality. Not directly related to this issue, but hifigan discriminator is very strong so I’m curious the combination mb melgan G + hifigan D.

kan-bayashi avatar Aug 22 '21 07:08 kan-bayashi

@kan-bayashi

but hifigan discriminator is very strong so I’m curious the combination mb melgan G + hifigan D.

I've been using that combination for the past few months and it seems to help a lot for making the generator finetuneable and increasing multispeaker performance. With hifigan discriminator and a learning rate decrease I can adapt mb-melgan generator (this requires a pretrained mb-melgan with hifigan d, both models to load) within 15 to 30k steps to new voice with at least 30 minutes of data. This is my config, I can share samples.

ZDisket avatar Aug 22 '21 08:08 ZDisket

mark

yt605155624 avatar Nov 10 '21 07:11 yt605155624