Sang-Hoon Lee comments

Results 9 comments of


                                            Sang-Hoon Lee

About audio quality evaluation

Thank you for your quick reply! When I compared the models trained with the complex MS-STFT and real MS-STFT discriminator, they have similar performance on Mel reconstruction error and PESQ....

Results?

Thank you for your interest. Actually, due to the computational resource constraints, I stopped training BigVGAN vocoder 😢. (I trained it only for 300k steps) When I evaluated, BigVGAN has...

Training on different sampling rates

There are so many ways... First, check a preprocessing method for your Mel-spectrogram Second, change the initial frequency value for resampling https://github.com/sh-lee-prml/BigVGAN/blob/main/models_bigvgan.py#L104 Calculate this according to your sampling, hop size,...

Alphas are not trainable?

Thank you... I have mis-implemented this parameter... I'll fix it right now. Thanks again

Alphas are not trainable?

```sh self.alpha1 = nn.ParameterList([nn.Parameter(torch.ones(1, channels, 1)) for i in range(len(self.convs1))]) ``` I changed alphas to ParameterList https://github.com/sh-lee-prml/BigVGAN/blob/main/models_bigvgan.py#L51 https://github.com/sh-lee-prml/BigVGAN/blob/main/models_bigvgan.py#L52 https://github.com/sh-lee-prml/BigVGAN/blob/main/models_bigvgan.py#L100 https://github.com/sh-lee-prml/BigVGAN/blob/main/models_bigvgan.py#L102 https://github.com/sh-lee-prml/BigVGAN/blob/main/models_bigvgan.py#L108 ![image](https://user-images.githubusercontent.com/56749640/179433560-386eca1b-6b6e-4b5c-8fdc-ee4d5d3f9bc7.png) Now, alpha is trainable 😢 Thank you again👍

Sang-Hoon Lee

About audio quality evaluation

Results?

Training on different sampling rates

Alphas are not trainable?

Alphas are not trainable?

Alphas are not trainable?

Why did you use low-pass filter twice in AMPBlock?

Why did you use low-pass filter twice in AMPBlock?

Why did you use low-pass filter twice in AMPBlock?