Amphion
Amphion copied to clipboard
support fs2 24k training, aligned mel setting with gan vocoder
- fixed fs2 24k training and inference issues, fixed hifigan vocoder training
- aligned mels feature for acoustic model and vocoder, so that people can use the pretrained model. fixed other data preprocess issues.
- fixed librosa usage issues
Please use black to format your code. For example, to format a file named wrong_format.py
, you can run:
pip install black
black wrong_format.py
- fixed fs2 24k training and inference issues, fixed hifigan vocoder training
- aligned mels feature for acoustic model and vocoder, so that people can use the pretrained model. fixed other data preprocess issues.
- fixed librosa usage issues
Thanks for your suggestion. We tried to use the pre-trained vocoder for fs2 with the settings of 100-dimensional mel-spectrogram, but achieved poor results compared with the current settings. Therefore, we have decided to retain the current settings for FS2.
- fixed fs2 24k training and inference issues, fixed hifigan vocoder training
- aligned mels feature for acoustic model and vocoder, so that people can use the pretrained model. fixed other data preprocess issues.
- fixed librosa usage issues
Thanks for your suggestion. As @lmxue mentioned, we have tried to align the FS2 with the default setting of the vocoder in the early stage but resulted in poor performance. If you have got good results with the aligned setting, please attach the pretrained model as well as some demos in this PR and we will process your PR then.
Hi, thanks for your suggeston. We're closing this issue for the reasons mentioned by @lmxue and @VocodexElysium , where we tried to use the default vocoder but resulted in poor performance. If you have got good results with the aligned setting, you're welcome to reopen the PR. Thanks!