MaxMax2016
MaxMax2016
https://wenet.org.cn/opencpop/download/
@Lukelluke 简单的讲,就是使用帧级别的phone,midi通过transformer预测帧级别的pitch,相关pitch预测可以参考 https://github.com/MoonInTheRiver/DiffSinger/blob/master/modules/fastspeech/pe.py https://github.com/yl4579/PitchExtractor
https://github.com/xcmyz/FastVocoder/tree/main/model/generator :multiband_hifigan.py @BridgetteSong @v-nhandt21 may help you
look forward to
i meet the same error, which python should use
另外PortaSpeech是不是比FastSpeech2需要更多的训练语料呢?因为要处理 字 级别的信息。
ok,i will try.
i think PortaSpeech + MB MelGAN can work on cpu very well, VITS need gpu to work.
waiting for you
will opensource training dataset?