ILG2021

Results 38 issues of ILG2021

I am trying to port openvpi nsf-hifigan working with Amphion, but I found that nsf-hifigan needs f0 to inference, which gan_vocoder_inference.py not supply.

bug

您好,sovits 4.1中会用到您项目中的diffusion底模,可以补充下协议吗? https://huggingface.co/datasets/ms903/Diff-SVC-refactor-pre-trained-model

你好,我想训练一个法语的tts,不知道是否需要修改代码?如何修改可以支持。另外想咨询下大概需要多少小时的干声可以训练出来一个比较好的tts?这个tts是专有领域的(科技),不需要那么强的泛化。

enhancement

最近huggingface也在开发optiumum-tpu

I have tried four models on desktop cpu Intel(R) Core(TM) i7-10700 CPU moondream-0_5b-int4.mf.gz moondream-0_5b-int8.mf.gz moondream-2b-int4.mf.gz moondream-2b-int8.mf.gz it will cost much time to inference, at least 7s, int4 is faster than...

This is the weight link: https://github.com/bshall/hubert/releases/tag/v0.1

I wanna to know what the license of these pretrain weights: https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/hubert_base.pt https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.pt

I have trained a 44k model recently, but the inference result is a little strange, the audio seems raised the pitch. I have change the mel parameters in ljspeech.yaml: n_fft:...

enhancement
help wanted
good first issue
question

[Speaker-Encoder by @mueller91](https://drive.google.com/drive/folders/15oeBYf6Qn1edONkVLXe82MzdIi3O_9m3)