GPT-SoVITS icon indicating copy to clipboard operation
GPT-SoVITS copied to clipboard

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Results 1028 GPT-SoVITS issues
Sort by recently updated
recently updated
newest added

可以贴下训练好的语音和原始数据集语音,看看大家的效果如何吗? 最好说下样本时长多少,微调的参数怎么设置,比如多少轮这样。

In the webui.py file, several strings with i18n code were missing to enable translation as soon as they are identified.

Running on local URL: http://0.0.0.0:9874 "C:\Users\lenovo\anaconda3\envs\GPTSoVits\python.exe" tools/damo_asr/cmd-asr.py "E:\GPT-SoVITS\output\slicer_opt" 2024-01-21 01:04:42,127 - modelscope - INFO - PyTorch version 2.1.2+cu121 Found. 2024-01-21 01:04:42,127 - modelscope - INFO - Loading ast index from...

In follow-up

报错如下: `(GPTSoVits) root@autodl-container-28d94f9faf-1cfbd1cd:~/autodl-tmp/GPT-SoVITS# python GPT_SoVITS/inference_webui.py Traceback (most recent call last): File "/root/autodl-tmp/GPT-SoVITS/GPT_SoVITS/inference_webui.py", line 28, in from text.cleaner import clean_text File "/root/autodl-tmp/GPT-SoVITS/GPT_SoVITS/text/cleaner.py", line 1, in from text import chinese, japanese, cleaned_text_to_sequence,...

Loading model cost 0.567 seconds. DEBUG:jieba:Loading model cost 0.567 seconds. Prefix dict has been built successfully. DEBUG:jieba:Prefix dict has been built successfully. 1%|▎ | 13/1500 [00:00 286] 1%|▍ | 15/1500...

File "E:\GPT-SoVITS\tools\my_utils.py", line 14, in load_audio ffmpeg.input(file, threads=0) File "E:\GPT-SoVITS\runtime\lib\site-packages\ffmpeg\_run.py", line 325, in run raise Error('ffmpeg', out, err) ffmpeg._run.Error: ffmpeg error (see stderr output for detail) During handling of the...

https://github.com/collabora/WhisperSpeech

There is currently a problem with incorrect readings that plagues all available text-to-speech (TTS) solutions. It’s essentially impossible to fully rely on a TTS that makes these errors. For instance,...

Pls add to roadmap Spanish compatibility Tnks

nice work, thank you for the open source. After testing, there are still inherent drawbacks of AR models, such as word swallowing and duplication issues.NAR models don’t sounds natural they...