无法将克隆好的音频文件文件保存下载到本地
问题描述 清晰简明地描述问题。
克隆生成的音频在合成后无法保存或下载到本地。应用程序能够正确生成音频,但播放后没有提供保存或下载的功能选项。
进入音频生成页面。 输入文本并选择所需的合成选项。 点击“生成音频”按钮生成并播放克隆的音频。 在播放后,尝试查找用于下载音频文件的按钮或链接,但没有此类选项。 预期行为 清晰简明地描述预期行为。
在生成克隆音频后,应该有选项可以将音频文件下载到本地系统。
Describe the bug A clear and concise description of what the bug is.
The generated cloned audio cannot be saved or downloaded locally after synthesis. The application correctly generates the audio, but there is no option or functionality to download it after playback.
Go to the audio generation page. Input text and select the desired synthesis options. Click on the 'Generate Audio' button to generate and play the cloned audio. Try to find a button or link to download the audio file after playback, but no such option is available. Expected behavior A clear and concise description of what you expected to happen.
After generating the cloned audio, there should be an option to download the audio file to the local system.
我也是不知道咋下载,用webui生成的 下载的都是0k的 wav
right click audio, save and rename file. this is gradio related
right click audio, save and rename file. this is gradio related
just tried, not work. There's nothing happen after click the Save Audio As.. menu
Actually, something happened, after one min or two, a file named 21.txt was downloaded, and it's an empty file.
Maybe the filename 21 is from the stream url http://192.168.0.100:7866/stream/2yhitg3ap2i/1247734777580/21 .
My Env: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/128.0.0.0 Safari/537.36
Related gradio PR: https://github.com/gradio-app/gradio/pull/8906
how to download final audio file
I got the same problem,how to save the final audio file?
https://www.youtube.com/watch?v=ReitI31JCcw here is the answer
right click audio, save and rename file with xxx.wav,it's work for me . 右键生成的语音,另存语音,重命名为xxx.wav,是可以播放的。
#audio_output = gr.Audio(label="Audio Output", autoplay=True, streaming=True) # wav audio_output = gr.Audio(label="Audio Output")
#audio_output = gr.Audio(label="Audio Output", autoplay=True, streaming=True) # wav audio_output = gr.Audio(label="Audio Output") This method can only apply to single sentencer. As for multiple sentences, voice snippets generated later will overwrite what is already in the output.
I solved it in this PR https://github.com/FunAudioLLM/CosyVoice/pull/772. It's just one file(inference.py) so you guys can try it out yourself if you are in a hurry.
刚通过自己拼接的方式解决了这个问题,详见这个PR https://github.com/FunAudioLLM/CosyVoice/pull/772. 只需要改 inference.py 这一个文件,所以如果比较急的话,可以直接把PR里面的文件拷贝过去试试。