fish-speech icon indicating copy to clipboard operation
fish-speech copied to clipboard

Inferece result with noise

Open ChengsongLu opened this issue 1 year ago • 0 comments

Self Checks

  • [X] This template is only for bug reports. For questions, please visit Discussions.
  • [X] I have thoroughly reviewed the project documentation (installation, training, inference) but couldn't find information to solve my problem. English 中文 日本語 Portuguese (Brazil)
  • [X] I have searched for existing issues, including closed ones. Search issues
  • [X] I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
  • [X] [FOR CHINESE USERS] 请务必使用英文提交 Issue,否则会被关闭。谢谢!:)
  • [X] Please do not modify this template and fill in all required fields.

Cloud or Self Hosted

Self Hosted (Source)

Environment Details

inference.zip I followed the steps that inference with an specific speaker (text.wav), and the result (output.wav) quality doesn's sound good, it have some noises there.

Steps to Reproduce

python tools/vqgan/inference.py
-i "text.wav"
--checkpoint-path "checkpoints/fish-speech-1.5/firefly-gan-vq-fsq-8x1024-21hz-generator.pth"

python tools/llama/generate.py
--text "你们这个是什么群啊,你们这是害人不浅啊你们这个群!谁是群主,出来!真的太过分了。"
--prompt-text "人间灯火倒映湖中,她的渴望让静水泛起涟漪。若代价只是孤独,那就让这份愿望肆意流淌。流入她所注视的世间,也流入她如湖水般澄澈的目光。"
--prompt-tokens "fake.npy"
--checkpoint-path "checkpoints/fish-speech-1.5"
--num-samples 2

python tools/vqgan/inference.py
-i "codes_0.npy"
--checkpoint-path "checkpoints/fish-speech-1.5/firefly-gan-vq-fsq-8x1024-21hz-generator.pth"

✔️ Expected Behavior

No response

❌ Actual Behavior

No response

ChengsongLu avatar Dec 09 '24 10:12 ChengsongLu